×
Register Here to Apply for Jobs or Post Jobs. X
More jobs:

Member of Technical Staff - Safety Lead

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Reflection
Full Time position
Listed on 2026-01-16
Job specializations:
  • Software Development
    AI Engineer
Salary/Wage Range or Industry Benchmark: 120000 - 160000 USD Yearly USD 120000.00 160000.00 YEAR
Job Description & How to Apply Below

Our Mission

Reflection’s mission is to build open superintelligence and make it accessible to all
.

We’re developing open weight models for individuals, agents, enterprises, and even nation states. Our team of AI researchers and company builders come from Deep Mind, OpenAI, Google Brain, Meta, Character.

AI, Anthropic and beyond.

About the Role
  • Own the red-teaming and adversarial evaluation pipeline for Reflection’s models, continuously probing for failure modes across security, misuse, and alignment gaps.

  • Work hand-in-hand with the Alignment team to translate safety findings into concrete guardrails, ensuring models behave reliably under stress and adhere to deployment policies.

  • Validate that every release meets the lab’s risk thresholds before it ships, serving as a critical gatekeeper for our open weight releases.

  • Develop scalable, automated safety benchmarks that evolve alongside our model capabilities, moving beyond static datasets to dynamic adversarial testing.

  • Research and implement state-of-the‑art jail breaking techniques and defenses to stay ahead of potential vulnerabilities in the wild.

About You
  • Graduate degree (MS or PhD) in Computer Science, Machine Learning, or related discipline, or equivalent practical experience in AI Safety.

  • Deep technical understanding of LLM safety, including adversarial attacks, red‑teaming methodologies, and interpretability.

  • Strong software engineering capabilities with experience building automated evaluation pipelines or large‑scale ML systems.

  • Experience with Reinforcement Learning (RLHF/RLAIF) and how it impacts model safety and alignment is a strong plus.

  • Thrive in a fast‑paced, high‑agency startup environment with bias toward action.

  • Willing to make high‑stakes decisions regarding model release and safety thresholds.

  • Passionate about advancing the frontier of intelligence.

What We Offer:

We believe that to build superintelligence that is truly open, you need to start at the foundation. Joining Reflection means building from the ground up as part of a small talent‑dense team. You will help define our future as a company, and help define the frontier of open foundational models.

We want you to do the most impactful work of your career with the confidence that you and the people you care about most are supported.

  • Top‑tier compensation: Salary and equity structured to recognize and retain the best talent globally.

  • Health & wellness: Comprehensive medical, dental, vision, life, and disability insurance.

  • Life & family: Fully paid parental leave for all new parents, including adoptive and surrogate journeys. Financial support for family planning.

  • Benefits & balance: paid time off when you need it, relocation support, and more perks that optimize your time.

  • Opportunities to connect with teammates: lunch and dinner are provided daily. We have regular off‑sites and team celebrations.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary