Research Scientist
Listed on 2026-02-02
-
Research/Development
Artificial Intelligence, Data Scientist -
Engineering
Artificial Intelligence, AI Engineer
Overview
The Center for AI Safety is a research and field-building nonprofit located in San Francisco. Our mission is to reduce catastrophic and existential risks from artificial intelligence through field-building and technical research.
The Center for AI Safety is a research and field-building nonprofit dedicated to ensuring the safety of future artificial intelligence systems. We believe that artificial intelligence will be a powerful technology which will dramatically change society and that AI safety must therefore be pursued proactively. To this end, we conduct research into machine learning safety and facilitate field-building projects which accelerate the growth of the safety community.
Join us in steering the future of AI.
- As a research scientist, pursue a variety of research projects in fields such as AI Honesty, Utility Engineering, Trojans, Transparency, and Robustness. Set research directions and strategies to make our AI systems safer, more aligned and more robust.
- Assist in writing and submitting articles for publication at top conferences.
- Collaborate with internal research staff as well as academics at top universities (including Stanford, UC Berkeley, CMU, or MIT).
- Leverage our compute cluster to run experiments at scale on large language models.
- Fine tuning large-scale transformers and evaluating them under different data domains
- Creating and designing new datasets to evaluate the robustness of different models
- Scaling machine learning systems to thousands of GPUs
- Evaluating models in sequential decision-making games
- Developing and launching ML competitions (e.g., Trojan Detection Challenge)
- Collaborating with academics on research ranging from transparency, proxy gaming, honest AI, interpretable uncertainty, and so on
- Ph.D. in computer science, machine learning, or a related field, with 5+ years of related research experience
- Familiar with relevant frameworks and libraries (e.g., pytorch and huggingface)
- Have experience launching and training distributed ML jobs
- Communicate clearly and promptly with teammates
- Have co-authored an NLP or RL paper in a top conference
The Center for AI Safety is a non-profit dedicated to ensuring the safety of future artificial intelligence systems. We believe that artificial intelligence will be a powerful technology which will dramatically change society and that AI safety must therefore be pursued proactively. To this end, we conduct research into machine learning safety and facilitate field-building projects which accelerate the growth of the safety community.
Join us in steering the future of AI.
For this role we are considering research scientists with salary pay ranges of 140-180K.
Benefits- Health insurance for you and your dependents
- 401(k) plan with 4% matching
- Unlimited PTO
- Free lunch and dinner at the office
- Commuter Benefit program
- Annual learning & development stipend
If you have any questions about the role, feel free to reach out to hiring.
The Center of AI Safety is an equal opportunity employer and all qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability status, protected veteran status, or any other characteristic protected by law.
Some studies have found that a higher percentage of women and underrepresented minority candidates won't apply if they don't meet every listed qualification. The Center for AI Safety values candidates of all backgrounds. If you find yourself excited by the position but you don't check every box in the description, we encourage you to apply anyway!
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).