More jobs:
AI Research Scientist, Evaluations - Meta Superintelligence Lab
Job in
Menlo Park, San Mateo County, California, 94029, USA
Listed on 2026-02-24
Listing for:
Meta Platforms, Inc.
Full Time
position Listed on 2026-02-24
Job specializations:
-
IT/Tech
Data Scientist, AI Engineer
Job Description & How to Apply Below
You'll work in tandem with world-class researchers to envision, develop, and validate novel evaluations that shape the future of AI capability measurement. This is a technical research role requiring good scientific judgment, creativity, and the ability to drive ambitious research agendas with independence. The evaluations you develop will directly influence research direction and major model lines within MSL, making scientific validity, methodological rigor, and clear communication important.
You will collaborate closely with technical leadership to ensure evaluations capture the most important capabilities, translating organizational priorities into measurable benchmarks, and translating evaluation insights back into research direction. We are looking for exceptional research talent - researchers who have shaped the field of machine learning, and are ready to do so again at the frontier of AI. If you are passionate about defining how we measure AI progress and want to shape the scientific foundations of frontier AI development, we encourage you to apply for this exciting opportunity at the core of MSL.
Minimum Qualifications
* Bachelor's degree in Computer Science, Computer Engineering, relevant technical field, or equivalent practical experience
* PhD degree in Computer Science, Machine Learning, or a related technical field
* 3+ years of experience in machine learning engineering, machine learning research, or a related technical role
* Proficiency in Python and experience with ML frameworks such as Py Torch
* Experience identifying, designing and completing medium to large technical features independently, without guidance
* Proven success in software engineering practices including version control, testing, and code review practices
* Ability to work independently and adapt to rapidly changing priorities
Preferred Qualifications
* Publications at peer-reviewed venues (NeurIPS, ICML, ICLR, ACL, EMNLP, or similar) related to language model evaluation, benchmarking, or deep learning
* Hands-on experience with language model post-training and deep learning systems, or building reinforcement learning environments
* Experience implementing or developing evaluation benchmarks for large language models and multimodal models (e.g., vision-language, audio, video)
* Experience working with large-scale distributed systems and data pipelines
* Familiarity with language model evaluation frameworks and metrics
* Track record of open-source contributions to ML evaluation tools or benchmarks
Responsibilities
* Curate and integrate publicly available and internal benchmarks to direct the capabilities of frontier model development
* Develop and implement evaluation environments, including environments for novel model capabilities and modalities
* Collaborate with external data vendors to source and prepare high-quality evaluation datasets
* Execute on the technical vision of research scientists designing new benchmarks and evaluations
* Build robust, reusable evaluation pipelines that scale across multiple model lines and product areas
* Contribute to evaluation tooling that measures the quality and reliability of evaluation suites
About Meta
Meta builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed the way people connect. Apps like Messenger, Instagram and Whats App further empowered billions around the world. Now, Meta is moving beyond 2D screens toward immersive experiences like augmented and virtual reality to help build the next evolution in social technology. People who choose to build their careers by building with us at Meta help shape a future that will take us beyond what digital connection makes possible today-beyond the constraints of screens, the limits of distance, and even the rules of physics.
Equal Employment Opportunity
Meta is proud to be an Equal Employment Opportunity employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, reproductive health decisions, or related medical conditions), sexual orientation, gender identity, gender expression, age, status as a protected veteran, status as an individual with a disability, genetic information, political views or activity, or other applicable legally protected characteristics.
You may view our Equal Employment Opportunity notice here.
Meta is committed to providing reasonable accommodations for qualified individuals with disabilities and disabled veterans in our job application procedures. If you need assistance or an accommodation due to a…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×