Applied Scientist, AI Evaluation Platform
Job in
Seattle, King County, Washington, 98113, USA
Listed on 2026-02-28
Listing for:
Apple Inc.
Full Time
position Listed on 2026-02-28
Job specializations:
-
Software Development
AI Engineer, Machine Learning/ ML Engineer, Data Scientist
Job Description & How to Apply Below
It's the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you'll do more than join something - you'll add something.
Our team, part of Apple Services Engineering, is looking for an Applied Scientist to lead the design and continuous development of automated benchmarking methodologies for AI-powered code assistant tools. In this role, you will investigate how coding-focused LLM agents behave, create rigorous evaluation frameworks, and establish scientific standards for assessing their quality and reliability. This role is crafted to enable the development of scalable evaluation frameworks that ensure our engineers have the right tools to create products that surprise and delight our customers.
The successful candidate will have a proactive approach with the ability to work independently and collaboratively on a wide range of projects. In this role, you will work alongside a small but impactful team, collaborating with ML and data scientists, software developers, project managers and other teams at Apple to understand requirements and translate them into scalable, reliable, and efficient evaluation frameworks.
Publications in ML evaluation or related fields.
Experience with automated testing frameworks. Experience constructing human-in-the-loop or multi-turn evaluation setups. Prior work on agentic developer tools.
Advanced degree (MS or PhD) in Computer Science, Software Engineering, or equivalent research/work experience. Strong research background in empirical evaluation, experimental design, or benchmarking. Strong proficiency in Python. Intermediate proficiency in Swift. Deep familiarity with software engineering workflows and developer tools. Experience working with or evaluating AI/ML models, preferably LLMs or program synthesis systems. Strong analytical and communication skills, including the ability to write clear reports.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×