×
Register Here to Apply for Jobs or Post Jobs. X

AI Evaluations Engineer – Healthcare

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Ellipsis Health
Full Time position
Listed on 2026-02-07
Job specializations:
  • Software Development
    AI Engineer, Software Engineer
Salary/Wage Range or Industry Benchmark: 150000 - 180000 USD Yearly USD 150000.00 180000.00 YEAR
Job Description & How to Apply Below

Location:

Remote, located in the US
Type:
Full-time
Department:
Engineering

Reports to:

Director Of Engineering

Responsibilities
  • Build and maintain infrastructure and tooling for the AI evaluations platform used by internal teams, including automated testing platform for AI voice agents, debugging and observability tools.
  • Develop and productionalize evaluation frameworks for individual system components such as ASR, LLMs, TTS, knowledge bases, and guardrails.
  • Partner with ML, engineering and QA teams to translate evaluation requirements into robust, maintainable infrastructure and tooling.
  • Improve developer experience by making evaluation systems easy to extend, well-documented, and reliable in day-to-day use.
  • Ensure evaluation tooling meets production standards for reliability, performance, and maintainability.
Qualifications
  • 5+ years of professional software engineering experience, with a strong focus on building backend systems, platforms, or developer tooling.
  • Proven experience designing and maintaining production-grade infrastructure with code, including APIs, services, and data pipelines.
  • Experience using test automation frameworks, evaluation pipelines, or CI/CD-integrated testing systems.
  • Familiarity with observability and debugging tools (logging, metrics, tracing) and building internal tools that improve developer and QA workflows.
  • Strong debugging skills and a methodical approach to diagnosing production and evaluation issues.
  • Ability to collaborate effectively across engineering, QA, and operations teams, translating requirements into reliable, maintainable systems.
  • Product-minded approach to infrastructure, with attention to usability, documentation, and long-term maintainability.
Preferred
  • Experience working with complex, multi-component systems (e.g., ASR, LLMs, TTS, or other ML-powered services)
  • Experience working in healthcare or other regulated environments, including awareness of HIPAA and PHI handling.
  • Familiarity with conversational AI or voice agents, including multi-turn dialogue, latency constraints, and error recovery.
  • Familiarity with LLM observability or evaluation tools (e.g., Langfuse, prompt eval frameworks).
  • Background in digital health, care coordination, or patient-facing systems.
  • As a health technology company, we reserve the right to run background checks on candidates to whom we extend offers, in compliance with applicable laws. We evaluate candidates holistically and comply with all “ban the box” regulations.
Salary and Benefits
  • We offer competitive salary and benefits, including 401(k) matching, health, vision, and dental insurance, and very flexible paid time off.
  • The typical salary range for this role is $150,000 to $180,000 USD, depending on skills, qualifications, and relevant experience.
Assistance
  • If you have a disability or require accommodations during the application or recruitment process, please contact .
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary