Senior Software Engineer, Research
Listed on 2026-01-20
-
Software Development
AI Engineer
Job type:
Full Time
· Department:
Engineering
· Work type:
Hybrid
· USD 175000
-220000 / year
Mountain View, California, United States
About Us ,
We’re Building the Most Impactful Healthcare Company on Earth
We believe that access to a great doctor is a basic human right. Today, that’s not a reality. Delays, misdiagnoses, administrative chaos, and burnout plague the system.
Our Mission: One Human, One Doctor. We build AI teammates that augment clinicians — scribes, nurses, receptionists, translators — all powered by our own world-class models and deployed in real-world care.
Our Traction
450+ organizations signed 16 months
AI agents cut admin by ~2.8 hours daily and reduce onboarding 85%.
5M+ Clinical Tasks completed to date, serving 36+ specialties.
Raised $25M from YC, Eric Yuan, Amity, Semper Virens
Patented AI architecture (Med Con-1)
outperforms GPT-4.5, Gemini, Claude on clinical reasoning tasks
Sully requires A-players capable of 4 months = 1 year output.
What You’ll DoBuild and optimize core research infrastructure: evaluation pipelines, agent workflows, hallucination detectors, coding benchmarks, and research→production integrations.
Design, implement, and scale agentic systems across backend, frontend, and model integrations, collaborating closely with research and co-founders.
Own reliability, observability, and performance across agents (logging, tracing, instrumentation, safety checks).
Ship research-proven features into production within 7 days, end-to-end.
Develop shared tools, SDKs, and internal products that accelerate iteration across Research, QA, and Engineering.
What You Must BringSenior-level full-stack engineering experience in React, Type Script, and Node.js
.
Proven ability to design, ship, and scale LLM-powered applications
.
Expertise in API design, streaming, and CI/CD pipelines
.
Strong cloud infrastructure background (
AWS, GCP, or Azure
).
Track record of building reliable systems with measurable performance and error budgets.
First-Month FocusAudit all cross-agent flows for UI/UX consistency, correctness, and performance gaps.
Implement shared components, typed schemas, and contract-driven interfaces for reliability.
Establish instrumentation for frontend performance, agent consistency, latency, and model round-trip tracing.
Improve or replace brittle evaluation or agent pipelines identified during onboarding.
Partner with Research to product ionize at least one new capability.
90 Day OKRsDeliver production-grade agentic workflows with
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).