Data Scientist | Clinical AI Job Palo Alto area,California USA,IT/Tech

Overview

Machinify is a leading healthcare intelligence company with expertise across the payment continuum, delivering value, transparency, and efficiency to health plan clients. We are growing quickly and build machine learning models to identify erroneous healthcare payments and reduce costs. We are hiring a Data Scientist to advance our AI system for clinical criteria evaluation, translating policy requirements into correct, testable code (SQL and Python), and building measurement and quality systems that improve an AI pipeline extracting structured clinical signals from medical records.

Responsibilities

Translate medical policy into executable logic - Read and interpret medical policies and clinical criteria (e.g., lab thresholds, temporal windows, trend logic, exclusions).
Convert requirements into correct, maintainable SQL and Python implementations (e.g., creatinine-based AKI rules, bilirubin thresholds, troponin dynamics, ABG-derived criteria).
Design rule representations that are composable and auditable (clear inputs, outputs, assumptions, edge cases).
Prompt engineering and system parameter tuning for AI configuration that extracts clinical information from medical records.
Build robust clinical feature pipelines.
Create and maintain pipelines that compute clinical features from extracted signals (labs, vitals, flow sheets, notes-derived facts).
Handle tricky realities: missing timestamps, multiple measurement sources, unit normalization, deduplication, conflicting values, provenance tracking.
Own measurement, evaluation, and continuous quality improvement.
Define and instrument accuracy metrics for the AI system that extracts data from medical records.
Build gold datasets, sampling strategies, and review workflows with clinical/operations partners.
Perform error analysis, identify root causes (retrieval failures, OCR issues, extraction ambiguity, policy interpretation gaps), and drive improvements.
Establish engineering frameworks and tooling.
Create reusable tooling for policy-to-code translation: templates, test harnesses, validation suites, regression checks, and monitoring dashboards.
Improve infrastructure for large-scale runs: orchestration, logging, lineage, versioning, and reproducibility.
Implement guardrails and QA gates so policy logic changes are safe, traceable, and measurable.
Partner deeply with domain experts.
Work with clinicians, policy specialists, and operations to clarify ambiguous requirements and ensure implementations reflect real-world intent.
Produce clear documentation that explains what the code is doing and why, with examples and edge-case handling.

Qualifications

Strong SQL and Python engineering skills - Ability to translate nuanced requirements into correct SQL (CTEs, window functions, joins at scale, performance tuning) and production-quality Python.

- Experience building testable pipelines, not just ad hoc analysis.
Experience operationalizing rules + models - Track record of implementing complex business/clinical logic and deploying it reliably.

- Comfort working with imperfect, messy, high-volume datasets.
Evaluation/Metric mindset - Experience designing metrics, building ground truth, running experiments, and improving system quality through structured iteration.

- Ability to connect technical quality measures to business outcomes (e.g., accuracy vs reviewer burden vs downstream decisions).
Systems thinking and rigor - You build frameworks that make other engineers/scientists faster: shared libraries, patterns, tooling, and clear interfaces.
You sweat details - edge cases, provenance, temporal logic, unit conversions, and regression safety.
Healthcare curiosity - Interest in medical records, clinical data, and how policies translate into decision criteria.

- Prior healthcare experience is a plus, but not required if you bring the aptitude and motivation.

Nice to Have

Experience with clinical data standards or lab data normalization (LOINC familiarity, units conversion, reference ranges).
Experience evaluating LLM/IE systems (information extraction) or building human-in-the-loop QA workflows.
Familiarity with distributed data systems (Spark, Big Query/Snowflake, Databricks) and workflow orchestrators.

Why This Role

If you enjoy turning ambiguous real-world policies into precise code, building quality systems that make AI reliable, and operating at the boundary of clinical domain nuance and production engineering, this role is designed for you.

Equal Employment Opportunity

Machinify is committed to hiring talented and qualified individuals with diverse backgrounds for all of its positions. Machinify believes that the gathering and celebration of unique backgrounds, qualities, and cultures enriches the workplace.

See our Candidate Privacy Notice at:

#J-18808-Ljbffr


Increase/decrease your Search Radius (miles)



Job Posting Language