Lead ML Engineer
Listed on 2026-02-28
-
IT/Tech
AI Engineer, Machine Learning/ ML Engineer
This is OUR story... and YOUR next chapter
At HCA Healthcare, our Digital Transformation and Innovation (DT&I) team is redefining what’s possible inpatient care. By leveraging the power of artificial intelligence, automation, and digital technologies, DT&I is helping drive meaningful improvements in clinical outcomes, reduce manual workload, and expand the reach of our care teams. If you're passionate about using technology to improve human life, this is where your work truly matters
What you will accomplish in this roleThe Lead Machine Learning Engineer will build and scale the core platform infrastructure supporting enterprise ML and Generative AI initiatives. This role focuses on developing production-grade AI systems, robust Python SDKs, and cloud-native architectures to enable reliable, secure, and cost-efficient deployment of ML and LLM applications.
Working across MLOps and LLMOps, the engineer will drive automation, observability, governance, and scalable model serving within GCP environments, ensuring AI solutions are resilient, high-performing, and enterprise-ready.
What you will do in this role:- Architect and build scalable ML/LLM platform infrastructure to support multi-team AI development and deployment.
- Design and maintain robust Python-based SDKs, reusable frameworks, and internal AI tooling.
- Lead end-to-end MLOps and LLMOps platform implementation (CI/CD, model registry, feature store integration, evaluation pipelines).
- Build standardized GenAI application frameworks (RAG orchestration, prompt pipelines, evaluation harnesses, guardrails).
- Develop scalable model serving and inference infrastructure optimized for latency, throughput, and cost efficiency.
- Implement enterprise-grade observability (logging, tracing, monitoring, drift detection, usage tracking).
- Deploy and manage AI workloads on GCP using containerized, Kubernetes-based, and serverless architectures.
- Establish governance, security, and compliance standards for ML and LLM systems.
- Drive infrastructure-as-code and automation practices for reproducible AI environments.
- Partner with security, data, and Dev Ops teams to ensure reliability and platform resilience.
- Bachelor's degree - Required
- Master's degree - Preferred
- 7+ years of experience in software engineering with a focus on ML Engineering and AI Engineering or platform engineering - Preferred
- Expert-level Python proficiency with experience building production SDKs and scalable backend systems - Required
- Strong experience designing and scaling GenAI platforms (RAG systems, vector DBs, embedding pipelines, LLM orchestration layers) - Required
- Deep hands-on experience with MLOps & LLMOps tooling (Vertex AI, MLflow, Kubeflow, model registry, CI/CD automation) - Required
- Strong background in GCP (GKE, Cloud Run, Vertex AI, Big Query, Pub/Sub, IAM, networking) - Preferred
- Strong understanding of API design, microservices architecture, and scalable backend engineering - Required
- Experience implementing monitoring/observability frameworks (Prometheus, Open Telemetry, logging pipelines) - Preferred
- Deep hands-on experience with Terraform or other IaC tools – Preferred
Schedule:
- Remote - M-F, 8am – 5pm - Central Time
- This job requires travel to Nashville, TN to attend final interview, 3-day New Hire Orientation, quarterly team meetings, and other travel on as-needed basis
- Not Offered, now or in the future
At HCA Healthcare, we are committed to fostering a culture of growth that allows you to build the career of a lifetime. We encourage you to apply for our Lead AI Engineer today. We review all applications promptly, and qualified candidates will be contacted to continue the process. Join us!
We are an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).