Senior Machine Learning Engineer - Healthcare Job Houston area,Texas USA,IT/Tech

Summary

The mission of The University of Texas M.

D. Anderson Cancer Center is to eliminate cancer in Texas, the nation, and the world through outstanding programs that integrate patient care, research, prevention, and education. Core to the success of our mission is the ability to orchestrate multidimensional data, data analytics, and machine learning to create sustainable impact within a framework of responsible AI. We are building a dynamic team of machine learning engineers and data scientists that can help us consistently and responsibly accelerate the impact of AI across the enterprise, driving long‑lasting improvements in cancer care.

We are actively seeking a Senior MLOps Engineer who will play a pivotal role in advancing MLOps initiatives across the enterprise. This role is critical for orchestrating an AI lifecycle management framework, encompassing the development, deployment, and maintenance of production‑quality machine learning models to support clinical and business operations. Additionally, the Senior MLOps Engineer will support the assessment and validation of external machine learning models and AI‑driven products.

The role extends beyond technical expertise, as it is also about forging team dynamics, cultivating a culture of innovation, and supporting processes and technological foundations necessary to accelerate strong MLOps practices across the enterprise.

Key Responsibilities

Oversee the lifecycle of AI models, encompassing training, evaluation, deployment, monitoring, and maintenance of production quality machine learning models, in compliance with standards and best practices.
Develop CI/CD pipelines for ML model training, deployment, and monitoring while upholding security, scalability, reliability, reproducibility, and performance.
Provide rigorous testing, versioning, and documentation, ensuring impact, risk mitigation, and reproducibility.
Develop and support a culture responsible AI by minimizing bias, enhancing fairness, and maximizing transparency in AI models.
Maintain diligent records of model development experiments, data and model lineage tracking, as well as data and model scorecards.
Engage with stakeholders to gather requirements, convey AI concepts understandably, and capture feedback.
Design fallback and decommissioning strategies for AI solutions to ensure operational continuity.
Support the evaluation and onboarding of third‑party machine learning models, ensuring they meet institutional standards, enhance institutional value, and minimize organizational risk.
Deliver training on AI solutions to enhance understanding and application across the organization.
Engage with technology trends, contribute to tech communities, and foster a culture of continuous learning and innovation.

Technical Expertise

Proficient in developing, deploying, and maintaining AI/ML algorithms in production environments.
Skilled in constructing scalable data pipelines, feature and artifact management, and analytics.
Experienced with MLOps tools and processes for data, code, and model management.
Strong proficiency in Python and either C++ or C#, with practical knowledge of Tensor Flow, PyTorch, and Scikit‑learn.
Knowledgeable about AI/ML platform infrastructure, including cloud and on‑premises architectures.
Familiar with cloud‑native tools, services, and computing environments (eg. Azure, AWS, GCP).
Proficient in Dev Ops practices and CI/CD pipelines, including Azure Dev Ops and Git Hub Actions.
Experienced with containerization using Docker and orchestration with Kubernetes, along with DAGs tools.

Analytical Expertise

Skilled in project management methodologies (SAFe agile, PRINCE2, Lean) for end‑to‑end AI/ML project lifecycle management, ensuring timely delivery, adherence to budget, and quality compliance.
In‑depth knowledge of AI/ML Model Lifecycle Management aligned with ISO standards for software and AI development.
Proficient in decision‑making, problem‑solving, and executing AI/ML healthcare solutions.
Skilled at the quantitatively assessing machine learning models for performance, workflow impact, and potential risks.
Adept at collaborating with vendors and partners for evaluating and…


Increase/decrease your Search Radius (miles)



Job Posting Language