×
Register Here to Apply for Jobs or Post Jobs. X

Machine Learning Scientist​/Sr Scientist, Federated Benchmarking & Validation Engineering

Job in Indianapolis, Hamilton County, Indiana, 46262, USA
Listing for: BioSpace
Full Time position
Listed on 2026-01-13
Job specializations:
  • IT/Tech
    Data Scientist, Data Engineer
Salary/Wage Range or Industry Benchmark: 151500 - 244200 USD Yearly USD 151500.00 244200.00 YEAR
Job Description & How to Apply Below
Location: Indianapolis

Machine Learning Scientist/Sr Scientist, Federated Benchmarking & Validation Engineering

Purpose:

Lilly Tune Lab is an AI-powered drug discovery platform that provides biotech companies with access to machine learning models trained on Lilly's extensive proprietary pharmaceutical research data. Through federated learning, the platform enables Lilly to build models on broad, diverse datasets from across the biotech ecosystem while preserving partner data privacy and competitive advantages.

This role centers on constructing robust validation frameworks for federated models, creating privacy-preserving test sets across partner datasets, establishing standardized benchmarks against public datasets, and ensuring model reproducibility and generalization in diverse deployment scenarios.

Key Responsibilities
  • Federated Test Set Design:
    Architect and implement privacy-preserving protocols for constructing representative test sets across distributed partner datasets, ensuring statistical validity while maintaining data isolation.
  • Benchmark Suite Development:
    Create comprehensive benchmark suites covering small molecules, antibodies, and RNA therapeutics.
  • Cross-Domain Validation:
    Develop validation strategies that assess model generalization across different experimental protocols, cell lines, species, and therapeutic indications while respecting partner data boundaries.
  • Public Dataset Integration:
    Systematically benchmark federated models against public datasets to establish performance baselines and identify gaps.
  • Validation Frameworks:
    Implement proper scaffold-split validation protocols that assess model performance on prospective data.
  • Reproducibility Infrastructure:
    Build robust MLOps pipelines ensuring complete reproducibility of federated experiments, including versioning of data snapshots, model checkpoints, and hyperparameter configurations.
  • Statistical Rigor:
    Design statistically powered validation studies accounting for multiple testing, hierarchical data structures, and non-independent observations common in drug discovery datasets.
  • Performance Profiling:
    Develop comprehensive performance profiling across diverse molecular scaffolds, target classes, and property ranges, identifying systematic biases and failure modes.
  • Platform Integration:
    Collaborate with engineering teams to integrate validation frameworks with the Tune Lab federated learning platform built on NVIDIA FLARE.
Basic Qualifications
  • PhD in Computational Biology, Bioinformatics, Cheminformatics, Computer Science, Statistics, or related field from an accredited college or university.
  • Minimum of 2 years of experience in the biopharmaceutical industry or related fields, with demonstrated expertise in drug discovery and early development.
  • Strong foundation in experimental design, statistical validation, and hypothesis testing.
  • Experience with ML model validation, cross-validation strategies, and performance metrics.
  • Proficiency in data engineering, pipeline development, and automation.
Additional Preferences
  • Experience with federated learning platforms and distributed computing.
  • Knowledge of regulatory requirements for AI/ML in pharmaceutical development.
  • Expertise in ADMET assay development and validation.
  • Understanding of antibody engineering and characterization methods.
  • Familiarity with RNA therapeutic design and delivery systems.
  • Experience with clinical biomarker validation and translational research.
  • Proficiency in workflow orchestration tools.
  • Strong knowledge of containerization and cloud computing.
  • Publications on model validation, benchmarking, or reproducibility.
  • Experience with GxP compliance and quality management systems.
  • Exceptional attention to detail and commitment to scientific rigor.
  • Strong technical writing skills for regulatory documentation.
Location & Travel

This role is based at a Lilly site in Indianapolis, South San Francisco, or Boston with up to 10% travel (attendance expected at key industry conferences). Relocation is provided.

Equal Opportunity

Lilly is a proud EEO Employer and does not discriminate on the basis of age, race, color, religion, gender identity, sex, gender expression, sexual orientation, genetic information, ancestry, national origin, protected veteran status, disability, or any other legally protected status. Lilly is dedicated to helping individuals with disabilities to actively engage in the workforce. If you require accommodation to submit a resume for a position at Lilly, please complete the accommodation request form.

Compensation

Anticipated wage: $151,500 - $244,200. Full-time employees also eligible for company bonus and comprehensive benefit program.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary