Senior/Applied Scientist, Multimodal Representation Learning; Oncology
Listed on 2026-03-01
-
Software Development
AI Engineer, Machine Learning/ ML Engineer, Data Scientist
Location: New York
Drug development shouldn’t be guesswork, not when patients are waiting.
Pathos is building a next-generation biotech with AI at the core. Not as a feature, but as the operating system for how medicines get developed. We believe most drugs don’t fail because the science was wrong. They fail because they were tested in the wrong patients, with the wrong assumptions, in trials that couldn’t answer the real question: who benefits, and why?
Pathos exists to change that. We’re building the largest foundation model in oncology and pairing it with proprietary AI systems, deep oncology expertise, and 200+ petabytes of multimodal data linked to patient outcomes, so we can make development decisions with more precision, much earlier.
This is not theoretical. We’re well-capitalized and have the leadership to build a generational company. We invest in and advance our own clinical-stage programs, using our AI platform to sharpen trial design, patient selection and biomarker strategy. So therapies reach the patients most likely to benefit, sooner.
If you’re driven by purpose, energized by complexity, and want to apply AI, biology, or both to redefine the future of drug development, come build Pathos with us.
About the role:Where Frontier AI Meets Frontier Biology to Deliver Frontier Medicine
We are hiring specialized scientists to accelerate development of our Oncology Foundation Model (OFM) stack. This is not a generic “model tinkering” role. The person in this seat will help define and build the modeling strategy that turns multimodal oncology data (clinical text/EHR, genomics, transcriptomics, pathology imaging, and derived features) into useful representations and predictive capabilities that directly support drug discovery and development.
You’ll operate at the intersection of:
- Frontier AI (representation learning, multimodal learning, alignment, evaluation)
- Messy biomedical reality (clinical endpoints, censoring, confounding, missingness, batch effects)
- Mechanism + translation (models that can be interrogated, stress-tested, and connected to biology and outcomes)
This role complements (not duplicates) the computational biology roles that focus on our program-facing biomarker analyses and trial decisions.
What You Will Do- Design and implement multimodal pretraining and fine-tuning strategies for oncology data (e.g., contrastive objectives, masked modeling, multitask learning, retrieval-augmented training, late/early fusion variants).
- Build model components that improve cross-modality grounding (e.g., aligning clinical narratives with molecular state and pathology signals).
- Develop robust approaches for missing-modality settings (train-time and inference-time), ensuring the OFM remains useful when only subsets of modalities exist.
- Work with domain partners to define prediction targets and representation tests that matter: response, durability, toxicity, survival, progression, resistance, subtype stability, etc.
- Incorporate oncology-specific realities into modeling and evaluation (censoring, treatment lines, temporal leakage, cohort shift, annotation noise).
- Create evaluation harnesses that go beyond leader board metrics: ablations, cohort-shift tests, missingness stress tests, temporal generalization, calibration, and failure-mode analysis.
- Define and maintain benchmark suites that reflect Pathos priorities and are reproducible across model iterations.
- Partner with engineering to support scalable training/inference (multi-node GPU training, data pipelines, throughput optimization), while keeping scientific intent front-and-center
- Package model outputs so they can be consumed by internal science teams: embeddings, uncertainty estimates, interpretable signals, retrieval tools, and model cards that clearly state what’s reliable vs. not.
- Collaborate with computational biologists, translational scientists, and clinicians to ensure the OFM supports mechanism discovery and patient stratification workflows
Minimum Qualifications
- Advanced degree (PhD strongly preferred) in ML/AI, CS, Statistics, Computational Biology, Bioinformatics, or a related…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).