×
Register Here to Apply for Jobs or Post Jobs. X

Research Scientist, AI​/ML Foundational Models

Job in Boston, Suffolk County, Massachusetts, 02298, USA
Listing for: Takeda
Full Time position
Listed on 2026-01-18
Job specializations:
  • IT/Tech
    Artificial Intelligence, Data Scientist
Job Description & How to Apply Below

By clicking the “Apply” button, I understand that my employment application process with Takeda will commence and that the information I provide in my application will be processed in line with Takeda’s Privacy Notice and Terms of Use. I further attest that all information I submit in my employment application is true to the best of my knowledge.

Job Description

Title
:
Scientist, AI/ML – Foundational Models

Position Overview

We are seeking Scientists to develop and deploy foundational AI models that will transform drug discovery across Takeda. As part of the AI/ML Foundation team, you will build large-scale models including large language models (LLMs), diffusion models, and multimodal architectures that integrate diverse data types—omics, biomedical imaging, protein 3D structures, and molecular representations. This role requires deep expertise in modern deep learning architectures combined with foundational knowledge of biology, chemistry, and disease biology to ensure models are scientifically grounded and impactful.

You will train models from scratch, fine-tune pre-trained models for Takeda-specific applications, and deploy foundation model capabilities that accelerate discovery across all therapeutic platforms.

Key Responsibilities
  • Develop and train foundational AI models (LLMs, diffusion models, flow-matching architectures) for drug discovery applications, with capability to pre-train on large-scale scientific corpora and molecular datasets.
  • Fine-tune and adapt pre-trained foundation models (protein language models, chemical LLMs, vision transformers) for Takeda-specific applications in target identification, disease modeling, and molecular design and discovery.
  • Build multimodal foundation models integrating diverse data types including omics (genomics, transcriptomics, proteomics), biomedical imaging, protein 3D structures, and molecular representations.
  • Apply and extend state-of-the-art approaches including graph neural networks, transformer-based protein language models, and multimodal learning frameworks.
  • Apply domain expertise in biology, chemistry, and/or disease biology to guide model architecture decisions, training data curation, and evaluation strategies ensuring scientific validity.
  • Implement state-of-the-art generative architectures (diffusion, score-based models, autoregressive transformers) for molecular generation, protein design, and multi-objective optimization.
  • Collaborate with computational scientists across domains to deploy foundation models that address diverse discovery needs across small molecules, biologics, and emerging modalities.
  • Stay current with advances in foundation models, generative AI, and multimodal learning; contribute to internal knowledge sharing and external publications.
Qualifications

Required:

  • PhD degree in a scientific discipline (or equivalent), or
  • MS with 6+ years relevant experience, or BS with 8+ years relevant experience Deep expertise in modern deep learning architectures including transformers, diffusion models, and/or generative models.
  • Strong experience training large-scale models with proficiency in PyTorch and distributed training frameworks.
  • Foundational knowledge of biology, chemistry, or disease biology sufficient to guide scientifically meaningful model development.
  • Experience with at least one of: protein language models (ESM, Prot Trans), molecular generative models, or biomedical vision models.
  • Experience with cloud computing (AWS, GCP) and GPU cluster training at scale.

Preferred:

  • Experience building or fine-tuning foundation models in pharmaceutical or life sciences settings.
  • Expertise in multimodal learning integrating text, images, and structured molecular data.
  • Experience with omics data analysis (genomics, transcriptomics, proteomics) and knowledge graph
  • Familiarity with protein structure prediction and 3D molecular representations.
  • Publications in top-tier ML venues (NeurIPS, ICML, ICLR) or computational biology journals.
  • Experience with model compression, efficient inference, or production deployment of large models.
  • Strong background in large-scale data integration and multimodal modeling for biological systems.
  • Proficiency in Python and…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary