Machine Learning Engineer, Foundation Model
Listed on 2026-01-17
-
IT/Tech
Robotics, Machine Learning/ ML Engineer -
Engineering
Robotics
Machine Learning Engineer, Foundation Model
San Jose, CA
About The CompanyDiDi's autonomous driving unit was established in 2016 with the mission of developing Level 4 autonomous driving (AD) technology to make transportation safer and more efficient. In August 2019, the unit became an independent company, DiDi Autonomous Driving, dedicated to advanced AD R&D, product application, and business expansion. We believe integrating AD technology into a shared-mobility fleet will generate immense social value.
By leveraging DiDi's specialized technology, operational expertise, and integrated ecosystem, we are positioned to build and operate a highly efficient, user-oriented autonomous fleet.
The Role
The Foundation Model Team focuses on building large-scale foundation models for multi-agent behavior prediction and autonomous vehicle planning
. By leveraging DiDi Voyager’s unparalleled driving data, we train highly robust and generalizable deep learning systems that enable safe and intelligent autonomous driving at scale.
Our models serve as the core intelligence of the autonomous driving stack, enabling vehicles to understand complex traffic scenarios, anticipate agent behavior, and make safe and efficient driving decisions.
We operate at the intersection of large-scale machine learning, autonomous driving, and foundation model research
, pushing the frontier of multi-agent prediction and planning.
- Design and train large-scale deep learning models for multi-agent trajectory prediction, behavior and intent prediction, and planning and decision-making.
- Build foundation model architectures (Transformers, Diffusion, Flow-based models, Decision models, VLM/VLA).
- Develop scalable training pipelines across hundreds to thousands of GPUs.
- Work with massive real-world datasets and build high-quality data pipelines.
- Optimize models for latency, reliability, and on-vehicle deployment.
- Collaborate closely with perception, mapping, simulation, and systems teams.
- Drive research ideas into production systems used by real autonomous vehicles.
- Strong background in machine learning, deep learning, or robotics.
- Experience with PyTorch / JAX / Tensor Flow.
- Solid understanding of modern neural architectures (transformers, diffusion, auto-regressive).
- Strong coding skills in Python and C++.
- Passion for building real-world, safety-critical AI systems.
- BS, MS or PhD in Computer Science, Machine Learning, Robotics, or a related field.
- Experience in autonomous driving, robotics, or embodied AI.
- Experience training large models on distributed GPU clusters.
- Experience with trajectory prediction, planning, or decision-making systems.
- Publications in top ML / robotics conferences (NeurIPS, ICML, ICLR, CVPR, RSS, CoRL, etc.).
The base salary range for this position is $129,189-$247,038 annually in addition to bonus, equity and benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).