Engineering Manager, HPC Storage
Listed on 2026-03-01
-
Software Development
Software Engineer, AI Engineer, Machine Learning/ ML Engineer
Zoox is looking for an experienced Software Engineering Manager to lead our High Performance Computing Storage infrastructure team. Zoox HPC Storage provides abstraction layers for petabyte-scale data movement and management for critical, high-throughput use cases, such as ML foundation model training, synthetic data generation, and more. You will take on a breadth of end-to-end responsibilities, including distributed system design, optimization of storage-related GPU utilization bottlenecks, and cost-effective resource management.
The position comes with a high degree of independence and the opportunity to help define Zoox’s scaling strategy, both technically and organizationally. You will be responsible for hiring and maintaining the health of your team, as well as growing and coaching them to support the continued success of their careers.
In this role, you will:- Work closely with AI teams and other software customers to holistically address pain points, find optimization opportunities, and ultimately chart systems-solutions for broad categories of storage use cases
- Develop a multi-year vision and roadmap for storage at Zoox, including investment into new data movement and management paradigms to meet Zoox’s ever growing computational and storage needs in a cost-effective manner
- Own the hiring process end-to-end, from thoughtful role definition to interview loop design to successfully hiring bar raisers
- Mentor, coach, and advocate for your direct reports
- Experience managing teams of 5-10
- Demonstrated ability to prioritize development work and build cross-functional consensus across ML stakeholders
- Experience with high performance storage systems deployed on cloud providers, such as FSx for Lustre on Amazon Web Services (AWS)
- Strong operational background with highly available systems
- Bachelor's degree in computer science (or related field)
Qualifications:
- Experience with ML-specific data formats such as Mosaic Streaming Datasets (MDS)
- Experience with end-to-end hosted ML services such as AWS Sage Maker Hyper Pod
- Proficiency with Python, Java, or other managed languages
$230,000 - $285,000 a year
Base Salary RangeThere are three major components to compensation for this position: salary, Amazon Restricted Stock Units (RSUs), and Zoox Stock Appreciation Rights. A sign‑on bonus may be offered as part of the compensation package. The listed range applies only to the base salary. Compensation will vary based on geographic location and level. Leveling, as well as positioning within a level, is determined by a range of factors, including, but not limited to, a candidate's relevant years of experience, domain knowledge, and interview performance.
The salary range listed in this posting is representative of the range of levels Zoox is considering for this position.
Zoox also offers a comprehensive package of benefits, including paid time off (e.g. sick leave, vacation, bereavement), unpaid time off, Zoox Stock Appreciation Rights, Amazon RSUs, health insurance, long‑term care insurance, long‑term and short‑term disability insurance, and life insurance.
About ZooxZoox is developing the first ground‑up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility‑as‑a‑service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast‑moving and highly execution‑oriented team.
Follow us on Linked In
AccommodationsIf you need an accommodation to participate in the application or interview process please reach out to or your assigned recruiter.
A FinalNote:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).