Senior Manager, Compute Bothell, Washington Boulder, Color
Listed on 2026-02-28
-
IT/Tech
Systems Engineer -
Engineering
Systems Engineer
Senior Manager, High Performance Compute
Bothell, Washington, United States;
Boulder, Colorado, United States;
College Park, Maryland, United States
IonQ is developing the world's most powerful full-stack quantum computer based on trapped‑ion technology. We are pushing past the limits of classical physics and current supercomputing technology to unlock a new era of computing. Quantum computing has the potential to impact every area of human society for the better. IonQ’s computers will soon redefine industries such as medicine, materials science, finance, artificial intelligence, machine learning, cryptography, and more.
IonQ is at the forefront of this technological revolution.
Before a quantum circuit runs on trapped ions, it often lives as a massive simulation on classical hardware. We are seeking a Senior Manager, High Performance Compute to lead the team responsible for the hybrid HPC computational platform that powers our physics simulations, hardware verification, and quantum algorithm development.
This is a Player‑Coach role for a technical leader who refuses to lose their edge. You will manage a team of talented engineers while remaining hands‑on with the technology. One day you might be hiring a HPC engineer and the next you might be debugging a race condition in a Slurm scheduler or profiling a simulation kernel for GPU efficiency. You will sit at the intersection of classical supercomputing and quantum simulations, building the hybrid infrastructure that allows us to push past the limits of classical physics.
Responsibilities- Lead, mentor, and grow a team of HPC engineers, fostering a culture of technical rigor where “it works” isn’t enough; it has to be performant.
- Own the strategy for our hybrid HPC environment, balancing workloads between on‑premise clusters and burst capacity in the cloud to maximize simulation throughput per dollar and create a fantastic user experience.
- Partner directly with quantum physicists and application teams to understand their simulation needs, translate complex scientific requirements into concrete infrastructure roadmaps, and deliver on those roadmaps.
- Manage relationships with hardware vendors and cloud providers, negotiating specialized compute instances such as H200s and high‑memory nodes required for our workloads.
- Architect and tune our job schedulers (leveraging Slurm) to handle massive, spiky workloads involving many concurrent simulation jobs.
- Dive deep into the stack to optimize I/O patterns, memory usage, and parallelization strategies.
- Build and maintain the “glue” that allows users to submit jobs seamlessly to on‑prem hardware or cloud clusters.
- Troubleshoot complex failures in simulation pipelines, distinguishing between infrastructure issues and algorithmic bugs, and providing a best‑in‑class user experience for submitting, running, and troubleshooting jobs.
- Bachelor’s degree in Computer Science, Physics, Engineering, or equivalent practical experience.
- 7+ years of HPC experience with deep expertise in Linux systems administration and cluster management.
- 3+ years of experience leading engineering teams, managing backlogs, and conducting performance reviews, with a desire to remain hands‑on.
- 3+ years of experience with Slurm, configuring fair‑share scheduling, backfill, and preemption.
- Proven experience deploying HPC clusters in the public cloud (AWS, Azure, or GCP) using tools like AWS Parallel Cluster, Batch, or equivalent.
- Strong proficiency in Python and Bash, treating infrastructure as code (Ansible, Terraform, Packer).
- 10+ years of HPC experience.
- 5+ years of experience in engineering management.
- Experience running and optimizing large‑scale scientific simulations (e.g., molecular dynamics, CFD, or electronic design automation).
- Understanding of MPI and GPU acceleration (CUDA/ROCm) frameworks.
- Background in Physics or experience with quantum simulation software (Qiskit, Cirq, or proprietary solvers).
- Experience with high‑performance parallel file systems (Lustre, GPFS/Spectrum Scale, or WEKA).
This is a hybrid role based at our office in College Park, MD, Bothell, WA, or Boulder, CO.
TravelUp to…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).