×
Register Here to Apply for Jobs or Post Jobs. X

Founding Engineer, ML Infrastructure

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Reactor
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing
Salary/Wage Range or Industry Benchmark: 200000 - 250000 USD Yearly USD 200000.00 250000.00 YEAR
Job Description & How to Apply Below

We're looking for a Founding Infrastructure Engineer with deep expertise in building and scaling cloud-native systems. This is a highly technical, high-impact role focused on designing and evolving the foundation that powers our AI platform.

You'll work across the entire infrastructure stack, from GPU orchestration to networking and observability, ensuring our systems are reliable, performant, and cost-efficient. You'll shape the architecture that supports large-scale AI workloads, set best practices for how we operate, and establish the infrastructure patterns that will carry Reactor forward over the next 1, 2, and 5 years.

We want to build a world‑class infrastructure platform for serving AI at scale, and you'll own this critical part of our stack.

What You'll Do
  • • Design and scale the infrastructure for real‑time AI inference, delivering ultra‑low latency, high throughput, and cost efficiency.
  • • Orchestrate GPUs and manage multi‑tenant workloads with Kubernetes, service mesh, and global traffic routing.
  • • Build and operate core systems including Terraform‑based infrastructure, Kubernetes, observability (Prometheus, Grafana), distributed storage, and networking.
  • • Implement cross‑cutting capabilities: authentication, rate limiting, monitoring, alerting, and telemetry for inference systems.
  • • Define the roadmap for infrastructure growth, making tradeoffs across performance, reliability, and cost.
  • • Partner closely with ML engineers to product ionize and optimize model serving pipelines.
Required Skills
  • • Proven experience in infrastructure engineering, Dev Ops, or ML platform engineering.
  • • Deep expertise in Kubernetes at scale, GPU orchestration, service mesh, and cloud‑native automation.
  • • Experience designing and operating global load balancing and high‑availability traffic routing.
  • • Fluency in infrastructure‑as‑code, modern CI/CD, and observability stacks.
  • • Strong systems background: distributed systems, performance tuning, caching, concurrency, and cost optimization.
  • • Hands‑on experience with ML inference serving frameworks (e.g., Triton, ONNX Runtime, vLLM, Tensor

    RT).
  • • Solid understanding of cloud security and data management strategies for inference workloads.
  • • Startup mindset: thrive in fast‑paced environments, embrace ambiguity, and own projects end to end.
Logistics

We are based in‑person in San Francisco. We believe the best ideas and work come from being together.

  • • Competitive San Francisco salary and meaningful early equity.
  • • We sponsor visas. We are committed to working through the process together for the right candidates. If you're currently outside the US, we're also committed to helping you relocate to the US throughout this process.
  • • We offer generous health, dental, and vision coverage, and relocation support as needed.

If this sounds like you, we'd love to hear from you.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary