×
Register Here to Apply for Jobs or Post Jobs. X

Founding Engineer, ML Inference

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Reactor
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    AI Engineer, Systems Engineer, Data Engineer, Data Scientist
Salary/Wage Range or Industry Benchmark: 150000 - 200000 USD Yearly USD 150000.00 200000.00 YEAR
Job Description & How to Apply Below

We're looking for a Founding Engineer, ML Inference with deep expertise in high-performance ML engineering. This is a highly technical, high-impact role focused on squeezing every drop of performance from real-time generative media models.

You'll work across the model-serving stack, designing novel inference frameworks, optimizing inference performance, and shaping Reactor's competitive edge in ultra-low-latency, high-throughput environments. We want to establish new inference frameworks in this domain and you will be able to own this part of our stack.

What You'll Do
  • • Drive our frontier position on real-time model performance for diffusion models
  • • Design and implement a high-performance in-house inference runtime
  • • Implement optimizations using torch.compile, custom CUDA kernels, and specialized inference frameworks
  • • Optimize neural network models for inference through quantization, pruning, and architectural modifications while maintaining accuracy
  • • Profile and benchmark model performance to identify computational bottlenecks
  • • Collaborate directly with model partner teams to directly integrate their models into our platform
Required Skills
  • • Strong foundation in systems programming, with a track record of identifying and resolving bottlenecks
  • • Deep expertise in the ML infrastructure stack:
    • ◦ PyTorch, Tensor

      RT, Transformer Engine, Nsight, ONNX Runtime
    • ◦ Model compilation, quantization (INT8/FP16), and advanced serving architectures
  • • Working knowledge of GPU hardware (NVIDIA) and the ability to dive deep into the stack as needed
  • • Strong understanding of transformer architectures and modern ML model optimization techniques
Logistics

We are based in-person in San Francisco. We believe the best ideas and work come from being together.

  • • Competitive San Francisco salary and meaningful early equity.
  • • We sponsor visas. We are committed to working through the process together for the right candidates. If you're currently outside the US, we're also committed to helping you relocate to the US throughout this process.
  • • We offer generous health, dental, and vision coverage, and relocation support as needed.

If this sounds like you, we'd love to hear from you.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary