×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability​/Production Engineer

Job in San Francisco, San Francisco County, California, 94199, USA
Listing for: Hebbia, Inc.
Full Time position
Listed on 2026-02-28
Job specializations:
  • Software Development
    DevOps, Cloud Engineer - Software, Software Engineer
Salary/Wage Range or Industry Benchmark: 160000 - 300000 USD Yearly USD 160000.00 300000.00 YEAR
Job Description & How to Apply Below

The AI platform for investors and bankers that generates alpha and drives upside.

Founded in 2020 by George Sivulka and backed by Peter Thiel and Andreessen Horowitz, Hebbia powers investment decisions for Black Rock, KKR, Carlyle, Centerview, and 40% of the world’s largest asset managers. Our flagship product, Matrix, delivers industry‑leading accuracy, speed, and transparency in AI‑driven analysis. It is trusted to help manage over $30 trillion in assets globally.

We deliver the intelligence that gives finance professionals a definitive edge. Our AI uncovers signals no human could see, surfaces hidden opportunities, and accelerates decisions with unmatched speed and conviction. We do not just streamline workflows, we transform how capital is deployed, how risk is managed, and how value is created across markets.

Hebbia is not a tool. Hebbia is the competitive advantage that drives performance, alpha, and market leadership.

The Role

Site Reliability Engineer who thinks like a software engineer first. Own critical production services end‑to‑end, designing, building, and improving them rather than simply operating them. Write production‑quality code that keeps the platform reliable at scale, embed with product engineering teams to influence architecture from the start, and build the internal tooling that every engineer at Hebbia depends on. This is not a ticket‑driven ops role.

Spend most of the time writing code: instrumenting services, eliminating performance bottlenecks, building deployment platforms, and translating incident post‑mortems into lasting architectural improvements.

Responsibilities
  • Own critical production services end‑to‑end, from design and code review through deployment, operation, and incident response
  • Profile, benchmark, and rewrite hot paths to eliminate bottlenecks as Hebbia scales
  • Lead incident response and drive post‑mortem culture, translating findings into code changes and architectural improvements rather than runbooks
  • Design and build observability frameworks from scratch, writing custom instrumentation, alerting logic, and debugging tooling that surfaces production issues before customers feel them
  • Define and enforce SLOs across platform services and build the feedback loops that keep engineering teams accountable to them
  • Own capacity planning and cost efficiency: model growth, right‑size infrastructure, and write automation that prevents over‑provisioning and resource exhaustion
  • Build robust, well‑tested internal platforms and deployment tooling held to the same engineering standards as customer‑facing code
  • Own and continuously improve CI/CD systems so engineering teams can ship safely and quickly
  • Embed with product engineering teams as a peer software engineer, contributing directly to production codebases and co‑designing systems for reliability from the start
  • Partner on infrastructure security through threat modeling, hardening, and automated compliance tooling
Who You Are
  • 5+ years software development with a track record of writing, shipping, and maintaining production services, not just operating infrastructure
  • Production‑grade proficiency in at least one systems or backend language:
    Go, Python, C++, or Rust
  • Proven experience as a Production Engineer, SRE, or software engineer with a deep infrastructure focus, comfortable owning services end‑to‑end across the full stack
  • Deep understanding of distributed systems
  • Container orchestration expertise and hands‑on experience debugging complex distributed failures in production
  • Working knowledge of OS‑level concepts
  • Experience in building and maintaining observability stacks
  • Strong CI/CD pipeline expertise and a track record of improving developer velocity without sacrificing safety
  • Background at a company with a Production Engineering or software‑focused SRE culture is a strong plus
  • Experience building platforms for AI/ML workloads or high‑throughput document processing pipelines is a plus
Compensation

The salary range for this role is $160,000 to $300,000. This range may be inclusive of several career levels at Hebbia and will be narrowed during the interview process based on the candidate’s experience and qualifications. Adjustments outside of this range may be considered for candidates whose qualifications significantly differ from those outlined in the job description.

PTO: Unlimited

Insurance: Medical + Dental + Vision + 401K

Eats: Catered lunch daily + Door Dash dinner credit if you ever need to stay late

Parental leave policy: 3 months non‑birthing parent, 4 months for birthing parent

Fertility benefits: $15k lifetime benefit

New hire equity grant: competitive equity package with unmatched upside potential

Equal Employment Opportunity

Hebbia’s Equal Employment Opportunity policy: we do not discriminate on the basis of any protected group status under any applicable law.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary