Senior Software Engineer, Site Reliability Engineer; SRE
Listed on 2026-01-12
-
IT/Tech
Systems Engineer, SRE/Site Reliability, Cloud Computing, IT Support
Senior Software Engineer, Site Reliability Engineer (SRE)
Why Harvey
At Harvey, we’re transforming how legal and professional services operate — not incrementally, but end‑to‑end. By combining frontier agentic AI, an enterprise‑grade platform, and deep domain expertise, we’re reshaping how critical knowledge work gets done for decades to come.
We’re scaling fast and defining a new category in real time. Our team is sharp, motivated, and deeply committed to the mission.
Role OverviewAs a Software Engineer on the Site Reliability team at Harvey, you will ensure the reliability, scalability, and performance of our legal AI platform. You’ll join a high‑leverage team that sits at the intersection of infrastructure and product, owning the systems that keep our platform fast, secure, and always on. From scaling across 50+ regions to automating mission‑critical operations, your work will help Harvey remain resilient as we grow.
If you’re passionate about building robust systems and reducing complexity through automation, we’d love to work with you. This role is based in San Francisco, CA, and we offer relocation assistance.
- Design, implement, and manage monitoring, alerting, and infrastructure resources across 50+ global regions
- Lead incident management processes, including post‑mortems, root cause analyses, and driving actionable improvements
- Automate operational tasks and workflows, building tools and processes for capacity planning, graceful rollouts, and safe data access to maintain high reliability and reduce manual intervention
- Collaborate across teams to drive reliability, security, and compliance throughout the software lifecycle
- Optimize infrastructure costs through strategic capacity planning and build‑versus‑buy decisions while maintaining system performance, reliability, and functionality
- 5+ years of experience in Site Reliability Engineering or similar roles supporting production environments
- Expertise in infrastructure as code (IaC) tools (Pulumi, Terraform, Cloud Formation, etc.)
- Deep familiarity with observability tools (Datadog, Sentry, etc.) and incident response practices (Pager Duty, Incident
IO, etc.) - Proficiency with cloud infrastructure platforms (Azure, GCP, AWS, etc.)
- Strong programming skills (Python, Bash, Go, or similar)
- Proven track record of diagnosing complex system problems and implementing durable solutions
- Solid understanding of CI/CD, Kubernetes, containerization, networking, databases, and cloud security principles
- Excellent problem‑solving skills, meticulous attention to detail, and a commitment to operational excellence
$200,000 – $260,000 USD
Harvey is an equal opportunity employer and does not discriminate on the basis of race, gender, sexual orientation, gender identity/expression, national origin, disability, age, genetic information, veteran status, marital status, pregnancy or related condition, or any other basis protected by law. We are committed to providing reasonable accommodations to applicants with disabilities, and requests can be made by emailing accommodations.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).