×
Register Here to Apply for Jobs or Post Jobs. X

Senior Site Reliability Engineer

Job in Santa Clara, Santa Clara County, California, 95053, USA
Listing for: LeanData
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, IT Project Manager
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

Lean Data helps the world’s fastest-growing companies automate, simplify, and accelerate revenue.

We are looking for a Senior Site Reliability Engineer to lead the strategic evolution of our cloud infrastructure. Reporting directly to the SVP of Engineering, this role is designed for a builder - someone who wants to move beyond maintenance and into the realm of architectural transformation.

You will have the autonomy to evaluate our existing AWS footprint and lead the charge in modernizing our environment. Your mission is to take a high-velocity system and implement the best practices, guardrails, and automated architectures that will support our next 10x of scale. You will be the primary authority on reliability, performance, and infrastructure security.

Please note: This is a hybrid role based in our Santa Clara, CA office, with an in-office schedule of two days per week – Monday and Wednesday.

Key Responsibilities
  • Architectural Modernization:
    Lead the design and implementation of a scalable, "Cloud-First" AWS architecture. You will drive the transition toward fully automated, state-of-the-art Infrastructure as Code (Terraform).
  • High Availability & Resilience:
    Design and implement robust Disaster Recovery (DR) and Business Continuity plans, moving our services toward a zero-downtime deployment model.
  • Performance & Capacity Engineering:
    Own the strategy for capacity planning and autoscaling. You will optimize our compute resources (EC2, Lambda) to handle bursty traffic patterns with precision and cost-efficiency.
  • Advanced Observability:
    Define our monitoring and alerting philosophy using New Relic for deep APM and system insights. Partner this with Incident

    IO to ensure we catch and resolve issues before they impact customers.
  • Streamlined CI/CD:
    Partner with feature teams to refine Change Management and CI/CD pipelines, ensuring code moves from "commit" to "production" safely and predictably.
  • Cloud Security:
    Harden our network architecture and application security posture, including WAF management and secure service-to-service communication.
The Tech Stack
  • Cloud Infrastructure: AWS (EC2, Lambda, SQS, SNS, ALB, API Gateway, S3, WAF).
  • Observability & Incident Response:
    New Relic (APM/Infrastructure), Incident

    IO.
  • Automation & Tools:
    Terraform, Redis/Elasticache, Shell Scripting, NPM/PM2.
  • Application Ecosystem:
    NodeJS, Python, C#, Angular, Apex.
  • Integration:
    Salesforce Managed Packages, MSFT Dynamics
    365.
Who You Are
  • Experienced Architect: 5+ years of experience in SRE, Dev Ops, or Systems Engineering, with a proven track record of managing complex AWS environments.
  • Proven Incident Commander:
    You demonstrate calm, decisive leadership during high-pressure outages. You have extensive experience running blameless postmortems and, crucially, driving the remediation work needed to prevent recurrence.
  • Observability Pro:
    You have deep experience configuring New Relic (or similar platforms) to create meaningful dashboards, SLIs, and SLOs.
  • Automation Advocate:
    You believe that manual intervention is a bug. You have deep experience with Terraform and a "Code-First" approach to infrastructure.
  • Strategic Problem Solver:
    You can look at a complex, "needs-based" architecture and formulate a clear, prioritized roadmap to move it toward industry best practices.
  • Collaborative Leader:
    You enjoy working with feature engineers to help them build "reliability-by-design" into their services.
  • Education:

    A Bachelor’s degree in Computer Science, Engineering, or a related technical field (or equivalent professional experience).
Why Work At Lean Data
  • Lean Data covers employee insurance premiums up to 90%
  • Stock options in Lean Data for all full-time employees
  • Flexible PTO
  • 401K plan
  • 401K plan
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary