×
Register Here to Apply for Jobs or Post Jobs. X

DevOps Engineer

Remote / Online - Candidates ideally in
San Jose, Santa Clara County, California, 95199, USA
Listing for: Zoom
Remote/Work from Home position
Listed on 2026-02-16
Job specializations:
  • IT/Tech
    SRE/Site Reliability, Cloud Computing
Salary/Wage Range or Industry Benchmark: 80000 - 100000 USD Yearly USD 80000.00 100000.00 YEAR
Job Description & How to Apply Below

What you can expect

We’re looking for a Senior Site Reliability / Dev Ops Engineer to help operate, scale, and continuously improve highly reliable SaaS production platforms. These are running across large-scale, distributed environments. In this role, you’ll focus on operational excellence, automation, observability, and performance to ensure availability, reliability, and scalability for customer-facing services. You’ll own impactful initiatives end to end, lead incident response when it matters most.

Also you'll partner closely with engineering teams to deliver durable improvements that directly impact customers.

About The Team

You’ll join an experienced SRE/Dev Ops team passionate about reliability and automation. We foster collaboration, fair on‑call rotations, and data‑driven decisions. Together, we build resilient systems and efficient operations that power our SaaS platform.

Responsibilities
  • Operating, scaling, and continuously improving SaaS production platforms across distributed environments
  • Designing and implementing zero‑downtime solutions for highly available services (99.999%)
  • Developing and maintaining disaster recovery (DR) strategies across datacenters in multiple regions. Developing and maintaining automation, tooling, and scripts to improve deployment efficiency and reduce manual operations
  • Implementing and enhancing monitoring, alerting, and observability to proactively detect and prevent issues. Analyzing system behavior and performance data to identify bottlenecks and optimization opportunities
  • Owning system performance, availability, and scalability for customer‑facing services. Leading incident response efforts, conducting root cause analysis, and implementing long‑term remediation
  • Creating and maintaining runbooks and operational documentation to standardize procedures. Define, track, and improve service reliability using SLOs, SLIs, and operational metrics
  • Providing operational input into platform and architecture decisions affecting SaaS services. Mentor engineers and share operational best practices across teams
  • Participating in on‑call rotations, incident management, and after‑hours or weekend work for application releases and deployments
What we’re looking for
  • 6–10 years of experience supporting and operating SaaS production systems in Dev Ops or SRE roles
  • Bring experience designing highly available systems with dynamic uptime targets (up to 99.999%) and hands‑on experience planning and executing multi‑region disaster recovery (DR) strategies.
  • Bring deep experience operating distributed systems in customer‑facing production environments. Experience building and maintaining automation and operational tooling
  • Have deep understanding of system reliability, availability, scalability, and performance optimization with hands‑on experience with monitoring, observability and alerting platforms.
  • Provide background in incident management, including on‑call operations and root cause analysis
  • Able to deeply understand the services you support, including dependencies and failure modes. Proven track record of owning and delivering operational improvements end to end
  • Champion collaboration and communication skills
Ways of Working

Our structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In‑Person is indicated in the job description/posting.

Benefits

As part of our award‑winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work‑life balance; and contribute to their community in meaningful ways. Click Learn for more information.

About Us

Zoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars. We’re problem‑solvers, working at a fast pace to design solutions with our customers and users in mind.

Here, you’ll work…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary