Senior Lead Site Reliability Engineer
San Jose, Santa Clara County, California, 95199, USA
Listed on 2026-03-01
-
IT/Tech
Systems Engineer, IT Support, Cloud Computing
Immigration sponsorship is not available for this position
What you can expectAs a Senior Lead Site Reliability Engineer, you can anticipate opportunities to work on our hybrid systems across the globe. You will be responsible for installing, configuring, and monitoring new systems within a network of global data centers. Additionally, you will patch and maintain thousands of physical and cloud systems worldwide. To streamline operations, you will develop automation to reduce repetitive tasks and analyze and address performance bottlenecks.
Furthermore, you will update and troubleshoot user access permissions, resolve network connectivity issues, and maintain system firewalls.
Zoom's SRE team is committed to delivering customer happiness, improving business efficiency, and promoting agility through innovation, data-driven insights, and automation. Our impact is reflected in smooth user experiences, optimized processes, and support for Zoom's expansion in the realm of communication and collaboration.
ResponsibilitiesProviding technical direction for cross-team initiatives and major incidents. Mentor SREs and developers; define best practices and design patterns. Partner with Security, Networking, and Platform teams on architecture roadmaps. Influence vendor and hardware strategy for on-prem and cloud workloads. Design self-healing platforms using automation, chaos engineering, and fault-tolerant patterns. Optimize Linux systems at scale: performance tuning, kernel parameters, networking, storage, and security hardening.
Define best practices and advocate for them across the company. Excellent communication skills and experience driving cross team projects as a technical lead. Able to participate in on-call shifts and incident management and work after hours/weekends for application releases/deployments.
- 10+ years in SRE, production engineering, or large-scale systems administration
- Experience of Linux system administration (systemd, cgroups, networking, file systems, performance analysis)
- Demonstrate coding ability with at least one programming language e.g. Python
- Experience with configuration management (Ansible), IaC (Terraform, Packer), CI/CD pipelines (Jenkins, Git Lab), container orchestration (k8s, Docker) and observability platforms
- Experience with incident response for mission-critical environments
- Security-first mindset (TPM, secure boot, identity, secrets management)
- Networking expertise: BGP, load balancing, DNS, TLS, traffic engineering
- Experience with chaos engineering and resilience testing. Experience with distributed storage systems such as Ceph
Salary Range or On Target Earnings:
Minimum: $;
Maximum: $. In addition to the base salary and/or OTE listed Zoom has a Total Direct Compensation philosophy that takes into consideration base salary, bonus and equity value.
Note:
Starting pay will be based on a number of factors and commensurate with qualifications & experience.
We also have a location based compensation structure; there may be a different range for candidates in this and other locations.
Closing DateAnticipated Position Close Date: 03/27/26
Ways of WorkingOur structured hybrid approach is centered around our offices and remote work environments. The work style of each role, Hybrid, Remote, or In-Person is indicated in the job description/posting.
BenefitsAs part of our award-winning workplace culture and commitment to delivering happiness, our benefits program offers a variety of perks, benefits, and options to help employees maintain their physical, mental, emotional, and financial health; support work-life balance; and contribute to their community in meaningful ways. Learn more information.
About UsZoomies help people stay connected so they can get more done together. We set out to build the best collaboration platform for the enterprise, and today help people communicate better with products like Zoom Contact Center, Zoom Phone, Zoom Events, Zoom Apps, Zoom Rooms, and Zoom Webinars. We’re problem-solvers, working at a fast pace to design solutions with our customers and users in mind.
Find room to grow with opportunities to…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).