Sr. Site Reliability Engineer, DevOps
Job in
Carlsbad, San Diego County, California, 92008, USA
Listed on 2026-03-03
Listing for:
Atec Spine
Full Time
position Listed on 2026-03-03
Job specializations:
-
IT/Tech
SRE/Site Reliability, Systems Engineer, Cloud Computing, IT Support
Job Description & How to Apply Below
Essential Duties and Responsibilities
* Serve as a primary contributor to the on-call rotation to maintain 24/7 uptime for production systems.
* Proactively, monitor, and continuously improve SLAs, SLOs, and SLIs across critical services.
* Develop and maintain robust observability tooling including logging, metrics, and tracing (e.g., Azure Monitor, Open Telemetry, Prometheus).
* Proactively conduct postmortems and root cause analysis; implement fixes to prevent repeat incidents.
* Identify and eliminate manual operational toil through scripting and automation.
* Design and maintain automated incident detection and response systems.
* Establish and maintain runbooks, playbooks, and escalation protocols for system support.
* Contribute to chaos testing and failure injection to proactively uncover weaknesses.
* Promote a culture of operational excellence through data-driven reliability practices.
* Proactively communicating status
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×