×
Register Here to Apply for Jobs or Post Jobs. X

Sr. SRE

Job in Coos Bay, Coos County, Oregon, 97458, USA
Listing for: Lytx
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Position: Sr. Staff SRE

Why Lytx:

At Lytx, our engineering culture is built around being hungry, low-ego, and highly capable. We are pragmatic engineers who take ownership, collaborate openly, and focus on delivering measurable operational impact . Our mission is to design, operate , and continuously evolve the cloud infrastructure and operational platforms that power mission-critical SaaS and IoT services at global scale.

We are growing rapidly and expanding the use of AI across our platform and engineering operations. As our systems scale in complexity and business criticality, we are investing in next-generation observability, intelligent automation, and AIOps capabilities to enable proactive, insight-driven operations.

The Site Reliability Engineering (SRE) organization is responsible for the availability, reliability, observability, and resilience of our cloud-native environments. This includes building the operational platforms, telemetry strategy, and automation frameworks that allow engineering teams to operate confidently and efficiently.

This role sits at the center of operational intelligence for the company. As a Sr. Staff SRE, you will define the technical vision for observability and operational automation, influence architecture across the organization, and lead initiatives that reduce operational risk, improve system insight, and enable predictive, automated response at scale.

If you enjoy building foundational platforms, shaping engineering standards, and driving the evolution toward AI-enabled operations, this role provides an opportunity to have broad organizational impact.

Responsibilities / You’ll get to

Strategic Technical Leadership : Define and drive the long-term strategy for observability, operational intelligence, and reliability engineering across the organization, aligning technical direction with business growth, customer experience, and service-level objectives .

Operational Intelligence & AIOps : Lead the evolution toward intelligent operations by designing capabilities such as event correlation, anomaly detection, alert noise reduction, predictive signal detection, and automated remediation to improve MTTD, MTTR, and operational efficiency.

Observability Platform Architecture : Architect and lead the end-to-end observability platform across metrics, logs, traces, and events. Establish scalable telemetry standards, instrumentation patterns, and onboarding models that enable consistent visibility across AWS and cloud-native services.

Automation at Scale : Drive large-scale automation initiatives that reduce operational toil, including self-service infrastructure workflows, policy-as-code guardrails, reliability automation, and automated response for common failure scenarios.

Reliability & Resilience Engineering : Partner with product, platform, and data teams to embed reliability, performance, cost efficiency, and fault tolerance into system design. Lead capacity modeling, resilience planning, and architecture improvements for multi-AZ and multi-region environments.

Incident Leadership & Continuous Learning : Provide technical leadership during high-severity incidents and guide blameless postmortems that identify systemic issues and drive long-term reliability improvements.

Organizational Standards & Governance : Define and standardize SLO/SLI frameworks, error budget practices, telemetry conventions, and infrastructure patterns to ensure consistent operational excellence across teams.

Innovation & Technology Evaluation : Evaluate and introduce emerging AWS-native, cloud-native, and AI-enabled observability and automation technologies. Lead proofs-of-concept and guide organization-wide adoption.

Mentorship & Influence : Mentor Staff and Senior SREs, raising the bar for system design, operational rigor, and engineering judgment while fostering a culture of ownership, learning, and continuous improvement.

Cross-Organizational Influence : Act as a senior technical authority for reliability and observability, shaping engineering roadmaps and influencing architectural decisions across product and platform domains.

Requirements / You’ll Need
  • 8–10+ years of experience in SRE, platform engineering, or cloud…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary