×
Register Here to Apply for Jobs or Post Jobs. X

Engineering Manager, Site Reliability; SRE

Job in Myrtle Point, Coos County, Oregon, 97458, USA
Listing for: SentinelOne
Full Time position
Listed on 2026-01-12
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, IT Support, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 100000 - 125000 USD Yearly USD 100000.00 125000.00 YEAR
Job Description & How to Apply Below
Position: Engineering Manager, Site Reliability (SRE)
Location: Myrtle Point

About Us

At Sentinel One, we’re redefining cybersecurity by pushing the limits of what’s possible—leveraging AI-powered, data-driven innovation to stay ahead of tomorrow’s threats.

From building industry-leading products to cultivating an exceptional company culture, our core values guide everything we do. We’re looking for passionate individuals who thrive in collaborative environments and are eager to drive impact. If you’re excited about solving complex challenges in bold, innovative ways, we’d love to connect with you.

What are we looking for?

Please note that under Federal & FedRAMP regulations, hiring for this role is limited to US citizens only.

FedRAMP Staff may be subject to customer or third‑party background checks up to and including secret clearance if required by their role at Sentinel One.

We are seeking an experienced engineering and operational Manager to lead a Site Reliability Engineering (SRE) team  the Manager of SRE, you will manage a team of SRE professionals responsible for ensuring the reliability and scalability of our products and production services, focusing on the experience our customers have in production every day. You will work closely with other engineering teams to identify and address availability, performance, and capacity issues, and you’ll be a key partner for our externally facing teams including Support, Customer Success, and Sales Engineering.

This is a highly visible role within S1 with frequent executive communication opportunities, and is a great opportunity to do good work with good people all around the world.

As a team we value:
  • Thinking from first principles, understanding second order impacts
  • Curiosity to understand new systems, their operating principles and limitations
  • Strong operational ownership and a desire to reduce toil via automation
  • A drive to learn, especially from prior failures
  • Courage to take risks and make things happen
  • Empathy and humility to collaborate effectively with peers and across teams
What will you do?
  • Grow and lead a team of SRE professionals, including setting performance goals and measuring deliverables against key metrics, while evolving those metrics as S1 grows and needs develop
  • Invest in data‑driven deep triage on recurring issues, collaborating with other engineering teams to identify and address issues related to reliability, performance, and capacity
  • Develop, improve, and implement processes for the full incident lifecycle, including incident management, post‑incident analysis, and learning from incidents. Lead incident response efforts, including coordinating with other teams to investigate and resolve customer‑impacting incidents
  • Design support model for SRE regarding service maturity and service ownership, including monitoring and alerting improvements, and SLI / SLO design and implementation
  • Analyze production metrics and signals to identify areas for improvement and take proactive steps to mitigate issues
  • Develop and implement best practices and standards for Site Reliability Engineering, from day‑to‑day operations to hiring and planning
  • Communicate effectively with cross‑functional teams to ensure alignment on objectives and priorities. Deliver outcomes, not just stories and tasks.
What skills and knowledge should you bring?
  • 8+ years of related engineering experience, with at least 2 years in a management role
  • Demonstrated experience leading technical and operational teams at various stages of maturity
  • Excellent analytical and problem‑solving skills
  • Familiarity with modern software development methodologies, tools, and techniques, including CI/CD
  • Experience working with cloud‑native applications and large‑scale distributed systems, including a working knowledge of technologies such as Kubernetes and Terraform/IaC, and cloud providers such as AWS or GCP
  • Experience with various monitoring and alerting techniques and tools, including frameworks and concepts such as SLOs, OTel and Golden Signals as well as tooling such as Prometheus and Grafana
  • Extensive experience with incident response and management at various layers of the stack across different business needs and applications, including both hands‑on experience leading…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary