More jobs:
Senior Site Reliability Engineer
Remote / Online - Candidates ideally in
Myrtle Point, Coos County, Oregon, 97458, USA
Listed on 2025-12-07
Myrtle Point, Coos County, Oregon, 97458, USA
Listing for:
Octopus Deploy
Per diem, Remote/Work from Home
position Listed on 2025-12-07
Job specializations:
-
IT/Tech
Systems Engineer, SRE/Site Reliability
Job Description & How to Apply Below
Octopus Deploy sets the standard for Continuous Delivery, empowering software teams to deliver value in an agile way. Over 4,000 organizations globally – including Ubisoft, Xero, Stack Overflow, NASA, and Disney – rely on our Continuous Delivery, Git Ops, and release orchestration solutions.
We’re a profitable scale-up of 300+ people, growing steadily. We’ve built a high-trust, remote-first, and value-driven culture where people are given space to do their best work.
The Builds team at Octopus Deploy is looking for a Senior Site Reliability Engineer (SRE) to:- Share SRE expertise with teams across the company.
- Keep our build systems running with high reliability and availability.
- Improve and iterate on our existing reliability practices.
- Bring fresh ideas and practices to increase reliability and reduce toil.
- Lead the implementation of new capabilities.
- Naturally work in line with our Senior SRE expectations.
- You collaborate effectively, even across wide organisational distances, to solve problems, combining passion, pragmatism, and empathy.
- Thrive in an environment focused on availability, reliability, and observability.
- Are a strong systems engineer and may have deeper expertise in particular domains.
- See value in applying safety culture lessons from other industries to software and operations.
- Are comfortable leading postmortems and designing deployment and monitoring pipelines.
- Care deeply about automation across builds, tests, deployments, infrastructure, and operational tasks.
- Embrace a “you build it, you run it” culture, with a strong commitment to quality and system availability, and are happy to participate in a humane on‑call program.
- Are self‑motivated, work independently with high‑quality output, and proactively seek help or new work when needed.
- Are results‑oriented, adapt quickly when business direction changes, and encourage the same in others.
- Welcome candid feedback, enjoy solving complex problems, and like helping other engineers succeed while working on genuinely valuable projects.
- You don’t need to know all of this – it’s here to give you a feel for our environment.
- Our primary focus and flagship product.
- Written in .NET and backed by a SQL database.
- Experience with the C# application SDLC (e.g. building, testing) is highly regarded.
- Team City is our primary build system for Octopus Server.
- Git Hub Actions is used for some internal tools.
- Continuous delivery is powered by Octopus Deploy.
- A mix of internally developed applications and third‑party software (e.g. Team City).
- Run in Azure using App Services, AKS clusters, and Azure Functions.
- Container workloads run on AKS, with Docker Hub and Artifactory as container registries.
- Terraform is our primary IaC tool.
- IaC workloads run mostly in Octopus Deploy, with some running via Git Hub Actions.
- Our team operates a multiregion Open Telemetry processing system for the rest of R&D.
- We’ve adopted Open Telemetry across many of our Builds systems.
- We help other teams adopt Open Telemetry for more use cases company‑wide.
- We use Sumo Logic and Honeycomb for analysis and troubleshooting.
- Building new capabilities to increase reliability (we don’t want you staring at dashboards all day).
- Working where you do your best work – from your home office, with your preferred setup, tools, and soundtrack.
- Consulting with another team on how to operate their services at the right level of reliability, or how best to use our build and observability platforms.
- Pairing with another engineer over Zoom to solve a complex technical problem or explore the problem space for future improvements.
- Responding to an actionable alert and working to maintain the reliability of the platform used across the company.
- Improving documentation so engineers can discover solutions themselves and reduce lead time.
- Writing a blog post or preparing a talk to share something interesting you’ve learned with other engineers.
- Facilitating an incident review and turning the learnings into practical changes.
- Proactively reducing toil by building thoughtful automation.
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×