Senior Site Reliability/Gitops Engineer
Listed on 2026-01-12
-
IT/Tech
Systems Engineer, Cloud Computing, IT Support
Senior Site Reliability / Git Ops Engineer
1 day ago Be among the first 25 applicants
Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is widely used in breakthrough enterprise initiatives such as public cloud, data science, AI, engineering innovation, and IoT. Our customers include the world’s leading public cloud and silicon providers and other industry leaders. The company is a pioneer of global distributed collaboration, with 1200+ colleagues in 75+ countries and very few office‑based roles.
We are hiring a Senior Site Reliability / Git Ops Engineer to join the Information Systems (IS) team. This role is an opportunity for an "automation‑first" senior technologist with a passion and proficiency in Linux to build a career with Canonical and drive success for those leveraging Ubuntu and open‑source products.
Job SummaryThe IS team at Canonical supports and maintains all of Canonical’s IT production services. The team is in charge of running services used by over 60 million Ubuntu users. As a Senior SRE & Git Ops engineer you will be uniquely positioned to drive operations automation to the next level, both in our own private clouds and in public clouds, using open‑source infrastructure‑as‑code tools, CI/CD pipelines and Canonical’s leading products for software operation automation.
In addition to defining the infrastructure as code, you will improve Canonical products and open‑source technologies by providing critical feedback to developers. You will also collaborate on design and implementation with other teams, contributing to bugs, pull‑requests, and shared tooling.
As a Senior Site Reliability / Git Ops Engineer you will- Drive the development of automation and Git Ops in your team as an embedded tech lead
- Collaborate closely with the IS architect to align solutions with the IS architecture vision
- Design and architect services that the IS can offer to the organization as products
- Apply your IaC experience to develop and improve infrastructure‑as‑code practice within IS
- Automate software operations for re‑usability and consistency across private and public clouds, handling distributed‑system complexities
- Maintain operational responsibility for all of Canonical’s core services, networks, and infrastructure
- Develop skills in troubleshooting, capacity planning, and performance investigation; set up, maintain, and use observability tools such as Prometheus, Grafana, and Elasticsearch; design, implement, and maintain monitoring and alerting for various systems and services
- Provide assistance and work with globally distributed engineering, operations, and support peers
- Receive uninterrupted development time to focus on larger projects and automation of manual tasks
- Share your experience, know‑how, and best practices with other team members in design sessions, mentorship, and collaborative work
- Carry final responsibility for time‑critical escalations
- A modern view on hosting architecture, driven by infrastructure as code across private and public clouds
- A product mindset that thrives on developing products rather than solutions
- Python software development experience, with large projects
- Experience working with Kubernetes or other container orchestration systems
- Proven exposure to managing and deploying cloud infrastructure with code
- Practical knowledge of Linux networking, routing, and firewalls
- Affinity with various forms of Linux storage, from Ceph to databases
- Hands‑on experience administering enterprise Linux servers
- Extensive knowledge of cloud computing concepts and technologies
- Bachelor’s degree or greater, preferably in computer science or related engineering field
- Clear and effective communication in English over email, chat, video or voice calls and in‑person
- Motivation to troubleshoot from kernel to web, and willingness to ask for help when appropriate
- Flexibility and eagerness to learn new things quickly
- Passion for fast‑changing environments and open-source, especially Ubuntu or Debian
- Distributed work environment with twice‑yearly team sprints in person
- P…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).