Senior Lead - Site Reliability Engineer
Listed on 2026-02-27
-
Software Development
Software Engineer, DevOps, Cloud Engineer - Software, AI Engineer
Location: Greater London
Overview
In short At On, we believe technology should move as fast as a runner. We are building the foundation that allows our engineering organization to scale, innovate, and deliver value without friction. We are looking for a Staff Software Engineer to join the Developer Experience (Dev Ex) team. In this hands-on IC role, you will build the platform that empowers dozens of technology teams to build, ship, and operate services effortlessly.
You will be the architect and primary builder of our "Golden Path," ensuring that speed, safety, and reliability are baked into every developer s workflow.
- Engineer the Golden Path:
Design and implement highly automated CI/CD pipelines (Git Hub Actions) and common templating (Typescript/Node) systems that allow a developer to go from "idea" to "production-ready service" in minutes. - Graph
QL
Infrastructure: Act as a core contributor to our central Apollo Graph
QL API Gateway. You will manage the supergraph composition, runtime stability, and schema governance standards to ensure a consistent contract for all consumers. - Build the Internal Developer Platform (IDP):
Develop our Cloud Abstraction Layer and Developer Portal. This includes building self-service tools, CLIs, and service catalogs that reduce cognitive load for engineers. - SRE & Observability:
Integrate "secure-by-default" practices and robust observability into the platform. You build dashboards (New Relic) and monitoring patterns that provide teams with deep insights into their service health. - Technical Excellence & Advocacy:
Conduct code reviews, write high-quality documentation, and advocate for Dev Ex best practices across the organization. - Friction Reduction:
Actively hunt for bottlenecks in the software development lifecycle. Whether it s a slow build or a complex deployment process, you are responsible for fixing it.
- Product Thinking:
You treat internal developers as your customers. You listen to their pain points and iterate on the platform based on real-world feedback. - Infrastructure as Code:
You are comfortable with Terraform and Kubernetes (GCP experience is a plus), treating infrastructure with the same rigor as application code. - Automation Mindset:
You are passionate about CI/CD (Git Hub Actions) and building developer tooling (CLIs, SDKs, or Portals). - AI-Augmented Engineering Workflows:
You leverage the latest agentic coding tools to 10x your productivity, blending deep technical principles with AI-assisted workflows. As a power user of AI, you orchestrate complex builds and bypass boilerplate to deliver robust, scalable code at pace, ensuring our Type Script/Node.js environment remains lean and efficient. - Architectural API Design:
Extensive experience in crafting robust API contracts with a focus on Graph
QL Federation; familiarity with the Apollo tech stack is highly regarded. - Data-Driven:
You understand DORA metrics and Dev Ex signals, using them to measure the success of the tools you build.
On is a place that is centered around growth and progress. We offer an environment designed to give people the tools to develop holistically – to stay active, to learn, explore and innovate. Our distinctive approach combines a supportive, team-oriented atmosphere, with access to personal self-care for both physical and mental well-being, so each person is led by purpose.
On is an Equal Opportunity Employer. We are committed to creating a work environment that is fair and inclusive, where all decisions related to recruitment, advancement, and retention are free of discrimination.
#J-18808-LjbffrTo Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: