Engineering Manager, Cloud Capacity, SaaS Production Engineering
Listed on 2026-03-01
-
IT/Tech
Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Project Manager
Overview
Git Lab is an open-core software company that develops the most comprehensive AI-powered Dev Sec Ops Platform, used by more than 100,000 organizations. Our mission is to enable everyone to contribute to and co-create the software that powers our world. When everyone can contribute, consumers become contributors, significantly accelerating human progress. Our platform unites teams and organizations, breaking down barriers and redefining what s possible in software development.
Thanks to products like Duo Enterprise and Duo Agent Platform, customers get AI benefits at every stage of the SDLC.
The same principles built into our products are reflected in how our team works: we embrace AI as a core productivity multiplier, with all team members expected to incorporate AI into their daily workflows to drive efficiency, innovation, and impact. Git Lab is where careers accelerate, innovation flourishes, and every voice is valued. Our high-performance culture is driven by our values and continuous knowledge exchange, enabling our team members to reach their full potential while collaborating with industry leaders to solve complex problems.
Co-create the future with us as we build technology that transforms how the world develops software.
An overview of this role
As the Engineering Manager for Cloud Capacity, Git Lab SaaS Production Engineering
, you will build and lead a high-performing, fully distributed team that operates and scales Git Lab s multi-tenant SaaS infrastructure. You ll guide the team through a strategic consolidation effort that aligns multi-tenant and single-tenant deployments around shared tooling and processes, reducing duplication, simplifying operations, and improving reliability across all production environments. You ll own cloud capacity planning and vendor relationships, collaborate closely with Product Management and other Infrastructure Platforms and Engineering teams on roadmap and backlog health, and participate in incident management to help ensure remains available, secure, and scalable for customers.
Alongside the technical mandate, you ll focus on developing a strong, collaborative engineering culture and growing team members into capable technical leaders.
- Lead a high-performing Cloud Capacity team within Git Lab SaaS Production Engineering, creating an environment where team members can do their best work and grow.
- Drive the consolidation of multi-tenant and single-tenant SaaS infrastructure tooling and processes into cohesive, standardized approaches that simplify operations and improve reliability.
- Own cloud capacity planning and operations, including maintaining effective relationships with cloud partners and other infrastructure vendors.
- Manage the team s roadmap and project work in partnership with Product Management, ensuring priorities are clear and the backlog remains in a healthy state.
- Participate in the Incident Management on-call rotation, working with reliability and development teams to meet availability goals for and other SaaS offerings.
- Collaborate across Infrastructure Platforms, other Infrastructure teams, Support, and Customer Success Management to deliver a consistent, high-quality customer experience.
- Champion automation, secure-by-default practices, and sound engineering principles to strengthen the availability, security, and scalability of Git Lab SaaS production environments.
- Mentor and develop individual contributors into strong technical leaders, fostering a collaborative, inclusive, and results-focused engineering culture.
- Experience leading production, platform engineering, or site reliability engineering teams, including guiding engineers through complex technical and operational change.
- Strong technical background that enables you to understand distributed systems, SaaS infrastructure, and cloud capacity needs and to make informed decisions
- Background running and operating consumer-scale platforms in a product company environment, with a focus on availability, security, and scalability.
- Experience participating in and navigating incident response, collaborating across teams to resolve outages and improve reliability practices.
- Demonstrated ability to build, develop, and coach engineering teams, including supporting individual contributors in growing into technical leaders.
- Effective cross-functional collaboration skills, working closely with Product Management, Infrastructure, Support, and Customer Success on shared outcomes.
- Clear and adaptable communication style, with the ability to explain complex systems to both technical and non-technical audiences in an all-remote, fully distributed context.
- Openness to candidates with diverse backgrounds and transferable experience in related infrastructure, reliability, or platform leadership roles.
The Cloud Capacity team sits within the Infrastructure Platforms department, which ensures Git Lab operates, delivers, and scales efficiently across , Git Lab…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).