SRE Technical Manager - Transport
Listed on 2026-01-19
-
IT/Tech
Systems Engineer, SRE/Site Reliability, IT Project Manager, Cloud Computing
Description
Leidos currently has an opening on the Service Management Integration and Transport (SMIT) Contract for a Site Reliability Engineering (SRE) Technical Manager. This is an exciting opportunity to use your experience and leadership skills to successfully execute the mission of the Navy's largest IT services program. Under the SMIT Contract, the Leidos team is responsible for the core backbone for the Navy-Marine Corps Intranet, including cybersecurity services, network operations, network engineering, service desk, seat support services, and data transport.
MoreAbout the Role
We are seeking a highly skilled and experienced SRE Technical Manager to lead our Transport Site Reliability Engineering (SRE) team. In this role, you will manage a group of talented engineers responsible for ensuring the reliability, performance, and scalability of critical systems across 5-6 SRE Pods. You will work closely with engineering, product, and operations teams to implement best practices in automation, incident management, and system monitoring.
This role will focus on both the strategic and operational aspects of site reliability, ensuring that the team meets performance objectives while fostering a culture of innovation and continuous improvement. The SRE Technical Manager will collaborate with the Director of Site Reliability Engineering and is responsible for supporting, migrating, automation and optimization of software development and deployment process, infrastructure as code, and maturing the Site Reliability Engineering program.
The manager will mentor and coach technical staff performing collaborative code reviews to strengthen the SRE skills across the teams.
- Manage and mentor 5-6 SRE teams (pods) and 60+ FTEs, providing guidance, setting performance expectations, and fostering professional development.
- Work collaboratively with SRE Resource Managers to staff and maintain engineering resources for your SRE vertical teams' reliability and scalability goals.
- Responsible for the P&L across the Transport Services vertical. Manage the SRE team's resources, including budget planning, tool selection, and infrastructure investments to meet reliability and scalability needs.
- Meet regularly with your team members, participate in performance reviews and interviews, and development planning.
- Oversee the reliability, availability, and performance of critical systems by leading the SRE teams within the data center vertical in implementing monitoring, incident response, and performance optimization strategies.
- Ensure the team adheres to best practices for system reliability, automation, and operational efficiency.
- Drive continuous improvement initiatives by analyzing performance metrics (e.g., SLOs, MTTR, MTBF) and identifying areas for enhancement.
- Collaborate with operations, quality, cybersecurity and other SRE engineering teams to define and enforce Service Level Objectives (SLOs) and manage error budgets.
- Act as a liaison between the SRE team and other departments to prioritize reliability and operational needs in the product development process.
- Collaborate with senior leadership to define the SRE strategy, set long-term reliability goals, and ensure alignment with business objectives.
- Lead efforts to reduce operational toil through automation. Work with the team to build or enhance automation tools that manage infrastructure, monitor systems, and respond to incidents.
- Oversee the development and adoption of Infrastructure as Code (IaC) tools, CI/CD pipelines, and other automation processes.
- Ensure that SRE practices align with organizational security policies and compliance requirements.
- Collaborate with security teams to integrate reliability‑focused security practices into the design and operation of systems.
- Ensure systems meet or exceed agreed‑upon service levels by proactively addressing potential issues and working with stakeholders to align on reliability expectations.
- Work within a SRE team, collaborating with other Developers, Security, and Operations, to continuously deliver products and increase the value stream for the organization and customers.
- Embrace and champion Agile…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).