Senior DevOps/SRE Engineer
Listed on 2026-01-24
-
IT/Tech
Systems Engineer, Cloud Computing, IT Project Manager
Overview
Job Category: Engineer
Job Type: Onsite
Job Location: District of Columbia Washington
Compensation: Depends on Experience
W2: W2-Contract Only;
Kindly note that applications on a C2C basis will not be considered for this role.
Job Description: We are seeking a high-caliber Senior Dev Ops/SRE Engineer to join a mission-critical team in Washington, DC, focused on maturing our cloud-native ecosystem. In this role, you will bridge the gap between development and operations by architecting resilient infrastructure, implementing robust SRE principles like SLOs and error budgeting, and driving a “Security-First” CI/CD culture. As a technical leader, you will not only manage high-availability AWS environments and Kubernetes orchestration but also serve as a force multiplier for our engineering teams by building self-service platforms and automated governance tools that ensure operational excellence and cost-efficiency.
Key Responsibilities- Reliability & Incident Management:
Define and maintain SLOs/SLIs, manage error budgets, and lead high-level incident response and blameless postmortem analyses. - Infrastructure as Code (IaC):
Architect and maintain secure, scalable environments using Terraform, Ansible, and Cloud Formation to ensure repeatable deployments. - CI/CD & Deployment Strategy:
Design secure delivery pipelines (Git Hub Actions/Jenkins) incorporating automated rollbacks, canary releases, and blue-green deployment patterns. - Comprehensive Observability:
Build and manage full-stack telemetry pipelines, dashboards, and alerting systems using Prometheus, Grafana, Datadog, or ELK. - Sec Dev Ops Integration:
Enforce security-as-code by integrating SAST/DAST, secrets scanning, and SBOM validation into the automated software development lifecycle. - Efficiency & Fin Ops:
Monitor cloud spend trends and implement right-sizing strategies to ensure high performance at an optimal cost-to-value ratio. - Internal Enablement:
Develop shared playbooks, reusable automation modules, and self-service tools to boost developer velocity and reduce friction. - Technical Leadership:
Mentor cross-functional teams and establish organization-wide best practices for fault tolerance and operational readiness.
- Education:
Bachelor’s degree in Computer Science, Engineering, or a related technical field. - Experience:
5+ years in Dev Ops, SRE, or Platform Engineering, including leadership experience in production automation. - Cloud Expertise: 3+ years of hands-on experience managing high-availability production environments, specifically within AWS (IAM, Networking, Compute).
- Containerization:
Deep proficiency with Kubernetes, Docker, and Linux systems administration. - Tooling Mastery:
Advanced experience with Terraform, Git Ops patterns, and CI/CD security tollgates. - Automation
Skills:
Strong scripting proficiency in Python, Go, or Bash for custom tool development. - Operational Mindset:
Proven track record in chaos engineering, capacity modeling, and managing complex observability stacks. - Communication:
Exceptional ability to document technical architectures and lead collaborative engineering initiatives.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).