More jobs:
Senior DevOps/SRE Engineer
Job in
Washington, District of Columbia, 20022, USA
Listed on 2026-01-24
Listing for:
JSM Consulting Inc.
Full Time
position Listed on 2026-01-24
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below
We are seeking a high-caliber Senior Dev Ops/SRE Engineer to join a mission-critical team in Washington, DC, focused on maturing our cloud-native ecosystem. In this role, you will bridge the gap between development and operations by architecting resilient infrastructure, implementing robust SRE principles like SLOs and error budgeting, and driving a "Security-First" CI/CD culture. As a technical leader, you will not only manage high‑availability AWS environments and Kubernetes orchestration but also serve as a force multiplier for our engineering teams by building self‑service platforms and automated governance tools that ensure operational excellence and cost‑efficiency.
Key Responsibilities- Reliability & Incident Management:
Define and maintain SLOs/SLIs, manage error budgets, and lead high‑level incident response and blameless post‑mortem analyses. - Infrastructure as Code (IaC):
Architect and maintain secure, scalable environments using Terraform, Ansible, and Cloud Formation to ensure repeatable deployments. - CI/CD & Deployment Strategy:
Design secure delivery pipelines (Git Hub Actions/Jenkins) incorporating automated rollbacks, canary releases, and blue‑green deployment patterns. - Comprehensive Observability:
Build and manage full‑stack telemetry pipelines, dashboards, and alerting systems using Prometheus, Grafana, Datadog, or ELK. - Sec Dev Ops Integration:
Enforce security‑as‑code by integrating SAST/DAST, secrets scanning, and SBOM validation into the automated software development lifecycle. - Efficiency & Fin Ops:
Monitor cloud spend trends and implement right‑sizing strategies to ensure high performance at an optimal cost‑to‑value ratio. - Internal Enablement:
Develop shared playbooks, reusable automation modules, and self‑service tools to boost developer velocity and reduce friction. - Technical Leadership:
Mentor cross‑functional teams and establish organization‑wide best practices for fault tolerance and operational readiness.
- Education:
Bachelor's degree in Computer Science, Engineering, or a related technical field. - Experience:
5+ years in Dev Ops, SRE, or Platform Engineering, including leadership experience in production automation. - Cloud Expertise: 3+ years of hands‑on experience managing high‑availability production environments, specifically within AWS (IAM, Networking, Compute).
- Containerization:
Deep proficiency with Kubernetes, Docker, and Linux systems administration. - Tooling Mastery:
Advanced experience with Terraform, Git Ops patterns, and CI/CD security tollgates. - Automation
Skills:
Strong scripting proficiency in Python, Go, or Bash for custom tool development. - Operational Mindset:
Proven track record in chaos engineering, capacity modeling, and managing complex observability stacks. - Communication:
Exceptional ability to document technical architectures and lead collaborative engineering initiatives.
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×