Site Reliability Engineer
Job in
Atlanta, Fulton County, Georgia, 30383, USA
Listed on 2026-02-28
Listing for:
TrueSkilla
Full Time
position Listed on 2026-02-28
Job specializations:
-
IT/Tech
Cloud Computing, SRE/Site Reliability, Systems Engineer, IT Support
Job Description & How to Apply Below
Job Title:
Site Reliability Engineer (SRE) – Azure | Banking Domain
Experience
Required:
7–12 Years
Location:
Atlanta, GA Hybrid
We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Microsoft Azure and prior experience in the Banking or Financial Services industry.
This role focuses on building reliable, scalable, and secure cloud infrastructure while ensuring high availability and performance of mission-critical banking applications.
The ideal candidate understands production environments in regulated industries and can balance reliability with speed of delivery.
Key Responsibilities- Design, implement, and manage highly available and scalable infrastructure on Azure.
- Improve system reliability, performance, and uptime through automation and monitoring.
- Build and maintain CI/CD pipelines for cloud-native and enterprise applications.
- Define and manage SLIs, SLOs, and SLAs.
- Implement observability solutions (logging, monitoring, alerting).
- Support incident management, root cause analysis, and post-mortem reviews.
- Automate infrastructure provisioning using Infrastructure as Code (IaC).
- Ensure compliance with banking security and regulatory standards.
- Collaborate with Dev Ops, development, and security teams.
- 7–12 years of experience in SRE / Dev Ops / Production Engineering roles.
- Experience with Infrastructure as Code (Terraform, ARM, or Bicep).
- Knowledge of CI/CD tools (Azure Dev Ops, Jenkins, Git Hub Actions).
- Strong scripting skills (Power Shell, Python, or Bash).
- Experience with containerization and orchestration (Docker, Kubernetes).
- Experience working in Banking / Financial Services environments.
- Strong understanding of security, compliance, and risk management in regulated industries.
- Experience with monitoring tools (Azure Monitor, Prometheus, Grafana, Splunk, etc.).
- Exposure to high-availability and disaster recovery architecture.
- Knowledge of ITIL processes and incident management frameworks.
- Certifications in Azure (AZ-104, AZ-400, etc.) are a plus.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×