×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineering Manager

Job in Miami, Miami-Dade County, Florida, 33222, USA
Listing for: Tata Consultancy Services
Full Time position
Listed on 2026-01-11
Job specializations:
  • IT/Tech
    IT Support, Cloud Computing
Job Description & How to Apply Below

2 days ago Be among the first 25 applicants

Base pay range

$/yr - $/yr

Desired Experience Range- 10 + Years

Location of Requirement
- Miami, FL, USA

Required Technical Skill Set

.NET/Java, Service Now, App Dynamics, Pager Duty, Grafana, Splunk, Azure ADO, Git Hub, Maven, Gradle, Selenium, JMeter, Postman, Sonar Qube, Google Analytics, Figma, Swagger, Giga Fox, Ready API, Load Runner, Pager Duty, Slack

Must-Have Competencies
  • Experience in .NET and Java frameworks.
  • Proven leadership managing SRE and Dev Ops teams.
  • Incident and problem management using Service Now.
  • Expertise in Observability:
    App Dynamics, Pager Duty, Grafana, Splunk.
  • Deep understanding of CI/CD with Azure ADO, Git Hub, Maven, Gradle.
  • Automated regression and performance testing experience with Selenium, JMeter.
  • Experience building self-healing systems.
  • Strong skills in root cause analysis (RCA) and problem identification.
  • Ability to define and enforce SLAs and response metrics.
  • Document and maintain version-controlled knowledge repositories.
  • Exposure to self-healing systems in SRE or Dev Ops context.
  • Self-Healing
  • Ticketing Automation
  • Observability Dashboard
  • Proactive incident & problem analysis framework
  • Dynamic Thresholds
  • Test Data Quality & Automation
Nice-to-have Competencies
  • Certifications in AWS/GCP/Azure
  • Experience working in a Travel/Tourism industry
Responsibilities / Expectations
  • Lead implementation of automation and self-healing deliverables across teams.
  • Design and drive observability dashboards and proactive alert systems.
  • Define, track and report SLA adherence, MTTR, and ticket lifecycle metrics.
  • Establish an incident audit and RCA review processes with version control.
  • Coordinate cross-functional teams in incident escalation and resolution.
  • Mentor SRE engineers and drive continuous improvement efforts.
  • Ensure usage of standardized templates for knowledge management and RCA.
  • Implement risk-based testing strategies and shift-left practices.
Seniority level
  • Mid-Senior level
Employment type
  • Full-time
Job function
  • Information Technology
Industries
  • IT Services and IT Consulting
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary