Job Description & How to Apply Below
About T-Mobile:
T-Mobile US, Inc. (NASDAQ: TMUS), headquartered in Bellevue, Washington, is America’s supercharged Un-carrier, connecting millions through its strong nationwide network and flagship brands, T-Mobile and Metro by T-Mobile. Customers benefit from an unmatched combination of value, quality, and exceptional service experience.
About TMUS Global Solutions:
TMUS Global Solutions is a world-class technology powerhouse accelerating the company’s global digital transformation. With a culture built on growth, inclusivity, and global collaboration, the teams here drive innovation at scale, powered by bold thinking.
TMUS India Private Limited operates as TMUS Global Solutions.
Sr. Engineer, Systems Reliability – Privacy (L08)
About the Role:
Senior Engineer, Systems Reliability (SRE) - Privacy ensures the stability, performance, and reliability of IT services and infrastructure. This role combines software engineering and operations expertise to build and maintain highly available, scalable systems. As a leader in Dev Ops and cloud reliability practices, the engineer supports continuous improvement of automation, deployment pipelines, observability, and incident management, while mentoring junior engineers and optimizing production workflows.
The position plays a critical part in enabling software to be delivered faster, better, and more reliably to support business and customer needs.
What You’ll Do:
Build and maintain CI/CD pipelines for data engineering deployments using Git Lab and Azure Dev Ops
Design and maintain CI/CD pipelines and Dev Ops automation solutions for REST APIs and microservices.
Implement robust monitoring, alerting, and logging for data pipelines, Snowflake and Azure services.
Respond to production incidents, troubleshoot failures and restore services quickly.
Perform root cause analysis and implement preventive measures.
Ensure high availability and disaster recovery planning for critical data systems.
Tune SQL queries, Snowflake features and Databricks clusters for optimal performance and cost efficiency.
Automate operational tasks to improve deployment reliability and reduce manual intervention.
Manage secrets and credentials using Azure Key Vault and Cyber Ark.
Hands-on experience with Terraform, Helm, or Ansible for infrastructure provisioning
Working knowledge of containerization (Docker) and Kubernetes orchestration
Hands-on experience with cloud platforms (Azure; AWS or GCP)
Understanding of deployment strategies (blue/green, rolling, canary), Git Ops, and artifact management
Ensure compliance with data governance, privacy regulations and organizational security standards.
Work closely with data engineers, analysts and cloud teams to ensure smooth operations.
Maintain detailed runbooks, operational documentation and incident reports.
Perform regular OS patching on Unix and Windows servers to address security vulnerabilities and maintain system stability.
Apply critical and cumulative updates for middleware components such as Oracle Data Integrator (ODI), Web Logic and related software to mitigate risks and enhance performance.
Coordinate patching schedules with application and infrastructure teams to minimize downtime and ensure business continuity.
What You’ll Bring :
Bachelor’s degree in computer science, Engineering, or equivalent practical experience
5–7 years of experience in systems reliability, software engineering, Dev Ops, or related technical roles
Experience working in Agile and Dev Ops delivery environments
Demonstrated ability to mentor engineers and influence technical outcomes
Strong problem-solving skills with a systems-level perspective
Must Have
Skills:
CI/CD tooling and automation experience (gitlab, azure devops, jenkins)
Experience working in public or private cloud environments
Proficiency in one or more programming or scripting languages (Python, Java, Shell, etc.)
Experience with monitoring, logging, and APM tools such as App Dynamics, Splunk, or equivalents
Strong understanding of system reliability concepts including scalability, performance, availability, and resilience
Strong experience in writing SQLs, analyzing logs and troubleshooting issues.
Databases : SQL…
Position Requirements
10+ Years
work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×