×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer II

Job in New York, New York County, New York, 10001, USA
Listing for: The Walt Disney Company
Full Time position
Listed on 2026-03-05
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, SRE/Site Reliability, IT Support
Job Description & How to Apply Below
Location: New York

Job Summary
:
The Walt Disney Company is a global leader in media and entertainment, and they are seeking a Site Reliability Engineer II to enhance the reliability and scalability of their critical systems. The role involves building automation, improving operational workflows, and collaborating with cross-functional teams to implement SRE principles across the organization.

Responsibilities
:

• Contribute to the design, implementation, and improvement of systems to enhance reliability, scalability, and performance.

• Build and maintain automation for deployment, monitoring, alerting, and operational workflows.

• Collaborate with software engineering teams to implement SRE best practices, including SLIs, SLOs, error budgets, and automated remediation.

• Support CI/CD pipelines and participate in optimizing the software delivery lifecycle.

• Develop tools, dashboards, and instrumentation to improve observability across metrics, logs, and distributed tracing.

• Participate in incident response, root cause analysis (RCA), and corrective actions to prevent recurrence.

• Assist in capacity planning, performance tuning, and scaling strategies for distributed systems.

• Maintain and improve Infrastructure-as-Code (IaC) definitions and cloud environment configurations.

• Contribute to documentation, runbooks, architectural diagrams, and operational standards.

• Collaborate with cross-functional teams to identify reliability risks and recommend improvements.

• Participate in incident-based escalations and rotations to support high-availability production systems.

• Continuously evaluate system architecture, tools, and practices to drive operational excellence and efficiency.

Qualifications
:
Required
:

• Bachelor's degree in computer science, Engineering, or related field (or equivalent experience)

• 3+ years of experience in Site Reliability Engineering, Dev Ops, Platform Engineering, or related discipline

Hands-on experience with cloud platforms – AWS (preferred), GCP, Azure

• Proficiency in Python, Go, JavaScript, Bash, or equivalent scripting languages

• Working knowledge of Linux or Unix-based systems

• Experience with CI/CD systems (e.g., Git Hub Actions, Git Lab CI, Jenkins)

• Familiarity with Infrastructure-as-Code (Terraform, Cloud Formation, etc.)

• Experience with containerization technologies such as Docker and Kubernetes

• Understand networking fundamentals, distributed systems, and system design basics

• Strong analytical and troubleshooting skills, including the ability to diagnose complex system issues

• An ability to work both independently and collaboratively

Strong communication skills and the ability to collaborate effectively with cross-functional teams

Preferred
:

Hands-on experience with observability stacks (Prometheus, Grafana, ELK/EFK, Datadog, Splunk, New Relic)

• Exposure to Git Ops tooling (Argo CD, Flux)

• Experience contributing to SLO/SLI frameworks and implementing error budgets

• Knowledge of service mesh architectures (Istio, Linkerd)

• Familiarity with performance testing and load testing tools

• Experience with message queues, event-driven systems, or distributed data platforms

• Cloud or Dev Ops-related certifications (AWS Associate/Specialty, GCP Professional, Kubernetes CKA/CKS)

• Experience working in large-scale enterprise environments or with distributed global teams

• Experience using modern AI-assisted development tools (e.g., Copilot, Cursor, or similar) to improve code quality, accelerate development, and enhance documentation

• Understanding foundational AI/ML concepts, familiarity with cloud-native AI services such as model hosting, and/or ability to use AI tools to automate cloud operations tasks

Company
:
The Walt Disney Company started as a cartoon studio and evolves into sports coverage and television shows. Founded in 1923, the company is headquartered in Burbank, USA, with a team of 10001+ employees. The company is currently Late Stage.
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary