×
Register Here to Apply for Jobs or Post Jobs. X

Director, Site Reliability Engineering

Job in New York, New York County, New York, 10261, USA
Listing for: NBCUniversal
Full Time position
Listed on 2026-01-15
Job specializations:
  • IT/Tech
    IT Project Manager, Cloud Computing, Systems Engineer
Job Description & How to Apply Below
Location: New York

Director, Site Reliability Engineering

NBCUniversal is one of the world’s leading media and entertainment companies. We create world‑class content, distribute across film, television, streaming, and bring to life through global theme parks and consumer products.

Job Description
  • As a member of NBCUniversal’s Production Software Engineering team, responsible for leading and performing custom architectural design, implementation, monitoring, and maintenance for a portfolio of production application environments.
  • Responsible for hands‑on configuration and support as well as managing the work of other architects and engineers.
  • Work closely with our Principal Software Engineer on technical architecture and design based on customer product requirements, translating product requirements to technical designs and implementations.
  • Collaborate with cross‑functional team members such as Scrum Leads, Software Engineers, QA Engineers, UX Designers, Product Managers, other Architects & Site Reliability Engineers (Contractors and/or Staff), and third‑party vendors.
  • Effectively delegate responsibilities to team members, mentoring and providing them with repeatable processes, and verifying the quality of their work.
  • Utilize metrics to measure accomplishments and monitors progress, ensuring milestones and projects are completed on‑time.
  • Communicate progress and the impact of solutions in technical terms to technology partners and in business terms to business partners.
  • Establish a reputation as the subject matter expert for every tech stack used in Production Software Engineering applications and how they all fit together while keeping current with new technologies, developing innovative technical ideas, and generating proposals.
  • Work with product teams to learn business objectives, development teams to plan platform needs, QA to understand test strategy, and SRE on environments and deployments.
  • Participate in Scrums, demos, and other Agile ceremonies and ensure accurate and timely status updates to the team.
  • Serve as primary interface with the NBCU Cyber Security team for all security‑related initiatives, patching, remediations, etc.
  • Hands‑on commissioning, configuration, administration, documentation, and support for all on‑prem & cloud (AWS) environments (Servers, Storage, Databases, Networking, Security, etc.).
  • Technical impact analysis, implementation, and monitoring of all cyber, technology audit, enterprise engineering, & IT (Databases, Monitoring, etc.) activities related to Production Software Engineering applications and platforms.
  • Create and manage CI/CD pipelines using tool likes Cloud Formation, Foreman, Jenkins, Nexus, Rundeck, Ansible, and Puppet.
  • Lead implementation of monitoring and reporting framework using tools like Grafana, Influx, Graylog/Splunk, Selenium, New Relic, and Icinga.
  • Recognize and identify potential technical impacts of enterprise change controls which could affect our applications and customers.
  • Help improve performance, scalability, and reliability.
  • Build and maintain distributed infrastructure and automation.
  • Solve problems quickly and automates processes for the future.
  • Direct management of other engineers and architects (Contractors and/or Staff). 24x7x365 availability for production outages, emergencies, and deployments.
  • 100% telecommuting is permitted for this role.
Qualifications
  • Bachelor’s degree in Computer Science, Information Technology, or related field (or foreign degree equivalent), plus 10 years of experience as a Software Architect or in a related occupation.
  • Hands‑on systems engineering experience on Linux/Unix platforms.
  • Experience with technical leadership and people management.
  • Experience with Continuous Delivery and SDLC practices.
  • Dev Ops principles, experience with operational tools (Ansible, Puppet, Chef, Terraform) and best practices for infrastructure (on‑prem or cloud) and software deployment.
  • Operational experience with large‑scale applications.
  • Experience with No

    SQL data stores (Mark Logic, Mongo

    DB, Cassandra, Dynamo

    DB, Couchbase, Postgre

    SQL, etc.).
  • Experience with a broad range of enterprise technologies.
  • Experience building real‑time, large‑scale, low‑latency distributed…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary