Engineer – Network Observability Platform and Automation
Listed on 2026-01-12
-
IT/Tech
Cloud Computing, Systems Engineer
Overview
Manager – Network Observability Platform and Automation – Digital Realty
Location:
Austin, Boston, Dallas, Ashburn, Chicago
In this role you will be responsible for oversight of Digital Realty’s Observability stack. The ideal candidate combines network engineering, network operations, and software understanding with engineering principles. You will focus on delivering operational discipline and embracing key operational principles including automation, agile development, and scripting.
- Team Leadership
- Manage and mentor a team of SREs, fostering their growth and development.
- Set team goals, prioritize projects, and ensure alignment with organizational objectives.
- Conduct performance reviews and provide constructive feedback.
- Build a positive and collaborative team environment.
- Technical Oversight
- Oversee the design, implementation, and maintenance of reliable infrastructure and services.
- Collaborate with other teams to define requirements, standards, and best practices.
- Identify and address performance bottlenecks and ensure system stability.
- Implement and improve monitoring and observability frameworks.
- Operational Excellence
- Manage on-call rotations and incident response to minimize downtime and ensure swift resolution.
- Drive automation efforts to reduce manual tasks and improve efficiency.
- Implement structured engineering and operations processes.
- Analyze and evaluate existing processes to identify opportunities for improvement.
- Strategic Planning
- Develop and implement the long-term reliability strategy for the organization.
- Make decisions about build vs. buy for tools and technologies.
- Ensure alignment with business goals and customer expectations.
- Manage relationships with vendors and other stakeholders.
- Communication and Collaboration
- Act as a bridge between technical teams and other departments.
- Represent the SRE team to stakeholders and communicate effectively.
- Collaborate with other engineering teams to ensure efficient workflows.
- Foster a blameless postmortem culture and continuous learning.
- Strong technical background in distributed systems, cloud computing, and related technologies.
- Proven experience in managing and mentoring technical teams.
- Excellent problem-solving and communication skills.
- Experience with monitoring, automation, and incident management.
- Understanding of SLOs, SLIs, and SLAs.
- Familiarity with Dev Ops and Agile practices.
- 10+ years of operations and engineering experience
- 5+ years of team building and management
- 3+ years of network engineering in large scale data center environments
- Bachelor’s degree in computer science (or equivalent training) preferred
- Expertise in Layer 3 routing (BGP, IS-IS, etc) and Layer 2 switching (802.1Q, STP, etc) protocols
- Experience with virtual networking concepts such as EVPN, VXLAN, Open vSwitch
- Experience working with automation tools (Ansible, Terraform, etc)
- Comfort with Python (or equivalent language)
- Strong experience working with Linux systems and tools
- Experience with virtual routing in Linux with FRR or similar software preferred
- Experience with AWS preferred
- A basic understanding of software development tools (Git Hub, Jenkins, etc) and software development practices
- Ability to understand high-level network design and its impacts across the infrastructure
- Ability to work independently on complex and unique enterprise engineering projects
- Strong analytical and troubleshooting skills
- Strong communication skills
Digital Realty brings companies and data together by delivering the full spectrum of data center, colocation and interconnection solutions. Platform
DIGITAL, the company’s global data center platform, provides customers with a secure data meeting place and a PDx solution methodology for powering innovation and efficiently managing Data Gravity challenges. Digital Realty gives its customers access to the connected data communities that matter to them with a global data center footprint of 300+ facilities in 50+ metros across 28 countries on six continents.
We Can Offer You
Our rapidly evolving business sector offers the opportunity to be part of a courageous and passionate team who work…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).