Data Center Site Reliability Engineer
Listed on 2026-02-28
-
IT/Tech
Systems Engineer, Cloud Computing, IT Support
Overview
We Are:
At Synopsys, we drive the innovations that shape the way we live and connect. Our technology is central to the Era of Pervasive Intelligence, from self-driving cars to learning machines. We lead in chip design, verification, and IP integration, empowering the creation of high-performance silicon chips and software content. Join us to transform the future through continuous technological innovation. This team handles design, vision, and driving major initiatives for Data Center and cloud.
We support a global network consisting of 100+ office locations, 12 data centers, 30MW of Colocation capacity, and a growing cloud presence.
You Are:
You are a motivated and passionate Site Reliability Engineer, technically savvy with a strong affinity for Data Centers. You bring a deep understanding of troubleshooting and problem-solving, with a proven knack for fixing servers and maintaining high-performance infrastructure. Your excellent communication skills allow you to collaborate seamlessly with cross-functional teams and confidently represent your work to senior leadership. You thrive in a matrixed, international, and team-oriented environment, adept at managing priorities and aligning stakeholders across a diverse organization.
Your experience as a process-oriented individual contributor prepares you for the new wave of transformation underway at Synopsys’s Dallas Data Center. You possess a drive for operational excellence, a commitment to reliability, and a keen eye for detail, ensuring that every aspect of the data center functions at its best. Your ability to adapt, learn, and apply best practices sets you apart, and your enthusiasm for innovation and automation fuels your desire to make a lasting impact.
If you value collaboration, continuous improvement, and the opportunity to work on cutting-edge data center technologies, you will thrive in this role and contribute to Synopsys’s global success.
What You’ll Be Doing:
- Oversee all aspects of Data Center’s critical infrastructure, ensuring high-quality execution of all work performed within the DC space.
- Monitor support queues, address tickets promptly, and communicate updates clearly and succinctly.
- Coordinate with colocation and external vendors to complete engineering, maintenance, and outsourcing tasks in compliance with Data Center operation requirements.
- Develop, maintain, and govern high-quality technical documentation, including network architectures, deployment playbooks, policies, standards, and guidelines for effective knowledge sharing and AI-assisted workflows.
- Assist in the design and adoption of AI-enabled capabilities—such as predictive analytics, anomaly detection, and automated incident response—to proactively identify, prevent, and resolve complex issues.
- Maintain asset inventory in DCIM, performing shipping and receiving duties including RMA activities.
- Perform regular maintenance tasks, such as cleaning, inspecting, and replacing components to prevent problems and ensure equipment longevity.
- Assist with diagnosing, fixing, and installing server components, providing expertise in hardware reliability engineering and break/fix for servers and systems.
- Implement automation strategies to streamline operations and reduce manual intervention.
- Track system performance and availability, implementing proactive measures to prevent downtime.
- Diagnose and resolve issues related to Linux OS, services, and system components.
- Participate in Data Center capacity management forums and Data Center on-call schedules.
The Impact
You Will Have:
- Strengthen Synopsys’s Data Center reliability and operational excellence, directly supporting critical engineering and business workloads.
- Reduce downtime and operational risks by proactively maintaining and optimizing infrastructure.
- Enable faster incident resolution and improved service quality through AI-driven monitoring and response.
- Enhance asset management and lifecycle tracking, ensuring data accuracy and compliance.
- Support innovation and adoption of advanced automation, contributing to greater efficiency and scalability.
- Foster cross-team collaboration and knowledge sharing, improving productivity…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).