Manager, Reliability Engineering, NA
Remote / Online - Candidates ideally in
Santa Clara, Santa Clara County, California, 95053, USA
Listed on 2026-01-12
Santa Clara, Santa Clara County, California, 95053, USA
Listing for:
Vantage Data Centers
Full Time, Remote/Work from Home
position Listed on 2026-01-12
Job specializations:
-
Engineering
-
Management
Job Description & How to Apply Below
Santa Clara, California time type:
Full time posted on:
Posted Todaytime left to apply:
End Date:
January 14, 2026 (6 days left to apply) job requisition :
R20975#
** About Vantage Data Centers
** Vantage Data Centers powers, cools, protects and connects the technology of the world’s well-known hyperscalers, cloud providers and large enterprises. Developing and operating across North America, EMEA and Asia Pacific, Vantage has evolved data center design in innovative ways to deliver dramatic gains in reliability, efficiency and sustainability in flexible environments that can scale as quickly as the market demands.
** Reliability Engineering Department
** The Reliability Engineering Team is responsible for the overall operating health of critical systems across Vantage global facilities. For each of the major systems Electrical, Mechanical, and Controls, the Reliability Engineering team is responsible for ensuring success in the commissioning stages of new construction, evaluating and improving the reliability and performance of existing critical infrastructure, sustaining equipment operational availability through maintenance program design, providing ongoing technical support to the Site Operations Teams, as well as providing systems reliability and maintainability feedback to the Design Engineering teams for future design considerations.
** Position Overview
**** This role can be based in our data center campus in the West and have remit over the West Coast Data Centers. This is a hybrid role with 3 days in the office, 2 days home based. This role will also be required to have business travel 25%.
**** Position Overview
** The Manager of Reliability Engineering is a hands-on leadership role responsible for guiding a team of engineers in the execution of reliability-focused initiatives across Vantage’s data center operations. This role plays a key part in ensuring system uptime, performance, and operational excellence by applying reliability engineering principles and supporting the implementation of preventive and predictive maintenance programs. The Manager will work closely with cross-functional teams to drive improvements and ensure consistency in reliability practices.##
##
** Essential Job Functions
*** Lead a team of reliability engineers in the day-to-day execution of reliability programs and initiatives.
* Support the implementation of reliability strategies that align with organizational goals and operational needs.
* Coordinate with Operations, Engineering, and Construction teams to ensure reliability considerations are integrated into facility design and maintenance planning.
* Oversee the execution of root cause analysis (RCA), failure mode effects analysis (FMEA), and other reliability tools to identify and address system vulnerabilities.
* Monitor system performance using data analytics and reliability metrics to identify trends and recommend improvements.
* Ensure team adherence to industry standards, safety protocols, and regulatory requirements related to reliability and maintenance.
* Collaborate with vendors and service providers to support reliability initiatives and ensure quality of service.
* Contribute to the development and standardization of reliability engineering processes across multiple sites.
* Provide coaching, feedback, and development opportunities to team members to build technical and leadership capabilities.
* Prepare and present operational updates and reliability reports to senior leadership as needed.
* Handle additional duties as assigned by Management.
** Job Requirements
*** Bachelor’s degree in Engineering, Mechanical, Electrical, or a related field, required.
* 5+ years of experience in reliability engineering, maintenance, or operations, preferably in mission-critical or data center environments.
* Experience leading or supervising technical teams in an engineering or operations setting.
* Working knowledge of reliability engineering principles and tools such as FMEA, RCA, and predictive maintenance.
* Familiarity with data analysis tools and monitoring systems used in…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×