×
Register Here to Apply for Jobs or Post Jobs. X

SiteOps Data Center Production Operations Engineer

Job in Aurora, Kane County, Illinois, 60505, USA
Listing for: META
Full Time position
Listed on 2026-01-12
Job specializations:
  • Engineering
    Systems Engineer, Data Engineer
  • IT/Tech
    Systems Engineer, Data Engineer
Job Description & How to Apply Below

Meta is seeking a forward thinking experienced engineer to join the Production Operations team within our Data Centers. These Data Centers are the foundation upon which our rapidly scaling infrastructure efficiently operates and upon which our innovative services are delivered. Meta is at the leading edge of the global data center industry both in terms of how data centers are designed and operated.

This person should enjoy working in a fast paced, technical environment where adaptability and flexibility will be key to their success. We seek an IT professional with advanced, hands-on technical skills in server hardware and Linux - ideally in a Data Center environment. Having broad knowledge of server administration and participating in projects in a large-scale distributed data center environment is a core competency of this individual.

The candidate should also have working knowledge and experience in a few of the following core areas:
Hardware repair, OS management, Tooling and Automation, Networking, or Technical Project Management.

Required Skills:

Site Ops Data Center Production Operations Engineer Responsibilities:

Support platform health by successfully resolving and closing tickets, while addressing the overall issue (i.e. addressing root cause) including, but not limited to, remote troubleshooting and physical inspection of services in data halls

Participate in root cause analysis of highly technical issues within the data center, ranging from automated tooling to hardware failures and network issues

Collaborate with cross-functional teams on projects and initiatives related to topics such as process, hardware and automation

Point of contact for the introduction of new platforms and hardware to the site, in collaboration with partners and global resources, accelerating the time it takes to bring these products to sustained mass production

Use tools and data analysis effectively to identify issues. Take actions to communicate with all stakeholders appropriately and manage or elevate as needed

Identify corrective actions of hardware issues, work with internal teams and vendors

Influence future design changes to ensure ease of serviceability

Solve systemic hardware and/or software issues at scale using scripting, automation, and tooling to drive global resolution

Continuously evaluate and identify areas for improvement in processes, tools, and systems to optimize efficiency and quality of repairs

Use data analytics to drive maximum server up-time and utilization rates, understanding hardware failure rates and service level agreements

Support and train team members to evaluate and identify better ways to resolve issues, and define updates to tools and processes

Provide engineering support and be a go-to technical resource for the team, leadership, and cross-functional teams in operating and maintaining data center servers

Maintain and update documentation i.e. procedures, runbooks and guides

Build cross functional relationships and influence policies and procedures that improve global data center operations

Participate in 24/7 on‑call rotation

Travel up to 15% of the time

Minimum Qualifications:

BS, BA or BEng in technical field or commensurate experience

5+ years of technical IT experience within an infrastructure environment, in a role such as Systems Administrator, Dev Ops Engineer, or Site Reliability Engineer

-level understanding in Linux (or equivalent OS) in a complex IT environment with the capacity to triage, debug, and troubleshoot server issues

Hands‑on experience and knowledge of server hardware and components, including storage

Intermediate-level knowledge of the interdependencies of data center functions and technologies including electrical, cooling, structured cabling, security, and network

Experience managing technical issues and driving to the root cause

Experience participating in technical projects related to areas such as process improvement, technology, and/or automation

Capacity to communicate effectively, in a clear and concise manner, appropriately tailoring messages to the audience

Intermediate-level knowledge of technologies such as HTTP, DNS, RAID, and DHCP

Experience in providing…

To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary