Senior HPC Linux Systems Engineer
Listed on 2026-01-12
-
IT/Tech
Systems Engineer
Overview
The High-Performance Computing Systems Section within the National Center for Computational Sciences (NCCS) is seeking a Senior HPC Linux Systems Engineer to join the HPC Infrastructure team. The preferred candidate will possess commensurate knowledge, skills and abilities in addition to relevant education, certifications, experience and demonstrated ability to work as a member of a team.
NCCS provides state-of-the-art computational and data science infrastructure coupled with dedicated technical and scientific professionals tackling large-scale problems across a broad range of scientific domains for accelerating scientific discovery and engineering advances. NCCS hosts the Oak Ridge Leadership Computing Facility (OLCF), one of the Department of Energy s (DOE) National User Facilities which operates Frontier, the nation s first exascale supercomputer.
MajorDuties/Responsibilities
Systems Administration:
- Lead the architecture and deployment of HPC-scale services
- Create and maintain internal documentation of system architectures, configurations and procedures
- Serve as the highest tier of support for complex issues, providing quick and efficient resolution
- Develop, maintain and review high quality code for internal tools using programming languages such as Python, Golang, or Rust
Virtualization and Automation:
- Design, deploy and manage resources in the NCCS VMware environment
- Identify potential automation targets and lead efforts to automate processes
- Define policies and procedures for automation and configuration management for the team and organization as a whole
Identity Management and Security:
- Design and administration of RSA Secure
ID and Ping Federate servers - Deploy, configure and support identity and access management services such as single-sign on (SSO), OAuth, two-factor auth, zero trust, etc...
Project Management and Leadership:
- Lead infrastructure projects through all phases from planning to design, implementation and support
- Mentor and train junior staff, creating training documentation, holding knowledge sharing sessions, and fostering skill growth throughout the team
- Propose and implement improvements to existing infrastructure systems as well as new systems, processes and procedures
- Bachelor's degree in computer science or closely related field and a minimum of 7 years of experience in Linux systems administration, or a Master s Degree and a minimum of 4 year of experience in Linux systems administration. An equivalent combination of education and experience will be considered.
- Excellent interpersonal/communication skills and the ability to work within a team
- Strong experience in Identity Management, supporting SSO, OAuth, two-factor authentication primarily in Ping Federate and RSA Secure
ID. Entra a bonus. - Strong working knowledge of Linux system fundamentals and common network protocols
- Programming and scripting skills in common languages such as Python and bash
- Understanding of versioning and code review tools like Git Hub and Git Lab
- Experience implementing and supporting highly-available systems and services
- Experience with configuration management tools such as Puppet or Ansible
- Experience deploying and maintaining virtual environments using VMware
- Experience deploying, maintaining and troubleshooting a variety of infrastructure services such as OpenLDAP, DNS, DHCP, etc...
- Ability to plan, prioritize and complete assigned projects with minimal supervision
For employment at Oak Ridge National Laboratory (ORNL), a Real form of identification will be required. ORNL is subject to Department of Energy (DOE) access restrictions. All employees must be able to obtain and maintain a federal Personal Identity Verification (PIV) card as mandated by Homeland Security Presidential Directive 12 (HSPD-12) and DOE Order 473.1A, which requires a favorable post-employment background investigation.
To obtain this credential, new employees must successfully complete and pass a Federal Tier 1 background check investigation. This investigation includes a declaration of illegal drug activities, including use,…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).