×
Register Here to Apply for Jobs or Post Jobs. X

Senior System Administrator; HPC

Job in Bengaluru, 560001, Bangalore, Karnataka, India
Listing for: Tata Consultancy Services
Full Time position
Listed on 2026-02-05
Job specializations:
  • IT/Tech
    Cloud Computing, IT Support, Systems Administrator, Systems Engineer
Job Description & How to Apply Below
Position: Senior System Administrator(HPC)
Location: Bengaluru

Role:
Sr. HPC Administrator

Desired Experience Range: 7 - 12 yrs

Notice Period:
Immediate to 60 Days only

Location of Requirement:
Bangalore

JOB DESCRIPTION

● Strong experience in providing support for Linux HPC clusters.

● Strong working knowledge on Following:

o IBM Platform LSF 9 and 10 administration.

o Redhat Enterprise Linux Administration.

o Lustre Parallel File system.

o Mellanox Infiniband Connectivity.

o Cluster Manager Administration (HPCM or xCAT)

o SSSD & NIS Authentication mechanisms.

o Bash & Python scripting.

o Ansible playbooks.

● Experience of Abaqus, and CFD application (Fluent and StarCCM..etc.,)

● Strong knowledge of application installations and version management on shared file systems.

● IT infrastructure Technical Operation Management under ITIL framework

● Security compliance and remediation management.

Intermediate Level

● Dev Ops, ITIL, Agile, Safe (certifications are desirable)

Responsibilities

● Installation, configuration, troubleshooting and administration of Linux HPC clusters (compute,

storage, and network) and applications in support of CAE environments.

● Monitor and analyze LSF job queues and resource utilization to optimize workload management.

● Troubleshoot and resolve any issues with LSF and its components, including master servers, compute

nodes, and resource managers.

● Collaborate with users to understand their HPC requirements and design LSF job workflows to meet

their needs.

● Develop and maintain LSF documentation, including standard operating procedures, installation

guides, and troubleshooting procedures.

● Develop and maintain LSF scripts for automation and task scheduling.

● Diagnose and troubleshoot complex RHEL OS, application and HPC cluster technical problems.

● Interact with hardware and software vendors for external support.

● Develop and maintain technical solution documents (TSD) and standard operating procedures(SOP).

● Keep all HPC infrastructure systems/servers/devices up to date and working condition to enhance

business continuity.

● Design and implement HPC network topology, including Mellanox connectivity.

● Create and maintain HPC capacity planning and periodical cluster utilization reports.

● Troubleshoot Abaqus, StarCCM+ and Fluent applications, and resolve any issues in a timely manner.

● Develop and maintain scripts for automation and task scheduling using Python and Bash scripting.
Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary