×
Register Here to Apply for Jobs or Post Jobs. X

Infini Band -L3

Job in 400001, Mumbai, Maharashtra, India
Listing for: Yotta Data Services Private Limited
Full Time position
Listed on 2026-02-04
Job specializations:
  • IT/Tech
    Systems Engineer, Network Engineer, Data Engineer, Cloud Computing
Job Description & How to Apply Below
Yotta Data Services Private Limited

Datacenter | Cloud | Managed IT | Network & Connectivity | Application Modernization | Cyber Security

CSPs and Hyperscalers around the world are using Infini Band products to revolutionize deep learning and data analytics, and to power data centers. Join the team building many of the largest and fastest AI/HPC systems in the world!

About the Role
We are looking for someone with the ability to work on a dynamic customer-focused team that requires excellent interpersonal skills. This role will be interacting with customers, partners and internal teams, to analyze, define and implement large-scale Networking projects. The scope of these efforts includes a combination of Networking, System Design and Automation and being the face to the customer!

Responsibilities
8 to 12 yrs as relevant experience
Primary responsibilities will include maintaining Infini Band interconnect for AI/HPC infrastructure
Day-to-day operations include diagnosis of Infini Band fabric, collecting logs, analysing the same and issue resolution
Closely working with server operations team
Support operational and reliability aspects of large-scale AI clusters, focusing on performance at scale, real-time monitoring, logging, and alerting.
Engage in and improve the whole lifecycle of services—from inception and design through deployment, operation, and refinement.
Maintain services once they are live by measuring and monitoring availability, latency, and overall system health.
Work with OEM by opening support ticket, documenting workarounds

Qualifications

8-12 years of professional experience in networking fundamentals, TCP/IP stack, Infini Band fundamentals and data center architecture

Required Skills
Proficiency in configuring, testing, validating, and resolving issues in Infini Band networks, especially in medium to large-scale HPC/AI environments.
Advanced knowledge of HPC/AI networking protocols.
Hands-on experience with Infini Band network switch/router platforms
Strong focus on customer needs and satisfaction.
Self-motivated with leadership skills to work collaboratively with customers and internal teams.
Strong written, verbal, and listening skills are essential.
Infini Band certification and storage operational experience managing large HPC clusters with IB as interconnect.
Having Knowledge like Mellanox OS, Cumulus Linux, SONiCLinux or Networking Certifications.
Knowledge in link level performance and diagnostics.

Experience with High-performance computing architectures.

Experience with GPU (Graphics Processing Unit) focused hardware/software.
Cluster/gpfs management technologies knowledge

Preferred Skills
Bonus credit for Node provisioning software such as Base Command Manager, XCAT, HPCM

Qualification
MS/PhD or equivalent experience in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, or related fields.
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary