×
Register Here to Apply for Jobs or Post Jobs. X

HPC​/AI Infrastructure Engineer - KSA; Onsite

Job in Riyadh, Riyadh Region, Saudi Arabia
Listing for: K20s - Kinetic Technologies Private Limited
Seasonal/Temporary position
Listed on 2025-12-06
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing
Salary/Wage Range or Industry Benchmark: 200000 - 300000 SAR Yearly SAR 200000.00 300000.00 YEAR
Job Description & How to Apply Below
Position: HPC/AI Infrastructure Engineer - KSA (Onsite)

Job Role:

HPC/AI Infrastructure Engineer

Experience:

5+ years

Location:

KSA
- Saudi Arabia

Contract Duration: 1 year

Seniority Level: Mid-Senior level

Employment Type:

Contract

Job Function:
Information Technology

Overview

We are seeking a highly skilled HPC/AI Infrastructure Engineer to design, deploy, and manage advanced computing environments leveraging NVIDIA technologies, Kubernetes, and Linux systems. This role is critical to ensuring the performance, scalability, and reliability of AI workloads across GPU‑accelerated clusters.

Key Responsibilities
  • Deploy, configure, and manage NVIDIA Base Command Manager for orchestrating GPU workloads (critical).
  • Implement and maintain NVIDIA AI Enterprise Suite to support enterprise‑grade AI frameworks.
  • Operate and optimize NVIDIA GPU and Network Operators within Kubernetes environments.
  • Utilize NVIDIA NIMs and Blueprints to streamline AI model deployment and infrastructure automation.
  • Administer and scale Slurm workload manager for HPC job scheduling (critical).
  • Manage vanilla Kubernetes clusters, ensuring high availability and resource efficiency.
  • Maintain and secure systems running on Canonical Ubuntu OS, including patching and performance tuning.
Required

Skills & Qualifications
  • Strong expertise with NVIDIA GPU technologies and AI infrastructure.
  • Hands‑on experience with Slurm in HPC environments.
  • Proficiency in Kubernetes cluster administration.
  • Deep knowledge of Linux (Ubuntu) system administration.
  • Familiarity with network operators and GPU scheduling in containerized environments.
  • Ability to troubleshoot complex distributed systems.
Preferred Skills
  • Experience with automation tools (e.g., Ansible, Terraform).
  • Knowledge of cloud‑native architectures and hybrid HPC/AI deployments.
  • Familiarity with observability tools (Prometheus, Grafana).
  • Background in AI/ML workflows and performance optimization.
Work Environment
  • Collaborative team working on cutting‑edge AI and HPC solutions.
  • Opportunity to shape infrastructure supporting enterprise‑scale AI workloads.
  • Exposure to NVIDIA’s latest ecosystem of AI and GPU technologies.
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary