Senior Software Engineer, Distributed Systems - NIM Factory
Listed on 2026-01-13
-
Software Development
Cloud Engineer - Software, Software Engineer, Senior Developer, DevOps
Senior Software Engineer, Distributed Systems – NIM Factory
Join NVIDIA as a Senior Software Engineer, Distributed Systems – NIM Factory and help build the next generation of AI inference microservices.
NVIDIA is the platform upon which every new AI-powered application is built. We are seeking a senior engineer to design and build factory infrastructure and automation for NVIDIA Inference Microservices (NIMs). The right person brings technical drive and creativity to change the way NVIDIA optimizes and serves performant inferencing across heterogeneous cluster environments. Our NIM offerings are easy to use, highly performant, and tested in all deployment scenarios—from the cloud to customer self‑hosted infrastructure and locally on all NVIDIA GPUs.
You will apply your deep technical expertise to design an efficient, scalable, and reliable automation factory infrastructure that will take AI models to become NIMs validated for best‑in‑class performance and accuracy.
NVIDIA is building a new category of products by intersecting our prowess in deep learning and computing with industry‑leading technologies. You will harness groundbreaking technologies and build a highly efficient factory that powers how NVIDIA builds and validates NIMs for inference everywhere. You will influence and drive technical advances in NVIDIA’s workflows and build the infrastructure that accelerates the delivery of every AI model on NVIDIA GPUs anywhere.
We are looking for technical talent to design and build our factory capabilities, including the underlying infrastructure, pipelines, backends, Docker builds, test harness, metrics, performance engineering, log ingestion, and more.
- Develop a factory pipeline that takes an AI model and produces a deployable service validated across cloud, on‑prem, and Kubernetes environments.
- Work with technical leaders to design and develop scalable and reliable factory components, collaborating with multiple AI model teams to understand requirements and build efficient infrastructure that improves every team’s productivity.
- Define metrics and drive improvements based on user feedback; mentor and collaborate to grow colleagues and yourself.
- A history of using advanced programming skills to build distributed and compute systems, backend services, microservices, and cloud technologies.
- Effective experience working with multi‑functional teams, principals, and architects across organizational boundaries.
- Mentorship, team growth, and flexibility to adjust direction and expectations based on customer needs.
- Deep technical expertise in distributed containerized applications using Docker, Kubernetes, Cloud Endpoints, Helm, and Prometheus.
- Passion for building rich microservice applications and test automation pipelines.
- Excellent interpersonal skills and the ability to lead multi‑functional efforts.
- Proven experience debugging and analyzing the performance of distributed microservices or cloud systems.
- BS or MS in Computer Science, Computer Engineering or related field (or equivalent experience).
- 8+ years of experience developing performant microservice, cloud software, and/or tooling roles.
- Experience delivering event‑driven applications using services such as Temporal, Kafka, Redis, or similar.
- A history of building and deploying containers for microservices, cloud, and on‑prem deployments, and their associated CI/CD pipelines.
- Prior experience working with large‑scale full‑stack development.
Salary range: $168,000 – $270,250 USD (Level 4) or $200,000 – $322,000 USD (Level 5). You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until January 13, 2026.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).