Product Manager, AI Platform Kernels and Communication Libraries
Listed on 2026-01-14
-
Software Development
AI Engineer, Software Engineer
Product Manager, AI Platform Kernels and Communication Libraries
NVIDIA AI Software Platforms team seeks a technical product manager to accelerate next-generation inference deployments through innovative libraries, communication runtimes, and kernel optimization frameworks. This role bridges low‑level GPU programming with ecosystem‑wide developer enablement for products including CUTLASS, cuDNN, NCCL, NVSHMEM, and open‑source contributions to Triton/Flash Infer.
As NVIDIA Product Managers, our goal is to enable developers to be successful on the NVIDIA Platform and push the boundaries of what is possible with AI deployments. For inference, we are the champions inside NVIDIA for AI developers looking to accelerate their deployments on GPUs. We work directly with developers inside and outside the company to identify improvements, create roadmaps, and stay alert on the inference landscape.
We also work with NVIDIA leaders to define clear product strategy and with marketing to build go‑to‑market plans.
- Architect developer‑focused products that simplify high‑performance inference and training deployment across diverse GPU architectures.
- Define the multi‑year strategy for kernel and communication libraries by analyzing performance bottlenecks in emerging AI workloads.
- Collaborate with CUDA kernel engineers to design intuitive, high‑level abstractions for memory and distributed execution.
- Partner with open‑source communities like Triton and Flash Infer to shape and drive ecosystem‑wide roadmaps.
- 7+ years of technical PM experience shipping developer products for GPU acceleration, with expertise in HPC optimization stacks.
- Expert‑level understanding of CUDA execution models and multi‑GPU protocols, with a proven track record to translate hardware capabilities into software roadmaps.
- BS or MS or equivalent experience in Computer Engineering or demonstrated expertise in parallel computing architectures.
- Strong technical interpersonal skills with experience communicating complex optimizations to developers and researchers.
- PhD or equivalent experience in Computer Engineering or a related technical field.
- Contributed to performance‑critical open‑source projects like Triton, Flash Attention, or TVM with measurable adoption impact.
- Crafted Git Hub‑first developer tools with >1k stars or similar community engagement metrics.
- Published research on GPU kernel optimization, collective communication algorithms, or ML model serving architectures.
- Experience building cost‑per‑inference models incorporating hardware utilization, energy efficiency, and cluster scaling factors.
Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. The base salary range is $,750 for Level 4, and $,750 for Level 5.
You will also be eligible for equity and benefits.
Applications for this job will be accepted at least until January 13, 2026.
NVIDIA is committed to fostering a diverse work environment and is proud to be an equal‑opportunity employer. We do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).