×
Register Here to Apply for Jobs or Post Jobs. X

Solutions Architect, Generative AI

Job in Santa Clara, Santa Clara County, California, 95053, USA
Listing for: NVIDIA
Full Time position
Listed on 2026-01-24
Job specializations:
  • IT/Tech
    AI Engineer, Machine Learning/ ML Engineer, Data Scientist
Salary/Wage Range or Industry Benchmark: 152000 - 218500 USD Yearly USD 152000.00 218500.00 YEAR
Job Description & How to Apply Below

Overview

NVIDIA is seeking an outstanding AI Engineer or Solutions Architect to join our growing team focused on ecosystem partner enablement for Generative AI. You will lead by example as a strategic technical expert and hands-on developer, building proof-of-concept solutions and reference architectures for innovative AI agents to demonstrate the NVIDIA full-stack accelerated Generative AI platforms. You will provide partners with technical blueprints and expert guidance to architect and deploy transformative applications using NVIDIA full AI stack, from GPU systems and CUDA to NeMo and Nemotron.

The Generative AI Partners Enablement Solutions Architect team leverages advanced technologies to address and expedite the deployment of solutions for customers’ real-world challenges. As a member of the NPN Generative AI Solution Architecture team, you will work in a diverse, supportive environment where everyone is encouraged to do their life’s work. Join the team to make a lasting impact on the world by applying accelerated computing AI and solving category-defining systems and production-grade AI solutions at scale.

What

You Will Be Doing
  • Build end-to-end agentic AI applications that solve real-world enterprise problems across various industries.
  • Serve as the primary technical domain expert for pre- and post-sale for partners, embedding with them to design and deploy Generative AI solutions ntain relationships with leadership and technical teams to drive adoption and utilization of NVIDIA GenAI platforms.
  • Accelerate partner/customer time to value by providing repeatable reference architecture guidance, building hands-on prototypes, and advising on scalable methodologies for production deployments.
  • Establish the scope, success metrics, and evaluation criteria for partner-led customer projects, aligned to standardized and reproducible GPU-accelerated workflows.
  • Enable strategic partners to build their own Professional Services, platforms, and products by integrating and accelerating NVIDIA technologies for high-impact customer workloads; proactively identify opportunities to drive deeper adoption and utilization of NVIDIA Generative AI products.
  • Codify knowledge and operationalize technical success practices to help partners scale impact across industries and workloads.
What We Need To See
  • MS or PhD in Computer Science/Engineering, Machine Learning, Data Science, Electrical Engineering or a closely related field (or equivalent experience).
  • 5+ years of meaningful work experience deploying AI models at scale as a Software Engineer or Deep Learning engineer.
  • Consistent track record of building enterprise-grade agentic AI systems using open-source models with a solid foundation in deep learning, emphasizing LLM and VLM.
  • Hands-on experience with LLM and agentic frameworks (NeMo Agent Toolkit, Lang Chain, Semantic Kernel, Crew.ai, Auto Gen) and evaluation/observability platforms. Comfortable building prototypes or proofs of concept.
  • Strong coding skills and proficiency in Python, C++, and deep learning frameworks (PyTorch, Tensor Flow).
  • Excellent communication and presentation skills to effectively collaborate with internal executives, partners, and customers.
Ways To Stand Out From The Crowd
  • Demonstrate expertise in building applications and systems using NeMo Framework, Nemotron, Dynamo, Tensor

    RTLLM, NIMs, AI Blueprints, and actively contribute to the open-source community.
  • Take end-to-end ownership of projects and proactively acquire new skills to drive success.
  • Excel in fast-paced environments, managing multiple work streams and prioritizing for maximum customer impact.
  • Understanding of different advanced agent architectures and emerging communication protocols (MCP, OpenAI Agentic SDK, or Google A2A).
  • Familiarity with NVIDIA GPUs and system software stacks (e.g., NCCL, CUDA) and HPC technologies such as Infini Band, MPI, NVLink.

Salary: base salary will be determined based on location, experience, and pay of employees in similar positions. The base salary range is 152,000 USD - 218,500 USD. Equity and benefits are also included.

Applications for this job will be accepted at least until January 13, 2026. This posting is for an existing vacancy. NVIDIA uses AI tools in its recruiting processes. NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. We value diversity and do not discriminate on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

JR2007265

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary