×
Register Here to Apply for Jobs or Post Jobs. X

Principal GPU​/NPU AI System Architect

Job in Austin, Travis County, Texas, 78716, USA
Listing for: Advanced Micro Devices
Full Time position
Listed on 2026-02-28
Job specializations:
  • IT/Tech
    AI Engineer, Systems Engineer, Robotics
Salary/Wage Range or Industry Benchmark: 150000 - 200000 USD Yearly USD 150000.00 200000.00 YEAR
Job Description & How to Apply Below

WHAT YOU DO AT AMD CHANGES EVERYTHING

At AMD, our mission is to build great products that accelerate next‑generation computing experiences—from AI and data centers, to PCs, gaming and embedded systems. Grounded in a culture of innovation and collaboration, we believe real progress comes from bold ideas, human ingenuity and a shared passion to create something extraordinary. When you join AMD, you’ll discover the real differentiator is our culture.

We push the limits of innovation to solve the world’s most important challenges—striving for execution excellence, while being direct, humble, collaborative, and inclusive of diverse perspectives. Join us as we shape the future of AI and beyond.

Together, we advance your career.

THE ROLE

The AI Architect will define and drive end‑to‑end AI system architecture for embedded and edge platforms, with deep expertise in GPU/NPU micro‑architecture, AI software stacks, and model behavior. This role bridges silicon capabilities, system software, and AI models, enabling performant, power‑efficient, and safe AI deployments across robotics, automotive, and industrial markets. The architect will own technical solutioning from model selection through deployment, working closely with silicon, compiler, software, and product teams, and will represent the AI architecture vision with customers and partners.

Location:

Austin or San Jose

THE PERSON

We are seeking a senior AI systems architect with deep expertise across GPU/NPU architecture, AI software stacks, and model behavior. This individual operates at the intersection of silicon, system software, and applied AI — translating real‑world robotics, automotive, and industrial workloads into scalable, production‑ready AI platform architectures.

The ideal candidate combines hardware‑aware AI model understanding with embedded deployment experience, and can drive full‑stack architectural trade‑offs across performance, power, memory, safety, and lifecycle constraints. They are technically hands‑on when needed, yet comfortable influencing silicon roadmaps, guiding cross‑functional teams, and representing architectural strategy with customers and ecosystem partners.

This is a high‑impact technical leadership role requiring strong architectural judgment, cross‑functional influence without direct authority, and the ability to bridge research, productization, and long‑term platform evolution.

KEY RESPONSIBILITIES GPU / NPU Architecture & HW–SW Co‑Design
  • Develop deep architectural understanding of GPU, NPU, and heterogeneous SoC designs, including memory hierarchies, interconnects, scheduling, and power/performance trade‑offs.
  • Guide HW–SW co‑optimization strategies for AI workloads across vision, perception, planning, and control.
  • Influence silicon and platform roadmaps using model‑driven architectural insights from robotics, automotive, and industrial workloads.
  • Collaborate across silicon, system engineering, software, thermal/mechanical, security, and product teams.
  • Technically lead internal AI engineers and work closely with partners, ISVs, and customers.
  • Act as a technical authority and mentor, influencing architecture decisions without direct reporting authority.
  • Architect AI solutions with strong understanding of model internals (CNNs, Transformers, multi‑modal models, sensor fusion, perception stacks).
  • Evaluate and map model characteristics (latency, memory bandwidth, precision, sparsity) onto GPU/NPU execution.
  • Drive model optimization strategies (quantization, pruning, distillation, compilation flows) aligned with embedded constraints.
Model‑Aware AI System Architecture
  • Software Stack & Deployment Solutioning
    • Define and optimize AI software stacks spanning frameworks (PyTorch, ONNX, Tensor

      RT‑like runtimes), compilers, graph optimizers, and runtime schedulers, drivers, firmware, and OS integration.
  • Lead solutioning for edge and embedded deployment, including OTA updates, lifecycle management, and production‑grade robustness.
  • Ensure scalability from prototype → production → long‑term maintenance.
Domain‑Focused Architecture Leadership
  • Robotics: perception, localization, SLAM, manipulation, real‑time decision pipelines.
  • Automotive: ADAS,…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary