×
Register Here to Apply for Jobs or Post Jobs. X

Senior LLM Inference Architect; Distributed GPUs

Job in Santa Clara, Santa Clara County, California, 95053, USA
Listing for: Advanced Micro Devices
Full Time position
Listed on 2026-01-18
Job specializations:
  • Software Development
    AI Engineer, Machine Learning/ ML Engineer
Job Description & How to Apply Below
Position: Senior LLM Inference Architect (Distributed GPUs)
A leading technology company in California seeks a senior ML engineer to optimize LLM inference runtimes for AMD GPUs. You will collaborate across teams to drive performance and scalability, working with frameworks like vLLM and SGLang. The role requires expertise in distributed inference and GPU architecture. Ideal candidates will have a Master's or PhD in a relevant field and experience in machine learning frameworks.

Competitive benefits include career advancement opportunities and an inclusive workplace.
#J-18808-Ljbffr
Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary