×
Register Here to Apply for Jobs or Post Jobs. X

Senior Principal Performance Engineering

Job in Austin, Travis County, Texas, 78716, USA
Listing for: Cerebras
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    AI Engineer, Systems Engineer
  • Engineering
    AI Engineer, Systems Engineer
Salary/Wage Range or Industry Benchmark: 125000 - 150000 USD Yearly USD 125000.00 150000.00 YEAR
Job Description & How to Apply Below

Graphcore is a globally recognised leader in Artificial Intelligence computing systems. The company designs advanced semiconductors and data centre hardware that provide the specialised processing power needed to drive AI innovation, while delivering the efficiency required to support its broader adoption.

As part of the Soft Bank Group, Graphcore is a member of an elite family of companies responsible for some of the world’s most transformative technologies. We are opening a new AI Engineering Campus in Austin, which will play a central role in Graphcore's work building the future of AI computing.

Job Overview:
Responsibilities:

As a Performance Engineer
, you will lead benchmarking, performance analysis, and system optimization across AI and HPC workloads on Arm-based architectures. You will collaborate with hardware architects, software developers, and customer engineering teams to enhance system efficiency and scalability, ensuring Arm technology delivers industry-leading datacenter solutions.

  • Design, implement, and analyze performance experiments for AI training, inference, and HPC applications across distributed clusters.
  • Develop tools and workflows to monitor, measure, and validate system and workload scalability.
  • Partner with system architects and software teams to identify bottlenecks and propose optimizations across the hardware/software stack.
  • Lead performance bring-up and validation of new hardware platforms, interconnects, and accelerators.
  • Collaborate with customers and Tier-1 partners to provide guidance on performance tuning and cluster-level deployment strategies.
  • Drive innovation in performance methodology, including predictive modeling, profiling frameworks, and benchmark development.
  • Present findings to engineering leadership, customers, and partners to influence architectural and design decisions.
Required

Skills and Experience:

  • Demonstrated ability in HPC and AI performance engineering
    , with proven hands-on expertise in distributed systems.
  • Solid understanding of CPU/GPU/accelerator performance analysis
    , workload profiling, and scalability optimization.
  • Proven experience with ARM
    64, x86, and GPU architectures
    in large-scale datacenter environments.
  • Proficiency in performance tools such as VTune, Nsight, Rocprof, Pytorch profiler, MPI/OpenMP profilers, Cray/Allinea tools
    .
  • Strong programming skills in Python, C/C++, Fortran, CUDA
    , and parallel frameworks (MPI, OpenMP, SYCL).
  • Experience with large AI frameworks (
    PyTorch, Tensor

    RT, Megatron-LM, vLLM, SGLang, Torch Titan

    ).
  • Familiarity with distributed training at scale (multi-node, multi-GPU clusters).
  • Excellent communication skills and experience working with cross-functional engineering teams.
“Nice To Have” Skills and Experience :
  • Experience with datacenter-scale benchmarking and system acceptance testing.
  • Knowledge of interconnect fabrics (Infiniband, Slingshot, Omni-Path, RoCE, EFA) and distributed storage systems (Lustre, GPFS, Weka).
  • Hands-on background with cloud HPC/AI deployments (AWS, Azure, GCP).
  • Familiarity with containerization and orchestration (
    Docker, Kubernetes, SLURM, PBS
    ).
  • Background in exascale or pre-exascale performance co-design projects
    .
  • Strong publication record in HPC/AI performance analysis
    .
  • Experience leading small teams or cross-company performance projects.
In Return:
  • Be part of a groundbreaking team influencing the next generation of data center systems.
  • Collaborate with premier engineers and vendors to develop industry-leading AI hardware.
  • Drive innovation in performance methodology with global impact.
  • Access professional growth through sophisticated project involvement and multidisciplinary teamwork.
  • Join a company committed to diversity and inclusion, where your work matters and drives global progress.
Accommodations at Arm

At Arm, we want to build extraordinary teams. If you need an adjustment or an accommodation during the recruitment process, please email  To note, by sending us the requested information, you consent to its use by Arm to arrange for appropriate accommodations. All accommodation or adjustment requests will be treated with confidentiality, and information concerning these requests will only be disclosed as…

Position Requirements
10+ Years work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary