×
Register Here to Apply for Jobs or Post Jobs. X

Model Accuracy Development and Test Engineer; Datacentre AI Engineering KSA

Job in Riyadh, Riyadh Region, Saudi Arabia
Listing for: Qualcomm Technologies, Inc
Full Time position
Listed on 2025-12-04
Job specializations:
  • IT/Tech
    Data Scientist, Machine Learning/ ML Engineer
Salary/Wage Range or Industry Benchmark: 150000 - 200000 SAR Yearly SAR 150000.00 200000.00 YEAR
Job Description & How to Apply Below
Position: Model Accuracy Development and Test Engineer (Data centre AI Engineering KSA

Company: Qualcomm Middle East Information Technology Company LLC Job Area: Engineering Group, Engineering Group >
Software Engineering

About Us

Qualcomm is enabling a world where everyone and everything can be intelligently connected. You interact with products and technologies made possible by Qualcomm every day, including 5G‑enabled smartphones that double as pro‑level cameras and gaming devices, smarter vehicles and cities, and the technology behind the smart, connected factories that manufactured your latest purchase. Qualcomm 5G and AI innovations are the power behind the connected intelligent edge.

You’ll find our technologies behind and inside the innovations that deliver significant value across multiple industries and to billions of people every day.

About the Role

We are seeking an Inference Accuracy engineer to design, develop, and validate model accuracy of deep learning models deployed  role focuses on deep accuracy analysis, debugging, accuracy evaluation, and recovery during inference on large data‑centre hardware platforms. You will have strong problem‑solving ability, excellent Python programming skills, and hands‑on expertise with inference pipelines.

Key Responsibilities
  • Define and implement accuracy KPIs across precision modes
  • Develop scalable Python‑based accuracy evaluation tools and automated pipelines.
  • Implement accuracy‑preserving optimizations for inference frameworks (Tensor

    RT, ONNX Runtime, AI Template, Triton).
  • Build and maintain automated pipelines for accuracy evaluation across multiple frameworks (ONNX, Tensor Flow, PyTorch).
  • Develop reusable plugins for pre‑processing, post‑processing, and metric evaluation.
  • Execute comprehensive accuracy tests for large‑scale models (LLMs, vision, diffusion).
  • Validate accuracy under various quantization and precision settings (FP32, FP16, INT8).
  • Perform accuracy analysis with deep understanding of model architecture, including layers, attention mechanisms, and parameter configurations.
  • Identify architecture‑driven accuracy degradation trends and propose optimization strategies.
  • Identify issues related to pre‑processing drift, tokenization mismatches, operator fallback, and quantization effects.
  • Analyse accuracy differences across hardware targets, firmware versions, and runtime backends.
  • Perform slice‑based accuracy analysis (batch size, concurrency, sequence length, domain shifts).
  • Design and run experiments to recover accuracy, including fine‑tuning, calibration, and hyperparameter adjustments.
  • Debug accuracy failures by tracing root causes across data pre‑processing, model layers, quantization steps, and deployment pipelines.
  • Compare results across different hardware/software stacks and generate actionable insights.
  • Document workflows, maintain dashboards, and publish accuracy results for stakeholders.
Required Skills & Experience
  • Strong background in AI/ML model evaluation and accuracy metrics.
  • Solid understanding of model architectures (transformers, CNNs, RNNs, MoE) and their impact on accuracy.
  • Experience with large language models (LLMs) and generative AI accuracy validation.
  • Expertise with inference runtimes (Tensor

    RT, ONNX Runtime, Triton).
  • Understanding of quantization (INT8/FP8/INT4), calibration, QAT, and accuracy trade-offs.
  • Experience with model graph conversion (PyTorch → ONNX → backend engines).
  • Hands‑on experience with accuracy pipeline development and automation frameworks. Understanding of video generation model accuracy and multi‑modal evaluation benchmarking.
  • Proficiency in Python and familiarity with ML toolkits (ONNX Runtime, Tensor Flow, PyTorch).
  • Expertise in accuracy analysis, including statistical methods and visualization tools.
  • Ability to design experiments for accuracy recovery and debug accuracy failures effectively.
  • Knowledge of quantization techniques and mixed‑precision workflows.
  • Experience with data‑centre accelerators (NVIDIA A100/H100/B200, AI100 Ultra, Gaudi, TPU).
  • Knowledge of LLM accuracy evaluation tools (lm‑eval, HELM, synthetic benchmarks) is an advantage.
  • Strong problem‑solving and analytical skills with the ability to isolate complex accuracy issues.
  • Familiarity with distributed deployment systems (Kubernetes,…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary