Senior Software Engineer - Generative AI
Listed on 2026-02-28
-
Software Development
AI Engineer, Machine Learning/ ML Engineer, Software Engineer
Senior Software Engineer - Generative AI
Location:
San Jose
Team:
Technology
Employment Type:
Regular
Job Code: A117674
ResponsibilitiesTeam Intro The Speech team's mission is to empower content interaction and creation using speech & audio related technologies. The team focuses on cutting‑edge R&D in areas like speech & audio, music processing, natural language understanding and multimodal deep learning. The team builds AI training and inference systems based on GPUs and advances the state‑of‑the‑art of AI system technologies to accelerate large audio/music language models.
The team is also responsible for the development of the complete engineering cycle of large models, including data preparing/processing, model training/evaluation/deployment, etc.
The major responsibilities include:
- Responsible for the design and development of the architecture of large‑scale AI systems, solving technical difficulties such as high concurrency, high reliability, and high scalability of the system.
- Covering various sub‑directions of machine learning system, including model training, model inference, data management, and workflow orchestration.
- Working closely with the algorithm teams to optimize the algorithm and system jointly.
Minimum Qualifications:
- Master's or PhD in Computer Science, Artificial Intelligence, Machine Learning, or related fields, with strong expertise in large‑scale ML systems.
- Over 5 years of experience in AI/ML, specializing in innovative model training, inference, and optimization.
- Proven track record in developing AI‑driven features for large‑scale user platforms, with a focus on user engagement and interactivity.
- Proficiency in Python and advanced skills in ML frameworks like Tensor Flow, PyTorch, and CUDA.
- Demonstrated experience with distributed training and inference frameworks such as Deep Speed, Megatron‑LM, VLLM, Tensor
RT‑LLM.
Preferred Qualifications:
- In‑depth knowledge of inference optimization techniques, such as quantization, pruning, network fusion, etc.
- Knowledge of Kubernetes, Docker, and cloud platforms (AWS, GCP, Azure) for scalable, efficient deployment of AI solutions.
- Deep understanding of advanced deep learning architectures (Transformers, reinforcement learning) and their application in real‑world scenarios.
- Exceptional mentoring and team‑building skills, fostering an environment of continuous learning and innovation.
- Excellent problem‑solving capabilities, with a detail‑oriented and user‑focused approach.
- Strong communication and interpersonal skills, capable of engaging effectively with both technical and non‑technical stakeholders.
The base salary range for this position in the selected city is $187,040 - $438,000 annually.
Compensation may vary outside of this range depending on a number of factors, including a candidate’s qualifications, skills, competencies and experience, and location. Base pay is one part of the Total Package that is provided to compensate and recognize employees for their work, and this role may be eligible for additional discretionary bonuses/incentives, and restricted stock units.
Benefits may vary depending on the nature of employment and the country work location. Employees have day one access to medical, dental, and vision insurance, a 401(k) savings plan with company match, paid parental leave, short‑term and long‑term disability coverage, life insurance, wellbeing benefits, among others. Employees also receive 10 paid holidays per year, 10 paid sick days per year and 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).
The Company reserves the right to modify or change these benefits programs at any time, with or without notice.
For Los Angeles County (unincorporated) CandidatesQualified applicants with arrest or conviction records will be considered for employment in accordance with all federal, state, and local laws including the Los Angeles County Fair Chance Ordinance for Employers and the California Fair Chance Act. Our company believes that criminal history may have a direct, adverse and negative relationship on the following job duties, potentially resulting in the withdrawal of the…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).