Principal Engineer, Storage AI
Listed on 2026-02-28
-
IT/Tech
AI Engineer, Data Engineer, Machine Learning/ ML Engineer, Systems Engineer
Apply
In accordance with Washington state law, we are highlighting our comprehensive benefits package, which is available to all eligible US based employees.
- Health, dental, vision, life, disability insurance
- Retirement Benefits: 401(k) with company match
- Paid Time Off: 20 days of vacation per year, accruing at a rate of 6.15 hours per pay period for the first five years of employment
- Sick Time: 40 hours/year (statutory, where applicable); 5 days/event (discretionary)
- Maternity Leave (Short-Term Disability + Baby Bonding): 28-30 weeks
- Baby Bonding Leave: 18 weeks
- Holidays: 13 paid days per year
Note:
By applying to this position you will have an opportunity to share your preferred working location from the following:
Sunnyvale, CA, USA;
Kirkland, WA, USA;
New York, NY, USA;
Seattle, WA, USA
.
- Bachelor's degree in Computer Science, Engineering, or a related technical field, or equivalent practical experience.
- 15 years of experience in software engineering, with 8 years in a technical engineering role.
- Experience building and scaling distributed systems and platforms (e.g., Kubernetes, microservices, Kafka/Rabbit
MQ, Redis/Cassandra, load balancing, sharding, CAP Theorem, service mesh). - Experience in machine learning concepts, AI architectures, and related technologies (e.g., transformer architectures, LLM fine‑tuning (LoRA, QLoA), Retrieval‑Augmented Generation, latency optimization, model quantization, hyperparameter tuning, PyTorch/Tensor Flow).
- Master's degree or PhD in Computer Science, Artificial Intelligence, or a related field.
- Experience with model training, inference, AI infrastructure, Agents and unstructured data platforms.
- Understanding of industry trends and competitive landscapes in AI and machine learning.
- Ability to drive innovation and foster a culture of technical excellence.
- Excellent communication and presentation skills, with the ability to articulate complex technical concepts to unique audiences.
Generative AI is revolutionizing all aspects of technology. Storage is a foundational technology needed for building the next generation models and using them in applications. Storage also represents a huge opportunity to deliver value to enterprises using the rich unstructured data sets available. As Director, you will drive and develop the next generation Storage infrastructure to act as the foundation of training, inference, and Reinforcement Learning (RL) platforms as they evolve.
You will also drive the capabilities in the platform that allow Agents for our customers to maximize the value of the unstructured data that they have stored. You'll collaborate with an excellent team of engineers across Google, driving full stack innovation from data center, hardware, and software, and ensure Google and Google Cloud provide the best AI products in the industry.
Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise‑grade solutions that leverage Google’s cutting‑edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.
The US base salary range for this full‑time position is $294,000-$414,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job‑related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.
Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about benefits at Google.
Responsibilities- Identify key opportunities and challenges in the model training, inference, RL and Agentic platforms where Storage can accelerate the development and performance of production systems. Build a strategy to stay ahead of the needs of the next generation systems.
- Drive the…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).