SRE and DevOps Engineer
Job in
Santa Clara, Santa Clara County, California, 95053, USA
Listed on 2025-12-01
Listing for:
Sustainable Talent
Full Time
position Listed on 2025-12-01
Job specializations:
-
IT/Tech
Cloud Computing, Systems Engineer, Systems Administrator, SRE/Site Reliability
Job Description & How to Apply Below
Overview
Sustainable Talent is partnering with Nvidia, a global leader who has been transforming computer graphics, PC gaming, and accelerated computing for over 25 years. We are looking for a SRE & Dev Ops Engineer to support NVIDIA's Infrastructure, Planning and Processes organization. This is a W-2 full-time contract based in Santa Clara, CA, onsite. We offer competitive pay based on factors like experience, education, location, etc.,
and provide full benefits, PTO, and a strong company culture.
- Working on systems deployed in NVIDIA's internal infrastructure products and ensuring they are available and reliable for our end users.
- Monitor system performance and troubleshoot issues related to NVIDIA hardware and software stack.
- Provide high quality user support.
- Monitor KPIs and ensure that the team’s SLAs are met.
- Manage and maintain production Kubernetes clusters and Jenkins pipelines.
- Drive automation of monitoring to gain more insight into applications and system health.
- Experience maintaining cloud and on-prem infrastructure and highly-available production environments.
- Expert level proficiency in CI/CD systems such as ArgoCD, Jenkins, Git Lab CI, Git Hub Actions, etc.
- Background in databases like SQL (MySQL) and time series DBs like Prometheus.
- Experience with data analytics/visualization tools (ELK, Grafana, Splunk) and alerting tools (Zabbix, Alert manager, Pager Duty).
- Proficiency with Ansible, Kubernetes, Containers & Virtualization platforms.
- 5+ years of proven experience and a Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent experience.
- Previous experience with SRE teams managing on-prem infrastructure.
- Experience managing NVIDIA hardware like GPUs and Tegra devices.
- Thrives in a multi-tasking environment with evolving priorities.
- Prior experience with a large-scale operations team.
Sustainable Talent is a M/F+, disabled, and veteran equal employment opportunity and affirmative action employer.
#J-18808-LjbffrTo View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×