×
Register Here to Apply for Jobs or Post Jobs. X

SRE and DevOps Engineer

Job in Santa Clara, Santa Clara County, California, 95053, USA
Listing for: Sustainable Talent
Full Time position
Listed on 2025-12-01
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer, Systems Administrator, SRE/Site Reliability
Job Description & How to Apply Below

Overview

Sustainable Talent is partnering with Nvidia, a global leader who has been transforming computer graphics, PC gaming, and accelerated computing for over 25 years. We are looking for a SRE & Dev Ops Engineer to support NVIDIA's Infrastructure, Planning and Processes organization. This is a W-2 full-time contract based in Santa Clara, CA, onsite. We offer competitive pay based on factors like experience, education, location, etc.,

and provide full benefits, PTO, and a strong company culture.

What you’ll be doing
  • Working on systems deployed in NVIDIA's internal infrastructure products and ensuring they are available and reliable for our end users.
  • Monitor system performance and troubleshoot issues related to NVIDIA hardware and software stack.
  • Provide high quality user support.
  • Monitor KPIs and ensure that the team’s SLAs are met.
  • Manage and maintain production Kubernetes clusters and Jenkins pipelines.
  • Drive automation of monitoring to gain more insight into applications and system health.
What we need to see
  • Experience maintaining cloud and on-prem infrastructure and highly-available production environments.
  • Expert level proficiency in CI/CD systems such as ArgoCD, Jenkins, Git Lab CI, Git Hub Actions, etc.
  • Background in databases like SQL (MySQL) and time series DBs like Prometheus.
  • Experience with data analytics/visualization tools (ELK, Grafana, Splunk) and alerting tools (Zabbix, Alert manager, Pager Duty).
  • Proficiency with Ansible, Kubernetes, Containers & Virtualization platforms.
  • 5+ years of proven experience and a Bachelor's degree in Computer Science, Information Technology, or a related field, or equivalent experience.
Ways to stand out from the crowd
  • Previous experience with SRE teams managing on-prem infrastructure.
  • Experience managing NVIDIA hardware like GPUs and Tegra devices.
  • Thrives in a multi-tasking environment with evolving priorities.
  • Prior experience with a large-scale operations team.

Sustainable Talent is a M/F+, disabled, and veteran equal employment opportunity and affirmative action employer.

#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary