×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer

Job in Chicago, Cook County, Illinois, 60290, USA
Listing for: Request Technology, LLC
Full Time position
Listed on 2026-01-09
Job specializations:
  • IT/Tech
    Systems Engineer, Cloud Computing, SRE/Site Reliability
Job Description & How to Apply Below

Senior Executive Recruiter at Request Technology

*** Hybrid, 3 days onsite, 2 days remote***

*** We are unable to sponsor as this is a permanent full-time role***

A prestigious company is looking for a Site Reliability Engineer. This role is focused on observation, logging, and capacity planning. This engineer will need experience/exposure to Linux systems, Kubernetes/Docker, Terraform, Jenkins, Ansible, Harness, and Kafka.

Responsibilities:

  • Collaborate with development, operations and infrastructure teams to ensure availability of services, and to work through implementation issues
  • Develop automation for incident response and to prevent problem recurrence
  • Create and enhance runbooks to respond to service outages or degradations
  • Assess the production readiness of services
  • Define and track operational metrics for production performance, reliability, scalability and availability
  • Architect, develop and maintain shared services and tools to improve reliability and reduce toil across the organization

Qualifications:

  • Bachelor’s or Master’s Degrees in Computer Science, Information Systems or other related field, or equivalent work experience
  • Minimum of 4+ years of experience in Site Reliability Engineering / Dev Ops
  • Experience with maintaining and troubleshooting large-scale distributed systems
  • Experience managing infrastructure in public cloud environments like AWS (preferred), Azure or GCP
  • Experience with AIOps and predictive analysis for anomaly detection, forecasting system capacity using monitoring and alerting tools like Splunk, App Dynamics, Datadog, Stack Driver, Sysdig, Prometheus or Grafana
  • Programming/scripting experience in languages like Java, Bash, Python or Go
  • Experience with distributed messaging systems like Kafka, Rabbit

    MQ, or ActiveMQ
  • Experience with container orchestration systems like Kubernetes, Mesos, Docker Swarm or Rancher
  • Experience with using Continuous Integration and Continuous Delivery (CI/CD) tools like Jenkins, Travis, Harness, Appveyor, Code Build or Code Pipeline
  • Familiarity with leveraging large language models (LLMs) to automate and optimize SRE workflows. This may include using AI-powered tools to perform tasks such as, writing scripts, summarizing incident reports, or even creating and maintaining AI workloads.
Seniority level
  • Mid-Senior level
Employment type
  • Full-time
Job function
  • Information Technology
#J-18808-Ljbffr
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary