More jobs:
Reliability Engineer
Job in
Menomonee Falls, Waukesha County, Wisconsin, 53051, USA
Listed on 2026-03-08
Listing for:
Kohl's
Full Time
position Listed on 2026-03-08
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing
Job Description & How to Apply Below
About The Role
As Reliability Engineer, you will ensure the resilience and availability of Kohl’s systems and applications and collaborate closely with development teams to review designs, conduct risk assessments, and implement robust monitoring and failover mechanisms.
What You’ll Do- Drive incident response efforts, perform root cause analysis and implement preventative measures to enhance system reliability
- Establish consistent practices that elevate Kohl’s operational excellence through automation and process improvements
- Follow software lifecycle and drive reliability, observability and efficiency across product teams within an assigned domain
- Identify repeated toil and find opportunities for automation and risk reduction
- On‑call on a rotation to respond to production incidents and conduct blameless retros and root‑cause analyses (RCAs) to drive a culture of continuous improvements
- Proactively identify failures before they cause outages using chaos engineering techniques such as edge cases, failure modes and design review
- Advise on capacity planning and provide continuous assessments on systems behavior and consumption
- Work with product managers to identify and prioritize work for reliability best practices (i.e., leveraging SLIs/SLOs/Error Budgets)
- Additional tasks may be assigned
- Bachelor's Degree or equivalent in MIS, Computer Science or related field
- 2+ years of experience in software development
- Strong programming skills in one or more languages (Java, Python, Go or Node.js)
- Working knowledge of systems architecture, operating system internals and network fundamentals
- Experience working with one cloud platform (e.g., GCP, AWS, or Azure)
- Experience with monitoring techniques and tools (e.g., Cloud Watch, Grafana, Prometheus, Open Telemetry, Tracing)
- Working knowledge around containerization and container orchestration (e.g., Docker, Kubernetes, Rancher)
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×