More jobs:
Senior Devops/SRE Engineer
Job in
California, Moniteau County, Missouri, 65018, USA
Listed on 2026-01-15
Listing for:
N-iX
Full Time
position Listed on 2026-01-15
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing
Job Description & How to Apply Below
Location: California
Site Reliability Engine+ Java Engineer (#4485)
India
Work type:
Office/Remote
Technical Level:
Senior
Job Category:
Software Development
Project:
We’re hiring experienced Site Reliability Engineer+ Java Engineer hybrids. You’ll be a key player in ensuring the health, reliability, and agility of over 200 mission‑critical microservices—while shaping the future of AI‑driven commerce.
Our California‑based customer is an American global e‑commerce leader, one of the most popular and successful websites on the Internet. It provides platform services by connecting millions of sellers and buyers in more than 190 markets around the world.
Responsibilities- Monitor, analyze, and enhance the health of 200+ distributed microservices.
- Own incident response and drive operational excellence as a member of our 24/7 SRE on‑call rotation, ensuring uptime and meeting strict SLAs.
- Deliver key Dev Ops outcomes—CVEs, SWUs, software upgrades, automated failover, resilience engineering, robust security design, and infrastructure improvements.
- Collaborate cross‑functionally to design, implement, and maintain monitoring, alerting, and automation frameworks.
- Build standardized tooling and practices supporting rapid recovery, continuous improvement, and compliance.
- Develop in Java for backend, including debugging, optimizing, and maintaining high‑availability server‑side applications gemstones and distributed systems.
- 5 years hands‑on expertise with cloud infrastructure (AWS, GCP or Azure), containers and orchestration, CI/CD, monitoring stacks, automation
- 3‑4 + years of extensive Java development experience
- Strong experience in Site Reliability Engineering, Dev Ops, or Production Operations—preferably supporting large‑scale systems
- Solid understanding of incident management, reliability engineering, and microservice architectures
- Background in security best practices, system resilience, and disaster recovery is a plus
- Wgability to participate in a rotating 24/7 on call schedule
- Excellent problem‑solving, communication, and teamwork skills
- Upper‑Intermediate/Advanced English level (there will be a lot of communication with the client)
- Flexible working format - remote, office‑based or flexible
- A competitive salary and good compensation package
- Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
- Active tech communities with regular knowledge sharing
Position Requirements
10+ Years
work experience
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×