×
Register Here to Apply for Jobs or Post Jobs. X

Site Reliability Engineer II

Job in 110006, Delhi, Delhi, India
Listing for: DevRabbit IT Solutions
Full Time position
Listed on 2026-02-27
Job specializations:
  • IT/Tech
    Cloud Computing, Systems Engineer
Job Description & How to Apply Below
Job Title :  Site Reliability Engineer II

Job Description
About your role:
We are seeking a highly skilled Technical Lead for AI Development to drive the architecture, design, and execution of advanced AI systems using LLM frameworks, multi-agent architectures, RAG pipelines, and Model Context Protocol (MCP) integrations. The ideal candidate has strong hands-on experience building production-grade AI features, orchestrating agent ecosystems, evaluating model performance, and iterating through continual refinements.
You will lead a team of engineers, collaborate with product and research teams, and play a key role in shaping our AI strategy and platform capabilities.
We are looking for a Staff Site Reliability Engineer to help us grow our domain expertise and provide support in a new global region to enable 24x7 development velocity as a global company. From AWS cloud provisioning as code to improving the developer experience in your working timezone, to acting as a guide to best practices around building and delivering software globally, we need an SRE with the passion, motivation, and great ideas to make everything better.

What you’ll do
Automate the provisioning of all of Juniper Square’s infrastructure in code. Everything we do is in code!
Partner with our Platform Engineering team on building developer tooling / improving developer experiences via joint initiatives and enhancements.
Partner with our Data Engineering team on improving our data posture and driving operational excellence.
Evolve our deployment pipelines to automate infrastructure deployments with the latest and greatest (and reliable) technologies.
Improve metrics on our main services, and act as a subject matter expert for our global dev teams.
Enable observability, SLO/SLI reporting, and respond to business impacting incidents as it pertains to infrastructure.
Adopt and drive solutions that align with AWS Well Architected frameworks and Juniper Square’s business objectives.
Identify performance bottlenecks and provide recommendations for improvement.
Proactively identify and solve problems that we didn’t even know we had.
Help build, deploy, and scale a load testing environment that is analogous to production.
Enforce security and operational safety controls.
Participate in technical roadmap planning and estimation.
Participate and contribute in production readiness and architecture review board (ARB) meetings and forums.
Train and mentor future engineers in the same region.
Contribute to the architectural improvements to meet future scaling and observability requirements

Qualifications
A profound love for solving hard problems and overcoming challenging obstacles.
Putting your customers first, whether they be internal or external, and making them more productive, happy, and successful.

Experience with AWS. Other public cloud providers are a bonus.

Experience with Postgre

SQL is a must. Additional experience with document databases is a nice-to-have.

Experience with cloud security best practices (CSPM, CDR, CWPP, SIEM, etc) to keep our customers and cloud posture secure.

Experience with containers (builds, registries, vulnerabilities scanning, run-time with docker-compose, run-time with TILT, run-time in schedulers/orchestration systems).
Multi-year hands-on experience and fluency with Kubernetes and helm charts are an absolute skill requirement. We live and breathe the k8s ecosystem.

Experience with a CI/CD pipeline. We use a combination of Github Actions, ArgoCD, Helm and Git Ops in our deployment process, but again, any are fine.
Some sort of infrastructure-as-code system:
Ansible, Terraform, Cloud Formation, CDK, etc.
We use Python and Typescript, so knowledge and exposure with either is a strong plus.
Experience breaking up monolithic architectures into microservices

Experience with service meshes and service discovery solutions.

Experience with an observability solution:
New Relic, Prometheus, Data Dog, etc.

Experience with logging systems:
Cloud Watch, ELK, Splunk, etc.
Bachelor’s degree in Computer Science or similar or equivalent experience

Key Responsibilities:

AI Architecture & Development
Design and implement multi-agent systems, including agent…
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary