Site Reliability Engineer CIAM
Listed on 2026-01-11
-
IT/Tech
Systems Engineer, Cloud Computing
Join us as a Site Reliability Engineer for CIAM at Barclays, where you will bring to life a new digital platform capability, transforming and modernizing our digital estate to build a market-leading digital offering with customer experience at its heart. This is an exciting and key role, partnering with business-aligned engineering and product teams to ensure a collaborative team culture is at the heart of what we do.
To be successful as a Site Reliability Engineer for CIAM, you should have:
- Experience in designing, implementing, deploying, and running highly available, fault‑tolerant, auto‑scaling, and auto‑healing systems
- Experience with AWS (essential);
Azure and GCP are a plus. Familiarity with Kubernetes (ECS is essential; Fargate and GCE are a plus) and serverless architectures - Experience in running disaster recovery and zero‑downtime solutions, and in designing and implementing continuous delivery across large-scale, distributed, cloud-based microservices and API service solutions with 99.9%+ uptime
- Exposure to coding in Python, Bash, and JSON/YAML (Configuration as Code)
- The ability to drive reliability best practices across engineering teams, embed SRE principles into the Dev Sec Ops lifecycle, and partner with engineering, security, and product teams to balance reliability and feature velocity
Some other highly desirable skills include:
- Experience in configuration, deployment, and operation of Forge Rock COTS‑based IAM solutions (Ping Gateway, PingAM, PingIDM, PingDS) with embedded security gates
- Experience with HTTP header signing, access token and data‑at‑rest encryption, PKI‑based self‑sovereign identity, or open‑source equivalents
- Cloud Infrastructure Management
You may be assessed on the key critical skills relevant for success in this role, such as risk and controls, change and transformation, business acumen strategic thinking and digital and technology, as well as job‑specific technical skills.
This role is in our Whippany, NJ office.
Minimum Salary: $120,000
Maximum Salary: $175,000
The minimum and maximum salary/rate information above includes only base salary or base hourly rate. It does not include any other type of compensation or benefits that may be available.
Barclays employees are eligible for a suite of competitive and generous employee benefits, including medical, dental and vision coverage, 401(k), life insurance, and other paid leave for qualifying circumstances.
This position is eligible for an incentive award.
Purpose of the roleTo apply software engineering techniques, automation, and best practices in incident response, to ensure the reliability, availability, and scalability of the systems, platforms, and technology through them.
Accountabilities- Availability, performance, and scalability of systems and services through proactive monitoring, maintenance, and capacity planning.
- Resolution, analysis and response to system outages and disruptions, and implement measures to prevent similar incidents from recurring.
- Development of tools and scripts to automate operational processes, reducing manual workload, increasing efficiency, and improving system resilience.
- Monitoring and optimisation of system performance and resource usage, identify and address bottlenecks, and implement best practices for performance tuning.
- Collaboration with development teams to integrate best practices for reliability, scalability, and performance into the software development lifecycle, and work closely with other teams to ensure smooth and efficient operations.
- Stay informed of industry technology trends and innovations, and actively contribute to the organization's technology communities to foster a culture of technical excellence and growth.
- To advise and influence decision making, contribute to policy development and take responsibility for operational effectiveness. Collaborate closely with other functions/ business divisions.
- Lead a team performing complex tasks, using well developed professional knowledge and skills to deliver on work that impacts the whole business function. Set objectives and coach employees in pursuit of those objectives, appraisal of…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).