Senior Site Reliability Engineer
Job in
Coventry, West Midlands, CV1, England, UK
Listed on 2025-10-17
Listing for:
Moneycorp
Full Time
position Listed on 2025-10-17
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability, IT Support
Job Description & How to Apply Below
Senior Site Reliability Engineer – Moneycorp
Moneycorp is a global payments ecosystem transforming from a consumer‑focused foreign exchange provider to a cloud‑native payment infrastructure. This role plays a key part in shaping the future of our payments and FX platforms.
Key Responsibilities- Define and maintain SLOs/SLIs and error budgets for critical services.
- Build and improve observability pipelines (metrics, logs, traces) and maintain dashboards for golden signals.
- Develop incident runbooks, lead post‑incident reviews, and drive root‑cause analysis.
- Implement anomaly detection, predictive monitoring, and forecast capacity for cloud workloads.
- Automate backup, restore, failover processes and validate RTO/RPO through regular DR testing.
- Design and run chaos engineering experiments and enhance self‑healing automation.
- Lead SEV‑1/SEV‑2 incidents, authorize critical decisions, and eliminate toil through automation.
- Map dependencies for key business services, conduct scenario‑based resilience testing, and produce compliance evidence.
- Identify and refactor platform reliability issues, engineer modern replacements, and lead migrations with measurable outcomes.
- 7+ years in SRE, platform, or systems roles with production ownership of high‑availability, low‑latency platforms.
- Deep experience with Azure services (IaaS, AKS, VNets, App Gateway, SQL, Service Bus, Event Hubs, Kafka, Key Vault) and IaC with Terraform.
- Strong background in security‑by‑design, Zero Trust principles, and regulatory compliance.
- Experience with Azure Dev Ops or Git Hub Actions for CI/CD pipelines.
- Hands‑on knowledge of Prometheus, Grafana, Open Telemetry, and alerting policies.
- Experience with Fin Ops practices, cost optimization, and cloud commercials.
- Led SEV‑1/SEV‑2 incident management and post‑mortem delivery.
- Designed and validated disaster recovery, chaos engineering, and automated resilience testing.
- Proficient in Windows Server (2019/2022/2025) and Linux (RHEL/Ubuntu) on Azure IaaS.
- Familiarity with payments orchestration, FX workflows, and platform refactoring for scale and resilience.
- Understanding of UK regulatory expectations (FCA/PRA) for operational resilience and scenario testing.
- Experience with Temenos or similar core banking platforms.
- Bachelor’s degree in Computer Science, Engineering, or a related technical discipline, or equivalent hands‑on experience.
- Optional certifications:
Microsoft Azure AZ‑104, AZ‑400, AZ‑700;
Kubernetes CKA/CKAD;
Hashi Corp Terraform Associate.
If the role sounds like you, we invite you to upload a copy of your CV by clicking on the Apply Now button.
Equal OpportunityWe're committed to creating a workplace where every individual feels valued, respected, and included. As an Equal Opportunity Employer, we actively cultivate an inclusive culture where diversity thrives and empower our colleagues to drive meaningful change through initiatives like our DE&I focus groups and value champion network.
#J-18808-LjbffrPosition Requirements
10+ Years
work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
Search for further Jobs Here:
×