Site Reliability Engineer
Listed on 2026-03-01
-
IT/Tech
Systems Engineer, SRE/Site Reliability, Cloud Computing, IT Support
We are seeking a Site Reliability Engineer (SRE) with strong experience in Application Support and Automation to join a high-performing team in Dublin.
You will take ownership of the production environment, ensure application reliability, and drive automation initiatives to improve system stability and reduce operational overhead. This role requires a proactive engineer who thrives in fast-paced, production-critical environments.
About the ClientOur client is a leading digital transformation and technology services organization delivering scalable, high-performance systems across global markets. They specialize in building resilient platforms, modernizing enterprise environments, and driving operational excellence through automation and Dev Ops practices.
Key Responsibilities- Plan, manage, and oversee all aspects of the production environment.
- Define strategies for application performance monitoring and optimization.
- Respond to incidents and implement continuous improvement initiatives to reduce recurrence.
- Design and standardize monitoring and alerting mechanisms.
- Optimize Mean Time to Recovery (MTTR) through structured incident management.
- Support pre-live activities including capacity planning and system design reviews.
- Analyze ITSM activities and provide feedback on operational gaps and resiliency risks.
- Support CI/CD pipelines with validation, operational gating, and Dev Ops best practices.
- Monitor availability, latency, and overall system health.
- Scale systems sustainably through automation and reliability engineering improvements.
- Collaborate with global teams across multiple time zones.
- Participate in rotational on-call support.
- Experience with Splunk and Dynatrace for monitoring and observability.
- Knowledge of AWS API Gateway and Event Gateway
. - Infrastructure monitoring expertise.
- Experience with Chef Habitat, Ansible, and XLR automation tools
. - Familiarity with Nginx (nice to have).
- Strong understanding of CI/CD processes and Dev Ops automation principles.
- Experience supporting live production systems in enterprise environments.
- 4–8 years of experience in SRE, Production Support, Dev Ops, or Systems Engineering roles.
- Strong background in application monitoring and performance optimization.
- Proven experience reducing incidents and improving system reliability.
- Hands-on experience with automation and deployment pipelines.
- Strong troubleshooting skills in complex production environments.
- Ability to collaborate effectively with cross-functional and global teams.
- Must be based in Ireland or hold a valid work permit for Dublin.
I look forward to receiving your application. Once submitted, we will review your profile and assess your fit against other candidates in the process. While we aim for an efficient recruitment process, occasional delays may occur.
About SpertonSperton Norway is part of Sperton Global, an international recruitment and consulting company. We support organizations in securing top talent and help professionals advance their careers by connecting them with the right opportunities.
#J-18808-Ljbffr(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).