SRE Manager
Publicado en 2026-01-24
-
TI/Tecnología
Gerente de Proyectos TI, Ingeniería de confiabilidad del sitio/Confiabilidad del sitio
Radisson Hotel Group is a leading hospitality company serving as a true host and best partner to guests, owners, business partners and talent. Our unique hotel brands offer award‑winning and exceptional hotel experiences, originating from our strong Scandinavian heritage of design and innovation. Our brands embody our modern vision of hospitality, including authentic local tastes, stylish living design, unique locations and vibrant social scenes.
Radisson Hotel Group brings a refreshed commitment to hospitality leadership to meet the changing travel industry and the bespoke needs of our guests. We provide exceptional service in all of our hotels across the globe and strive to deliver a hospitality experience that is beyond guest expectations.
The SRE Manager ensures the reliability, scalability, and performance of Radisson Hotel Group’s digital web and app platforms.
Responsibilities- Lead, coach, and grow a team of SREs, fostering a culture of ownership, collaboration, and innovation.
- Drive automation of operational tasks, deployments, and monitoring to reduce manual effort and human error.
- Oversee incident management processes, ensuring timely communication, root cause analysis, and postmortems.
- Collaborate with software engineering, product, and infrastructure teams to design scalable, secure, and reliable systems.
- Report on system health, reliability metrics, and operational risks to senior leadership.
- Lead and mentor the SRE team to design, implement, and operate resilient systems.
- Establish and enforce best practices for monitoring, incident response, automation, and capacity planning.
- Partner with product, engineering, and infrastructure teams to embed reliability into the software development lifecycle.
- Highly available and performant digital platforms that enhance guest experience.
- Reduced downtime and faster incident resolution across services.
- A culture of reliability, automation, and continuous improvement within the Digital services.
Location: Madrid, Spain.
Language skills: Fluency in English is a must.
Must have experience:
- 7+ years of experience in Site Reliability Engineering, Dev Ops, or Infrastructure roles.
- 2+ years in a leadership/managerial role, leading distributed teams.
- Proven track record of managing mission‑critical, customer‑facing digital platforms.
- Experience with hybrid cloud environments (Azure, AWS, GCP).
- Strong knowledge of observability tools (Dynatrace, Prometheus, Grafana, Splunk, etc.).
- Expertise in automation and Infrastructure-as-Code (Terraform, Ansible, Pulumi).
- Familiarity with CI/CD pipelines, Kubernetes, and microservices architectures.
Desirable experience:
- Hospitality, travel, or e‑commerce industry background.
- Solid understanding of networking, security, and distributed systems.
- Expertise in scripting languages (Python, Go, Bash).
Travel needs: Approximately 10% to Madrid and/or Brussels HQ.
Soft skills:
- Strong leadership and people management skills.
- Excellent communication and stakeholder management.
- Strategic thinker with hands‑on problem‑solving ability.
- Ability to thrive in a fast‑paced, global, customer‑centric environment.
Education: University Degree in Computer Science, Engineering, or related field.
Certifications: Cloud, agile, and/or Dev Ops certifications preferable.
Compensation:
To be discussed.
Para buscar, ver y solicitar empleos que acepten solicitudes de su ubicación o país, toque aquí para realizar una búsqueda: