Manager, Resiliency and Platform Engineering
Listed on 2026-01-14
-
IT/Tech
Systems Engineer -
Engineering
Systems Engineer
Manager, Resiliency and Platform Engineering
Be among the first 25 applicants. 3 days ago.
- This role is not eligible for sponsorship and is a four‑day onsite hybrid at our N. Scottsdale office.
Choice Hotels has an exciting new opportunity as our Manager, Resiliency and Platform Engineering in the Sky Touch Technology division. Sky Touch Technology is an independently operated division of Choice Hotels that provides the most widely used cloud‑based (SaaS) hotel property management system.
This is not a traditional Site Reliability Engineering role. In our environment, resiliency and platform engineering are proactive, year‑round disciplines focused on eliminating failure modes, modernising foundational technology, and enabling engineering teams to build and operate systems safely at scale.
As a leader in this role, you will manage teams with dedicated capacity to drive platform modernisation, engineering resiliency, and developer enablement while feature delivery continues in parallel. You will work across enablement teams and application teams to turn lessons learned into durable standards, tools, and paved roads that reduce operational risk and engineering toil.
Are you someone who can operate effectively in complex, always‑on environments, hold teams accountable for reliability outcomes, and balance immediate operational demands with long‑term platform resilience? The #Skys The Limit when you #Make It Your Choice ! We encourage you to apply today!
Your Responsibilities- Serve as a senior technical authority for resiliency and reliability, influencing platform architecture, operational design standards, and cross‑domain engineering decisions beyond the immediate team.
- Own and drive year‑round resiliency initiatives that reduce the likelihood and impact of outages in mission‑critical systems, particularly during peak travel season when system stability directly impacts revenue and guest experience.
- Lead and manage multiple engineering teams focused on application resiliency, data resiliency enablement, and engineering platform improvements, ensuring clear direction, priorities, and accountability.
- Eliminate recurring production issues by addressing systemic root causes such as runtime instability, memory leaks, incorrect failure propagation, unsafe operational practices, and brittle defaults.
- Define and enforce standards for observability, monitoring, alerting, and operational readiness that improve signal quality, reduce unnecessary paging, and accelerate diagnosis and recovery.
- Lead platform and runtime modernisation efforts including upgrades, containerisation, serverless development standards, and engineering toolchain improvements while feature delivery continues in parallel.
- Improve developer experience as a reliability and quality lever by delivering tooling, frameworks, templates, documentation, and paved‑road solutions adopted across engineering teams.
- Partner with enablement teams and application teams to drive adoption of standards and practices without owning their feature roadmaps.
- Apply automation and pragmatic use of AI where it demonstrably reduces engineering toil, accelerates remediation, or improves operational outcomes, prioritising durability and adoption over experimentation.
- Translate technical risk into clear business context and communicate progress, outcomes, and priorities to engineering and technology leadership.
- Extensive experience in software engineering, platform engineering, or reliability‑focused roles, including multiple years leading engineering teams.
- Demonstrated success improving the reliability of distributed systems through systemic fixes rather than reactive incident management.
- Strong technical background in Java‑based systems and modern application architectures, including Spring Boot applications, microservices, event‑driven systems, and legacy platforms operating in cloud environments such as AWS.
- Bachelor’s degree in a related field required or equivalent experience.
- Experience leading large‑scale platform, runtime, or foundational technology modernisation efforts in production environments.
- Proven ability to define…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).