More jobs:
Manager, Resiliency and Platform Engineering
Job in
Scottsdale, Maricopa County, Arizona, 85261, USA
Listed on 2026-01-12
Listing for:
Choice Hotels International, Inc.
Full Time
position Listed on 2026-01-12
Job specializations:
-
IT/Tech
Systems Engineer, Cloud Computing
Job Description & How to Apply Below
Scottsdale AZ - Technology & Digital Commerce Center time type:
Full time posted on:
Posted Yesterday job requisition :
R20951
** This role is not eligible for sponsorship AND is four days onsite hybrid at our N. Scottsdale office
** Choice Hotels has an exciting new opportunity as our
** Manager, Resiliency and Platform Engineering
** in the
** Sky Touch Technology
** division. Sky Touch Technology is an independently operated division of Choice Hotels that provides the most widely used cloud-based (SaaS) hotel property management system. As a key member of the Sky Touch Technology organization, this role is responsible for leading the teams and practices that ensure the reliability, resilience, and operational readiness of a mission-critical, multi-tenant SaaS platform operating at enterprise scale.**#Skys
The Limit
** when you **#Make It Your Choice **! We encourage you to apply today!
** Your Responsibilities**
* ** Serve as a senior technical authority for resiliency and reliability**, influencing platform architecture, operational design standards, and cross-domain engineering decisions beyond the immediate team.
* Lead and manage multiple engineering teams focused on application resiliency, data resiliency enablement, and engineering platform improvements, ensuring clear direction, priorities, and accountability.
* Own and drive year-round resiliency initiatives that reduce the likelihood and impact of outages in mission-critical systems, particularly during peak travel season when system stability directly impacts revenue and guest experience.
* Eliminate recurring production issues by addressing systemic root causes such as runtime instability, memory leaks, incorrect failure propagation, unsafe operational practices, and brittle defaults.
* Define and enforce standards for observability, monitoring, alerting, and operational readiness that improve signal quality, reduce unnecessary paging, and accelerate diagnosis and recovery.
* Lead platform and runtime modernization efforts including upgrades, containerization, serverless development standards, and engineering toolchain improvements while feature delivery continues in parallel.
* Improve developer experience as a reliability and quality lever by delivering tooling, frameworks, templates, documentation, and paved-road solutions adopted across engineering teams.
* Partner with enablement teams and application teams to drive adoption of standards and practices without owning their feature roadmaps.
* Apply automation and pragmatic use of AI where it demonstrably reduces engineering toil, accelerates remediation, or improves operational outcomes, prioritizing durability and adoption over experimentation.
* Translate technical risk into clear business context and communicate progress, outcomes, and priorities to engineering and technology leadership.
** Your Experience, Skills & Competencies
*** Bachelor's degree in related field required or equivalent experience
* Master’s degree in related field preferred
* At least 10 years’ experience in in software engineering, platform engineering, or reliability-focused roles, with multiple years leading engineering teams.
* Proficient in Microsoft Outlook, Excel, PowerPoint and Word
* Demonstrated success improving the reliability of distributed systems through
** systemic fixes rather than reactive incident management**.
* Strong technical background in Java-based systems, cloud environments, and modern application architectures, including microservices, event-driven systems, and legacy platforms.
* Experience leading large-scale platform, runtime, or foundational technology modernization efforts in production environments.
* Proven ability to define engineering standards and drive adoption across teams
** without direct ownership of their delivery commitments**.
* Experience improving observability, monitoring, and operational practices to reduce noise, accelerate diagnosis, and improve recovery outcomes.
* Comfort operating in environments where reliability issues have direct revenue and customer experience impact.
* Practical experience applying automation and AI-enabled tools to improve engineering productivity, quality, or operational outcomes.
* Strong communication and leadership skills, with the ability to set technical direction, make decisions under uncertainty, and hold teams accountable for durable outcomes.
* Successful candidates for this role consistently demonstrate the following leadership competencies:
* ** Manages Complexity**:
Effectively navigates complex technical and organizational environments by synthesizing incomplete or conflicting information, identifying root causes, and making sound decisions that balance short-term delivery with long-term system health.
* ** Decision Quality**:
Makes timely, well-reasoned decisions under uncertainty, particularly when reliability, revenue, and customer…
To View & Apply for jobs on this site that accept applications from your location or country, tap the button below to make a Search.
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).
Search for further Jobs Here:
×