Principal Engineer, Operational Excellence & Resilience; Remote
Coos Bay, Coos County, Oregon, 97458, USA
Listed on 2026-01-12
-
IT/Tech
Systems Engineer, IT Project Manager, Cybersecurity, IT Consultant
As a global leader in cybersecurity, Crowd Strike protects the people, processes and technologies that drive modern organizations. Since 2011, our mission hasn’t changed — we’re here to stop breaches, and we’ve redefined modern security with the world’s most advanced AI-native platform. Our customers span all industries, and they count on Crowd Strike to keep their businesses running, their communities safe and their lives moving forward.
We’re also a mission-driven company. We cultivate a culture that gives every Crowd Striker both the flexibility and autonomy to own their careers. We’re always looking to add talented Crowd Strikers to the team who have limitless passion, a relentless focus on innovation and a fanatical commitment to our customers, our community and each other. Ready to join a mission that matters?
The future of cybersecurity starts with you.
The Technology Resilience Principal Engineer will lead Crowd Strike's Technology Resilience function within the Resilience Organization. This role drives the strategy and execution of resilience practices across Crowd Strike's technology stack, including infrastructure, applications, and products. Working in parallel with the Business Resilience team, this position creates and maintains comprehensive technical resilience standards and practices that ensure service reliability, system redundancy, and rapid recovery capabilities.
The Technology Resilience Principal provides dedicated support specifically for our technology business units, ensuring enterprise-wide consistency in how we apply resilience standards across infrastructure, applications, and products. As a senior individual contributor, this position serves as the central "hub" in our hub‑and‑spoke model for technology resilience, partnering with technology business units for implementation and operational delivery.
With potential to grow as the function matures, this role bridges the gap between our established enterprise resilience capabilities and the specialized needs of our technology organizations.
What You’ll Do:- Cross‑Organizational Coordination:
Facilitate coordination between stakeholders across IT, Product, Engineering, and business units, serving as the central point for technology resilience initiatives and ensuring alignment with business objectives - Enterprise Standards & Governance:
Own and maintain enterprise‑wide technology resilience standards, ensuring consistent implementation and reducing organizational drift from established frameworks across infrastructure, application, and product domains - Technology Resilience Strategy:
Drive comprehensive technical resilience architecture including infrastructure redundancy and fault tolerance, application resilience and graceful degradation strategies, and chaos engineering frameworks for continuous resilience validation - Disaster Recovery Leadership:
Lead enterprise technical recovery strategy development and implementation, including backup and redundancy systems, recovery time/point objectives (RTO/RPO) for technical systems, and data recovery/restoration procedures - Product Performance and Scalability:
Partner to define and implement resilience standards, including feature flagging, release, testing, multi‑tenancy frameworks, and scalability frameworks to manage growth - Risk Oversight & Metrics:
Provide technical oversight and aggregation of technology resilience risks across the enterprise, establishing and monitoring key performance indicators including system uptime
- Resilience Engineering Leadership:
Drive chaos engineering and resilience testing programs, establishing enterprise‑wide practices for proactive resilience validation and continuous improvement - Shared Tooling Strategy:
Own shared resilience tooling strategy, evaluation, and implementation to support enterprise‑wide capabilities including monitoring, testing, and recovery automation - Stakeholder Engagement:
Build and maintain formal networks with key constituents across business units, engineering teams, and external partners - Crisis Leadership:
Serve as senior technical advisor during major incident response, providing…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).