Principal Site Reliability Engineer - FlightAware; Remote
Coos Bay, Coos County, Oregon, 97458, USA
Listed on 2026-03-05
-
IT/Tech
Systems Engineer, Cloud Computing, SRE/Site Reliability
Date Posted:
Country: United States of America
Location: US-WA-REMOTE
Position Role Type: Remote
U.S. Citizen, U.S. Person, or Immigration Status Requirements: Must be authorized to work in the U.S. without the company’s immigration sponsorship now or in the future. The company will not offer immigration sponsorship for this position. The company will not seek an export authorization for this role.
Security Clearance Type: None/Not Required
Security Clearance Status: Not Required
Are you interested in building and maintaining the infrastructure to power the world’s largest flight tracking platform?
Flight Aware, part of Collins Aerospace, has built the world's leading aviation software platform, processing over 300+ million incoming messages an hour from almost 40,000 individual data feeds—more than 5 terabytes a day and growing! We provide the best, most complete, and most accurate real-time flight‑tracking service and are proud to have built a wide variety of successful products on this foundation that have become central to the aviation industry at large.
Flight Aware is searching for an enthusiastic and process‑driven, Site Reliability Engineer (SRE) to "automate themselves out of a job." The Flight Aware Site Reliability Engineering team embraces infrastructure automation, release engineering, and continuous delivery. As part of the Operations team, our SREs work alongside highly effective and talented counterparts, interacting closely with all facets of the Engineering org. This role requires a fusion of skills in development, analytics and hardware to solve problems in an exciting and demanding environment.
Flight Aware Engineering consists of teams that develop and deliver an array of services powering its commercial products. These teams tackle a broad set of technologies and solve challenging technical and product problems daily. From collecting and interpreting aviation datasets to enriching them with procedural and AI/ML solutions and delivering them through APIs, web interfaces, and reporting products, our engineers work in a dynamic environment that tests their ability to marshal over 100 resources to achieve the company’s vision.
Regardless of role, we expect excellent interpersonal and communication skills across all hires look for candidates who will thrive here, meaning they demonstrate clear communication, embrace open feedback, trust their colleagues, and are driven to execute, deliver, and complete projects independently and efficiently.
Learn more about the history of our reliability team and the Flight Aware engineering interview process.
What You Will Do:- Spend your days working to automate and improve reliability and continue to push Flight Aware's infrastructure forward, ensuring it is resilient and reproducible.
- Be responsible for service availability, performance, monitoring, incident response, and capacity planning.
- Create, improve, and manage environments to ensure decisions on resource allocation, problem identification, and capacity planning are based on accurate data‑driven insights.
- Maintain a physical infrastructure using Kubernetes, Linux, & Ceph, and a cloud infrastructure in AWS as part of the Site Reliability Engineering team.
- Impact technology decision and direction to grow and support the Flight Aware platform.
- Collaborate closely with fellow SREs on your team and extend your collaboration across other Flight Aware teams and disciplines to design dependable and scalable solutions and services.
- Identify, implement, and champion process improvements to enhance productivity, collaboration, and delivery efficiency, while ensuring alignment with company goals and industry best practices.
- Gain a deep understanding of the systems and infrastructure that support Flight Aware’s applications and services; this includes networking, operating systems, cloud platform, databases, and other relevant technologies.
- Learn the intricacies of handling and processing real‑time flight data, including ensuring the reliability of systems dealing with dynamic and time‑sensitive information.
- Gain expertise in designing and maintaining high‑availability architectures for…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).