Principal SaaS Operations Engineer
Fairfield, Fairfield County, Connecticut, 06828, USA
Listed on 2026-01-14
-
IT/Tech
Cloud Computing, Systems Engineer, Cybersecurity, SRE/Site Reliability
and the job listing Expires on January 16, 2026
RSA provides trusted identity and access management for 12,000 organizations around the world, managing 25 million enterprise identities and providing secure, convenient access to millions of users. RSA specializes in empowering security‑first organizations in financial services, healthcare, energy, technology services, and other industries to thrive in a digital world, delivering complete capabilities for modern authentication, access, lifecycle management, and identity governance.
Whether in the cloud or on‑premises, RSA connects people with the digital resources they depend on everywhere they live, work, and play.
For decades, RSA has pioneered many of the encryption, authentication, and identity federation technologies that still power the internet. And now RSA is transforming the industry yet again, paving the way for the future of digital identity through the RSA Unified Identity Platform; next‑generation hybrid and cloud solutions; the first ever and only multi‑functional, passwordless hardware authenticator; and a frictionless, mobile‑optimised experience for the modern workforce.
If you are self‑motivated and looking for a fast‑paced challenge doing something that truly matters, come join our winning team! For more information, go to
RSA is seeking a Principal SaaS Operations Engineer to serve as a technical leader and hands‑on expert within the Global SaaS Operations organization supporting the RSA and Secur
ID Cloud Platforms.
This role bridges operations engineering and Dev Ops practices, driving architecture, automation, and reliability initiatives across RSA’s cloud environments. The ideal candidate combines deep cloud expertise with strong execution and leadership in cross‑functional projects, mentoring other engineers, and improving operational maturity at scale.
- Act as the technical lead across SaaS Operations initiatives — mentoring team members, setting technical direction, and leading incident response and recovery efforts.
- Design, automate, and optimise cloud operations in Azure and AWS environments with emphasis on scalability, reliability, and security.
- Partner with Dev Ops and Cloud Engineering to implement CI/CD pipelines, IAC frameworks (Terraform, Ansible, Jenkins), and automated configuration management.
- Develop and maintain monitoring, alerting, and observability across distributed systems using Azure Monitor, Dynatrace, and Prometheus/Grafana stacks.
- Lead root cause analysis (RCA) and continuous improvement processes following incidents, ensuring lessons learned translate into durable engineering improvements.
- Architect and enforce secure operational practices, including key management, access controls, and change management workflows.
- Support multi‑cloud networking: routing, firewalls, load balancers, VPNs, DNS, and proxies.
- Drive automation for backup, DR, and compliance validation, aligning with FedRAMP, SOC 2, and DISA STIG frameworks.
- Collaborate with Engineering and Product teams to ope rationalise new services and define production readiness standards.
- Participate in on‑call rotation and act as escalation point for critical incidents.
- 10+ years in SaaS Operations, Site Reliability Engineering, or Dev Ops within large‑scale production environments.
- Experience operating in FedRAMP, DoD IL, or regulated cloud environments.
- Extensive Linux/UNIX administration experience (SUSE, RHEL, Ubuntu) with practical scripting in Python, Bash, or Power Shell.
- Expertise with Azure (preferred) and AWS cloud ecosystems — including virtual networking, IaaS/PaaS services, monitoring, and automation.
- Proven ability to build and maintain CI/CD pipelines using Jenkins, Git Hub Actions, or Azure Dev Ops.
- Hands‑on experience with infrastructure‑as‑code and configuration management (Terraform, Ansible, Puppet, or Chef).
- Strong knowledge of networking fundamentals and security best practices.
- Proficiency in monitoring and logging systems (Dynatrace, ELK stack, Cloud Watch, Azure Monitor, Pager Duty, etc.).
- Solid understanding of high availability and DR architectures.
- U.S. Citizenship required (due to FedRAMP and DoD compliance…
(If this job is in fact in your jurisdiction, then you may be using a Proxy or VPN to access this site, and to progress further, you should change your connectivity to another mobile device or PC).