Senior Engineering Manager, Site Reliability
Relay is a digital banking platform that gives self-made business owners the tools and know-how to be great with money—bringing clarity, confidence, and control to every dollar earned, so they can turn hard work into lasting success. We do this by replacing financial guesswork with real visibility, transforming cash flow from a constant source of stress into a clear signal owners can use to run stronger, more resilient businesses.
As Relay continues to scale, the reliability, performance, and resilience of our platform are no longer just technical concerns. They are core to our customer experience and business success.
This is a senior leadership role responsible not only for guiding a strong team of Site Reliability Engineers, but for shaping how reliability strategy influences engineering and product decisions across the organization. You will define the direction of the SRE function, drive alignment on operational excellence, and help the company anticipate and navigate scale inflection points before they become risk.
If you’re energized by complex systems, organizational leadership, and building resilient platforms customers rely on every day, we'd love to meet you!
What You'll Be DoingLead and evolve Relay’s Site Reliability Engineering function, setting strategic direction as the company scales
Define and drive the long-term reliability roadmap, making principled tradeoffs under real business and capacity constraints
Serve as the senior reliability voice in engineering and product leadership discussions
Influence how reliability considerations are embedded into product planning, architecture decisions, and delivery processes
Act as a senior escalation point during critical production incidents, ensuring clear communication and durable follow-through
Strengthen Relay’s observability, performance, and operational maturity practices across teams
Establish and reinforce standards around SLOs, operational readiness, incident management, and continuous improvement
Partner with Engineering, Product, Data, and Finance stakeholders to balance velocity, risk, performance, and cost
Build and develop a high-performing SRE organization capable of supporting future growth
You bring deep SRE expertise and senior-level leadership experience. You’ve operated at scale, navigated inflection points, and understand how reliability strategy must evolve as companies grow.
You have 5+ years of experience managing engineering teams and 8+ years in Site Reliability, Platform, or Infrastructure roles
You’ve owned and materially improved reliability, scalability, and performance in production systems
You’ve defined and driven reliability or platform strategy across teams or at an organizational level
You’ve built, evolved, or restructured SRE or platform functions in growing companies
You’ve led teams through significant production incidents and operational challenges, acting as a credible escalation leader
You demonstrate strong technical judgment in cloud-native systems (e.g., AWS) and modern infrastructure practices (IaC, CI/CD, observability)
You’ve influenced engineering and product leadership on reliability tradeoffs, long-term investments, and operational risk
You’re comfortable operating at multiple altitudes; from technical design discussions to executive-level conversations about impact and strategy
You lead with calm authority, set a high bar for ownership and accountability, and develop strong, opinionated engineers into even stronger leaders
You thrive in fast-moving environments where reliability practices must continuously evolve
You’ve scaled an SRE or platform organization through a significant growth phase
You have experience in fintech or other regulated, high-availability environments
You’ve implemented SLO frameworks, error budgets, and capacity planning at scale
You’ve led through organizational change or multi-team transformation
The Interview Process:
Stage 1: A1-hour Google Meet call with a member of the People team
Stage 2: A 1-hour Google Meets video call with the hiring manager (VP of Engineering)
Stage 3: A 1-hour in-person interview with a member of Relay’s senior leadership team
Stage 4: A…
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search: