×
Register Here to Apply for Jobs or Post Jobs. X

Senior Engineering Manager, Site Reliability

Job in Toronto, Ontario, C6A, Canada
Listing for: Relay
Full Time position
Listed on 2026-03-01
Job specializations:
  • IT/Tech
    Systems Engineer, SRE/Site Reliability
Salary/Wage Range or Industry Benchmark: 243000 - 297000 CAD Yearly CAD 243000.00 297000.00 YEAR
Job Description & How to Apply Below

Relay is a digital banking platform that gives self-made business owners the tools and know-how to be great with money—bringing clarity, confidence, and control to every dollar earned, so they can turn hard work into lasting success. We do this by replacing financial guesswork with real visibility, transforming cash flow from a constant source of stress into a clear signal owners can use to run stronger, more resilient businesses.

As Relay continues to scale, the reliability, performance, and resilience of our platform are no longer just technical concerns. They are core to our customer experience and business success.

This is a senior leadership role responsible not only for guiding a strong team of Site Reliability Engineers, but for shaping how reliability strategy influences engineering and product decisions across the organization. You will define the direction of the SRE function, drive alignment on operational excellence, and help the company anticipate and navigate scale inflection points before they become risk.

If you’re energized by complex systems, organizational leadership, and building resilient platforms customers rely on every day, we'd love to meet you!

What You'll Be Doing
  • Lead and evolve Relay’s Site Reliability Engineering function, setting strategic direction as the company scales

  • Define and drive the long-term reliability roadmap, making principled tradeoffs under real business and capacity constraints

  • Serve as the senior reliability voice in engineering and product leadership discussions

  • Influence how reliability considerations are embedded into product planning, architecture decisions, and delivery processes

  • Act as a senior escalation point during critical production incidents, ensuring clear communication and durable follow-through

  • Strengthen Relay’s observability, performance, and operational maturity practices across teams

  • Establish and reinforce standards around SLOs, operational readiness, incident management, and continuous improvement

  • Partner with Engineering, Product, Data, and Finance stakeholders to balance velocity, risk, performance, and cost

  • Build and develop a high-performing SRE organization capable of supporting future growth

Who You Are

You bring deep SRE expertise and senior-level leadership experience. You’ve operated at scale, navigated inflection points, and understand how reliability strategy must evolve as companies grow.

  • You have 5+ years of experience managing engineering teams and 8+ years in Site Reliability, Platform, or Infrastructure roles

  • You’ve owned and materially improved reliability, scalability, and performance in production systems

  • You’ve defined and driven reliability or platform strategy across teams or at an organizational level

  • You’ve built, evolved, or restructured SRE or platform functions in growing companies

  • You’ve led teams through significant production incidents and operational challenges, acting as a credible escalation leader

  • You demonstrate strong technical judgment in cloud-native systems (e.g., AWS) and modern infrastructure practices (IaC, CI/CD, observability)

  • You’ve influenced engineering and product leadership on reliability tradeoffs, long-term investments, and operational risk

  • You’re comfortable operating at multiple altitudes; from technical design discussions to executive-level conversations about impact and strategy

  • You lead with calm authority, set a high bar for ownership and accountability, and develop strong, opinionated engineers into even stronger leaders

  • You thrive in fast-moving environments where reliability practices must continuously evolve

Bonus Points
  • You’ve scaled an SRE or platform organization through a significant growth phase

  • You have experience in fintech or other regulated, high-availability environments

  • You’ve implemented SLO frameworks, error budgets, and capacity planning at scale

  • You’ve led through organizational change or multi-team transformation

The Interview Process:

  • Stage 1: A1-hour Google Meet call with a member of the People team

  • Stage 2: A 1-hour Google Meets video call with the hiring manager (VP of Engineering)

  • Stage 3: A 1-hour in-person interview with a member of Relay’s senior leadership team

  • Stage 4: A…

Position Requirements
10+ Years work experience
Note that applications are not being accepted from your jurisdiction for this job currently via this jobsite. Candidate preferences are the decision of the Employer or Recruiting Agent, and are controlled by them alone.
To Search, View & Apply for jobs on this site that accept applications from your location or country, tap here to make a Search:
 
 
 
Search for further Jobs Here:
(Try combinations for better Results! Or enter less keywords for broader Results)
Location
Increase/decrease your Search Radius (miles)

Job Posting Language
Employment Category
Education (minimum level)
Filters
Education Level
Experience Level (years)
Posted in last:
Salary