Senior Engineering Manager guiding the Site Reliability Engineering function for Relay, a digital banking platform. Defining reliability strategies and leading engineering teams to enhance operational excellence.
Responsibilities
Lead and evolve Relay’s Site Reliability Engineering function, setting strategic direction as the company scales.
Define and drive the long-term reliability roadmap, making principled tradeoffs under real business and capacity constraints.
Serve as the senior reliability voice in engineering and product leadership discussions.
Influence how reliability considerations are embedded into product planning, architecture decisions, and delivery processes.
Act as a senior escalation point during critical production incidents, ensuring clear communication and durable follow-through.
Strengthen Relay’s observability, performance, and operational maturity practices across teams.
Establish and reinforce standards around SLOs, operational readiness, incident management, and continuous improvement.
Partner with Engineering, Product, Data, and Finance stakeholders to balance velocity, risk, performance, and cost.
Build and develop a high-performing SRE organization capable of supporting future growth.
Requirements
You have 5+ years of experience managing engineering teams and 8+ years in Site Reliability, Platform, or Infrastructure roles.
You’ve owned and materially improved reliability, scalability, and performance in production systems.
You’ve defined and driven reliability or platform strategy across teams or at an organizational level.
You’ve built, evolved, or restructured SRE or platform functions in growing companies.
You’ve led teams through significant production incidents and operational challenges, acting as a credible escalation leader.
You demonstrate strong technical judgment in cloud-native systems (e.g., AWS) and modern infrastructure practices (IaC, CI/CD, observability).
You’ve influenced engineering and product leadership on reliability tradeoffs, long-term investments, and operational risk.
You’re comfortable operating at multiple altitudes; from technical design discussions to executive-level conversations about impact and strategy.
You lead with calm authority, set a high bar for ownership and accountability, and develop strong, opinionated engineers into even stronger leaders.
You thrive in fast-moving environments where reliability practices must continuously evolve.
Benefits
Competitive salary and meaningful equity: Relay employees are Relay owners, complete with equity and a competitive salary.
Comprehensive health benefits: enjoy full health benefits from day one: no probation period required. We offer flexible Health or Wellness Spending Accounts and medical, dental, and vision coverage for you and your dependents.
Flexible vacation and time off: every team member starts with 15 vacation days and 5 flex days to use as needed, plus an extra week of office closure during the end-of-year holidays so you can take time off to recharge and come back better for our customers.
Parental leave with top-up: we offer 12 weeks off with a 100% salary top-up for all full-time employees, regardless of location, and accessible for all parents: birthing, non-birthing, and adoptive.
Hybrid work environment: we value meaningful collaboration and connection at our Toronto office twice a week, with lunch, snacks, and beverages on us.
Dog-friendly space: can dogs really make you happy and healthy? We don’t know for sure, but since we don’t want to chance it, our office is 100% floof-friendly.
Personal and professional growth: through ongoing feedback, mentorship, and coaching, work with peers and leaders who are invested in your growth and success.
Top-tier equipment: as a Mac-first company, our Toronto offices have everything you need to produce your best work comfortably, from multiple screens to ergonomic seating.
Social connection: we believe in celebrating our wins with two annual company-wide get-togethers, quarterly team events, happy hours, and special events and networking opportunities.
Site Reliability Engineer maintaining and optimizing cloud infrastructure for Tecsys. Collaborating with engineering teams to drive reliability and performance in mission - critical SaaS environments.
DevOps Engineer responsible for maintaining corporate IT systems and cloud infrastructure. Collaborating with business teams to deliver technology - driven solutions.
Engineering Manager leading Site Reliability Engineers in developing reliable cloud infrastructure at Tempo. Ensure stability, cost efficiency, and effective team management in a SaaS environment.
Senior Site Reliability Engineer with Python infra - as - code for Cloud operations at Canonical. Enabling devsecops for applications on OpenStack and Kubernetes in a remote global environment.
Site Reliability / Gitops Engineer supporting and maintaining Canonical’s IT production services. Automating operations with Infrastructure as Code for private and public cloud environments.
DevOps Engineer optimizing CI/CD processes and maintaining AWS cloud infrastructure. Collaborative role focusing on automation, scalability, and cost optimization in cloud technologies.
Site Reliability Engineer at BMO focusing on code deployment, IT operations, and system reliability through automation and monitoring. Collaborating between development and operations teams to improve service health.
DevOps Engineer supporting NY operations from Canada for a global software services provider. Focused on developing and deploying services in a collaborative environment with various technical stacks.
Build & Release Engineer managing CI/CD infrastructure and release automation leveraging AI at League. Ensuring build reliability and improving developer productivity across platforms.
Senior DevOps Engineer building the next - generation methane sensing platform at Sensirion. Collaborating with software developers and engineers to deliver innovative IoT solutions.