About the role

  • Leading a Technical Operations team at Quicknode to ensure service reliability and oversee critical incidents. Responsible for team growth, project delivery, and operational efficiency.

Responsibilities

  • Own team delivery: set quarterly goals, run sprints, ship infra projects and upgrades on time
  • Drive reliability: maintain SLAs/SLOs, reduce incident recurrence
  • Lead escalations: coordinate SEV-0/1 response, stakeholder comms, post-mortems
  • Grow the team: hire, mentor, run performance reviews, and build a culture of accountability
  • Champion automation initiatives to reduce toil
  • Align with Product, Engineering, Security, and Support on priorities

Requirements

  • 3+ years (M3) or 5+ years (M4) managing SRE/DevOps/TechOps teams
  • Hands-on background in Linux, Kubernetes, cloud (AWS/GCP/OCI/Azure), and observability (Prometheus/Grafana/Datadog)
  • Led SEV-0/1 responses, written RCAs, and built processes for lower MTTR
  • Strong SLO/SLA ownership, on-call scheduling, workload distribution
  • Partner with Product, Engineering, Security, and Support
  • Articulate complex failures simply to engineers, execs, and customers

Benefits

  • Compensation: region-aligned, bonus-eligible, transparent from the start
  • Remote-first: async culture, follow-the-sun coverage, sustainable on-call
  • Inclusive and respectful workplace

Job type

Full Time

Experience level

Mid levelSenior

Salary

Not specified

Degree requirement

No Education Requirement

Tech skills

AWSAzureCloudGoogle Cloud PlatformGrafanaKubernetesLinuxPrometheus

Location requirements

RemoteCanada

Report this job

Found something wrong with the page? Please let us know by submitting a report below.