About the role

Leading a Technical Operations team at Quicknode to ensure service reliability and oversee critical incidents. Responsible for team growth, project delivery, and operational efficiency.

Responsibilities

Own team delivery: set quarterly goals, run sprints, ship infra projects and upgrades on time
Drive reliability: maintain SLAs/SLOs, reduce incident recurrence
Lead escalations: coordinate SEV-0/1 response, stakeholder comms, post-mortems
Grow the team: hire, mentor, run performance reviews, and build a culture of accountability
Champion automation initiatives to reduce toil
Align with Product, Engineering, Security, and Support on priorities

3+ years (M3) or 5+ years (M4) managing SRE/DevOps/TechOps teams
Hands-on background in Linux, Kubernetes, cloud (AWS/GCP/OCI/Azure), and observability (Prometheus/Grafana/Datadog)
Led SEV-0/1 responses, written RCAs, and built processes for lower MTTR
Strong SLO/SLA ownership, on-call scheduling, workload distribution
Partner with Product, Engineering, Security, and Support
Articulate complex failures simply to engineers, execs, and customers