Senior Site Reliability Engineer maintaining and optimizing large-scale distributed infrastructure at Branch. Collaborating with cross-functional teams to support mission-critical services across the organization.
Responsibilities
Architect, design, and evolve complex distributed systems to improve reliability, operational efficiency, and performance at scale
Partner closely with product, security, and data engineering teams to translate business needs into resilient and scalable system designs
Drive reliability through automation and advanced observability
Lead and mentor in high stakes situations
Perform deep infrastructure cost audits
Own and maintain key distributed data platforms
Guide teams in defining SLIs/SLOs and operational best practices
Continuously identify and eliminate bottlenecks
Champion Infrastructure as Code (IaC) to automate provisioning, configuration, and lifecycle management
Lead our GitOps and deployment strategy using Argo CD
Requirements
6+ years in SRE, systems engineering, or software engineering roles
Proven track record as a senior reliability or production engineer
Expert level proficiency in Kubernetes, AWS, Linux internals, and distributed system fundamentals
Strong programming skills in Go, Python, Java, Kotlin, Bash, or similar languages
Hands-on experience with modern observability stacks (Prometheus, Grafana, AlertManager, Loki, PagerDuty)
Familiarity with large scale data and streaming ecosystems such as Kafka, Spark, Aerospike, FoundationDB, and the broader Hadoop ecosystem
Deep experience with Terraform, CloudFormation, or related IaC tooling
Proven incident management leadership in production SaaS systems
Deployment Engineer at Maneva bringing AI - powered vision systems to manufacturing environments in Canada and the US, ensuring production - ready installations.
Senior DevOps Engineer operating AWS infrastructure and Kubernetes for BlueCat Cloud SaaS platform. Focused on automation and operational stability while collaborating with cross - functional teams.
System Analyst in Alberta Blue Cross supporting SharePoint Online and M365 collaboration tools for over 1.8 million members. Collaborating with teams to enhance digital workplace environment.
Senior DevOps Specialist ensuring the reliability, scalability, and efficiency of Experlogix's SaaS platforms. Collaborating with development and operations teams to streamline deployment processes.
Senior DevOps Engineer designing and operating cloud - native infrastructure for distributed systems at ELITS. Collaborating with teams to ensure reliable streaming and high availability in production.
Senior Data DevOps Engineer at Scene+, supporting reliability and deployment of data platforms. Collaborating across teams to design automated pipelines and ensure operational stability.
Director of Software Engineering at Affirm focusing on site reliability engineering. Leading a global team and establishing risk management practices in a remote environment.
Hands - on Senior DevOps Developer designing, building, and operating secure cloud infrastructure. Enabling engineering teams to deploy mission - critical digital solutions into the nuclear industry.
DevSecOps Engineer responsible for building CI/CD pipelines and collaborating with security and operations teams at Aviso Wealth. Contributes to a culture of continuous improvement by implementing best practices.