Senior Site Reliability Engineer maintaining and optimizing large-scale distributed infrastructure at Branch. Collaborating with cross-functional teams to support mission-critical services across the organization.
Responsibilities
Architect, design, and evolve complex distributed systems to improve reliability, operational efficiency, and performance at scale
Partner closely with product, security, and data engineering teams to translate business needs into resilient and scalable system designs
Drive reliability through automation and advanced observability
Lead and mentor in high stakes situations
Perform deep infrastructure cost audits
Own and maintain key distributed data platforms
Guide teams in defining SLIs/SLOs and operational best practices
Continuously identify and eliminate bottlenecks
Champion Infrastructure as Code (IaC) to automate provisioning, configuration, and lifecycle management
Lead our GitOps and deployment strategy using Argo CD
Requirements
6+ years in SRE, systems engineering, or software engineering roles
Proven track record as a senior reliability or production engineer
Expert level proficiency in Kubernetes, AWS, Linux internals, and distributed system fundamentals
Strong programming skills in Go, Python, Java, Kotlin, Bash, or similar languages
Hands-on experience with modern observability stacks (Prometheus, Grafana, AlertManager, Loki, PagerDuty)
Familiarity with large scale data and streaming ecosystems such as Kafka, Spark, Aerospike, FoundationDB, and the broader Hadoop ecosystem
Deep experience with Terraform, CloudFormation, or related IaC tooling
Proven incident management leadership in production SaaS systems
Senior Developer / DevOps Specialist joining large - scale digital modernization initiative. Building secure, scalable cloud - native applications within an agile delivery environment.
Senior Deployment Engineer addressing complex technical integrations in AI agent deployments for customer experience. Collaborative role with technical teams and customers to optimize solutions.
We are hiring a CI/CD Engineer with strong Platform Engineering and DevOps expertise to design, build, and optimize scalable and secure CI/CD pipelines and cloud - based platforms in Toronto, ON.
DevOps Lead needed for a 6 - 12 month remote contract in Toronto, ON. Must have 10 - 12 years experience, CI/CD with Azure DevOps, Docker, Kubernetes, and scan integration.
Co - op or Intern, DevOps Engineer joining BDO Digital's AppDev team. Responsibilities include managing Azure cloud environments and building CI/CD pipelines.
Senior DevOps Engineer designing and implementing scalable AWS network architectures at Magnet Forensics. Collaborating with diverse teams for secure, efficient connectivity across services.
Site Reliability Engineer ensuring high availability, scalability, and performance of Emburse’s systems. Collaborating on distributed systems while mentoring junior engineers.
Associate DevOps Engineer supporting the Continuous Integration and Delivery pipeline of Sun Life's Canadian IT API applications. Ideal for Computer Science students graduating December 2026 or later, seeking industry experience.
Reliability Engineering Intern working with experienced engineers on mining operations. Gaining hands - on experience with Caterpillar equipment and engineering challenges.
Senior Reliability Engineer at IKO Industries optimizing asset reliability and equipment performance across manufacturing operations. Applying advanced reliability methodologies and leading multi - site initiatives.