DevOps role managing critical systems across platforms at CBC/Radio-Canada. Ensuring effective client support and infrastructure optimization in a hybrid work model.
Responsibilities
install, monitor and manage critical systems that span multiple platforms across the country
plan, coordinate, implement, manage system configuration for new installations or upgrades
responsible for initiating, planning and coordinating the effective client support for the system
troubleshoot problems and coordinate activities to support the administration of systems
assist in the development and implementation of standards for running new computer system processes and procedures
build automation and internal tooling to reduce operational overhead and improve developer productivity
document infrastructure standards, patterns, and operational practices
collaborate closely with software engineers, product teams, and other stakeholders
Requirements
strong hands-on experience with IaC using Terraform and Ansible
proven experience designing and operating Kubernetes platforms
implementing GitOps workflows based on core principles and best practices
hands-on experience with GitOps tools such as Argo CD, Helm, and GitLab CI
Kubernetes-native controllers like AWS Controllers for Kubernetes (ACK)
solid understanding of cloud-native architectures, containers, and serverless services on AWS
experience building and operating observability systems
practical knowledge of GNU/Linux systems
experience managing and scaling relational databases like MySQL and PostgreSQL
proficiency in one or more programming languages (Go, Python) for automation and tooling
experience supporting production systems in high-availability environments
familiarity with SRE principles and reliability metrics
flexibility to work outside of regular working hours
bachelor’s degree in Computer Science or Engineering or 5+ years working with high availability systems
Benefits
excellent benefits package
pension plan noted as one of the best in the country
Site Reliability Engineer responsible for the installation, configuration, maintenance of middleware technologies at Hyve Solutions. Managing applications on container platforms and ensuring reliable operation of critical middleware components.
Director of IT Operations & DevOps leading infrastructure and DevOps at CanadaHelps. Focus on operational reliability, improvements, and team collaboration in a technology - driven environment.
Own operational reliability of cloud load balancing infrastructure serving global customers. Design and implement frameworks reflecting customer experience for reliability management.
Senior Site Reliability Engineer ensuring platform reliability at Circle. Managing systems and database infrastructure to support high growth in user engagement and system performance.
DevOps II role providing production support for Java - based applications. Involves incident management, CI/CD operations, and collaboration on cloud platforms.
Senior DevOps Engineer at Ad Hoc contributing to DevOps and software engineering strategies. Collaborating across teams and mentoring members to improve software delivery processes.
Senior DevOps Engineer responsible for enhancing CI/CD processes at EQ Bank's IT team. Collaborating with developers to streamline software delivery and operations.
Senior DevOps Engineer designing and managing cloud infrastructure at Borrowell, a company helping Canadians with their finances. Collaborating with development, security, and QA teams to enhance service delivery.
Senior Site Reliability Engineer joining SaaS - Ops team at Magnet Forensics. Overseeing Kubernetes clusters and operational reliability in cloud environments for law enforcement customers.
Senior Site Reliability Engineer establishing infrastructure to support Thunderbird’s privacy - respecting tools. Collaborates remotely with a distributed team across various time zones.