CloudOps – Digital Strategy and Product, English Services

Posted 5 days ago

Apply Now

Resume Score

Check how well your resume matches this job before you apply.

Sign in to check score

About the role

  • DevOps managing the largest news site in Canada at CBC/Radio-Canada. Installing, monitoring, and managing critical systems that span multiple platforms across the country.

Responsibilities

  • Reporting to the Senior Manager, Web and Infrastructure, we're looking for a DevOps to join our growing team.
  • It’s an opportunity to help shape the way the corporation works internally and make a contribution towards our “Space for us all” strategic plan.
  • As a part of the production solutions team, you will install, monitor and manage critical systems that span multiple platforms across the country.
  • You are a DevOps managing the largest news site in Canada and you want to step up to the challenges of working with bigger systems, bigger infrastructure, and bigger demands.
  • Or, you are escaping the old way to work and getting ready to learn new tools, work with exciting digital product teams and be part of a public broadcaster.
  • We spend our days solving problems on an unbelievable scale.
  • Media files are highly nuanced and incredibly complicated; moving our broadcasting content from on-prem servers to the cloud is a complex technical feat.
  • And that’s just the beginning.
  • In this role, you will continuously be challenged to apply your judgment, knowledge, experience and analytical skills to: Enhance application functionality. You will plan, coordinate, implement, manage system configuration for new installations or upgrades. Support. You will be responsible for initiating, planning and coordinating the effective client support for the system. Maintain. You will troubleshoot problems and coordinate activities to support the administration of systems. Oversee operational procedures. You will assist in the development and implementation of standards for running new computer system processes and procedures. Think in terms of platforms and products, not one-off solutions. Favor automation over manual processes and reproducibility over ad hoc fixes. Comfortable working in fast-evolving cloud-native ecosystems and learning new tools when they solve real problems. Collaborate openly, share knowledge, and take pride in team outcomes rather than individual ownership. Develop systems that combine operational simplicity with long-term scalability. Communicate clearly and work effectively with engineers, product teams, and stakeholders. Overseeing infrastructure usage & load during major events such as the Federal Elections, Olympics etc.

Requirements

  • Strong hands-on experience with IaC using Terraform and Ansible with working knowledge of CloudFormation and AWS SAM.
  • Proven experience designing and operating Kubernetes platforms and implementing GitOps workflows based on core principles and best practices.
  • Hands-on experience with GitOps tools such as Argo CD, Helm, and GitLab CI, as well as Kubernetes-native controllers like AWS Controllers for Kubernetes (ACK) to manage cloud resources declaratively.
  • Hands-on experience implementing Kubernetes-native scaling strategies, including event-driven and on-demand scaling using tools such as KEDA and Karpenter.
  • Solid understanding of cloud-native architectures, containers, and serverless services on AWS.
  • Experience building and operating observability systems for logs, metrics, tracing, and alerting in distributed systems.
  • Practical knowledge of GNU/Linux systems, networking fundamentals including DNS, TLS, HTTP, CDN and load balancing.
  • Experience managing and scaling relational databases such as MySQL and PostgreSQL including RDS.
  • Proficiency in one or more programming languages (Go, Python) for automation and tooling.
  • Hands-on experience working from the command line in production environments.
  • Exposure to FinOps practices and cost-aware infrastructure design.
  • Experience migrating workloads to different architectures, including x86 to ARM Graviton-based Ec2 instances, and transitioning workloads into containers or virtual machines.
  • Experience supporting production systems in high-availability environments.
  • Familiarity with SRE principles and reliability metrics such as SLOs, SLIs, error budgets, and DORA metrics.
  • Participate in a 24/7 on-call rotation and drive continuous improvement of operational playbooks and postmortem practices.
  • The flexibility. You are able and willing to work outside of regular working hours. You can travel between Montreal and Toronto when required.

Benefits

  • Work with purpose and impact at scale;
  • Flexible work schedules, allowing you to find balance for yourself, your family and your work;
  • A hybrid environment you can enjoy the benefits of work from home and in-office collaboration;
  • Competitive total rewards package including robust health benefits and best-in-class defined benefits pension plan;
  • Dedicated time for innovation, learning and development; wherever your interests lie;
  • Opportunities to work with emerging technology;
  • Opportunities for continued learning and professional development;
  • Opportunities to become a member of our Employee Resource Groups;
  • Pair programming and mentorship opportunities, where you can learn from the best in the industry and help coach new talent;
  • A creative and dynamic work environment, where your ideas and contributions can be heard, valued and respected;
  • A supportive management team committed to upholding the highest standards of diversity and inclusivity;
  • An environment which favors experimentation and an iterative approach in order to achieve the highest form of technical innovation.

Job type

Contract

Experience level

Mid levelSenior

Salary

Not specified

Degree requirement

Bachelor's Degree

Tech skills

AnsibleAWSCloudDistributed SystemsDNSEC2KubernetesLinuxMySQLPostgresPythonTerraformGo

Location requirements

HybridTorontoCanada

Report this job

Found something wrong with the page? Please let us know by submitting a report below.