Senior Site Reliability Engineer (SRE) – Hybrid role in Mississauga. Design, build, and maintain cloud infrastructure through code, automate CI/CD, and manage Kubernetes clusters.
Responsibilities
Build and maintain Kubernetes clusters and cloud-native infrastructure. Automate and optimize CI/CD pipelines and operational workflows. Develop monitoring and observability systems (Grafana, Prometheus, Kibana). Collaborate with development teams to streamline environments and processes. Own infrastructure through code, ensuring reliability, scalability, and security.
Requirements
6+ years as an SRE and 6+ years in software development. Expert in Terraform, Azure, AKS, Kubernetes. Strong scripting skills (PowerShell, Shell). Hands-on experience with CI/CD pipelines, Docker, Helm, Ansible. Excellent problem-solving and communication skills.
Production Support Engineer / SRE role supporting critical digital applications with SRE practices. Requires 5+ years experience with Ansible, Elasticsearch, MongoDB, Redis, OpenShift, Azure, and Linux/Windows administration.
Production Support Engineer ensuring system stability and reliability for Manulife's critical services. Collaborative role bridging development and infrastructure, providing seamless service for customers.
Senior SRE Engineer for cloud - native solutions, CI/CD automation, and infrastructure - as - code. Hybrid role in Mississauga, ON with Azure/Kubernetes focus.
Production Support Engineer at Miratech ensuring reliability for mission - critical contact center environments through proactive monitoring and troubleshooting. Join a global IT services company focused on digital transformation.
Senior SRE role building Kubernetes infrastructure, CI/CD pipelines, and automation. Hybrid contract in Mississauga with potential for full - time conversion.
Production Engineer ensuring compliance with manufacturing procedures and standards at Galderma. Optimizing production processes and supporting autonomous work cells for operational improvements.
Production Engineering Specialist providing support to the Production and Planning departments at Coperion. Implementing design improvements and ensuring efficiency of manufacturing processes.
Senior SRE role designing secure, scalable AKS clusters and automating infrastructure using Terraform. Requires 6+ years SRE/software engineering experience with Azure, Kubernetes, and CI/CD pipelines.
Contract Site Reliability Engineer role in Brampton, ON requiring 5 - 8 years of OpenShift, Azure, Kubernetes experience with monitoring tools expertise.
Site Reliability Engineer (SRE) role focused on automation, resilience, and scale across cloud - native platforms. Responsibilities include monitoring, Kubernetes, AWS, disaster recovery, and mentoring teams.