Site Reliability Engineer (SRE) role focused on automation, resilience, and scale across cloud-native platforms. Responsibilities include monitoring, Kubernetes, AWS, disaster recovery, and mentoring teams.
Responsibilities
Drive automation, resilience, and scale across cloud-native platforms. Work on monitoring & observability (alerts, dashboards), automation & IaC (Python, Terraform, CloudFormation), Kubernetes (K8s) & AWS Cloud, disaster recovery (DR) strategies, ServiceNow workflows (incident management), production troubleshooting (on-call rotations), coaching delivery teams on SRE best practices, and blameless postmortems for continuous learning.
Senior Java Engineer - Production Management needed for a 6 - month contract - to - hire in Mississauga, Ontario (hybrid). Lead transformation, automation, and operational excellence.
Production Support Engineer / SRE role supporting critical digital applications with SRE practices. Requires 5+ years experience with Ansible, Elasticsearch, MongoDB, Redis, OpenShift, Azure, and Linux/Windows administration.
Production Support Engineer ensuring system stability and reliability for Manulife's critical services. Collaborative role bridging development and infrastructure, providing seamless service for customers.
Senior SRE Engineer for cloud - native solutions, CI/CD automation, and infrastructure - as - code. Hybrid role in Mississauga, ON with Azure/Kubernetes focus.
Production Support Engineer at Miratech ensuring reliability for mission - critical contact center environments through proactive monitoring and troubleshooting. Join a global IT services company focused on digital transformation.
Senior SRE role building Kubernetes infrastructure, CI/CD pipelines, and automation. Hybrid contract in Mississauga with potential for full - time conversion.