Sr. Site Reliability Engineer (SRE – Azure) for global inventory/data services company. 8-month contract-to-hire, 3 days/week onsite in Mississauga, ON or Dallas, TX.
Responsibilities
Senior Site Reliability Engineer role supporting production infrastructure for a global inventory/data collections services company. Responsibilities include managing Azure infrastructure, container orchestration with Kubernetes, infrastructure as code using Terraform, Docker, Helm, Packer, Ansible, ARM, configuration management, PowerShell/Shell scripting, observability tools (Grafana, Kibana, Prometheus), enterprise-grade software, microservices architecture, and Kubernetes production systems.
Requirements
6+ years of experience as an SRE supporting production infrastructure. 6+ years of overall software engineering experience in a development environment. Extensive experience with Azure. Experience with container orchestration platforms such as Kubernetes. Experience using IAC tools such as Terraform, Docker, Helm, Packer, Ansible, ARM. Experience with configuration management tools such as Ansible, YAML and Terraform. Experience with PowerShell and Shell scripting. Experience managing observability tools such as Grafana, Kibana and Prometheus. Experience with enterprise-grade software. Experience with software development. Experience with microservices architecture. At least two years of experience managing Kubernetes production systems.
Storage Technical Analyst providing global support for RBC's storage and backups infrastructure. Mentor operations staff and manage automation solutions for advanced incident management.
Infrastructure Engineer/SRE responsible for core infrastructure design and building tools for AI - driven contact center solutions. Join a leading AI company impacting the future of work.
DevOps Engineer intern at Sun Life focusing on Java applications and working with Docker and Kubernetes. Engage in collaborative, agile practices with the DevOps team.
Senior Developer, DevOps responsible for Azure infrastructure and automation at Radio - Canada. Collaborating with development teams to ensure optimal performance, availability, and security for digital media services.
Senior Analyst on Data Platform DevOps at AIMCo, responsible for building data operations and collaborating with teams on innovative solutions. Focused on ensuring data quality and integrity across technologies.
Site Reliability Engineer ensuring reliability, availability, and performance of Hiive's platform. Collaborating with cross - functional teams to build scalable and resilient infrastructure while supporting AI systems.
AI Security Control Developer/Site Reliability Engineer for RBC's enterprise AI ecosystem. Design, implement, and validate security controls to protect AI systems with 24/7 reliability.
DevOps Engineering Manager leading a team to improve SDLC at Vancity, Canada's largest Living Wage Employer. Collaborating across teams for reliable delivery of mission - critical systems.