DevOps Specialist creating and overseeing Azure hybrid cloud infrastructures for EVLO's battery energy storage solutions. Collaborating with teams to implement cutting-edge technologies in a dynamic environment.
Responsibilities
Play a key role in designing and evolving Azure hybrid cloud infrastructures within critical industrial and technological environments.
Have a direct impact on national-scale projects by contributing to large-scale solutions in IT and OT contexts.
Deploy and evolve modern CI/CD pipelines for applications, industrial systems (OT), and AI/MLOps projects.
Automate infrastructures and environments using Infrastructure as Code best practices (Terraform, ARM/Bicep, Ansible).
Design and operate containerized and orchestrated environments (Docker, Kubernetes / AKS).
Work closely with multidisciplinary teams (software development, data, AI, automation, cybersecurity) to industrialize and operate innovative solutions.
Contribute to the deployment and operation of MLOps platforms covering training, deployment, monitoring, and reliability of AI models.
Orchestrate and automate data and compute pipelines for advanced industrial analytics projects using tools like Dagster.
Participate in the integration and operation of industrial data using time-series and analytical databases (e.g., InfluxDB).
Ensure stability, traceability, and monitoring of AI models and critical systems in production.
Contribute to the secure convergence of IT, cloud, and industrial (OT) environments.
Participate in continuous improvement of availability, resilience, and observability of platforms and data flows.
Integrate cybersecurity by design, securing cloud infrastructures, CI/CD pipelines, Kubernetes, and MLOps platforms.
Collaborate with cybersecurity teams during risk analyses, audits, and incident responses.
Automate development and operations processes using scripts and modern tools.
Participate in testing, commissioning, and continuous improvement phases of environments.
Maintain and evolve historian databases and key platform components.
Document solutions and share best practices with teams.
Contribute to project estimation, planning, and resolution of complex problems.
Requirements
Bachelor’s degree in software engineering, computer science, electrical engineering, automation, or a related field
3 to 6 years of relevant experience in DevOps, cloud infrastructure, or IT/OT operations
Hands-on experience with industrial, critical, or high-availability environments
Languages: French and English (spoken and written)
Good knowledge of version control and continuous integration tools such as Git and GitLab
Understanding of industrial communication protocols (e.g., OPC UA, Modbus, MQTT)
Ability to write and maintain automation scripts (Python, Bash, PowerShell, YAML)
Strong knowledge of industrial programming and systems, including PLCs, SCADA systems, and embedded programming (C, C++, IEC 61131-3 (ST)…)
Experience with monitoring, observability, and log management tools to ensure system performance and reliability (Prometheus, Grafana, Azure Monitor, Telegraf, Elastic Stack [ELK / OpenSearch], Azure Log Analytics, Graylog, GitLab-Kubeflow-Dagster monitoring)
Experience with containerized and orchestrated environments (Docker, Kubernetes / AKS)
Familiarity with AI and MLOps tools and platforms
Ability to orchestrate and automate data and compute pipelines
Experience with Infrastructure as Code to automate environment deployments
Good knowledge of the Microsoft Azure platform (networking, security, compute, storage)
Understanding of IT and industrial (OT) architectures and related cybersecurity challenges
Solid fundamentals in networking and cloud security, including access and secrets management
Knowledge of security best practices and system hardening
Ability to read and understand technical diagrams (electrical or control schematics)
Asset: experience or interest in the energy sector
Benefits
Comprehensive group health insurance
Group RRSP program with employer contribution
Employee Assistance Program (EAP)
Recognition of years of experience for vacation entitlement
Collaborative, people-focused work environment that emphasizes learning
Challenging, impactful projects with tangible results
DevOps Specialist responsible for technical expertise in Java development and AWS automation. Ensuring high - quality software solutions and a reliable infrastructure at Portage CyberTech.
Senior DevOps Engineer managing Zipline's cloud infrastructure and CI/CD systems. Collaborating with engineering teams to ensure platform reliability and scalability.
Senior Site Reliability Engineer at Fable ensuring reliable and scalable infrastructure for AI - driven accessible products. Collaborating across teams to improve operational excellence and platform engineering.
Back - End & DevOps Software Developer contributing to building digital products to change the world. Specializing in back - end development and command of DevOps ecosystem for robust infrastructure.
Storage Technical Analyst providing global support for RBC's storage and backups infrastructure. Mentor operations staff and manage automation solutions for advanced incident management.
Infrastructure Engineer/SRE responsible for core infrastructure design and building tools for AI - driven contact center solutions. Join a leading AI company impacting the future of work.
DevOps Engineer intern at Sun Life focusing on Java applications and working with Docker and Kubernetes. Engage in collaborative, agile practices with the DevOps team.
Senior Developer, DevOps responsible for Azure infrastructure and automation at Radio - Canada. Collaborating with development teams to ensure optimal performance, availability, and security for digital media services.
Senior Analyst on Data Platform DevOps at AIMCo, responsible for building data operations and collaborating with teams on innovative solutions. Focused on ensuring data quality and integrity across technologies.
Site Reliability Engineer ensuring reliability, availability, and performance of Hiive's platform. Collaborating with cross - functional teams to build scalable and resilient infrastructure while supporting AI systems.