DevOps Engineer designing, building, and optimizing cloud infrastructure for machine learning operations at a gaming company. Scaling AI models for production and ensuring system reliability and performance.
Responsibilities
Manage, configure, and automate cloud infrastructure using tools such as Terraform and Ansible.
Implement CI/CD pipelines for ML models and data workflows, focusing on automation, versioning, rollback, and monitoring with tools like Vertex AI, Jenkins, and DataDog.
Build and maintain scalable data and feature pipelines for both real-time and batch processing using BigQuery, BigTable, Dataflow, Composer, Pub/Sub, and Cloud Run.
Set up infrastructure for model monitoring and observability — detecting drift, bias, and performance issues using Vertex AI Model Monitoring and custom dashboards.
Optimize inference performance, improving latency and cost-efficiency of AI workloads.
Ensure overall system reliability, scalability, and performance across the ML/Data platform.
Define and implement infrastructure best practices for deployment, monitoring, logging, and security.
Troubleshoot complex issues affecting ML/Data pipelines and production systems.
Ensure compliance with data governance, security, and regulatory standards, especially for real-money gaming environments.
Requirements
3+ years of experience as a DevOps Engineer, ideally with a focus on ML and Data infrastructure.
Strong hands-on experience with Google Cloud Platform (GCP) — especially BigQuery, Dataflow, Vertex AI, Cloud Run, and Pub/Sub.
Proficiency with Terraform (and bonus points for Ansible).
Solid grasp of containerization (Docker, Kubernetes) and orchestration platforms like GKE.
Experience building and maintaining CI/CD pipelines, preferably with Jenkins.
Strong understanding of monitoring and logging best practices for cloud and data systems.
Scripting experience with Python, Groovy, or Shell.
Familiarity with AI orchestration frameworks (LangGraph or LangChain) is a plus.
Bonus points if you’ve worked in gaming, real-time fraud detection, or AI-driven personalization systems.
Senior Developer / DevOps Specialist joining large - scale digital modernization initiative. Building secure, scalable cloud - native applications within an agile delivery environment.
Senior Deployment Engineer addressing complex technical integrations in AI agent deployments for customer experience. Collaborative role with technical teams and customers to optimize solutions.
We are hiring a CI/CD Engineer with strong Platform Engineering and DevOps expertise to design, build, and optimize scalable and secure CI/CD pipelines and cloud - based platforms in Toronto, ON.
DevOps Lead needed for a 6 - 12 month remote contract in Toronto, ON. Must have 10 - 12 years experience, CI/CD with Azure DevOps, Docker, Kubernetes, and scan integration.
Co - op or Intern, DevOps Engineer joining BDO Digital's AppDev team. Responsibilities include managing Azure cloud environments and building CI/CD pipelines.
Senior DevOps Engineer designing and implementing scalable AWS network architectures at Magnet Forensics. Collaborating with diverse teams for secure, efficient connectivity across services.
Site Reliability Engineer ensuring high availability, scalability, and performance of Emburse’s systems. Collaborating on distributed systems while mentoring junior engineers.
Associate DevOps Engineer supporting the Continuous Integration and Delivery pipeline of Sun Life's Canadian IT API applications. Ideal for Computer Science students graduating December 2026 or later, seeking industry experience.
Reliability Engineering Intern working with experienced engineers on mining operations. Gaining hands - on experience with Caterpillar equipment and engineering challenges.
Senior Reliability Engineer at IKO Industries optimizing asset reliability and equipment performance across manufacturing operations. Applying advanced reliability methodologies and leading multi - site initiatives.