Platform Engineer developing secure and reliable cloud technology solutions for a global retail network. Collaborating with various engineering teams to optimize supply chains and enhance deployment automation.
Responsibilities
Design, implement, and maintain secure, highly available, and cost-efficient container orchestration platforms, including Kubernetes and ECS.
Develop and optimize Continuous Integration and Continuous Delivery (CI/CD) pipelines to streamline and enhance deployment processes, enabling high efficiency for Product Engineering teams.
Build and refine tools and patterns for monitoring and observability to strengthen failure detection, response, and recovery capabilities.
Collaborate with Technology Engineering, Development, and Product Management teams to scale, improve, and support production systems and services.
Partner with service teams to provide comprehensive documentation, knowledge sharing, architecture planning, capacity assessments, and recommendations for future optimizations.
Engineer solutions aimed at failure prevention and minimizing the likelihood of system issues.
Write clean, maintainable code, develop thorough test plans, and assess code quality while providing constructive feedback during code reviews.
Design and operationalize AI agent infrastructure: Deploy and manage orchestration frameworks (e.g., LangChain, AutoGen) on enterprise platform infrastructure to automate complex, multi-step workflows—including self-healing pipelines, infrastructure drift detection, and automated on-call triage running on Kubernetes.
Integrate AI coding assistants into platform engineering: Evaluate, configure, and govern AI-powered development tools (e.g., GitHub Copilot, Claude) to ensure secure, seamless integration within CI/CD pipelines, code review processes, and Infrastructure-as-Code (IaC) toolchains like Terraform and CloudFormation.
Requirements
Minimum of 2 years experience in a Platform Engineer or similar role
Proficiency in Python and/or Golang with a strong software engineering mindset
Experience managing and administering Linux systems
Experience with Docker and Kubernetes
Experience with AWS (EC2, RDS, Dynamo DB, Route53, Elastic Load Balancers, AMIs, IAM Roles, Ops Works, and Cloud Formation)
Knowledge of or experience building CI/CD pipelines
Experience with infrastructure as code concepts such as immutable and scalable infrastructure
Solid understanding of networking systems as well as identity and authorization mechanisms
Experience with using AI coding tools (GitHub, Copilot, Claude) to help Platform Engineering workflows
Understanding of Agentic AI and MCP Servers
Benefits
Work in a collaborative environment with strong ambitions and goals
Senior Infrastructure & Platform Engineering Lead needed for hybrid role in GTA. Must have Azure, OpenShift/Kubernetes, Terraform, Ansible, CI/CD, and Linux experience.
Director of Platform Engineering at TELUS Digital, leading innovative solutions in AI and cloud transformations. Focus on building talent, defining strategies, and delivering client impact.
Software Update Platform Developer in Ford's Electric Vehicles team focusing on embedded systems. Collaborating on the next generation Phoenix and ECG modules to enhance electric vehicle experience.
Platform Engineering Co - Op Student analyzing platform infrastructure and operational metrics at MealSuite. Collaborating with teams to improve system performance and manage change requests.
Hiring Platform Developer (C++) in Toronto, ON. Contract role requiring 6 - 8 years experience in system - level programming and high - performance applications.
Join Wealthsimple as a Senior Software Developer to build systems that enhance developer productivity. Work remotely to design, build, and improve production - grade distributed systems.
Join Clio as a ML Platform Engineer, developing AI solutions for legal technology. Collaborate with cross - functional teams on machine learning projects and enhance operational efficiency.
Site Reliability Engineer ensuring Emburse’s systems are highly available, scalable, and performant. Collaborating across teams to drive automation and operational excellence in cloud infrastructure.