Principal Software Engineer leading MLOps team within Analytics Platform at Sun Life. Emphasizing AWS cloud services and machine learning operations responsibilities.
Responsibilities
Provide Principal-level technical leadership for the MLOps squad within the Analytics Platform, influencing architecture and standards across multiple teams
Design, build, and evolve production-grade MLOps and ML platform capabilities, including model lifecycle management, CI/CD, evaluation, monitoring, and governance
Spend ~ 60% hands-on in software engineering, building high-quality, testable, maintainable services, frameworks, APIs, and shared libraries that enable ML and GenAI at scale
Spend ~ 40% on platform, cloud, and MLOps enablement, ensuring solutions are secure, scalable, observable, and cost-effective
Act as a technical authority for AWS-based ML platforms, leading design decisions and guiding adoption of new cloud and GenAI services
Establish and evolve engineering and MLOps standards, patterns, and best practices across squads
Partner closely with Architecture, Security, Risk, and Product teams to reduce delivery, operational, and model governance risk
Lead by example through code reviews, design reviews, incident analysis, and operational improvements
Mentor senior and intermediate engineers, raising the overall technical bar without direct people management.
Requirements
A deep software engineering background, with a strong track record of building and operating large-scale, production systems
Extensive hands-on experience with AWS in production, including infrastructure design, deployment, and optimization
Strong experience with Distributed systems and cloud-native architectures
CI/CD pipelines and automation
Observability, reliability, and incident response
Practical experience with MLOps, ModelOps, ML platforms, or LLMOps in enterprise environments
Proficiency in one or more of: Python, Java, Scala, and Infrastructure-as-Code tooling (Terraform / CloudFormation)
Ability to operate at Principal scope, influencing architecture and decisions across multiple teams and domains
Exceptional communication skills, with the ability to explain complex technical concepts to both technical and non-technical stakeholders.
Benefits
Wellness programs that support the three pillars of your health – mental, physical, and financial
Principal Applied AI/ML Engineer designing and delivering high - impact AI systems for Autodesk's Forma Construction Cloud. Collaborating across teams to tackle complex technical challenges.
Senior Machine Learning Engineer architecting ranking systems for Instacart's search and recommendations. Collaborating with teams to optimize personalization, revenue, and user experience.
Audio ML Engineer II developing state - of - the - art audio deepfake detection models for Reality Defender. Tuning and deploying models in real - world client environments with a focus on performance and robustness.
AI/ML Engineer developing and deploying solutions for pharmacy technology at VXForward. Collaborating with cross - functional teams to enhance workflows and operational efficiency.
Technical leader for ML - powered Supply & Fleet Optimization Systems at Lime. Driving scalable, high - impact systems for fleet optimization and forecasting.
Software Engineer responsible for MLOps technology at BMO, focusing on software development lifecycle and system designs. Collaborating with functional teams to translate user requirements into technical specifications.
Machine Learning Engineer at Red Hat focused on optimizing AI models and contributing to open - source AI tooling. Collaborating on model compression algorithms and deploying deep learning research solutions.
Principal ML Engineer at a global networking leader shaping ML strategy and building intelligent systems for network management. Driving innovation in fields like Generative AI and Data Mining.