Cloud Full Stack MLOps Engineer developing cloud applications and MLOps at Ontario Brain Institute. Collaborating with stakeholders to enhance workflows and support data models in production.
Responsibilities
Design, develop, enhance, and maintain CfA cloud-based web applications software and supporting services.
Support the productization of CfA applications and services for third-party use, including deployment and operational readiness.
Implement and support MLOps capabilities, including data pipelines, model packaging, deployment, monitoring, and versioning.
Support federated learning (FL) data and model deployment workflows and associated web applications, as applicable.
Provide timely technical support to application users; triage issues, resolve incidents, and coordinate fixes with the development team.
Develop and maintain technical documentation, including architecture, runbooks, and user guidance.
Participate in application security audits, access reviews, and incident-response drills; implement remediation actions as needed.
Work closely with other cloud team members to ensure secure and robust cloud resources are configured and managed with high standards.
Requirements
Bachelor’s degree in computer science, software engineering, data science, or a related field, or an equivalent combination of education and experience.
5+ years of professional experience in software engineering, including delivery of production web applications and services.
Demonstrated experience developing modern web applications (Python; React and/or Vue.js preferred).
Proficiency with front-end user interface development and back-end services, including database integration.
Hands-on experience deploying and operating ML models in production, including versioning and containerization.
Experience developing data ingestion and processing pipelines (ETL/ELT), including custom data loaders.
Experience working in cloud environments with CPU/GPU and distributed/grid computing (Microsoft Azure preferred).
Experience delivering software in an Agile/Scrum environment, with strong collaboration and communication skills.
Demonstrated experience supporting cloud MLOps deployments and operations for model lifecycle management (e.g., CI/CD for ML, model registries, experiment tracking, and monitoring).
Experience applying engineering best practices, including source control, code review, automated testing, and CI/CD (GitHub experience preferred).
Experience with Azure services (e.g., compute, storage, networking, identity) and deploying production workloads to Azure.
Familiarity with security, privacy, and compliance practices for cloud applications (e.g., secure SDLC, threat modeling, vulnerability management).
Experience supporting regulated or research environments requiring auditability and traceability.
Lead AI/ML & MLOps Engineer executing projects from data foundations to model deployment. Collaborating with sales to drive AI/ML engagements for our clients.
Applied ML Engineer working on AI - driven insights at Kaseya. Collaborating with product teams to enhance features with machine learning and data analysis.
Adversarial Machine Learning Engineer conducting adversarial testing and simulations on LLM - driven AI systems for enterprise security. Collaborating with teams to validate and document findings.
MLOps Engineer managing infrastructure for large 2D and 3D media datasets at NBCUniversal. Responsible for automation, reproducibility, and performance of machine learning lifecycles.
Senior ML Engineer leading the strategic direction of machine learning infrastructure for global food delivery platform. Collaborating with Data Science team for seamless model deployment and innovation.
Machine Learning Intern/Co - op at Cohere working on developing and training models for AI applications. Join a team focused on advancing AI technology in an inclusive environment.
Machine Learning Engineer designing and deploying detection ML systems for social engineering defense platform at Doppel. Collaborating to mitigate evolving digital threats using AI.
Senior Software Developer responsible for designing and developing solutions in data engineering and machine learning. Collaborating with teams to deliver scalable software solutions with agile methodologies.
Senior ML Engineer responsible for designing and building ML pipelines for a Trust Scoring platform. Involves productionizing models and implementing MLOps best practices.
Principal Machine Learning Engineer designing the core ML systems for AI agents at Workday. Collaborating in cross - functional teams to integrate ML solutions into the platform.