Senior MLOps Engineer at Deep Genomics, maintaining ML infrastructure for drug discovery. Enjoy collaborating with scientists to ensure reliable and scalable ML systems in an innovative environment.
Responsibilities
Maintain and improve cloud infrastructure (GCP) using Infrastructure-as-Code tools (Terraform).
Manage IAM, RBAC, and permission policies across cloud environments.
Own and evolve CI/CD pipelines (CircleCI, GitHub Actions) and ensure best practices are followed across the engineering and ML teams.
Administer and support workflow orchestration platforms (e.g., Seqera/Nextflow, Argo, Kubeflow).
Operate and configure ML experiment tracking and registry tooling (e.g., W&B, MLflow).
Build and maintain containerized environments (Docker) and manage Kubernetes clusters.
Manage GPU resources – provisioning, scheduling, and debugging hardware and driver issues.
Write and maintain Python tooling, scripts, and integrations that support ML infrastructure.
Help deploy ML models to production environments and monitor their performance.
Requirements
4+ years of experience operating production infrastructure.
Proficiency with cloud platforms (GCP preferred; AWS/Azure acceptable) and Infrastructure-as-Code (Terraform).
Extensive Hands-on experience with Kubernetes and containerization (Docker).
Solid background in CI/CD systems (CircleCI, GitHub Actions, or similar).
Familiarity with Python package and environment management (e.g., pip, conda, pixi).
Strong Python programming skills.
Self-motivated problem solver with excellent communication skills.
Benefits
Highly competitive compensation, including meaningful stock ownership.
Comprehensive benefits - including health, vision, and dental coverage for employees and families, employee and family assistance program.
Flexible work environment - including flexible hours, extended long weekends, holiday shutdown, unlimited personal days.
Maternity and parental leave top-up coverage, as well as new parent paid time off.
Focus on learning and growth for all employees - learning and development budget & lunch and learns.
Facilities located in the heart of Toronto - the epicenter of machine learning and AI research and development, and in Kendall Square, Cambridge, Mass. - a global center of biotechnology and life sciences.
Senior Machine Learning Engineer at DraftKings shaping player experiences through machine learning solutions. Leading initiatives to improve engagement and retention via advanced data strategies.
Machine Learning Developer at Rocket Innovation Studio designing frameworks for automated decision - making. Collaborating with data scientists to develop algorithms and hosting trained models into business processes.
MLOps Developer leading major workstreams for AI - enhanced aerial surveillance platforms. Focused on deploying deep learning models and influencing MLOps strategy.
Senior Staff Machine Learning Engineer at Workiva defining enterprise - level AI architecture and solutions. Leading technical direction and influencing secure AI platform design across multiple teams.
Principal Applied AI/ML Engineer designing and delivering high - impact AI systems for Autodesk's Forma Construction Cloud. Collaborating across teams to tackle complex technical challenges.
Senior Machine Learning Engineer architecting ranking systems for Instacart's search and recommendations. Collaborating with teams to optimize personalization, revenue, and user experience.
Audio ML Engineer II developing state - of - the - art audio deepfake detection models for Reality Defender. Tuning and deploying models in real - world client environments with a focus on performance and robustness.
AI/ML Engineer developing and deploying solutions for pharmacy technology at VXForward. Collaborating with cross - functional teams to enhance workflows and operational efficiency.
Technical leader for ML - powered Supply & Fleet Optimization Systems at Lime. Driving scalable, high - impact systems for fleet optimization and forecasting.