Resume Score

Check how well your resume matches this job before you apply.

Sign in to check score

About the role

  • Senior MLOps Engineer at Deep Genomics, maintaining ML infrastructure for drug discovery. Enjoy collaborating with scientists to ensure reliable and scalable ML systems in an innovative environment.

Responsibilities

  • Maintain and improve cloud infrastructure (GCP) using Infrastructure-as-Code tools (Terraform).
  • Manage IAM, RBAC, and permission policies across cloud environments.
  • Own and evolve CI/CD pipelines (CircleCI, GitHub Actions) and ensure best practices are followed across the engineering and ML teams.
  • Administer and support workflow orchestration platforms (e.g., Seqera/Nextflow, Argo, Kubeflow).
  • Operate and configure ML experiment tracking and registry tooling (e.g., W&B, MLflow).
  • Build and maintain containerized environments (Docker) and manage Kubernetes clusters.
  • Manage GPU resources – provisioning, scheduling, and debugging hardware and driver issues.
  • Write and maintain Python tooling, scripts, and integrations that support ML infrastructure.
  • Help deploy ML models to production environments and monitor their performance.

Requirements

  • 4+ years of experience operating production infrastructure.
  • Proficiency with cloud platforms (GCP preferred; AWS/Azure acceptable) and Infrastructure-as-Code (Terraform).
  • Extensive Hands-on experience with Kubernetes and containerization (Docker).
  • Solid background in CI/CD systems (CircleCI, GitHub Actions, or similar).
  • Experience managing GPU compute (provisioning, debugging, driver management).
  • Familiarity with Python package and environment management (e.g., pip, conda, pixi).
  • Strong Python programming skills.
  • Self-motivated problem solver with excellent communication skills.

Benefits

  • Highly competitive compensation, including meaningful stock ownership.
  • Comprehensive benefits - including health, vision, and dental coverage for employees and families, employee and family assistance program.
  • Flexible work environment - including flexible hours, extended long weekends, holiday shutdown, unlimited personal days.
  • Maternity and parental leave top-up coverage, as well as new parent paid time off.
  • Focus on learning and growth for all employees - learning and development budget & lunch and learns.
  • Facilities located in the heart of Toronto - the epicenter of machine learning and AI research and development, and in Kendall Square, Cambridge, Mass. - a global center of biotechnology and life sciences.

Job type

Full Time

Experience level

Senior

Salary

CA$175,000 - CA$200,000 per year

Degree requirement

Bachelor's Degree

Tech skills

AWSAzureCloudDockerGoogle Cloud PlatformKubernetesPythonTerraform

Location requirements

HybridTorontoCanada

Report this job

Found something wrong with the page? Please let us know by submitting a report below.