Resume Score

Check how well your resume matches this job before you apply.

Sign in to check score

About the role

  • Data Scientist responsible for ensuring reliability of ML models at Jobber. Collaborating closely with product teams and contributing to AI systems.

Responsibilities

  • Design, implement, and maintain ML model validation frameworks, including custom evaluation metrics, loss functions, and statistical tests, to ensure model quality before and after deployment.
  • Build and own regression test suites for ML and LLM models, catching performance regressions and unexpected behaviour across model updates and data drift scenarios.
  • Develop and execute MCP evaluations, systematically assessing model capabilities, edge cases, and failure modes across relevant business contexts.
  • Monitor models in production using statistical process control, drift detection, and alerting pipelines; proactively surface issues before they impact customers.
  • Collaborate with senior data scientists to contribute to the design and refinement of ML model architectures, offering feedback grounded in validation results.
  • Document evaluation methodologies, test results, and monitoring runbooks clearly enough that stakeholders across technical and business teams can understand model health.
  • Stay current with advancements in LLM evaluation techniques, AI safety, and model observability, and apply emerging best practices to our workflows.
  • Communicate findings clearly and concisely to stakeholders, translating model performance signals into actionable recommendations.

Requirements

  • Industry experience in data science, machine learning, or a closely related quantitative field.
  • Proficiency in Python and the core DS stack: Pandas, Scikit-Learn, XGBoost, and at least one deep learning framework (PyTorch or TensorFlow).
  • Solid grasp of statistical concepts underpinning model evaluation: bias–variance tradeoff, calibration, confidence intervals, A/B testing, and data drift.
  • Experience with LLM evaluation frameworks (e.g. RAGAS, Eleuther AI Eval Harness, or custom LLM eval pipelines).
  • Hands-on experience designing custom evaluation metrics; you've gone beyond off-the-shelf metrics when the problem demanded it.
  • Strong understanding of ML and LLM model architectures — you can reason about how a model is built and why it behaves the way it does.
  • High proficiency in SQL for data exploration, feature validation, and debugging model inputs.
  • Exceptional attention to detail — you treat model validation with the same rigour as software QA.
  • Strong written and verbal communication skills; comfortable presenting findings to both technical peers and non-technical stakeholders.

Benefits

  • equity rewards
  • annual stipends for health and wellness
  • retirement savings matching
  • extended health package with fully paid premiums for body and mind
  • access to a dedicated talent development program including career coaching and opportunities for career development

Job type

Full Time

Experience level

Mid levelSenior

Salary

CA$125,800 - CA$170,100 per year

Degree requirement

Bachelor's Degree

Tech skills

PandasPythonPyTorchScikit-LearnSQLTensorflow

Location requirements

HybridCanada

Report this job

Found something wrong with the page? Please let us know by submitting a report below.