Data Scientist responsible for ensuring reliability of ML models at Jobber. Collaborating closely with product teams and contributing to AI systems.
Responsibilities
Design, implement, and maintain ML model validation frameworks, including custom evaluation metrics, loss functions, and statistical tests, to ensure model quality before and after deployment.
Build and own regression test suites for ML and LLM models, catching performance regressions and unexpected behaviour across model updates and data drift scenarios.
Develop and execute MCP evaluations, systematically assessing model capabilities, edge cases, and failure modes across relevant business contexts.
Monitor models in production using statistical process control, drift detection, and alerting pipelines; proactively surface issues before they impact customers.
Collaborate with senior data scientists to contribute to the design and refinement of ML model architectures, offering feedback grounded in validation results.
Document evaluation methodologies, test results, and monitoring runbooks clearly enough that stakeholders across technical and business teams can understand model health.
Stay current with advancements in LLM evaluation techniques, AI safety, and model observability, and apply emerging best practices to our workflows.
Communicate findings clearly and concisely to stakeholders, translating model performance signals into actionable recommendations.
Requirements
Industry experience in data science, machine learning, or a closely related quantitative field.
Proficiency in Python and the core DS stack: Pandas, Scikit-Learn, XGBoost, and at least one deep learning framework (PyTorch or TensorFlow).
Solid grasp of statistical concepts underpinning model evaluation: bias–variance tradeoff, calibration, confidence intervals, A/B testing, and data drift.
Experience with LLM evaluation frameworks (e.g. RAGAS, Eleuther AI Eval Harness, or custom LLM eval pipelines).
Hands-on experience designing custom evaluation metrics; you've gone beyond off-the-shelf metrics when the problem demanded it.
Strong understanding of ML and LLM model architectures — you can reason about how a model is built and why it behaves the way it does.
High proficiency in SQL for data exploration, feature validation, and debugging model inputs.
Exceptional attention to detail — you treat model validation with the same rigour as software QA.
Strong written and verbal communication skills; comfortable presenting findings to both technical peers and non-technical stakeholders.
Benefits
equity rewards
annual stipends for health and wellness
retirement savings matching
extended health package with fully paid premiums for body and mind
access to a dedicated talent development program including career coaching and opportunities for career development
Senior Data Scientist leading high priority data initiatives to support Sales at Wealthsimple, Canada's leading financial innovator. Collaborate with diverse teams to model sales data, improve processes, and develop ML solutions.
Statistical Methodology Data Scientist enhancing research methodologies at Roche. Supporting decision - making through expert guidance and collaboration with various teams in clinical research.
Data Scientist developing custom AI solutions by extracting insights from data at BDO. Collaborating cross - functionally to leverage advanced machine learning and data processing techniques.
Lead Data Scientist at McKesson focusing on Generative AI and ML solutions. Address complex healthcare challenges and drive business transformation via innovative technology.
Data Scientist II responsible for developing innovative AI solutions and supporting data - driven strategies at Intact. Collaborating across departments to enhance data insights and recommendations.
Senior Clinical Data Manager managing clinical trial data management processes with Precision for Medicine. Lead data management, ensure timelines and quality throughout clinical trials, requiring 8+ years of experience in the role.
Senior Clinical Data Manager managing clinical trial data from start to post lock. Overseeing quality control, timelines, and data entry processes for assigned projects.
Senior Data Scientist II optimizing delivery experience at Instacart. Identifying strategic opportunities and leveraging data to enhance customer satisfaction.
Freelance Data Scientist supporting Klick Consulting by converting complex data into actionable insights. Collaborating with teams to drive meaningful impact in life sciences with creativity and innovation.