Data Engineer focused on developing and sustaining Machine Learning solutions for ICBC's data needs. Collaborating with Data Scientists and Statistical Analysts on data-driven projects.
Responsibilities
Understanding Data Science, Machine Learning, Performance & Evaluative Analytics model requirements, working closely with Data Scientists & Statistical Analysts, supporting them with their data and Machine Learning operational needs.
Operationalizing Data Science Model into Machine Learning pipelines, applying coding optimization of the data science models, conducting model training and re-training, deploying the models and sustaining them in Production.
Responding to data requests, data discovery and data profiling to support various data science, evaluative and machine learning solutions and projects, reviewing and clarifying data requirements, ensuring the data artifacts are acceptable within policy and privacy protocols.
Providing subject matter & data expertise to the Strategic Analytics, Actuarial and Regulatory Affairs departs as well as ICBC divisional clients on data sources, reporting workflows, business process, and the appropriate tools with which to analyze their data.
Participating with corporate data user teams, developing data science model validation and test plans, performing user acceptance testing, and providing support to data scientists, evaluative & performance metrics analysts and sustainment of their end products.
Conducting analysis for moderate to complex strategic solutions and POCs, defining data fields and determining data availability, developing information layout, format and interactivity. Presenting findings and providing clarification.
Requirements
Proven work-based experience coding using Python Language and PySpark data framework will be required.
Experience working with ML libraries & frameworks including Scikit-Learn for traditional ML, TensorFlow and PyTorch for deep learning.
Proficiency in Data Science Stack such as NumPy, PySpark and Pandas for data manipulation.
Technical knowledge in cleaning, transforming and preparing un-curated data including handling of values and feature scaling.
Exposure to Machine Learning Operations (MLOps) supporting Model development, skills with Docker for containerization, API development and using cloud platforms.
Knowledge & experience with Machine Learning Algorithms and techniques
Experience or exposure to working with pre-trained models such as Large Language Models (LLM), using Retrieval-Augmented Generation (RAG) and working with HuggingFace pre-trained models
Experience with processing structured and unstructured data.
Intermediate to Advance experience of writing SQL Queries & working with NoSQL Databases
Knowledge of experiment tracking & Management using tools like MLFlow, Data Version Control (DVC), managing model versions, parameters and results.
Pipeline orchestration using Apache AirFlow to automate training, testing and deployment workflows.
Setting up automated pipelines for Continuous integration and continuous deployment (CI/CD) using GitLab.
Excellent interpersonal, verbal and written communication skills to work with customers.
Strong data quality management process understanding, data analysis and data profiling.
Ability to apply critical thinking skills to troubleshoot and perform root cause analysis on technical problems and Machine Learning model deployments.
Understanding of Agile Methodologies.
Experience with reporting and visualization tools, such as Tableau, Jupiter or other reporting tools would be an asset.
Google Cloud Data Engineer implementing data ingestion and analytics frameworks at Fueled. Specializing in Google Cloud Platform and modern data modeling.
Consulting Senior Data Architect specializing in Microsoft Fabric solutions for digital products. Engage in hands - on delivery, architecture, and governance for data engineering in a remote capacity.
Data Engineer at Motive delivering data infrastructure for the AI era. Collaborating with stakeholders, building models, and implementing innovative tooling.
Data Architect designing and governing data foundations for analytics and AI applications at Clio. Collaborating cross - functionally to develop high - quality data models and standards.
IAM/Data Engineer role in Toronto (Hybrid). Requires 4+ years in ETL, data pipelines, cloud platforms, and skills in Windows IAM, Ansible, Terraform, SQL, Python/Java, Spark/Kafka.
Data Migration Specialist managing client data migrations to gaiia's platform. Collaborating with teams to ensure accurate and timely data transitions.
Senior Data Architect/Strategist at Robots & Pencils blending advanced data knowledge with problem solving to drive intelligent products and smarter business decisions.
Principal Data Architect at PointClickCare ensuring coherent and scalable data architecture. Driving unified data direction while collaborating with Engineering Architecture team for AI enablement.
Senior Data Engineer developing the data management layer for a financial brokerage platform with scalability for larger customers. Collaborating with teams in a fully remote, diverse environment.
Technical Lead overseeing data engineers, analysts, and architects to implement data solutions. Leading modernization of data infrastructures for diverse business objectives.