Software Engineer, Labelling, Data & Automation

Posted yesterday

Apply Now

Resume Score

Check how well your resume matches this job before you apply.

Sign in to check score

About the role

  • Software Engineer developing tools and pipelines for Waabi's AI-focused self-driving technology. Collaborating with teams to create reliable data solutions and supporting machine learning projects.

Responsibilities

  • Design and implement tools, pipelines, and metrics to accelerate the development of our AI-first autonomy system and generative AI simulator.
  • Own the process, criteria, and tooling for efficiently finding interesting and relevant data across the petabytes of real world data that Waabi has collected.
  • Build high reliability systems for extracting and labelling the interesting data with various vendors and integrate it back into our system.
  • Work with both internal and third party stakeholders to define taxonomy, validation rules and success criteria for our labelling projects.
  • Design and manage the end-to-end deployment of data solutions to deliver high quality labelled data for various ML teams to use in experiments and model improvement.
  • Deploy open-set / embedding models to our production environment - empowering new search and curation modalities.
  • Champion engineering excellence, ensuring high-quality, well structured, and tested code.
  • Contribute to project roadmap planning, prioritization, and delivery.

Requirements

  • 4+ years of industry experience.
  • Bachelor's in computer science, engineering, machine learning, or a related technical discipline.
  • Proficient in Python programming and strong software engineering fundamentals with real-world experience writing high quality, well-structured, and well-tested code.
  • A willingness and ability to learn new skills, technologies, and software libraries as required.
  • Strong experience with data pipelines for large-scale processing and analysis.
  • Strong communication and organizational skills.
  • Understanding of cloud job orchestration, monitoring, and instrumentation best-practices.
  • Open-minded and collaborative team player with the willingness to help others.
  • Passionate about self-driving technologies, solving hard problems, and creating innovative solutions.
  • Bonus Points:
  • Experience with ML pipelines, including dataset curation, labelling, training and evaluation.
  • Previous experience in self-driving technology or related fields.
  • Familiar with linear algebra (projections, transforms) and 3D geometry.
  • Experience with MapReduce frameworks (Apache Hadoop/Spark) or orchestration frameworks (Apache Airflow/Apache Beam/Google Dataflow/AWS Step Functions).
  • Experience with with front end development.
  • Experience working with open-set / embedding models and deploying them in a production setting.
  • Experience working with infra as code (Terraform, CloudFormation, etc).

Benefits

  • Competitive compensation and equity awards.
  • Health and Wellness benefits encompassing Medical, Dental and Vision coverage (for full-time employees only).
  • Unlimited Vacation.
  • Flexible hours and Work from Home support.
  • Daily drinks, snacks and catered meals (when in office).
  • Regularly scheduled team building activities and social events both on-site, off-site & virtually.
  • As we grow, this list continues to evolve!

Job type

Full Time

Experience level

Mid levelSenior

Salary

$127,000 - $225,000 per year

Degree requirement

Bachelor's Degree

Tech skills

AirflowApacheAWSCloudHadoopMapReducePythonSparkTerraform

Location requirements

HybridTorontoCanada

Report this job

Found something wrong with the page? Please let us know by submitting a report below.