Geospatial Data Engineer at GHGSat integrating geospatial data and optimizing AI/ML pipelines. Supporting climate impact mission through data systems and analytics in a hybrid role.
Responsibilities
Design, implement, and optimise scalable geospatial data and AI/ML pipelines.
Integrate new data sources, including satellite and terrestrial, both public and proprietary.
Re-engineer and validate existing pipelines, ensuring high-quality and performance standards.
Blend and process various geospatial data sources to create artifacts for exploratory analysis and insights.
Build scripts and automations for geospatial data processing, using tools like QGIS, GeoPandas, Rasterio, Xarray and rioxarray.
Conduct geospatial analysis and contribute to mapping and visualization.
Data testing and quality control of geospatial datasets.
Contribute to the automation of testing, deployment, and monitoring of data pipelines and AI/ML models using DBT, Airflow, Docker, and AWS services.
Work collaboratively with the Analytics team, Subject Matter Experts, and cross-teams to prototype new data solutions.
Explore applications of AI/ML for geospatial data and integrate emerging technologies where possible.
Present findings and recommendations to both technical and non-technical stakeholders, fostering a data-driven culture.
Communicate complex geospatial data insights in a clear, accessible manner to support informed outcomes.
Requirements
2-4 years of experience in data engineering, with specific expertise in geospatial data processing and analysis.
Proficiency in SQL and geospatial databases e.g., PostgresSQL/PostGIS
Experience with Airflow, DBT, and dashboarding tools such as Grafana
Proficiency in Python and experience with libraries like Pandas, pytest, NumPy, sqlalchemy.
Comfortable with cloud infrastructure (AWS preferred), containerization tools (Docker), and version control (Git).
Experience with geospatial packages such as GeoPandas, Rasterio, and QGIS is beneficial.
Knowledge of AI/ML concepts applied to geospatial data is a plus.
Knowledge of ClearML is beneficial.
Benefits
Competitive salary + stock options for all full-time employees
Salesforce Data Architect designing and optimizing enterprise - grade data architectures across Salesforce platforms. Collaborating with team members to ensure data quality and readiness for analytics.
Senior Data Engineer with a strong background in Google Cloud services at Valtech. Leading data engineering projects and developing highly available data pipelines.
Sr. Databricks Spark Developer role designing and optimizing data pipelines for banking. Requires Databricks/Spark experience in financial services with strong communication skills.
Data Integration Developer for market risk systems. Responsible for ETL/ELT development, SQL database programming, and supporting risk management systems in a hybrid Mississauga contract role.
Azure & Databricks Data Engineer role designing and building data pipelines using Microsoft tech stack. 11 - month contract, hybrid work in Oshawa, $90 - 95/hr.
Data Engineering Developer responsible for designing and implementing data flows using cloud technologies like AWS and Databricks. Collaborating within a strong data science team to optimize data for machine learning.
Sr. Manager leading data engineering team to optimize data infrastructure for insurance. Driving innovative data solutions and managing cross - functional collaborations within a remote setup.