Geospatial Data Engineer at GHGSat integrating geospatial data and optimizing AI/ML pipelines. Supporting climate impact mission through data systems and analytics in a hybrid role.
Responsibilities
Design, implement, and optimise scalable geospatial data and AI/ML pipelines.
Integrate new data sources, including satellite and terrestrial, both public and proprietary.
Re-engineer and validate existing pipelines, ensuring high-quality and performance standards.
Blend and process various geospatial data sources to create artifacts for exploratory analysis and insights.
Build scripts and automations for geospatial data processing, using tools like QGIS, GeoPandas, Rasterio, Xarray and rioxarray.
Conduct geospatial analysis and contribute to mapping and visualization.
Data testing and quality control of geospatial datasets.
Contribute to the automation of testing, deployment, and monitoring of data pipelines and AI/ML models using DBT, Airflow, Docker, and AWS services.
Work collaboratively with the Analytics team, Subject Matter Experts, and cross-teams to prototype new data solutions.
Explore applications of AI/ML for geospatial data and integrate emerging technologies where possible.
Present findings and recommendations to both technical and non-technical stakeholders, fostering a data-driven culture.
Communicate complex geospatial data insights in a clear, accessible manner to support informed outcomes.
Requirements
2-4 years of experience in data engineering, with specific expertise in geospatial data processing and analysis.
Proficiency in SQL and geospatial databases e.g., PostgresSQL/PostGIS
Experience with Airflow, DBT, and dashboarding tools such as Grafana
Proficiency in Python and experience with libraries like Pandas, pytest, NumPy, sqlalchemy.
Comfortable with cloud infrastructure (AWS preferred), containerization tools (Docker), and version control (Git).
Experience with geospatial packages such as GeoPandas, Rasterio, and QGIS is beneficial.
Knowledge of AI/ML concepts applied to geospatial data is a plus.
Knowledge of ClearML is beneficial.
Benefits
Competitive salary + stock options for all full-time employees
Cloud Data Engineer responsible for modern Data & AI solutions on Microsoft Azure. Collaborating with clients and teams to develop production - ready data platforms and support analytics.
Senior Data Engineer at Solana Foundation collaborating with blockchain engineers on data indexing and pipeline creation. Ensuring efficient data processing and metrics formulation for decentralized applications.
Senior Engineer on Data Platform team designing and building systems for data flow at Movable Ink. Collaborating with engineering, analytics, and infrastructure teams to power data ingestion and processing.
Senior Data Engineer responsible for designing and maintaining event streaming pipelines at Movable Ink. Working with modern technologies to enhance data availability and reliability.
Senior Data Engineer architecting and owning Snowflake layer for Knak’s Data Infrastructure and AI enablement. Collaborating across departments to ensure data accessibility and governance standards.
Data Engineer designing and implementing cloud - native data ecosystem for sports analytics. Building scalable infrastructure to transform raw data into valuable consent assets.
Data Engineer owning infrastructure that turns raw events from mobile users into trustworthy data. Building scalable data architecture and collaborating with cross - functional teams for data management.
Data Architect engaging with companies on transformational data programs to enhance AI and data capabilities. Leading architectural frameworks and mentoring data teams against industry best practices.
ML Data Engineer responsible for designing and developing AI platforms at Newfold Digital. Collaborating across teams to integrate and optimize data sources for AI - driven applications.