Data Engineer building scalable ETL/ELT data pipelines using GCP and BigQuery. Collaborating across teams to ensure robust data infrastructure and analytics ready datasets.
Responsibilities
Design, build, and maintain scalable and reliable batch and real-time ETL/ELT data pipelines using cloud services such as GCP Dataflow, Cloud Functions, Pub/Sub, and Cloud Composer.
Architect and implement robust data infrastructure capable of handling high-volume data ingestion and processing.
Develop and manage our central data warehouse in Google BigQuery.
Design and implement data models, schemas, and table structures optimized for performance, scalability, and long-term maintainability.
Write clean, efficient, and maintainable SQL and Python code to transform raw data into curated, analysis-ready datasets.
Build reliable transformation workflows that support analytics, reporting, and data science initiatives.
Monitor, troubleshoot, and optimize data infrastructure to ensure high performance, reliability, and cost efficiency.
Implement BigQuery best practices, including partitioning, clustering, query optimization, and materialized views.
Build and maintain curated data models that serve as the “source of truth” for business intelligence and reporting.
Ensure data is optimized and readily accessible for BI tools such as Looker and other analytics platforms.
Implement automated data quality checks, validation rules, and monitoring frameworks to ensure the integrity and reliability of data pipelines and warehouse systems.
Establish processes for data governance, observability, and lineage tracking.
Work closely with software engineers, data analysts, and data scientists to understand their data requirements and provide the necessary infrastructure and data products.
Lead and support client and stakeholder communication, working with enterprise clients to translate business needs into scalable data solutions.
Partner with product teams and leadership to ensure that technical data solutions align with business strategy and client expectations.
Take ownership of data platforms and architecture decisions, helping shape the future direction of our analytics and data infrastructure.
Identify opportunities to improve data reliability, automate workflows, and generate new insights through data.
Contribute to a collaborative, high-performing engineering culture with strong communication and teamwork.
Requirements
5+ years of hands-on experience in data engineering, data integration, or data platform development.
Degree in Computer Science, Engineering, Mathematics, or related STEM discipline.
Strong programming and query skills in SQL and Python.
Experience working with distributed version control systems such as Git in an Agile/Scrum environment.
Experience designing and orchestrating ETL pipelines, particularly with Databricks.
Experience working within cloud environments (GCP, AWS, or Azure).
Experience with database systems such as MongoDB and Elasticsearch.
Strong understanding of data warehousing and dimensional modeling methodologies.
Hands-on experience with Airflow and Hadoop.
Experience using Docker for containerized workflows and reproducible environments.
Ability to identify opportunities to improve data quality, reliability, and automation.
Strong business awareness and communication skills, with the ability to collaborate with both technical teams and business stakeholders.
Senior Data Architect delivering enterprise - scale data and analytics solutions at 3Pillar. Leading design and delivery of analytics platforms in mixed prem and cloud environments.
Cloud Data Engineer responsible for modern Data & AI solutions on Microsoft Azure. Collaborating with clients and teams to develop production - ready data platforms and support analytics.
Senior Data Engineer at Solana Foundation collaborating with blockchain engineers on data indexing and pipeline creation. Ensuring efficient data processing and metrics formulation for decentralized applications.
Senior Data Engineer responsible for designing and maintaining event streaming pipelines at Movable Ink. Working with modern technologies to enhance data availability and reliability.
Senior Engineer on Data Platform team designing and building systems for data flow at Movable Ink. Collaborating with engineering, analytics, and infrastructure teams to power data ingestion and processing.
Senior Data Engineer architecting and owning Snowflake layer for Knak’s Data Infrastructure and AI enablement. Collaborating across departments to ensure data accessibility and governance standards.
Data Engineer designing and implementing cloud - native data ecosystem for sports analytics. Building scalable infrastructure to transform raw data into valuable consent assets.
Data Engineer owning infrastructure that turns raw events from mobile users into trustworthy data. Building scalable data architecture and collaborating with cross - functional teams for data management.
Data Architect engaging with companies on transformational data programs to enhance AI and data capabilities. Leading architectural frameworks and mentoring data teams against industry best practices.