Data Engineer managing ingestion pipelines in cloud-native data ecosystem for analytics, marketing, and reporting use cases. Ensuring data quality and governance throughout the process.
Responsibilities
Design, build, and operate **robust ingestion pipelines** for batch and near-real-time data using AWS-native services
Implement **CDC-based ingestion patterns** for databases, SaaS platforms, and external partners
Standardize ingestion frameworks for files, APIs, event streams, and cross-account data sharing
Define and maintain **raw and staging data models** that preserve source fidelity and lineage
Partner with source system owners to define ingestion SLAs, contracts, schemas, and change management strategies
Ensure ingestion pipelines meet **data quality, observability, and reliability standards**
Implement metadata capture, schema evolution handling, and data validation at ingestion time
Automate infrastructure using **AWS CDK** and integrate CI/CD pipelines via CodeCommit and CodePipeline
Optimize ingestion workflows for scalability, cost efficiency, and fault tolerance
Support Agile delivery and collaborate closely with offshore engineering teams
Requirements
Bachelor’s or Master’s degree in Computer Science, Engineering, or a quantitative field
5+ years of experience in data engineering, with a strong focus on **data ingestion and integration**
Strong understanding of **change data capture (CDC)** concepts and ingestion patterns
Hands-on experience with AWS services such as **Lambda, Step Functions, MWAA, Glue, Redshift**
Experience building ingestion pipelines for **APIs, files, databases, and event-based systems**
Proficiency in **Python** and familiarity with data serialization formats (JSON, Parquet, Avro)
Experience implementing infrastructure as code using **AWS CDK**
Working knowledge of CI/CD, version control, and automated deployments
Strong collaboration skills and comfort working with distributed offshore teams
Detail-oriented, proactive, and ownership-driven mindset
Salesforce Data Architect designing and optimizing enterprise - grade data architectures across Salesforce platforms. Collaborating with team members to ensure data quality and readiness for analytics.
Senior Data Engineer with a strong background in Google Cloud services at Valtech. Leading data engineering projects and developing highly available data pipelines.
Sr. Databricks Spark Developer role designing and optimizing data pipelines for banking. Requires Databricks/Spark experience in financial services with strong communication skills.
Data Integration Developer for market risk systems. Responsible for ETL/ELT development, SQL database programming, and supporting risk management systems in a hybrid Mississauga contract role.
Azure & Databricks Data Engineer role designing and building data pipelines using Microsoft tech stack. 11 - month contract, hybrid work in Oshawa, $90 - 95/hr.
Data Engineering Developer responsible for designing and implementing data flows using cloud technologies like AWS and Databricks. Collaborating within a strong data science team to optimize data for machine learning.
Sr. Manager leading data engineering team to optimize data infrastructure for insurance. Driving innovative data solutions and managing cross - functional collaborations within a remote setup.