Data Engineer managing ingestion pipelines in cloud-native data ecosystem for analytics, marketing, and reporting use cases. Ensuring data quality and governance throughout the process.
Responsibilities
Design, build, and operate **robust ingestion pipelines** for batch and near-real-time data using AWS-native services
Implement **CDC-based ingestion patterns** for databases, SaaS platforms, and external partners
Standardize ingestion frameworks for files, APIs, event streams, and cross-account data sharing
Define and maintain **raw and staging data models** that preserve source fidelity and lineage
Partner with source system owners to define ingestion SLAs, contracts, schemas, and change management strategies
Ensure ingestion pipelines meet **data quality, observability, and reliability standards**
Implement metadata capture, schema evolution handling, and data validation at ingestion time
Automate infrastructure using **AWS CDK** and integrate CI/CD pipelines via CodeCommit and CodePipeline
Optimize ingestion workflows for scalability, cost efficiency, and fault tolerance
Support Agile delivery and collaborate closely with offshore engineering teams
Requirements
Bachelor’s or Master’s degree in Computer Science, Engineering, or a quantitative field
5+ years of experience in data engineering, with a strong focus on **data ingestion and integration**
Strong understanding of **change data capture (CDC)** concepts and ingestion patterns
Hands-on experience with AWS services such as **Lambda, Step Functions, MWAA, Glue, Redshift**
Experience building ingestion pipelines for **APIs, files, databases, and event-based systems**
Proficiency in **Python** and familiarity with data serialization formats (JSON, Parquet, Avro)
Experience implementing infrastructure as code using **AWS CDK**
Working knowledge of CI/CD, version control, and automated deployments
Strong collaboration skills and comfort working with distributed offshore teams
Detail-oriented, proactive, and ownership-driven mindset
Senior Data Architect delivering enterprise - scale data and analytics solutions at 3Pillar. Leading design and delivery of analytics platforms in mixed prem and cloud environments.
Cloud Data Engineer responsible for modern Data & AI solutions on Microsoft Azure. Collaborating with clients and teams to develop production - ready data platforms and support analytics.
Senior Data Engineer at Solana Foundation collaborating with blockchain engineers on data indexing and pipeline creation. Ensuring efficient data processing and metrics formulation for decentralized applications.
Senior Data Engineer responsible for designing and maintaining event streaming pipelines at Movable Ink. Working with modern technologies to enhance data availability and reliability.
Senior Engineer on Data Platform team designing and building systems for data flow at Movable Ink. Collaborating with engineering, analytics, and infrastructure teams to power data ingestion and processing.
Senior Data Engineer architecting and owning Snowflake layer for Knak’s Data Infrastructure and AI enablement. Collaborating across departments to ensure data accessibility and governance standards.
Data Engineer designing and implementing cloud - native data ecosystem for sports analytics. Building scalable infrastructure to transform raw data into valuable consent assets.
Data Engineer owning infrastructure that turns raw events from mobile users into trustworthy data. Building scalable data architecture and collaborating with cross - functional teams for data management.
Data Architect engaging with companies on transformational data programs to enhance AI and data capabilities. Leading architectural frameworks and mentoring data teams against industry best practices.