Data Scientist specializing in data architecture and ETL workflows at Xsolla. Collaborating with engineering and data teams to optimize data processes for game developers.
Responsibilities
Architecture & Development
Design, build, and optimize data pipelines and ETL workflows in Snowflake using Snowpark, Streams/Tasks, and Snowpipe.
Develop scalable data models, Algorithm supporting user 360 views, churn prediction, and recommendation engine inputs.
Lead integration across data sources: MySQL, BigQuery, Redis, Kafka, GCP Storage, and API Gateway.
Implement CI/CD for data pipelines using Git, dbt, and automated testing.
Define data quality checks and auditing pipelines for ingestion and transformation layers.
Leadership & Collaboration
Mentor and guide junior data engineers on data modeling, performance tuning, and Snowflake best practices.
Partner with Data Science, ML, and Backend teams to productionize machine learning features in Snowflake.
Work closely with Legal, Security, and Infrastructure teams to ensure compliance, privacy, and governance of user data (PII).
Collaborate with the Director of Data Platforms and product stakeholders to translate business requirements into technical specifications.
Performance & Scalability
Tune algorithm performance.
Establish data partitioning, clustering, and materialized views for fast query execution.
Build dashboards and monitors for pipeline health, job success, and data latency metrics (e.g., via Looker, Tableau, or Snowsight).
Governance & Best Practices
Establish and enforce naming conventions, data lineage, and metadata standards across schemas.
Lead code reviews, enforce documentation standards, and manage schema versioning.
Contribute to the company’s evolving data mesh and streaming architecture vision.
Requirements
5+ years of experience in Data Scientist, with **3+ years in Spark framework**.
Strong SQL and Python skills, with proven experience building **ETL/ELT** at scale.
Deep understanding of algorithm** performance tuning**, **query optimization**, and **warehouse orchestration**.
Experience with **data pipeline orchestration** (Airflow, Prefect, dbt, or similar).
Solid understanding of **data modeling** (Kimball, Data Vault, or hybrid).
Proficiency in **Kafka**, **GCP**, or **AWS** for real-time or batch ingestion.
Familiarity with **API-based data integration** and **microservice architectures**.
**Preferred**
Experience lead **machine learning teams** or/and deploying **ML feature pipelines**.
Background in **ad-tech, gaming, or e-commerce** recommendation systems.
Familiarity with **data contracts** and **feature stores** (Feast, Tecton, or custom-built).
Experience managing small data engineering teams and setting technical direction.
Strong ownership and ability to work autonomously in a fast-paced environment.
Excellent cross-functional communication — can translate between engineering and business.
Hands-on problem solver who balances velocity with reliability.
Collaborative mentor who raises the bar for team quality and discipline
Data Scientist involved in GCP - driven data and AI initiatives at Valtech. Contributing to production readiness and building impactful data solutions.
Senior Data Scientist at Thumbtack leading data - driven decision - making through analytics and collaboration with cross - functional partners. Delivering insights and mentoring others for company - wide impact.
Senior Data Scientist at EvenUp analyzing treatment journeys to drive business insights. Collaborating with cross - functional teams to build data solutions and insights for product direction.
Junior Data Scientist developing and maintaining analytical workflows for home and auto insurance segmentation initiatives. Supporting statistical models and data pipelines in a collaborative environment within a regulated insurance industry.
Contracts Data Manager ensuring contract data integrity in Salesforce for Create Music Group. Collaborating with cross - functional teams and managing operational metrics.
Staff Data Scientist at EvenUp employing data science and economics to drive business decisions. Collaborating with leadership and cross - functional teams on product monetization and strategic insights.
Data Scientist/Economist at EvenUp applying econometric techniques to shape data - driven decisions. Collaborating with cross - functional teams to uncover insights and drive growth for a legal tech startup.
Data Scientist driving insights and improvements for Reddit's advertising platform through advanced data analysis and modeling techniques. Collaborating across teams to maximize effectiveness and user experience.
Senior Data Scientist improving Reddit's advertising platform through data - driven insights and collaboration with product managers. Joining a passionate team in leveraging machine learning and statistical modeling.
Staff Data Scientist for Reddit's Ads Data Science team designing models to optimize advertising performance. Collaborating with cross - functional teams to lead strategic initiatives and improve advertiser experience.