Data Scientist specializing in data architecture and ETL workflows at Xsolla. Collaborating with engineering and data teams to optimize data processes for game developers.
Responsibilities
Architecture & Development
Design, build, and optimize data pipelines and ETL workflows in Snowflake using Snowpark, Streams/Tasks, and Snowpipe.
Develop scalable data models, Algorithm supporting user 360 views, churn prediction, and recommendation engine inputs.
Lead integration across data sources: MySQL, BigQuery, Redis, Kafka, GCP Storage, and API Gateway.
Implement CI/CD for data pipelines using Git, dbt, and automated testing.
Define data quality checks and auditing pipelines for ingestion and transformation layers.
Leadership & Collaboration
Mentor and guide junior data engineers on data modeling, performance tuning, and Snowflake best practices.
Partner with Data Science, ML, and Backend teams to productionize machine learning features in Snowflake.
Work closely with Legal, Security, and Infrastructure teams to ensure compliance, privacy, and governance of user data (PII).
Collaborate with the Director of Data Platforms and product stakeholders to translate business requirements into technical specifications.
Performance & Scalability
Tune algorithm performance.
Establish data partitioning, clustering, and materialized views for fast query execution.
Build dashboards and monitors for pipeline health, job success, and data latency metrics (e.g., via Looker, Tableau, or Snowsight).
Governance & Best Practices
Establish and enforce naming conventions, data lineage, and metadata standards across schemas.
Lead code reviews, enforce documentation standards, and manage schema versioning.
Contribute to the company’s evolving data mesh and streaming architecture vision.
Requirements
5+ years of experience in Data Scientist, with **3+ years in Spark framework**.
Strong SQL and Python skills, with proven experience building **ETL/ELT** at scale.
Deep understanding of algorithm** performance tuning**, **query optimization**, and **warehouse orchestration**.
Experience with **data pipeline orchestration** (Airflow, Prefect, dbt, or similar).
Solid understanding of **data modeling** (Kimball, Data Vault, or hybrid).
Proficiency in **Kafka**, **GCP**, or **AWS** for real-time or batch ingestion.
Familiarity with **API-based data integration** and **microservice architectures**.
**Preferred**
Experience lead **machine learning teams** or/and deploying **ML feature pipelines**.
Background in **ad-tech, gaming, or e-commerce** recommendation systems.
Familiarity with **data contracts** and **feature stores** (Feast, Tecton, or custom-built).
Experience managing small data engineering teams and setting technical direction.
Strong ownership and ability to work autonomously in a fast-paced environment.
Excellent cross-functional communication — can translate between engineering and business.
Hands-on problem solver who balances velocity with reliability.
Collaborative mentor who raises the bar for team quality and discipline
Data Scientist developing AI agents and advanced analytics solutions for Ciena. Focused on machine learning, generative AI, and delivering business value through intelligent data - driven tools.
Data Scientist driving detection and mitigation of fraud across audio verticals on our platform. Collaborating with data scientists and ML engineers to ensure fair engagement and accuracy for users and creators.
Senior Marketing Data Scientist for Mozilla providing analytical support to marketing team's investment decisions. Measuring marketing performance and uncovering insights to enhance impact.
Senior Data Scientist developing advanced machine learning models and managing data science services. Working in a hybrid environment to address client needs and optimize business functions.
Data Science Specialist at Nasdaq analyzing financial crime data for insights. Collaborating with teams to create impactful reports and presentations for fraud detection solutions.
Data Scientist at Dropbox partnering with product, engineering, and design teams for analytics and business growth. Focusing on revenue growth, product optimization, and launching high - impact initiatives.
Senior Data Science Manager driving data science projects and mentoring teams for AI product development at Dropbox. Collaborating cross - functionally to influence product strategies and analyze impactful data - driven insights.
Data Science Manager at Instacart leading a data science team to optimize consumer app experiences. Focusing on analytics and experimentation across critical shopper surfaces.
Clinical Data Manager overseeing comprehensive data management activities in clinical trials. Ensuring regulatory compliance and data quality while collaborating with clients and teams.