Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.
Senior Big Data Engineer (Core Data)
About the company
Playrix is one of the most successful mobile game development companies in the world. In terms of mobile application revenue, we rank first in Europe and are in the top 3 worldwide. Our games have been installed 2.3 billion times (that's a third of the world's population!) and more than 30 million people around the world play them every day.
Team, about the role
Our data lake contains over 2.5 petabytes of marketing metrics, game events, and operational parameters. We do everything possible to ensure there's no doubt about the completeness, relevance, and reliability of the data we provide. We pay special attention to processing speed and data quality. This allows us to make the right decisions when developing our games.
We want to boost the gaming and marketing event attribution business and are looking for a Senior Data Engineer to join our Data Core team, which is responsible for organizing the storage, transformation and access to gaming and marketing analytics data.
Tasks
Automating external source data extraction processes
Automating quality control and verification of supplied data, configuring monitoring and regulations
Transforming and uploading data for further analysis within game analytics and marketing in the most client-friendly form
Developing and maintaining a continuous data delivery pipeline
Developing and maintaining the integration services for interaction with partners
Our stack
Python is our main language, and data access is in SQL. Our Data Lake is built on the S3 and Delta Lake format in Databricks. For our DWH, we use Redshift/PostgreSQL. We work in the AWS cloud infrastructure.
We use solutions from mainstream vendors, including MonteCarlo and DBT. We use a serverless approach when working with resources, as well as horizontal scaling and predictive models. We pay close attention to both code and architecture refactoring. For CI/CD, we use TeamCity.
Requirements
5+ years of general experience in Data Engineering
3+ years of development experience in Python: OOP, skills in designing frameworks and libraries, ability to read and analyze code, experience in profiling and performance/scalability optimization
Skills in working with popular DE/ML frameworks: PySpark, Airflow, pandas, SQLAlchemy, MLflow
Skills in writing and optimizing SQL queries, ability to work with query plans, and experience working with repositories of different architectures: MPP, columnar, relation, Hadoop, in-memory
Experience with storage of different architectures like mpp, columnar, relation, hadoop, and in-memory
Fluency in Russian
Our Perks
Flexibility at work
For most positions we offer a flexible schedule, and employees can work from anywhere in the world (except for the Russian Federation and the Republic of Belarus). You can do your work however and wherever you like — we only evaluate your results.
Caring for health and well-being
We provide voluntary health insurance for employees and their children, and reimburse online sessions with a psychologist.
We offer fully paid vacation days and sick leave. In certain circumstances, you can request additional days off.
We support leading a healthy lifestyle and reimburse the cost of gym memberships and fitness app subscriptions.
Professional development and education
We pay for participation in specialized conferences and courses and hold our own internal conferences.
We offer discounts on English language courses and courses for the languages of the countries in which the company operates.
Events and merch
We regularly hold offline events like team meetings, big corporate parties, and informal office parties.
We also regularly organize fun competitions, quizzes, hackathons, and online fitness challenges.
We have a merch store with an internal currency you can use to buy unique gifts. We give out this currency as a gift for significant dates, and you can earn more by participating in events and company activities.