Big Data Engineer, Data Lake / Feature Store

5 Months ago • 2 Years + • Monetization

Job Summary

Job Description

The batch processing team at ByteDance is responsible for offline data processing and distributed training. You will be developing and optimizing the in-house Feature Store functionality based on Iceberg, participating in optimizing the integration of Iceberg with various upper-level computing engines, and being involved in platform-related infrastructure development.
Must have:
  • Bachelor's Degree or above in Computer Science or related fields
  • 2+ years of relevant development experience
  • Strong programming ability in Java, Python, C++
  • Experience with large-scale distributed systems
  • In-depth knowledge of data lake formats like Delta, Hudi, or Iceberg
Good to have:
  • In-depth research or practical experience in Hadoop, Spark, Flink, Presto
  • Experience with open-source big data computing frameworks

Job Details

Responsibilities
About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About The Team The batch processing team is responsible for the company's offline data processing and distributed training, supporting various business scenarios such as offline ETL and machine learning within the company. The components involved include the offline computing engine Spark, the in-house distributed training framework Primus, feature storage solutions like Iceberg and Hudi, as well as Ray, a next-generation distributed application framework. Faced with massive-scale scenarios, extensive functional and performance optimizations have been carried out in Spark, Primus, Feature Store, and support for the adoption of the new-generation distributed application framework Ray in relevant company scenarios. What you will be doing: - Responsible for the development and performance optimisation of the in-house Feature Store functionality based on Iceberg; - Participant in optimisation of the integration of Iceberg with various upper-level computing engines; - Involve in platform-related infrastructure development.
Qualifications
Minimum Qualifications - Bachelor's Degree or above, majoring in Computer Science, or related fields, with 2+ years of relevant development experience in the field with a strong programming ability, and proficiency in Java, Python, C++, with the ability to develop and optimize large-scale distributed systems. - In-depth research and relevant experience in one or more data lake formats such as Delta, Hudi, or Iceberg. Preferred Qualifications - In-depth research or practical experience in open-source big data computing frameworks and scenarios like Hadoop, Spark, Flink, Presto, and more. ByteDance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At ByteDance, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. #LI-CT

Similar Jobs

Nagarro - Senior Staff Engineer, Java

Nagarro

Japan (Remote)
5 Months ago
Relax Gaming  - Casino QA Engineer

Relax Gaming

Tallinn, Harju County, Estonia (On-Site)
7 Months ago
Nielsen Holdings - Senior Software Engineer - Bigdata (Java/Scala , Spark, Python, AWS )

Nielsen Holdings

Gurugram, Haryana, India (Hybrid)
5 Months ago
Bigpoint - Lead Game Developer

Bigpoint

Hamburg, Hamburg, Germany (Remote)
2 Months ago
Nielsen Holdings - Senior Software Engineer - Bigdata ( Java / Scala / Python  & Spark , SQL , AWS).

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
ByteDance - Backend Software Engineer - Global E-commerce - Seller Growth

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - Financial Risk Strategy Expert - Global Payment

ByteDance

Jakarta, Jakarta, Indonesia (On-Site)
2 Months ago
Warner Bros Games - Staff Technical Program Manager

Warner Bros Games

Hyderabad, Telangana, India (Hybrid)
1 Week ago
ByteDance - ByteDance Back-end Engineer Graduate Program (Dubai 2025)

ByteDance

Dubai, Dubai, United Arab Emirates (On-Site)
1 Month ago
Ubisoft - Lead User Acquisition & Monetization Manager

Ubisoft

Saint-Mandé, Île-de-France, France (On-Site)
5 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Netflix - Research Scientist L5, Algorithms Engineering

Netflix

Los Gatos, California, United States (On-Site)
5 Months ago
Google - Software Engineering Manager, Black Community Inclusion

Google

State Of Minas Gerais, Brazil (On-Site)
3 Months ago
ByteDance - Backend Software Engineer, Enterprise Solution

ByteDance

San Jose, California, United States (On-Site)
6 Days ago
ION - Senior Java Developer - Italy

ION

Collecchio, Emilia-Romagna, Italy (On-Site)
5 Months ago
Netflix - Software Engineer (L4) - Telemetry Collections

Netflix

United States (On-Site)
5 Months ago
Yodo1 - Unity3D Mobile SDK Team Lead

Yodo1

(Remote)
8 Months ago
ByteDance - Backend Software Engineer - Global E-Commerce Supply Chain

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Game District - Game Developer

Game District

Punjab, Pakistan (On-Site)
2 Weeks ago
Playtika - Server Technical Lead

Playtika

Poland (Hybrid)
4 Months ago
Xsolla - Senior DevOps Engineer

Xsolla

Los Angeles, California, United States (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

ByteDance - Data Center Design Engineer (Electrical) - Data Center Development

ByteDance

Singapore (On-Site)
4 Months ago
ByteDance - Global Monetization Product Counsel, Ads

ByteDance

Singapore (On-Site)
1 Week ago
Razer - Product Developer (Mobile and Console)

Razer

Singapore (On-Site)
6 Months ago
ByteDance - OCBP - Global Monetization Product and Technology

ByteDance

Singapore (On-Site)
2 Months ago
Bushiroad - Localization Quality Assurance Executive/Senior Executive

Bushiroad

Singapore, Singapore (On-Site)
1 Month ago
Eleven Labs - Customer Success Manager - APAC

Eleven Labs

Singapore (Remote)
5 Days ago
ByteDance - Data Analyst - Global Payment

ByteDance

Singapore (On-Site)
6 Days ago
Alphasense - Associate, Customer & Product Support

Alphasense

Singapore, Singapore (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Monetization Jobs

Voodoo - VIP Account Manager Intern - Blitz

Voodoo

Paris, Île-de-France, France (On-Site)
5 Days ago
InMobiInMobi - Lead - Revenue Strategy

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
1 Month ago
ByteDance - Tech Lead - Applied Machine Learning Algorithm

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - Product Operations, Search Ads AI Data Service - Trust & Safety

ByteDance

Pasig, Metro Manila, Philippines (On-Site)
1 Month ago
SuperPlay - SENIOR MONETIZATION MANAGER - DISNEY SOLITAIRE

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
ByteDance - Experienced Technical Lead - Edge Cloud Infrastructure - San Jose / Seattle / Boston

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - AI Voice Technical Support Engineer (Global Market)

ByteDance

Singapore (On-Site)
6 Days ago
SuperPlay - Graphic Designer Team Lead

SuperPlay

Tel Aviv District, Israel (On-Site)
4 Weeks ago
ByteDance - Engineering Manager Machine Learning Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Metacore - LiveOps Specialist

Metacore

Helsinki, Uusimaa, Finland (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

New York, New York, United States (On-Site)

Jakarta, Jakarta, Indonesia (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

Singapore (On-Site)

Taguig, Metro Manila, Philippines (On-Site)

View All Jobs

Get notified when new jobs are added by ByteDance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug