Big Data Engineer - Data Lake / Feature Store

2 Months ago • All levels • Data Analysis

Job Summary

Job Description

The Big Data Engineer will be responsible for developing and optimizing the in-house Feature Store functionality based on Iceberg. They will also participate in optimizing the integration of Iceberg with various upper-level computing engines and involve in platform-related infrastructure development. The role requires working with Spark, Primus, and Ray to support offline data processing and distributed training for various business scenarios within the company. The team focuses on functional and performance optimizations for large-scale scenarios and the adoption of new-generation distributed application frameworks.

Job Details

About ByteDance Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About The Team The batch processing team is responsible for the company's offline data processing and distributed training, supporting various business scenarios such as offline ETL and machine learning within the company. The components involved include the offline computing engine Spark, the in-house distributed training framework Primus, feature storage solutions like Iceberg and Hudi, as well as Ray, a next-generation distributed application framework. Faced with massive-scale scenarios, extensive functional and performance optimizations have been carried out in Spark, Primus, Feature Store, and support for the adoption of the new-generation distributed application framework Ray in relevant company scenarios. What you will be doing: - Responsible for the development and performance optimisation of the in-house Feature Store functionality based on Iceberg; - Participant in optimisation of the integration of Iceberg with various upper-level computing engines; - Involve in platform-related infrastructure development.

Similar Jobs

PwC - Banking Risk Manager, Advisory (Ref:570911WD)

PwC

Nicosia, Nicosia, Cyprus (Hybrid)
9 Months ago
Marsh McLennan - AI Technology Product Manager

Marsh McLennan

Dublin, County Dublin, Ireland (Hybrid)
1 Month ago
BetterMe - Growth Product Manager (Web)

BetterMe

Kyiv, Kyiv City, Ukraine (On-Site)
2 Months ago
The Globel Talent Co - Conversion Rate Optimization (CRO) Manager

The Globel Talent Co

Johannesburg, Gauteng, South Africa (Remote)
5 Months ago
beghou consulting - Power BI Developer

beghou consulting

Hyderabad, Telangana, India (Hybrid)
3 Months ago
Addepar - Senior Software Engineer - Alternatives & Data Management

Addepar

Pune, Maharashtra, India (Hybrid)
1 Week ago
Head Digital Works - Data Scientist

Head Digital Works

Hyderabad, Telangana, India (On-Site)
1 Year ago
Lightcast - Data Analyst (Hebrew)

Lightcast

United States (Remote)
1 Week ago
Illumina - Senior Data Analytics Engineer (Sr Analytics/BI)

Illumina

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Go Fund Me - Staff Data Engineer

Go Fund Me

Buenos Aires, Buenos Aires, Argentina (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

IMC - Data Quality Engineer

IMC

Amsterdam, North Holland, Netherlands (On-Site)
2 Weeks ago
Tekion Corp - Partner Success Manager

Tekion Corp

United States (On-Site)
2 Weeks ago
London stock Exchange - Turquoise Luminex Business Development Lead

London stock Exchange

London, England, United Kingdom (On-Site)
1 Month ago
Kano studios - Mobile Game Backend Developer

Kano studios

Canada (Remote)
1 Month ago
Univision - Marketing Senior Analyst

Univision

Miami, Florida, United States (On-Site)
2 Weeks ago
Expedia - Data Analytics & Insights Analyst

Expedia

Gurugram, Haryana, India (On-Site)
3 Days ago
Pomelo - Senior Salesforce Administrator

Pomelo

United States (Remote)
6 Days ago
Scientific Games - Director, Product Management

Scientific Games

Alpharetta, Georgia, United States (On-Site)
2 Months ago
Critical mass - Senior Account Manager

Critical mass

Chicago, Illinois, United States (On-Site)
2 Days ago
Nfocus solution - Multimedia Designer

Nfocus solution

Leavenworth, Kansas, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

Workato - Delivery Manager, Professional Services

Workato

Singapore (On-Site)
2 Weeks ago
Sonar Source - LLM Engineer

Sonar Source

Singapore (On-Site)
4 Months ago
bytedance - Client Engineer (Real Time Communication) - 2025 Start

bytedance

Singapore (On-Site)
8 Months ago
Interactive Brokers - Client Services Representative

Interactive Brokers

Singapore (Hybrid)
2 Months ago
bytedance - Software Engineer - Edge Cloud Infrastructure

bytedance

Singapore (On-Site)
1 Month ago
dbt Labs - Sales Director

dbt Labs

Singapore (Remote)
6 Days ago
Razer - Senior Data Engineer

Razer

Singapore (On-Site)
1 Month ago
NinjaVan - Delivery Attendant

NinjaVan

Singapore, Singapore (On-Site)
9 Months ago
Coda - People Business Partner

Coda

Singapore (Hybrid)
1 Month ago
OKX - Leadership Growth Expert, Product & Engineering

OKX

Singapore, Singapore (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

HCL Tech - Senior Reporting & Data Analyst

HCL Tech

Madurai, Tamil Nadu, India (On-Site)
2 Months ago
Nagarro - Senior Staff Engineer, Data Science

Nagarro

India (Remote)
9 Months ago
Comscore - Data Scientist

Comscore

Pune, Maharashtra, India (On-Site)
1 Week ago
zoox - Senior / Staff Data Scientist - Autonomy Performance Metrics

zoox

Foster City, California, United States (Hybrid)
2 Months ago
Jane Street - Data Center Mechanical Engineer

Jane Street

New York, United States (On-Site)
2 Months ago
CRB workforce  - Senior Business Analyst

CRB workforce

El Segundo, California, United States (Remote)
2 Months ago
easygo - Senior Data Engineer

easygo

Melbourne, Victoria, Australia (On-Site)
5 Months ago
Apple - AI Solutions Architect, Data Solutions & Initiatives

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Fictiv - Manufacturing Data Analysis Intern

Fictiv

Monterrey, Nuevo Leon, Mexico (On-Site)
2 Months ago
Sword Health - Data Science Trainee

Sword Health

Porto, Porto District, Portugal (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
View All Jobs

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug