Backend Engineer (Machine Learning Infrastructure), AML Engine - 2025 Start

8 Months ago • All levels • Research Development

Job Summary

Job Description

ByteDance's AML team seeks a Backend Engineer specializing in Machine Learning Infrastructure to build and enhance a global-scale system for recommendation, ads, and search ranking models. This role involves designing and implementing robust ML infrastructure, optimizing training and serving workflows, managing data pipelines, and mentoring interns. The ideal candidate possesses strong programming skills in C/C++/Python, familiarity with deep learning frameworks (TensorFlow/PyTorch), and experience in deploying large-scale systems. A background in big data frameworks (Spark/Hadoop/Flink) and expertise in resource management for distributed systems are highly valued.
Must have:
  • Proficient in C/C++/Python
  • Solid programming skills
  • Familiar with deep learning frameworks (TensorFlow/PyTorch)
  • Experience in developing and deploying large-scale systems
  • Ability to work independently and complete projects timely
  • Good communication and teamwork skills
Good to have:
  • Experience contributing to an open-sourced ML framework (TensorFlow/PyTorch)
  • Experience in big data frameworks (e.g., Spark/Hadoop/Flink)
  • Experience in resource management and task scheduling for large scale distributed systems
  • Strong background in Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/TPU/RDMA) or ML for Systems.

Job Details

Responsibilities
ByteDance will be prioritizing applicants who have a current right to work in Singapore, and do not require ByteDance's sponsorship of a visa. Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About the Team The mission of our AML team is to push next-generation machine learning algorithms and platform for the recommendation system, ads ranking and search ranking in our company. We also drive substantial impact for core businesses of the company. Currently we are looking for Software Engineer - Machine Learning Infrastructure to join our team to support and advance that mission. We are looking for talented individuals to join us in 2025. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance. Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to ByteDance and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early. Responsibilities - Responsible for the design and implementation of a global-scale machine learning system for feeds, ads and search ranking models. - Responsible for improving use-ability and flexibility of the machine learning infrastructure. - Responsible for improving the workflow of model training and serving, data pipelines and resource management for the multi-tenancy machine learning systems. - Responsible for designing and developing key components of ML infrastructure and mentoring interns. - Research, design, and develop computer and network software or specialised utility programs. - Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis. - Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures. - Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.
Qualifications
Minimum Qualifications - Final year or recent graduate with a background in Software Development, Computer Science, Computer Engineering, or a related technical discipline- Proficient in C/C++/Python, and have solid programming skills. - Familiar with deep learning frameworks (TensorFlow/Pytorch). - Experience in developing and deploying large-scale systems. - Ability to work independently and complete projects from beginning to end and in a timely manner. - Good communication and teamwork skills to clearly communicate technical concepts with other teammates. - Experience on improving core machine learning infrastructure(TensorFlow, Pytorch, and Jax). Preferred Qualifications - Experience contributing to an open sourced machine learning framework (TensorFlow/PyTorch). - Experience in big data frameworks (e.g., Spark/Hadoop/Flink), experience in resource management and task scheduling for large scale distributed systems. - Strong background in one of the following fields: Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/TPU/RDMA) or ML for Systems. Bytedance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At Bytedance, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. By submitting an application for this role, you accept and agree to our global applicant privacy policy, which may be accessed here: https://jobs.bytedance.com/en/legal/privacy. If you have any questions, please reach out to us at apac-earlycareers@bytedance.com

Similar Jobs

Tekion Corp - Learning Operation Specialist II

Tekion Corp

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Super.com - Software Architect

Super.com

United States (Remote)
7 Months ago
Tide - Senior CRM Marketing Manager

Tide

Bulgaria (Hybrid)
2 Months ago
Christ Fellowship - Worship College Intern

Christ Fellowship

Florida, United States (On-Site)
2 Months ago
Monzo - Credit Risk Manager

Monzo

London, England, United Kingdom (Remote)
1 Month ago
Meta - Research Scientist, Machine Learning (PhD)

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
7 Months ago
Valve corporation - Software Engineer for HW

Valve corporation

Bellevue, Washington, United States (On-Site)
8 Months ago
Assystems - Middle Level Marine Structural Engineer

Assystems

Chennai, Tamil Nadu, India (On-Site)
8 Months ago
Tencent - Senior Staff Researcher

Tencent

California, United States (On-Site)
4 Months ago
Riot Games - Manager, Software Engineering - Infrastructure / Cloud Foundations

Riot Games

Los Angeles, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Haptic  - Design Director

Haptic

Paris, Île-de-France, France (Remote)
6 Months ago
Ansys - Strategic Account Executive

Ansys

(Remote)
2 Months ago
Hasbro - Program Manager, Commercialization

Hasbro

Renton, Washington, United States (On-Site)
3 Weeks ago
Dream Games - Community Specialist

Dream Games

İstanbul, Türkiye (On-Site)
10 Months ago
Tesla - Service Advisor

Tesla

Bavaria, Germany (On-Site)
4 Months ago
Headout - Machine Learning Engineer

Headout

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Riot Games - Senior Service Reliability Analyst - ITIL

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Hololight - C/C++ Software Developer

Hololight

Ismaning, Bavaria, Germany (On-Site)
3 Months ago
SBM Management - Custodial Lead

SBM Management

Utica, Michigan, United States (On-Site)
5 Months ago
bytedance - Payroll Analyst - HR Operations - India

bytedance

Mumbai, Maharashtra, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

HoYoverse - Data Analyst - Honkai: Star Rail - Fresh Grad

HoYoverse

Singapore (On-Site)
7 Months ago
Rolls-Royce - Service Operations Manager PSB APAC (Commissioning)

Rolls-Royce

Singapore (On-Site)
8 Months ago
HoYoverse - CRM Lifecycle Manager

HoYoverse

Singapore (On-Site)
4 Months ago
bytedance - Android Software Engineer, Flow

bytedance

Singapore (On-Site)
8 Months ago
bytedance - Incident Response Manager - Infrastructure Engineering

bytedance

Singapore (On-Site)
8 Months ago
Razer - Associate Director, Software Product Marketing

Razer

Singapore (On-Site)
9 Months ago
Tencent - IT Operations Intern

Tencent

Singapore (On-Site)
1 Month ago
bytedance - Research Engineer (Machine Learning Training System) - 2025 Start

bytedance

Singapore (On-Site)
8 Months ago
Jane Street - Unified Communications Engineer

Jane Street

Singapore (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

bytedance - Research Scientist- Applied Machine learning Graduates (AML) - 2024 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
8 Months ago
bytedance - Software Engineer Large Model System Graduate (Machine Learning Sys-US) - 2024 Start (BS/MS)

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
NVIDIA - System Software Engineer - Embedded and Automotive (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
Ubisoft - Senior C++ Programmer

Ubisoft

Malmö, Skåne County, Sweden (Hybrid)
2 Months ago
bytedance - Tech Expert - Machine Learning Infrastructure

bytedance

Singapore (On-Site)
7 Months ago
NVIDIA - Senior Observability Architect, AI and HPC

NVIDIA

Santa Clara, California, United States (On-Site)
4 Months ago
Google - Software Engineer, Site Reliability Engineering, Campus

Google

Sydney, New South Wales, Australia (On-Site)
2 Months ago
Meta - Software Engineer, Machine Learning

Meta

Redmond, Washington, United States (On-Site)
8 Months ago
bytedance - Research Scientist Graduate (Foundation Model - Vision and Language)

bytedance

Seattle, Washington, United States (On-Site)
5 Months ago
NVIDIA - System Software Architect, Programmable Vision Accelerator

NVIDIA

Hyderabad, Telangana, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
View All Jobs

Get notified when new jobs are added by bytedance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug