Backend Engineer (Machine Learning Infrastructure), AML Engine - 2025 Start

5 Months ago • All levels • Research & Development

Job Summary

Job Description

ByteDance's AML team seeks a Backend Engineer specializing in Machine Learning Infrastructure to build and enhance a global-scale system for recommendation, ads, and search ranking models. This role involves designing and implementing robust ML infrastructure, optimizing training and serving workflows, managing data pipelines, and mentoring interns. The ideal candidate possesses strong programming skills in C/C++/Python, familiarity with deep learning frameworks (TensorFlow/PyTorch), and experience in deploying large-scale systems. A background in big data frameworks (Spark/Hadoop/Flink) and expertise in resource management for distributed systems are highly valued.
Must have:
  • Proficient in C/C++/Python
  • Solid programming skills
  • Familiar with deep learning frameworks (TensorFlow/PyTorch)
  • Experience in developing and deploying large-scale systems
  • Ability to work independently and complete projects timely
  • Good communication and teamwork skills
Good to have:
  • Experience contributing to an open-sourced ML framework (TensorFlow/PyTorch)
  • Experience in big data frameworks (e.g., Spark/Hadoop/Flink)
  • Experience in resource management and task scheduling for large scale distributed systems
  • Strong background in Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/TPU/RDMA) or ML for Systems.

Job Details

Responsibilities
ByteDance will be prioritizing applicants who have a current right to work in Singapore, and do not require ByteDance's sponsorship of a visa. Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok and Helo as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content. Why Join Us Creation is the core of ByteDance's purpose. Our products are built to help imaginations thrive. This is doubly true of the teams that make our innovations possible. Together, we inspire creativity and enrich life - a mission we aim towards achieving every day. To us, every challenge, no matter how ambiguous, is an opportunity; to learn, to innovate, and to grow as one team. Status quo? Never. Courage? Always. At ByteDance, we create together and grow together. That's how we drive impact - for ourselves, our company, and the users we serve. Join us. About the Team The mission of our AML team is to push next-generation machine learning algorithms and platform for the recommendation system, ads ranking and search ranking in our company. We also drive substantial impact for core businesses of the company. Currently we are looking for Software Engineer - Machine Learning Infrastructure to join our team to support and advance that mission. We are looking for talented individuals to join us in 2025. As a graduate, you will get unparalleled opportunities for you to kickstart your career, pursue bold ideas and explore limitless growth opportunities. Co-create a future driven by your inspiration with ByteDance. Candidates can apply to a maximum of two positions and will be considered for jobs in the order you apply. The application limit is applicable to ByteDance and its affiliates' jobs globally. Applications will be reviewed on a rolling basis - we encourage you to apply early. Responsibilities - Responsible for the design and implementation of a global-scale machine learning system for feeds, ads and search ranking models. - Responsible for improving use-ability and flexibility of the machine learning infrastructure. - Responsible for improving the workflow of model training and serving, data pipelines and resource management for the multi-tenancy machine learning systems. - Responsible for designing and developing key components of ML infrastructure and mentoring interns. - Research, design, and develop computer and network software or specialised utility programs. - Analyse user needs and develop software solutions, applying principles and techniques of computer science, engineering, and mathematical analysis. - Update software, enhances existing software capabilities, and develops and direct software testing and validation procedures. - Work with computer hardware engineers to integrate hardware and software systems and develop specifications and performance requirements.
Qualifications
Minimum Qualifications - Final year or recent graduate with a background in Software Development, Computer Science, Computer Engineering, or a related technical discipline- Proficient in C/C++/Python, and have solid programming skills. - Familiar with deep learning frameworks (TensorFlow/Pytorch). - Experience in developing and deploying large-scale systems. - Ability to work independently and complete projects from beginning to end and in a timely manner. - Good communication and teamwork skills to clearly communicate technical concepts with other teammates. - Experience on improving core machine learning infrastructure(TensorFlow, Pytorch, and Jax). Preferred Qualifications - Experience contributing to an open sourced machine learning framework (TensorFlow/PyTorch). - Experience in big data frameworks (e.g., Spark/Hadoop/Flink), experience in resource management and task scheduling for large scale distributed systems. - Strong background in one of the following fields: Hardware-Software Co-Design, High Performance Computing, ML Hardware Acceleration (e.g., GPU/TPU/RDMA) or ML for Systems. Bytedance is committed to creating an inclusive space where employees are valued for their skills, experiences, and unique perspectives. Our platform connects people from across the globe and so does our workplace. At Bytedance, our mission is to inspire creativity and bring joy. To achieve that goal, we are committed to celebrating our diverse voices and to creating an environment that reflects the many communities we reach. We are passionate about this and hope you are too. By submitting an application for this role, you accept and agree to our global applicant privacy policy, which may be accessed here: https://jobs.bytedance.com/en/legal/privacy. If you have any questions, please reach out to us at apac-earlycareers@bytedance.com

Similar Jobs

NVIDIA - Solutions Architect, AI and ML

NVIDIA

Redmond, Washington, United States (On-Site)
3 Weeks ago
Balbix - AI/ML Architect

Balbix

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Meta - Software Engineer, Machine Learning

Meta

Redmond, Washington, United States (On-Site)
5 Months ago
Google - Customer Engineer II, Cloud AI, Google Cloud

Google

San Francisco, California, United States (On-Site)
4 Days ago
Inworld AI - Staff / Principal Machine Learning Engineer - USA

Inworld AI

Mountain View, California, United States (Remote)
5 Months ago
Microsoft - Senior Researcher – Cloud and AI Infrastructure

Microsoft

Vancouver, British Columbia, Canada (On-Site)
1 Week ago
Riot Games - Senior Principal Technical Artist

Riot Games

Los Angeles, California, United States (On-Site)
6 Months ago
Rivos - Silicon Logic Formal Verification - Full Time

Rivos

Portland, Oregon, United States (Hybrid)
6 Months ago
Riot Games - Staff Anti-Cheat Analyst

Riot Games

Los Angeles, California, United States (On-Site)
1 Week ago
NVIDIA - Machine Learning Engineer Intern - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer Tech Lead, Photos Reminiscing

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Washington, District Of Columbia, United States (On-Site)
5 Months ago
ByteDance - Research Engineer Graduate (Machine Learning Sys-US) - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Lucid Reality Labs - Machine Learning Engineer

Lucid Reality Labs

Poland (Remote)
3 Months ago
Meta - Software Engineer, Machine Learning

Meta

Burlingame, California, United States (On-Site)
5 Months ago
Razer - Solutions Architect

Razer

Singapore (On-Site)
6 Months ago
ByteDance - Software Engineer in Large Model System Graduate (Machine Learning Sys-US) - 2024 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Welevel - Senior AI Engineer (LLM & Training Focus)

Welevel

Munich, Bavaria, Germany (On-Site)
1 Month ago
Altagram Group - Data Science Internship/Workstudent

Altagram Group

Germany (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Singapore

ByteDance - Product Manager, Payment Solutions - Global Payment

ByteDance

Singapore (On-Site)
6 Months ago
ByteDance - Digital Product Designer - Enterprise Products

ByteDance

Singapore (On-Site)
1 Week ago
ByteDance - Cloud Solutions Technical Account Manager

ByteDance

Singapore (On-Site)
1 Week ago
ByteDance - Product Manager - LLM Training

ByteDance

Singapore (On-Site)
5 Months ago
ByteDance - Senior SRE Architect, Security Engineering

ByteDance

Singapore (On-Site)
5 Months ago
ByteDance - Lark APAC Partnerships & Scale-Ups Marketing Intern

ByteDance

Singapore (On-Site)
1 Month ago
HoYoverse - Accountant (Accounts Payable)

HoYoverse

Singapore (On-Site)
3 Months ago
Google - Social Insight Strategist

Google

Singapore (On-Site)
6 Days ago
PwC - Tax NewLaw - Associate

PwC

Singapore (On-Site)
6 Months ago
ByteDance - Solution Architect (Edge Cloud)

ByteDance

Singapore (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - SRAM CAD Engineer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
NVIDIA - Senior System Software Engineer, GPU Server

NVIDIA

Hillsboro, Oregon, United States (On-Site)
1 Month ago
Google - Silicon RTL Design Engineer, TPU, Google Cloud

Google

Bengaluru, Karnataka, India (On-Site)
6 Days ago
Riot Games - Staff Software Engineer (Build Platforms)

Riot Games

Los Angeles, California, United States (On-Site)
1 Week ago
HIR Expert - C++ Engineer

HIR Expert

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Tesla - Electrical Engineer, Motor Design - Tesla Bot

Tesla

Athens, Greece (On-Site)
2 Months ago
ByteDance - Research Scientist, Data Management and Security

ByteDance

San Jose, California, United States (On-Site)
4 Weeks ago
SideFX Software - Senior Compositing Software Developer/Technical Director

SideFX Software

Ontario, Canada (Hybrid)
4 Weeks ago
Riot Games - Staff Software Engineer, Rendering - League of Legends

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
NVIDIA - Principal Software Architect, GPU Networking Research

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Where imagination meets innovation, delivering limitless gaming experiences.

Dublin, County Dublin, Ireland (On-Site)

London, England, United Kingdom (On-Site)

Bangkok, Bangkok, Thailand (On-Site)

San Jose, California, United States (On-Site)

State Of São Paulo, Brazil (On-Site)

San Jose, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by ByteDance

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug