Machine Learning Engineer - Model Training Infrastructure

1 Month ago • 5 Years + • $334,000 PA - $435,000 PA

Job Summary

Job Description

The Machine Learning Engineer will be responsible for designing and implementing a global-scale machine learning system for feeds, ads, and search ranking models. The role involves improving the usability and flexibility of the machine learning infrastructure, enhancing model training and serving workflows, data pipelines, storage systems, and resource management for multi-tenancy machine learning systems. The engineer will also design and develop key components of ML infrastructure, mentor interns, and contribute to the overall advancement of the company's AI infrastructure and recommendation platform. This role demands a strong understanding of large-scale system development and experience with deep learning frameworks and core machine learning infrastructure.
Must have:
  • 5+ years of experience in developing and deploying large-scale systems.
  • Proficiency in C/C++/CUDA/Python and solid programming skills.
  • Familiarity with deep learning frameworks (TensorFlow/Pytorch).
Good to have:
  • Experience contributing to an open-sourced machine learning framework (TensorFlow/PyTorch).
  • Experience in using/designing open-source machine learning lifecycle management systems: TFX
Perks:
  • Day one access to medical, dental, and vision insurance.
  • 401(k) savings plan with company match.
  • Paid parental leave.
  • Short-term and long-term disability coverage.
  • Life insurance.
  • Wellbeing benefits.
  • 10 paid holidays per year.
  • 10 paid sick days per year.
  • 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

Job Details

The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company. We also drive substantial impact on core businesses of the company. Currently, we are looking for Machine Learning Engineer in Model Training Infrastructure to join our team to support and advance that mission. Responsibilities: - Responsible for the design and implementation of a global-scale machine learning system for feeds, ads and search ranking models. - Responsible for improving use-ability and flexibility of the machine learning infrastructure. - Responsible for improving the workflow of model training and serving, data pipelines, storage system and resource management for multi-tenancy machine learning systems. - Responsible for designing and developing key components of ML infrastructure and mentoring interns.
Qualifications
Minimum Qualifications - At least 5 years of experience in developing and deploying large-scale systems. - Proficient in C/C++/CUDA/Python, and have solid programming skills. - Familiar with deep learning frameworks (TensorFlow/Pytorch). - Experience on improving core machine learning infrastructure(TensorFlow, Pytorch, and Jax). Preferred Qualifications: - Experience contributing to an open sourced machine learning framework (TensorFlow/PyTorch). - Experience in using/designing open-source machine learning lifecycle management systems: TFX

Similar Jobs

Reddit - Senior Machine Learning Engineer

Reddit

(Remote)
2 Months ago
Inkittt - Senior Machine Learning Engineer, Recommendations

Inkittt

San Francisco, California, United States (Hybrid)
4 Months ago
Rackspace Technology - Senior Machine Learning Engineer

Rackspace Technology

Vietnam (Remote)
3 Months ago
PrizePicks - Staff Data Science Engineer

PrizePicks

Atlanta, Georgia, United States (Remote)
1 Month ago
bytedance - Machine Learning Engineer - Machine Learning Infrastructure

bytedance

San Jose, California, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

whoop - Machine Learning Engineer II

whoop

Boston, Massachusetts, United States (On-Site)
4 Weeks ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
2 Months ago
Autodesk - Senior Construction Research Scientist

Autodesk

Toronto, Ontario, Canada (Hybrid)
1 Week ago
bytedance - Imaging System Architect

bytedance

San Jose, California, United States (On-Site)
1 Month ago
Scale AI - Head of Frontier Data Operations

Scale AI

San Francisco, California, United States (On-Site)
1 Month ago
Microsoft - Research Intern - Microsoft Teams CMD Labs

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
ManyChat - Lead Machine Learning Scientist

ManyChat

Amsterdam, North Holland, Netherlands (Hybrid)
1 Week ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

Toronto, Ontario, Canada (Remote)
2 Months ago
Inworld AI - Staff / Principal Machine Learning Engineer - USA

Inworld AI

Mountain View, California, United States (Remote)
6 Months ago
bytedance - Machine Learning Engineer Intern (E-commerce-Supply Chain & Logistics)

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Jose, California, United States

Playstation - Senior Manager - Content Personalization

Playstation

San Mateo, California, United States (Hybrid)
1 Week ago
Epic Games - Senior Desktop Engineer, Fortnite Tech

Epic Games

United States (On-Site)
5 Months ago
bytedance - Software Development Engineer, Network Automation

bytedance

San Jose, California, United States (On-Site)
1 Month ago
Axonius - Sr. Sales Engineer - SLED/Public Sector

Axonius

United States (Remote)
1 Week ago
aspyr - Senior Game Producer

aspyr

Austin, Texas, United States (On-Site)
4 Weeks ago
Sail Point - Solution Architect IIQ

Sail Point

United States (On-Site)
2 Weeks ago
Google - Senior Software Engineer, Infrastructure, Google Cloud NetInfra

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Qualcomm - SoC Power/Performance Post-Si Validation & Emulation Engineer

Qualcomm

San Diego, California, United States (On-Site)
2 Weeks ago
Somewear Labs - Android Engineer

Somewear Labs

United States (Remote)
10 Months ago
Evolution  - Casino Game Presenter (Live Chat Agent Alternative) - up to $25/hr

Evolution

Atlantic City, New Jersey, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.

Seattle, Washington, United States (On-Site)

San Jose, California, United States (On-Site)

Tokyo, Japan (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

San Jose, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by bytedance