Machine Learning Engineer - Model Training Infrastructure

2 Months ago • 5 Years + • Devops • $334,000 PA - $435,000 PA

Job Summary

Job Description

The Machine Learning Engineer will be responsible for designing and implementing a global-scale machine learning system for feeds, ads, and search ranking models. The role involves improving the usability and flexibility of the machine learning infrastructure, enhancing model training and serving workflows, data pipelines, storage systems, and resource management for multi-tenancy machine learning systems. The engineer will also design and develop key components of ML infrastructure, mentor interns, and contribute to the overall advancement of the company's AI infrastructure and recommendation platform. This role demands a strong understanding of large-scale system development and experience with deep learning frameworks and core machine learning infrastructure.
Must have:
  • 5+ years of experience in developing and deploying large-scale systems.
  • Proficiency in C/C++/CUDA/Python and solid programming skills.
  • Familiarity with deep learning frameworks (TensorFlow/Pytorch).
Good to have:
  • Experience contributing to an open-sourced machine learning framework (TensorFlow/PyTorch).
  • Experience in using/designing open-source machine learning lifecycle management systems: TFX
Perks:
  • Day one access to medical, dental, and vision insurance.
  • 401(k) savings plan with company match.
  • Paid parental leave.
  • Short-term and long-term disability coverage.
  • Life insurance.
  • Wellbeing benefits.
  • 10 paid holidays per year.
  • 10 paid sick days per year.
  • 17 days of Paid Personal Time (prorated upon hire with increasing accruals by tenure).

Job Details

The mission of our AML team is to push the next-generation AI infrastructure and recommendation platform for the ads ranking, search ranking, live & ecom ranking in our company. We also drive substantial impact on core businesses of the company. Currently, we are looking for Machine Learning Engineer in Model Training Infrastructure to join our team to support and advance that mission. Responsibilities: - Responsible for the design and implementation of a global-scale machine learning system for feeds, ads and search ranking models. - Responsible for improving use-ability and flexibility of the machine learning infrastructure. - Responsible for improving the workflow of model training and serving, data pipelines, storage system and resource management for multi-tenancy machine learning systems. - Responsible for designing and developing key components of ML infrastructure and mentoring interns.
Qualifications
Minimum Qualifications - At least 5 years of experience in developing and deploying large-scale systems. - Proficient in C/C++/CUDA/Python, and have solid programming skills. - Familiar with deep learning frameworks (TensorFlow/Pytorch). - Experience on improving core machine learning infrastructure(TensorFlow, Pytorch, and Jax). Preferred Qualifications: - Experience contributing to an open sourced machine learning framework (TensorFlow/PyTorch). - Experience in using/designing open-source machine learning lifecycle management systems: TFX

Similar Jobs

IGG - Senior Backend Engineer

IGG

Singapore (On-Site)
8 Months ago
Yahoo - Senior Software Engineer - Anti-Spam

Yahoo

United States (Hybrid)
1 Month ago
Riot Games - Senior Software Engineer, Game/UI - Teamfight Tactics, Events

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Capgemini - C++ Developer with Python

Capgemini

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Riot Games - Senior Software Engineer - VALORANT, Foundations, Build Platforms

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Unity - Senior ML Infrastructure Engineer

Unity

San Francisco, California, United States (On-Site)
9 Months ago
Apple - Senior SRE Manager, iCloud

Apple

Seattle, Washington, United States (On-Site)
4 Weeks ago
London stock Exchange - Senior Lead DevOps Engineer

London stock Exchange

Bangkok, Thailand (On-Site)
1 Day ago
luxsoft - DevOps Automation Engineer

luxsoft

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Cubic corporation - Solutions Architect

Cubic corporation

Hamburg, Hamburg, Germany (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Wind River - Technical Leader - V&V

Wind River

Bengaluru, Karnataka, India (Hybrid)
1 Week ago
Rockstar Games - Senior Animation R&D Programmer: Retargeting

Rockstar Games

Oakville, Ontario, Canada (On-Site)
2 Months ago
Google - Software Engineering Manager, People with Disabilities

Google

São Paulo, State Of São Paulo, Brazil (On-Site)
8 Months ago
Playtech - Games Designer Mathematician

Playtech

New South Wales, Australia (On-Site)
3 Months ago
Activision - Rigger

Activision

Shanghai, China (On-Site)
1 Month ago
Osome studios - Stage UI Programmer Unreal

Osome studios

Lyon, Auvergne-Rhône-Alpes, France (On-Site)
3 Weeks ago
Epic Games - Senior Software Engineer

Epic Games

Canada (On-Site)
3 Months ago
Activision - Expert Software Engineer, Graphics

Activision

Santa Monica, California, United States (Remote)
2 Months ago
we are unseen  - Senior Gameplay Engineer

we are unseen

Tokyo, Japan (Hybrid)
1 Year ago
Rocket Science - Software Engineer - UI

Rocket Science

Wales, United Kingdom (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Jose, California, United States

whoop - Senior Data Scientist (Insights)

whoop

Boston, Massachusetts, United States (On-Site)
2 Months ago
Apple - Senior Controls Engineer

Apple

Austin, Texas, United States (On-Site)
1 Month ago
Sleeper - Performance Creative Associate (TikTok Ads)

Sleeper

Los Angeles, California, United States (On-Site)
3 Months ago
Evolution  - In Studio Game Presenter (Retail Alternative)

Evolution

Atlantic City, New Jersey, United States (On-Site)
2 Weeks ago
Funko - Retail Stock Lead

Funko

Buckeye, Arizona, United States (On-Site)
1 Week ago
Next Level Business Services - Java Developer with Oracle SOA

Next Level Business Services

Cincinnati, Ohio, United States (On-Site)
8 Months ago
Ziff Davis - Lead Product Manager

Ziff Davis

Los Angeles, California, United States (Hybrid)
2 Months ago
Next Level Business Services - Data Scientist -  Full Time Only

Next Level Business Services

Redmond, Washington, United States (On-Site)
8 Months ago
Google - Software Developer III, Front End, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
2 Months ago
Univision - Director, Travel & Hospitality

Univision

New York, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Aristocrat - DevOps Lead

Aristocrat

Austin, Texas, United States (Hybrid)
1 Month ago
miniclip - Cloud Infrastructure Engineer

miniclip

Lisbon, Lisbon, Portugal (On-Site)
2 Days ago
bytedance - Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

bytedance

San Jose, California, United States (On-Site)
6 Months ago
miniclip - Cloud Infrastructure Engineer - Cloud Engineer II

miniclip

Lisbon, Lisbon, Portugal (On-Site)
1 Month ago
gitlab - Intermediate Site Reliability Engineer, Foundations

gitlab

Canada (Remote)
1 Month ago
Salesforce - Revenue Cloud Solution Engineer

Salesforce

London, England, United Kingdom (On-Site)
1 Month ago
PowerSchool - Associate Cloud Operations Engineer 2

PowerSchool

Bengaluru, Karnataka, India (On-Site)
8 Months ago
BigID - Site Reliability Engineer

BigID

Hyderabad, Telangana, India (Hybrid)
1 Month ago
Dream Sports - SDE - 1 - DevOps

Dream Sports

Mumbai, Maharashtra, India (On-Site)
8 Months ago
Zazz - Solutions Architect - Backend Development

Zazz

India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 2012, ByteDance's mission is to inspire creativity and enrich life. With a suite of more than a dozen products, including TikTok as well as platforms specific to the China market, including Toutiao, Douyin, and Xigua, ByteDance has made it easier and more fun for people to connect with, consume, and create content.
View All Jobs

Get notified when new jobs are added by bytedance