Staff Software Engineer, ML Hardware, YouTube Discovery

47 Minutes ago • 8-12 Years • Artificial Intelligence

About the job

Job Description

This Staff Software Engineer role at YouTube focuses on optimizing YouTube's recommender systems for Google's TPUs (Tensor Processing Units). You'll develop and execute strategies to leverage new ML hardware, including adapting models for quantized training and inference. Responsibilities include leading engineering efforts to improve model efficiency, evaluating new hardware, and collaborating with model developers and hardware experts. The role requires deep experience in ML infrastructure, model deployment, and large-scale model development, especially with recommender systems and LLMs. You will also play a key role in shaping YouTube's ML hardware adoption strategy.
Must have:
  • 8+ years software development experience
  • 5+ years ML infrastructure experience
  • 3+ years developing large-scale ML models
  • 2+ years technical leadership
  • Experience with post-training quantization
Good to have:
  • Experience with AI infrastructure, compilers, or performance engineering
  • Experience optimizing ML models for ML hardware accelerators
  • Excellent communication skills
Perks:
  • Bonus
  • Equity
  • Benefits

Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience in software development, and with data structures/algorithms.
  • 5 years of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, debugging).
  • 3 years of experience in developing large-scale ML models (recommender systems, LLMs, computer vision, etc) utilizing ML hardware accelerators.
  • 2 years of experience in a technical leadership role.
  • Experience with post-training quantization, quantized aware training, or quantized training for ML models.

Preferred qualifications:

  • Experience with building efficient and reusable AI infrastructure, compilers, or performance engineering.
  • Experience with optimizing ML models to efficiently run on ML hardware accelerators.
  • Excellent communication skills, with the ability to convey complex technical concepts clearly and concisely.

About the job

YouTube's growth is generated by Machine Learning (ML) powered personalized recommendations. Over time, these models are growing larger and it's imperative they be efficiently trained and served using Google's ML hardware Tensor Processing Unit (TPU) Simultaneously, Gemini and Large Language Model offer exciting new model architectures and are developing new advancements in ML hardware development.

In this role, you will be responsible for ensuring YouTube's business-critical recommender models best utilize the available ML hardware. To do this, you will be responsible for managing YouTube's participation in hardware development and evaluation programs. Additionally, you will initiate and drive efforts to adapt YouTube's models to take advantage of new accelerator capabilities, for example by adopting quantized training and inference.

At YouTube, we believe that everyone deserves to have a voice, and that the world is a better place when we listen, share, and build community through our stories. We work together to give everyone the power to share their story, explore what they love, and connect with one another in the process. Working at the intersection of cutting-edge technology and boundless creativity, we move at the speed of culture with a shared goal to show people the world. We explore new ideas, solve real problems, and have fun — and we do it all together.

The US base salary range for this full-time position is $189,000-$284,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Develop YouTube Discovery's ML hardware adoption strategy.
  • Initiate and lead engineering efforts to adapt YouTube's recommender models to perform efficiently on future generations of ML hardware.
  • Lead YouTube's evaluation of new ML hardware, in collaboration with model developers and Google-wide ML hardware and software experts.
View Full Job Description
$189.0K - $284.0K/yr (Outscal est.)
$236.5K/yr avg.
San Bruno, California, United States

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Mountain View, California, United States (On-Site)

Virginia, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Reston, Virginia, United States (On-Site)

Mexico City, Mexico City, Mexico (On-Site)

Sydney, New South Wales, Australia (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Playrix - UA Marketing Producer

Playrix, Georgia (Remote)

G5 Games - C++ Gameplay Programmer

G5 Games, Kazakhstan (Remote)

Elevation Capital - Data Scientist

Elevation Capital, India (On-Site)

Red Point Labs - Java Backend Developer (Remote OK)

Red Point Labs, Argentina (Remote)

Hyper Verge - Machine Learning Engineer II

Hyper Verge, India (On-Site)

Redhorse Corp - Systems Architect

Redhorse Corp, United States (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

OpenGov - Software Engineer III - PRO

OpenGov, United States (Hybrid)

MyGames - Unity Developer

MyGames, (Remote)

Auros Global - Strategy Developer - Asia

Auros Global, (Remote)

Netflix - Machine Learning Intern - Spring or Summer 2025

Netflix, United States (On-Site)

SingleStore - Senior Engineer, Engine Security (C/C++)

SingleStore, India (Hybrid)

Tango Eye - Sr. Computer Vision Developer

Tango Eye, India (On-Site)

Google - Software Engineer III, TPU Backend, Complier

Google, United States (On-Site)

Autodesk - Principal Software Developer  (C++)

Autodesk, Canada (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in San Bruno, California, United States

Next Level Business Services - SQL BI Developer

Next Level Business Services, United States (On-Site)

Funko - Senior Accountant

Funko, United States (On-Site)

The Mill - Senior Systems Engineer

The Mill, United States (On-Site)

Alphasense - Senior Director of Product, Enterprise Intelligence

Alphasense, United States (On-Site)

The Walt Disney Company - Manager, Product Design (Disney +)

The Walt Disney Company, United States (On-Site)

Blizzard Entertainment - Principal Designer, Open World - Unannounced Game

Blizzard Entertainment, United States (Hybrid)

The Walt Disney Company - Manager, Employee Comms & Engagement

The Walt Disney Company, United States (On-Site)

Mattel  Inc  - Senior Manager, Brand Creative

Mattel Inc , United States (On-Site)

IGT - Systems Engineer

IGT, United States (Remote)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Rackspace Technology - Senior MLOPs Engineer

Rackspace Technology, United States (Remote)

ByteDance - Research Scientist- Foundation Model, Generative AI

ByteDance, United States (On-Site)

Stonewall Collision & Auto Painting - Senior Data Scientist

Stonewall Collision & Auto Painting, India (On-Site)

Spell Brush - AI Infrastructure Engineer

Spell Brush, United States (On-Site)

Get notifed when new similar jobs are uploaded