Staff Software Engineer, ML Hardware, YouTube Discovery

4 Weeks ago • 8-12 Years • Artificial Intelligence

About the job

Job Description

This Staff Software Engineer role at YouTube focuses on optimizing YouTube's recommendation models for Google's TPU hardware. Responsibilities include developing a hardware adoption strategy, leading engineering efforts to adapt models for efficient performance on new hardware generations, and evaluating new ML hardware. The ideal candidate will have extensive experience in software development, ML infrastructure, large-scale model development (specifically recommender systems), and technical leadership. Experience with quantized training and inference is crucial. The role requires collaboration with model developers and Google-wide ML experts to ensure YouTube's recommender models best utilize available ML hardware.
Must have:
  • 8+ years software development experience
  • 5+ years ML infrastructure experience
  • 3+ years developing large-scale ML models
  • 2+ years technical leadership
  • Experience with post-training quantization
  • Develop YouTube's ML hardware adoption strategy
  • Lead engineering efforts to adapt YouTube's models
Good to have:
  • Experience with AI infrastructure, compilers, or performance engineering
  • Experience optimizing ML models for ML hardware accelerators
  • Excellent communication skills
Perks:
  • Bonus
  • Equity
  • Benefits
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience in software development, and with data structures/algorithms.
  • 5 years of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, debugging).
  • 3 years of experience in developing large-scale ML models (recommender systems, LLMs, computer vision, etc) utilizing ML hardware accelerators.
  • 2 years of experience in a technical leadership role.
  • Experience with post-training quantization, quantized aware training, or quantized training for ML models.

Preferred qualifications:

  • Experience with building efficient and reusable AI infrastructure, compilers, or performance engineering.
  • Experience with optimizing ML models to efficiently run on ML hardware accelerators.
  • Excellent communication skills, with the ability to convey complex technical concepts clearly and concisely.

About the job

YouTube's growth is generated by Machine Learning (ML) powered personalized recommendations. Over time, these models are growing larger and it's imperative they be efficiently trained and served using Google's ML hardware Tensor Processing Unit (TPU) Simultaneously, Gemini and Large Language Model offer exciting new model architectures and are developing new advancements in ML hardware development.

In this role, you will be responsible for ensuring YouTube's business-critical recommender models best utilize the available ML hardware. To do this, you will be responsible for managing YouTube's participation in hardware development and evaluation programs. Additionally, you will initiate and drive efforts to adapt YouTube's models to take advantage of new accelerator capabilities, for example by adopting quantized training and inference.

At YouTube, we believe that everyone deserves to have a voice, and that the world is a better place when we listen, share, and build community through our stories. We work together to give everyone the power to share their story, explore what they love, and connect with one another in the process. Working at the intersection of cutting-edge technology and boundless creativity, we move at the speed of culture with a shared goal to show people the world. We explore new ideas, solve real problems, and have fun — and we do it all together.

The US base salary range for this full-time position is $189,000-$284,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Develop YouTube Discovery's ML hardware adoption strategy.
  • Initiate and lead engineering efforts to adapt YouTube's recommender models to perform efficiently on future generations of ML hardware.
  • Lead YouTube's evaluation of new ML hardware, in collaboration with model developers and Google-wide ML hardware and software experts.
View Full Job Description
$189.0K - $284.0K/yr (Outscal est.)
$236.5K/yr avg.
Worldwide

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

View All Jobs

Get notified when new jobs are added by Google

Similar Jobs

Intelliswift Software - Senior Frontend Developer

Intelliswift Software, India (Hybrid)

OpenGov - Software Engineer III

OpenGov, United States (On-Site)

Qualcomm - Engineer- MM Audio Test

Qualcomm, India (On-Site)

Inworld AI - Forward Deployed Engineer - USA

Inworld AI, United States (On-Site)

Eleven Labs - Technical Investigator / Data Scientist - AI Safety

Eleven Labs, United States (Remote)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Blinkhealth - Senior Data Engineer

Blinkhealth, India (On-Site)

Dew Software - Data Scientist

Dew Software, United States (On-Site)

PENN Interactive - Senior Data Engineer, Fulfillment

PENN Interactive, United States (Hybrid)

Snowed In Studios - Principle Software Developer

Snowed In Studios, Canada (Remote)

CleverTap - Junior Backend Engineer (Java)

CleverTap, India (Hybrid)

Lurkit - Software Engineer

Lurkit, Sweden (On-Site)

NK Securities Research - ML- Quantitative Researcher

NK Securities Research, India (Hybrid)

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Get notifed when new similar jobs are uploaded