Senior Engineering Manager, ML Performance

1 Month ago • 8-13 Years • Artificial Intelligence • $237,000 PA - $337,000 PA

Job Summary

Job Description

The Senior Engineering Manager, ML Performance will focus on LLM performance analysis and optimizations for partner teams including Google Gemini, Search Magi, Cloud LLM APIs, etc. Responsibilities include identifying and maintaining LLM training and serving benchmarks, exploring numeric and algorithmic optimizations, engaging with various Google product teams to solve their LLM performance challenges, analyzing performance and efficiency metrics, and implementing solutions at Google fleet-wide scale. This role requires strong technical leadership, experience managing engineering teams, and expertise in machine learning systems and performance analysis. The ideal candidate will have experience with TensorFlow/JAX, TPUs, and large-scale model training.
Must have:
  • 8+ years software development experience (Python, C++)
  • 5+ years technical leadership & people management
  • Performance analysis expertise
  • LLM performance optimization
  • TensorFlow/JAX TPU experience
Good to have:
  • Master's/PhD in CS
  • Experience in matrixed organization
  • Compiler optimization experience
  • Numeric/algorithmic optimization (quantization, sparsity)
  • Machine learning systems experience
Perks:
  • Bonus
  • Equity
  • Benefits

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 8 years of experience with software development in one or more programming languages (e.g., Python, C++).
  • 5 years of experience in a technical leadership role; overseeing projects, with 5 years of experience in a people management, supervision/team leadership role.
  • Experience with performance analysis.

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical field.
  • 5 years of experience working in a matrixed organization.
  • Experience working on compiler optimizations or related fields.
  • Experience with numeric and algorithmic optimization like quantization, sparsity, or model architecture improvement.
  • Experience in machine learning systems (e.g., background theory, TensorFlow, or other ML tools).

About the job

Like Google's own ambitions, the work of a Software Engineer goes way beyond just Search. Software Engineering Managers have not only the technical expertise to take on and provide technical leadership to major projects, but also manage a team of engineers. You not only optimize your own code but make sure engineers are able to optimize theirs. As a Software Engineering Manager you manage your project goals, contribute to product strategy and help develop your team. Teams work all across the company, in areas such as information retrieval, artificial intelligence, natural language processing, distributed computing, large-scale system design, networking, security, data compression, user interface design; the list goes on and is growing every day. Operating with scale and speed, our exceptional software engineers are just getting started -- and as a manager, you guide the way.

With technical and leadership expertise, you manage engineers across multiple teams and locations, a large product budget and oversee the deployment of large-scale projects across multiple sites internationally.

Google Cloud accelerates every organization’s ability to digitally transform its business and industry. We deliver enterprise-grade solutions that leverage Google’s cutting-edge technology, and tools that help developers build more sustainably. Customers in more than 200 countries and territories turn to Google Cloud as their trusted partner to enable growth and solve their most critical business problems.

The US base salary range for this full-time position is $237,000-$337,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. The range displayed on each job posting reflects the minimum and maximum target salaries for the position across all US locations. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Focus on LLM performance analysis and optimizations for partner teams including Google Gemini, Search Magi, Cloud LLM APIs, etc.
  • Identify and maintain LLM training and serving benchmarks and use them to identify performance opportunities and drive TensorFlow/JAX TPU out-of-the-box performance toward state-of-the-art.
  • Explore numeric and algorithmic optimizations, new ML model architecture/optimizer/training techniques to solve ML tasks more efficiently, and new techniques to reduce the label/unlabeled ML data needed to train a model to target accuracy. 
  • Engage with various Google product teams to solve their LLM performance challenges, including onboarding new LLM models and products on Google’s TPU hardware and enabling LLMs to train efficiently on a very large scale (i.e., thousands of TPUs).
  • Analyze performance and efficiency metrics to identify bottlenecks, design, and implement solutions at Google fleet-wide scale.

Similar Jobs

PwC - Senior AI Developer - MILANO [DIG]

PwC

Milan, Lombardy, Italy (On-Site)
4 Months ago
ByteDance - Research Scientist, Reinforcement Learning

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Senior Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
PwC - IN-Senior Associate_ML Engineer_Data and Analytics_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
3 Months ago
PwC - Conversational AI Developer- Manager

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Intel Corporation - CCG CGAI: AI Software Solutions Engineer

Intel Corporation

(On-Site)
1 Month ago
DeepSight AI Labs   - Intern/Computer Vision Engineer

DeepSight AI Labs

Gurugram, Haryana, India (On-Site)
8 Months ago
Token Metrics - Crypto Data Scientist / Machine Learning Engineer  (Remote)

Token Metrics

Tirana, Tirana County, Albania (Remote)
3 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Serbia (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Engineering Manager Machine Learning Infrastructure

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Video Analysis and Quality Algorithm Engineer - 2023 Start (MS)

ByteDance

San Diego, California, United States (On-Site)
3 Months ago
Visa - Senior Manager Data Science - Visa Consulting & Analytics

Visa

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Frost & Sullivan - AI Engineer

Frost & Sullivan

Tamil Nadu, India (On-Site)
4 Months ago
The Walt Disney Company - Principal Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Months ago
Web Secure AI - Data Scientist

Web Secure AI

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Microsoft - Research Intern - Applied Science in Viva Insights

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
ByteDance - Software Engineer in Large Model System Graduate (Machine Learning Sys-US) - 2024 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Autodesk - Machine Learning Developer

Autodesk

Toronto, Ontario, Canada (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sunnyvale, California, United States

Luxoft - Senior C++ AUTOSAR Adaptive Software Developer with Security Knowledge

Luxoft

Poland, Ohio, United States (Remote)
2 Months ago
Meta - Marketing Science Partner (Financial Services)

Meta

Menlo Park, California, United States (On-Site)
3 Months ago
Fabric - Applied Researcher, Cryptography Proof Systems

Fabric

Seattle, Washington, United States (Remote)
4 Months ago
Amazon Games - Senior Software Engineer, Amazon Games AI Research

Amazon Games

San Diego, California, United States (On-Site)
1 Month ago
Axinous - Senior Network Engineer

Axinous

United States (Remote)
1 Month ago
Zoox - Software Systems Engineer - Software Health and Complexity

Zoox

Foster City, California, United States (Hybrid)
3 Months ago
Backbone - Technical Program Manager, Mechanical

Backbone

Atherton, California, United States (Hybrid)
6 Months ago
Fabric - Applied Cryptographer, ZKP Research

Fabric

Los Angeles, California, United States (Remote)
4 Months ago
Netflix - Machine Learning Scientist (L5) - Content and Studio

Netflix

United States (Remote)
1 Month ago
Hasbro - Senior Business Analyst

Hasbro

Pawtucket, Rhode Island, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Canva - Machine Learning Engineering Manager (m/f/x) - Canva Austria

Canva

Vienna, Vienna, Austria (Remote)
3 Months ago
CharacterAI - Research Engineer, Post-Training

CharacterAI

Menlo Park, California, United States (On-Site)
6 Months ago
CharacterAI - Software Engineer, Machine Learning Infrastructure

CharacterAI

New York, New York, United States (On-Site)
2 Months ago
Level AI - Enterprise Account Executive  (Remote, US)

Level AI

United States (Remote)
4 Months ago
Eleven Labs - Machine Learning Researcher

Eleven Labs

London, England, United Kingdom (Remote)
2 Months ago
Google - AI Sales Specialist, Startups, Google Cloud

Google

San Francisco, California, United States (On-Site)
3 Months ago
ARRISE powering Pragmatic Play - Sr.Data Scientist

ARRISE powering Pragmatic Play

India (Remote)
5 Months ago
ByteDance - Product Solution Architect, Volcano ARK (Singapore)

ByteDance

Singapore (On-Site)
3 Months ago
Microsoft - Senior Researcher: Machine Learning – Microsoft Research AI for Science

Microsoft

Cambridge, England, United Kingdom (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug