Technical Program Manager, Cloud ML Compute Services

2 Days ago • 8-10 Years • Artificial Intelligence • $183,000 PA - $271,000 PA

Job Summary

Job Description

As a Technical Program Manager at Google, you'll lead complex, multi-disciplinary projects from start to finish. You'll collaborate with stakeholders to plan requirements, identify risks, manage schedules, and communicate with cross-functional partners. Responsibilities include defining project scope, developing plans, managing timelines, communicating with stakeholders (engineering, product, research), identifying and mitigating risks, understanding ML workload monitoring and diagnostics (distributed systems, performance optimization, ML model convergence), and translating business requirements into technical solutions. This role requires expertise in Machine Learning and program management, working with Cloud ML Compute Services within the MSCA organization.
Must have:
  • Bachelor's degree or equivalent experience
  • 8+ years ML experience
  • 8+ years program management experience
  • Cross-functional project management
  • Understanding of ML Infra/TPU/GPU systems
Good to have:
  • 10+ years managing complex projects
  • Software development & distributed systems knowledge
Perks:
  • Bonus
  • Equity
  • Benefits

Job Details


Minimum qualifications:

  • Bachelor's degree in a relevant field, or equivalent practical experience.
  • 8 years of experience working on Machine Learning (ML).
  • 8 years of experience in program management.

Preferred qualifications:

  • 10 years of experience managing complex cross-functional or cross-team projects.
  • Understanding of software development, distributed systems, and ML Infra, or TPU/GPU systems.

About the job

A problem isn’t truly solved until it’s solved for all. That’s why Googlers build products that help create opportunities for everyone, whether down the street or across the globe. As a Technical Program Manager at Google, you’ll use your technical expertise to lead complex, multi-disciplinary projects from start to finish. You’ll work with stakeholders to plan requirements, identify risks, manage project schedules, and communicate clearly with cross-functional partners across the company. You're equally comfortable explaining your team's analyses and recommendations to executives as you are discussing the technical tradeoffs in product development with engineers.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

The US base salary range for this full-time position is $183,000-$271,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Collaborate with cross-functional teams to define project scope, goals, and deliverables. Develop detailed project plans, identify dependencies, and manage timelines.
  • Communicate with stakeholders across engineering, product, and research to ensure alignment and drive progress.
  • Identify and mitigate risks that could impact project success.
  • Understand the technical aspects of ML workload monitoring and diagnostics, including distributed systems, performance optimization, and ML model convergence.
  • Work with engineers, researchers, and product managers to translate business requirements into technical solutions.

Similar Jobs

Google - Technical Program Manager III, Hardware Compliance, Geo

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Google - Staff Firmware Engineer, Pixel System Software

Google

New Taipei, New Taipei City, Taiwan (On-Site)
3 Days ago
ByteDance - Machine Learning Engineer Intern (Global E-commerce Risk Control)

ByteDance

San Jose, California, United States (On-Site)
3 Days ago
Survay Monkey - Content Strategist

Survay Monkey

Ottawa, Ontario, Canada (Hybrid)
9 Hours ago
Gloss Genius - Strategy and Operations Manager, Payments

Gloss Genius

New York, New York, United States (Hybrid)
8 Hours ago
NVIDIA - Senior Software Engineer - Automated Parallel Programming

NVIDIA

North Carolina, United States (Remote)
1 Month ago
Virtuos - Senior Machine Learning Engineer (Game)

Virtuos

Vietnam (On-Site)
2 Weeks ago
Interface AI - Software Development Engineer IV - Backend

Interface AI

India (Remote)
2 Months ago
Google - Customer Engineer, Data and AI

Google

Melbourne, Victoria, Australia (On-Site)
2 Weeks ago
Hedra - Research Scientist

Hedra

San Francisco, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Egnyte - Sr. Software Engineer

Egnyte

Mountain View, California, United States (Hybrid)
5 Months ago
NetEase Games - Procurement Business Partner

NetEase Games

Dublin, County Dublin, Ireland (On-Site)
4 Months ago
Google - Sales Operations Manager, Devices and Services

Google

Tokyo, Japan (On-Site)
2 Weeks ago
Google - Technical Program Manager II, Cloud Networking, Telecommunications

Google

Addison, Texas, United States (On-Site)
2 Days ago
Aerospike - Principal Product Manager

Aerospike

Mountain View, California, United States (On-Site)
23 Hours ago
Riot Games - Senior Manager, Game Production

Riot Games

Shanghai, China (On-Site)
1 Day ago
Monzo - Senior Product Manager, Homeownership

Monzo

London, England, United Kingdom (On-Site)
8 Hours ago
JustPlay - Senior UI/Visual Designer

JustPlay

(Remote)
1 Month ago
Reversing Labs - Senior Customer Success Manager

Reversing Labs

United States (Remote)
3 Weeks ago
Corsair - HR Operations Manager

Corsair

Milpitas, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Sunnyvale, California, United States

Google - Program Manager, YouTube Operations (Fixed-Term Contract)

Google

New York, New York, United States (On-Site)
2 Days ago
Google - Senior Software Engineer, Machine Learning, Search

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Google - Software Engineer III, Mobile (Android), Google Workspace

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
CRB workforce  - IT Field Technician

CRB workforce

Orange, California, United States (On-Site)
1 Day ago
Sony Pictures Animation - Story Artist - Series

Sony Pictures Animation

Culver City, California, United States (On-Site)
6 Months ago
Highspot - Principal Frontend Web Engineer

Highspot

Seattle, Washington, United States (Hybrid)
6 Months ago
Google - PhD Software Engineer

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Nexon America - Security Compliance Analyst

Nexon America

El Segundo, California, United States (Hybrid)
1 Day ago
Google - Customer Engineer IV, Platform, Greenfield, Google Cloud

Google

Miami, Florida, United States (On-Site)
2 Days ago
Jane Street - Financial Reporting Accountant

Jane Street

New York, New York, United States (On-Site)
7 Hours ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

FTF Studios - FTF Senior Programmer

FTF Studios

(Remote)
1 Year ago
ByteDance - Research Scientist Graduate (Foundation Model - Vision and Language)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Hedra - Research Engineer

Hedra

San Francisco, California, United States (On-Site)
1 Month ago
Trend Micro - Large Language Models (LLM) Expert (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Alpha Sense - Lead AI Platform Engineer

Alpha Sense

New York, New York, United States (On-Site)
5 Months ago
GoTo Group - Senior Data Scientist - Computer Vision - KYC

GoTo Group

Singapore (On-Site)
6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - MultiModal Generative Model)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Ello - Tech Lead, Machine Learning

Ello

San Francisco, California, United States (On-Site)
1 Month ago
Meta - Postdoctoral Researcher, Embodied AI (PhD)

Meta

Seattle, Washington, United States (On-Site)
5 Months ago
NVIDIA - Technical Marketing Engineer - AI Platform Software

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug