Staff Software Engineer, GPU Performance, Core ML

9 Hours ago • 8-13 Years • Artificial Intelligence

Job Summary

Job Description

This Staff Software Engineer role focuses on GPU performance optimization within Google's Core ML organization. Responsibilities include building optimizations for critical products and services, shaping the GPU software stack (influencing model design, optimizing low-level kernels and compilers), resolving performance bottlenecks, and collaborating with experts in ML, compiler design, and systems architecture. The ideal candidate possesses extensive experience in software development, machine learning, and GPU programming, along with strong technical leadership skills. The role involves impacting billions of users worldwide and driving significant cloud business growth.
Must have:
  • 8+ years software development experience
  • 5+ years ML design & infrastructure experience
  • GPU experience
  • Technical leadership experience
  • C++ or Python proficiency
  • Data structures & algorithms expertise
Good to have:
  • Master's/PhD in related field
  • Compiler optimization experience
  • Low-level GPU programming (CUDA, OpenCL)
  • Experience with OpenXLA, MLIR, Triton
  • Performance modeling and benchmarking expertise

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 8 years of experience in software development and with data structures/algorithms (e.g., C++ or Python).
  • 5 years of experience with Machine Learning (ML) design and ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
  • Experience working with GPUs.
  • Experience in a technical leadership role leading project teams and setting technical direction.

Preferred qualifications:

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
  • Experience with compiler optimization, code generation, and runtime systems for GPU architectures (OpenXLA, MLIR, Triton, etc.).
  • Expertise in tailoring algorithms and ML models to exploit GPU strengths and minimize weaknesses.
  • Knowledge of low-level GPU programming (CUDA, OpenCL, etc.) and performance tuning techniques.
  • Understanding of modern GPU architectures, memory hierarchies, and performance bottlenecks.
  • Ability to develop and utilize sophisticated performance models and benchmarks to guide optimization efforts and hardware roadmap decisions.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

In recognition of hardware as a strength, Google’s Core Machine Learning (ML) organization is heavily invested in growing a powerhouse team of GPU experts, and we invite you to be at its vanguard! This is your opportunity to move beyond incremental improvements and architect truly transformative solutions, shaping the future of AI and accelerated computing for Google and the world.

While known for pioneering work with TPUs, GPUs are an equally vital and rapidly expanding frontier within Google's machine learning infrastructure. GPUs are indispensable to Google’s ever-evolving landscape for strategic, pragmatic, and performance-driven reasons — ensuring top performance for our ML models, adapting to ML workloads, achieving results, and influencing next-generation GPU architectures via strategic partnerships.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

Responsibilities

  • Build optimizations that improve benchmarks, but also power Google's most critical products and services, impacting billions of users worldwide and driving significant cloud business.
  • Shape the entire GPU software stack through influencing model design, optimizing low-level kernels and compilers (OpenXLA, JAX, Triton), and bridging the gap between model developers and hardware for optimal co-design and performance.
  • Resolve the most challenging performance bottlenecks and explore groundbreaking optimization techniques through Google’s unparalleled access to the latest generation of GPUs, tooling, and a decade of experience building AI accelerators. 
  • Collaborate with experts in ML, compiler design, and systems architecture through internal and external partnerships, as well as open-source projects. 

Similar Jobs

ByteDance - Research Scientist, Data Management and Security

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
ByteDance - Research scientist, Foundation Model(LLM)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Netflix - Machine Learning Engineer (L5) - Content and Studio

Netflix

United States (Remote)
16 Hours ago
NVIDIA - Senior Formal Verification Engineer

NVIDIA

Yokne'am Illit, North District, Israel (Hybrid)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Machine Learning System) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
NetEase Games - Senior AI Engineer (Asset Creation)

NetEase Games

(Remote)
3 Months ago
NVIDIA - Senior Software Engineer - XR

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Google - Senior Software Engineer, GPU Performance, Google Scale

Google

Sunnyvale, California, United States (On-Site)
11 Hours ago
Google - Software Engineer, Research, Computational Imaging

Google

Mountain View, California, United States (On-Site)
10 Hours ago
ByteDance - Research Scientist in Foundation Model (Music) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior GPU Power Architect

NVIDIA

Austin, Texas, United States (Hybrid)
1 Month ago
Epic Games - Machine Learning Engineer

Epic Games

London, England, United Kingdom (On-Site)
3 Weeks ago
Google - Software Engineer, gReach Program for People with Disabilities

Google

Shanghai, Shanghai, China (On-Site)
8 Hours ago
Microsoft - Data Scientist: Opportunities - PhD - Redmond

Microsoft

Redmond, Washington, United States (On-Site)
19 Hours ago
CloudHire - Machine Learning Engineer

CloudHire

India (Remote)
4 Weeks ago
ByteDance - Senior Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Tesla - Cellular Software Development Engineer

Tesla

Brandenburg, Germany (On-Site)
2 Months ago
Google - Software Engineer III, Devices and Services, Business Systems

Google

Haryana, India (On-Site)
8 Hours ago
Google - Software Engineer III, XBorg, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
9 Hours ago
Meta - Research Engineer

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Dream Games - Creative Manager

Dream Games

London, England, United Kingdom (On-Site)
9 Months ago
Kwalee - Business Analyst

Kwalee

Royal Leamington Spa, England, United Kingdom (On-Site)
1 Month ago
Cloud Imperium Games - Principal Network Programmer

Cloud Imperium Games

Manchester, England, United Kingdom (On-Site)
4 Months ago
Team17 - Head of Game Scouting

Team17

England, United Kingdom (Hybrid)
4 Weeks ago
Framestore - Launchpad Insights Work Experience in Visual Effects

Framestore

England, United Kingdom (On-Site)
3 Weeks ago
Rockstar Games - Engineering Manager

Rockstar Games

Edinburgh, Scotland, United Kingdom (On-Site)
3 Weeks ago
Playground Games - Facilities Assistant - Contract

Playground Games

England, United Kingdom (On-Site)
6 Days ago
Blazing griffin - Technical Artist (Games)

Blazing griffin

Glasgow, Scotland, United Kingdom (Hybrid)
4 Months ago
Rebellion - Gameplay Programmer

Rebellion

Runcorn, England, United Kingdom (Hybrid)
3 Weeks ago
Activision - Senior VFX Destruction Artist

Activision

Guildford, England, United Kingdom (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Software Engineer, Vertex GenAI, Infrastructure and Features

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
10 Hours ago
Zoox - Senior/ Staff Software Engineer - Simulation Workload Orchestration

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
CrazyLabs - Generative AI Specialist

CrazyLabs

Berlin, Berlin, Germany (On-Site)
1 Week ago
Omnissa - Staff Data Scientist

Omnissa

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Google - Staff Software Engineer, Private Machine Learning

Google

Mountain View, California, United States (On-Site)
9 Hours ago
Keywords Studios - AI - Senior Research Associate (Prompts)

Keywords Studios

Silesian Voivodeship, Poland (On-Site)
1 Month ago
ByteDance - Research Scientist Graduate (Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
3 Weeks ago
Google - Senior Software Engineer, AI/ML, Google Ads

Google

Mountain View, California, United States (On-Site)
8 Hours ago
Google - Software Engineer III, AI/ML GenAI

Google

New York, New York, United States (On-Site)
9 Hours ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Bucharest, Bucharest, Romania (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Bengaluru, Karnataka, India (On-Site)

Sunnyvale, California, United States (On-Site)

Sunnyvale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug