Staff Software Engineer, GPU Performance, Core ML

2 Weeks ago • 8-13 Years • Artificial Intelligence

Job Summary

Job Description

This Staff Software Engineer role focuses on GPU performance optimization within Google's Core ML organization. Responsibilities include building optimizations for critical products and services, shaping the GPU software stack (influencing model design, optimizing low-level kernels and compilers), resolving performance bottlenecks, and collaborating with experts in ML, compiler design, and systems architecture. The ideal candidate possesses extensive experience in software development, machine learning, and GPU programming, along with strong technical leadership skills. The role involves impacting billions of users worldwide and driving significant cloud business growth.
Must have:
  • 8+ years software development experience
  • 5+ years ML design & infrastructure experience
  • GPU experience
  • Technical leadership experience
  • C++ or Python proficiency
  • Data structures & algorithms expertise
Good to have:
  • Master's/PhD in related field
  • Compiler optimization experience
  • Low-level GPU programming (CUDA, OpenCL)
  • Experience with OpenXLA, MLIR, Triton
  • Performance modeling and benchmarking expertise

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 8 years of experience in software development and with data structures/algorithms (e.g., C++ or Python).
  • 5 years of experience with Machine Learning (ML) design and ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
  • Experience working with GPUs.
  • Experience in a technical leadership role leading project teams and setting technical direction.

Preferred qualifications:

  • Master’s degree or PhD in Engineering, Computer Science, or a related technical field.
  • Experience with compiler optimization, code generation, and runtime systems for GPU architectures (OpenXLA, MLIR, Triton, etc.).
  • Expertise in tailoring algorithms and ML models to exploit GPU strengths and minimize weaknesses.
  • Knowledge of low-level GPU programming (CUDA, OpenCL, etc.) and performance tuning techniques.
  • Understanding of modern GPU architectures, memory hierarchies, and performance bottlenecks.
  • Ability to develop and utilize sophisticated performance models and benchmarks to guide optimization efforts and hardware roadmap decisions.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

In recognition of hardware as a strength, Google’s Core Machine Learning (ML) organization is heavily invested in growing a powerhouse team of GPU experts, and we invite you to be at its vanguard! This is your opportunity to move beyond incremental improvements and architect truly transformative solutions, shaping the future of AI and accelerated computing for Google and the world.

While known for pioneering work with TPUs, GPUs are an equally vital and rapidly expanding frontier within Google's machine learning infrastructure. GPUs are indispensable to Google’s ever-evolving landscape for strategic, pragmatic, and performance-driven reasons — ensuring top performance for our ML models, adapting to ML workloads, achieving results, and influencing next-generation GPU architectures via strategic partnerships.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

Responsibilities

  • Build optimizations that improve benchmarks, but also power Google's most critical products and services, impacting billions of users worldwide and driving significant cloud business.
  • Shape the entire GPU software stack through influencing model design, optimizing low-level kernels and compilers (OpenXLA, JAX, Triton), and bridging the gap between model developers and hardware for optimal co-design and performance.
  • Resolve the most challenging performance bottlenecks and explore groundbreaking optimization techniques through Google’s unparalleled access to the latest generation of GPUs, tooling, and a decade of experience building AI accelerators. 
  • Collaborate with experts in ML, compiler design, and systems architecture through internal and external partnerships, as well as open-source projects. 

Similar Jobs

Google - Software Engineer II, Device Integrity, Web and IOS

Google

Zürich, Zurich, Switzerland (On-Site)
2 Weeks ago
Google - Software Engineer, Performance Modeling

Google

Durham, North Carolina, United States (On-Site)
1 Week ago
Google - Staff Software Engineer, YouTube

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Lionsgate Games - Manager, Social Media, WWTV Marketing

Lionsgate Games

Santa Monica, California, United States (On-Site)
2 Weeks ago
Google - Software Engineer III, File Solutions, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Days ago
Google - Senior Software Engineer, AI/ML GenAI, Google Cloud

Google

Seattle, Washington, United States (On-Site)
2 Weeks ago
Google - Software Engineer III, AI/ML GenAI

Google

New York, New York, United States (On-Site)
1 Week ago
Google - Digital Transformation Consultant

Google

Hyderabad, Telangana, India (On-Site)
2 Weeks ago
Meta - Software Engineer, Machine Learning

Meta

Sunnyvale, California, United States (On-Site)
5 Months ago
Google - Senior Software Engineer, Machine Learning, Google Play Books

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Developer III, Front End, Google Cloud Runtimes

Google

Waterloo, Ontario, Canada (On-Site)
2 Weeks ago
Google - Senior Staff Software Engineer, BigQuery Generative AI

Google

Kirkland, Washington, United States (On-Site)
2 Days ago
Google - Software Engineer III, Cloud Storage

Google

Madison, Wisconsin, United States (On-Site)
2 Days ago
Google - Software Engineer, Android

Google

Bucharest, Bucharest, Romania (On-Site)
2 Weeks ago
The Walt Disney Company - Sr Software Engineer

The Walt Disney Company

Orlando, Florida, United States (On-Site)
3 Days ago
Netflix - Machine Learning Engineer 5 - Ads Platform Engineering

Netflix

United States (Remote)
1 Week ago
Appier - Senior Software Engineer, Java Backend Development

Appier

Taipei City, Taiwan (On-Site)
9 Hours ago
Mashgin - Senior Software Engineer, Computer Vision and Deep Learning

Mashgin

Palo Alto, California, United States (Hybrid)
6 Months ago
Genies.io - Senior 3D Software Engineer

Genies.io

Los Angeles, California, United States (On-Site)
1 Day ago
Google - Senior Software Engineer, Pixel Sensor

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

Nissan - DIE MAINTENANCE TECHNICIAN (CASTING PLANT)

Nissan

Sunderland, England, United Kingdom (On-Site)
6 Months ago
Gaming Innovation Group  - Sales Development Representative

Gaming Innovation Group

Salford, England, United Kingdom (Hybrid)
3 Weeks ago
Humanitarian Operations - FullStack Web Developer

Humanitarian Operations

Edinburgh, Scotland, United Kingdom (On-Site)
3 Months ago
Aristocrat Gaming - Games Mathematician

Aristocrat Gaming

London, England, United Kingdom (Hybrid)
3 Months ago
gravitee.io - Technical Writer

gravitee.io

London, England, United Kingdom (Hybrid)
1 Month ago
Epic Games - Senior GL Accountant

Epic Games

London, England, United Kingdom (On-Site)
2 Weeks ago
Technicon Design - Lead Digital Modeller

Technicon Design

Coventry, England, United Kingdom (Hybrid)
4 Weeks ago
Foster and partners  - Graduate Mechanical Engineer

Foster and partners

London, England, United Kingdom (On-Site)
23 Hours ago
Epic Games - Senior Desktop Engineer, Fortnite Tech

Epic Games

London, England, United Kingdom (On-Site)
3 Months ago
MIQ Digital - Account Manager (German Speaker)

MIQ Digital

London, England, United Kingdom (On-Site)
9 Hours ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Software Engineer III, Machine Learning, Google Ads

Google

Kirkland, Washington, United States (On-Site)
2 Weeks ago
FTF Studios - FTF Senior Programmer

FTF Studios

(Remote)
1 Year ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago
NVIDIA - Principal Software Engineer - Enterprise AI Platform

NVIDIA

Santa Clara, California, United States (Hybrid)
3 Months ago
Google - Technical Program Manager III, Infrastructure Resource Analytics, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
NVIDIA - Principal Software Engineer - Enterprise AI Platform

NVIDIA

Canada (Hybrid)
2 Months ago
Google - Software Engineering Manager, Cloud AI

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Inworld AI - Forward Deployed Engineer (AI Gameplay Engineer)

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
1 Month ago
NVIDIA - Senior Solutions Architect - Generative AI

NVIDIA

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Google - Senior Software Engineer, Core Machine Learning, Google Cloud

Google

New York, New York, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug