Staff Software Engineer, GPU Performance, Google Scale

2 Hours ago • 8-13 Years • Artificial Intelligence • $197,000 PA - $291,000 PA

Job Summary

Job Description

This Staff Software Engineer role focuses on GPU performance optimization at Google scale. Responsibilities include building optimizations for critical Google products and services, shaping the entire GPU software stack (influencing model design, optimizing low-level kernels and compilers), managing performance bottlenecks, and collaborating with ML, compiler, and systems architecture teams. The ideal candidate will have extensive experience in software development, testing, ML design and infrastructure, GPU programming, and performance tuning. They will utilize Google's resources and work across teams to improve benchmarks and drive cloud business growth.
Must have:
  • 8+ years software development experience
  • 5+ years testing and launching software
  • 5+ years ML design and infrastructure experience
  • Experience with GPUs
  • Optimize performance, improve benchmarks
Good to have:
  • Low-level GPU programming (CUDA, OpenCL)
  • Compiler optimization experience
  • Knowledge of modern GPU architectures
  • Performance modeling and benchmarking skills
Perks:
  • Bonus
  • Equity
  • Benefits

Job Details


Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience in software development.
  • 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture.
  • 5 years of experience with ML design and ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
  • Experience working with GPUs.

Preferred qualifications:

  • Experience in low-level GPU programming (e.g., CUDA, OpenCL, etc.) and performance tuning techniques.
  • Experience with compiler optimization, code generation, and runtime systems for GPU architectures (e.g., OpenXLA, MLIR, Triton, etc.).
  • Experience in algorithms and ML models to leverage GPUs.
  • Knowledge of modern GPU architectures (e.g., NVIDIA, AMD, etc.), memory hierarchies, and performance bottlenecks.
  • Ability to develop and utilize performance models and benchmarks to guide optimization efforts and hardware roadmap decisions.

About the job

Google Cloud's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google Cloud's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. You will anticipate our customer needs and be empowered to act like an owner, take action and innovate. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

In this role, you will support the future of AI and accelerate computing for Google.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

The US base salary range for this full-time position is $197,000-$291,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Build optimizations that improve benchmarks, but also power Google's most critical products and services, impacting billions of users worldwide and driving cloud business growth.
  • Shape the entire GPU software stack through influencing model design, optimizing low-level kernels and compilers (e.g., OpenXLA, JAX, Triton, etc.), and bridging the gap between model developers and hardware for optimal co-design and performance.
  • Manage performance bottlenecks in tests and explore optimization techniques through Google’s unparalleled access to the latest generation of GPUs, tools, and build AI accelerators.
  • Collaborate with ML, compiler design, and systems architecture teams through internal and external partnerships, as well as open-source projects.

Similar Jobs

Google - Senior Software Engineer, AI/ML, Google Cloud Technical Infrastructure

Google

Kirkland, Washington, United States (On-Site)
1 Day ago
Blitz app - Senior Software Engineer (C++)

Blitz app

India (Remote)
1 Month ago
Google - Software Engineer II, Health and Trackers, Data Foundation

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Day ago
RoofStack - Software Architect

RoofStack

İstanbul, İstanbul, Türkiye (On-Site)
3 Weeks ago
Google - Software Engineer, System Software, Pixel

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Day ago
Google - Staff Software Engineer, AI/ML GenAI, Google Ads

Google

Mountain View, California, United States (On-Site)
1 Day ago
QUANTIC DREAM - Programmeur Intelligence Artificielle

QUANTIC DREAM

Montreal, Quebec, Canada (Hybrid)
6 Months ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Months ago
Google - Field Solutions Architect, Generative AI, Google Cloud

Google

Madrid, Community Of Madrid, Spain (On-Site)
1 Day ago
Google - Conversational AI Consultant

Google

Buenos Aires, Buenos Aires, Argentina (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Developer III, AI/ML Natural Language Processing, Google Workspace

Google

Waterloo, Ontario, Canada (On-Site)
1 Day ago
ByteDance - Quality Assurance Engineer Graduate (Global E-commerce-US) - 2025 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - MultiModal Generative Model)

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
Fairmatic - Senior Data Scientist

Fairmatic

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Music Foundation Model) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Staff Software Engineer, AI/ML, Google Workspace

Google

Sunnyvale, California, United States (On-Site)
1 Day ago
Google - Senior Staff Software Engineer, GPU Performance, Google Scale

Google

Sunnyvale, California, United States (On-Site)
1 Day ago
Google - Staff Software Engineer, Embedded Systems

Google

Sunnyvale, California, United States (On-Site)
1 Day ago
CloudLinux - Senior Go Developer for Imunify360

CloudLinux

Masovian Voivodeship, Poland (Remote)
3 Weeks ago
Avathon - Software Engineering Manager

Avathon

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Kirkland, Washington, United States

Next Level Business Services - IIB, DP, ODM Admin

Next Level Business Services

Burbank, California, United States (On-Site)
6 Months ago
Google - Technical Program Manager III, Security Compliance, Google Cloud

Google

Reston, Virginia, United States (On-Site)
1 Day ago
Google - Revenue Accelerator Specialist, Apps and Gaming

Google

New York, New York, United States (On-Site)
2 Hours ago
Starkflow - AI Product Engineer

Starkflow

San Francisco, California, United States (On-Site)
1 Month ago
Probably Monsters - Principal Player Combat & Gameplay Designer

Probably Monsters

Washington, District Of Columbia, United States (On-Site)
4 Months ago
Netflix - Analytics Engineer (L5) - Consumer Insights DSE

Netflix

Los Gatos, California, United States (On-Site)
1 Day ago
ByteDance - Software Engineer in Large Model System Graduate (Machine Learning Sys-US) - 2024 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Riot Games - Principal VFX Artist - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Fluence - Sales Engineer/Senior Sales Engineer - Battery Energy Storage

Fluence

San Francisco, California, United States (Hybrid)
6 Months ago
DraftKings - Customer Experience Associate

DraftKings

United States (Remote)
6 Days ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Machine Learning Engineer Intern - 2025

NVIDIA

Beijing, Beijing, China (On-Site)
1 Month ago
Windranger Labs - Technical AI Researcher

Windranger Labs

Singapore (On-Site)
3 Weeks ago
Keywords Studios - Research Associate - AI

Keywords Studios

(Remote)
3 Weeks ago
NVIDIA - Senior Applied LLM Engineer, AI – Chip Design

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Google - Software Engineer, Machine Learning, Edge Tensor Processing Unit

Google

Bengaluru, Karnataka, India (On-Site)
1 Day ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model AI Platform) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Hedra - Applied Research Scientist

Hedra

San Francisco, California, United States (On-Site)
3 Weeks ago
Keywords Studios - Research Associate - AI

Keywords Studios

(Remote)
4 Weeks ago
Trend Micro - Large Language Models (LLM) Expert (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Google - Sensitive Content Analyst, Emerging AI Strategy

Google

Austin, Texas, United States (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Seoul, South Korea (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Hyderabad, Telangana, India (On-Site)

Atlanta, Georgia, United States (On-Site)

Fremont, California, United States (On-Site)

Milan, Lombardy, Italy (On-Site)

Eemshaven, Groningen, Netherlands (On-Site)

Bengaluru, Karnataka, India (On-Site)

Sunnyvale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug