Senior Staff Software Engineer, GPU Performance, Google Scale

8 Hours ago • 8-8 Years • Artificial Intelligence • $248,000 PA - $349,000 PA

Job Summary

Job Description

This Senior Staff Software Engineer role focuses on GPU performance at Google scale. Responsibilities include architecting accelerated computing, building optimizations for critical products and services, shaping the GPU software stack (OpenXLA, JAX, Triton), managing performance bottlenecks, and collaborating cross-functionally. The role requires expertise in GPU programming, compiler optimization, and ML model optimization for GPU architectures. The engineer will impact users and drive significant cloud business by improving benchmarks and bridging the gap between model developers and hardware for optimal co-design and performance.
Must have:
  • 8+ years software development experience
  • 7+ years leading technical projects, ML design, and working with industry-scale ML infrastructure
  • 5+ years testing and launching software products
  • GPU programming experience
  • Compiler optimization experience
Good to have:
  • Experience with OpenXLA, MLIR, Triton
  • Knowledge of CUDA, OpenCL
  • Understanding of modern GPU architectures
  • Ability to develop performance models and benchmarks
Perks:
  • Bonus
  • Equity
  • Benefits

Job Details


Minimum qualifications:

  • Bachelor's degree or equivalent practical experience.
  • 8 years of experience in software development.
  • 7 years of experience leading technical project strategy, ML design, and working with industry-scale ML infrastructure (e.g., model deployment, model evaluation, data processing, debugging, fine tuning).
  • 5 years of experience testing, and launching software products, and 3 years of experience with software design and architecture.
  • Experience working with GPUs.

Preferred qualifications:

  • Experience with compiler optimization, code generation, and runtime systems for GPU architectures (e.g., OpenXLA, MLIR, Triton, etc.).
  • Knowledge of low-level GPU programming (e.g., CUDA, OpenCL, etc.) and performance tuning techniques.
  • Understanding of modern GPU architectures (e.g., NVIDIA, AMD, etc.), memory hierarchies, and performance bottlenecks.
  • Ability to tailor algorithms and ML models to utilize GPU strengths.
  • Ability to develop and utilize performance models and benchmarks to guide optimization efforts and hardware roadmap decisions.

About the job

Google Cloud's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google Cloud's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. You will anticipate our customer needs and be empowered to act like an owner, take action and innovate. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

Graphics Processing Unit (GPUs) are indispensable to Google’s ever-evolving landscape for strategic, pragmatic, and performance-driven reasons, ensuring top performance for our Machine Learning models, adapting to machine learning workloads, achieving results, and influencing next-generation GPU architectures via strategic partnerships. While known for pioneering work with the Tensor Processing Unit (TPUs), GPUs are an important and rapidly expanding frontier within Google's machine learning infrastructure.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

The US base salary range for this full-time position is $248,000-$349,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Help architect the future of accelerated computing.
  • Build optimizations that improve benchmarks, and power Google's most critical products and services, impacting users and driving significant cloud business.
  • Shape the entire GPU software stack through influencing model design, optimizing low-level kernels and compilers (OpenXLA, JAX, Triton), and bridging the gap between model developers and hardware for optimal co-design and performance.
  • Manage challenging performance bottlenecks and explore optimization techniques through Google’s access to the latest generation of GPUs, tooling, and experience building AI accelerators.
  • Collaborate cross-functionally with machine learning, compiler design, and systems architecture through internal and external partnerships, as well as open-source projects.

Similar Jobs

Blind Squirrel Games - Technical Director

Blind Squirrel Games

Auckland, Auckland, New Zealand (On-Site)
4 Months ago
Google - Senior Software Engineer, Full Stack, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
10 Hours ago
Regent Craft - Flight Controls Engineering Intern

Regent Craft

North Kingstown, Rhode Island, United States (On-Site)
6 Months ago
Fluence - Jr. Controls Engineer (m/f/d) - German speaker

Fluence

Erlangen, Bavaria, Germany (Hybrid)
6 Months ago
NVIDIA - Deep Learning Engineer, Generative AI and 3D Reconstruction

NVIDIA

Tokyo, Japan (On-Site)
2 Months ago
NVIDIA - Machine Learning Software Platform Architect

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Zoox - Senior Machine Learning Engineer - Collision Avoidance System

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
AI Fund - Curriculum Developer

AI Fund

(Remote)
6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - MultiModal Generative Model)

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
Netflix - Software Engineer L4/L5, Training Platform, Machine Learning Platform

Netflix

California, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer III, Front End, Google Cloud

Google

Ramat Gan, Tel Aviv District, Israel (On-Site)
10 Hours ago
Snowed In Studios - Principle Software Developer

Snowed In Studios

Quebec, Canada (Remote)
4 Weeks ago
Google - Senior Software Engineer, Google Cloud Networking

Google

Bengaluru, Karnataka, India (On-Site)
9 Hours ago
ByteDance - Backend Software Engineer

ByteDance

Dubai, Dubai, United Arab Emirates (On-Site)
3 Weeks ago
Google - Staff Software Engineer, Infrastructure, Google Cloud

Google

Cambridge, Massachusetts, United States (On-Site)
5 Months ago
Flying Wild Hog - Animation Programmer

Flying Wild Hog

(Remote)
1 Month ago
Google - Site Reliability Engineer, Databases

Google

Bengaluru, Karnataka, India (On-Site)
9 Hours ago
Electronic Arts - Data Science Engineer

Electronic Arts

Hyderabad, Telangana, India (On-Site)
2 Days ago
Google - Software Engineer III, Google Play

Google

Mountain View, California, United States (On-Site)
10 Hours ago
Trendyol - Data Science Professionals - Trendyol GO

Trendyol

İzmir, İzmir, Türkiye (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sunnyvale, California, United States

King - Senior Sales Account Executive

King

New York, New York, United States (Hybrid)
18 Hours ago
The Walt Disney Company - Oiler - Truck Driver

The Walt Disney Company

Anaheim, California, United States (On-Site)
2 Days ago
Rockstar Games - Senior Production Coordinator: Motion Capture

Rockstar Games

New York, New York, United States (On-Site)
2 Weeks ago
ByteDance - Frontend Software Engineer Intern (Web and Client - Global Payment)

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
Epic Games - Concept Outsource Lead

Epic Games

United States (On-Site)
2 Days ago
Google - Software Engineering Manager II, Infrastructure, Google Cloud Security and Privacy

Google

Sunnyvale, California, United States (On-Site)
9 Hours ago
Microsoft - Member of Technical Staff, Machine Learning Engineer

Microsoft

Mountain View, California, United States (Hybrid)
13 Hours ago
The Walt Disney Company - Senior Software Engineer (Project Hire)

The Walt Disney Company

Celebration, Florida, United States (On-Site)
2 Days ago
Next Level Business Services - Sr. SAP Hana Modeler

Next Level Business Services

Bentonville, Arkansas, United States (On-Site)
6 Months ago
Google - Failure Analysis Engineer, Materials and Defect Analysis

Google

Fremont, California, United States (On-Site)
9 Hours ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Product Manager, Assurance Evaluations, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
11 Hours ago
Microsoft - Principal Researcher-Artificial Specialized Intelligence

Microsoft

Vancouver, British Columbia, Canada (On-Site)
19 Hours ago
CloudHire - ML Engineer

CloudHire

Telangana, India (Remote)
4 Weeks ago
ByteDance - Research Scientist- Foundation Model, Video Generation

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Senior Software Engineer, AI/ML, Google Cloud Technical Infrastructure

Google

Kirkland, Washington, United States (On-Site)
11 Hours ago
Google - Field Solutions Architect, Generative AI, Google Cloud

Google

Berlin, Berlin, Germany (On-Site)
8 Hours ago
NVIDIA - Technical Marketing Engineer - AI Platform Software

NVIDIA

Canada (Hybrid)
1 Month ago
DNEG - Head of Machine Learning

DNEG

London, England, United Kingdom (Remote)
3 Weeks ago
Google - Staff Software Engineer, Applied AI

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
9 Hours ago
Google - Software Engineering Manager, Visual Language and Multimodal Modeling

Google

Sydney, New South Wales, Australia (On-Site)
9 Hours ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Bucharest, Bucharest, Romania (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Bengaluru, Karnataka, India (On-Site)

Sunnyvale, California, United States (On-Site)

Sunnyvale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug