Senior Software Engineer, GPU Performance, Google Scale

7 Hours ago • 5-8 Years • Artificial Intelligence • $166,000 PA - $244,000 PA

Job Summary

Job Description

This Senior Software Engineer role focuses on GPU performance at Google scale. Responsibilities include building optimizations for critical products and services, shaping the GPU software stack (influencing model design, optimizing low-level kernels and compilers), managing performance bottlenecks, and collaborating with experts in ML, compiler design, and systems architecture. The role requires expertise in GPU programming, performance tuning, and ML infrastructure. The engineer will leverage Google's resources and contribute to groundbreaking optimization techniques, impacting billions of users.
Must have:
  • 5+ years software development experience
  • 3+ years ML infrastructure experience
  • 3+ years testing/launching software
  • GPU programming experience
  • Performance optimization expertise
Good to have:
  • Low-level GPU programming (CUDA, OpenCL)
  • Compiler optimization experience
  • Experience with OpenXLA, MLIR, Triton
  • Performance modeling and benchmarking
  • Knowledge of modern GPU architectures
Perks:
  • Bonus
  • Equity
  • Benefits

Job Details


Minimum qualifications:

  • Bachelor’s degree or equivalent practice experience.
  • 5 years of experience with software development in one or more programming languages, and with data structures/algorithms.
  • 3 years of experience testing, maintaining, or launching software products, and 1 year of experience with software design and architecture.
  • 3 years of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
  • Experience working with GPUs.

Preferred qualifications:

  • Experience in algorithms and ML models to leverage GPUs effectively.
  • Experience in low-level GPU programming (e.g., CUDA, OpenCL, etc.) and performance tuning techniques.
  • Experience with compiler optimization, code generation, and runtime systems for GPU architectures (e.g., OpenXLA, MLIR, Triton, etc).
  • Ability to develop and utilize sophisticated performance models and benchmarks to guide optimization efforts and hardware roadmap decisions.
  • Knowledge of modern GPU architectures (e.g., NVIDIA, AMD, etc.), memory hierarchies, and performance bottlenecks.

About the job

Google Cloud's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google Cloud's needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. You will anticipate our customer needs and be empowered to act like an owner, take action and innovate. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

Google is known for its pioneering work with TPUs, GPUs are equally vital and rapidly expanding within its machine learning infrastructure. GPUs are indispensable to Google’s diverse and ever-evolving landscape for strategic, pragmatic, and performance-driven reasons ensuring performance for ML models, adapting to diverse ML workloads, achieving results, and influencing next-generation GPU architectures through strategic partnerships

In recognition of hardware diversity as a strength, Google’s Core ML organization is heavily invested in growing a powerhouse team of GPU experts.

In this role, you will shape the future of AI and accelerate computing for Google.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

The US base salary range for this full-time position is $166,000-$244,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process. Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Build optimizations that improve benchmarks, but also power Google's most critical products and services, impacting billions of users worldwide and driving significant cloud business growth.
  • Shape the entire GPU software stack through influencing model design, optimizing low-level kernels and compilers (e.g., OpenXLA, JAX, Triton, etc.), and bridging the gap between model developers and hardware for optimal co-design and performance.
  • Manage performance bottlenecks in tests and explore groundbreaking optimization techniques through Google’s unparalleled access to the latest generation of GPUs, tools, and over a decade of experience in building AI accelerators.
  • Collaborate with some of the resourceful minds in ML, compiler design, and systems architecture through internal and external partnerships, as well as open-source projects.

Similar Jobs

ByteDance - Content Insights Analyst, Lifestyle - Lemon8

ByteDance

Los Angeles, California, United States (On-Site)
2 Days ago
Google - Staff Software Engineer, Google Cloud Platforms

Google

Kirkland, Washington, United States (On-Site)
3 Months ago
NVIDIA - Software Advanced Developer

NVIDIA

Washington, United States (On-Site)
2 Months ago
ByteDance - Software Engineer, Global Payment

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago
Epic Games - Software Engineer, Developer Relations

Epic Games

Seoul, South Korea (On-Site)
2 Months ago
Appier - Software Engineer, Machine Learning Platform

Appier

Taipei City, Taiwan (On-Site)
5 Months ago
Google - Field Solutions Architect, Generative AI, Google Cloud

Google

Stockholm, Stockholm County, Sweden (On-Site)
7 Hours ago
NVIDIA - Senior Deep Learning Performance Architect

NVIDIA

Canada (On-Site)
1 Month ago
NVIDIA - Senior System Networking Engineer, InfiniBand

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Senior Software Engineer, Software Development Life Cycle

Google

New Taipei, New Taipei City, Taiwan (On-Site)
8 Hours ago
NVIDIA - Senior ASIC Verification Engineer - GPU Memory Subsystem

NVIDIA

Durham, North Carolina, United States (On-Site)
3 Weeks ago
Google - Software Engineer II, Device Integrity

Google

Zürich, Zurich, Switzerland (On-Site)
7 Hours ago
ByteDance - Research Scientist in Large Model System

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Rackspace Technology - Sr Big Data Engineer - Oozie and Pig (GCP)

Rackspace Technology

United States (Remote)
1 Week ago
Twitch - Software Engineer

Twitch

San Francisco, California, United States (On-Site)
1 Month ago
Zoox - Machine Learning Engineer - Collision Avoidance System

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
ByteDance - Research Scientist/Engineer, Large Language Model - 2025 Start

ByteDance

Singapore (On-Site)
4 Months ago
KBG Blockchain Game Studios - Blockchain Developer (BSC)

KBG Blockchain Game Studios

Thành Phố Hồ Chí Minh, Vietnam (On-Site)
9 Months ago
ByteDance - Software Development Engineer - Distributed NoSQL Database Systems

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sunnyvale, California, United States

Crunchyroll - iOS Engineering Manager

Crunchyroll

San Francisco, California, United States (Remote)
4 Months ago
Fluence - Product Manager - Battery Systems

Fluence

Houston, Texas, United States (Hybrid)
6 Months ago
Netflix - Manager, Performance Marketing

Netflix

Los Angeles, California, United States (On-Site)
13 Hours ago
The Walt Disney Company - Disney Culinary Program Alumni 2025

The Walt Disney Company

Florida, United States (On-Site)
2 Months ago
Varonis  - Cloud Security Architect

Varonis

United States (Remote)
1 Month ago
NVIDIA - Senior System Software Engineer, Robotics Simulation

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
Light Speed Studios - Associate Producer

Light Speed Studios

California, United States (On-Site)
6 Days ago
CD PROJEKT RED - Engineering Director, Engine

CD PROJEKT RED

Boston, Massachusetts, United States (On-Site)
1 Week ago
Saviynt - Principal Engineer, Quality Engineering

Saviynt

El Segundo, California, United States (Hybrid)
6 Months ago
WebFX - Jr. Business Data Analyst

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Group Product Manager, Generative AI, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
8 Hours ago
Tencent - UA Manager - AI Integrated

Tencent

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Microsoft - Applied Researcher II

Microsoft

Redmond, Washington, United States (On-Site)
16 Hours ago
PlayStation Global - Staff Machine Learning Engineer, Enterprise Enablement

PlayStation Global

Carlsbad, California, United States (On-Site)
3 Weeks ago
Meta - Software Engineer, Machine Learning

Meta

Austin, Texas, United States (Remote)
2 Days ago
NetEase Games - Game AI Research Leader

NetEase Games

Singapore (On-Site)
3 Weeks ago
Meta - Software Engineer, Machine Learning

Meta

United States (Remote)
8 Hours ago
ByteDance - Research Engineer - Multimodal Model

ByteDance

Singapore (On-Site)
5 Months ago
Meta - Research Scientist, Machine Learning (PhD)

Meta

Sunnyvale, California, United States (On-Site)
5 Months ago
Google - Senior Software Engineer, Machine Learning, Google Ads

Google

Los Angeles, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Dublin, County Dublin, Ireland (On-Site)

Sunnyvale, California, United States (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

Hyderabad, Telangana, India (On-Site)

Sunnyvale, California, United States (On-Site)

Sydney, New South Wales, Australia (On-Site)

Waterloo, Ontario, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug