Software Engineer III, AI/ML

2 Weeks ago • 3-5 Years • Artificial Intelligence • $141,000 PA - $202,000 PA

Job Summary

Job Description

Google is seeking a Software Engineer III, AI/ML to develop and optimize custom GPU kernels using Pallas/Mosaic for NVIDIA GPUs to enhance LLM model inference performance. The role involves working on projects critical to Google's needs, offering opportunities for team and project changes. Responsibilities include leveraging experience in software development (multiple languages), ML infrastructure (model deployment, evaluation, optimization, data processing, debugging), GPU programming, and LLMs. The ideal candidate will possess experience with NVIDIA GPU architecture, CUDA kernel writing/optimization, and inference performance optimization for LLMs like Gemini. Experience with Jax/Pallas/Mosaic or Triton is preferred.
Must have:
  • Bachelor's degree or equivalent experience
  • 2+ years software development experience
  • 1+ year experience with speech/audio, reinforcement learning, ML infrastructure, or other ML field
  • 1+ year experience with ML infrastructure (model deployment, evaluation, etc.)
  • GPU programming experience
  • LLM experience
Good to have:
  • Master's/PhD in Computer Science/Engineering
  • NVIDIA GPU architecture, performance, and profiling experience
  • NVIDIA GPU CUDA kernel writing/optimization experience
  • LLM inference performance optimization experience
  • Performance and resource optimization experience
  • Experience with Jax/Pallas/Mosaic or Triton

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 1 year of experience with one or more of the following: Speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field.
  • 1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
  • Experience with GPU programming.
  • Experience with LLMs (large language models).

Preferred qualifications:

  • Master's or PhD in Computer Science or Computer Engineering or equivalent practical experience.
  • Experience with NVIDIA GPU architecture, performance and profiling.
  • Experience writing or optimizing NVIDIA GPU CUDA kernels.
  • Experience optimizing inference performance for LLM models like Gemini or other open source models.
  • Experience in performance and resource optimization.
  • Experience with Jax/Pallas/Mosaic or Triton.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

The US base salary range for this full-time position is $141,000-$202,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Develop/optimize custom GPU kernels using Pallas/Mosaic for NVIDIA GPU to improve LLM model inference performance.

Similar Jobs

NVIDIA - Senior Research Engineer for Reinforcement Learning

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Aristocrat Gaming - Senior Data Science Director

Aristocrat Gaming

London, England, United Kingdom (Hybrid)
1 Month ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Menlo Park, California, United States (On-Site)
5 Months ago
ByteDance - Software Engineer, ML System Architecture

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Google - Silicon Design Verification Engineer, TPU, Google Cloud

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Google - Software Engineer III, Machine Learning, Google Cloud Compute Infrastructure

Google

Seattle, Washington, United States (On-Site)
2 Days ago
Google - Staff Research Scientist, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Google - Applied ML Engineer for AICore

Google

Taipei City, Taiwan (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - High-Performance Computing Research Scientist (Inference Optimization)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Aristocrat Gaming - Senior Data Science Director

Aristocrat Gaming

London, England, United Kingdom (Hybrid)
1 Month ago
Genies - Machine Learning Engineer, Character Animation & Motion AI

Genies

San Mateo, California, United States (On-Site)
1 Month ago
ByteDance - Senior Software Engineer - Generative AI

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Microsoft - Member of Technical Staff - Software Engineer

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Bellevue, Washington, United States (On-Site)
5 Months ago
Upwork - Lead Software Engineer - AI Data Systems

Upwork

(Remote)
18 Hours ago
ByteDance - Product Manager - LLM Training

ByteDance

Singapore (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sunnyvale, California, United States

Zynga - Lead Product Manager

Zynga

Austin, Texas, United States (On-Site)
1 Day ago
Dialpad AI - CX FP&A Manager

Dialpad AI

Austin, Texas, United States (On-Site)
20 Hours ago
Probably Monsters - Senior UI Artist

Probably Monsters

Dallas, Texas, United States (On-Site)
2 Months ago
Gupta - Media Analyst

Gupta

Madison, Wisconsin, United States (On-Site)
3 Days ago
Sbm management - Custodial Lead

Sbm management

Orlando, Florida, United States (On-Site)
2 Months ago
ION - First Line Support Analyst - 7911

ION

Jersey City, New Jersey, United States (On-Site)
6 Months ago
Onward Search - Lead Producer

Onward Search

Santa Monica, California, United States (Hybrid)
2 Weeks ago
Blind Squirrel Games - Systems Designer

Blind Squirrel Games

Austin, Texas, United States (Hybrid)
3 Months ago
31st Union - Expert Core Engineer

31st Union

San Mateo, California, United States (On-Site)
7 Hours ago
Pika - Product Design Lead

Pika

Palo Alto, California, United States (On-Site)
10 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Field Solutions Architect II, Generative AI, Google Cloud

Google

San Francisco, California, United States (On-Site)
2 Days ago
NVIDIA - Senior Applied LLM Engineer, AI – Chip Design

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Zazz - Artificial Intelligence Engineer

Zazz

(Remote)
2 Months ago
Google - Field Solutions Architect, Generative AI, Google Cloud

Google

Madrid, Community Of Madrid, Spain (On-Site)
1 Week ago
NVIDIA - Deep Learning Intern - Fall 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Weeks ago
ByteDance - Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Zoox - Principal Machine Learning Engineer

Zoox

Foster City, California, United States (On-Site)
6 Months ago
Google - Customer Solutions Engineer

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Days ago
Zoox - Senior/Staff Software Engineer - Simulator

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
Google - Staff Software Engineer, Machine Learning

Google

Los Angeles, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug