Software Engineer III, AI/ML

7 Hours ago • 3-5 Years • Artificial Intelligence • $141,000 PA - $202,000 PA

Job Summary

Job Description

Google is seeking a Software Engineer III, AI/ML to develop and optimize custom GPU kernels using Pallas/Mosaic for NVIDIA GPUs to enhance LLM model inference performance. The role involves working on projects critical to Google's needs, offering opportunities for team and project changes. Responsibilities include leveraging experience in software development (multiple languages), ML infrastructure (model deployment, evaluation, optimization, data processing, debugging), GPU programming, and LLMs. The ideal candidate will possess experience with NVIDIA GPU architecture, CUDA kernel writing/optimization, and inference performance optimization for LLMs like Gemini. Experience with Jax/Pallas/Mosaic or Triton is preferred.
Must have:
  • Bachelor's degree or equivalent experience
  • 2+ years software development experience
  • 1+ year experience with speech/audio, reinforcement learning, ML infrastructure, or other ML field
  • 1+ year experience with ML infrastructure (model deployment, evaluation, etc.)
  • GPU programming experience
  • LLM experience
Good to have:
  • Master's/PhD in Computer Science/Engineering
  • NVIDIA GPU architecture, performance, and profiling experience
  • NVIDIA GPU CUDA kernel writing/optimization experience
  • LLM inference performance optimization experience
  • Performance and resource optimization experience
  • Experience with Jax/Pallas/Mosaic or Triton

Job Details

Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 1 year of experience with one or more of the following: Speech/audio (e.g., technology duplicating and responding to the human voice), reinforcement learning (e.g., sequential decision making), ML infrastructure, or specialization in another ML field.
  • 1 year of experience with ML infrastructure (e.g., model deployment, model evaluation, optimization, data processing, debugging).
  • Experience with GPU programming.
  • Experience with LLMs (large language models).

Preferred qualifications:

  • Master's or PhD in Computer Science or Computer Engineering or equivalent practical experience.
  • Experience with NVIDIA GPU architecture, performance and profiling.
  • Experience writing or optimizing NVIDIA GPU CUDA kernels.
  • Experience optimizing inference performance for LLM models like Gemini or other open source models.
  • Experience in performance and resource optimization.
  • Experience with Jax/Pallas/Mosaic or Triton.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

The US base salary range for this full-time position is $141,000-$202,000 + bonus + equity + benefits. Our salary ranges are determined by role, level, and location. Within the range, individual pay is determined by work location and additional factors, including job-related skills, experience, and relevant education or training. Your recruiter can share more about the specific salary range for your preferred location during the hiring process.

Please note that the compensation details listed in US role postings reflect the base salary only, and do not include bonus, equity, or benefits. Learn more about .

Responsibilities

  • Develop/optimize custom GPU kernels using Pallas/Mosaic for NVIDIA GPU to improve LLM model inference performance.

Similar Jobs

ByteDance - Student Researcher (Foundation Models - Reasoning, Planning & Agent - Doubao (Seed)) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Krafton  - Head of Deep Learning PM & Ops Dept.

Krafton

Seoul, South Korea (On-Site)
3 Weeks ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - MultiModal Generative Model)

ByteDance

San Jose, California, United States (On-Site)
2 Days ago
ByteDance - Seed - LLM Performance Operation Analyst (Non-safety)

ByteDance

Singapore (On-Site)
4 Months ago
Krafton  - Applied Research Engineer - Reinforcement Learning

Krafton

Seoul, South Korea (On-Site)
3 Weeks ago
Microsoft - Applied Researcher II

Microsoft

Redmond, Washington, United States (On-Site)
16 Hours ago
Lionbridge Games - Games Language AI Specialist (Linguist)

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
1 Month ago
Microsoft - Member of Technical Staff – Voice & Vision

Microsoft

Mountain View, California, United States (Hybrid)
3 Weeks ago
Meta - Software Engineer, Machine Learning

Meta

Burlingame, California, United States (On-Site)
5 Months ago
Resemble AI - Deep Learning Speech Researcher

Resemble AI

Mountain View, California, United States (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

DNEG - Head of Machine Learning

DNEG

London, England, United Kingdom (Remote)
2 Months ago
Microsoft - Member of Technical Staff, AI

Microsoft

Mountain View, California, United States (On-Site)
3 Weeks ago
Meta - AI Research Scientist, Language - Generative AI

Meta

New York, New York, United States (On-Site)
5 Months ago
ByteDance - Senior Site Reliability Engineer, ML System - Foundation Model

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
PlayStation Global - Staff Machine Learning Engineer, Enterprise Enablement

PlayStation Global

California, United States (On-Site)
3 Months ago
Microsoft - Senior Researcher

Microsoft

Singapore (On-Site)
10 Hours ago
Electronic Arts - Senior Software Engineer

Electronic Arts

Austin, Texas, United States (On-Site)
4 Weeks ago
Google - Staff Software Engineer, AI/ML, Content Safety Platform

Google

State Of Minas Gerais, Brazil (On-Site)
7 Hours ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

New York, New York, United States (On-Site)
5 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Video Generative Model)

ByteDance

San Jose, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Sunnyvale, California, United States

Bonfire Studios - HR Business Partner

Bonfire Studios

California, United States (Hybrid)
3 Weeks ago
On Location - Manager, Accounting - Affiliate Program

On Location

Atlanta, Georgia, United States (On-Site)
1 Month ago
ByteDance - Cloud Site Reliability Engineer

ByteDance

Seattle, Washington, United States (On-Site)
2 Days ago
The Walt Disney Company - Outside Machinist - Auto Mechanic

The Walt Disney Company

Anaheim, California, United States (On-Site)
3 Weeks ago
Netflix - Engineering Manager - OnlineDataStores

Netflix

Los Gatos, California, United States (On-Site)
5 Months ago
Inworld AI - Senior Software Development Engineer in Test (SDET) – Game Engine SDKs - USA

Inworld AI

Mountain View, California, United States (On-Site)
5 Months ago
Riot Games - Researcher III - RDS Central User Research Team

Riot Games

Los Angeles, California, United States (On-Site)
3 Weeks ago
Feld Entertainment - Warehouse Associate

Feld Entertainment

Jessup, Maryland, United States (On-Site)
6 Months ago
Kabam - Graphic and Motion Designer

Kabam

Los Angeles, California, United States (Hybrid)
2 Months ago
Crunchyroll - DevOps Engineer - Cloud Reliability

Crunchyroll

San Francisco, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Soul AI - Subject Matter Expert (AI Trainer)

Soul AI

Hyderabad, Telangana, India (On-Site)
7 Months ago
Google - Product Strategy and Operations Lead, Machine Learning Systems

Google

Sunnyvale, California, United States (On-Site)
7 Hours ago
Airlab Inc  - Junior Programmer Artificial Intelligence

Airlab Inc

Quebec, Canada (On-Site)
3 Weeks ago
Genies - Machine Learning Engineer: 3D Generative AI

Genies

San Mateo, California, United States (Remote)
5 Months ago
Interface AI - Vice President of Product Management

Interface AI

United States (Remote)
2 Months ago
Google - Staff Cloud Solutions Architect, Rapid Innovation

Google

Reston, Virginia, United States (On-Site)
8 Hours ago
Keywords Studios - AI - Senior Research Associate (Prompts)

Keywords Studios

Silesian Voivodeship, Poland (On-Site)
4 Weeks ago
Canva - Senior Machine Learning Engineer - Photo AI

Canva

Prague, Czechia (Remote)
2 Months ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Dublin, County Dublin, Ireland (On-Site)

Sunnyvale, California, United States (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

Hyderabad, Telangana, India (On-Site)

Sunnyvale, California, United States (On-Site)

Sydney, New South Wales, Australia (On-Site)

Waterloo, Ontario, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug