Software Engineer III, AI/ML, Cloud AI Infrastructure

1 Day ago • 2-4 Years • Artificial Intelligence

Job Summary

Job Description

Google is seeking a Software Engineer III to focus on AI/ML and Cloud AI infrastructure. Responsibilities include measuring and optimizing AI/ML model performance on Google Cloud, identifying and resolving performance bottlenecks, developing training and demos, contributing to product improvement through bug fixes and code enhancements, and conducting performance profiling and troubleshooting. The role requires collaboration with internal teams and adherence to best practices. The ideal candidate will have experience with software development, data structures and algorithms, ML infrastructure, and cloud services (particularly GCP).
Must have:
  • 2+ years software development experience
  • Experience with data structures/algorithms
  • Measure and optimize AI/ML model performance
  • Identify and resolve performance bottlenecks
  • Develop high-quality training and demos
Good to have:
  • Master's/PhD in CS or related field
  • 4+ years software development experience
  • 1 year experience with ML infrastructure
  • Proficient in GCP cloud services

Job Details


Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 2 years of experience with data structures or algorithms.

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical fields.
  • 4 years of experience in software development using one or more programming languages, with expertise in data structures and algorithms.
  • 1 year of experience with ML infrastructure or performance.
  • Proficient in cloud services such as Compute, Storage, and Networking, particularly on Google Cloud Platform.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

Responsibilities

  • Measure and optimize AI/ML model performance on Google Cloud infrastructure.
  • Identify and resolve performance bottlenecks, collaborating with internal infrastructure teams to enhance support for demanding AI workloads as needed.
  • Develop and deliver high-quality training and demos for both customers and internal teams.
  • Contribute to ongoing product improvement by identifying bugs, recommending enhancements, and writing and testing production-quality code.
  • Conduct in-depth performance profiling, debugging, and troubleshooting of training and inference workloads, ensuring adherence to best practices through design and code reviews.

Similar Jobs

Meetelise - Junior Research Scientist

Meetelise

(On-Site)
6 Months ago
Google - Business Data Scientist

Google

Chicago, Illinois, United States (On-Site)
1 Week ago
Microsoft - Member of Technical Staff, AI - Pre-Training

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
NVIDIA - Machine Learning Intern - 2025

NVIDIA

(On-Site)
3 Months ago
Netflix - Machine Learning Intern, Fall 2025

Netflix

Los Gatos, California, United States (On-Site)
1 Week ago
Google - Director, Development, Ads Safety, Platform and Experiences

Google

Los Angeles, California, United States (On-Site)
1 Week ago
Google - Technical Program Manager III, AI/ML, Cloud AI Systems

Google

Austin, Texas, United States (On-Site)
1 Week ago
Virtuos - Senior Machine Learning Engineer (Game)

Virtuos

Malaysia (On-Site)
2 Weeks ago
DNEG - Head of Machine Learning

DNEG

London, England, United Kingdom (Remote)
1 Month ago
Tencent - Senior Researcher: Artificial General Intelligence (Natural Language Processing)

Tencent

Washington, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer II, Mobile, Android Settings

Google

Bucharest, Bucharest, Romania (On-Site)
1 Week ago
NVIDIA - Senior Research Engineer for Reinforcement Learning

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Google - Site Reliability Manager, Platforms and Devices, SRE

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Google - Senior Software Engineer, Infrastructure, Platforms Infrastructure Engineering

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Sleeper - Senior Frontend Engineer (Mobile)

Sleeper

Las Vegas, Nevada, United States (On-Site)
1 Month ago
ByteDance - Software Development Engineer (SDN Traffic Intelligence & Control)

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Scopely - Senior Software Engineer (PHP)

Scopely

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Google - Software Engineer II, Cloud AI Agent Space Backend

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Cricketpedia - Backend Engineer - PHP only

Cricketpedia

Gurugram, Haryana, India (Remote)
2 Years ago
Google - Staff Software Engineer, Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in New Taipei, New Taipei City, Taiwan

Google - Senior CPU Subsystem RTL Design Engineer, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Day ago
Google - Senior Bluetooth Firmware Engineer

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Day ago
NVIDIA - Software Engineering Intern, Autonomous Vehicles (RDSS)

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
Google - Product Design Engineer, Pixel Camera

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Day ago
Corsair - Technical Marketing Manager – Gaming Marketing

Corsair

Taipei City, Taiwan (On-Site)
2 Weeks ago
NVIDIA - Solutions Architect

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
Google - Silicon Quality and Reliability Engineer

Google

Taipei City, Taiwan (On-Site)
1 Week ago
Google - Firmware Engineer, AS Layer 3, Modem Reliability Engineering

Google

New Taipei City, Taiwan (On-Site)
1 Week ago
Netflix - Senior Software Engineer, Partner Engineering - APAC

Netflix

Hsinchu, Hsinchu City, Taiwan (On-Site)
6 Months ago
Google - SoC ATE Test Engineer

Google

Taipei City, Taiwan (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Senior Research Scientist, Multimodal Foundation Models and Robotics

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
NVIDIA - Senior Solutions Architect, Autonomous Vehicles and Robotics

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
ByteDance - Research Scientist- Foundation Model, Generative AI

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago
DraftKings - Technical Business Analyst

DraftKings

Boston, Massachusetts, United States (On-Site)
5 Days ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Generative AI)

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Microsoft - Senior Researcher

Microsoft

Singapore (On-Site)
1 Week ago
ByteDance - AI Model Optimization Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Microsoft - Director - Responsible AI

Microsoft

Redmond, Washington, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Dublin, County Dublin, Ireland (On-Site)

New York, New York, United States (On-Site)

Waterloo, Ontario, Canada (On-Site)

Taipei City, Taiwan (On-Site)

San Francisco, California, United States (On-Site)

Saint-Ghislain, Wallonia, Belgium (On-Site)

Bengaluru, Karnataka, India (On-Site)

Austin, Texas, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug