Software Engineer III, AI/ML, Cloud AI Infrastructure

2 Days ago • 2-4 Years • Artificial Intelligence

Job Summary

Job Description

Google is seeking a Software Engineer III to focus on AI/ML and Cloud AI infrastructure. Responsibilities include measuring and optimizing AI/ML model performance on Google Cloud, identifying and resolving performance bottlenecks, developing training and demos, contributing to product improvement through bug fixes and code enhancements, and conducting performance profiling and troubleshooting. The role requires collaboration with internal teams and adherence to best practices. The ideal candidate will have experience with software development, data structures and algorithms, ML infrastructure, and cloud services (particularly GCP).
Must have:
  • 2+ years software development experience
  • Experience with data structures/algorithms
  • Measure and optimize AI/ML model performance
  • Identify and resolve performance bottlenecks
  • Develop high-quality training and demos
Good to have:
  • Master's/PhD in CS or related field
  • 4+ years software development experience
  • 1 year experience with ML infrastructure
  • Proficient in GCP cloud services

Job Details


Minimum qualifications:

  • Bachelor’s degree or equivalent practical experience.
  • 2 years of experience with software development in one or more programming languages, or 1 year of experience with an advanced degree.
  • 2 years of experience with data structures or algorithms.

Preferred qualifications:

  • Master's degree or PhD in Computer Science or related technical fields.
  • 4 years of experience in software development using one or more programming languages, with expertise in data structures and algorithms.
  • 1 year of experience with ML infrastructure or performance.
  • Proficient in cloud services such as Compute, Storage, and Networking, particularly on Google Cloud Platform.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

Responsibilities

  • Measure and optimize AI/ML model performance on Google Cloud infrastructure.
  • Identify and resolve performance bottlenecks, collaborating with internal infrastructure teams to enhance support for demanding AI workloads as needed.
  • Develop and deliver high-quality training and demos for both customers and internal teams.
  • Contribute to ongoing product improvement by identifying bugs, recommending enhancements, and writing and testing production-quality code.
  • Conduct in-depth performance profiling, debugging, and troubleshooting of training and inference workloads, ensuring adherence to best practices through design and code reviews.

Similar Jobs

Fliff  Inc  - Data Scientist

Fliff Inc

Austin, Texas, United States (On-Site)
9 Months ago
NVIDIA - Principal Engineer

NVIDIA

(Remote)
2 Months ago
General arcade studio - Senior C++ Developer

General arcade studio

(Remote)
1 Day ago
Netflix - Research Engineer (L4) - Member Lifecycle and Monetization

Netflix

United States (Remote)
2 Weeks ago
Google - Software Engineering Manager II, Namespaces Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
2 Days ago
Google - Intel Strategist, Scaled Intel Collection, Trust and Safety

Google

Austin, Texas, United States (On-Site)
1 Week ago
GT - AI/ML Engineer

GT

(Remote)
1 Month ago
Gameopedia - Data Scientist

Gameopedia

Norway (Hybrid)
1 Month ago
NVIDIA - AI Computing Software Development Engineer, TensorRT

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Mashgin - Senior Software Engineer, Computer Vision and Deep Learning

Mashgin

Palo Alto, California, United States (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Research Scientist for Generative AI, LLM and Multimodal 【Talent Spotters】

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
NVIDIA - Silicon Performance, Power, and Binning Tools Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
Twitch - Software Engineer I - iOS

Twitch

Seattle, Washington, United States (On-Site)
3 Months ago
Google - Staff Software Engineer, Site Reliability Engineering

Google

Poland (On-Site)
2 Days ago
ByteDance - Senior Site Reliability Engineer - Applied Machine Learning

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
GoMotive - Engineering Manager, Full Stack (MarTech)

GoMotive

(Remote)
1 Day ago
Socialpoint - Senior Software Engineer (GameOps Tools)

Socialpoint

Barcelona, Catalonia, Spain (Hybrid)
2 Weeks ago
ByteDance - Data Engineer, Cloud and System

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Google - Wearable Telemetry Tech Lead

Google

Bucharest, Bucharest, Romania (On-Site)
2 Weeks ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in New Taipei, New Taipei City, Taiwan

NVIDIA - Data Center NPI Program Manager

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
4 Weeks ago
Google - Software Engineer, Linux Embedded Systems, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Weeks ago
NVIDIA - Senior Memory Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
Google - Lead Software Engineer, Engineering Productivity

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Days ago
Google - Operations Program Manager, NPI

Google

Taipei City, Taiwan (On-Site)
2 Days ago
AI Fund - Frontend Engineer

AI Fund

Taipei City, Taiwan (Hybrid)
6 Months ago
Appier - Machine Learning Scientist

Appier

Taipei City, Taiwan (On-Site)
9 Hours ago
Google - Technical Solutions Consultant, Android TV Partner Engineering

Google

Taipei City, Taiwan (On-Site)
2 Weeks ago
Corsair - Mechanical Engineer

Corsair

Taipei City, Taiwan (On-Site)
1 Month ago
Google - Firmware Engineer, Pixel Modem

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Interface AI - Sr. Implementation Engineer

Interface AI

United States (Remote)
5 Months ago
Google - Sales Specialist, Generative AI, SMB, Google Cloud

Google

Austin, Texas, United States (On-Site)
2 Days ago
Google - Software Engineer III, Education AI Platform

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Days ago
Hedra - Senior Research Engineer

Hedra

New York, New York, United States (On-Site)
1 Month ago
Google - Senior Staff Software Engineer, BigQuery Generative AI

Google

Kirkland, Washington, United States (On-Site)
2 Days ago
Canva - Senior Machine Learning Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
3 Months ago
Electronic Arts - Senior Manager, Generative AI Software Engineering

Electronic Arts

Orlando, Florida, United States (On-Site)
3 Weeks ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

London, England, United Kingdom (Remote)
3 Months ago
Genies - Machine Learning Engineer, Character Animation & Motion AI

Genies

San Mateo, California, United States (On-Site)
1 Month ago
ByteDance - Backend Engineer (Model Inference), Machine Learning Systems

ByteDance

Singapore (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug