Software Engineer, GPU System, Google Cloud Platforms

7 Hours ago • 2 Years + • Research & Development

Job Summary

Job Description

This Software Engineer role at Google focuses on developing, integrating, debugging, and validating Data Center Graphics Processing Units (GPUs) system software. Responsibilities include resolving GPU machine issues, integrating and validating GPU kernel drivers and firmware, collaborating with cross-functional teams to improve reliability and stability, writing software architecture specifications, and developing comprehensive test suites. The position requires experience in embedded systems, C/C++ programming, Linux/Unix environments, and working with GPUs and related peripherals. The ideal candidate will also possess skills in developer operations, release management, and scripting languages.
Must have:
  • Bachelor's degree in CS or related field
  • 2+ years embedded system software development experience
  • 2+ years C/C++ coding experience
  • Linux/Unix development environment experience
  • GPU system software development, integration, debugging, and validation
Good to have:
  • Experience with developer operations, release management, and integration testing
  • Device driver design and development for peripherals (GPUs, PCIe, I2C, USB)
  • Python or scripting language experience
  • Open source development experience

Job Details


Minimum qualifications:

  • Bachelor's degree in Computer Science, a related technical field, or equivalent practical experience.
  • 2 years of experience in embedded system software development.
  • 2 years of experience coding in C or C++.
  • Experience with Linux/Unix development environments.

Preferred qualifications:

  • Experience with developer operations, release management, and integration testing.
  • Experience with designing and developing device drivers for peripherals (e.g., GPUs, PCIe Switches, and connectivity buses like I2C, USB, PCIe).
  • Experience coding in Python or with scripting languages (e.g., shell).
  • Experience with open source development.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

Responsibilities

  • Develop, integrate, debug and validate Data Center Graphics Processing Units (GPUs) system software, resolve Data Center GPU machines issues.
  • Integrate and validate GPU kernel drivers and firmware, enabling GPU Software bundle on the Data Center machines.
  • Collaborate with Google Cloud cross-teams to enable software and solve the issues, improve Data Center GPU machines reliability, stability and repairability.
  • Write detailed specifications for software architecture and GPU systems we build.
  • Develop comprehensive test suites that enable unit, integration, and system level testing of our system software.

Similar Jobs

ByteDance - Security Operation Engineer, Security Assurance

ByteDance

Singapore (On-Site)
1 Month ago
ByteDance - Linux System Engineer

ByteDance

London, England, United Kingdom (On-Site)
3 Months ago
Werplay - QA Engineer

Werplay

Islamabad, Islamabad Capital Territory, Pakistan (On-Site)
3 Months ago
Paytm - Devops - Senior DevOps Engineer

Paytm

Noida, Uttar Pradesh, India (On-Site)
6 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
2 Days ago
GreenWave™ Radios - Tech Lead, Design Verification

GreenWave™ Radios

Bengaluru, Karnataka, India (On-Site)
6 Months ago
NVIDIA - ASIC Design Engineer

NVIDIA

Toronto, Ontario, Canada (On-Site)
1 Week ago
Google - Software Engineer (For Women in Tech Candidates)

Google

State Of Minas Gerais, Brazil (On-Site)
3 Months ago
NVIDIA - Senior ASIC Verification and Infrastructure Engineer – GPU

NVIDIA

Austin, Texas, United States (Hybrid)
4 Weeks ago
Tesla - Jr. SPS Programmer

Tesla

Rhineland-Palatinate, Germany (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Research Scientist, Multimodality

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Feld Entertainment - Body Refurb Technician

Feld Entertainment

Ellenton, Florida, United States (On-Site)
6 Months ago
NVIDIA - System Test Design Engineer

NVIDIA

(Remote)
2 Months ago
Zeta - Site Reliability Engineer I (Payzapp)

Zeta

Bengaluru, Karnataka, India (On-Site)
6 Months ago
DEVOTEAM - Tech Lead Java

DEVOTEAM

Casablanca, Casablanca-Settat, Morocco (Remote)
6 Months ago
Zoox - Site Reliability Engineer

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
ByteDance - Software Development Engineer - Cloud Native Databases

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
The Workshop - Data Software Engineer

The Workshop

Málaga, Andalusia, Spain (Hybrid)
7 Months ago
NVIDIA - Interconnect Failure Analysis Hardware Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
NVIDIA - Senior SWQA Test Development Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

Trend Micro - Sr. Engineer

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
WildBrain - Senior Licensing Manager

WildBrain

Taipei City, Taiwan (Hybrid)
1 Day ago
WongDoody - SERVICE DESIGNER, HONG KONG

WongDoody

Taipei City, Taiwan (On-Site)
6 Months ago
Corsair - IE Engineer

Corsair

Taoyuan City, Taiwan (On-Site)
3 Weeks ago
PwC - C.顧問類-ESG經理/協理(氣候變遷&企業永續發展管理)

PwC

Taipei City, Taiwan (On-Site)
6 Months ago
Corsair - Supply Chain Planner

Corsair

Taipei City, Taiwan (On-Site)
3 Weeks ago
NVIDIA - Senior Generalist Software Engineer -- Omniverse

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
Visa - Senior Consulting Manager, Visa Managed Services

Visa

Taipei City, Taiwan (On-Site)
6 Months ago
Garena - Data Engineer (Warehouse)

Garena

Taipei City, Taiwan (On-Site)
2 Days ago
Appier - Software Engineer, Site Reliability Engineering

Appier

Taipei City, Taiwan (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

ByteDance - Software Engineer - Machine Learning Training

ByteDance

Singapore (On-Site)
2 Days ago
The Walt Disney Company - Software Engineer, Platform

The Walt Disney Company

California, United States (On-Site)
3 Weeks ago
Google - Software Engineer III, Embedded Systems/Firmware, Google TV

Google

Taipei City, Taiwan (On-Site)
8 Hours ago
NVIDIA - Senior Firmware Design Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
2 Months ago
NVIDIA - Senior Post Silicon Hardware Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
Tencent - Senior Regional Game Operation Manager

Tencent

London, England, United Kingdom (On-Site)
1 Month ago
NVIDIA - Senior Physical Design Backend Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Google - Lead CPU Design Verification Engineer, Silicon

Google

Austin, Texas, United States (On-Site)
8 Hours ago
Ubisoft - Architecte de Stockage

Ubisoft

Montreal, Quebec, Canada (On-Site)
4 Months ago
Zoox - Senior/Staff Software Engineer - HIL Sensor Simulation

Zoox

Foster City, California, United States (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Dublin, County Dublin, Ireland (On-Site)

Sunnyvale, California, United States (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Warsaw, Masovian Voivodeship, Poland (On-Site)

Hyderabad, Telangana, India (On-Site)

Sunnyvale, California, United States (On-Site)

Sydney, New South Wales, Australia (On-Site)

Waterloo, Ontario, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug