Software Engineer, GPU System, Google Cloud Platforms

4 Days ago • 2 Years + • Research & Development

Job Summary

Job Description

This Software Engineer role at Google focuses on developing, integrating, debugging, and validating Data Center Graphics Processing Units (GPUs) system software. Responsibilities include resolving GPU machine issues, integrating and validating GPU kernel drivers and firmware, collaborating with cross-functional teams to improve reliability and stability, writing software architecture specifications, and developing comprehensive test suites. The position requires experience in embedded systems, C/C++ programming, Linux/Unix environments, and working with GPUs and related peripherals. The ideal candidate will also possess skills in developer operations, release management, and scripting languages.
Must have:
  • Bachelor's degree in CS or related field
  • 2+ years embedded system software development experience
  • 2+ years C/C++ coding experience
  • Linux/Unix development environment experience
  • GPU system software development, integration, debugging, and validation
Good to have:
  • Experience with developer operations, release management, and integration testing
  • Device driver design and development for peripherals (GPUs, PCIe, I2C, USB)
  • Python or scripting language experience
  • Open source development experience

Job Details


Minimum qualifications:

  • Bachelor's degree in Computer Science, a related technical field, or equivalent practical experience.
  • 2 years of experience in embedded system software development.
  • 2 years of experience coding in C or C++.
  • Experience with Linux/Unix development environments.

Preferred qualifications:

  • Experience with developer operations, release management, and integration testing.
  • Experience with designing and developing device drivers for peripherals (e.g., GPUs, PCIe Switches, and connectivity buses like I2C, USB, PCIe).
  • Experience coding in Python or with scripting languages (e.g., shell).
  • Experience with open source development.

About the job

Google's software engineers develop the next-generation technologies that change how billions of users connect, explore, and interact with information and one another. Our products need to handle information at massive scale, and extend well beyond web search. We're looking for engineers who bring fresh ideas from all areas, including information retrieval, distributed computing, large-scale system design, networking and data storage, security, artificial intelligence, natural language processing, UI design and mobile; the list goes on and is growing every day. As a software engineer, you will work on a specific project critical to Google’s needs with opportunities to switch teams and projects as you and our fast-paced business grow and evolve. We need our engineers to be versatile, display leadership qualities and be enthusiastic to take on new problems across the full-stack as we continue to push technology forward.

The ML, Systems, & Cloud AI (MSCA) organization at Google designs, implements, and manages the hardware, software, machine learning, and systems infrastructure for all Google services (Search, YouTube, etc.) and Google Cloud. Our end users are Googlers, Cloud customers and the billions of people who use Google services around the world.

We prioritize security, efficiency, and reliability across everything we do - from developing our latest TPUs to running a global network, while driving towards shaping the future of hyperscale computing. Our global impact spans software and hardware, including Google Cloud’s Vertex AI, the leading AI platform for bringing Gemini models to enterprise customers.

Responsibilities

  • Develop, integrate, debug and validate Data Center Graphics Processing Units (GPUs) system software, resolve Data Center GPU machines issues.
  • Integrate and validate GPU kernel drivers and firmware, enabling GPU Software bundle on the Data Center machines.
  • Collaborate with Google Cloud cross-teams to enable software and solve the issues, improve Data Center GPU machines reliability, stability and repairability.
  • Write detailed specifications for software architecture and GPU systems we build.
  • Develop comprehensive test suites that enable unit, integration, and system level testing of our system software.

Similar Jobs

Epic Games - Machine Learning Ops Engineer

Epic Games

London, England, United Kingdom (On-Site)
3 Months ago
NVIDIA - Senior SRAM Engineer, Circuit Design

NVIDIA

Canada (Hybrid)
1 Month ago
NVIDIA - Senior Test Product Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
Google - Senior Bluetooth Firmware Engineer

Google

New Taipei, New Taipei City, Taiwan (On-Site)
4 Days ago
Samsung Semiconductor - Senior Staff Engineer, Process Integration

Samsung Semiconductor

San Jose, California, United States (Hybrid)
6 Days ago
ByteDance - Linux System Engineer

ByteDance

London, England, United Kingdom (On-Site)
1 Week ago
NVIDIA - Software Engineering Intern, Autonomous Vehicles (RDSS)

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
Hawk Eye Innovations - Senior Computer Vision Engineer

Hawk Eye Innovations

Budapest, Hungary (Hybrid)
3 Weeks ago
Google - Software Engineer, People with Disabilities

Google

Belo Horizonte, State Of Minas Gerais, Brazil (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Epic Games - Senior DevOps Programmer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
1 Week ago
ByteDance - Site Reliability Engineer, ML System

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
NVIDIA - Senior DevOps Engineer, Deep Learning Frameworks

NVIDIA

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
Interactive Brokers - Software Developer - C++

Interactive Brokers

Greenwich, Connecticut, United States (On-Site)
6 Months ago
Google - Bluetooth Firmware Engineer

Google

New Taipei, New Taipei City, Taiwan (On-Site)
4 Days ago
Paytm - DevOps Engineer/Senior DevOps-Paytm Money

Paytm

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Google - CPU Design Verification Engineer

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Days ago
Google - Senior Software Engineer, Databases Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
4 Days ago
ByteDance - Site Reliability Engineer, Edge Services

ByteDance

Boston, Massachusetts, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

NVIDIA - System Software Engineer - Embedded and Automotive (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
2 Weeks ago
Google - Firmware Engineer, Pixel Modem

Google

New Taipei, New Taipei City, Taiwan (On-Site)
4 Days ago
Google - AICore Software Engineer

Google

Taipei City, Taiwan (On-Site)
4 Days ago
Google - Program Manager, Product Data Management

Google

Taipei City, Taiwan (On-Site)
1 Day ago
Microsoft - Security Sales Specialist

Microsoft

Taipei City, Taiwan (Hybrid)
5 Days ago
Appier - Campaign Analyst (US) 02:00 AM-11:00 AM working hours

Appier

Taipei City, Taiwan (On-Site)
5 Months ago
Google - Software Engineer, Backend, Pixel Camera AI Experiences

Google

New Taipei, New Taipei City, Taiwan (On-Site)
4 Days ago
NVIDIA - Research Scientist, Deep Learning and Computer Vision

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
1 Month ago
NVIDIA - Safety Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
Google - Software Engineer, AICore, Knowledge and Information

Google

Taipei City, Taiwan (On-Site)
4 Days ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Software Manager, DOCA Verification

NVIDIA

Ra'anana, Center District, Israel (On-Site)
3 Months ago
Samsung Semiconductor - Staff Software Engineer – Storage Systems & Protocols

Samsung Semiconductor

San Jose, California, United States (Hybrid)
2 Months ago
Google - Silicon Quality and Reliability Engineer

Google

Taipei City, Taiwan (On-Site)
3 Days ago
NVIDIA - Senior System Software Architect, HPC Networking

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
Ubisoft - Lead R&D Programmer - La Forge

Ubisoft

Montreal, Quebec, Canada (Hybrid)
4 Days ago
NVIDIA - Senior Math Libraries Engineer - Dense Linear Algebra

NVIDIA

California, United States (Hybrid)
3 Months ago
Netflix - Software Engineer L6 - Server Platform Architect

Netflix

United States (Remote)
5 Days ago
NVIDIA - CAD Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
ByteDance - Research Scientist, Vision Foundation Model

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Senior Networking Electrical Validation Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

A problem isn't truly solved until it's solved for all. Googlers build products that help create opportunities for everyone, whether down the street or across the globe. Bring your insight, imagination and a healthy disregard for the impossible. Bring everything that makes you unique. Together, we can build for everyone.

Fremont, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Atlanta, Georgia, United States (On-Site)

San Francisco, California, United States (On-Site)

Fremont, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Google

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug