Diagnostic Software Manager - Server

2 Months ago • 8 Years + • Research & Development

Job Summary

Job Description

NVIDIA seeks a Diag Software Manager - Server to lead a team of software engineers responsible for developing and improving system stress applications for their data center products. This role involves collaborating with cross-functional teams (architecture, ASIC, systems engineering, operations) to create software that rigorously tests GPU servers in customer and partner environments. Responsibilities include managing multiple concurrent projects, mentoring engineers, recruiting new talent, driving root cause analysis, and developing long-term team strategies to address future challenges. The ideal candidate possesses strong system software expertise (8+ years), team management experience (4+ years), and proficiency in C/C++ and Python. The position is crucial for improving product quality and production efficiency, directly impacting NVIDIA's gross margin.
Must have:
  • 8+ years system software experience
  • 4+ years team management experience
  • Proficiency in C/C++
  • Strong system design skills
  • Understanding of computer architecture
Good to have:
  • Python programming
  • GPU compute/server tech knowledge
  • Experience with BMC, Infiniband, PCIe, NVLink
  • RAS software engineering experience
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

We seek a manager to lead all aspects of a team of software engineers tasked with improving and crafting a collection of system stress applications tailored for NVIDIA's forthcoming data center products, operational within customer and partner infrastructures. Our focus lies in crafting software that subjects GPU servers to the most thorough testing scenarios imaginable. Our team collaborates closely with architecture, ASIC, systems engineering, and operations teams to devise methodologies aimed at pushing every hardware component to its limits. Situated at the core of NVIDIA's data center enterprise, from GPU baseboards to standalone servers and entire clusters, we are responsible for developing the comprehensive suite of system stress applications. We partner with NVIDIA operation teams to find efficient balance between product quality, test yield, and manufacturing efficiency. Wouldn't you want to be a key factor of NVIDIA gross margin?

What you will be doing:

  • Collaborated with multi-functional teams to do NPI project and improve and refine software deployed on our customers' servers and environments, facilitating detailed identification of hardware or software issues.

  • As the manager, you will run multiple concurrent projects through active prioritization, and communication.

  • On the engineer management side, we want the manager to continue to groom future technical leaders in the team and recruit new talent.

  • Constant development is another area of responsibility. We look for candidates who are proactive - seek opportunities to improve NVIDIA product quality and production efficiency.

  • We also need our candidates to be reactive: be able to drive root cause of critical issues and embrace corrective actions.

  • Finally, we need our leaders to develop long range strategies for the team to prepare for new challenges and drive execution.

What we need to see:

  • Bachelor of science in Computer Science, Computer Engineering, Electrical Engineering (or equivalent experience).

  • 8+ overall years of system software experience, deep understanding of software development principles, comfortable working in large code space and deep driver stack with 4+ years of team management experience

  • Good system design skills

  • Good programming skills in C/C++, python programming is a plus.

  • Solid understanding in computer architecture, operating system, kernel driver, device programming.

  • Experience driving feature development and multi-team debug.

Ways to stand out from the crowd:

  • Knowledge of GPU compute or server product technologies like BMC (Baseboard Management Controller), Infiniband, PCIE, NVLink.

  • Extensive experience collaborating with customer software teams

  • Strong experience to engineer software with consideration of RAS

  • Comfortable with unknown and change

With competitive salaries and a generous benefits package, NVIDIA is widely considered to be one of the most desirable employers in the world. We have some of the most brilliant and talented people in the world working for us. If you are creative, autonomous and love a challenge, we want to hear from you. We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

#LI-Hybrid 

Similar Jobs

Monks - Full Stack Developer

Monks

Noida, Uttar Pradesh, India (On-Site)
8 Months ago
Meta - Software Engineering Manager, Product Infrastructure

Meta

Bellevue, Washington, United States (Remote)
6 Months ago
stim studio - Lighting Artist - Senior

stim studio

Liège, Wallonia, Belgium (On-Site)
3 Weeks ago
Tencent - Senior Technical Artist

Tencent

(On-Site)
2 Months ago
Escape Velocity Entertainment - Technical Artist (Houdini)

Escape Velocity Entertainment

(Remote)
2 Months ago
Google - Software Engineer, Wi-Fi, Chrome OS

Google

Taipei City, Taiwan (On-Site)
1 Month ago
Google - Student Researcher, PhD, Winter/Summer 2025

Google

(On-Site)
6 Months ago
Krafton  - PUBG Mobile Marketing Manager (Korea)

Krafton

Seoul, South Korea (On-Site)
2 Months ago
NVIDIA - Senior Software Engineer, Code Coverage Tools

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
NVIDIA - Senior System Software Architect, HPC Networking

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Security Operation Engineer, Security Assurance

ByteDance

Singapore (On-Site)
3 Months ago
Ubisoft - Technical Cinematic Designer

Ubisoft

Bordeaux, Nouvelle-Aquitaine, France (On-Site)
3 Months ago
Google - Software Engineering Manager, AI Powered Data

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Ubisoft - Technical Artist (MOSAIC)

Ubisoft

Singapore (On-Site)
1 Month ago
Plume Design,  Inc  - Senior Security Engineer

Plume Design, Inc

Hyderabad, Telangana, India (On-Site)
7 Months ago
Trailmix - Game Data Lead

Trailmix

London, England, United Kingdom (Hybrid)
1 Month ago
Zurora - Implementation Consultant - Enterprise SaaS Software

Zurora

Heredia, Heredia Province, Costa Rica (Hybrid)
3 Weeks ago
Google - Display Manufacturing Test and Partnerships Lead

Google

Fremont, California, United States (On-Site)
1 Month ago
Veeam Software - Software Developer in Test (JavaScript)

Veeam Software

(Remote)
1 Month ago
Eqvilent - Senior C++ Software Engineer

Eqvilent

(Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

Google - Mechanical Manufacturing Engineer, Global Manufacturing Operations

Google

Taipei City, Taiwan (On-Site)
1 Month ago
Qualcomm - Buyer, Associate

Qualcomm

Hsinchu City, Taiwan (On-Site)
3 Weeks ago
Google - Thermal Test Engineer

Google

Taipei City, Taiwan (On-Site)
1 Month ago
Google - Technical Program Manager, Supplier Development Engineering

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
Ansys - Senior Application Engineer

Ansys

Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - Senior Mechanical Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
NVIDIA - Mixed-Signal Circuit Design Engineer - New College Graduate

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
4 Months ago
Google - Thermal Test Engineer

Google

Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - Senior Software Engineer – Simulation and Virtualization

NVIDIA

Taipei City, Taiwan (On-Site)
4 Months ago
Google - Staff Firmware Engineer, Pixel System Software

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - System Memory Validation Software Engineer

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
4 Months ago
NVIDIA - Senior Mask Layout Design Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Google - NISQ Application Research Scientist, Quantum AI

Google

Santa Barbara, California, United States (On-Site)
1 Month ago
Google - Thermal Engineering Manager, Google Cloud

Google

Taipei City, Taiwan (On-Site)
1 Month ago
Google - Circuits Design Engineer, Clock Design

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
NVIDIA - Senior Physical Design Full Chip STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Samsung Semiconductor - Staff Engineer, DRAM

Samsung Semiconductor

San Jose, California, United States (Hybrid)
2 Months ago
NVIDIA - Senior ASIC Power Integrity Engineer

NVIDIA

Massachusetts, United States (On-Site)
1 Month ago
NVIDIA - Mixed-Signal Circuit Design Engineer - New College Graduate

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
ByteDance - Principal Algorithm Engineer, Trust and Safety

ByteDance

Singapore (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug