Senior Datacenter Product Development Engineer

3 Days ago • 8 Years + • Quality Assurance • $160,000 PA - $304,750 PA

Job Summary

Job Description

As a Senior Datacenter Product Development Engineer at NVIDIA, you'll collaborate with engineering teams to launch new boards for GPU-accelerated server platforms (HGX/MGX/DGX). This involves planning processes, defining test requirements, optimizing production lines, and ensuring cost and quality targets are met. Responsibilities include developing diagnostic tests, debugging complex hardware/software interactions, ensuring DFx compliance, owning product lifecycles, and creating test specifications. You'll also collaborate with contract manufacturers, analyze data to solve yield problems, and support customer teams during escalations. This role requires deep expertise in high-speed signals, server architectures, and GPU technology.
Must have:
  • 8+ years HW design/validation/manufacturing test experience
  • Proficient in HW interfaces (PCIe Gen4+, InfiniBand, etc.)
  • Strong problem-solving & troubleshooting expertise
  • Experience defining test specs for complex HW systems
  • BS/MS in EE/CE/CS or equivalent
Good to have:
  • Experience in HW board/system electrical design
  • HW device drivers or HW diagnostics software development
  • Proficient in Python or Shell scripting
  • Familiar with FPGA implementation, FW secure-boot
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA Corporation is a world leader in visual computing technology. The GPU, which the company invented, serves as the visual cortex of modern computers and is at the heart of their products and services. NVIDIA has transformed into a specialized platform company that targets four large markets – Gaming, Professional Visualization, Datacenter and Automotive – where visual computing is essential and deeply valued. Their work also uncovers new universes to explore and enable amazing creativity and discovery by powering what was once thought to be science fiction inventions like artificial intelligence and autonomous cars.

Collaborating with your peers across various engineering groups, you will successfully launch new boards for NVIDIA GPU Accelerated Server Platforms (HGX/MGX/DGX) to production. These purpose-built systems are optimized for the growing Deep Learning, Artificial Intelligence, and Analytics environments. With world-class technology enabling never-been-seen-before performance levels, NVIDIA’s HGX/DGX portfolio is arguably the most complicated Server platform ever developed by humans. This product family represents the company’s fastest growing line of business as well as its largest total available market opportunity. You will bring to bear your knowledge of Server architectures, CPU baseboards and GPU technology in order to productize new GPU boards for Server architectures with GPU-accelerated clusters. Your responsibilities will include planning and establishing processes, defining test requirements and optimizing the production line to deliver new NVIDIA GPU boards. You will also be instrumental in helping the team to achieve the desired cost and quality metrics considered best-in-class.

What you will be doing:

  • Leverage your in-depth experience with high-speed signals to plan and develop new diagnostic tests and debug procedures for next gen products.

  • Use your knowledge of system power-up and handshakes during boot to debug complex interactions between HW, FW and SW on faulty boards.

  • Recommend, drive and ensure compliance to DFx requirements for robust signal integrity performance as related to layout, mechanical components, assembly procedures, etc.

  • Own a product or series of products end-to-end through the entire product lifecycle; your role would be to ensure successful production ramps are achieved working through a large matrixed team.

  • Develop and deliver test specs for system level manufacturing screens for all new products to meet the required HW coverage, quality and product requirements for various business units.

  • Collaborate with CM to define product assembly line, number of test stations and number of assembly fixtures, optimized for cost and throughput.

  • Craft creative solutions and WARs through volume data analysis and lab experimentation to solve challenging yield and test problems seen on the production floor.

  • Lead optimization and continuous improvement efforts on the production screen spec definition processes to minimize waste and meet test time, yield, DPPM requirements.

  • Support customer facing and quality teams during customer escalations to understand the issue and fix gaps identified in coverage.

What we need to see:

  • BS/MS in EE/CE/CS or equivalent experience

  • 8+ years of experiences in HW design or diagnostics/validation or manufacturing test of PCIe IPs, Chips or Systems

  • Proficient in HW interfaces, including PCIe (Gen4+), InfiniBand, Ethernet, I3C/I2C, SPI, USB, etc.

  • In depth understanding of HPC server architecture and Out-of-Band management

  • Strong problem-solving and trouble-shooting expertise; and institutionalizing root-cause analysis

  • Experience in defining test and validation specifications for complex HW systems or HPC servers

  • Motivated to continually improve/optimize processes

  • Self-initiative, strong interpersonal skills, and flexibility to adapt to new technologies

Ways to stand out from the crowd:

  • Prior experience in HW board/system electrical design, HW device drivers or HW diagnostics software development

  • On-hand experience in debugging and triaging HW faults using testing equipment and Linux commands/tools

  • Proficient in Python or Shell scripting for HW testing automation and log parsing

  • Familiar with FPGA implementation, FW secure-boot and encrypted images

  • Operations Research/Industrial/Engineering statistics skills

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers; we have some of the most forward-thinking and hardworking people in the world working for us and, due to unparalleled growth, best-in-class teams are rapidly growing. If you’re creative and autonomous with a real passion for your work, we want to hear from you!

The base salary range is 160,000 USD - 304,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ION - Principal Technical Consultant - Endur

ION

Berlin, Berlin, Germany (On-Site)
6 Months ago
Nielsen Holdings - Senior Software Engineer - Bigdata ( Java / Scala / Python , Spark, SQL , AWS)

Nielsen Holdings

Mumbai, Maharashtra, India (Hybrid)
5 Months ago
NVIDIA - Interconnect Failure Analysis Hardware Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
Next Level Business Services - Java/J2EE Developer

Next Level Business Services

Tampa, Florida, United States (On-Site)
5 Months ago
Offworld - DevOps Engineer

Offworld

New Westminster, British Columbia, Canada (On-Site)
1 Month ago
PlaySimple - Associate QA Engineer

PlaySimple

Karnataka, India (On-Site)
6 Months ago
Scorewarrior - Game QA Engineer

Scorewarrior

Limassol, Limassol, Cyprus (On-Site)
2 Months ago
Altair - QA Engineer

Altair

Bengaluru, Karnataka, India (On-Site)
7 Months ago
ByteDance - Senior Test Development Engineer - Global Payment - San Jose

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Breach - XR Quality Assurance (QA) Lead

Breach

Trondheim, Trøndelag, Norway (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Tencent - Site Reliability Engineer

Tencent

(On-Site)
3 Months ago
Next Level Business Services - SDE Web Developer

Next Level Business Services

Redmond, Washington, United States (On-Site)
5 Months ago
ION - Senior Linux Systems Administrator - Trumbull, CT

ION

Trumbull, Connecticut, United States (Hybrid)
6 Months ago
Power Integrations - IT Support Manager (APAC)

Power Integrations

Penang, Malaysia (On-Site)
5 Months ago
Feld Entertainment - Monster Jam Truck Body Technician

Feld Entertainment

Ellenton, Florida, United States (On-Site)
6 Months ago
PwC - Senior Associate

PwC

Gurugram, Haryana, India (On-Site)
6 Months ago
ByteDance - Cloud Technical Support

ByteDance

Singapore (On-Site)
15 Hours ago
Feld Entertainment - Body Refurb Technician

Feld Entertainment

Ellenton, Florida, United States (On-Site)
6 Months ago
CleverTap - Senior Backend Engineer - Platform

CleverTap

Mumbai, Maharashtra, India (Hybrid)
6 Months ago
ByteDance - Site Reliability Engineer, Edge Services

ByteDance

Seattle, Washington, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

AGS - American Gaming Systems - Field Service Technician II

AGS - American Gaming Systems

Michigan, United States (On-Site)
1 Month ago
Google - Software Engineer, PhD, Early Career, Campus, Systems and Infrastructure, 2025 Start

Google

Atlanta, Georgia, United States (On-Site)
5 Months ago
PlayStation Global - Technical Product Manager II

PlayStation Global

San Mateo, California, United States (Hybrid)
2 Weeks ago
Microsoft - Service Engineer II

Microsoft

Redmond, Washington, United States (On-Site)
1 Day ago
Riot Games - Staff Software Engineer, Gameplay/Characters

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Probably Monsters - Design Director

Probably Monsters

Washington, District Of Columbia, United States (On-Site)
4 Months ago
Zones - Client Solutions Architect

Zones

Washington, United States (Remote)
4 Days ago
Oculus VR - Senior Tools Programmer

Oculus VR

Burlingame, California, United States (Remote)
4 Days ago
Aristocrat Gaming - Business Analyst

Aristocrat Gaming

Austin, Texas, United States (Hybrid)
2 Weeks ago
Rockstar Games - Senior Animation R&D Programmer: Retargeting

Rockstar Games

New York, New York, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Quality Assurance Jobs

Brillio - QA Engineer - R01542503

Brillio

Guadalajara, Jalisco, Mexico (Hybrid)
5 Months ago
NVIDIA - Senior ICT and JTAG Test Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
SciPlay - QA Engineer

SciPlay

Kyiv, Kyiv City, Ukraine (On-Site)
1 Month ago
Playrix - QA Director

Playrix

Serbia (Remote)
5 Months ago
Scorewarrior - QA Engineer (Core Team)

Scorewarrior

Limassol, Limassol, Cyprus (On-Site)
2 Months ago
Nagarro - Principal Engineer, QA Automation

Nagarro

India (Remote)
5 Months ago
Epic Games - Compliance Assurance Lead

Epic Games

Cary, North Carolina, United States (On-Site)
3 Months ago
ByteDance - Software Development Engineer in Test Graduate

ByteDance

San Jose, California, United States (On-Site)
15 Hours ago
Saviynt - Senior Engineer SDET, Quality Engineering

Saviynt

El Segundo, California, United States (Hybrid)
5 Months ago
The Walt Disney Company - Manager, Quality Assurance, Business Technology

The Walt Disney Company

Minato City, Tokyo, Japan (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Hanoi, Hanoi, Vietnam (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Shanghai, Shanghai, China (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Shanghai, Shanghai, China (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug