Senior Datacenter Product Development Engineer

2 Weeks ago • 8 Years + • Quality Assurance • $160,000 PA - $304,750 PA

Job Summary

Job Description

As a Senior Datacenter Product Development Engineer at NVIDIA, you'll collaborate with engineering teams to launch new boards for GPU-accelerated server platforms (HGX/MGX/DGX). This involves planning processes, defining test requirements, optimizing production lines, and ensuring cost and quality targets are met. Responsibilities include developing diagnostic tests, debugging complex hardware/software interactions, ensuring DFx compliance, owning product lifecycles, and creating test specifications. You'll also collaborate with contract manufacturers, analyze data to solve yield problems, and support customer teams during escalations. This role requires deep expertise in high-speed signals, server architectures, and GPU technology.
Must have:
  • 8+ years HW design/validation/manufacturing test experience
  • Proficient in HW interfaces (PCIe Gen4+, InfiniBand, etc.)
  • Strong problem-solving & troubleshooting expertise
  • Experience defining test specs for complex HW systems
  • BS/MS in EE/CE/CS or equivalent
Good to have:
  • Experience in HW board/system electrical design
  • HW device drivers or HW diagnostics software development
  • Proficient in Python or Shell scripting
  • Familiar with FPGA implementation, FW secure-boot
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA Corporation is a world leader in visual computing technology. The GPU, which the company invented, serves as the visual cortex of modern computers and is at the heart of their products and services. NVIDIA has transformed into a specialized platform company that targets four large markets – Gaming, Professional Visualization, Datacenter and Automotive – where visual computing is essential and deeply valued. Their work also uncovers new universes to explore and enable amazing creativity and discovery by powering what was once thought to be science fiction inventions like artificial intelligence and autonomous cars.

Collaborating with your peers across various engineering groups, you will successfully launch new boards for NVIDIA GPU Accelerated Server Platforms (HGX/MGX/DGX) to production. These purpose-built systems are optimized for the growing Deep Learning, Artificial Intelligence, and Analytics environments. With world-class technology enabling never-been-seen-before performance levels, NVIDIA’s HGX/DGX portfolio is arguably the most complicated Server platform ever developed by humans. This product family represents the company’s fastest growing line of business as well as its largest total available market opportunity. You will bring to bear your knowledge of Server architectures, CPU baseboards and GPU technology in order to productize new GPU boards for Server architectures with GPU-accelerated clusters. Your responsibilities will include planning and establishing processes, defining test requirements and optimizing the production line to deliver new NVIDIA GPU boards. You will also be instrumental in helping the team to achieve the desired cost and quality metrics considered best-in-class.

What you will be doing:

  • Leverage your in-depth experience with high-speed signals to plan and develop new diagnostic tests and debug procedures for next gen products.

  • Use your knowledge of system power-up and handshakes during boot to debug complex interactions between HW, FW and SW on faulty boards.

  • Recommend, drive and ensure compliance to DFx requirements for robust signal integrity performance as related to layout, mechanical components, assembly procedures, etc.

  • Own a product or series of products end-to-end through the entire product lifecycle; your role would be to ensure successful production ramps are achieved working through a large matrixed team.

  • Develop and deliver test specs for system level manufacturing screens for all new products to meet the required HW coverage, quality and product requirements for various business units.

  • Collaborate with CM to define product assembly line, number of test stations and number of assembly fixtures, optimized for cost and throughput.

  • Craft creative solutions and WARs through volume data analysis and lab experimentation to solve challenging yield and test problems seen on the production floor.

  • Lead optimization and continuous improvement efforts on the production screen spec definition processes to minimize waste and meet test time, yield, DPPM requirements.

  • Support customer facing and quality teams during customer escalations to understand the issue and fix gaps identified in coverage.

What we need to see:

  • BS/MS in EE/CE/CS or equivalent experience

  • 8+ years of experiences in HW design or diagnostics/validation or manufacturing test of PCIe IPs, Chips or Systems

  • Proficient in HW interfaces, including PCIe (Gen4+), InfiniBand, Ethernet, I3C/I2C, SPI, USB, etc.

  • In depth understanding of HPC server architecture and Out-of-Band management

  • Strong problem-solving and trouble-shooting expertise; and institutionalizing root-cause analysis

  • Experience in defining test and validation specifications for complex HW systems or HPC servers

  • Motivated to continually improve/optimize processes

  • Self-initiative, strong interpersonal skills, and flexibility to adapt to new technologies

Ways to stand out from the crowd:

  • Prior experience in HW board/system electrical design, HW device drivers or HW diagnostics software development

  • On-hand experience in debugging and triaging HW faults using testing equipment and Linux commands/tools

  • Proficient in Python or Shell scripting for HW testing automation and log parsing

  • Familiar with FPGA implementation, FW secure-boot and encrypted images

  • Operations Research/Industrial/Engineering statistics skills

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers; we have some of the most forward-thinking and hardworking people in the world working for us and, due to unparalleled growth, best-in-class teams are rapidly growing. If you’re creative and autonomous with a real passion for your work, we want to hear from you!

The base salary range is 160,000 USD - 304,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

The Embassy - Pipeline TD

The Embassy

Vancouver, British Columbia, Canada (Hybrid)
2 Months ago
hostari - Site Reliability Engineer (Remote)

hostari

(Remote)
4 Months ago
NVIDIA - Senior Hardware Validation Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Luxoft - Business Analyst - ION

Luxoft

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Valeo - Trainee/Apprentice - IT infrastructure

Valeo

Créteil, Île-de-France, France (On-Site)
1 Day ago
Universally Speaking - Korean Games Tester

Universally Speaking

Liverpool, England, United Kingdom (On-Site)
1 Month ago
Corsair - Sr. Manufacturing Quality Manager

Corsair

Taiwan (On-Site)
1 Month ago
Nagarro - Senior Engineer

Nagarro

Mexico (Remote)
6 Months ago
Tesla - Materials Test Technician

Tesla

Berlin, Berlin, Germany (On-Site)
2 Months ago
Google - Technical Program Manager, Pixel Watch Test Engineering

Google

Bucharest, Bucharest, Romania (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Backend Engineer, Machine Learning Systems - Singapore

ByteDance

Singapore (On-Site)
6 Months ago
Palo Alto Networks - Principal Site Reliability Engineer (WildFire Cloud Infrastructure)

Palo Alto Networks

Santa Clara, California, United States (On-Site)
2 Weeks ago
hostari - Site Reliability Engineer (Remote)

hostari

(Remote)
4 Months ago
Next Level Business Services - Oracle BI Developer

Next Level Business Services

Goleta, California, United States (On-Site)
6 Months ago
Cold Symmetry - Senior Environment Artist

Cold Symmetry

(Remote)
3 Months ago
Starkflow - Senior Oracle Apps DBA

Starkflow

Qatar (On-Site)
2 Weeks ago
DEVOTEAM - IT Traineeship - Data (Dutch speaking)

DEVOTEAM

Amsterdam, North Holland, Netherlands (On-Site)
6 Months ago
ION - Senior Technical Support Analyst, Jersey City - 7537

ION

Jersey City, New Jersey, United States (On-Site)
6 Months ago
Enphase Energy - DevOps Engineer

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
3 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Company3 Method Studios - Facility Technician (7:00am - 3:30pm PT)

Company3 Method Studios

Hollywood, Florida, United States (On-Site)
6 Months ago
Guardian - BenTech Strategic Channel Manager

Guardian

Boston, Massachusetts, United States (Hybrid)
1 Day ago
McDonald's Corporation - Manager - Vulnerability Management Governance

McDonald's Corporation

Chicago, Illinois, United States (On-Site)
3 Weeks ago
Life church - Donor Relationship Manager

Life church

Edmond, Oklahoma, United States (On-Site)
6 Months ago
Zoox - Manager, Vehicle Quality Support

Zoox

Hayward, California, United States (On-Site)
6 Months ago
NVIDIA - GPU Verification Architect

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
The Walt Disney Company - Executive Producer, Digital

The Walt Disney Company

Durham, North Carolina, United States (On-Site)
1 Month ago
Disney - Graphic Fabrication Designer (Project Hire)

Disney

Celebration, Florida, United States (On-Site)
20 Hours ago
Scientific Games  - Sales Account Manager - Tahlequah or Muskogee area

Scientific Games

Oklahoma, United States (On-Site)
1 Week ago
Google - Senior Product Manager, Systems, and Cloud AI

Google

Sunnyvale, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Quality Assurance Jobs

Mayhem Studios - QA Engineer I - Automation

Mayhem Studios

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Netflix - Senior Software Engineer - Test & Device Automation Platform

Netflix

Los Gatos, California, United States (On-Site)
2 Months ago
Lighthouse Games - Tools Tester

Lighthouse Games

England, United Kingdom (On-Site)
1 Week ago
Corsair - ICUE Test Engineer

Corsair

Vietnam (On-Site)
1 Month ago
Evolution - Senior QA Engineer

Evolution

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Months ago
Epic Games - Senior QA Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
3 Months ago
N-iX - Senior AQA Engineer (With C# and JavaScript)

N-iX

Poland (Remote)
1 Month ago
2K - PC Compatibility Lead

2K

Las Vegas, Nevada, United States (On-Site)
1 Month ago
Google - Assurance Specialist, Payments Compliance

Google

Austin, Texas, United States (On-Site)
2 Days ago
Bigpoint - Associate QA Tester

Bigpoint

Hamburg, Hamburg, Germany (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug