Senior Datacenter Product Development Engineer

2 Months ago • 8 Years + • Quality Assurance • $160,000 PA - $304,750 PA

Job Summary

Job Description

As a Senior Datacenter Product Development Engineer at NVIDIA, you'll collaborate with engineering teams to launch new boards for GPU-accelerated server platforms (HGX/MGX/DGX). This involves planning processes, defining test requirements, optimizing production lines, and ensuring cost and quality targets are met. Responsibilities include developing diagnostic tests, debugging complex hardware/software interactions, ensuring DFx compliance, owning product lifecycles, and creating test specifications. You'll also collaborate with contract manufacturers, analyze data to solve yield problems, and support customer teams during escalations. This role requires deep expertise in high-speed signals, server architectures, and GPU technology.
Must have:
  • 8+ years HW design/validation/manufacturing test experience
  • Proficient in HW interfaces (PCIe Gen4+, InfiniBand, etc.)
  • Strong problem-solving & troubleshooting expertise
  • Experience defining test specs for complex HW systems
  • BS/MS in EE/CE/CS or equivalent
Good to have:
  • Experience in HW board/system electrical design
  • HW device drivers or HW diagnostics software development
  • Proficient in Python or Shell scripting
  • Familiar with FPGA implementation, FW secure-boot
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA Corporation is a world leader in visual computing technology. The GPU, which the company invented, serves as the visual cortex of modern computers and is at the heart of their products and services. NVIDIA has transformed into a specialized platform company that targets four large markets – Gaming, Professional Visualization, Datacenter and Automotive – where visual computing is essential and deeply valued. Their work also uncovers new universes to explore and enable amazing creativity and discovery by powering what was once thought to be science fiction inventions like artificial intelligence and autonomous cars.

Collaborating with your peers across various engineering groups, you will successfully launch new boards for NVIDIA GPU Accelerated Server Platforms (HGX/MGX/DGX) to production. These purpose-built systems are optimized for the growing Deep Learning, Artificial Intelligence, and Analytics environments. With world-class technology enabling never-been-seen-before performance levels, NVIDIA’s HGX/DGX portfolio is arguably the most complicated Server platform ever developed by humans. This product family represents the company’s fastest growing line of business as well as its largest total available market opportunity. You will bring to bear your knowledge of Server architectures, CPU baseboards and GPU technology in order to productize new GPU boards for Server architectures with GPU-accelerated clusters. Your responsibilities will include planning and establishing processes, defining test requirements and optimizing the production line to deliver new NVIDIA GPU boards. You will also be instrumental in helping the team to achieve the desired cost and quality metrics considered best-in-class.

What you will be doing:

  • Leverage your in-depth experience with high-speed signals to plan and develop new diagnostic tests and debug procedures for next gen products.

  • Use your knowledge of system power-up and handshakes during boot to debug complex interactions between HW, FW and SW on faulty boards.

  • Recommend, drive and ensure compliance to DFx requirements for robust signal integrity performance as related to layout, mechanical components, assembly procedures, etc.

  • Own a product or series of products end-to-end through the entire product lifecycle; your role would be to ensure successful production ramps are achieved working through a large matrixed team.

  • Develop and deliver test specs for system level manufacturing screens for all new products to meet the required HW coverage, quality and product requirements for various business units.

  • Collaborate with CM to define product assembly line, number of test stations and number of assembly fixtures, optimized for cost and throughput.

  • Craft creative solutions and WARs through volume data analysis and lab experimentation to solve challenging yield and test problems seen on the production floor.

  • Lead optimization and continuous improvement efforts on the production screen spec definition processes to minimize waste and meet test time, yield, DPPM requirements.

  • Support customer facing and quality teams during customer escalations to understand the issue and fix gaps identified in coverage.

What we need to see:

  • BS/MS in EE/CE/CS or equivalent experience

  • 8+ years of experiences in HW design or diagnostics/validation or manufacturing test of PCIe IPs, Chips or Systems

  • Proficient in HW interfaces, including PCIe (Gen4+), InfiniBand, Ethernet, I3C/I2C, SPI, USB, etc.

  • In depth understanding of HPC server architecture and Out-of-Band management

  • Strong problem-solving and trouble-shooting expertise; and institutionalizing root-cause analysis

  • Experience in defining test and validation specifications for complex HW systems or HPC servers

  • Motivated to continually improve/optimize processes

  • Self-initiative, strong interpersonal skills, and flexibility to adapt to new technologies

Ways to stand out from the crowd:

  • Prior experience in HW board/system electrical design, HW device drivers or HW diagnostics software development

  • On-hand experience in debugging and triaging HW faults using testing equipment and Linux commands/tools

  • Proficient in Python or Shell scripting for HW testing automation and log parsing

  • Familiar with FPGA implementation, FW secure-boot and encrypted images

  • Operations Research/Industrial/Engineering statistics skills

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers; we have some of the most forward-thinking and hardworking people in the world working for us and, due to unparalleled growth, best-in-class teams are rapidly growing. If you’re creative and autonomous with a real passion for your work, we want to hear from you!

The base salary range is 160,000 USD - 304,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

FICO - Site Reliability Engineering - Engineer I

FICO

Guadalajara, Jalisco, Mexico (Remote)
1 Month ago
Canonical - Software Engineer - Industrial IoT

Canonical

(Remote)
1 Month ago
Paytm - DevOps Engineer/Senior DevOps-Paytm Money

Paytm

Bengaluru, Karnataka, India (On-Site)
7 Months ago
bytedance - Site Reliability Engineer, Compute Platform

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Rackspace Technology - Site Reliability Engineer / Observability Engineer

Rackspace Technology

India (Remote)
4 Months ago
Juego Studios - Senior QA Engineer

Juego Studios

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Thales - Technical Lead

Thales

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Epic Games - Lead Automation Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
3 Months ago
Luxoft - Technical Business Analyst

Luxoft

Bengaluru, Karnataka, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Cadence - Principal Product Engineer

Cadence

San Jose, California, United States (On-Site)
4 Weeks ago
Safe security - Senior Threat Researcher

Safe security

Bengaluru, Karnataka, India (On-Site)
4 Weeks ago
Luxoft - Senior .Net developer with AWS

Luxoft

Poland, Ohio, United States (Remote)
7 Months ago
luxsoft - Murex FO Production Support

luxsoft

London, England, United Kingdom (On-Site)
3 Weeks ago
HCL Tech - IICS Technical Specialist

HCL Tech

Illinois, United States (On-Site)
1 Month ago
White Hat Gaming - Network Operations Center – Level 2 Support Engineer

White Hat Gaming

Cape Town, Western Cape, South Africa (Hybrid)
3 Months ago
bytedance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
Qualcomm - Embedded Linux Dev Engineer

Qualcomm

London, England, United Kingdom (On-Site)
1 Month ago
Info Stretch - Lead Data Engineer

Info Stretch

Chennai, Tamil Nadu, India (On-Site)
8 Months ago
Progress carrers - Senior DBA Consultant

Progress carrers

Sofia, Sofia City Province, Bulgaria (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Blitz app - Senior Product Manager II, Games

Blitz app

Los Angeles, California, United States (Hybrid)
7 Months ago
CME Group - Technology Buyer

CME Group

Chicago, Illinois, United States (Hybrid)
1 Month ago
Moloco - Senior Software Engineer, Ad Serving

Moloco

Redwood City, California, United States (On-Site)
1 Month ago
Penumbrainc - Procurement Process Excellence Principal

Penumbrainc

Alameda, California, United States (On-Site)
1 Month ago
Rockstar Games - Associate Principal Analytics Engineer

Rockstar Games

New York, United States (On-Site)
1 Month ago
bytedance - Research Scientist in Foundation Model (Music) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
8 Months ago
Warner bro discovery - Executive Producer

Warner bro discovery

Salt Lake City, Utah, United States (Hybrid)
1 Month ago
Yodlee - Information Security & Risk Director

Yodlee

Raleigh, North Carolina, United States (Remote)
2 Months ago
luxsoft - Java/Python Developer

luxsoft

Los Angeles, California, United States (On-Site)
1 Month ago
Scale AI - AI Product Manager, Generative AI

Scale AI

San Francisco, California, United States (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Quality Assurance Jobs

PlayerUnknown Productions - Senior QA Specialist

PlayerUnknown Productions

Amsterdam, North Holland, Netherlands (Hybrid)
9 Months ago
Nagarro - Principal Engineer, QA Automation

Nagarro

India (Remote)
8 Months ago
Lionbridge Games - Software Testing Associate

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
3 Months ago
PlayStation Global - QA Lead (Contract)

PlayStation Global

Los Angeles, California, United States (On-Site)
5 Months ago
Nintendo - Contract - DevOps Engineer

Nintendo

Redmond, Washington, United States (On-Site)
5 Months ago
Aristocrat Gaming - Director, Quality Assurance

Aristocrat Gaming

Sofia, Sofia City Province, Bulgaria (Hybrid)
3 Months ago
Power Integrations - Engineering Intern

Power Integrations

Penang, Malaysia (On-Site)
8 Months ago
PlatinumGames - QA Coordinator

PlatinumGames

(On-Site)
2 Months ago
playrix  - Lead QA Automation Engineer (Mobile)

playrix

Montenegro (Remote)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug