Senior Datacenter Product Development Engineer

1 Month ago • 8 Years + • Quality Assurance • $160,000 PA - $304,750 PA

Job Summary

Job Description

As a Senior Datacenter Product Development Engineer at NVIDIA, you'll collaborate with engineering teams to launch new boards for GPU-accelerated server platforms (HGX/MGX/DGX). This involves planning processes, defining test requirements, optimizing production lines, and ensuring cost and quality targets are met. Responsibilities include developing diagnostic tests, debugging complex hardware/software interactions, ensuring DFx compliance, owning product lifecycles, and creating test specifications. You'll also collaborate with contract manufacturers, analyze data to solve yield problems, and support customer teams during escalations. This role requires deep expertise in high-speed signals, server architectures, and GPU technology.
Must have:
  • 8+ years HW design/validation/manufacturing test experience
  • Proficient in HW interfaces (PCIe Gen4+, InfiniBand, etc.)
  • Strong problem-solving & troubleshooting expertise
  • Experience defining test specs for complex HW systems
  • BS/MS in EE/CE/CS or equivalent
Good to have:
  • Experience in HW board/system electrical design
  • HW device drivers or HW diagnostics software development
  • Proficient in Python or Shell scripting
  • Familiar with FPGA implementation, FW secure-boot
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA Corporation is a world leader in visual computing technology. The GPU, which the company invented, serves as the visual cortex of modern computers and is at the heart of their products and services. NVIDIA has transformed into a specialized platform company that targets four large markets – Gaming, Professional Visualization, Datacenter and Automotive – where visual computing is essential and deeply valued. Their work also uncovers new universes to explore and enable amazing creativity and discovery by powering what was once thought to be science fiction inventions like artificial intelligence and autonomous cars.

Collaborating with your peers across various engineering groups, you will successfully launch new boards for NVIDIA GPU Accelerated Server Platforms (HGX/MGX/DGX) to production. These purpose-built systems are optimized for the growing Deep Learning, Artificial Intelligence, and Analytics environments. With world-class technology enabling never-been-seen-before performance levels, NVIDIA’s HGX/DGX portfolio is arguably the most complicated Server platform ever developed by humans. This product family represents the company’s fastest growing line of business as well as its largest total available market opportunity. You will bring to bear your knowledge of Server architectures, CPU baseboards and GPU technology in order to productize new GPU boards for Server architectures with GPU-accelerated clusters. Your responsibilities will include planning and establishing processes, defining test requirements and optimizing the production line to deliver new NVIDIA GPU boards. You will also be instrumental in helping the team to achieve the desired cost and quality metrics considered best-in-class.

What you will be doing:

  • Leverage your in-depth experience with high-speed signals to plan and develop new diagnostic tests and debug procedures for next gen products.

  • Use your knowledge of system power-up and handshakes during boot to debug complex interactions between HW, FW and SW on faulty boards.

  • Recommend, drive and ensure compliance to DFx requirements for robust signal integrity performance as related to layout, mechanical components, assembly procedures, etc.

  • Own a product or series of products end-to-end through the entire product lifecycle; your role would be to ensure successful production ramps are achieved working through a large matrixed team.

  • Develop and deliver test specs for system level manufacturing screens for all new products to meet the required HW coverage, quality and product requirements for various business units.

  • Collaborate with CM to define product assembly line, number of test stations and number of assembly fixtures, optimized for cost and throughput.

  • Craft creative solutions and WARs through volume data analysis and lab experimentation to solve challenging yield and test problems seen on the production floor.

  • Lead optimization and continuous improvement efforts on the production screen spec definition processes to minimize waste and meet test time, yield, DPPM requirements.

  • Support customer facing and quality teams during customer escalations to understand the issue and fix gaps identified in coverage.

What we need to see:

  • BS/MS in EE/CE/CS or equivalent experience

  • 8+ years of experiences in HW design or diagnostics/validation or manufacturing test of PCIe IPs, Chips or Systems

  • Proficient in HW interfaces, including PCIe (Gen4+), InfiniBand, Ethernet, I3C/I2C, SPI, USB, etc.

  • In depth understanding of HPC server architecture and Out-of-Band management

  • Strong problem-solving and trouble-shooting expertise; and institutionalizing root-cause analysis

  • Experience in defining test and validation specifications for complex HW systems or HPC servers

  • Motivated to continually improve/optimize processes

  • Self-initiative, strong interpersonal skills, and flexibility to adapt to new technologies

Ways to stand out from the crowd:

  • Prior experience in HW board/system electrical design, HW device drivers or HW diagnostics software development

  • On-hand experience in debugging and triaging HW faults using testing equipment and Linux commands/tools

  • Proficient in Python or Shell scripting for HW testing automation and log parsing

  • Familiar with FPGA implementation, FW secure-boot and encrypted images

  • Operations Research/Industrial/Engineering statistics skills

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers; we have some of the most forward-thinking and hardworking people in the world working for us and, due to unparalleled growth, best-in-class teams are rapidly growing. If you’re creative and autonomous with a real passion for your work, we want to hear from you!

The base salary range is 160,000 USD - 304,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Tencent - Senior Gateway Ops Engineer

Tencent

(On-Site)
1 Month ago
FICO - Site Reliability Engineering - Engineer I

FICO

Guadalajara, Jalisco, Mexico (Remote)
5 Days ago
London stock Exchange - Senior Lead Software Engineer

London stock Exchange

Bangkok, Thailand (On-Site)
3 Weeks ago
oportun - Senior Manager, Data Science

oportun

(Remote)
1 Month ago
bytedance - SRE and DevOps Tech Lead - Edge Cloud Infrastructure - London

bytedance

London, England, United Kingdom (On-Site)
6 Months ago
Netflix - Senior Software Engineer - Developer Automation Platform (Backend)

Netflix

Los Gatos, California, United States (On-Site)
1 Month ago
Amazon games  - QA Specialist - Functionality

Amazon games

Bucharest, Bucharest, Romania (On-Site)
1 Month ago
rivos - Silicon Verification - Intern

rivos

Santa Clara, California, United States (On-Site)
7 Months ago
zeta - Software Development Engineer in Test I / II

zeta

Hyderabad, Telangana, India (On-Site)
7 Months ago
Sharkmob - Senior Game Quality Analyst

Sharkmob

Malmö, Skåne County, Sweden (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Ion - Technical Support Analyst, Toronto - 4363

Ion

Toronto, Ontario, Canada (On-Site)
7 Months ago
Qualcomm - Turing Design Verification Sr Lead Engineer

Qualcomm

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Qualcomm - Senior Engineer - Linux Product Integration Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)
3 Weeks ago
JDA - Support Engineer I

JDA

Monterrey, Nuevo Leon, Mexico (Remote)
4 Days ago
bytedance - Site Reliability Engineer, Compute Platform

bytedance

San Jose, California, United States (On-Site)
6 Months ago
Qualcomm - CAD Physical Design Engineer

Qualcomm

Cork, County Cork, Ireland (On-Site)
3 Weeks ago
Zurora - Sr Ops Site Reliability Engineer

Zurora

Costa Rica (Hybrid)
4 Days ago
JMA - Advanced Engineer - Baseband Unit QA

JMA

Syracuse, New York, United States (On-Site)
2 Months ago
Barracuda Networks Inc - Senior Software Engineer

Barracuda Networks Inc

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Tencent - Site Reliability Engineer

Tencent

(On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Daybreak Game Company LLC - Environment Artist

Daybreak Game Company LLC

Austin, Texas, United States (Remote)
4 Months ago
Universal Music - Manager, North America Catalog Services

Universal Music

Franklin, Tennessee, United States (On-Site)
2 Months ago
Wolters Kluwer - Customer Success Associate

Wolters Kluwer

Chicago, Illinois, United States (Hybrid)
2 Days ago
The Walt Disney Company - Senior Data Engineer

The Walt Disney Company

Glendale, California, United States (On-Site)
1 Month ago
Kavalirio - Payroll Specialist

Kavalirio

Atlanta, Georgia, United States (On-Site)
1 Week ago
Jane Street - MacOS Engineer

Jane Street

New York, United States (On-Site)
2 Weeks ago
Tencent - Senior IT Devops Engineer

Tencent

Irvine, California, United States (On-Site)
5 Days ago
IGT gaming - Gaming Service Technician III

IGT gaming

Alabama, United States (Remote)
2 Weeks ago
Gigamon - Senior Product Manager - AI

Gigamon

Santa Clara, California, United States (Hybrid)
1 Month ago
HCL Tech - Python developer (data analysis)

HCL Tech

Texas, United States (On-Site)
3 Days ago

Get notifed when new similar jobs are uploaded

Quality Assurance Jobs

Lionbridge Games - Software Test Engineer

Lionbridge Games

Mexico City, Mexico City, Mexico (On-Site)
1 Month ago
Sapiens - Quality Analyst

Sapiens

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Nagarro - Senior Staff Engineer, QA Automation

Nagarro

India (Remote)
7 Months ago
Niantic - Quality Engineer

Niantic

Tokyo, Japan (Hybrid)
1 Month ago
playrix  - Senior QA Engineer (Mobile)

playrix

Portugal (Remote)
7 Months ago
fluence - Systems Engineer - Product Verification & Validation (m/f/d)

fluence

Berlin, Berlin, Germany (On-Site)
7 Months ago
Epic Games - Senior SDET

Epic Games

(On-Site)
3 Months ago
Cargo studio - QA – Game Tester

Cargo studio

(On-Site)
3 Months ago
DRIFE - QA Engineer

DRIFE

Bengaluru, Karnataka, India (On-Site)
9 Months ago
playrix  - Full Stack QA Engineer (Mobile)

playrix

Cyprus (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug