Senior System Reliability Engineer

1 Month ago • 6-8 Years • Research & Development • $140,000 PA - $264,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior System Reliability Engineer to contribute to the reliability of their GPU servers and high-performance computing systems. Responsibilities include establishing and maintaining product reliability standards, participating in design reviews, working with suppliers and partners, defining reliability plans, performing testing and failure analysis, and correlating test results with field performance. This role requires expertise in hardware reliability engineering for electronics and server systems, including graphics cards, servers, racks, and clusters, encompassing the entire product lifecycle. The ideal candidate will have extensive experience with PCIE peripherals, graphics cards, and servers, strong statistical analysis skills, and excellent communication abilities.
Must have:
  • Hardware Reliability Engineering Expertise
  • Experience with PCIE peripherals, graphics cards, servers
  • Strong statistical analysis skills
  • Excellent communication skills
  • Design for Reliability (DfR) methods
  • Failure analysis and recommendations
Good to have:
  • MS or PhD in relevant field
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA has continuously reinvented itself over two decades. Our invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing — with the GPU acting as the brains of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and build our teams with the most thoughtful people in the world. Join us at the forefront of technological advancement. GPU Servers are one of the fastest-growing segments for NVIDIA and the Artificial Intelligence industry. As the computational power increases with every GPU generation, developing efficient and reliable systems is an imperative. We are looking for a System Reliability Engineer to join NVIDIA's existing Reliability Engineering team, involved in NVIDIA's diverse system product range specifically Graphics and High-Performance Computing printed circuit boards and Data Center Servers.


What you'll be doing:

  • Provide expertise in Hardware Reliability Engineering for Electronics/Server Systems (graphics cards, server, rack, cluster) from Concept to End-of-Life phase.

  • Establish, deliver and maintain product reliability standards and metrics for NVIDIA's new system technologies, using existing tools and processes or developing new as required.

  • Participate in product and engineering design reviews, assess the reliability budget of products/designs, and inspire changes that enhance product reliability.

  • Interface and interact with all pertinent engineering groups, suppliers, and partners ensuring the desired reliability is achieved using Design for Reliability (DfR) methods including FMEA and DoE approaches.

  • Define and implement Reliability Plans & Specifications.

  • Provide reliability predictions, along with test plans and methods to access and drive product reliability to the desired levels.

  • Perform and lead appropriate testing with associated failure analysis and recommendations for improving designs and manufacturing.

  • Develop and present methods of correlating reliability test results with actual field performance.


What we need to see:

  • BS (or equivalent experience) in Engineering, Material Science, Physics, or a related field, MS or PhD preferred.

  • 6+ years in a hardware validation/reliability environment related to PCIE peripherals, graphics cards and servers.

  • Understand power supply, memory, high speed I/O, PCI express, Ethernet and I2C.

  • Hands-on experience in theoretical and practical Reliability concepts as it relates to high-tech electronic enterprise and consumer products.

  • Have a strong command and understanding of statistical concepts/models/analysis and how they relate to product reliability & life analysis.

  • Good verbal and writing skills as well as the ability to communicate at a high level.

  • Self-motivating, independent, and committed to getting things done.

  • Good project management skills and ability to balance multiple simultaneous projects during development and production stages.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you. Come build the future with us!

The base salary range is 140,000 USD - 264,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

OAO INFO INDIA - Product Lead

OAO INFO INDIA

Pune, Maharashtra, India (On-Site)
3 Months ago
magnopus - Technical Artist II

magnopus

Los Angeles, California, United States (On-Site)
10 Months ago
Sony Interactive Entertainment - Lead AI/ML Engineer (Facial and Motion Generation)

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
1 Month ago
Kwalee - Senior Game Programmer (Creative Marketing)

Kwalee

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Valeo - Electronic System Design

Valeo

Martos, Andalusia, Spain (On-Site)
1 Week ago
NVIDIA - Physical Design Full Chip STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
NVIDIA - Layout Design Engineer

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
NVIDIA - Senior Post Silicon Hardware Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
NVIDIA - Senior Chip Design Engineer, Formal Verification

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago
Google - Research Scientist, Google Cloud AI

Google

Kirkland, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Ice fall interactive - Real Time FX Artist

Ice fall interactive

Nelson, British Columbia, Canada (On-Site)
1 Month ago
Inworld AI - Staff C++ Engineer

Inworld AI

Mountain View, California, United States (On-Site)
2 Months ago
Wargaming - Senior Game Designer, Core (Unannounced Project)

Wargaming

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Months ago
Ubisoft - Technical Cinematic Designer

Ubisoft

Bordeaux, Nouvelle-Aquitaine, France (On-Site)
2 Months ago
Creasaur - VFX Artists

Creasaur

Ankara, Ankara, Türkiye (On-Site)
2 Weeks ago
Activision - Senior UX Tool Designer

Activision

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
bohemia interactive - Senior Multiplayer Programmer

bohemia interactive

Prague, Prague, Czechia (On-Site)
6 Months ago
Ansys - Senior C++ Software Engineer - Semiconductors

Ansys

Chalandri, Greece (On-Site)
1 Month ago
Juego Studios - Unity Developer _Delhi _Onsite

Juego Studios

Delhi, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Ion - Lead Python Engineer, New York

Ion

New York, New York, United States (Hybrid)
7 Months ago
Snap Mobile INC - Account Executive

Snap Mobile INC

Orlando, Florida, United States (On-Site)
1 Month ago
Collaborative Robotics - Software Engineer, Build and Deploy

Collaborative Robotics

Santa Clara, California, United States (On-Site)
1 Month ago
AliveCor - Senior Financial Accountant

AliveCor

Mountain View, California, United States (Hybrid)
1 Month ago
bytedance - Software Development Engineer Graduate (SDN Traffic Intelligence & Control) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Fox Factory - Warehouse Packaging Associate

Fox Factory

Coldwater, Michigan, United States (On-Site)
4 Days ago
whoop - Product Manager II (Connectivity and Embedded Systems)

whoop

Boston, Massachusetts, United States (On-Site)
1 Month ago
Nightfall - Sales Development Representative

Nightfall

San Francisco, California, United States (On-Site)
1 Week ago
bytedance - Video Experience Software Engineer Intern

bytedance

San Jose, California, United States (On-Site)
2 Months ago
Treck - Seasonal Sales Associate

Treck

Edgewater, New Jersey, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

bytedance - Research Scientist Graduate (High-Performance Computing (Inference Optimization) - Vision AI Platform)

bytedance

San Jose, California, United States (On-Site)
1 Month ago
DNEG - Video Streaming Engineer - Imaging, Playback and Review Tools

DNEG

London, England, United Kingdom (Remote)
1 Month ago
Riot Games - Senior Animation Artist - VALORANT

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
W Beyond   - Embedded C

W Beyond

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Hashlist - ADAS Feature Architect

Hashlist

Pune, Maharashtra, India (Hybrid)
8 Months ago
Google - Senior Machine Learning Physical Design Engineer

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
NVIDIA - Software Engineering Intern - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
4 Months ago
Tencent - Senior Regional Game Operation Manager

Tencent

London, England, United Kingdom (On-Site)
3 Months ago
The Walt Disney Company - Senior Manager, Software Development

The Walt Disney Company

Burbank, California, United States (On-Site)
1 Month ago
Riot Games - Senior User Researcher

Riot Games

Shanghai, Shanghai, China (On-Site)
10 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug