Senior Reliability Engineer

5 Months ago • 8 Years +
Software Development & Engineering

Job Description

NVIDIA seeks a Senior Reliability Engineer to ensure the reliability of complex systems, focusing on critical components and liquid cooling. Responsibilities include defining and executing reliability test plans, analyzing data, driving improvements, and collaborating with design engineers. The ideal candidate possesses a deep understanding of reliability principles, testing methodologies, and experience with DFQR methods and FMEA approaches. This role involves working with engineering groups, suppliers, and partners to achieve desired reliability, identifying reliability threats, and improving procedures.
Good To Have:
  • Master's degree in Engineering
  • Data center industry experience
  • Knowledge of material science and manufacturing processes
Must Have:
  • 8+ years in reliability engineering
  • Experience with critical components and complex systems
  • Liquid cooling systems experience
  • Statistical analysis and Weibull analysis
  • Cross-functional team leadership

Add these skills to join the top 1% applicants for this job

team-management
cross-functional
data-analytics
budget-management
game-texts
networking
css

NVIDIA Networking Business Unit is seeking for a highly motivated and experienced Reliability Expert to join our team. In this critical role, you will be responsible for ensuring the reliability of our complex systems, focusing specifically on critical components and the liquid cooling system. You will define and execute reliability test plans, analyze data, and drive improvements to enhance the overall robustness and performance of our products. This position requires a deep understanding of reliability principles, testing methodologies, and a passion for identifying and mitigating potential failures. NVIDIA Networking division is a leading supplier of innovative end-to-end InfiniBand and Ethernet connectivity solutions and services for servers and storage. We offer market-leading solutions that include adapter cards, switches, cables and software to support networking technologies. Our products optimize data center performance and deliver industry-leading bandwidth and scalability. In addition, we serve a wide range of markets including high-performance computing, enterprise, data centers, cloud computing, big data and Web 2.0. We are constantly reinventing ourselves to stay ahead of the market and bring groundbreaking products and services to the industry. Our product line is focusing on delivering the most optimized Ethernet solutions for industries like Media and Entertainment as well as any other industry that can benefit from our Datastream and TCP/IP acceleration.

What you will be doing:

  • You'll have the opportunity to interface and interact with all pertinent engineering groups, suppliers, and partners ensuring the desired reliability is achieved using Design for Quality and Reliability (DFQR) methods including FMEA approaches
  • Collaborate with design engineers to incorporate quality and reliability principles into the design of new products and systems. Provide input on component selection, material selection, and design features to enhance reliability
  • Participate in product and engineering design reviews, assessing the reliability budget of products/designs and inspire changes which improve product reliability
  • Identifying reliability threats and assessing risks
  • Improving procedures and methodologies
  • Define and implement Reliability test plans

What we need to see:

  • Bachelor's degree in Engineering (Mechanical, Electrical, Materials Science, or related field) required; Master's degree preferred.
  • At least 8 years of experience in reliability engineering, with a focus on critical components and complex systems. Experience with liquid cooling systems is highly desirable.
  • A strong technical background and experience driving cross-functional teams
  • Sharp thinking and good decision-making skills

Ways to stand out from the crowd:

  • Strong understanding of reliability principles, including statistical analysis, Weibull analysis, and reliability prediction methods
  • Experience in industries such as data centers.
  • Knowledge of material science and manufacturing processes.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and dedicated people in the world working for us. If you're creative and autonomous, we want to hear from you!

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Set alerts for more jobs like Senior Reliability Engineer
Set alerts for new jobs by NVIDIA
Set alerts for new Software Development & Engineering jobs in Israel
Set alerts for new jobs in Israel
Set alerts for Software Development & Engineering (Remote) jobs
Contact Us
hello@outscal.com
Made in INDIA 💛💙