Enterprise Software Test Development Engineer

3 Months ago • 2 Years + • Testing

Job Summary

Job Description

NVIDIA seeks an Enterprise Software Test Development Engineer to join their Enterprise Server QA team. Responsibilities include developing and executing test plans for NVIDIA HGX/DGX platforms (OS, FW, CUDA SW stack), installing and testing various system OS and software stacks (Windows & Linux), driving root cause analysis of reliability and validation test failures, leveraging AI (Language Model) skills for automation, reviewing partner test results, and working in an agile environment. The ideal candidate possesses strong OS/FW experience, reliability testing expertise, and CI/CD/DevOps knowledge.
Must have:
  • 2+ years experience
  • Automation (Python, Shell, Ansible, Jenkins)
  • OS troubleshooting (Linux, Windows)
  • Test plan development (functional, performance, stress)
  • CI/CD automation
  • AI development tools for testing
Good to have:
  • NVIDIA GPU hardware experience
  • x86 server error handling
  • x86/ARM environment
  • Parallel programming (CUDA/OpenCL)
  • FW, BMC/OpenBMC, SBIOS, Network protocol, enterprise storage, Redfish experience

Job Details

NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our traditional OEM business. NVIDIA is also well positioned as the ‘AI Computing Company’, and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some of the most experienced and dedicated people in the world working for us. If you are dedicated, forward-thinking, and hard-working technical people across countries sounds exciting, this job is for you.

NVIDIA is looking for an outstanding individual who thrives in a diverse work environment, has outstanding interpersonal skills and possesses a strong sense of engagement and continuous process improvement. This candidate must have enterprise system integration, strong OS/FW experience, reliability testing with various telemetries, test plan development, CI/CD and DevOps experience to join our Enterprise Server QA team.

What you’ll be doing:

  • Responsible for the development and execution of NVIDIA HGX/DGX platform test plan on OS, FW and CUDA SW stack from design doc.

  • Installing and testing various systems OS, system firmware and software stack including Windows & Linux

  • Drive support for root cause analysis on reliability and validation test failures to identify root cause(s) and achieve mitigation.

  • Leverage AI (Language Model) skills to build automation front-end and back-end framework which could interaction with human

  • Review partner and supplier test results and prescribe additional reliability testing on components, systems, and packaging as needed.

  • Work in an agile software development team with very high production quality standards.

  • Manage bug lifecycle and collaborate with inter-groups to drive for solutions.

What we need to see:

  • Bachelor’s Degree (or equivalent experience) in a STEM (Science, Technology, Engineering, Math or Physics) field with 2+ years proven experience; or Master’s Degree.

  • 2+ years of meaningful work experience

  • Proven years of automation experience using Python, Shell Script, Ansible, Jenkins

  • Strong OS (Ubuntu, RedHat, CentOS, SuSE, Fedora, Windows, etc.) trouble-shooting and debugging experience in a bare-metal and KVM/VMWare environment.

  • Experience in using AI development tools for test plans creation, test cases development and test cases automation

  • Ability to write test plans focusing on functional, performance, stress and negative testing.

  • Experience in developing CI/CD automation processes and DevOps contribution with a real passion for automation and Good teamwork with ability to work independently.

Ways to stand out from the crowd:

  • Experience working with NVIDIA GPU hardware is a strong plus.

  • Have implemented error handling for x86 based servers, online and offline health monitoring tools.

  • Experience of developing x86/ARM based environment

  • Background in parallel programming ideally CUDA/OpenCL is a plus

  • Strong experience in FW, BMC/OpenBMC, SBIOS, Network protocol, enterprise storage devices, Redfish - huge plus

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Similar Jobs

Scale AI - Senior Software Engineer, GenAI Outlier

Scale AI

San Francisco, California, United States (Hybrid)
2 Months ago
Scale AI - Strategic Finance Manager, GenAI

Scale AI

San Francisco, California, United States (On-Site)
2 Months ago
HCL Tech - Sr tec lead teamcenter support tc admin

HCL Tech

New York, United States (On-Site)
1 Month ago
Bungie - Contract Associate Creator Marketing Manager

Bungie

(Hybrid)
6 Months ago
Rebellion - Lead Environment Artist

Rebellion

Oxford, England, United Kingdom (Hybrid)
3 Months ago
NVIDIA - Test Floor Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
5 Months ago
Qualcomm - Staff Engineer - Split Compute Testing

Qualcomm

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Next Level Business Services - Performance Testing (Full Time Only)

Next Level Business Services

Dublin, Ohio, United States (On-Site)
8 Months ago
Next Level Business Services - Workday Integration Tester

Next Level Business Services

Menomonee Falls, Wisconsin, United States (On-Site)
8 Months ago
NVIDIA - Senior Test Engineer

NVIDIA

(Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Illumina - Staff Automation Engineer

Illumina

Foster City, California, United States (On-Site)
2 Weeks ago
Socialpoint - Manager, Events & Corporate Food & Bev

Socialpoint

Barcelona, Catalonia, Spain (On-Site)
1 Month ago
Toast - Retail Account Executive

Toast

California, United States (Hybrid)
2 Weeks ago
PwC - Senior Associate - SAP BPC - GDC

PwC

Kolkata, West Bengal, India (On-Site)
9 Months ago
Spaulding Ridge - Oracle EPM Solution Architect

Spaulding Ridge

Chicago, Illinois, United States (On-Site)
2 Months ago
Apple - Data Visualization / BI Engineer

Apple

Cupertino, California, United States (On-Site)
2 Weeks ago
Wargaming - Tactical Sourcing Supervisor

Wargaming

Belgrade, Serbia (Hybrid)
2 Weeks ago
Ion - Lead Product Manager

Ion

Boston, Massachusetts, United States (Hybrid)
3 Months ago
Adyen - Team Lead - Ops Academy

Adyen

Amsterdam, North Holland, Netherlands (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Taipei City, Taiwan

Qualcomm - Component Supplier Quality Engineer, Staff

Qualcomm

Hsinchu City, Taiwan (On-Site)
1 Month ago
binance - Senior Data Analyst - Compliance

binance

Taipei City, Taiwan (Remote)
1 Year ago
binance - DevSecOps Engineer, Infrastructure Security

binance

Taipei City, Taiwan (Remote)
10 Months ago
binance - Senior Java Engineer - Payment

binance

Taipei City, Taiwan (Remote)
2 Months ago
Coda - Compliance & Partner Enablement Specialist

Coda

Taipei City, Taiwan (Hybrid)
3 Weeks ago
Maersk - People Advisor

Maersk

Taoyuan City, Taiwan (On-Site)
8 Months ago
Canonical - Linux Enablement - Software Engineering Manager

Canonical

Taipei City, Taiwan (On-Site)
1 Month ago
Marsh McLennan - Account Manager - Mercer Marsh Benefits (New Business Focus)

Marsh McLennan

Taipei City, Taiwan (Hybrid)
1 Month ago
appier - Campaign Analyst (US)

appier

Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Testing Jobs

luxsoft - Regular Test Specifications Engineer

luxsoft

Egypt (Remote)
1 Month ago
NVIDIA - System Test Design Engineer

NVIDIA

(Remote)
5 Months ago
luxsoft - Test Engineer

luxsoft

Sofia, Sofia City Province, Bulgaria (On-Site)
1 Month ago
31st Union - Senior Test Automation Engineer

31st Union

San Mateo, California, United States (Hybrid)
2 Months ago
Capgemini - Penetration Testing Engineer

Capgemini

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Universally Speaking - Danish Games Tester

Universally Speaking

Madrid, Community Of Madrid, Spain (On-Site)
2 Weeks ago
Capgemini - Payments Testing

Capgemini

Pune, Maharashtra, India (On-Site)
1 Month ago
Unada labs - Manual Tester

Unada labs

Ahmedabad, Gujarat, India (On-Site)
5 Months ago
Capgemini - Testing Engineer (HFC)

Capgemini

Chennai, Tamil Nadu, India (On-Site)
1 Month ago
endava - Senior Java Automation Tester

endava

Bucharest, Bucharest, Romania (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug