Senior Software QA Test Development Engineer

1 Month ago • 5 Years + • Quality Assurance • $136,000 PA - $212,750 PA

Job Summary

Job Description

As a Senior Software QA Test Development Engineer at NVIDIA, you will be responsible for developing and executing test plans for NVIDIA's HGX/DGX/MGX platforms. This includes installing and testing various OS, server firmware, and software stacks; conducting root cause analysis for reliability and validation test failures; building automation frameworks; reviewing partner test results; and collaborating with teams to resolve issues. The role requires strong experience in server and OS troubleshooting, automation (Python, Shell, Ansible, Jenkins), CI/CD, and DevOps. Experience with AI tools/frameworks, and various hardware/software components is also critical. The role involves working within an agile environment maintaining high-quality standards.
Must have:
  • 5+ years experience
  • OS & server automation (Python, Shell)
  • Server & OS troubleshooting
  • CI/CD automation & DevOps
  • Test plan development & execution
  • Root cause analysis
Good to have:
  • NVIDIA GPU hardware experience
  • Virtualization in Linux (KVM, Docker, Kubernetes)
  • Deep learning frameworks (TensorFlow, PyTorch)
  • Parallel programming (CUDA/OpenCL)
  • FW, BMC/OpenBMC, Network protocols
  • GitHub/Gitlab/Gerrit, PXE, SLURM, Kubernetes/Docker
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is the world leader in GPU Computing. We are passionate about markets include gaming, automotive, vision, HPC, datacenters and networking in addition to our traditional OEM business. NVIDIA is also well positioned as the ‘AI Computing Company’, and NVIDIA GPUs are the brains powering Deep Learning software frameworks, analytics, data centers, and driving autonomous vehicles. We have some of the most experienced and dedicated people in the world working for us. If you are dedicated, forward-thinking, and hard-working technical people across countries sounds exciting, this job is for you. NVIDIA is looking for an outstanding individual who thrives in a diverse work environment, has outstanding interpersonal skills and possesses a strong sense of engagement and continuous process improvement. This candidate must have enterprise server integration, strong OS experience, reliability testing with various telemetries, scale out cluster, test plan development, CI/CD and DevOps experience to join our platform SWQA team.

What you’ll be doing:

  • Responsible for the development and execution of NVIDIA HGX/DGX/MGX platform test plan on servers, OS, FW and CUDA SW stack from design doc.

  • Installing and testing various systems OS, server firmware and SW stack.

  • Drive support for root cause analysis on reliability and validation test failures to identify root cause(s) and achieve mitigation.

  • Build, develop/debug server and OS level automation front-end and back-end framework and tests

  • Review partner and supplier test results and prescribe additional reliability testing on components, servers, and packaging as needed.

  • Work in an agile software development team with very high production quality standards.

  • Manage bug lifecycle and collaborate with inter-groups to drive for solutions.

What we need to see:

  • Bachelor’s Degree (or equivalent experience) in a STEM (Science, Technology, Engineering, Math or Physics) field

  • 5+ years proven experience; or Master’s Degree.

  • Proven years of OS and server level automation experience using Python, SHELL, Ansible, Jenkins, C/C++, Java, JavaScript

  • Strong server and OS(Ubuntu, RedHat, CentOS, SuSE, Fedora, Windows and etc…) trouble-shooting and debugging experience in a bare-metal and KVM/VMWare/Hyper-V environment.

  • Good knowledge and hands-on experience in model testing, AI tools/frameworks (TensorFlow, Pytorch, Cursor and etc…), NLP  and LLM benchmarking

  • Experience in developing CI/CD automation processes and DevOps contribution with a real passion for automation.

  • Strong experience in FW, BMC/OpenBMC, Network protocol, internal/external enterprise storage devices, PCIe buses and devices, IO sub-devices, CPU and memory, ACPI, UEFI spec, Redfish - huge plus

  • Proven years of experience in GitHub/Gitlab/Gerrit, PXE, SLURM, Stack/Kubernetes/Docker) – huge plus

Ways to stand out from the crowd:

  • Experience working with NVIDIA GPU hardware is a strong plus.

  • Good to have solid understanding of virtualization in Linux (KVM, Docker orchestrated with Kubernetes)

  • Background in deep learning frameworks is a plus

  • Background in parallel programming ideally CUDA/OpenCL is a plus

    The base salary range is 136,000 USD - 212,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

    You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

    NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

    Similar Jobs

    Wargaming - UX/UI Designer

    Wargaming

    Prague, Prague, Czechia (On-Site)
    1 Month ago
    Nintendo - Manufacturing Engineer (Bilingual Japanese)

    Nintendo

    Redmond, Washington, United States (Hybrid)
    8 Months ago
    Socialpoint - Senior Game Designer

    Socialpoint

    Barcelona, Catalonia, Spain (On-Site)
    2 Months ago
    Funcom - Senior Game Tester - Dune: Awakening

    Funcom

    Bucharest, Bucharest, Romania (Hybrid)
    1 Month ago
    Nordcurrent - Lead 2D Artist

    Nordcurrent

    Vilnius, Vilnius County, Lithuania (On-Site)
    3 Months ago
    Syniverse - QA Automation Engineer

    Syniverse

    Bengaluru, Karnataka, India (Hybrid)
    5 Months ago
    Lionbridge Games - Technical Test Associate

    Lionbridge Games

    Mexico City, Mexico City, Mexico (On-Site)
    2 Months ago
    Thatgamecompany - Build Engineer (Associate to Mid-Level)

    Thatgamecompany

    Canada (Remote)
    3 Weeks ago
    NVIDIA - Test Floor Engineer

    NVIDIA

    Hsinchu, Hsinchu City, Taiwan (On-Site)
    2 Months ago
    Tesla - Software QA Engineer, IT Application

    Tesla

    North Holland, Netherlands (On-Site)
    2 Months ago

    Get notifed when new similar jobs are uploaded

    Similar Skill Jobs

    Ubisoft - Lead Programmer

    Ubisoft

    Pune, Maharashtra, India (On-Site)
    1 Month ago
    undefined - Game Producer

    London, England, United Kingdom (On-Site)
    4 Weeks ago
    Onward Search - AEM Content Developer

    Onward Search

    Woonsocket, Rhode Island, United States (Remote)
    2 Months ago
    FromSoftware - Graphic Supervisor Specialized in Lighting

    FromSoftware

    Japan (On-Site)
    4 Months ago
    Starkflow - Systems Design & Architecture Engineer

    Starkflow

    United States (On-Site)
    2 Months ago
    Obsidian Entertainment - Senior Producer

    Obsidian Entertainment

    Irvine, California, United States (On-Site)
    1 Month ago
    Rocket Science - Producer (Technical Account Manager)

    Rocket Science

    Wales, United Kingdom (Hybrid)
    3 Weeks ago
    ZiMAD - QA Engineer

    ZiMAD

    (Remote)
    2 Months ago
    Lionbridge Games - Test Lead

    Lionbridge Games

    Mexico City, Mexico City, Mexico (On-Site)
    2 Months ago
    Electronic Arts - Associate Quality Designer

    Electronic Arts

    Bucharest, Bucharest, Romania (On-Site)
    4 Weeks ago

    Get notifed when new similar jobs are uploaded

    Jobs in Santa Clara, California, United States

    Axinous - Sr. Staff ML Engineer

    Axinous

    San Jose, California, United States (Hybrid)
    3 Months ago
    Universal Music - Manager, eCommerce Analytics

    Universal Music

    New York, New York, United States (On-Site)
    1 Month ago
    Next Level Business Services - BigData Architect

    Next Level Business Services

    Bentonville, Arkansas, United States (On-Site)
    5 Months ago
    ByteDance - Software Engineer in ML Engineering Platform

    ByteDance

    San Jose, California, United States (On-Site)
    5 Months ago
    Axon - Manager, Site Reliability Engineering (Observability)

    Axon

    Seattle, Washington, United States (Remote)
    1 Month ago
    The Walt Disney Company - Animal Keeper - Small Mammal / Ectotherm (Seasonal)

    The Walt Disney Company

    Lake Buena Vista, Florida, United States (On-Site)
    2 Months ago
    IGT - Game Mathematician IV - PlaySocial

    IGT

    North Dakota, United States (Remote)
    4 Months ago
    Sleeper - Performance Creative Associate (TikTok Ads)

    Sleeper

    Los Angeles, California, United States (On-Site)
    3 Weeks ago
    Ludeo - Senior C++ Video Engineer

    Ludeo

    Los Angeles, California, United States (On-Site)
    2 Months ago
    Onward Search - DevOps Engineer

    Onward Search

    Irvine, California, United States (Hybrid)
    1 Month ago

    Get notifed when new similar jobs are uploaded

    Quality Assurance Jobs

    Playrix - Lead SDET

    Playrix

    Georgia (Remote)
    5 Months ago
    BigShip - Software Tester

    BigShip

    Dehradun, Uttarakhand, India (On-Site)
    5 Months ago
    Bragg - QA Engineer

    Bragg

    Ljubljana, Ljubljana, Slovenia (Hybrid)
    3 Months ago
    Nordcurrent - Experienced QA Mobile Game Tester

    Nordcurrent

    Warsaw, Masovian Voivodeship, Poland (On-Site)
    2 Months ago
    Logitech - Sr.Test Engineer

    Logitech

    Camas, Washington, United States (Hybrid)
    6 Months ago
    Velotio Technologies - QA Architect

    Velotio Technologies

    Maharashtra, India (Remote)
    3 Weeks ago
    ZiMAD - QA Engineer

    ZiMAD

    (Remote)
    2 Months ago
    Breach - XR Quality Assurance (QA) Lead

    Breach

    Trondheim, Trøndelag, Norway (On-Site)
    5 Months ago
    Irdeto - Senior Software Engineer in Test

    Irdeto

    Noida, Uttar Pradesh, India (Hybrid)
    6 Months ago
    Morgan McKinley - QA Tester (Gyomu-itaku)

    Morgan McKinley

    Tokyo, Japan (On-Site)
    8 Months ago

    Get notifed when new similar jobs are uploaded

    About The Company

    Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


    Santa Clara, California, United States (On-Site)

    Texas, United States (Remote)

    Santa Clara, California, United States (On-Site)

    Yokne'am Illit, North District, Israel (On-Site)

    United Kingdom (Remote)

    Yokne'am Illit, North District, Israel (On-Site)

    Bengaluru, Karnataka, India (Hybrid)

    Toronto, Ontario, Canada (On-Site)

    View All Jobs

    Get notified when new jobs are added by NVIDIA

    Level Up Your Career in Game Development!

    Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

    Job Common Plug