Senior GPU System Software Engineer
NVIDIA
Job Summary
NVIDIA is seeking a Sr Systems Software Engineer to develop and validate software for next-generation SOC platforms. This role involves solving and debugging complex problems related to Memory, ACPI, Graphics, and Display, collaborating with hardware, firmware, and OS teams. Responsibilities include investigating architecture requirements, debugging hard hangs across system components, utilizing hardware debug tools, and analyzing crash dumps. The engineer will also support validation and bring-up activities for new hardware platforms and provide design feedback.
Must Have
- Develop and validate software for next-generation SOC platform
- Solve and debug Memory, ACPI, Graphics, and Display problems
- Investigate architecture requirements for next-gen SOC systems
- Debug hard hangs across CPU, memory, and I/O components
- Collaborate with hardware, firmware, and OS teams
- Utilize hardware debug tools (e.g., JTAG) and software diagnostics
- Collect and analyze crash dumps, logs, and signal traces
- Support validation and bring-up for new hardware platforms
- Provide feedback for design improvements
- Strong C programming skills
- Solid engineering background and good code methodology
- Good foundation in OS & RTOS concepts
- Develop and maintain kernel-mode drivers for Linux and WDDM-compliant drivers for Windows
- Experience in SOC platform engineering, development, integration, and bring-up
- Excellent problem-solving and cross-team collaboration skills
Good to Have
- SOC System level understanding
- Knowledge of tools like windbg, gdb, kernel tracing utilities
- Familiarity with graphics/display subsystems, DMA, interrupt handling, and memory management
- Proficiency in navigating diverse IPs within SOC platform
- Robust knowledge of memory management and coherency principles
Job Description
NVIDIA's invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company”. We are looking to grow our company, and grow our teams with the smartest people in the world.
Systems Software supports NVIDIA's new high performance chipset business. They are generalists, with a deep understanding of today's PC architecture, including bus protocols such as PCI, AGP, USB and 1394, and CPU booting, arbitration, and cache management protocols. System engineers program down to the metal and are comfortable around hardware specs. They write software for Networking, Audio, IDE, SMBus and other hardware, and implement support for modern PC standards such as ACPI and OnNow.
What you’ll be doing:
- Developing and validating software for our next-generation SOC platform.
- Solve and debug problems related to Memory, ACPI, Graphics, and Display to support our customers, partners, and users.
- Investigating and scoping out the architecture requirement for the next-generation SOC systems.
- Investigate and debug hard hangs across various system components (CPU, memory, I/O).
- Collaborate closely with hardware, firmware, and OS teams to isolate root causes.
- Utilize hardware debug tools (e.g., JTAG) and software diagnostics.
- Collect and analyse crash dumps, logs, and signal traces.
- Support validation and bring-up activities for new hardware platforms.
- Provide feedback for design improvements.
What we need to see:
- Master’s degree in computer science, Computer Engineering or equivalent experience
- 7+ years of experience
- Strong C programming skills as well as having shown initiative in pursuing independent coding projects
- Solid engineering background and understanding of good code methodology
- Experience writing code to directly control hardware is a plus
- Good foundation and knowledge of OS & RTOS concepts.
- Develop and maintain kernel-mode drivers for Linux and WDDM-compliant drivers for Windows.
- SOC platform engineering, development/integration/bring up
- Excellent problem-solving and cross-team collaboration skills
Ways to stand out from the crowd:
- SOC System level understanding and good knowledge of tools like windbg, gdb, kernel tracing utilities.
- Familiarity with graphics/display subsystems, DMA, interrupt handling, and memory management.
- Proficient in navigating the functionalities of diverse IPs within the SOC platform, backed by a robust knowledge of memory management and coherency principles.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most brilliant and talented people on the planet working for us. If you're creative and autonomous, we want to hear from you!