Distinguished Engineer – Data Center System Software Architect

1 Month ago • 20 Years + • Research & Development • $308,000 PA - $471,500 PA

Job Summary

Job Description

NVIDIA seeks a Distinguished Engineer – Data Center System Software Architect to lead the architecture of their DGX and HGX data center systems. This role involves owning the end-to-end system software architecture, encompassing firmware, kernel drivers, operating systems, and user-mode drivers. The architect will be the primary technical contact for major customers, leading technical discussions, defining KPIs, gathering requirements, and resolving complex technical issues. Collaboration with hyperscalers to architect next-generation products and drive the adoption of new technologies are key responsibilities. The ideal candidate possesses deep expertise in server system architecture, system software for accelerators, and system management protocols. Experience leading complex projects and implementing left-shift strategies is essential.
Must have:
  • Deep expertise in scalable server system architecture
  • Extensive experience with complex system software for accelerators
  • Mastery of system firmware, embedded systems, and Linux kernel internals
  • Proficiency in Out-of-Band and In-Band management architectures
  • Extensive knowledge of networking technologies and protocols
Good to have:
  • Knowledge of cloud and cluster level deployment and management systems
  • Participation in standards bodies such as OCP and DMTF
  • Familiarity with NVIDIA HPC programming models and libraries
  • Knowledge of enterprise storage architectures

Job Details

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We’re looking for a strong technical architect to own the end-to-end architecture of these products, at the system software level.

Including firmware, kernel drivers, operating systems, and user mode drivers. You will work with component leads internally and engage with industry leading cloud service providers on taking these products to market.

What you’ll be doing:

  • Serve as the primary technical point of contact for major customers, leading technological discussions, defining KPIs, gathering requirements, and addressing complex technical queries.

  • As a system software architect, lead technical innovation and strategic collaborations with major hyperscalers to architect next-generation data center products.

  • Align NVIDIA's roadmap with major customers' requirements through direct engagement.

  • Develop and drive adoption of new technologies and protocols.

  • Make critical technical decisions in ambiguous situations, mitigating risks through left-shift strategies.

What we need to see:

  • Deep expertise in scalable and performant server system architecture, focusing on SW/HW interfaces.

  • Extensive experience with complex system software for accelerators (GPUs, DPUs, FPGAs).

  • Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and Linux kernel internals.

  • Proficiency in Out-of-Band and In-Band management architectures, device management protocols (e.g., MCTP, PLDM, SPDM, RDE) and system management protocols (Redfish, IPMI).

  • Extensive knowledge of networking technologies and protocols, including TCP/IP, Ethernet, InfiniBand, as well as advanced switching and routing concepts

  • Experience collaborating with platform security experts to define tradeoffs between security and ease of use.

  • Demonstrated success in leading complex, cross-functional projects to completion, showcasing the ability to influence and achieve results without direct authority in large-scale, collaborative environments. Demonstrable experience in implementing left shift strategy to de-risk program execution.

  • BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience).

  • 20+ years in the area of System architecture and design.

Ways to stand out from the crowd:

  • Knowledge of cloud and cluster level deployment and management systems. Participation and contributions in standards bodies such as OCP and DMTF.

  • Familiarity with NVIDIA HPC programming models and libraries (CUDA, cuDNN, DOCA)

  • Knowledge of enterprise storage architectures and distributed parallel processing paradigms

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you!

NVIDIA’s invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.

The base salary range is 308,000 USD - 471,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Tencent - Artificial General Intelligence Research Internship

Tencent

Washington, United States (On-Site)
2 Months ago
Zoox - Staff/Senior Staff Software Engineer, ML Performance Optimization

Zoox

Foster City, California, United States (On-Site)
6 Months ago
NVIDIA - Director of AI Research

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - Software Engineer Intern, Autonomous Vehicle - 2025

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
ByteDance - Ad Delivery Algorithm Intern - Game

ByteDance

Singapore (On-Site)
1 Month ago
Krafton  - PUBG Mobile Marketing Manager (Korea)

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Framestore - Machine Learning Developer - London Launchpad Internship 2025

Framestore

London, England, United Kingdom (On-Site)
1 Month ago
Samsung Semiconductor - Senior Staff Engineer, SoC Power Architect

Samsung Semiconductor

San Jose, California, United States (Hybrid)
3 Months ago
Google - Senior Software Engineer, Machine Learning, YouTube

Google

San Bruno, California, United States (On-Site)
4 Months ago
NVIDIA - Signal and Power Integrity Engineer (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Principal DGX Cloud Machine Learning Architect

NVIDIA

Canada (On-Site)
2 Months ago
DNEG - Head of Machine Learning

DNEG

London, England, United Kingdom (Remote)
1 Month ago
Meta - Software Engineer (Leadership) - Machine Learning

Meta

Burlingame, California, United States (Remote)
5 Months ago
NVIDIA - Senior Math Libraries Engineers - Python APIs

NVIDIA

Santa Clara, California, United States (Remote)
2 Months ago
Canva - Research Engineering Manager - Image Generation (m/f/x) - Canva Austria

Canva

Vienna, Vienna, Austria (Remote)
5 Months ago
Tencent - IaaS Product Solution Architect

Tencent

(On-Site)
1 Month ago
Dolby Laboratories - AIOps Research Scientist

Dolby Laboratories

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Krafton  - Applied Research Scientist/Engineer - LM/Agent

Krafton

Seoul, South Korea (On-Site)
1 Month ago
NVIDIA - Mixed Signal Design Engineer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
NVIDIA - Offensive Hardware Security Researcher

NVIDIA

Canada (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Rovio Entertainment Corporation - Senior UI/UX Artist

Rovio Entertainment Corporation

Toronto, Ontario, Canada (Hybrid)
3 Months ago
Larian Studios - DevOps Developer Intern

Larian Studios

Quebec, Canada (On-Site)
2 Months ago
NVIDIA - Senior Firmware Engineer - Embedded Controller

NVIDIA

Canada (On-Site)
1 Month ago
Epic Games - Senior QA Engineer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
3 Months ago
Highspot - Principal Frontend Web Engineer

Highspot

Vancouver, British Columbia, Canada (Hybrid)
6 Months ago
Next Level Games - Rendering Engineer

Next Level Games

Vancouver, British Columbia, Canada (Hybrid)
6 Months ago
Epic Games - Artiste UI sénior

Epic Games

Montreal, Quebec, Canada (On-Site)
4 Months ago
NvizzioCreations - Programmeur(euse) Senior - Unreal

NvizzioCreations

Québec City, Quebec, Canada (On-Site)
6 Months ago
PwC - Azure Data Engineer, Manager (Security clearance required)

PwC

Ottawa, Ontario, Canada (On-Site)
5 Months ago
Blazesoft - Investment Analyst

Blazesoft

Vaughan, Ontario, Canada (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Tesla - Mechanical Design Team Lead

Tesla

Rhineland-Palatinate, Germany (On-Site)
2 Months ago
NVIDIA - System Software Engineer - Base OS (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
Samsung Semiconductor - Principal Engineer, AI/ML Software Compiler

Samsung Semiconductor

San Jose, California, United States (Hybrid)
1 Month ago
Samsung Semiconductor - Senior Engineer, System Software

Samsung Semiconductor

San Jose, California, United States (On-Site)
1 Month ago
NVIDIA - Senior GPU Kernel Performance Lead

NVIDIA

Canada (On-Site)
2 Months ago
Rivos - SOC Electrical Analysis Engineer - Full Time

Rivos

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
ByteDance - Machine Learning Research Scientist, AI for Science

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
NVIDIA - DFX Software Engineer (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
NVIDIA - Senior Firmware Engineer - Memory Subsystem

NVIDIA

Canada (On-Site)
2 Months ago
NVIDIA - Senior Software Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug