Distinguished Engineer – Data Center System Software Architect

6 Days ago • 20 Years + • Research & Development • $308,000 PA - $471,500 PA

Job Summary

Job Description

NVIDIA seeks a Distinguished Engineer – Data Center System Software Architect to lead the end-to-end architecture of data center systems (DGX, HGX). This role involves serving as the primary technical contact for major customers, leading technological discussions, defining KPIs, and gathering requirements. The architect will drive technical innovation, collaborating with hyperscalers to design next-generation products, aligning NVIDIA's roadmap with customer needs, and developing new technologies and protocols. Responsibilities include making critical technical decisions, mitigating risks, and leading cross-functional projects. Deep expertise in server system architecture, system software for accelerators, firmware, Linux kernel, networking, and security is essential.
Must have:
  • Deep expertise in scalable server architecture
  • Extensive experience with system software for accelerators
  • Mastery of system firmware, embedded systems, and Linux kernel
  • Proficiency in Out-of-Band and In-Band management
  • Extensive knowledge of networking technologies and protocols
  • Experience leading complex, cross-functional projects
Good to have:
  • Knowledge of cloud and cluster management systems
  • Participation in OCP and DMTF standards bodies
  • Familiarity with NVIDIA HPC programming models
  • Knowledge of enterprise storage architectures

Job Details

NVIDIA data center systems, such as DGX and HGX, have become core to NVIDIA's rapidly growing enterprise and cloud provider businesses. These platforms bring together the full power of NVIDIA GPUs, NVIDIA NVLink, NVIDIA InfiniBand networking, NVIDIA Grace CPUs, and a fully optimized NVIDIA AI and HPC software stack. We’re looking for a strong technical architect to own the end-to-end architecture of these products, at the system software level.

Including firmware, kernel drivers, operating systems, and user mode drivers. You will work with component leads internally and engage with industry leading cloud service providers on taking these products to market.

What you’ll be doing:

  • Serve as the primary technical point of contact for major customers, leading technological discussions, defining KPIs, gathering requirements, and addressing complex technical queries.

  • As a system software architect, lead technical innovation and strategic collaborations with major hyperscalers to architect next-generation data center products.

  • Align NVIDIA's roadmap with major customers' requirements through direct engagement.

  • Develop and drive adoption of new technologies and protocols.

  • Make critical technical decisions in ambiguous situations, mitigating risks through left-shift strategies.

What we need to see:

  • Deep expertise in scalable and performant server system architecture, focusing on SW/HW interfaces.

  • Extensive experience with complex system software for accelerators (GPUs, DPUs, FPGAs).

  • Mastery of system firmware (SBIOS, OpenBMC), embedded systems, and Linux kernel internals.

  • Proficiency in Out-of-Band and In-Band management architectures, device management protocols (e.g., MCTP, PLDM, SPDM, RDE) and system management protocols (Redfish, IPMI).

  • Extensive knowledge of networking technologies and protocols, including TCP/IP, Ethernet, InfiniBand, as well as advanced switching and routing concepts

  • Experience collaborating with platform security experts to define tradeoffs between security and ease of use.

  • Demonstrated success in leading complex, cross-functional projects to completion, showcasing the ability to influence and achieve results without direct authority in large-scale, collaborative environments. Demonstrable experience in implementing left shift strategy to de-risk program execution.

  • BS or MS degree in Computer Science, Electrical Engineering or related field (or equivalent experience).

  • 20+ years in the area of System architecture and design.

Ways to stand out from the crowd:

  • Knowledge of cloud and cluster level deployment and management systems. Participation and contributions in standards bodies such as OCP and DMTF.

  • Familiarity with NVIDIA HPC programming models and libraries (CUDA, cuDNN, DOCA)

  • Knowledge of enterprise storage architectures and distributed parallel processing paradigms

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative, passionate and self-motivated, we want to hear from you!

NVIDIA’s invention of the GPU in 1999 fueled the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited modern deep learning — the next era of computing — with the GPU acting as the brain of computers, robots, and self-driving cars that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company and establish teams with the most thoughtful people in the world. Are you ready to change the next generation of computing? Join us at the forefront of technological advancement.

The base salary range is 308,000 USD - 471,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Student Researcher (Doubao (Seed) - Foundation Model, Speech & Audio) - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Granicus - Data Scientist 4

Granicus

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
ByteDance - Research Scientist Intern in Foundation Models for Science (ByteDance Research) - 2025 Summer/Fall (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
NVIDIA - Senior Computer Architect - Deep Learning

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
ByteDance - Research Scientist in Foundation Models for Science - ByteDance Research

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
NVIDIA - Software Engineering Manager - GPU Communications Libraries

NVIDIA

Santa Clara, California, United States (On-Site)
1 Week ago
bosh group india - Circuit Analysis Engineer - Team Lead

bosh group india

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Google - Hardware Engineering Intern, 2025

Google

New Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - Memory Solutions Engineer

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Month ago
NVIDIA - Senior ASIC Front End Infrastructure Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Axon - Axon AI - Senior Machine Learning Scientist I

Axon

Seattle, Washington, United States (Remote)
6 Days ago
Microsoft - Research Intern - Algorithms Group: Deep learning

Microsoft

Mountain View, California, United States (On-Site)
1 Month ago
NVIDIA - Senior Timing Methodology Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
PlayStation Global - Senior Data Scientist

PlayStation Global

Carlsbad, California, United States (Remote)
1 Week ago
Luxoft - Senior/Lead Machine Learning and Image Processing Specialist

Luxoft

Italy, New York, United States (Remote)
3 Months ago
ByteDance - Software Engineer, ML System Architecture

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
NVIDIA - Senior Post Silicon Hardware Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
ByteDance - Research Scientist Graduate (Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
ByteDance - Lead Research Scientist, Foundation Model, Music Intelligence

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Meta - Postdoctoral Researcher, Embodied AI (PhD)

Meta

Seattle, Washington, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

ByteDance - Senior Backend Software Engineer - Global E-Commerce Supply Chain Billing & Settlement

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Activision - Principal Character Concept Artist - Treyarch (Los Angeles)

Activision

Los Angeles, California, United States (On-Site)
3 Months ago
ByteDance - Senior Backend Software Engineer - Global E-Commerce Supply Chain

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Senior Software Engineer, Traffic Platform

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Warner Bros Discovery - Staff Product Designer - Video & Ads

Warner Bros Discovery

New York, New York, United States (On-Site)
2 Months ago
Interactive Brokers - Senior Systems Engineer- Microsoft M365/Active Directory

Interactive Brokers

Greenwich, Connecticut, United States (Hybrid)
4 Months ago
Lionsgate Games - Sr. Coordinator, Business & Legal Affairs

Lionsgate Games

Santa Monica, California, United States (On-Site)
1 Month ago
Netflix - Senior Software Engineer - Growth Foundations

Netflix

United States (Remote)
3 Months ago
Axon - Director of Enterprise Sales

Axon

San Francisco, California, United States (Remote)
6 Days ago
Trek - Service Technician (Part-Time)

Trek

Alamo, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Distinguished Software Architect - Deep Learning and HPC Communications

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Rockstar Games - Senior Production Coordinator, Creator Platform

Rockstar Games

Leeds, England, United Kingdom (On-Site)
5 Months ago
Fluence - Sr. Software Architect (m/f/d)

Fluence

Berlin, Berlin, Germany (On-Site)
3 Months ago
ByteDance - Student Researcher (Foundation Models - Reasoning, Planning & Agent) - Doubao (Seed) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Experienced Technical Lead - Edge Cloud Infrastructure - San Jose / Seattle / Boston

ByteDance

Boston, Massachusetts, United States (On-Site)
3 Months ago
Netflix - Senior Researcher - Netflix Experiences

Netflix

Los Gatos, California, United States (On-Site)
3 Months ago
NVIDIA - Senior Physical Design Verification Layout Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago
Riot Games - Principal Software Engineer, Foundations Developer Experience & Workflows

Riot Games

Los Angeles, California, United States (On-Site)
4 Months ago
NVIDIA - Silicon Solutions Engineer - NPQ

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Riot Games - Staff Data Scientist - Anti-Cheat

Riot Games

Dublin, County Dublin, Ireland (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug