Senior Software Architect - Data Center Systems

3 Months ago • 10-8 Years • Full Stack Development • $224,000 PA - $425,500 PA

Job Summary

Job Description

NVIDIA's Data Center SW team is seeking a Senior Software Architect to lead software activities for deep learning server platforms. This role involves designing the system architecture for a complex server platform, collaborating with cross-functional teams, working directly with major customers to understand their requirements, developing a roadmap of new technologies, and mentoring engineering teams. The ideal candidate possesses deep experience in designing scalable and performant server systems, particularly at the SW/HW interface, and a strong understanding of HPC or Deep Learning workloads. Responsibilities include driving software solutions from design to production, partnering with application software, libraries, system software, and firmware teams, and working with business partners and vendors to shape products to meet NVIDIA's needs. This position requires excellent communication skills and a BS or MS degree in a related field.
Must have:
  • Deep experience in server system architecture design
  • Understanding of HPC or Deep Learning workloads
  • Expertise in Out of Band and In-band management
  • Experience implementing left shift strategies
  • Excellent communication skills
Good to have:
  • Knowledge of cloud and cluster deployment
  • Familiarity with device management protocols (Redfish, IPMI, etc.)
  • Knowledge of storage and networking technologies

Job Details

We are building innovative server systems for GPU accelerated applications, such as Deep Learning. Data Center SW team architects and develops the end to end software and firmware stack for these systems. We are looking for a Senior Software Architect who has deep expertise in designing server platforms and has added understanding of application use cases in Deep Learning workloads. You will work with world class engineering teams, product management, Operations and Customer support to build systems that will truly delight our customers.

What you’ll be doing:

  • You will lead software activities for NVIDIA's deep learning server platforms, from design through production; collaborating with teams across company to deliver software solutions

  • Drive the system architecture for a complex server platform in a multi-functional environment.

  • Partner across application software, libraries, system software and firmware teams to design complete software solutions for new server platforms

  • Work directly with major customers to understand their requirements and work to align their roadmap with NVIDIA’s roadmap.

  • Work with business partners and vendors to shape their products to meet NVIDIA’s needs.

  • Develop a roadmap of new technologies and protocols and drive their design and adoption.

  • Mentor architects and engineering teams to grow them into future leaders.

  • Make key technical decisions for designs involving complex inter-component dependencies.

What we need to see:

  • Deep experience in designing architecture for scalable and performant server systems, particularly at the SW/HW interface.

  • Understanding of HPC or Deep learning workloads and use of accelerated computing platforms.

  • Expertise in Out of Band and In-band management architectures.

  • Knowledge of server system architecture and implications of architecture decisions on overall performance of end applications.

  • Demonstrable experience in implementing left shift strategy to de-risk program execution.

  • Excellent written and verbal communication skills.

  • BS or MS degree in Computer Engineering, Computer Science, or related degree or equivalent experience.

  • 10+ years in the area of System architecture and design.

Ways to stand out from the crowd:

  • Knowledge of cloud and cluster level deployment and management systems.

  • Strong background of device management protocols such as Redfish, IPMI, MCTP, PLDM and RDE.

  • Knowledge in storage and networking technologies.

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come, join our Data center server systems team and help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

The base salary range is 224,000 USD - 425,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Machine Learning Engineer - MLDev

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Google - Accelerator Architect and Performance Engineer, Generative AI

Google

San Diego, California, United States (On-Site)
1 Week ago
Electronic Arts - Senior Machine Learning Engineer

Electronic Arts

Hyderabad, Telangana, India (On-Site)
1 Month ago
Arrise Solutions (India)   - Lead ML Engineer

Arrise Solutions (India)

Hyderabad, Telangana, India (On-Site)
7 Months ago
Zoox - Senior/Staff Software Engineer, ML Performance Optimization

Zoox

Foster City, California, United States (On-Site)
6 Months ago
Sinch - Full Stack Technical Team Lead - DevEx

Sinch

Mandaluyong, Metro Manila, Philippines (Remote)
1 Week ago
Google - Data Cloud Consultant

Google

Bengaluru, Karnataka, India (On-Site)
2 Days ago
N-iX - Senior .NET Full-Stack Engineer

N-iX

Bulgaria (Remote)
2 Weeks ago
Nagarro - Principal Engineer, NodeJS

Nagarro

India (Remote)
6 Months ago
Google - Staff Software Engineer, Google Cloud

Google

Ramat Gan, Tel Aviv District, Israel (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Jane Street - Quantitative Researcher

Jane Street

Hong Kong, Hong Kong (On-Site)
5 Hours ago
Zoox - Senior/Staff Machine Learning Engineer - Prediction & Behavior ML

Zoox

Boston, Massachusetts, United States (Hybrid)
6 Months ago
NVIDIA - DevOps Engineering Intern, DGXC Console - Fall 2025

NVIDIA

Washington, United States (On-Site)
2 Weeks ago
NVIDIA - Senior Cost Accountant

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
Luxoft - Senior ML Engineer

Luxoft

Poland, Ohio, United States (Remote)
4 Months ago
Microsoft - Senior Principal Researcher - Deep Learning & AI

Microsoft

New York, New York, United States (On-Site)
1 Week ago
NVIDIA - DFX Methodology Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - Software Engineering Intern - CUDA Test Development

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
NVIDIA - Senior GPU Kernel Performance Lead

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Ubisoft - Machine Learning Programmer (Character & Animation)

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

The Walt Disney Company - Dispatcher, Part-Time Global Security Control Center

The Walt Disney Company

Burbank, California, United States (On-Site)
3 Days ago
Bally's Interactive - Marketing Representative

Bally's Interactive

Black Hawk, Colorado, United States (On-Site)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
UPF Industries  - Plant Manager

UPF Industries

Livermore Falls, Maine, United States (On-Site)
23 Hours ago
Roblox - Distinguished Engineer, Machine Learning

Roblox

San Mateo, California, United States (On-Site)
1 Day ago
DraftKings - Manager, Marketing Analytics

DraftKings

Boston, Massachusetts, United States (On-Site)
1 Month ago
Epic Games - Senior Developer Relations Engineer

Epic Games

United States (On-Site)
3 Months ago
Next Level Business Services - Network Engineer

Next Level Business Services

New York, New York, United States (On-Site)
6 Months ago
Microsoft - Product Manager - Copilot

Microsoft

Mountain View, California, United States (Hybrid)
1 Month ago
Netflix - Technical Services Manager

Netflix

Pennsylvania, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Full Stack Development Jobs

Nagarro - Principal Engineer, InfraOps

Nagarro

New York, New York, United States (On-Site)
6 Months ago
Netflix - UI Engineer (L4/L5) - Enablement Apps

Netflix

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
Britive - SOFTWARE ENGINEER

Britive

San Francisco, California, United States (Remote)
5 Months ago
The Walt Disney Company - Manager, Software Engineering

The Walt Disney Company

Glendale, California, United States (On-Site)
1 Week ago
Google - Senior Software Engineer, Google Cloud

Google

Ramat Gan, Tel Aviv District, Israel (On-Site)
2 Weeks ago
The Mill Adventure - Senior Back-End Developer

The Mill Adventure

St. Julian's, Malta (Remote)
1 Month ago
Microsoft - Technical Support Engineer (Microsoft Dynamics 365 CE / Power Platform)

Microsoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Hybrid)
2 Weeks ago
Google - Web Solutions Engineer

Google

Hyderabad, Telangana, India (On-Site)
2 Days ago
Mashgin - Senior Software Engineer, Product

Mashgin

Palo Alto, California, United States (Hybrid)
6 Months ago
Google - Software Developer III, AI/ML, Google Workspace

Google

Waterloo, Ontario, Canada (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug