Senior Software Engineer - Data Center System Bringup

2 Months ago • 7-8 Years • Full Stack Development • $184,000 PA - $356,500 PA

Job Summary

Job Description

As a Senior Software Engineer - Data Center System Bringup at NVIDIA, you'll lead software and firmware debug and bringup efforts for powerful server systems (HGX, DGX, MGX). You'll collaborate with CSP partners to bring up and stabilize their new server systems, working with matrix teams on firmware and software across the entire stack. Responsibilities include debugging, triaging bugs during CSP server system bringup, working directly with major customers to solve complex technical issues, refining solutions for flawless server integration in large-scale data centers, and collaborating with product and documentation teams. The role requires expertise in system bringup, debugging, firmware/software development, and out-of-band/in-band management architectures.
Must have:
  • System bringup & debugging expertise
  • 7+ years in system software/firmware
  • Firmware/software development skills
  • End-to-end stack & server architecture understanding
  • Out-of-band & in-band management expertise
  • Excellent collaboration & communication skills
Good to have:
  • Redfish, IPMI, PLDM knowledge
  • PCIE, memory management, or networking stack understanding
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

We are building innovative server systems for GPU accelerated applications, such as Deep Learning. As a Senior Software Engineer - Data Center System Bringup, you will play a crucial role in our innovative team, leading the software and firmware debug and bringup efforts of our powerful server systems like HGX, DGX and MGX. We are looking for Senior Firmware / System Software engineers who would closely work with our CSP partners to bringup and stabilize their new server systems. This is an outstanding chance to collaborate with exceptional professionals on groundbreaking technology that pushes the limits of what can be achieved.

What you'll be doing:

  • Spearhead the debug, bringup, and triage of bugs during the CSP's server system bringup.

  • Collaborate closely with matrix teams to work on firmware and software across the entire stack.

  • Work directly with major customers to solve complex technical issues.

  • Refine and stabilize solutions that ensure flawless integration of server products in large scale data centers and improve performance and reliability of our data center systems.

  • Work closely with product experience and documentation teams to ensure true customer delight.

What we need to see:

  • Bachelor’s or Master’s degree in Computer Science, Electrical Engineering, or a related field, or equivalent experience.

  • Demonstrated ability in system bringup, debugging, and handling complex hardware and software issues with 7+ years in the area of system software or firmware areas.

  • Outstanding skills in firmware and software development with a deep understanding of end-to-end stack and server system architecture.

  • Expertise in Out of Band and In-band management architectures.

  • Strong ability to collaborate and communicate effectively within a diverse team of engineers.

  • A track record of delivering high-quality, innovative solutions in a fast-paced, dynamic environment.

Ways to stand out from the crowd:

  • Strong background in device management protocols such as Redfish, IPMI, PLDM.

  • In depth understanding of PCIE, memory management or networking stack.

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High-Performance Computing, and Visualization. Our invention serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great people like you to help us accelerate the next wave of artificial intelligence.

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Backend Software Engineer, AI Applications

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
NVIDIA - Technical Marketing Engineer - AI Platform Software

NVIDIA

Canada (Hybrid)
2 Months ago
neural concept - ML Platform Deployment Engineer

neural concept

(Remote)
3 Weeks ago
NVIDIA - Senior System Software Engineer – DC Platform Software Tools

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Google - Customer Engineer, Applied and Generative AI, Google Cloud

Google

Singapore, Singapore (On-Site)
1 Month ago
Haptic - Senior Fullstack Developer

Haptic

Paris, Île-de-France, France (Remote)
5 Months ago
Google - Early Career Software Engineer, Black Community Inclusion

Google

State Of Minas Gerais, Brazil (On-Site)
5 Months ago
Ajmera Infotech - Senior React Expert

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
6 Months ago
Next Level Business Services - Azure Services developer

Next Level Business Services

Redmond, Washington, United States (On-Site)
7 Months ago
Vigaet - Full Stack Developer Internship

Vigaet

(On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Generative AI)

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Video Generation) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
7 Months ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

Toronto, Ontario, Canada (Remote)
2 Months ago
ByteDance - Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
7 Months ago
NVIDIA - AI Computing Software Development Engineer, TensorRT

NVIDIA

Shanghai, Shanghai, China (On-Site)
4 Months ago
Eleven Labs - Machine Learning Researcher

Eleven Labs

Poland (Remote)
2 Months ago
Canva - Engineering Manager - Design Generation

Canva

Sydney, New South Wales, Australia (Hybrid)
7 Months ago
ByteDance - LLM Software Engineer/Researcher (Applied Machine Learning)

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
NVIDIA - Principal Engineer

NVIDIA

(Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

PlayStation Global - Programmeur·euse Senior – Jouabilité/Senior Gameplay Programmer

PlayStation Global

Montreal, Quebec, Canada (On-Site)
5 Months ago
Reddit - Client Account Manager, Mid-Market (Services)

Reddit

Toronto, Ontario, Canada (On-Site)
1 Month ago
Rockstar Games - Associate Animator: Gameplay

Rockstar Games

Oakville, Ontario, Canada (On-Site)
2 Months ago
Ubisoft - Scientifique en données ML Senior _ Groupe Technologique Content Creation

Ubisoft

Montreal, Quebec, Canada (On-Site)
4 Months ago
People Can Fly - Community Manager

People Can Fly

Montreal, Quebec, Canada (Remote)
2 Months ago
Lucky VR - Technical Animator

Lucky VR

Canada (Remote)
4 Months ago
ZeniMax Media - Animateur.trice de créatures / Animator (Creatures)

ZeniMax Media

Montreal, Quebec, Canada (On-Site)
8 Months ago
Airlab Inc  - Artificial Intelligence Researcher

Airlab Inc

Quebec, Canada (On-Site)
2 Months ago
Activision - Senior Lead Producer

Activision

Montreal, Quebec, Canada (On-Site)
1 Month ago
Ubisoft - Senior Textures Artist [Prince of Persia Remake]

Ubisoft

Montreal, Quebec, Canada (Hybrid)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Full Stack Development Jobs

Google - Software Engineer III, Infrastructure and Operations

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
1 Month ago
Pocket Worlds - Staff Full-Stack Engineer (Backend Leaning)

Pocket Worlds

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Months ago
Consilio LLC - Software Developer

Consilio LLC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Meta - Software Engineering Manager, Product Infrastructure

Meta

Seattle, Washington, United States (Remote)
6 Months ago
ByteDance - Software Engineer Intern (CDN/Edge/Traffic Platform)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Microsoft - Technical Support Engineer – Web Technologies

Microsoft

Seoul, South Korea (Remote)
1 Month ago
Nagarro - Principal Engineer, .Net Web

Nagarro

New York, New York, United States (On-Site)
7 Months ago
Google - Software Engineer III, Infrastructure, Google Cloud Security and Privacy

Google

San Francisco, California, United States (On-Site)
7 Months ago
Rebellion - Senior Online Developer - Tech Team

Rebellion

Oxford, England, United Kingdom (Hybrid)
2 Months ago
Meta - Software Engineer, Infrastructure

Meta

Menlo Park, California, United States (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug