Senior Software Developer, HPC Cluster Management

3 Months ago • 7 Years + • Administrative • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks experienced Senior Software Developers to contribute to Bright Cluster Manager, a Linux-based cluster management software powering thousands of clusters globally. Responsibilities include developing head/compute node installation and provisioning processes, edge site deployment functionality, hardware integration (GPUs, DPUs, high-speed interconnects), composable infrastructure management features, BIOS/firmware upgrade management, and expanding Bright's usability and scalability for diverse workloads and large-scale deployments. Additional responsibilities include adding support for new Linux distributions and alternative CPU architectures (like ARM), enhancing Ansible collections, and assisting the support team with customer requests.
Must have:
  • 7+ years software development experience
  • Proficient in Python and OOP
  • Strong Linux OS and networking knowledge
  • High-quality code production
  • Experience with concurrent programming
Good to have:
  • Ansible experience
  • High-performance computing & sysadmin experience
  • Knowledge of Kubernetes, AWS, Azure, etc.
  • C++ proficiency
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you will be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Join the team and see how you can make a lasting impact on the world!

We have positions available for enthusiastic, hardworking and experienced software developers for working on our hardware integration and bare-metal provisioning related functionality in our Linux-based cluster management software environment. NVIDIA's Bright Cluster Manager is used to power thousands of Linux clusters around the world, varying from a few nodes to several thousands of nodes. Bright clusters can run on-premises, completely in the cloud, or in a hybrid environment.

What you’ll be doing:

  • Development of the head node and compute node installation and provisioning processes.

  • Work on functionality in the area of edge site deployment.

  • Integrating our product with the latest hardware (e.g GPUs, DPUs, accelerators, high-speed interconnects such as Infiniband).

  • Work on features related to composable infrastructure management.

  • Develop new features for our BIOS and firmware upgrade management.

  • Develop functionality that makes Bright clusters usable for a wider range of workloads, and increases scalability to allow clusters to scale to huge number of nodes.

  • Adding support for new Linux distributions.

  • Improving support for alternative CPU architectures such as ARM.

  • Work on adding features to our Ansible collections for Cluster Installation and Management.

  • Assist our support team with customer support requests in the above mentioned features and help our customers to use our product more efficiently.


What we need to see:

  • Degree in Computer Science or related field (or equivalent experience).

  • 7+ years of experience in software development and/or related roles.

  • Our software is based on Linux. You should be very familiar with the Linux operating system and in particular with networking concepts in Linux. In addition, good practical knowledge about the most common software that is installed as part of a typical Linux installation is required.

  • You are proficient in Python and intimately familiar with object oriented software design, design patterns, and concurrent programming techniques.

  • Emphasis on high quality of work and in producing clean code.

  • Eager to learn and use new technologies.

Ways to stand out from the crowd:

  • Experience with Ansible.

  • Experience with high-performance computing and system administration.

  • Knowledge of Kubernetes, AWS, Azure, GCE, OpenStack, Jenkins and distributed programming.

  • Proficiency in C++.

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

RoofStack - Backend Developer

RoofStack

İstanbul, İstanbul, Türkiye (On-Site)
3 Weeks ago
GoTo Group - Principal SRE Engineer (SE5)

GoTo Group

Gurugram, Haryana, India (On-Site)
6 Months ago
Info Stretch - Senior Engineer

Info Stretch

Pune, Maharashtra, India (On-Site)
5 Months ago
ION - Senior C++ Developer, Italy

ION

Collecchio, Emilia-Romagna, Italy (On-Site)
6 Months ago
Enphase Energy - Senior Front-end Design (Drupal)

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
4 Months ago
The Walt Disney Company - Security Host - 12 months contract

The Walt Disney Company

Hong Kong (On-Site)
4 Months ago
Mistplay - Bilingual Office Administrator II

Mistplay

Montreal, Quebec, Canada (On-Site)
6 Days ago
Feld Entertainment - Business Systems Administrator

Feld Entertainment

Ellenton, Florida, United States (On-Site)
5 Months ago
Nagarro - Lead SAP Basis Consultant for SAP RISE

Nagarro

Germany (Remote)
4 Weeks ago
Nintendo - Compensation Analyst

Nintendo

Redmond, Washington, United States (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Accurate - Senior Engineering Manager - Java

Accurate

Hyderabad, Telangana, India (Hybrid)
6 Months ago
Oriserve - Senior DevOps Engineer (4+ Yrs exp)

Oriserve

Noida, Uttar Pradesh, India (On-Site)
6 Months ago
Wargaming - DevOps Engineer (Deployment team)

Wargaming

Nicosia, Nicosia, Cyprus (On-Site)
1 Month ago
Riot Games - Senior Software Engineer - Matchmaking

Riot Games

United States (On-Site)
4 Days ago
The Walt Disney Company - Software Engineer II - ABC News Roku

The Walt Disney Company

New York, New York, United States (On-Site)
1 Week ago
Microsoft - Senior Cloud Network Engineer

Microsoft

(On-Site)
6 Days ago
Nagarro - Staff Engineer, Java Fullstack

Nagarro

Mexico (Remote)
6 Months ago
Hawk Eye Innovations - Senior Data Test Automation Engineer

Hawk Eye Innovations

Basingstoke, England, United Kingdom (Hybrid)
1 Day ago
Actian - C++ Engineer - Pune

Actian

Pune, Maharashtra, India (On-Site)
6 Months ago
The Walt Disney Company - Lead Software Engineer (Roku Engineer)

The Walt Disney Company

Santa Monica, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Microsoft - Research Intern - Azure Front Door

Microsoft

Redmond, Washington, United States (On-Site)
6 Days ago
Glean - Technical Support Engineer

Glean

Palo Alto, California, United States (On-Site)
5 Months ago
Corsair - Senior Director of Distribution

Corsair

Duluth, Georgia, United States (On-Site)
1 Week ago
Riot Games - Integration Specialist, Enterprise

Riot Games

Los Angeles, California, United States (On-Site)
5 Months ago
Aristocrat Gaming - Field Engineer I

Aristocrat Gaming

Tulsa, Oklahoma, United States (Remote)
3 Weeks ago
Jelly Smack - Join Our Talent Pipeline: Future Account Manager Roles

Jelly Smack

Los Angeles, California, United States (Hybrid)
6 Months ago
Google - Technical Program Manager III, Data Center Operations

Google

Atlanta, Georgia, United States (On-Site)
1 Week ago
Samsung Semiconductor - Customer Field Support Engineer (Contractor)

Samsung Semiconductor

Cedar Rapids, Iowa, United States (On-Site)
4 Months ago
On Location - Travel Consultant, After Hours Support

On Location

North Carolina, United States (Remote)
1 Day ago
ByteDance - Machine Learning Engineer - Pico Perception

ByteDance

San Jose, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Administrative Jobs

Nagarro - Senior Cloud Developer/Architect

Nagarro

Germany (Remote)
2 Months ago
OKX - Senior Agent, Customer Service (Italian Speaker)

OKX

Budapest, Hungary (On-Site)
5 Months ago
Rackspace Technology - AWS Support Engineer I

Rackspace Technology

Gurugram, Haryana, India (Hybrid)
4 Weeks ago
Aristocrat Gaming - Order Fulfillment Operator - Production

Aristocrat Gaming

Barcelona, Catalonia, Spain (On-Site)
1 Month ago
EXUSIA - Salesforce Conga Specialist

EXUSIA

Hyderabad, Telangana, India (Remote)
2 Weeks ago
Scientific Games  - Helpdesk Tech I

Scientific Games

Alpharetta, Georgia, United States (On-Site)
1 Month ago
Dun & Bradstreet - Early Talent Network

Dun & Bradstreet

Jacksonville, Florida, United States (On-Site)
6 Months ago
InvenioLSI - SAP HCM Senior Associate Consultant

InvenioLSI

New Delhi, Delhi, India (On-Site)
2 Weeks ago
Techland - Junior Office Assistant

Techland

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Scientific Games  - Data Center Technician II

Scientific Games

Middletown, Pennsylvania, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug