Senior Software Architect, AI and HPC

6 Days ago • 8 Years + • Research & Development • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Software Architect, AI and HPC to contribute to the research, development, and deployment of solutions in networking hardware, programming environments, and system software. Responsibilities include creating proofs-of-concept for AI frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), and new runtime/hardware features. The role also involves designing and implementing features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM) and Deep Learning frameworks (TensorFlow/PyTorch), as well as enhancing compiler features to support the NVIDIA networking ecosystem. The ideal candidate possesses extensive experience in parallel programming, network architecture, algorithm design, and system programming, along with strong software development skills.
Must have:
  • Ph.D., Masters, or Bachelors in CS/related field
  • 8+ years parallel programming/network architecture experience
  • Algorithm design, system programming, computer architecture skills
  • Strong programming & software development skills
Good to have:
  • Experience with communication middleware for HPC systems
  • Compiler feature development/optimization experience (Clang/LLVM)
  • Experience implementing communications libraries (MPI, OpenSHMEM, etc.)
  • CUDA programming and NVIDIA GPU experience
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people.

Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. The software architecture group at NVIDIA has openings for software architects in the field of AI and high-performance networking and system software. We research, develop, and deploy solutions in networking hardware, programming environments, and system software to make current and future high-end computer systems more performant, scalable, and usable. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

What you will be doing

  • Creating proofs-of-concept to evaluate and motivate extensions in AI Frameworks (PyTorch/NEMO), HPC programming models (MPI, OpenSHMEM, PGAS), new runtime designs, and new network hardware features.

  • Research, design and implement features for AI and HPC communication middleware (NCCL, Open MPI, UCX, UCC, NVSHMEM), and Deep Learning frameworks such as TensorFlow/Pytorch.

  • Review , design, and implement features to enhance compiler features to support the NVIDIA networking ecosystem.

  • Research, design and develop hardware features relevant to scientific, Deep learning, and data-intensive workloads.

What we need to see

  • Ph.D, Masters, or Bachelors in , in computer science, computer engineering, or a closely related field or equivalent experience.

  • 8+ years of experience in parallel programming models, and/or network architecture.

  • Background in algorithm design, system programming, and computer architecture

  • Strong programming and software development skills

  • Ability and flexibility to work and communicate effectively in a multi-national, multi-time-zone corporate environment

Ways to stand out from the crowd:

  • Deep understanding of technology and passion for what you do

  • Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment

  • Background with designing communication middleware for high-performance computing systems, including InfiniBand, DPUs, Ethernet, and Shared Memory

  • Experience developing and implementing features for compilers, optimizations for compilers, particularly Clang/LLVM, and NVIDIA compilers and experience implementing communications libraries, particularly MPI, OpenSHMEM, NCCL, NVSHMEM, UCX, UCC, or PGAS

  • Background with CUDA programming and NVIDIA GPUs and programming models for emerging architectures including hierarchical heterogeneous memory systems and accelerators.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Ubisoft - Scientifique en données ML Senior _ Groupe Technologique Content Creation

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Weeks ago
Coursera - AI Specialist

Coursera

India (Remote)
2 Weeks ago
Samsung Semiconductor - Senior Engineer, AI

Samsung Semiconductor

San Jose, California, United States (Hybrid)
4 Months ago
ByteDance - Software Engineer Large Model System Graduate (Machine Learning Sys-US) - 2024 Start (BS/MS)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Evolution - Data Scientist

Evolution

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Months ago
NVIDIA - Solution Architect - CSP Cloud

NVIDIA

Beijing, Beijing, China (On-Site)
1 Month ago
bosh group india - Circuit Analysis Engineer - Team Lead

bosh group india

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Google - Software Engineering Manager (For Women in Tech Candidates)

Google

State Of Minas Gerais, Brazil (On-Site)
1 Month ago
Samsung Semiconductor - Intern, Logic Pathfinding Lab Advanced DTCO Engineer

Samsung Semiconductor

San Jose, California, United States (On-Site)
3 Weeks ago
NVIDIA - Senior AI Cluster Tools Developer

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Research Engineer in Large Model System

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ASSIST Software - AI Engineer

ASSIST Software

Suceava, Suceava County, Romania (Remote)
3 Months ago
Digitap AI - Senior Data Scientist

Digitap AI

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Ubisoft - Senior Machine Learning Engineer H/F/NB

Ubisoft

Paris, Île-de-France, France (On-Site)
8 Months ago
SatSure - Senior Machine Learning Researcher

SatSure

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Rackspace Technology - Senior MLOPs Engineer (Canada)

Rackspace Technology

Toronto, Ontario, Canada (Remote)
4 Months ago
Attentive - Senior Machine Learning Engineer

Attentive

New York, New York, United States (Hybrid)
4 Months ago
ByteDance - Software Engineer in ML Engineering Platform

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Paypal - Senior Machine Learning Engineer

Paypal

San Jose, California, United States (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Trek - Sales Associate

Trek

New York, New York, United States (On-Site)
2 Months ago
The Walt Disney Company - Director, Marine Operations

The Walt Disney Company

Celebration, Florida, United States (On-Site)
5 Days ago
SciPlay - Director of Product

SciPlay

Cedar Falls, Iowa, United States (Hybrid)
2 Months ago
Warner Bros Games - Producer

Warner Bros Games

Burbank, California, United States (Hybrid)
1 Month ago
My Fitness Pal - Food Data Curator (Contractor)

My Fitness Pal

United States (Remote)
4 Weeks ago
Zones - Client Solutions Architect

Zones

Texas, United States (Remote)
2 Months ago
Interface AI - Chief Financial Officer

Interface AI

United States (Remote)
6 Days ago
Flow - Senior/Staff Backend Software Engineer

Flow

Miami, Florida, United States (Hybrid)
4 Months ago
Crunchyroll - Director of Engineering, Android

Crunchyroll

Culver City, California, United States (On-Site)
1 Week ago
Alphasense - Marketing Specialist, Influencer and Sponsorship Campaigns

Alphasense

Remote, Oregon, United States (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Senior Chip Design Methodologies Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
6 Days ago
Keywords Studios (Player Support) - Chercheurs Expérimentés d’Utilisateurs de Jeux

Keywords Studios (Player Support)

Montreal, Quebec, Canada (Hybrid)
3 Months ago
Tenstorrent - Staff Design Verification Engineer - CPU Subsystem

Tenstorrent

Bengaluru, Karnataka, India (On-Site)
5 Months ago
NVIDIA - Senior ASIC Verification Engineer - GPU Memory Subsystem

NVIDIA

Durham, North Carolina, United States (On-Site)
1 Month ago
ByteDance - CPU Application Platform Engineer Intern (Server Platform)

ByteDance

San Jose, California, United States (On-Site)
5 Days ago
ByteDance - Student Researcher (Doubao (Seed) - Machine Learning System) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Research Scientist in Foundation Model, Speech Understanding - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Intel Corporation - Pre-Silicon SoC Design and Verification Technologist

Intel Corporation

Folsom, California, United States (Hybrid)
2 Months ago
Meta - Software Engineer, Machine Learning

Meta

Mountain View, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug