Senior Software Engineer - Automated Parallel Programming

3 Months ago • 4 Years + • Research & Development • Artificial Intelligence • $184,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA's PyTorch team seeks a Senior Software Engineer to design and build tools for AI practitioners. Responsibilities include crafting code generation systems to accelerate machine learning frameworks, optimizing GPU performance in PyTorch, building production AI solutions, and collaborating with internal and external teams. The role involves working with NVIDIA's nvFuser (parallel programming tool), influencing the software stack (CUDA compiler, Lightning-Thunder Graph Compiler), and advising on future hardware designs. The ideal candidate will have experience with parallel programming (CUDA or similar), C++, large software projects, and strong communication skills. Experience with deep learning compilers, distributed parallelism, and Large Language Models is a plus.
Must have:
  • MS/PhD in CS/related field or equiv. exp.
  • 4+ years C++ experience
  • Parallel programming (CUDA)
  • Large software project experience
  • Excellent communication skills
Good to have:
  • CPU/GPU architecture knowledge
  • Deep learning compiler expertise
  • Optimized distributed parallelism
  • LLM parallelization
  • Heuristic generation (cost models, ML, auto-tuning)
  • Contributions to PyTorch, NumPy, etc.
Perks:
  • Equity
  • Benefits

Job Details

The PyTorch Team @ NVIDIA is hiring passionate parallel programmers. Join us to design and build the tools used by millions of AI practitioners deploying AI applications scalable to thousands of GPUs. Our team is responsible for the continual delivery of best in class experience on NVIDIA's hardware with PyTorch. Join our team and collaborate with many multi-disciplinary engineering teams within NVIDIA and internationally in the PyTorch open source community to deliver our customers the best of NVIDIA software.

In this position you will learn innovative techniques from NVIDIA's domain experts for efficiently programming the world's most sophisticated computer systems. Build these techniques into NVIDIA/Fuser (commonly known as "nvFuser") applying our groundbreaking Parallel Programming Theory, allowing these optimization techniques to be applied to algorithms broadly, automatically, and safely to algorithms written in Numpy and PyTorch. Beyond building nvFuser influence and improve the entire software stack all the way from users to the CUDA compiler, to the Lightning-Thunder Graph Compiler, as well as influence the future design of NVIDIA's hardware platform. Join our ambitious and diverse team who strive to lead the best in AI programming.

What you will be doing:

  • Crafting a code generation system to accelerate portions of a graph collected from a machine learning framework.

  • Partnering with NVIDIA’s hardware and software teams to improve GPU performance in PyTorch.

  • Design, build and support production AI solutions used by enterprise customers and partners.

  • Optimize the performance of influential, modern Deep Learning models coming out of academic and industry research, for NVIDIA GPUs and systems.

  • Collaborating with internal applied researchers to improve their AI tools.

  • Advise design of new hardware generations.

What we need to see:

  • MS or PhD Computer Science, Computer Engineering, Electrical Engineering or a related field (or equivalent experience).

  • Parallel programming experience with writing optimized kernels in the NVIDIA CUDA Programming Language or similar parallel languages

  • 4+ years of experience with C++ programming.

  • Demonstrated experience developing large software projects.

  • We require excellent verbal and written communication skills.

Ways to stand out from the crowd:

  • Proven technical foundation in CPU and GPU architectures, numeric libraries, modular software design.

  • A background in deep learning compilers or compiler infrastructure

  • Expertise with optimized distributed parallelism techniques and it's a bonus if that includes parallelizing Large Language Models!

  • Knowledge of heuristic generation that employs cost models, machine learning, or auto-tuning.

  • Contributions to PyTorch, Numpy, JAX, TensorFlow, OpenAI-Triton, Lightning Thunder, TVM, Halide or similar system.

The base salary range is 184,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

bytedance - Machine Learning Engineer - Inference

bytedance

San Jose, California, United States (On-Site)
2 Months ago
bytedance - Algorithm Engineer - Audio Understanding - Start 2025

bytedance

Singapore (On-Site)
7 Months ago
bytedance - Research Scientist (Machine Learning for Science (AI-for-Science))

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
C10 Labs - AI Fellow- BioTech and Life Sciences

C10 Labs

Cambridge, Massachusetts, United States (Hybrid)
1 Month ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
4 Months ago
Riot Games - Senior Technical Producer, League Studios - Build Test Ship

Riot Games

Los Angeles, California, United States (On-Site)
7 Months ago
NVIDIA - System Software Engineer, GPU Tools Development

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Google - Lead CPU Design Verification Engineer, Silicon

Google

Austin, Texas, United States (On-Site)
1 Month ago
NVIDIA - Senior System Software Engineer, Robotics Simulation

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
bytedance - Student Researcher (Doubao (Seed) - Foundation Model - Speech Understanding) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Qualcomm - Senior ML Compiler Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Week ago
Reddit - Senior Machine Learning Manager - Ads Retrieval

Reddit

British Columbia, Canada (Remote)
2 Weeks ago
attentive - Staff Machine Learning Engineer

attentive

San Francisco, California, United States (Hybrid)
7 Months ago
Blizzard Entertainment - Senior Data Scientist, Computer Graphics

Blizzard Entertainment

Irvine, California, United States (On-Site)
6 Months ago
bytedance - Software Engineer, Model Inference

bytedance

Seattle, Washington, United States (On-Site)
2 Months ago
AI Fund - ML Engineer

AI Fund

San Francisco, California, United States (On-Site)
1 Month ago
Twitch - Software Development Engineer - Safety ML

Twitch

San Francisco, California, United States (On-Site)
2 Months ago
JDA - Sr. Data Scientist I (ML, Python, Tensorflow)

JDA

Bengaluru, Karnataka, India (On-Site)
4 Days ago
CyberArk - Data Engineer

CyberArk

Israel (Hybrid)
1 Month ago
bytedance - Research Scientist, Foundation Model, Vision

bytedance

Singapore (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in North Carolina, United States

ZeniMax Media - Senior Narrative Animator

ZeniMax Media

Cockeysville, Maryland, United States (Remote)
2 Months ago
Crunchyroll - People Services Coordinator

Crunchyroll

Dallas, Texas, United States (On-Site)
4 Months ago
bytedance - Tech Lead - Architect / Researcher - DPU

bytedance

San Jose, California, United States (On-Site)
3 Months ago
Univision - Maintenance Engineer

Univision

Los Angeles, California, United States (On-Site)
1 Month ago
Backbone - Staff Accountant

Backbone

Atherton, California, United States (On-Site)
10 Months ago
GoMotive - Account Executive, Enterprise - Great Lakes

GoMotive

United States (Remote)
1 Month ago
Adyen - Enterprise Account Manager, Adyen for Platforms

Adyen

New York, United States (Hybrid)
2 Weeks ago
Spell Brush - Front-End Engineer (Anime)

Spell Brush

San Francisco, California, United States (On-Site)
2 Months ago
Riot Games - Sr. Manager, Publishing Product Management, 2XKO AMER

Riot Games

Los Angeles, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Physical Design Power Optimization Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago
bytedance - AR Optics Architect - Pico- San Jose

bytedance

San Jose, California, United States (On-Site)
5 Months ago
Google - ASIC Power Architect, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - IO Validation Methodology Design Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Riot Games - Principal Software Engineer, Gameplay - Teamfight Tactics

Riot Games

Dublin, County Dublin, Ireland (On-Site)
6 Months ago
Google - Silicon RTL Design Engineer, TPU, Google Cloud

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Krafton - Korean-Japanese Interpreter/Translator (Contract)

Krafton

Seoul, South Korea (On-Site)
2 Months ago
bytedance - Research Scientist, Reinforcement Learning

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
Google - Software Engineer, People with Disabilities

Google

São Paulo, State Of São Paulo, Brazil (On-Site)
7 Months ago
NVIDIA - Senior Digital Design Verification Engineer - Hardware

NVIDIA

Canada (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug