Senior High-Performance LLM Training Engineer

1 Month ago • 5-13 Years • Full Stack Development • Research & Development • Artificial Intelligence • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior High-Performance LLM Training Engineer to optimize the efficiency of Large Language Model (LLM) training workloads. Responsibilities include performance analysis, optimization of LLM software stacks (PyTorch, JAX) on thousands of GPUs, contributing to hardware roadmaps, implementing production-quality software, supporting MLPerf Training benchmark submissions, and building automation tools. This role demands expertise in deep learning, neural networks, computer architecture, and GPU architecture, along with programming skills in C++, Python, and CUDA. The engineer will work across the full hardware & software stack to achieve optimal performance for AI training.
Must have:
  • PhD/MS in CS/EE/CE and relevant experience
  • Deep learning & neural network expertise
  • GPU architecture knowledge
  • Performance analysis & tuning
  • C++, Python, CUDA programming
Perks:
  • Highly competitive salary
  • Comprehensive benefits package
  • Collaboration with leading experts
  • Innovative work environment

Job Details

We are now looking for a Senior High-Performance LLM Training Engineer!

NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing systems. This position focuses on optimizing NVIDIA’s high-performance LLM software stack in frameworks like PyTorch and JAX for high-performance training on thousands of GPUs, while also helping shape hardware roadmaps for the next generation of GPUs powering the AI revolution.

What you will be doing:

  • Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.

  • Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.

  • Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.

  • Build and support NVIDIA submissions to the MLPerf Training benchmark suite.

  • Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies.

  • Build tools to automate workload analysis, workload optimization, and other critical workflows.

What we want to see:

  • PhD in Computer Science, Electrical Engineering or Computer Engineering and 5+ years; or MS (or or equivalent experience) and 8+ years of meaningful work experience.

  • Strong background in deep learning and neural networks, in particular training.

  • A deep background in computer architecture and familiarity with the fundamentals of GPU architecture.

  • Proven experience analyzing and tuning application performance & processor and system-level performance modelling.

  • Programming skills in C++, Python, and CUDA.

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

Widely considered to be one of tech's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. If you're excited to work across the full hardware & software stack—from GPU architecture to application code—to achieve optimal performance, we want to hear from you!

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

G5 Games - 2D UI/UX Artist (Hidden objects project)

G5 Games

Astana, Astana, Kazakhstan (Remote)
1 Month ago
Paypal - Staff Machine Learning Engineer

Paypal

San Jose, California, United States (Hybrid)
3 Months ago
Playrix - Feature Owner (LiveOps)

Playrix

Portugal (Remote)
4 Months ago
Mistplay - Staff Machine Learning Engineer I (MLE)

Mistplay

Montreal, Quebec, Canada (Hybrid)
2 Weeks ago
Playrix - Game Designer

Playrix

Serbia (Remote)
4 Months ago
Paypal - Distinguished MTS, Software Engineer

Paypal

San Jose, California, United States (Hybrid)
4 Months ago
Varonis  - Python Developer

Varonis

Herzliya, Tel Aviv District, Israel (Hybrid)
1 Month ago
Sitetracker - Salesforce Engineer (EDS)

Sitetracker

Montclair, New Jersey, United States (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Playrix - Game Designer

Playrix

Montenegro (Remote)
4 Months ago
NVIDIA - Senior Deep Learning Software Engineer, cuDNN

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
G5 Games - 2D HOG Grind Artist

G5 Games

(Remote)
2 Weeks ago
NVIDIA - Senior AI Training Performance Engineer

NVIDIA

Shanghai, Shanghai, China (Hybrid)
1 Month ago
LeoVegas - Data Scientist - Sportsbook

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
2 Months ago
NXP - Junior Developer of Systems Testing Infrastructure

NXP

Brno, South Moravian Region, Czechia (On-Site)
5 Months ago
Playrix - Game Director

Playrix

Cyprus (Remote)
4 Months ago
Playrix - Feature Owner (LiveOps)

Playrix

Montenegro (Remote)
4 Months ago
Playrix - Feature Owner (LiveOps)

Playrix

Serbia (Remote)
4 Months ago
NVIDIA - Manager, Tools and Development

NVIDIA

Pune, Maharashtra, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

The Walt Disney Company - KABC Freelance News Photographers

The Walt Disney Company

Glendale, California, United States (On-Site)
3 Months ago
Aristocrat Gaming - Senior Global Marketing Research Analyst

Aristocrat Gaming

Las Vegas, Nevada, United States (Hybrid)
2 Months ago
Meta - Technical Game Designer

Meta

Seattle, Washington, United States (On-Site)
9 Months ago
Infoblox - Senior Enterprise Account Executive

Infoblox

Dallas, Texas, United States (On-Site)
3 Months ago
WebMD - Sr. Wellness Program Manager- Part Time (Martinez, CA)

WebMD

United States (On-Site)
3 Months ago
Sphere Entertainment Co - Designer

Sphere Entertainment Co

Burbank, California, United States (On-Site)
3 Months ago
Netflix - Security Software Engineer (L4), Client Security Integrations

Netflix

United States (Remote)
1 Month ago
PeopleFun - Associate Art Director, Wordscapes Search

PeopleFun

United States (Remote)
2 Months ago
The Walt Disney Company - Senior Software Engineer (Swift)

The Walt Disney Company

Seattle, Washington, United States (On-Site)
1 Month ago
Hasbro - Intern - Graphic Designer (Summer 2025)

Hasbro

Rhode Island, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Full Stack Development Jobs

PhonePe - Software Engineer - Test (4-6yrs) - (Pune)

PhonePe

Pune, Maharashtra, India (On-Site)
3 Months ago
Apollo - Staff Software Engineer, Rust (UK)

Apollo

United Kingdom (Remote)
2 Months ago
Rockstar Games - Senior Software Engineer (C#)

Rockstar Games

New York, New York, United States (On-Site)
5 Months ago
Microsoft - Senior Software Engineering Manager

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Month ago
Alphasense - Lead Full-Stack Engineer (AI)

Alphasense

New York, New York, United States (On-Site)
2 Months ago
Velotio Technologies - Lead Engineer (Ruby On Rails)

Velotio Technologies

Pune, Maharashtra, India (Remote)
4 Months ago
Nagarro - Senior Engineer, Mainframe

Nagarro

India (Remote)
4 Months ago
Onward Search - Java Developer III

Onward Search

New York, New York, United States (Hybrid)
1 Week ago
NAH.io - Web Technical Manager

NAH.io

Hong Kong (On-Site)
4 Months ago
Trend Micro - (Sr.) Cloud Developer (Vision One)

Trend Micro

Taipei City, Taiwan (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug