Senior High-Performance LLM Training Engineer

1 Month ago • 5-13 Years • Full Stack Development • Research & Development • Artificial Intelligence • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior High-Performance LLM Training Engineer to optimize the efficiency of Large Language Model (LLM) training workloads. Responsibilities include performance analysis, optimization of LLM software stacks (PyTorch, JAX) on thousands of GPUs, contributing to hardware roadmaps, implementing production-quality software, supporting MLPerf Training benchmark submissions, and building automation tools. This role demands expertise in deep learning, neural networks, computer architecture, and GPU architecture, along with programming skills in C++, Python, and CUDA. The engineer will work across the full hardware & software stack to achieve optimal performance for AI training.
Must have:
  • PhD/MS in CS/EE/CE and relevant experience
  • Deep learning & neural network expertise
  • GPU architecture knowledge
  • Performance analysis & tuning
  • C++, Python, CUDA programming
Perks:
  • Highly competitive salary
  • Comprehensive benefits package
  • Collaboration with leading experts
  • Innovative work environment

Job Details

We are now looking for a Senior High-Performance LLM Training Engineer!

NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing systems. This position focuses on optimizing NVIDIA’s high-performance LLM software stack in frameworks like PyTorch and JAX for high-performance training on thousands of GPUs, while also helping shape hardware roadmaps for the next generation of GPUs powering the AI revolution.

What you will be doing:

  • Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.

  • Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.

  • Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.

  • Build and support NVIDIA submissions to the MLPerf Training benchmark suite.

  • Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies.

  • Build tools to automate workload analysis, workload optimization, and other critical workflows.

What we want to see:

  • PhD in Computer Science, Electrical Engineering or Computer Engineering and 5+ years; or MS (or or equivalent experience) and 8+ years of meaningful work experience.

  • Strong background in deep learning and neural networks, in particular training.

  • A deep background in computer architecture and familiarity with the fundamentals of GPU architecture.

  • Proven experience analyzing and tuning application performance & processor and system-level performance modelling.

  • Programming skills in C++, Python, and CUDA.

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

Widely considered to be one of tech's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. If you're excited to work across the full hardware & software stack—from GPU architecture to application code—to achieve optimal performance, we want to hear from you!

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

PwC - IN-Manager_ Advanced Analytics & ML _D&A_Advisory_Gurgaon

PwC

Gurugram, Haryana, India (On-Site)
3 Months ago
Netflix - Machine Learning Engineer

Netflix

United States (Remote)
1 Month ago
G5 Games - 2D HOG Grind Artist

G5 Games

(Remote)
1 Week ago
NVIDIA - Senior Hypervisor and RTOS Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
G5 Games - 2D UI/UX Artist (match-3 project)

G5 Games

Tbilisi, Tbilisi, Georgia (Remote)
4 Months ago
FIS Global - Lead Engineer – Development- (Java, Angular)

FIS Global

Pune, Maharashtra, India (Hybrid)
4 Months ago
Zuru - Python Backend Software Engineer

Zuru

Modena, Emilia-Romagna, Italy (Hybrid)
4 Months ago
Electronic Arts - Software Engineer II

Electronic Arts

Hyderabad, Telangana, India (Hybrid)
2 Months ago
Nielsen Holdings - STAFF SOFTWARE ENGINEER

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
CloudHire - Scala API Architect

CloudHire

Bengaluru, Karnataka, India (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Tango Eye - Sr. Computer Vision Developer

Tango Eye

Chennai, Tamil Nadu, India (On-Site)
6 Months ago
Ello - Tech Lead, Machine Learning

Ello

San Francisco, California, United States (On-Site)
3 Months ago
Microsoft - Senior Machine Learning Research Scientist

Microsoft

Cambridge, England, United Kingdom (On-Site)
1 Month ago
Trendyol - Data Science Team Lead - Dolap

Trendyol

İstanbul, İstanbul, Türkiye (Hybrid)
1 Month ago
Luxoft - Senior ML Engineer

Luxoft

Poland, Ohio, United States (Remote)
1 Month ago
Playrix - Game Designer

Playrix

Armenia (Remote)
3 Months ago
ByteDance - Engineering Manager - Applied Machine Learning Algorithm

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Machine Learning Engineer - Global Payment - 2025 Start

ByteDance

Singapore (On-Site)
1 Month ago
Optum - Data Scientist

Optum

Noida, Uttar Pradesh, India (On-Site)
4 Months ago
Sinch - Senior Machine Learning Engineer

Sinch

Flanders, Belgium (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Epic Games - Senior UI Artist

Epic Games

Bellevue, Washington, United States (On-Site)
2 Weeks ago
Inkittt - AI Video Producer

Inkittt

San Francisco, California, United States (On-Site)
6 Months ago
Epic Games - Senior Tools Programmer, UEFN

Epic Games

Cary, North Carolina, United States (On-Site)
2 Weeks ago
Meta - Network Production Engineer, Network Infrastructure

Meta

Austin, Texas, United States (On-Site)
3 Months ago
Inworld AI - Senior Unreal Engine Developer - USA

Inworld AI

Mountain View, California, United States (Remote)
3 Months ago
Mattel  Inc  - American Girl Lead Merchandise Handler

Mattel Inc

Tennessee, United States (On-Site)
1 Week ago
Epic Games - Gameplay Systems Engineer Intern

Epic Games

Cary, North Carolina, United States (On-Site)
1 Month ago
Lionsgate Games - Coordinator, Research & Digital Insights

Lionsgate Games

Santa Monica, California, United States (On-Site)
1 Month ago
NVIDIA - Senior System Software Engineer - Tegra

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Microsoft - Research Intern - Undergraduate

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Full Stack Development Jobs

Nagarro - Associate Principal Engineer, Java Fullstack

Nagarro

India (Remote)
4 Months ago
Magic Media - Python Automation Engineer

Magic Media

State Of Rio De Janeiro, Brazil (Remote)
2 Months ago
Google - Staff Software Engineer, Geo

Google

Seattle, Washington, United States (On-Site)
1 Month ago
Info Stretch - Senior .NET Developer

Info Stretch

Mechanicsburg, Pennsylvania, United States (On-Site)
1 Month ago
Meta - Production Engineering

Meta

Fremont, California, United States (On-Site)
3 Months ago
Gamemano - Sr. Frontend Developer

Gamemano

Noida, Uttar Pradesh, India (On-Site)
5 Months ago
Gaming Innovation Group  - Java Tech Lead

Gaming Innovation Group

Community Of Madrid, Spain (Remote)
1 Day ago
Vigaet - Internship-Backend Developer

Vigaet

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Bazaar Voice - Staff Software Engineer - Full Stack, R6542

Bazaar Voice

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Nagarro - Principal Engineer, Java Fullstack

Nagarro

Mumbai, Maharashtra, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

United States (Remote)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug