High-Performance LLM Training Engineer - New College Grad 2025

1 Month ago • All levels • Research & Development • Full Stack Development • Artificial Intelligence • $104,000 PA - $189,750 PA

Job Summary

Job Description

NVIDIA seeks a High-Performance LLM Training Engineer to optimize LLM training workloads on thousands of GPUs. Responsibilities include analyzing, profiling, and optimizing AI training workloads; implementing production-quality software across NVIDIA's deep learning platform; contributing to the MLPerf Training benchmark; implementing DL workloads in NVIDIA's simulators; and building automation tools. The role requires expertise in deep learning, neural networks, computer architecture, and GPU architecture, along with proficiency in C++, Python, and CUDA. This position is crucial in shaping hardware roadmaps for future GPU generations and improving the efficiency of LLM training.
Must have:
  • Deep learning & neural network expertise
  • Computer architecture & GPU architecture knowledge
  • Performance analysis & tuning experience
  • C++, Python, CUDA programming skills
Perks:
  • Highly competitive salary
  • Comprehensive benefits package
  • Collaboration with leading experts
  • Creative and autonomous work environment

Job Details

We are now looking for a High-Performance LLM Training Engineer!

NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing systems. This position focuses on optimizing NVIDIA’s high-performance LLM software stack in frameworks like PyTorch and JAX for high-performance training on thousands of GPUs, while also helping shape hardware roadmaps for the next generation of GPUs powering the AI revolution.

What you will be doing:

  • Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.

  • Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.

  • Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.

  • Build and support NVIDIA submissions to the MLPerf Training benchmark suite.

  • Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies.

  • Build tools to automate workload analysis, workload optimization, and other critical workflows.

What we want to see:

  • MS or PhD in Computer Science, Electrical Engineering or Computer Engineering (or equivalent experience).

  • Strong background in deep learning and neural networks, in particular training.

  • A deep background in computer architecture and familiarity with the fundamentals of GPU architecture.

  • Proven experience analyzing and tuning application performance & processor and system-level performance modeling.

  • Programming skills in C++, Python, and CUDA.

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

Widely considered to be one of tech's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. If you're excited to work across the full hardware & software stack—from GPU architecture to application code—to achieve optimal performance, we want to hear from you!

The base salary range is 104,000 USD - 189,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Canva - Senior Computer Vision Engineer - Photo AI

Canva

London, England, United Kingdom (Remote)
3 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
1 Month ago
NVIDIA - Solution Architect - CSP Cloud

NVIDIA

Beijing, Beijing, China (On-Site)
3 Months ago
Playrix - Game Director

Playrix

Georgia (Remote)
6 Months ago
G5 Games - 2D UI/UX Artist (Hidden objects project)

G5 Games

Tbilisi, Tbilisi, Georgia (Remote)
3 Months ago
Regent Craft - Embedded Software Engineering Intern

Regent Craft

North Kingstown, Rhode Island, United States (On-Site)
6 Months ago
Cirrus Logic - Manager, Design Engineering (MMS-64000105)

Cirrus Logic

Edinburgh, Scotland, United Kingdom (Hybrid)
6 Months ago
NVIDIA - Senior System Software Engineer, GPU Firmware

NVIDIA

Bengaluru, Karnataka, India (On-Site)
3 Months ago
NVIDIA - Senior Thermal Design Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
NVIDIA - Senior System Software Engineer

NVIDIA

Pune, Maharashtra, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Trendyol - Data Science Team Lead - Dolap

Trendyol

İstanbul, İstanbul, Türkiye (Hybrid)
4 Months ago
G5 Games - 2D Illustrator (HOG project)

G5 Games

Tbilisi, Tbilisi, Georgia (Remote)
1 Month ago
G5 Games - 2D Illustrator (HOG project)

G5 Games

Yerevan, Yerevan, Armenia (Remote)
1 Month ago
ByteDance - Software Engineer, Inference

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Trend Micro - NLP / Prompt Engineer (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Playrix - Game Designer

Playrix

Ireland (Remote)
6 Months ago
Playrix - Feature Owner (LiveOps)

Playrix

Cyprus (Remote)
6 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Prague, Czechia (Remote)
3 Months ago
Playtika - Experienced Data Scientist

Playtika

Israel (On-Site)
2 Months ago
NVIDIA - Senior ASIC Verification Engineer, Coherent High Speed Interconnect

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Probably Monsters - Principal Player Combat & Gameplay Designer

Probably Monsters

Bellevue, Washington, United States (On-Site)
5 Months ago
Twitch - Sr. Applied Scientist

Twitch

San Francisco, California, United States (On-Site)
1 Month ago
Epic Games - IAM Director

Epic Games

Cary, North Carolina, United States (On-Site)
2 Months ago
Framestore - FREELANCE: NUKE - CHICAGO

Framestore

Chicago, Illinois, United States (On-Site)
11 Months ago
ByteDance - Site Reliability Engineer (Cloud Native Platform) - Traffic Infrastructure

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
PlayStation Global - Producer (Contract)

PlayStation Global

Los Angeles, California, United States (On-Site)
1 Month ago
Joyride Games - VP Marketing

Joyride Games

Austin, Texas, United States (Remote)
1 Year ago
ION - Senior Linux Systems Administrator - Trumbull, CT

ION

Trumbull, Connecticut, United States (Hybrid)
6 Months ago
Zoox - Senior Technical Program Manager, Milestone Execution

Zoox

Foster City, California, United States (On-Site)
6 Months ago
GungHo Online Entertainment America,  Inc  - Rigger/Character Technical Artist

GungHo Online Entertainment America, Inc

California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

ByteDance - Software Engineer, Inference

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Riot Games - Staff Software Engineer (Game UI) - Teamfight Tactics

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Riot Games - Manager, Insights - Central User Research

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
NVIDIA - Manager, Software Engineering

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Riot Games - Manager, Software Engineering - Unreal Ecosystem (UnEco)

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Netflix - Research Engineer L4/L5 -LLMs for Search, Recommendations, and Personalization

Netflix

Los Gatos, California, United States (On-Site)
6 Months ago
NVIDIA - Senior System Software Engineer - Autonomous Driving

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
Valeo - Site Management Controller

Valeo

Chennai, Tamil Nadu, India (On-Site)
6 Months ago
Krafton  - [Publishing Platform Div.] Sr. Web Front-End Developer (5년 이상)

Krafton

Seoul, South Korea (On-Site)
5 Months ago
Samsung Semiconductor - Staff Engineer, Embedded Firmware

Samsung Semiconductor

San Jose, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Austin, Texas, United States (Remote)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug