Senior High-Performance LLM Training Engineer

4 Months ago • 8-10 Years • Software Development & Engineering • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior High-Performance LLM Training Engineer to optimize LLM training workloads on thousands of GPUs. Responsibilities include performance analysis, optimization of AI training on innovative hardware and software platforms (PyTorch, JAX), implementing production-quality software across NVIDIA's deep learning platform, contributing to the MLPerf Training benchmark, and building tools for workload analysis. The role involves working with cutting-edge neural networks and shaping hardware roadmaps for next-generation GPUs. This position requires deep learning, computer architecture, and programming expertise (C++, Python, CUDA).
Must have:
  • PhD/MS in CS/EE/CE & relevant experience
  • Deep learning & neural network expertise
  • GPU architecture knowledge
  • Performance analysis & tuning skills
  • C++, Python, CUDA programming
Perks:
  • Highly competitive salary
  • Comprehensive benefits package
  • Collaboration with top talent
  • Innovative work environment

Job Details

We are now looking for a Senior High-Performance LLM Training Engineer!

NVIDIA is seeking experienced engineers specializing in performance analysis and optimization to improve the efficiency of LLM training workloads, which are shaping the world's most advanced computing systems. This position focuses on optimizing NVIDIA’s high-performance LLM software stack in frameworks like PyTorch and JAX for high-performance training on thousands of GPUs, while also helping shape hardware roadmaps for the next generation of GPUs powering the AI revolution.

What you will be doing:

  • Understand, analyze, profile, and optimize AI training workloads on innovative hardware and software platforms.

  • Understand the big picture of training performance on GPUs, prioritizing and then solving problems across all state-of-the-art neural networks.

  • Implement production-quality software in multiple layers of NVIDIA's deep learning platform stack, from drivers to DL frameworks.

  • Build and support NVIDIA submissions to the MLPerf Training benchmark suite.

  • Implement key DL training workloads in NVIDIA's proprietary processor and system simulators to enable future architecture studies.

  • Build tools to automate workload analysis, workload optimization, and other critical workflows.

What we want to see:

  • PhD in Computer Science, Electrical Engineering or Computer Engineering and 5+ years; or MS (or equivalent experience) and 8+ years of meaningful work experience.

  • Strong background in deep learning and neural networks, in particular training.

  • A deep background in computer architecture and familiarity with the fundamentals of GPU architecture.

  • Proven experience analyzing and tuning application performance & processor and system-level performance modelling.

  • Programming skills in C++, Python, and CUDA.

GPU computing is the most productive and pervasive platform for deep learning and AI. It begins with the most advanced GPUs and the systems and software we build on top of them. We integrate and optimize every deep learning framework. We work with the major systems companies and every major cloud service provider to make GPUs available in data centers and in the cloud. We craft computers and software to bring AI to edge devices, such as self-driving cars and autonomous robots. AI has the potential to spur a wave of social progress unmatched since the industrial revolution.

Widely considered to be one of tech's most desirable employers, NVIDIA offers highly competitive salaries and a comprehensive benefits package. Additionally, this opportunity offers you the ability to collaborate with some of the most forward-thinking and hard-working people in the world, shaping the future of AI in a creative and autonomous work environment that encourages innovation. If you're excited to work across the full hardware & software stack—from GPU architecture to application code—to achieve optimal performance, we want to hear from you!

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Rackspace Technology - Marketing Operations Analyst II

Rackspace Technology

Gurugram, Haryana, India (Remote)
4 Months ago
Onward Search - Senior Recruiter – In House

Onward Search

Santa Monica, California, United States (Hybrid)
5 Months ago
kaizen gaming  - Operations Data Specialist

kaizen gaming

São Paulo, Brazil (On-Site)
1 Month ago
Varonis  - Manager of Customer Success

Varonis

United States (On-Site)
3 Months ago
Ously games - Career

Ously games

Frankfurt Am Main, Hessen, Germany (On-Site)
4 Weeks ago
Silicon Labs - Engineering Manager - MCU Software Development

Silicon Labs

Hyderabad, Telangana, India (On-Site)
3 Weeks ago
Survay Monkey - Senior Software Engineer II

Survay Monkey

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Zeeco, Inc. - Controls Engineer

Zeeco, Inc.

Tulsa, Oklahoma, United States (On-Site)
1 Week ago
Samsung Semiconductor - Senior Engineer, DRAM Applications

Samsung Semiconductor

San Jose, California, United States (On-Site)
2 Months ago
Axon - Technical Support Engineer

Axon

Kassel, Hessen, Germany (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Playtika - VIP Account Management Team Leader

Playtika

Romania (Hybrid)
9 Months ago
miniclip - Communications & Employer Branding Senior Lead

miniclip

United Kingdom (On-Site)
4 Weeks ago
Hitachi - D365 F&O Functional Consultant (Fin, Ops and T&L)

Hitachi

Pune, Maharashtra, India (On-Site)
9 Months ago
Teradata - Senior Product Manager

Teradata

Pune, Maharashtra, India (On-Site)
9 Months ago
Betson Group - Senior Affiliate Manager

Betson Group

Malta (On-Site)
3 Weeks ago
Paradox Interactive - Chinese Community Ambassador

Paradox Interactive

Stockholm, Stockholm County, Sweden (Remote)
1 Month ago
WebFX - Remote Copywriter: Agriculture/Environment/Eco Living

WebFX

Philippines (Remote)
9 Months ago
Ziff Davis - Product Marketing Specialist - B2B

Ziff Davis

Dublin, County Dublin, Ireland (Remote)
2 Months ago
Autodesk - Partner Principal Solutions Executive | AEC

Autodesk

Bengaluru, Karnataka, India (On-Site)
1 Month ago
eBay - Producer, ebay Live

eBay

Kleinmachnow, Brandenburg, Germany (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Sonar Source - Sales Development Representative

Sonar Source

Austin, Texas, United States (Hybrid)
1 Year ago
Apple - AIML - Machine Learning Engineer, Answers, Knowledge & Intelligence (AKI)

Apple

Santa Clara, California, United States (On-Site)
1 Month ago
Square - Process Analyst

Square

Lisle, Illinois, United States (Hybrid)
1 Week ago
Playstation - IT Coordinator

Playstation

Santa Monica, California, United States (On-Site)
1 Month ago
Qualcomm - Senior Embedded Software Development Engineer

Qualcomm

Boulder, Colorado, United States (On-Site)
1 Month ago
Snap Mobile INC - Account Executive

Snap Mobile INC

St. Cloud, Minnesota, United States (On-Site)
3 Months ago
HCL Tech - Design Lead - Design for Manufacturability (DFM)

HCL Tech

New York, United States (On-Site)
2 Months ago
bytedance - Senior Software Engineer, Multi Cloud CDN

bytedance

Boston, Massachusetts, United States (On-Site)
1 Week ago
Expedia - Senior Manager, Product Management (CRM - Customer Relationship Management Products)

Expedia

Austin, Texas, United States (On-Site)
3 Weeks ago
WebMD - Senior Program Manager (Salesforce CPQ & Go-to-Market Operations)

WebMD

Yardley, Pennsylvania, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Software Development & Engineering Jobs

Merqube - Quant Analyst (Financial Engineer)

Merqube

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
PwC - Philippines - SAP Finance Senior Consultant

PwC

Makati City, Metro Manila, Philippines (On-Site)
2 Weeks ago
rivos - Thermal/Mechanical Engineer

rivos

Santa Clara, California, United States (Hybrid)
5 Months ago
Regent craft - Senior Electrical Engineer

Regent craft

North Kingstown, Rhode Island, United States (On-Site)
2 Weeks ago
Regent craft - Senior Perception Software Engineer - Sensor Fusion

Regent craft

North Kingstown, Rhode Island, United States (On-Site)
1 Month ago
WebTech Corporation - Method Engineer (Mechanical) - Intern

WebTech Corporation

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
3 Months ago
kaizen gaming  - Analytics Engineering Team Lead

kaizen gaming

Thessaloniki, Greece (On-Site)
1 Month ago
Adtran - Senior Software Engineer

Adtran

Meiningen, Thuringia, Germany (Hybrid)
2 Weeks ago
bytedance - Software Development Engineer Graduate (SDN Traffic Intelligence & Control) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

Beijing, Beijing, China (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug