Senior GPU Kernel Performance Lead

3 Months ago • 8 Years + • Research & Development • $224,000 PA - $425,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior GPU Kernel Performance Lead to oversee performance validation efforts for high-performance GPU math kernels used in cuDNN, cuBLAS, and TensorRT libraries. Responsibilities include specifying test cases from deep learning workloads, developing analytical performance models, tracking performance throughout the development lifecycle, and providing feedback to kernel developers. The role involves leading a team and requires strong C++ programming skills, experience with performance analysis tools, and a PhD in a related field (or equivalent experience). This position offers the opportunity for hands-on development while primarily focusing on performance leadership and validation.
Must have:
  • PhD in CS/related field or equiv. exp.
  • 8+ years relevant industry experience
  • Strong C++ programming & software design
  • Experience leading performance teams
  • Performance analysis & test design skills
Good to have:
  • Experience with analytical models and cycle-accurate HW simulators
  • Knowledge of performance tools (Nsight, VTune)
  • Programming experience beyond C++ (assembly, MLIR/LLVM, Python, CUDA/OpenCL)
Perks:
  • Equity
  • Benefits

Job Details

We're now looking for a Senior GPU Kernel Performance Lead. Do you enjoy analyzing and reporting on GPU kernel performance? If so, consider applying for the role of Senior GPU Kernel Performance Analysis Lead! Our team delivers high-performance GPU math kernels to NVIDIA’s cuDNN, cuBLAS, and TensorRT libraries to accelerate deep learning models. The team is proud to play an integral part in enabling breakthroughs in domains such as image classification, speech recognition, natural language processing,and large language models. We’re always striving for peak performance and energy efficiency on current and future-generation GPUs.

As a kernel performance analysis lead, you will oversee all efforts pertaining to the performance of our kernels. Join the team that is building the underlying software used across the world to power the revolution in artificial intelligence! To get a sense of the code we write, check out our CUTLASS open-source project showcasing performant matrix multiply on NVIDIA’s Tensor Cores with CUDA. While there will be the opportunity for hands-on development, this position specifically is to lead a team for validating the performance of the kernels.

What you’ll be doing:

  • Specify test cases, derived from Deep Learning workloads, to provide adequate directed and use-case coverage across all kernels on both simulation and silicon targets

  • Determine performance theory through the development and use of analytical models

  • Track and report on kernel performance throughout the development lifecycle by using and expanding upon current infrastructure

  • Provide feedback to the kernel developers by identifying performance regressions and opportunities to reach the achievable peak performance

What we need to see:

  • PhD degree in Computer Science, Computer Engineering, Applied Math, or related field (or equivalent experience) with 8+ years of relevant industry experience.

  • Demonstrated strong C++ programming and software design skills, including debugging, performance analysis, and test design

  • Experience leading or managing a team relating to the performance of CPUs, GPUs, or other DL accelerators

Ways to stand out from the crowd:

  • Experience with analytical models and cycle-accurate HW simulators

  • Knowledgeable about performance tools like Nsight or VTune

  • Programming experience beyond C++ including assembly, MLIR/LLVM, Python, and CUDA/OpenCL

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and collaborative software leader seeking new challenges? If so, we want to hear from you! Come, join our DL Architecture team and help build the real-time, cost-effective AI computing platform driving our success in this exciting and quickly growing field.

The base salary range is 224,000 USD - 425,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

bytedance - Research Scientist, Code Generation

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Qualcomm - Automotive - Platform Software Engineer

Qualcomm

San Diego, California, United States (On-Site)
4 Days ago
bytedance - Research Scientist Graduate (Foundation Models for Science - ByteDance Research) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
bytedance - Research Scientist, Foundation Model, Speech Understanding

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
bytedance - Large Language Model Algorithm Engineer - Volcano Ark

bytedance

Singapore (On-Site)
7 Months ago
Tesla - Torque Tool and Production Technology Internship

Tesla

Brandenburg, Germany (On-Site)
3 Months ago
Netflix - Engineering Manager, Delivery Engineering

Netflix

Los Gatos, California, United States (On-Site)
7 Months ago
NVIDIA - Senior STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Krafton - HR Recruiting Planning/Operation

Krafton

Seoul, South Korea (On-Site)
2 Months ago
NVIDIA - STA Backend Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Mixed Signal Analog Circuit Designer (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
Qualcomm - Software Engineer, Machine Learning Group

Qualcomm

San Diego, California, United States (On-Site)
4 Weeks ago
Qualcomm - Computer Vision Engineer

Qualcomm

San Diego, California, United States (On-Site)
6 Days ago
bytedance - Research Scientist (Machine Learning for Science (AI-for-Science))

bytedance

Seattle, Washington, United States (On-Site)
2 Months ago
NVIDIA - Senior Signal and Power Integrity Engineer - Hardware

NVIDIA

Santa Clara, California, United States (On-Site)
4 Months ago
Synechron - Lead AI/ML Engineer

Synechron

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Rackspace Technology - Machine Learning Architect (AWS)

Rackspace Technology

San Diego, California, United States (Remote)
2 Months ago
Tencent - Security Data Engineer

Tencent

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
5 Days ago
bytedance - Research Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Hedra - Research Scientist

Hedra

New York, New York, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Amber - Customer Support Agent with English and Japanese

Amber

Quebec, Canada (Remote)
2 Months ago
Fortra - Software Engineer III

Fortra

Canada (On-Site)
3 Days ago
Rockstar Games - Animation Systems Programmer

Rockstar Games

Oakville, Ontario, Canada (On-Site)
2 Months ago
Inworld AI - Forward Deployed Engineer (AI Gameplay Engineer)

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
Keywords Studios - Tax Analyst

Keywords Studios

Quebec, Canada (Remote)
2 Months ago
Epic Games - Character Concept Outsource Lead

Epic Games

Montreal, Quebec, Canada (On-Site)
4 Months ago
Epic Games - Senior Technical Producer

Epic Games

Montreal, Quebec, Canada (On-Site)
2 Months ago
2K - Lead Audio Designer

2K

Burnaby, British Columbia, Canada (Hybrid)
2 Weeks ago
Nvizzio Creations - Quality Assurance Tester (Contract)

Nvizzio Creations

Montreal, Quebec, Canada (On-Site)
2 Weeks ago
Epic Games - Technical Animator

Epic Games

Montreal, Quebec, Canada (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Meta - Research Intern, Computer Vision for Egocentric Representation Learning (PhD)

Meta

Redmond, Washington, United States (On-Site)
6 Months ago
Netflix - Research Scientist (L6) - Identity Algorithms

Netflix

Los Gatos, California, United States (On-Site)
7 Months ago
Cadence - Lead FrontEnd Methodology Engineer

Cadence

Bengaluru, Karnataka, India (On-Site)
8 Months ago
rivos - DPA Performance Modeling - Intern

rivos

Santa Clara, California, United States (On-Site)
7 Months ago
Meta - Software Engineer, Machine Learning

Meta

Los Angeles, California, United States (On-Site)
6 Months ago
bytedance - DevOps Engineer - Applied Machine Learning Engine (Singapore)

bytedance

Singapore (On-Site)
6 Months ago
Microsoft - Senior Researcher - Systems and Foundations

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
bytedance - Research Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Vigaet - Internship - Mechanical Engineer

Vigaet

(On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug