Senior GPU Kernel Performance Lead

3 Months ago • 8 Years + • Research & Development • $224,000 PA - $425,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior GPU Kernel Performance Lead to oversee performance validation of GPU math kernels for cuDNN, cuBLAS, and TensorRT libraries. Responsibilities include specifying test cases from deep learning workloads, developing analytical performance models, tracking kernel performance throughout the development lifecycle, and providing feedback to developers on performance optimization. The role requires strong C++ programming skills, experience leading performance teams, and a PhD in a related field or equivalent experience. The ideal candidate will have experience with analytical models, cycle-accurate HW simulators, and performance tools like Nsight or VTune. This position involves leading a team, analyzing and improving the performance of AI kernels impacting critical applications in image classification, speech recognition, and large language models.
Must have:
  • PhD in CS/related field or equiv. exp.
  • 8+ years relevant industry experience
  • Strong C++ programming & software design
  • Experience leading performance teams
  • Performance analysis & test design expertise
Good to have:
  • Analytical models & cycle-accurate HW simulators
  • Nsight/VTune experience
  • Assembly, MLIR/LLVM, Python, CUDA/OpenCL experience
Perks:
  • Equity
  • Benefits

Job Details

We're now looking for a Senior GPU Kernel Performance Lead. Do you enjoy analyzing and reporting on GPU kernel performance? If so, consider applying for the role of Senior GPU Kernel Performance Analysis Lead! Our team delivers high-performance GPU math kernels to NVIDIA’s cuDNN, cuBLAS, and TensorRT libraries to accelerate deep learning models. The team is proud to play an integral part in enabling breakthroughs in domains such as image classification, speech recognition, natural language processing,and large language models. We’re always striving for peak performance and energy efficiency on current and future-generation GPUs.

As a kernel performance analysis lead, you will oversee all efforts pertaining to the performance of our kernels. Join the team that is building the underlying software used across the world to power the revolution in artificial intelligence! To get a sense of the code we write, check out our CUTLASS open-source project showcasing performant matrix multiply on NVIDIA’s Tensor Cores with CUDA. While there will be the opportunity for hands-on development, this position specifically is to lead a team for validating the performance of the kernels.

What you’ll be doing:

  • Specify test cases, derived from Deep Learning workloads, to provide adequate directed and use-case coverage across all kernels on both simulation and silicon targets

  • Determine performance theory through the development and use of analytical models

  • Track and report on kernel performance throughout the development lifecycle by using and expanding upon current infrastructure

  • Provide feedback to the kernel developers by identifying performance regressions and opportunities to reach the achievable peak performance

What we need to see:

  • PhD degree in Computer Science, Computer Engineering, Applied Math, or related field (or equivalent experience) with 8+ years of relevant industry experience.

  • Demonstrated strong C++ programming and software design skills, including debugging, performance analysis, and test design

  • Experience leading or managing a team relating to the performance of CPUs, GPUs, or other DL accelerators

Ways to stand out from the crowd:

  • Experience with analytical models and cycle-accurate HW simulators

  • Knowledgeable about performance tools like Nsight or VTune

  • Programming experience beyond C++ including assembly, MLIR/LLVM, Python, and CUDA/OpenCL

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and collaborative software leader seeking new challenges? If so, we want to hear from you! Come, join our DL Architecture team and help build the real-time, cost-effective AI computing platform driving our success in this exciting and quickly growing field.

The base salary range is 224,000 USD - 425,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Senior Software Engineer - Serverless Compute Infrastructure

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
ByteDance - Research Engineer Intern

ByteDance

Seattle, Washington, United States (On-Site)
2 Days ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Seattle, Washington, United States (On-Site)
5 Months ago
Agara labs - Lead / Staff ML Scientist - NLP

Agara labs

(Remote)
21 Hours ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - MultiModal Generative Model)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
ByteDance - Student Researcher (Doubao (Seed) - Machine Learning System) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Trackman - Machine Learning Developer with DSP experience (Python/C++)

Trackman

Hørsholm, Denmark (On-Site)
1 Month ago
Rivos - Silicon Verification - Intern

Rivos

Santa Clara, California, United States (On-Site)
6 Months ago
ByteDance - Tech Lead, Software Engineer, Distributed Storage System

ByteDance

Seattle, Washington, United States (On-Site)
2 Weeks ago
Tesla - Electrical Engineering - Motor Design, Tesla Bot Internship

Tesla

Athens, Greece (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - System Architect

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - Technical Marketing Engineer - AI Platform Software

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
NVIDIA - Senior ASIC Verification Engineer - HSIO

NVIDIA

Westford, Massachusetts, United States (On-Site)
3 Months ago
NVIDIA - Mixed Signal Design Engineer (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
3 Months ago
NVIDIA - System Software Engineering Manager

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Agara labs - Engineering Manager

Agara labs

(Remote)
21 Hours ago
Canva - Backend Software Engineer - Gen AI, Design Generation Experience

Canva

Melbourne, Victoria, Australia (Remote)
4 Weeks ago
Google - Student Researcher, BS/MS, Winter/Summer 2025

Google

(On-Site)
5 Months ago
Krafton  - [Deep Learning Div.] DL Strategy & Operations Associate (3년 ~ 8년)

Krafton

Seoul, South Korea (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Google - Senior Software Engineer, Security/Privacy

Google

Kirkland, Washington, United States (On-Site)
2 Days ago
In The Pocket - Business Developer Public Sector

In The Pocket

Belgium, Wisconsin, United States (On-Site)
8 Hours ago
Egnyte - Senior Product Marketing Manager, Life Sciences

Egnyte

Raleigh, North Carolina, United States (Remote)
1 Month ago
Scout - Staff Technical Product Manager

Scout

Fremont, California, United States (On-Site)
1 Day ago
Google - Automation and Robotics Manufacturing Test Engineer

Google

Moncks Corner, South Carolina, United States (On-Site)
2 Days ago
IGT - Senior IT Internal Auditor

IGT

Providence, Rhode Island, United States (On-Site)
4 Months ago
ByteDance - Interaction Technology Lead - Smart Wearable Devices- Pico Lab- San Jose

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Google - Product Engineer, Cloud Compute and Storage

Google

Atlanta, Georgia, United States (On-Site)
2 Weeks ago
Google - Software Engineer III, Google Cloud Compute Infrastructure

Google

Kirkland, Washington, United States (On-Site)
2 Days ago
Sophic Synergistics LLC - Human Factors Specialist Aerospace Focused

Sophic Synergistics LLC

Houston, Texas, United States (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Oculus VR - Product Design Engineer, Reality Labs

Oculus VR

Sunnyvale, California, United States (On-Site)
9 Months ago
Tesla - Mechanical Assembly Team Lead

Tesla

Rhineland-Palatinate, Germany (On-Site)
2 Months ago
NVIDIA - Research Scientist, Circuits

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
ByteDance - Engineering Manager Machine Learning Infrastructure

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
NVIDIA - Senior Chip Design Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
2 Months ago
Google - Staff Software Engineer, Mobile (Android), YouTube

Google

San Bruno, California, United States (On-Site)
2 Days ago
ByteDance - Senior Machine Learning Ops Engineer, ML System

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Meta - Software Engineer (Technical Leadership)

Meta

Redmond, Washington, United States (On-Site)
5 Months ago
Google - Embedded Software Engineer, Android Pixel Kernel

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
NVIDIA - ASIC Design and STA Engineer

NVIDIA

Hyderabad, Telangana, India (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug