GPU Kernel Dev & Perf Analysis Architect

2 Months ago • 4 Years + • Research & Development

Job Summary

Job Description

NVIDIA seeks a GPU Kernel Dev & Perf Analysis Architect to design, develop, and optimize GEMM (General Matrix Multiply) kernels for their new architectures. Responsibilities include implementing and fine-tuning kernels for optimal performance on NVIDIA GPUs, conducting in-depth performance analysis, identifying bottlenecks, optimizing resource utilization, and improving throughput and power efficiency. The role requires creating and maintaining workloads and micro-benchmark suites, generating performance reports, and collaborating with architecture, software, and product teams. This position significantly impacts the development of next-generation deep learning hardware and software.
Must have:
  • 4+ years GPU programming/DL optimization experience
  • GEMM kernel development and optimization
  • GPU kernel performance analysis & improvement
  • CUDA programming expertise
  • Experience with performance profiling tools (e.g., NVIDIA Nsight)

Job Details

NVIDIA is developing processor and system architectures that accelerate machine learning, automotive and high performance computing (HPC) applications. We are seeking a strong candidate to do GEMM kernel development and performance analysis for NVIDIA's new architectures. Your work will play a critical role in shaping the future of deep learning hardware and software, ensuring optimal performance for next-generation AI applications.  This position offers the opportunity to make a meaningful impact in a fast-moving, technology focused company.

What you'll be doing:

  • Design, develop, and optimize GEMM (General Matrix Multiply) kernels for NVIDIA's new architectures.

  • Implement and fine-tune kernels to achieve optimal performance on NVIDIA GPUs.

  • Conduct in-depth performance analysis of GPU kernels, including GEMM and other critical operations.

  • Identify bottlenecks, optimize resource utilization, and improve throughput, and power efficiency

  • Create and maintain workloads and micro-benchmark suites to evaluate kernel performance across various hardware and software configurations.

  • Generate performance projections, comparisons, and detailed analysis reports for internal and external stakeholders.

  • Collaborate with architecture, software, and product teams to guide the development of next-generation deep learning hardware and software.

What we need to see:

  • 4+ years of industry experience in GPU programming or performance optimization for DL applications.

  • Hands-on experience in developing and optimizing GEMM (General Matrix Multiply) kernels.

  • Demonstrated experience in analyzing and improving the performance of GPU kernels, with measurable results (e.g., performance improvements, efficiency gains).

  • Expertise in CUDA programming for GPU acceleration.

  • Experience with performance profiling tools (e.g., NVIDIA Nsight).

  • Excellent communication skills, both written and verbal.

  • Strong organizational and time management abilities, with the ability to prioritize tasks effectively.

Similar Jobs

NVIDIA - Senior Software Engineer, AI Resiliency

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
CynLr - Software Engineer - GPU performance

CynLr

Bengaluru, Karnataka, India (On-Site)
6 Months ago
NVIDIA - Graphics Tools Software Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
4 Months ago
Blind Squirrel Games - Senior Generalist Engineer

Blind Squirrel Games

Austin, Texas, United States (Hybrid)
4 Months ago
Blind Squirrel Games - Sr. Generalist Engineer, Austin

Blind Squirrel Games

Austin, Texas, United States (Hybrid)
1 Month ago
Google - Silicon RTL IP/Subsystem Senior Engineer

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Google - CPU Functional Verification Engineer, Silicon

Google

Mountain View, California, United States (On-Site)
1 Month ago
NVIDIA - Senior Deep Learning Engineer

NVIDIA

Redmond, Washington, United States (On-Site)
1 Month ago
NVIDIA - Senior Chip Design Verification Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Google - Software Engineer, PhD

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Software Engineer, AI Resiliency

NVIDIA

Redmond, Washington, United States (On-Site)
2 Months ago
NVIDIA - Graphics Tools Software Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
4 Months ago
CynLr - Software Engineer - GPU performance

CynLr

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Blind Squirrel Games - Sr. Generalist Engineer, Austin

Blind Squirrel Games

Austin, Texas, United States (Hybrid)
1 Month ago
Blind Squirrel Games - Senior Generalist Engineer

Blind Squirrel Games

Austin, Texas, United States (Hybrid)
4 Months ago
NVIDIA - Senior Software Engineer, AI Resiliency

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Google - Software Engineer III, Machine Learning, Search

Google

Seattle, Washington, United States (On-Site)
6 Months ago
NVIDIA - Senior Developer Technology Engineer - AI

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
NVIDIA - M&A and Integration Lead

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
NVIDIA - Senior Firmware Engineer - Embedded Controller

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Google - Account Manager, Retail, Large Customer Sales

Google

Shanghai, Shanghai, China (On-Site)
1 Month ago
Tencent - Senior Technical Artist (UE5)

Tencent

Guangzhou, Guangdong Province, China (On-Site)
2 Months ago
Tencent - Strategic Investment Manager-Video Games(深圳)

Tencent

Shenzhen, Guangdong Province, China (On-Site)
4 Months ago
Mattel  Inc  - Manufacturing Engineer

Mattel Inc

Dongguan, Guangdong Province, China (On-Site)
5 Months ago
Zengame Technology - Advertising Optimization Specialist

Zengame Technology

Shenzhen, Guangdong Province, China (On-Site)
4 Months ago
NinjaVan - Sales Manager

NinjaVan

Shenzhen, Guangdong Province, China (On-Site)
7 Months ago
Tencent - NIKKE CN Community Content Operator

Tencent

Shanghai, Shanghai, China (On-Site)
1 Month ago
Astek - BSP Audio Engineer

Astek

Guangzhou, Guangdong Province, China (On-Site)
9 Months ago
Thatgamecompany - HR Generalist (Recruitment Focus)

Thatgamecompany

Shanghai, Shanghai, China (On-Site)
1 Month ago
Riot Games - Senior Game Designer, Combat

Riot Games

Shanghai, Shanghai, China (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Google - Software Engineer, People with Disabilities

Google

São Paulo, State Of São Paulo, Brazil (On-Site)
6 Months ago
Google - Silicon Networking Microarchitecture and RTL Lead

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
NVIDIA - Senior DFT Verification Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago
Wind River Systems - Software Architect – Real Time Operating Systems

Wind River Systems

Galați, Județul Galați, Romania (On-Site)
6 Months ago
NVIDIA - Senior System Software Engineer, Deep Learning Accelerator

NVIDIA

Santa Clara, California, United States (On-Site)
4 Months ago
ByteDance - Interaction Technology Lead - Smart Wearable Devices- Pico Lab- San Jose

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
NVIDIA - Senior Firmware Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Tesla - Bachelor/Master Thesis Research and Development, Mechanical Engineering

Tesla

Prüm, Rhineland-Palatinate, Germany (On-Site)
3 Months ago
Rivos - Silicon DFT - Full time

Rivos

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Easygo - Software Engineering Manager - Kick

Easygo

Melbourne, Victoria, Australia (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug