Outscal Logooutscal logo

GPU Kernel Dev & Perf Analysis Architect

18 Minutes ago • 4 Years + • Research & Development

Job Summary

Job Description

NVIDIA seeks a GPU Kernel Dev & Perf Analysis Architect to design, develop, and optimize GEMM (General Matrix Multiply) kernels for new architectures. Responsibilities include implementing and fine-tuning kernels for optimal performance on NVIDIA GPUs, conducting in-depth performance analysis, identifying bottlenecks, optimizing resource utilization, and improving throughput and power efficiency. The role involves creating and maintaining workloads and micro-benchmarks, generating performance reports, and collaborating with architecture, software, and product teams. This position requires 4+ years of experience in GPU programming or performance optimization for DL applications, expertise in CUDA programming, and proficiency with performance profiling tools like NVIDIA Nsight.
Must have:
  • 4+ years GPU programming/DL optimization experience
  • GEMM kernel development and optimization
  • GPU kernel performance analysis and improvement
  • CUDA programming expertise
  • Proficiency with NVIDIA Nsight

Job Details

NVIDIA is developing processor and system architectures that accelerate machine learning, automotive and high performance computing (HPC) applications. We are seeking a strong candidate to do GEMM kernel development and performance analysis for NVIDIA's new architectures. Your work will play a critical role in shaping the future of deep learning hardware and software, ensuring optimal performance for next-generation AI applications.  This position offers the opportunity to make a meaningful impact in a fast-moving, technology focused company.

What you'll be doing:

  • Design, develop, and optimize GEMM (General Matrix Multiply) kernels for NVIDIA's new architectures.

  • Implement and fine-tune kernels to achieve optimal performance on NVIDIA GPUs.

  • Conduct in-depth performance analysis of GPU kernels, including GEMM and other critical operations.

  • Identify bottlenecks, optimize resource utilization, and improve throughput, and power efficiency

  • Create and maintain workloads and micro-benchmark suites to evaluate kernel performance across various hardware and software configurations.

  • Generate performance projections, comparisons, and detailed analysis reports for internal and external stakeholders.

  • Collaborate with architecture, software, and product teams to guide the development of next-generation deep learning hardware and software.

What we need to see:

  • 4+ years of industry experience in GPU programming or performance optimization for DL applications.

  • Hands-on experience in developing and optimizing GEMM (General Matrix Multiply) kernels.

  • Demonstrated experience in analyzing and improving the performance of GPU kernels, with measurable results (e.g., performance improvements, efficiency gains).

  • Expertise in CUDA programming for GPU acceleration.

  • Experience with performance profiling tools (e.g., NVIDIA Nsight).

  • Excellent communication skills, both written and verbal.

  • Strong organizational and time management abilities, with the ability to prioritize tasks effectively.

Similar Jobs

Blind Squirrel Games - Senior Generalist Engineer

Blind Squirrel Games

Austin, Texas, United States (Hybrid)
1 Month ago
Pika - Senior Distributed Systems Engineer

Pika

Palo Alto, California, United States (On-Site)
3 Months ago
NVIDIA - Graphics Tools Software Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
CynLr - Software Engineer - GPU performance

CynLr

Bengaluru, Karnataka, India (On-Site)
4 Months ago
NVIDIA - Senior Deep Learning Systems Software Engineer - AI Infrastructure

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Tesla - Bachelor/Master Thesis Research and Development, Mechanical Engineering

Tesla

Prüm, Rhineland-Palatinate, Germany (On-Site)
3 Weeks ago
Riot Games - Technical Product Manager III - VALORANT, Cross-VALORANT Experience (XVX)

Riot Games

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
Ubisoft - Principal R&D Scientist on Bots & Behaviors

Ubisoft

Bordeaux, Nouvelle-Aquitaine, France (Hybrid)
1 Month ago
Krafton  - CEO's Office Staff

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Samsung Semiconductor - Intern, Machine Learning Research Scientist

Samsung Semiconductor

San Jose, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Graphics Tools Software Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
CynLr - Software Engineer - GPU performance

CynLr

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Pika - Senior Distributed Systems Engineer

Pika

Palo Alto, California, United States (On-Site)
3 Months ago
Blind Squirrel Games - Senior Generalist Engineer

Blind Squirrel Games

Austin, Texas, United States (Hybrid)
1 Month ago
NVIDIA - Senior Deep Learning Systems Software Engineer - AI Infrastructure

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Month ago
ByteDance - Video Analysis and Quality Algorithm Intern 2023 Summer/Fall (MS)

ByteDance

San Diego, California, United States (On-Site)
4 Months ago
NVIDIA - Signal and Power Integrity Engineer (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
Meta - Software Engineer (Leadership) - Machine Learning

Meta

Redmond, Washington, United States (Remote)
3 Months ago
NVIDIA - Senior DFT Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
20 Hours ago
NVIDIA - Senior System Software Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

NVIDIA - Senior Product Engineer - Board Products

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Mattel  Inc  - Sr Laboratory Engineer

Mattel Inc

Guangdong Province, China (On-Site)
3 Months ago
Tencent - 2D Open World Game Storyboard Artist

Tencent

Guangzhou, Guangdong Province, China (On-Site)
1 Week ago
Keywords Studios (Player Support) - UE4 Technical Artist

Keywords Studios (Player Support)

Shanghai, Shanghai, China (On-Site)
9 Months ago
Riot Games - Senior User Researcher

Riot Games

Shanghai, Shanghai, China (On-Site)
7 Months ago
Riot Games - Marketing Creative Director - VALORANT, China Publishing

Riot Games

Shanghai, Shanghai, China (On-Site)
3 Months ago
Microsoft - Researcher

Microsoft

Beijing, Beijing, China (On-Site)
2 Months ago
Razer - Senior Electronics Engineer

Razer

Shenzhen, Guangdong Province, China (On-Site)
5 Months ago
NVIDIA - Software Engineer Intern - Mapping and Generative AI

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Fabric - Applied Researcher, Cryptography Hardware

Fabric

Seattle, Washington, United States (Remote)
4 Months ago
Krafton  - [Publishing] Lead of Game PR (5~10년)

Krafton

Seoul, South Korea (On-Site)
2 Months ago
NVIDIA - Senior Signal and Power Integrity Engineer - Hardware

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Riot Games - Game Designer III - VALORANT, Competitive Systems

Riot Games

Los Angeles, California, United States (On-Site)
3 Months ago
Fabric - Applied Researcher, Cryptography Proof Systems

Fabric

New York, New York, United States (Remote)
4 Months ago
Activate Games - Electronics Assembler (Night Shift)

Activate Games

Winnipeg, Manitoba, Canada (On-Site)
3 Months ago
Hawk Eye Innovations - Computer Vision Engineer - Level 1/2

Hawk Eye Innovations

Budapest, Hungary (Hybrid)
1 Month ago
NVIDIA - Senior Software Manager

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Week ago
The Walt Disney Company - Lead Solution Architect

The Walt Disney Company

Orlando, Florida, United States (On-Site)
1 Month ago
Magic Leap - Director, Calibration Software

Magic Leap

Sunnyvale, California, United States (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


California, United States (Hybrid)

Yokne'am Illit, North District, Israel (On-Site)

Taipei City, Taiwan (On-Site)

Seoul, South Korea (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Remote)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug