Deep Learning Performance Architect

3 Months ago • 5 Years + • Research & Development • Artificial Intelligence

Job Summary

Job Description

NVIDIA seeks a Deep Learning Performance Architect to contribute to AI performance modeling and analysis efforts. Responsibilities include analyzing state-of-the-art deep learning networks (LLMs), identifying performance opportunities, developing analytical models, specifying hardware/software configurations, and collaborating with cross-functional teams to guide the direction of next-gen deep learning hardware and software. The role involves optimizing performance and efficiency for various LLM workloads on current and future inference products. This position requires expertise in deep learning, AI models, and hardware architectures.
Must have:
  • 5+ years experience
  • Deep learning expertise
  • LLM and AIGC model experience
  • DL framework knowledge (Torch/JAX/TensorFlow/TensorRT)
  • Hardware architecture knowledge
  • Performance modeling and analysis skills
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning system performance architect to join our AI performance modelling and analysis efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company. 

What you'll be doing:

  • Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products

  • Develop analytical models for the state of the art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency.

  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uni-processor and multiprocessor configurations.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.

What we need to see:

  • BS, MS or PhD in relevant discipline (CS, EE, Math, etc.) or equivalent experience.

  • 5+ years work experience.

  • Experience with popular AI models (e.g., LLM and AIGC models)

  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow/TensorRT)

  • Knowledge and experience on hardware architectures for deep learning applications

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you! NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning

Similar Jobs

Sony Interactive Entertainment - AI/機械学習エンジニア

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
4 Months ago
Attentive - Staff Machine Learning Engineer

Attentive

San Francisco, California, United States (Hybrid)
6 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

London, England, United Kingdom (Remote)
2 Months ago
ByteDance - Lead Research Scientist, Foundation Model, Speech & Audio

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - Tech Expert - Machine Learning Infrastructure

ByteDance

Singapore (On-Site)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

Burlingame, California, United States (On-Site)
5 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Riot Games - Senior Game Product Manager, Social Systems - 2XKO

Riot Games

Dublin, County Dublin, Ireland (On-Site)
5 Months ago
NVIDIA - Senior High-Performance System Architect

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Cirrus Logic - Summer Intern, Validation Engineer

Cirrus Logic

Austin, Texas, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Attentive - Senior Machine Learning Engineer

Attentive

New York, New York, United States (Hybrid)
6 Months ago
Stonewall Collision & Auto Painting - Data Scientist

Stonewall Collision & Auto Painting

Hyderabad, Telangana, India (On-Site)
7 Months ago
Dream Sports - Senior ML Scientist

Dream Sports

Mumbai, Maharashtra, India (On-Site)
5 Months ago
ByteDance - AI Security Researcher - Security Flow

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Software Engineer III, Machine Learning, Google Ads

Google

Mountain View, California, United States (On-Site)
5 Months ago
ByteDance - Software Engineer Intern (Machine Learning Platform) - 2024 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - Engineering Manager - Applied Machine Learning Algorithm

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - Research Scientist, Reinforcement Learning

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Axinous - Sr. Staff ML Engineer

Axinous

San Jose, California, United States (Hybrid)
3 Months ago
PAPAYA - Data Scientist

PAPAYA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Beijing, Beijing, China

Visa - Copy of Senior Manager, Client Consulting

Visa

Shenzhen, Guangdong Province, China (On-Site)
6 Months ago
NVIDIA - AI Computing Software Development Engineer, TensorRT

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
Spin Master - Manager, Engineering

Spin Master

Guangdong Province, China (On-Site)
3 Months ago
Canva - Backend Software Engineer - Internationalization

Canva

Beijing, Beijing, China (Remote)
1 Month ago
Microsoft - Researcher

Microsoft

Beijing, Beijing, China (On-Site)
4 Months ago
Paper Games - Game UI Designer - Infinite Warmth (2025 Spring Recruitment)

Paper Games

Shanghai, Shanghai, China (On-Site)
2 Months ago
Tencent - Senior Engine Programmer

Tencent

Shenzhen, Guangdong Province, China (On-Site)
4 Months ago
Razer - Lead Site Reliability Engineer

Razer

Shanghai, Shanghai, China (On-Site)
6 Months ago
NVIDIA - Principal Autonomous Vehicles Engineer - Mapping and Localization

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
Wargaming - Game Designer (World of Warships Blitz)

Wargaming

Shanghai, Shanghai, China (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Intel Corporation - CPU Physical Design-Timing Engineer

Intel Corporation

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Rivos - Software Compiler - Full Time

Rivos

United States (Hybrid)
6 Months ago
Krafton  - Applied Research Engineer - Reinforcement Learning (Intern)

Krafton

Seoul, South Korea (On-Site)
1 Month ago
NVIDIA - ASIC Verification Engineer - GPU

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
NVIDIA - Senior Mask Layout Design Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
3 Months ago
ByteDance - Video Codec Firmware Engineer - Multimedia Lab

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Senior Manager, High-Speed Optical Transceiver Design

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
NVIDIA - ASIC Design Efficiency Engineer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Riot Games - Senior Software Engineer - VALORANT - Foundations Developer Experience & Workflows

Riot Games

Dublin, County Dublin, Ireland (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug