Deep Learning Performance Architect

1 Month ago • 5 Years + • Research & Development • Artificial Intelligence

Job Summary

Job Description

NVIDIA seeks a Deep Learning Performance Architect to contribute to AI performance modeling and analysis efforts. Responsibilities include analyzing state-of-the-art deep learning networks (LLMs), identifying performance opportunities, developing analytical models, specifying hardware/software configurations, and collaborating with cross-functional teams to guide the direction of next-gen deep learning hardware and software. The role involves optimizing performance and efficiency for various LLM workloads on current and future inference products. This position requires expertise in deep learning, AI models, and hardware architectures.
Must have:
  • 5+ years experience
  • Deep learning expertise
  • LLM and AIGC model experience
  • DL framework knowledge (Torch/JAX/TensorFlow/TensorRT)
  • Hardware architecture knowledge
  • Performance modeling and analysis skills
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA is developing processor and system architectures that accelerate deep learning and high-performance computing applications. We are looking for an expert deep learning system performance architect to join our AI performance modelling and analysis efforts. In this position, you will have a chance to work on DL performance modelling, analysis, and optimization on state-of-the-art hardware architectures for various LLM workloads. You will make your contributions to our dynamic technology focused company. 

What you'll be doing:

  • Analyze state of the art DL networks (LLM etc.), identify and prototype performance opportunities to influence SW and Architecture team for NVIDIA's current and next gen inference products

  • Develop analytical models for the state of the art deep learning networks and algorithm to innovate processor and system architectures design for performance and efficiency.

  • Specify hardware/software configurations and metrics to analyze performance, power, and accuracy in existing and future uni-processor and multiprocessor configurations.

  • Collaborate across the company to guide the direction of next-gen deep learning HW/SW by working with architecture, software, and product teams.

What we need to see:

  • BS, MS or PhD in relevant discipline (CS, EE, Math, etc.) or equivalent experience.

  • 5+ years work experience.

  • Experience with popular AI models (e.g., LLM and AIGC models)

  • Be familiar with typical deep learning SW framework (e.g., Torch/JAX/TensorFlow/TensorRT)

  • Knowledge and experience on hardware architectures for deep learning applications

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you! NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

#deeplearning

Similar Jobs

Paypal - Senior AI Machine Learning Engineer

Paypal

San Jose, California, United States (On-Site)
4 Months ago
Samsung Semiconductor - Senior Engineer, AI

Samsung Semiconductor

San Jose, California, United States (Hybrid)
4 Months ago
HiLabs - Data Scientist

HiLabs

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Riot Games - Research Scientist Intern - Game AI - Summer 2025 (Remote)

Riot Games

Dublin, County Dublin, Ireland (Remote)
3 Months ago
NVIDIA - Senior Solutions Architect - Generative AI

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Microsoft - Silicon Engineering: Internship Opportunities

Microsoft

Penang, Malaysia (On-Site)
1 Month ago
Assystems - Sr. HVAC Design Engineer

Assystems

Hyderabad, Telangana, India (On-Site)
3 Months ago
Zoox - Staff Software Engineer, Core Middleware Components

Zoox

Foster City, California, United States (On-Site)
3 Months ago
Rivos - Silicon SOC Verification - Full-time

Rivos

Hsinchu, Hsinchu City, Taiwan (Hybrid)
4 Months ago
Netflix - Software Engineer (L5) - Consumer Engineering

Netflix

United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
3 Months ago
Axinous - Sr. Staff ML Engineer

Axinous

San Jose, California, United States (Hybrid)
1 Month ago
Scopely - Senior Machine Learning Engineer - LiveOps Automation Team

Scopely

Barcelona, Catalonia, Spain (Hybrid)
2 Months ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Microsoft - Research Intern - AI Frontiers - Foundation Model Evaluation and Understanding

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
ByteDance - Research Scientist in Foundation Model (Speech & Audio Generation) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Canva - Staff Machine Learning Engineer - User Voice

Canva

Brisbane, Queensland, Australia (Remote)
4 Months ago
Intel Corporation - AI Frameworks Engineer

Intel Corporation

San José, San José Province, Costa Rica (On-Site)
3 Months ago
Applike Group - Senior Data Scientist (Recommendation Systems Expert) (f/m/d)

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
4 Months ago
Microsoft - Principal Applied Scientist

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Beijing, Beijing, China

eBay - Commercial Underwriting Teammate

eBay

Shanghai, Shanghai, China (On-Site)
4 Months ago
Nagarro - Senior Staff Consultant, Support Presales

Nagarro

China (Remote)
4 Months ago
Virtuos - 3D Environment Artist

Virtuos

China (On-Site)
2 Weeks ago
Virtuos - Planning Specialist

Virtuos

China (On-Site)
1 Week ago
Tencent - Project Management Intern - Game Development

Tencent

Shenzhen, Guangdong Province, China (On-Site)
6 Months ago
Ourpalm - 2D Lead Artist

Ourpalm

Beijing, Beijing, China (On-Site)
3 Weeks ago
Virtuos - Senior / Lead Software Engineer

Virtuos

China (On-Site)
1 Month ago
NVIDIA - Firmware PHY Verification Engineer

NVIDIA

Beijing, Beijing, China (On-Site)
3 Weeks ago
Tencent - Senior Strategic Investment Manager

Tencent

Shenzhen, Guangdong Province, China (On-Site)
2 Months ago
Unity - Performance Manager(Supply/Unified)

Unity

Beijing, Beijing, China (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Rivos - Senior Memory Design Engineer

Rivos

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
KaleidEO - Principal Satellite Image Processing Engineer

KaleidEO

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Nielsen Holdings - Software Engineering Manager - Windows\C++\.Net

Nielsen Holdings

Gurugram, Haryana, India (Hybrid)
1 Month ago
NVIDIA - DFT Verification Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Weeks ago
Taggd - Senior Software Engineer

Taggd

Pune, Maharashtra, India (On-Site)
5 Months ago
Krafton  - Deep Learning Research Scientist - Core

Krafton

Seoul, South Korea (On-Site)
3 Days ago
NVIDIA - Senior VLSI Physical Design Integration Engineer

NVIDIA

Massachusetts, United States (On-Site)
1 Month ago
Rivos - Senior Memory Design Engineer

Rivos

Santa Clara, California, United States (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

United States (Remote)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug