Deep Learning Performance Architect

5 Months ago • 2 Years + • Research Development

Job Summary

Job Description

As a Deep Learning Performance Architect at NVIDIA, you'll benchmark and analyze AI workloads across single and multi-node configurations. You'll develop high-level simulators and debuggers in C++/Python, evaluate performance, power, and area (PPA) trade-offs for hardware and system architecture. Collaboration with architecture and product management teams is crucial for trade-off analysis throughout the project lifecycle. Staying current with deep learning trends and research is essential. Responsibilities include working with modern transformer-based model architectures, benchmarking, projections, workload profiling and clear technical communication.
Must have:
  • MS/PhD in CS, EE, Math
  • 2+ years in parallel computing
  • Strong C, C++, Python skills
  • Architecture analysis & modeling
  • Problem-solving skills
Good to have:
  • Understanding of transformer models
  • Experience with benchmarking methodologies
  • Workload profiling and correlation
  • Communication with non-technical audiences

Job Details

NVIDIA has continuously reinvented itself. Our invention of the GPU sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. Today, research in artificial intelligence is booming worldwide, which calls for highly scalable and massively parallel computation horsepower that NVIDIA GPUs excel.

NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that only we can address, and that matter to the world. This is our life’s work , to amplify human creativity and intelligence. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join our diverse team and see how you can make a lasting impact on the world!

Intelligent machines powered by Artificial Intelligence computers that can learn, reason and interact with people are no longer science fiction. GPU Deep Learning has provided the foundation for machines to learn, perceive, reason and solve problems. NVIDIA's GPUs run AI algorithms, simulating human intelligence, and act as the brains of computers, robots and self-driving cars that can perceive and understand the world. Increasingly known as “the AI computing company”, NVIDIA wants you. Come, join our Deep Learning Architecture team, where you can help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field!

What you'll be doing:

  • Benchmark and analyze AI workloads in single and multi-node configurations.

  • High level simulator and debugger development in C++/Python.

  • Evaluate PPA (performance, power, area) for hardware features and system-level architectural trade-offs.

  • Work closely with wider architecture teams, architecture and product management to help with trade-off analysis at every stage of the project.

  • Keep abreast with emerging trends and research in deep learning.

What we need to see:

  • MS or PhD in a relevant discipline (CS, EE, Math).

  • 2+ years of experience in parallel computing architectures, interconnect fabrics and deep learning applications.

  • Strong programming skills in C, C++ and Python.

  • Proficiency in architecture analysis and performance modeling.

  • Curious mindset with excellent problem solving skills.

Ways to stand out from the crowd: 

  • Understanding of modern transformer-based model architectures.

  • Experience with benchmarking, projections methodologies, workload profiling and correlation.

  • Ability to simplify and communicate rich technical concepts with non-technical audience.

#LI-Hybrid

Similar Jobs

PwC - Senior Consultant / Senior Consultant (Finance of the Future)

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago
Gupta Media - Media Analyst

Gupta Media

New York, New York, United States (On-Site)
3 Months ago
Hawkeye Innovations - KinaTrax Systems Operator - Baseball Tech

Hawkeye Innovations

Atlanta, Georgia, United States (On-Site)
4 Months ago
Penumbrainc - Electronic Data Interchange (EDI) Analyst

Penumbrainc

United States (Hybrid)
2 Months ago
Activision - Director, Learning & Development

Activision

Los Angeles, California, United States (On-Site)
2 Weeks ago
Make - Senior Process Automation & AI specialist

Make

Prague, Czechia (On-Site)
2 Months ago
Eventbrite - Researcher II (East Coast)

Eventbrite

United States (Remote)
1 Month ago
Lionbridge Games - AI Program Director

Lionbridge Games

(Remote)
5 Months ago
flying wild hog - Lead User Researcher

flying wild hog

Poland (Remote)
5 Months ago
Pika - Research Engineer (Applied Research)

Pika

Palo Alto, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Trailer park group - Post Production Manager

Trailer park group

Los Angeles, California, United States (On-Site)
1 Week ago
Social Discovery Group - Head of Product of Premium products

Social Discovery Group

Thailand (Remote)
8 Months ago
HoYoverse - Accountant (GL)

HoYoverse

Singapore (On-Site)
2 Months ago
GHX - Inventory Specialist

GHX

Phoenix, Arizona, United States (On-Site)
3 Months ago
Lionbridge Games - Audio Lead

Lionbridge Games

Berlin, Berlin, Germany (On-Site)
5 Months ago
WebFX - Technical Marketing Analyst

WebFX

(Remote)
3 Months ago
Valeo - Industrial Controlling Trainee

Valeo

Mondovì, Piedmont, Italy (On-Site)
1 Month ago
Side - Espagnol EU/Castilian Spanish - Localisation de jeux | Localization QA Tester

Side

Montreal, Quebec, Canada (On-Site)
1 Week ago
Paytm - Go-To-Market Lead - Deputy General Manager - Offline Merchants QR

Paytm

Ahmedabad, Gujarat, India (On-Site)
2 Months ago
Assystems - CDM Technical Coordinator

Assystems

Bridgwater, England, United Kingdom (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Hyderabad, Telangana, India

Capgemini - DC-ACI/Nexus

Capgemini

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Zones - Software, Cloud and Datacenter Solution Architect

Zones

Bengaluru, Karnataka, India (On-Site)
5 Months ago
FICO - DevOps Engineering Enablement-Engineer II

FICO

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Touch Magix - Junior Engineer - Customer / Technical Support

Touch Magix

India (On-Site)
1 Month ago
Cognite - Vice President Global Academy

Cognite

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Zscaler - Analyst, Strategic Finance/Investor Relations

Zscaler

India (Remote)
1 Month ago
Nice - Specialist Software Engineer (Java, Microservices)

Nice

Pune, Maharashtra, India (On-Site)
1 Month ago
dun bradstreet - Strategic Account Manager

dun bradstreet

Mumbai, Maharashtra, India (On-Site)
5 Months ago
Scopely - Senior Data Analyst

Scopely

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
NCR Voyix - Software Engineer III / Java Full Stack Developer

NCR Voyix

Chennai, Tamil Nadu, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research Development Jobs

bytedance - Research Scientist Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

bytedance

Seattle, Washington, United States (On-Site)
9 Months ago
Take-Two Interactive - Senior Architect II - AI

Take-Two Interactive

Canada (Remote)
2 Weeks ago
bytedance - Senior Research Engineer / Scientist - Storage for LLM

bytedance

Seattle, Washington, United States (On-Site)
3 Months ago
bytedance - Research Scientist, AI for Infra

bytedance

Seattle, Washington, United States (On-Site)
1 Week ago
FICO - Senior/Lead Research Engineer - AI/ML- Applied AI

FICO

United States (Remote)
3 Weeks ago
Canva - AI Research Lead - Generative AI

Canva

Adelaide, South Australia, Australia (Remote)
1 Month ago
Next Level Business Services - User Experience Researcher

Next Level Business Services

Santa Clara, California, United States (On-Site)
9 Months ago
zoox - Engineering Manager, ML Training Platform

zoox

Foster City, California, United States (Hybrid)
10 Months ago
level ai - Lead Research Engineer - Applied AI

level ai

California, United States (Hybrid)
1 Month ago
Tide - Lead Machine Learning Engineer (MLOps)

Tide

Ukraine (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

Beijing, Beijing, China (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug