Deep Learning Performance Architect

2 Months ago • 2 Years + • Artificial Intelligence

Job Summary

Job Description

As a Deep Learning Performance Architect at NVIDIA, you'll benchmark and analyze AI workloads across single and multi-node configurations. You'll develop high-level simulators and debuggers in C++/Python, evaluate performance, power, and area (PPA) trade-offs for hardware and system architecture. Collaboration with architecture and product management teams is crucial for trade-off analysis throughout the project lifecycle. Staying current with deep learning trends and research is essential. Responsibilities include working with modern transformer-based model architectures, benchmarking, projections, workload profiling and clear technical communication.
Must have:
  • MS/PhD in CS, EE, Math
  • 2+ years in parallel computing
  • Strong C, C++, Python skills
  • Architecture analysis & modeling
  • Problem-solving skills
Good to have:
  • Understanding of transformer models
  • Experience with benchmarking methodologies
  • Workload profiling and correlation
  • Communication with non-technical audiences

Job Details

NVIDIA has continuously reinvented itself. Our invention of the GPU sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. Today, research in artificial intelligence is booming worldwide, which calls for highly scalable and massively parallel computation horsepower that NVIDIA GPUs excel.

NVIDIA is a “learning machine” that constantly evolves by adapting to new opportunities that are hard to solve, that only we can address, and that matter to the world. This is our life’s work , to amplify human creativity and intelligence. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join our diverse team and see how you can make a lasting impact on the world!

Intelligent machines powered by Artificial Intelligence computers that can learn, reason and interact with people are no longer science fiction. GPU Deep Learning has provided the foundation for machines to learn, perceive, reason and solve problems. NVIDIA's GPUs run AI algorithms, simulating human intelligence, and act as the brains of computers, robots and self-driving cars that can perceive and understand the world. Increasingly known as “the AI computing company”, NVIDIA wants you. Come, join our Deep Learning Architecture team, where you can help build real-time, cost-effective computing platforms driving our success in this exciting and rapidly growing field!

What you'll be doing:

  • Benchmark and analyze AI workloads in single and multi-node configurations.

  • High level simulator and debugger development in C++/Python.

  • Evaluate PPA (performance, power, area) for hardware features and system-level architectural trade-offs.

  • Work closely with wider architecture teams, architecture and product management to help with trade-off analysis at every stage of the project.

  • Keep abreast with emerging trends and research in deep learning.

What we need to see:

  • MS or PhD in a relevant discipline (CS, EE, Math).

  • 2+ years of experience in parallel computing architectures, interconnect fabrics and deep learning applications.

  • Strong programming skills in C, C++ and Python.

  • Proficiency in architecture analysis and performance modeling.

  • Curious mindset with excellent problem solving skills.

Ways to stand out from the crowd: 

  • Understanding of modern transformer-based model architectures.

  • Experience with benchmarking, projections methodologies, workload profiling and correlation.

  • Ability to simplify and communicate rich technical concepts with non-technical audience.

#LI-Hybrid

Similar Jobs

Google - Staff Software Engineer, Core, Dagger

Google

Mexico City, Mexico City, Mexico (On-Site)
1 Week ago
Google - Front-End Software Developer, Developer Experience

Google

Sunnyvale, California, United States (On-Site)
2 Days ago
Google - Software Engineer, Cloud Console Platform, Full Stack

Google

New York, New York, United States (On-Site)
1 Week ago
Google - Software Engineer III, Site Reliability Engineering

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
NVIDIA - Senior Software Engineer - Triton Tools

NVIDIA

California, United States (Remote)
3 Months ago
Scopely - Senior AI Creative (Motion) - Monopoly Go

Scopely

Barcelona, Catalonia, Spain (Hybrid)
1 Month ago
Inworld AI - Staff / Principal Machine Learning Engineer - USA

Inworld AI

Mountain View, California, United States (Remote)
5 Months ago
Zoox - Manager, Simulation Traffic & Behavior Modeling

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
NVIDIA - Deep Learning Intern - Fall 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Weeks ago
ByteDance - Machine Learning Engineer, Tech Lead - Engineering Efficiency and AI Code Assistant

ByteDance

San Jose, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Baton - Software Engineer - Infrastructure, Machine Learning

Baton

San Francisco, California, United States (Hybrid)
1 Day ago
Epic Games - Automation Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Month ago
Google - Senior Software Engineer, Google Cloud AI

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Days ago
Google - Senior Software Engineer, Full Stack, Google Cloud

Google

Hyderabad, Telangana, India (On-Site)
2 Weeks ago
ByteDance - Backend Software Engineer - Customer Service Platform - Seattle

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
ByteDance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Dolby Laboratories - AIOps Research Scientist

Dolby Laboratories

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Google - Software Engineer, Wi-Fi, Chrome OS

Google

Taipei City, Taiwan (On-Site)
2 Weeks ago
Vigaet - Machine Learning Engineer-Internship

Vigaet

Bengaluru, Karnataka, India (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Jobs in Hyderabad, Telangana, India

PwC - Specialist 2

PwC

Gujarat, India (On-Site)
7 Months ago
DNEG - Animation TD

DNEG

Karnataka, India (On-Site)
1 Month ago
Assystems - Design Engineer - Water Supply, Sewerage & Drainage

Assystems

Gurugram, Haryana, India (On-Site)
6 Months ago
Revenera - Senior Site Reliability Engineer

Revenera

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
e2 open - Senior Research Engineer

e2 open

Bengaluru, Karnataka, India (On-Site)
1 Day ago
Valeo - Senior Project Buyer

Valeo

Ahmedabad, Gujarat, India (On-Site)
1 Day ago
PwC - Associate, Renewable Energy, Utility Transformation Advisory

PwC

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Accurate - Performance/Load Test Engineer

Accurate

Hyderabad, Telangana, India (Hybrid)
1 Day ago
Dream Sports - SDE 2 - DevOps

Dream Sports

Mumbai, Maharashtra, India (On-Site)
2 Weeks ago
PwC - IN-Manager-PLS-Health  Industries-Advisory-Mumbai

PwC

Mumbai, Maharashtra, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

ByteDance - Machine Learning Engineer, Tech Lead - Engineering Efficiency and AI Code Assistant

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
Canva - Research Engineering Manager - Image Generation (m/f/x) - Canva Austria

Canva

Vienna, Vienna, Austria (Remote)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

Washington, District Of Columbia, United States (On-Site)
1 Week ago
Google - Digital Transformation Consultant

Google

Hyderabad, Telangana, India (On-Site)
1 Week ago
NVIDIA - Senior Software Engineer, TensorRT-LLM

NVIDIA

California, United States (Hybrid)
4 Weeks ago
Google - Senior Technical Program Manager I, Infrastructure, Google Cloud

Google

Durham, North Carolina, United States (On-Site)
2 Weeks ago
Google - Photonic Engineer, Machine Learning Systems, Platforms

Google

Sunnyvale, California, United States (On-Site)
2 Days ago
Google - Group Product Manager, Google Cloud Storage

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Trend Micro - Sr. Data Scientist (AI Lab)

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug