Senior Deep Learning Engineer

3 Days ago • 5 Years + • Artificial Intelligence • Research & Development • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Deep Learning Engineer to contribute to next-generation inference optimizations and deliver top performance in AI. Responsibilities include analyzing and exploring techniques to scale test-time compute, optimizing low-latency inference, leveraging cross-stack optimizations, and collaborating with diverse teams. The role requires keeping abreast of generative AI research, prototyping emergent techniques (output refinement, speculation, retrieval), identifying optimization opportunities, and pioneering innovative solutions for high-quality inferencing on NVIDIA GPUs. Successful candidates will collaborate with production teams to integrate advancements into software frameworks.
Must have:
  • Master's degree in relevant field
  • Strong deep learning foundation (generative models, inferencing)
  • 5+ years experience in modern deep learning frameworks (PyTorch)
  • Growth mindset and pragmatic attitude
Good to have:
  • Published research in deep learning (inference-time compute)
  • Experience with prototyping/deploying test-time compute techniques
  • Cross-team collaboration experience
  • Familiarity with computer architecture and AI algorithms
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a Senior Deep Learning Engineer! At NVIDIA, we are at the forefront of advancing the capabilities of artificial intelligence. We are seeking an ambitious and forward-thinking senior deep learning engineer to contribute to the development of next-generation inference optimizations and deliver industry-leading performance without compromising model quality. In this role, you will analyze and explore techniques to scale test-time compute and optimize low-latency inference. Your work will leverage cross-stack optimizations at the algorithmic and system level.

As NVIDIA makes significant strides in AI datacenters, our team holds a central role in maximizing the efficiency of our exponentially growing inference deployment needs and establishing a data-driven approach to algorithmic improvements, hardware design and system software development. We collaborate extensively with diverse teams at NVIDIA, spanning deep learning research and framework development teams, to silicon architecture. Thriving in such a high-impact, interdisciplinary environment necessitates not only technical proficiency but also a growth mindset and a pragmatic attitude — qualities that fuel our collective success in shaping the future of datacenter technology.

What You'll Be Doing:

  • Keeping abreast of the latest advancements in generative AI research.

  • Prototyping and analyzing emergent techniques in the test-time compute space such as output refinement,  speculation, and retrieval. Identifying opportunities for algorithmic as well as system optimizations.

  • Pioneering the development of innovative optimizations to enable high quality inferencing on NVIDIA GPUs. 

  • Collaborating closely with production teams to incorporate the latest advancements into cutting-edge software frameworks.

What We Need to See:

  • Master's degree (or equivalent experience) in Computer Science, Artificial Intelligence, Applied Mathematics, or related fields.

  • A strong foundation in deep learning, with a particular emphasis on generative models and inferencing.

  • A track record of at least 5 years of relevant software development experience in modern deep learning frameworks such as PyTorch.

  • Growth mindset and pragmatic attitude.

Ways to Stand Out From the Crowd:

  • Published research or noteworthy contributions to the field of deep learning, particularly in areas such as inference-time compute, conditional compute, speculative decoding, etc. 

  • Experience with prototyping and/or deployment of emergent test time compute techniques. 

  • Experience with collaborating across algorithms, software and performance teams to deliver high quality solutions.

  • Familiarity with computer architecture and how it relates to AI algorithms development.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Payactiv - Software Engineer

Payactiv

Milpitas, California, United States (Hybrid)
5 Months ago
The Walt Disney Company - Software Engineer - Core Software

The Walt Disney Company

Vancouver, British Columbia, Canada (On-Site)
1 Week ago
Google - Software Engineer III, Google Cloud Compute Infrastructure

Google

Sunnyvale, California, United States (On-Site)
4 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

(Remote)
2 Months ago
ByteDance - Vision Scientist- Pico

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Granicus - Data Scientist 4

Granicus

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Lionbridge Games - Language AI Specialist (Test & Tech)

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
1 Month ago
Match Group - Senior ML Platform Engineer

Match Group

New York, New York, United States (Hybrid)
5 Months ago
Truecaller - Senior MLOps Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
4 Months ago
Axon - Senior Technical Program Manager, AI

Axon

Seattle, Washington, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Senior Software Engineer, Core Machine Learning, Google Cloud

Google

New York, New York, United States (On-Site)
4 Months ago
NVIDIA - Senior ASIC Design Verification Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
Meta - Software Engineer, Computer Vision (Technical Leadership)

Meta

Menlo Park, California, United States (Remote)
4 Months ago
ByteDance - Senior Natural Language Processing Algorithm Engineer

ByteDance

Seattle, Washington, United States (On-Site)
1 Week ago
prizepicks - Engineering Manager — Data Science Engineering

prizepicks

Atlanta, Georgia, United States (Remote)
1 Week ago
Inworld AI - Senior C++ Developer - Canada

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
5 Months ago
Salesforce - Backend Software Engineer - Lead/Principal

Salesforce

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
5 Months ago
The Walt Disney Company - Senior Software Engineer

The Walt Disney Company

Glendale, California, United States (On-Site)
1 Week ago
ByteDance - Senior Data Scientist

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
ByteDance - Algorithm Engineer - Enterprise Solution R&D

ByteDance

San Jose, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Redmond, Washington, United States

NVIDIA - Senior Physical Design Methodology Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
NVIDIA - Senior Server Firmware Bringup Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Egnyte - Staff Software Engineer

Egnyte

Mountain View, California, United States (Hybrid)
4 Months ago
undefined - Enterprise Account Executive, East

United States (Remote)
5 Months ago
Pattern® - Digital Marketing Manager

Pattern®

Lehi, Utah, United States (Hybrid)
6 Months ago
ByteDance - Research Scientist Graduate, Computational Biology (AML - AI-for-Science) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Evolution - Casino Game Presenter (Guest Service Agent Alternative) - up to $25/hr

Evolution

Atlantic City, New Jersey, United States (On-Site)
5 Months ago
ByteDance - Site Reliability Engineer Graduate (Product RD and Infrastructure-Global E-Commerce) - 2024 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Rockstar Games - Senior Animation R&D Programmer: Retargeting

Rockstar Games

New York, New York, United States (On-Site)
6 Days ago
NVIDIA - Senior Math Libraries Engineers - Python APIs

NVIDIA

Remote, Oregon, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Senior Deep Learning Performance Architect

NVIDIA

Redmond, Washington, United States (On-Site)
1 Month ago
Match Group - Machine Learning Engineer

Match Group

New York, New York, United States (Hybrid)
5 Months ago
AI Fund - Machine Learning Engineer

AI Fund

(Remote)
5 Months ago
Zoox - Senior/Staff Software Engineer - 3D World Generation Pipelines

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago
PwC - Senior Data Scientist

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Months ago
Genies - Machine Learning Engineer: 3D Generative AI

Genies

San Mateo, California, United States (Remote)
5 Months ago
Zoox - Collision Avoidance System, Machine Learning Internship/Co-op

Zoox

Foster City, California, United States (On-Site)
5 Months ago
Virtuos - R&D Machine Learning Engineer

Virtuos

China (On-Site)
1 Week ago
PlayStation Global - Mid-Career Machine Learning Engineer - Recommendation Systems

PlayStation Global

San Francisco, California, United States (On-Site)
6 Days ago
Krafton  - Deep Learning Strategy & Operations Associate

Krafton

Seoul, South Korea (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug