Senior Software Engineer, Deep Learning Inference, TensorRT

1 Month ago • 3 Years + • Research & Development • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer specializing in Deep Learning Inference using TensorRT. Responsibilities include developing TensorRT components using C++ and Python, building graph parsers and optimizers, and collaborating with deep learning experts and GPU architects. The role involves optimizing deep learning models, particularly Large Language Models, for NVIDIA GPUs. Success requires strong C++ and machine learning expertise, proficiency in data structures and algorithms, and excellent communication skills.
Must have:
  • C++11/14/17 expertise
  • Strong ML concepts understanding
  • Computer Architecture knowledge
  • Data Structures & Algorithms
  • 3+ years software development experience
Good to have:
  • System Software development
  • Python proficiency
  • CUDA/OpenCL GPU kernel programming
  • Software performance optimization
  • Compiler development experience
  • TensorRT/PyTorch/TensorFlow experience
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming exceptional software engineers to apply to Senior Engineering positions in the Deep Learning Inference TensorRT software team.

What you’ll be doing:

  • Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning inference.

  • Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models.

  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams.

What we need to see:

  • A Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field.

  • 3+ years of software development experience.

  • Strong experience with C++11/C++14/C++17.

  • Strong grasp of Machine Learning concepts.

  • Experience and knowledge in Computer Architecture, Data Structures, Algorithms.

  • Excellent communication skills, and an aptitude for collaboration and teamwork.

Ways to stand out from the crowd:

  • Experience developing System Software.

  • Proficiency in Python as well as Background in GPU kernel programming using CUDA or OpenCL.

  • Experience in software performance benchmarking, profiling, and optimizations.

  • Background in compiler development

  • Experience in working with TensorRT, PyTorch, TensorFlow, ONNX Runtime or other ML frameworks.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous and love a challenge, we want to hear from you. Come, join our TensorRT Workflows team and help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

#LI-Hybrid

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Netflix - Machine Learning Engineer (L4/5) - Studio Media Algorithms

Netflix

United States (Remote)
1 Week ago
ByteDance - Machine Learning Engineer Intern (Knowledge Graph) - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
NVIDIA - Applied Physics ML Research Intern - Fall 2025

NVIDIA

Santa Clara, California, United States (On-Site)
5 Days ago
ByteDance - Research Engineer in Large Model System

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Krafton  - Deep Learning Engineer - Model Optimization

Krafton

Seoul, South Korea (On-Site)
1 Week ago
Google - CPU System Software Engineer, Performance Architect

Google

Austin, Texas, United States (On-Site)
6 Days ago
Valve corporation - Psychologist
Research/Experimental

Valve corporation

Bellevue, Washington, United States (On-Site)
5 Months ago
NVIDIA - Senior DFT Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Weeks ago
Google - Senior Privacy Engineer, Health Data Governance

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Google - Senior Staff Software Engineer, Search, Ads Query Understanding

Google

Mountain View, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
NVIDIA - Senior Developer Relations Manager - Manufacturing

NVIDIA

Tokyo, Japan (On-Site)
3 Months ago
The Walt Disney Company - Principal Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
1 Week ago
ION - Data Engineer, Italy

ION

Italy (Hybrid)
6 Months ago
Netflix - ML Software Engineer (L4/L5) - Media Algorithms

Netflix

Los Angeles, California, United States (On-Site)
1 Week ago
NVIDIA - AI Algorithms Software Engineer (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
3 Months ago
Canva - Senior Machine Learning Engineer - Photo AI

Canva

Prague, Czechia (Remote)
3 Months ago
ByteDance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Evolution - Data Scientist

Evolution

Warsaw, Masovian Voivodeship, Poland (Hybrid)
8 Months ago
The Walt Disney Company - Senior Machine Learning Engineer - Ad Platforms

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Epic Games - Senior Desktop Programmer, Fortnite Tech

Epic Games

Montreal, Quebec, Canada (On-Site)
3 Months ago
Tencent - Data Analytics Intern

Tencent

Montreal, Quebec, Canada (On-Site)
1 Month ago
Epic Games - Senior Designer, Cosmetic Ecosystem

Epic Games

Montreal, Quebec, Canada (On-Site)
2 Months ago
AppLovin - Growth Associate, eCommerce

AppLovin

Toronto, Ontario, Canada (On-Site)
4 Months ago
Behaviour Interactive - Senior People & Culture Business Partner

Behaviour Interactive

Montreal, Quebec, Canada (Hybrid)
3 Months ago
Mistplay - Senior DevOps Engineer II

Mistplay

Toronto, Ontario, Canada (Hybrid)
1 Month ago
Intrepid Studios,  Inc  - Unreal Automation Engineer

Intrepid Studios, Inc

Canada (On-Site)
8 Months ago
Ubisoft - Team Lead - Animation

Ubisoft

Toronto, Ontario, Canada (On-Site)
2 Weeks ago
Google - Strategic Partnerships Development Manager

Google

Toronto, Ontario, Canada (On-Site)
4 Days ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Ubisoft - Lead R&D Scientist

Ubisoft

Shanghai, Shanghai, China (On-Site)
3 Months ago
NVIDIA - Silicon Power Performance Engineer

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
NVIDIA - Senior Software Program Manager

NVIDIA

Santa Clara, California, United States (On-Site)
1 Week ago
ByteDance - FPGA Firmware Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Riot Games - Principal Software Engineer, Gameplay Networking SME

Riot Games

Los Angeles, California, United States (On-Site)
1 Week ago
Riot Games - Staff Software Engineer (Build Platforms)

Riot Games

Los Angeles, California, United States (On-Site)
1 Week ago
Riot Games - Staff Software Engineer, MGS - 2XKO

Riot Games

Dublin, County Dublin, Ireland (On-Site)
5 Months ago
NVIDIA - Silicon Photonics Test Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
Google - Imaging Systems Engineer

Google

Mountain View, California, United States (On-Site)
1 Week ago
ByteDance - Machine Learning Engineer, Tech Lead - Code AI

ByteDance

San Jose, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug