Outscal Logooutscal logo

Senior Software Engineer, Deep Learning Inference, TensorRT

1 Week ago • 3 Years + • Research & Development • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer to contribute to its TensorRT deep learning inference framework. Responsibilities include developing components of TensorRT using C++ and Python, building graph parsers and optimizers, and collaborating with cross-functional teams. The role requires expertise in C++, machine learning, computer architecture, and data structures. Experience with GPU kernel programming (CUDA or OpenCL), software performance optimization, and familiarity with frameworks like PyTorch, TensorFlow, or ONNX Runtime are advantageous. This position focuses on accelerating deep learning models, particularly large language models, on NVIDIA GPUs. The candidate will work on the real-time, cost-effective computing platform that drives NVIDIA's success in this growing field.
Must have:
  • C++11/14/17 expertise
  • Machine learning knowledge
  • Computer architecture understanding
  • Data structures and algorithms
  • 3+ years software development experience
Good to have:
  • System software development
  • Python proficiency
  • CUDA/OpenCL experience
  • Performance benchmarking/profiling
  • Compiler development
  • TensorRT/PyTorch/TensorFlow experience
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming exceptional software engineers to apply to Senior Engineering positions in the Deep Learning Inference TensorRT software team.

What you’ll be doing:

  • Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning inference.

  • Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models.

  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams.

What we need to see:

  • A Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field.

  • 3+ years of software development experience.

  • Strong experience with C++11/C++14/C++17.

  • Strong grasp of Machine Learning concepts.

  • Experience and knowledge in Computer Architecture, Data Structures, Algorithms.

  • Excellent communication skills, and an aptitude for collaboration and teamwork.

Ways to stand out from the crowd:

  • Experience developing System Software.

  • Proficiency in Python as well as Background in GPU kernel programming using CUDA or OpenCL.

  • Experience in software performance benchmarking, profiling, and optimizations.

  • Background in compiler development

  • Experience in working with TensorRT, PyTorch, TensorFlow, ONNX Runtime or other ML frameworks.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous and love a challenge, we want to hear from you. Come, join our TensorRT Workflows team and help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

#LI-Hybrid

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

NVIDIA - Senior Research Engineer for Reinforcement Learning

NVIDIA

Canada (On-Site)
1 Month ago
Trustana - Senior Data Engineer

Trustana

Gurugram, Haryana, India (Hybrid)
5 Months ago
Visa - Senior Manager Data Science - Visa Consulting & Analytics

Visa

Mumbai, Maharashtra, India (On-Site)
5 Months ago
Truecaller - Senior MLOps Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
4 Months ago
NVIDIA - Senior Software Developer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - Senior Physical Design Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Riot Games - Manager, Software Engineering - Teamfight Tactics, Core Tech

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
ByteDance - DevOps Engineer, Applied Machine Learning Engine - 2025 Start

ByteDance

Singapore (On-Site)
4 Months ago
NVIDIA - Design Verification Intern - 2025

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Anavation - AI Specialist

Anavation

Chantilly, Virginia, United States (On-Site)
3 Months ago
NVIDIA - Senior HPC Performance Engineer

NVIDIA

Canada (On-Site)
1 Week ago
ByteDance - Machine Learning Engineer - MLDev

ByteDance

Seattle, Washington, United States (On-Site)
1 Day ago
ByteDance - Research Scientist (Machine Learning for Science (AI-for-Science))

ByteDance

Seattle, Washington, United States (On-Site)
20 Hours ago
NVIDIA - AI Computing Software Development Engineer, TensorRT

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
The Walt Disney Company - Lead Software Engineer, Machine Learning - Ad Platforms

The Walt Disney Company

Seattle, Washington, United States (On-Site)
4 Months ago
Netflix - Research Engineer L4/L5 -LLMs for Search, Recommendations, and Personalization

Netflix

Los Gatos, California, United States (On-Site)
5 Months ago
Google - Software Engineer III, Core Machine Learning, Google Cloud

Google

Mountain View, California, United States (On-Site)
4 Months ago
ByteDance - Research Scientist, Foundation Model, Music Intelligence

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
ByteDance - Software Engineer Intern (Applied Machine Learning-Enterprise) - 2025 Summer/Fall (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Meta - Art Director

Meta

Los Angeles, California, United States (On-Site)
4 Months ago
Epic Games - Senior Concept Artist

Epic Games

United States (On-Site)
2 Months ago
Super - Senior Full-Stack Software Engineer ( Remote! )

Super

Orlando, Florida, United States (Remote)
4 Months ago
Riot Games - Manager, Software Engineering - Infrastructure / Cloud Foundations

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Interactive Brokers - Entry- Level Career Opportunities: Institutional Services

Interactive Brokers

Greenwich, Connecticut, United States (Hybrid)
5 Months ago
undefined - Technical Consultant, West

United States (Remote)
5 Months ago
Canva - Senior Manager, Corporate FP&A

Canva

San Francisco, California, United States (Remote)
3 Weeks ago
ByteDance - Research Scientist Graduate (High-Performance Computing (Inference Optimization) - Vision AI Platform)

ByteDance

Seattle, Washington, United States (On-Site)
21 Hours ago
Trek - Seasonal Sales Associate - Part Time

Trek

Newport News, Virginia, United States (On-Site)
1 Month ago
ByteDance - Principal Product Manager - IaaS AI Infra

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Senior Software Engineer - Robot Learning Platform

NVIDIA

Toronto, Ontario, Canada (On-Site)
3 Weeks ago
NVIDIA - Senior SRAM Engineer, Circuit Design

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
Assystems - Ingénieur MES / AVEVA H/F

Assystems

Carquefou, Pays De La Loire, France (On-Site)
4 Months ago
Rivos - SOC Design Verification - Intern

Rivos

Santa Clara, California, United States (On-Site)
5 Months ago
NVIDIA - ASIC Design Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Tencent - Speech Synthesis Intern

Tencent

(On-Site)
1 Month ago
Magic Leap - Sr Optical Engineer, Software

Magic Leap

Plantation, Florida, United States (Hybrid)
3 Months ago
ByteDance - Wearable Electrical Engineer / Architect

ByteDance

San Jose, California, United States (On-Site)
21 Hours ago
Assystems - Administrateur AVEVA PDMS E3D H/F

Assystems

Marseille, Provence-Alpes-Côte D'Azur, France (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Hsinchu, Hsinchu City, Taiwan (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Seoul, South Korea (Hybrid)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Shanghai, Shanghai, China (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Be'er Sheva, South District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug