Senior Software Engineer, Deep Learning Inference, TensorRT

2 Months ago • 3 Years + • Research & Development • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer to contribute to its TensorRT deep learning inference framework. Responsibilities include developing components of TensorRT using C++ and Python, building graph parsers and optimizers, and collaborating with cross-functional teams. The role requires expertise in C++, machine learning, computer architecture, and data structures. Experience with GPU kernel programming (CUDA or OpenCL), software performance optimization, and familiarity with frameworks like PyTorch, TensorFlow, or ONNX Runtime are advantageous. This position focuses on accelerating deep learning models, particularly large language models, on NVIDIA GPUs. The candidate will work on the real-time, cost-effective computing platform that drives NVIDIA's success in this growing field.
Must have:
  • C++11/14/17 expertise
  • Machine learning knowledge
  • Computer architecture understanding
  • Data structures and algorithms
  • 3+ years software development experience
Good to have:
  • System software development
  • Python proficiency
  • CUDA/OpenCL experience
  • Performance benchmarking/profiling
  • Compiler development
  • TensorRT/PyTorch/TensorFlow experience
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming exceptional software engineers to apply to Senior Engineering positions in the Deep Learning Inference TensorRT software team.

What you’ll be doing:

  • Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning inference.

  • Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models.

  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams.

What we need to see:

  • A Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field.

  • 3+ years of software development experience.

  • Strong experience with C++11/C++14/C++17.

  • Strong grasp of Machine Learning concepts.

  • Experience and knowledge in Computer Architecture, Data Structures, Algorithms.

  • Excellent communication skills, and an aptitude for collaboration and teamwork.

Ways to stand out from the crowd:

  • Experience developing System Software.

  • Proficiency in Python as well as Background in GPU kernel programming using CUDA or OpenCL.

  • Experience in software performance benchmarking, profiling, and optimizations.

  • Background in compiler development

  • Experience in working with TensorRT, PyTorch, TensorFlow, ONNX Runtime or other ML frameworks.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous and love a challenge, we want to hear from you. Come, join our TensorRT Workflows team and help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

#LI-Hybrid

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Pluto7 - Data Scientist

Pluto7

Bengaluru, Karnataka, India (On-Site)
9 Months ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

(Remote)
2 Months ago
Amazon Games - Senior Software Developer, Amazon Games AI

Amazon Games

San Diego, California, United States (On-Site)
5 Months ago
ByteDance - Engineering Manager Machine Learning Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
2 Months ago
Google - Senior Memory Management Unit Architect, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - Senior ASIC Design Engineer

NVIDIA

California, Maryland, United States (Remote)
2 Months ago
NVIDIA - System Software Engineer - Base OS (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
4 Months ago
Google - Software Engineer III, Embedded Systems, Pixel

Google

Mountain View, California, United States (On-Site)
1 Month ago
Netflix - Software Engineer 5 - Streaming Algorithms

Netflix

United States (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior System Software Engineer - Dynamo and Triton Inference Server

NVIDIA

California, United States (Remote)
2 Months ago
Rackspace Technology - Senior Machine Learning Engineer

Rackspace Technology

Vietnam (Remote)
2 Months ago
Hitachi - Senior AI Data Scientist

Hitachi

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
Netflix - Software Engineer (L5), N-Tech Software Engineering

Netflix

United States (Remote)
7 Months ago
TVH - Data Scientist

TVH

Pune, Maharashtra, India (On-Site)
8 Months ago
ByteDance - Research Scientist in Foundation Model, Speech Understanding - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
ByteDance - Senior Machine Learning Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Nextbrain - Computer Vision Engineer

Nextbrain

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Canva - Senior Machine Learning Engineer - Specialist Platform and Experience

Canva

Surry Hills, New South Wales, Australia (Remote)
2 Months ago
NVIDIA - Senior HPC Performance Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Meta - Software Engineer, Infrastructure

Meta

Atlanta, Georgia, United States (Remote)
6 Months ago
Regent Craft - Embedded Software Engineering Intern

Regent Craft

North Kingstown, Rhode Island, United States (On-Site)
7 Months ago
Meta - Software Engineer, Machine Learning

Meta

Sunnyvale, California, United States (On-Site)
6 Months ago
Google - Technical Program Manager III, Data Center Software Automation, Cloud Systems

Google

Council Bluffs, Iowa, United States (On-Site)
1 Month ago
ByteDance - Research Scientist (Machine Learning for Science (AI-for-Science))

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Intrepid Studios,  Inc  - Technical Artist

Intrepid Studios, Inc

San Diego, California, United States (On-Site)
1 Month ago
Google - Risk and Compliance Senior Associate, Global Business Strategy and Operations

Google

Atlanta, Georgia, United States (On-Site)
1 Month ago
Activate Games - Store Leader (Store Manager)

Activate Games

Pembroke Pines, Florida, United States (On-Site)
1 Month ago
Haptic - Lead Technical Artist

Haptic

Dallas, Texas, United States (Remote)
4 Months ago
Dynamics - Junior Financial Analyst

Dynamics

Springfield, Virginia, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Fluence - Controls Software Engineer-II(m/f/d)

Fluence

Erlangen, Bavaria, Germany (Hybrid)
7 Months ago
Google - Senior Software Engineering Manager, Wear OS Platform

Google

Mountain View, California, United States (On-Site)
1 Month ago
Riot Games - Staff Software Engineer (Graphics)

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Google - Staff Software Engineer, Infrastructure, Platforms Infrastructure Engineering

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Tesla - Constructor

Tesla

Prüm, Rhineland-Palatinate, Germany (On-Site)
3 Months ago
ByteDance - Design Verification Engineer - Multimedia Lab

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Senior Software Engineer, Machine Learning (Recommendations, Rankings, and Predictions)

Google

Mountain View, California, United States (On-Site)
1 Month ago
Rivos - Data Parallel Accelerator Performance Intern

Rivos

Hsinchu, Hsinchu City, Taiwan (Hybrid)
7 Months ago
Tesla - Constructor

Tesla

Rhineland-Palatinate, Germany (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug