Senior Software Engineer, Deep Learning Inference, TensorRT

1 Month ago • 3 Years + • Research & Development • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer to contribute to its TensorRT deep learning inference framework. Responsibilities include developing components of TensorRT using C++ and Python, building graph parsers and optimizers, and collaborating with cross-functional teams. The role requires expertise in C++, machine learning, computer architecture, and data structures. Experience with GPU kernel programming (CUDA or OpenCL), software performance optimization, and familiarity with frameworks like PyTorch, TensorFlow, or ONNX Runtime are advantageous. This position focuses on accelerating deep learning models, particularly large language models, on NVIDIA GPUs. The candidate will work on the real-time, cost-effective computing platform that drives NVIDIA's success in this growing field.
Must have:
  • C++11/14/17 expertise
  • Machine learning knowledge
  • Computer architecture understanding
  • Data structures and algorithms
  • 3+ years software development experience
Good to have:
  • System software development
  • Python proficiency
  • CUDA/OpenCL experience
  • Performance benchmarking/profiling
  • Compiler development
  • TensorRT/PyTorch/TensorFlow experience
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a Senior Software Engineer for Deep Learning Inference! Would you like to make a big impact in Deep Learning by helping build a state-of-the-art inference framework for accelerating Deep Learning models, especially Large Language Models, on NVIDIA GPUs? We are now welcoming exceptional software engineers to apply to Senior Engineering positions in the Deep Learning Inference TensorRT software team.

What you’ll be doing:

  • Develop components of TensorRT, NVIDIA’s SDK for high-performance deep learning inference.

  • Use C++ and Python to build graph parsers, optimizers, and tools for effective deployment of trained deep learning models.

  • Collaborate with teams of deep learning experts, GPU architects and DevOps engineers across diverse teams.

What we need to see:

  • A Bachelor's, Master's, PhD or equivalent experience in Computer Science, Computer Engineering, Electrical Engineering or related field.

  • 3+ years of software development experience.

  • Strong experience with C++11/C++14/C++17.

  • Strong grasp of Machine Learning concepts.

  • Experience and knowledge in Computer Architecture, Data Structures, Algorithms.

  • Excellent communication skills, and an aptitude for collaboration and teamwork.

Ways to stand out from the crowd:

  • Experience developing System Software.

  • Proficiency in Python as well as Background in GPU kernel programming using CUDA or OpenCL.

  • Experience in software performance benchmarking, profiling, and optimizations.

  • Background in compiler development

  • Experience in working with TensorRT, PyTorch, TensorFlow, ONNX Runtime or other ML frameworks.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative, autonomous and love a challenge, we want to hear from you. Come, join our TensorRT Workflows team and help build the real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.

#LI-Hybrid

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Canva - Machine Learning Engineer Intern

Canva

Auckland, Auckland, New Zealand (Remote)
2 Weeks ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Santa Clara, California, United States (Hybrid)
3 Months ago
Google - Software Engineering Manager, Visual Language and Multimodal Modeling

Google

Sydney, New South Wales, Australia (On-Site)
1 Week ago
NVIDIA - Senior Research Engineer for Reinforcement Learning

NVIDIA

Canada (On-Site)
1 Month ago
ByteDance - Research Engineer Graduate (Machine Learning Sys-US) - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Google - Low Power Verification Lead, Core IP

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
NVIDIA - Principal Autonomous Vehicles Engineer - Mapping and Localization

NVIDIA

Beijing, Beijing, China (On-Site)
1 Month ago
NVIDIA - Senior Firmware Engineer - Embedded Controller

NVIDIA

Santa Clara, California, United States (On-Site)
1 Week ago
Google - Senior Staff Software Engineer, AI/ML GenAI, Google Ads

Google

New York, New York, United States (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Microsoft - Research Intern - Microsoft Teams CMD Labs

Microsoft

Redmond, Washington, United States (On-Site)
1 Week ago
Eightfold - Lead Engineer- Backend

Eightfold

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
ByteDance - Software Engineer in ML Engineering Platform

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Meta - Postdoctoral Researcher, Embodied AI (PhD)

Meta

Seattle, Washington, United States (On-Site)
5 Months ago
PwC - IN-Senior Associate_ML Engineer_Data and Analytics_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Luxoft - Senior ML Engineer

Luxoft

Poland, Ohio, United States (Remote)
3 Months ago
SmileGate - Game Data Engineer [LOST ARK]

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
3 Months ago
Granicus - Data Scientist 4

Granicus

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Meta - Software Engineer, Machine Learning

Meta

Burlingame, California, United States (On-Site)
5 Months ago
ByteDance - Video Analysis and Quality Algorithm Intern 2023 Summer/Fall (PHD)

ByteDance

San Diego, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Nintendo - Localization Product Specialist III - Spanish

Nintendo

Redmond, Washington, United States (Hybrid)
5 Months ago
Light Speed Studios - Senior Technical Artist

Light Speed Studios

Irvine, California, United States (On-Site)
4 Months ago
SciPlay - Software Engineering Intern

SciPlay

Cedar Falls, Iowa, United States (On-Site)
1 Week ago
Sphere Entertainment Co - Director Retail Strategy

Sphere Entertainment Co

Las Vegas, Nevada, United States (On-Site)
3 Months ago
NVIDIA - Senior RTL Analysis Methodology Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Onward Search - Graphic Designer/Production Artist

Onward Search

New York, New York, United States (Remote)
1 Day ago
Visa - Sr. Site Reliability Engineer, Product Reliability Engineering - Middleware

Visa

Austin, Texas, United States (Hybrid)
4 Months ago
Universal Music - Director, eCommerce & Artist Services

Universal Music

New York, New York, United States (On-Site)
1 Month ago
Xsolla - Senior Tax Manager – U.S. Tax Compliance

Xsolla

Los Angeles, California, United States (Hybrid)
5 Hours ago
Microsoft - Senior Researcher – Generative AI

Microsoft

Redmond, Washington, United States (On-Site)
4 Days ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Google - CPU System Software Engineer, Performance Architect

Google

Austin, Texas, United States (On-Site)
6 Days ago
Rivos - Logic Equivalence Check (LEC) Engineer

Rivos

Hsinchu, Hsinchu City, Taiwan (Hybrid)
6 Months ago
NVIDIA - Physical Design Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
Playtika - R&D Group Manager

Playtika

Poland (Hybrid)
1 Week ago
NVIDIA - Senior Chip Design Verification Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Weeks ago
NVIDIA - Senior Methodology Software Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Week ago
NVIDIA - Senior High-Performance ASIC Timing Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - Design Verification Engineer (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
3 Months ago
NVIDIA - Senior Power Integrity Engineer

NVIDIA

Canada (Hybrid)
1 Month ago
NVIDIA - Senior Mask Layout Design Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug