Senior Software Engineer, TensorRT-LLM

32 Minutes ago • 2 Years + • Artificial Intelligence • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA's TensorRT-LLM team seeks a Senior Software Engineer to develop robust, scalable inferencing software for multiple platforms. Responsibilities include performance analysis, optimization, and tuning; staying current with AI advancements and updating TensorRT; and collaborating with cross-functional teams. The ideal candidate possesses a Master's degree (or equivalent experience) in a relevant field, 2+ years of software development experience, excellent C/C++ skills, deep learning framework experience (TensorFlow, PyTorch), and strong communication skills. This role involves crafting high-performance AI inferencing software foundational to NVIDIA's product lines and the broader industry.
Must have:
  • Master's degree in relevant field
  • 2+ years software development experience
  • Excellent C/C++ programming skills
  • Deep learning framework experience (TensorFlow, PyTorch)
  • Strong communication skills
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a TensorRT-LLM Software Development Engineer!

NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What you'll be doing:

  • Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance

  • Performance analysis, optimization and tuning

  • Closely follow academic developments in the field of artificial intelligence and feature update TensorRT

  • Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams

What we need to see:

  • Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)

  • 2+ years of relevant software development experience.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models

  • Experience working with deep learning frameworks like TensorFlow and PyTorch

  • Proactive and able to work without supervision

  • Excellent written and oral communication skills in English

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the entire DL SW stack? Come join us and help build the GPU-accelerated DL platform used worldwide.

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Hashlist - Data Scientist

Hashlist

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Thumbtack - Staff Software Engineer,  Machine Learning Infrastructure

Thumbtack

Ontario, Canada (Remote)
4 Months ago
NVIDIA - Research Scientist, Design Automation

NVIDIA

Austin, Texas, United States (On-Site)
2 Months ago
Match Group - Machine Learning Engineer (MG AI)

Match Group

Seoul, South Korea (On-Site)
5 Months ago
Netflix - Research Scientist (L6) - Identity Algorithms

Netflix

Los Gatos, California, United States (On-Site)
5 Months ago
HP - Machine Learning Engineer

HP

Palo Alto, California, United States (On-Site)
6 Months ago
ByteDance - Research Scientist Graduate (Foundation Model, Video Generation) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Zoox - Offline Perception Internship/Co-op

Zoox

Boston, Massachusetts, United States (On-Site)
5 Months ago
Canva - Head of AI Research

Canva

Sydney, New South Wales, Australia (Remote)
2 Months ago
CharacterAI - Research Engineer, Post-Training

CharacterAI

New York, New York, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

DNEG - Head of Machine Learning

DNEG

London, England, United Kingdom (Remote)
6 Days ago
Match Group - Staff Software Engineer, Machine Learning

Match Group

Palo Alto, California, United States (Hybrid)
5 Months ago
Saama Technologies,  Inc  - NLP Engineer

Saama Technologies, Inc

(Remote)
1 Month ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Redmond, Washington, United States (On-Site)
4 Months ago
NVIDIA - Senior Software Engineer, Deep Learning Inference, TensorRT

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Weeks ago
Genies - ML Engineering Intern

Genies

Los Angeles, California, United States (Hybrid)
1 Week ago
PlayStation Global - Mid-Career Machine Learning Engineer - Recommendation Systems

PlayStation Global

San Francisco, California, United States (On-Site)
1 Week ago
NVIDIA - Senior Software Engineer, AI Resiliency

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Virtuos - Senior Machine Learning Engineer (Game)

Virtuos

Singapore (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

ByteDance - ISP/Display Firmware Prototype Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Infoblox - Product Security Architect

Infoblox

Washington, United States (On-Site)
3 Months ago
PlayQ - Senior Recruiter

PlayQ

Santa Monica, California, United States (On-Site)
2 Months ago
AVER LLC - Oracle Exadata Administrator/DBA

AVER LLC

United States (Remote)
2 Months ago
Mattel  Inc  - American Girl Server

Mattel Inc

Los Angeles, California, United States (On-Site)
5 Days ago
NVIDIA - Corporate Development Manager

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
The Pokemon Company International - Legal Specialist, Product Safety and Compliance

The Pokemon Company International

Bellevue, Washington, United States (Hybrid)
1 Month ago
Universal Music - Director of Creative Strategy, eCommerce

Universal Music

New York, New York, United States (On-Site)
1 Month ago
ByteDance - Research Scientist (Machine Learning for Science (AI-for-Science))

ByteDance

Seattle, Washington, United States (On-Site)
1 Week ago
Tap Nation - Senior Fullstack Developer

Tap Nation

New York, New York, United States (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Genies - Machine Learning Engineer, Character Animation & Motion AI

Genies

Los Angeles, California, United States (On-Site)
1 Month ago
PlayStation Global - Senior Director, AI Governance

PlayStation Global

Aliso Viejo, California, United States (Hybrid)
1 Month ago
Airlab Inc  - Jr Programmer Artificial Intelligence

Airlab Inc

Montreal, Quebec, Canada (On-Site)
10 Months ago
Hedra - Research Scientist

Hedra

San Francisco, California, United States (On-Site)
1 Week ago
Truecaller - Senior ML Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
4 Months ago
Krafton  - Technical Project Manager, Deep Learning Division

Krafton

Seoul, South Korea (On-Site)
2 Months ago
NVIDIA - Principal Software Engineer - Enterprise AI Platform

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
Zoox - Senior Software Engineer - Simulaton Scenario Automation

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago
AI Fund - Curriculum Developer

AI Fund

(Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Shanghai, Shanghai, China (On-Site)

Roskilde, Denmark (Hybrid)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Redmond, Washington, United States (Hybrid)

California, United States (Hybrid)

Redmond, Washington, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug