Senior Software Engineer, TensorRT-LLM

2 Months ago • 2 Years + • Artificial Intelligence • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA's TensorRT-LLM team seeks a Senior Software Engineer to develop robust, scalable inferencing software for multiple platforms. Responsibilities include performance analysis, optimization, and tuning; staying current with AI advancements and updating TensorRT; and collaborating with cross-functional teams. The ideal candidate possesses a Master's degree (or equivalent experience) in a relevant field, 2+ years of software development experience, excellent C/C++ skills, deep learning framework experience (TensorFlow, PyTorch), and strong communication skills. This role involves crafting high-performance AI inferencing software foundational to NVIDIA's product lines and the broader industry.
Must have:
  • Master's degree in relevant field
  • 2+ years software development experience
  • Excellent C/C++ programming skills
  • Deep learning framework experience (TensorFlow, PyTorch)
  • Strong communication skills
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a TensorRT-LLM Software Development Engineer!

NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What you'll be doing:

  • Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance

  • Performance analysis, optimization and tuning

  • Closely follow academic developments in the field of artificial intelligence and feature update TensorRT

  • Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams

What we need to see:

  • Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)

  • 2+ years of relevant software development experience.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models

  • Experience working with deep learning frameworks like TensorFlow and PyTorch

  • Proactive and able to work without supervision

  • Excellent written and oral communication skills in English

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the entire DL SW stack? Come join us and help build the GPU-accelerated DL platform used worldwide.

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Morningstar - Senior Software Development Engineer, ML Operations

Morningstar

Mumbai, Maharashtra, India (Hybrid)
6 Days ago
Rackspace Technology - Machine Learning Architect (AWS)

Rackspace Technology

(Remote)
1 Month ago
ManyChat - Lead Machine Learning Scientist

ManyChat

Amsterdam, North Holland, Netherlands (Hybrid)
6 Days ago
bytedance - Imaging System Architect

bytedance

San Jose, California, United States (On-Site)
1 Month ago
Rackspace Technology - Senior Machine Learning Engineer

Rackspace Technology

Vietnam (Remote)
2 Months ago
bytedance - Research Engineer Graduate (Vision AI Platform)

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
Xsolla - Machine Learning Engineer

Xsolla

Montreal, Quebec, Canada (Remote)
1 Month ago
Orion Innovation - Data Engineer-AI,ML

Orion Innovation

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
Genies - Backend Engineer Intern (LLM)

Genies

San Mateo, California, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Spyke Games - Generative AI Engineer

Spyke Games

İstanbul, Türkiye (On-Site)
2 Weeks ago
bytedance - Research Scientist, Foundation Model, Music Intelligence

bytedance

San Jose, California, United States (On-Site)
7 Months ago
bytedance - Software Engineer, Model Inference

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
Twitch - Sr. Applied Scientist

Twitch

San Francisco, California, United States (On-Site)
2 Months ago
Xepelin - Data Scientist I

Xepelin

Santiago, Santiago Metropolitan Region, Chile (Hybrid)
1 Month ago
Condé Nast - Data Scientist Manager

Condé Nast

New York, United States (On-Site)
6 Days ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Santa Clara, California, United States (Hybrid)
4 Months ago
Match Group - Senior ML Software Engineering Team Leader

Match Group

Seoul, South Korea (Hybrid)
7 Months ago
Tekion Corp - Machine Learning Architect

Tekion Corp

Pleasanton, California, United States (On-Site)
1 Month ago
bytedance - Software Engineer, ML System Scheduling

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Qualcomm - IOT Software Engineer

Qualcomm

San Diego, California, United States (On-Site)
3 Weeks ago
Apple - Time Series and Web Analytics Data Scientist

Apple

Cupertino, California, United States (On-Site)
3 Weeks ago
Apple - Software Engineer, Payments

Apple

Cupertino, California, United States (On-Site)
5 Days ago
manticore games - Director / Senior Director of Marketing

manticore games

San Mateo, California, United States (Remote)
1 Month ago
Axon - Field Engineer, ALPR

Axon

Scottsdale, Arizona, United States (Remote)
2 Weeks ago
Next Level Business Services - Visual Analytics Architect

Next Level Business Services

Atlanta, Georgia, United States (On-Site)
9 Years ago
Iron Mountain - Non CDL Local Route Driver / Warehouse Associate

Iron Mountain

Greenville, South Carolina, United States (On-Site)
1 Month ago
Snlo studios - Financial Controller

Snlo studios

San Francisco, California, United States (Remote)
3 Weeks ago
broadcom - Senior Technical Support Engineer

broadcom

Lisle, Illinois, United States (On-Site)
6 Days ago
extreme network - Director, Marketing Partnerships

extreme network

Salem, New Hampshire, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

DraftKings - Technical Business Analyst

DraftKings

Boston, Massachusetts, United States (On-Site)
1 Month ago
Keywords Studios - AI - Senior Research Associate (Prompts)

Keywords Studios

Silesian Voivodeship, Poland (On-Site)
2 Months ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Hyderabad, Telangana, India (Hybrid)
3 Months ago
NVIDIA - Applied Research Intern - 2025

NVIDIA

Yerevan, Yerevan, Armenia (On-Site)
3 Months ago
bytedance - Research Scientist Graduate (Foundation Model - Generative AI) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
5 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Georgia (Remote)
7 Months ago
The Walt Disney Company - Senior Machine Learning Engineer - Ad Platforms

The Walt Disney Company

San Francisco, California, United States (On-Site)
3 Months ago
bytedance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Inworld AI - Staff C++ Engineer

Inworld AI

Mountain View, California, United States (On-Site)
2 Months ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug