Senior Software Engineer, TensorRT-LLM

3 Months ago • 2 Years + • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA's TensorRT-LLM team seeks a Senior Software Engineer to develop robust, scalable inferencing software for multiple platforms. Responsibilities include performance analysis, optimization, and tuning; staying current with AI advancements and updating TensorRT; and collaborating with cross-functional teams. The ideal candidate possesses a Master's degree (or equivalent experience) in a relevant field, 2+ years of software development experience, excellent C/C++ skills, deep learning framework experience (TensorFlow, PyTorch), and strong communication skills. This role involves crafting high-performance AI inferencing software foundational to NVIDIA's product lines and the broader industry.
Must have:
  • Master's degree in relevant field
  • 2+ years software development experience
  • Excellent C/C++ programming skills
  • Deep learning framework experience (TensorFlow, PyTorch)
  • Strong communication skills
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a TensorRT-LLM Software Development Engineer!

NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What you'll be doing:

  • Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance

  • Performance analysis, optimization and tuning

  • Closely follow academic developments in the field of artificial intelligence and feature update TensorRT

  • Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams

What we need to see:

  • Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)

  • 2+ years of relevant software development experience.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models

  • Experience working with deep learning frameworks like TensorFlow and PyTorch

  • Proactive and able to work without supervision

  • Excellent written and oral communication skills in English

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the entire DL SW stack? Come join us and help build the GPU-accelerated DL platform used worldwide.

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Washington, District Of Columbia, United States (On-Site)
8 Months ago
Kwalee - Machine Learning Engineer

Kwalee

Royal Leamington Spa, England, United Kingdom (On-Site)
3 Months ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Redmond, Washington, United States (On-Site)
8 Months ago
Scale AI - Machine Learning Engineer

Scale AI

San Francisco, California, United States (On-Site)
1 Month ago
Qualcomm - Engineer- Python Automation Machine Learning

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Month ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

Toronto, Ontario, Canada (Remote)
3 Months ago
bytedance - Senior Research Scientist, Foundation Model, Speech Understanding

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
Google - Customer Engineer II, Cloud AI, Google Cloud

Google

San Francisco, California, United States (On-Site)
2 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

London, England, United Kingdom (Remote)
5 Months ago
Hedra - Machine Learning Engineer (CUDA)

Hedra

San Francisco, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

SingleStore - AI Security Engineer

SingleStore

Hyderabad, Telangana, India (Remote)
1 Month ago
Volksbyte - Senior Software Engineer – AR/VR, AI/ML & Full-Stack

Volksbyte

Dhaka, Dhaka Division, Bangladesh (Remote)
3 Months ago
Scale AI - Machine Learning Engineer, Enterprise GenAI

Scale AI

San Francisco, California, United States (On-Site)
2 Months ago
bytedance - Machine Learning Engineer, Tech Lead - Code AI

bytedance

San Jose, California, United States (On-Site)
3 Months ago
bytedance - DevOps Engineer, Applied Machine Learning Engine - 2025 Start

bytedance

Singapore (On-Site)
8 Months ago
Dentsu Aegis - Data Scientist

Dentsu Aegis

Pune, Maharashtra, India (On-Site)
1 Month ago
Rocket studio - AI (Intern)

Rocket studio

Hanoi, Hanoi, Vietnam (On-Site)
2 Months ago
Tencent - Senior Researcher: Artificial General Intelligence (Natural Language Processing)

Tencent

Washington, United States (On-Site)
4 Months ago
Google - Technical Program Manager III, AI/ML, Cloud AI Systems

Google

Austin, Texas, United States (On-Site)
2 Months ago
Qualcomm - AI SDK Software Engineer

Qualcomm

Chengdu, Sichuan, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

The Walt Disney Company - Sr Sales Manager - Resorts

The Walt Disney Company

Celebration, Florida, United States (Hybrid)
6 Months ago
ManyChat - Director of Partnerships

ManyChat

Austin, Texas, United States (Hybrid)
1 Month ago
Riot Games - Manager, Software Engineering - Teamfight Tactics, Gameplay

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
The Walt Disney Company - Principal Data Engineer, Architect

The Walt Disney Company

Seattle, Washington, United States (On-Site)
2 Months ago
Gamurs - Fantasy Sports Writer

Gamurs

United States (Remote)
2 Months ago
Penumbrainc - Procurement Process Excellence Principal

Penumbrainc

Alameda, California, United States (On-Site)
1 Month ago
Apple - Software Engineer (Accessibility Engineer)

Apple

Sunnyvale, California, United States (On-Site)
1 Month ago
Next Level Business Services - IBM Content Manager

Next Level Business Services

Columbus, Ohio, United States (On-Site)
8 Months ago
Take-Two Interactive - Sr Director, Global Talent Operations

Take-Two Interactive

New York, United States (On-Site)
1 Month ago
Ethos Life - Sales Enablement Associate

Ethos Life

United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

bytedance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago
Kokotree - Artificial Intelligence Developers

Kokotree

Wilmington, North Carolina, United States (On-Site)
7 Months ago
bytedance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

bytedance

San Jose, California, United States (On-Site)
8 Months ago
NVIDIA - Senior Application Software Engineer, Performance

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
NVIDIA - AI Algorithms Software Engineer (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
5 Months ago
Keywords Studios - AI Prompt & Language Specialist (Italian)

Keywords Studios

Quebec, Canada (Remote)
3 Months ago
bytedance - Student Researcher (Doubao (Seed) - Foundation Model - MultiModal Generative Model)

bytedance

San Jose, California, United States (On-Site)
2 Months ago
Tencent - Large Language Model Algorithm Engineer

Tencent

California, United States (On-Site)
3 Months ago
ClinDCast - GenAI Application Lead

ClinDCast

Austin, Texas, United States (Remote)
11 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug