Senior Software Engineer, TensorRT-LLM

3 Weeks ago • 2 Years + • Artificial Intelligence • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA's TensorRT-LLM team seeks a Senior Software Engineer to develop robust, scalable inferencing software for multiple platforms. Responsibilities include performance analysis, optimization, and tuning; staying current with AI advancements and updating TensorRT; and collaborating with cross-functional teams. The ideal candidate possesses a Master's degree (or equivalent experience) in a relevant field, 2+ years of software development experience, excellent C/C++ skills, deep learning framework experience (TensorFlow, PyTorch), and strong communication skills. This role involves crafting high-performance AI inferencing software foundational to NVIDIA's product lines and the broader industry.
Must have:
  • Master's degree in relevant field
  • 2+ years software development experience
  • Excellent C/C++ programming skills
  • Deep learning framework experience (TensorFlow, PyTorch)
  • Strong communication skills
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a TensorRT-LLM Software Development Engineer!

NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What you'll be doing:

  • Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance

  • Performance analysis, optimization and tuning

  • Closely follow academic developments in the field of artificial intelligence and feature update TensorRT

  • Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams

What we need to see:

  • Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)

  • 2+ years of relevant software development experience.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models

  • Experience working with deep learning frameworks like TensorFlow and PyTorch

  • Proactive and able to work without supervision

  • Excellent written and oral communication skills in English

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the entire DL SW stack? Come join us and help build the GPU-accelerated DL platform used worldwide.

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Nintendo - Intern – Machine Learning Software Engineer (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
5 Months ago
PlayStation Global - Senior Machine Learning Engineer

PlayStation Global

London, England, United Kingdom (On-Site)
1 Day ago
ByteDance - Research Scientist, Reinforcement Learning

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Ubisoft - Gen AI Programmer

Ubisoft

Pune, Maharashtra, India (On-Site)
1 Week ago
ByteDance - Machine Learning Engineer - MLDev

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Keywords Studios - Junior AI Tester

Keywords Studios

Silesian Voivodeship, Poland (On-Site)
4 Weeks ago
The Walt Disney Company - Senior Machine Learning Engineer - Ad Platforms

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Months ago
Google - Product Manager, Assurance Evaluations, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Trustana - Senior Data Engineer

Trustana

Gurugram, Haryana, India (Hybrid)
6 Months ago
Google - Senior Imaging and On-Device Machine Learning Software Engineer, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer Intern (Applied Machine Learning-Enterprise) - 2025 Summer/Fall (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - Customer Engineer, AI Infrastructure

Google

Seattle, Washington, United States (On-Site)
1 Week ago
ByteDance - Machine Learning Engineer - MLDev

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
NVIDIA - Solutions Architect, AI and ML

NVIDIA

Redmond, Washington, United States (On-Site)
3 Weeks ago
ByteDance - Student Researcher (Doubao (Seed) - Machine Learning System) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Twitch - Sr. Applied Scientist

Twitch

San Francisco, California, United States (On-Site)
4 Weeks ago
ByteDance - Research Scientist in Foundation Model, Speech Understanding - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
TVH - Data Scientist

TVH

Pune, Maharashtra, India (On-Site)
7 Months ago
Google - Customer Engineer, Cloud AI, Google Cloud

Google

New York, New York, United States (On-Site)
1 Week ago
NVIDIA - Distinguished Software Architect - Deep Learning and HPC Communications

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Nintendo - Senior Talent Management Partner

Nintendo

Redmond, Washington, United States (Hybrid)
8 Months ago
Meta - Manager, Recruiting Services & Operations

Meta

Austin, Texas, United States (On-Site)
5 Months ago
ByteDance - Senior Research Scientist, Data Management and Security - Infrastructure System Lab

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Nintendo - CONTRACT - Events Marketing Specialist (LatAm)

Nintendo

Redmond, Washington, United States (Hybrid)
5 Months ago
Scientific Games  - Senior Online Marketing Manager

Scientific Games

Alpharetta, Georgia, United States (On-Site)
2 Weeks ago
Sphere Entertainment Co - Custodian - Overnight (Part-Time)

Sphere Entertainment Co

Las Vegas, Nevada, United States (On-Site)
1 Month ago
Hawk Eye Innovations - NBA Technical Operations Senior Coordinator - Officiating

Hawk Eye Innovations

Atlanta, Georgia, United States (Hybrid)
1 Week ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Microsoft - Principal Security Architect

Microsoft

Reston, Virginia, United States (On-Site)
2 Days ago
ByteDance - Software Development Engineer Graduate, AI/LLM Network (High Speed Network)- 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Zoox - Senior/Staff Software Engineer - Simulation Workload Orchestration

Zoox

Seattle, Washington, United States (Hybrid)
6 Months ago
Google - Senior Software Engineer, Machine Learning, Cloud AI

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Microsoft - Technical Program Manager, AI Multimodal

Microsoft

London, England, United Kingdom (On-Site)
1 Month ago
Google - Senior Software Engineer, Visual Language and Multimodal Modeling

Google

Sydney, New South Wales, Australia (On-Site)
1 Week ago
Google - Director, Development, Ads Safety, Platform and Experiences

Google

Los Angeles, California, United States (On-Site)
6 Days ago
ByteDance - LLM Software Engineer/Researcher (Applied Machine Learning)

ByteDance

Seattle, Washington, United States (On-Site)
1 Week ago
Genies - Backend Engineer Intern (LLM)

Genies

San Mateo, California, United States (Hybrid)
4 Weeks ago
NVIDIA - Deep Learning Software Engineer, Performance Optimization

NVIDIA

Tokyo, Japan (On-Site)
3 Months ago
Google - Solution Engineer, Innovation, Cloud Solution Accelerator Workshops

Google

Austin, Texas, United States (On-Site)
6 Days ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Vision Generative AI)

ByteDance

San Jose, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug