Senior Software Engineer, TensorRT-LLM

4 Weeks ago • 2 Years + • Artificial Intelligence • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA's TensorRT-LLM team seeks a Senior Software Engineer to develop robust, scalable inferencing software for multiple platforms. Responsibilities include performance analysis, optimization, and tuning; staying current with AI advancements and updating TensorRT; and collaborating with cross-functional teams. The ideal candidate possesses a Master's degree (or equivalent experience) in a relevant field, 2+ years of software development experience, excellent C/C++ skills, deep learning framework experience (TensorFlow, PyTorch), and strong communication skills. This role involves crafting high-performance AI inferencing software foundational to NVIDIA's product lines and the broader industry.
Must have:
  • Master's degree in relevant field
  • 2+ years software development experience
  • Excellent C/C++ programming skills
  • Deep learning framework experience (TensorFlow, PyTorch)
  • Strong communication skills
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a TensorRT-LLM Software Development Engineer!

NVIDIA is hiring software engineers for its TensorRT-LLM team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and Generative AI that have put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which is foundational to product lines within NVIDIA and across the industry! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What you'll be doing:

  • Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance

  • Performance analysis, optimization and tuning

  • Closely follow academic developments in the field of artificial intelligence and feature update TensorRT

  • Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams

What we need to see:

  • Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)

  • 2+ years of relevant software development experience.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models

  • Experience working with deep learning frameworks like TensorFlow and PyTorch

  • Proactive and able to work without supervision

  • Excellent written and oral communication skills in English

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the entire DL SW stack? Come join us and help build the GPU-accelerated DL platform used worldwide.

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Canva - Senior Computer Vision Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
4 Weeks ago
Zscaler - Staff Machine Learning Engineer

Zscaler

Bengaluru, Karnataka, India (Hybrid)
8 Hours ago
Microsoft - Senior Applied Scientist

Microsoft

Bengaluru, Karnataka, India (On-Site)
3 Days ago
Games talent (Staffing and recruiting) - Senior Data Engineer

Games talent (Staffing and recruiting)

(Remote)
22 Hours ago
bosh group india - Senior ML Engineer Lead - Time Series

bosh group india

Bengaluru, Karnataka, India (On-Site)
1 Month ago
NVIDIA - Solution Architect - Auto

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Google - Staff Software Engineer, Embedded Systems

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Google - Software Engineer, AI/ML, Health and Safety Intelligence

Google

Mountain View, California, United States (On-Site)
2 Days ago
Krafton  - Head of Deep Learning PM & Ops Dept.

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Google - Software Engineer III, Generative AI, Google Workspace

Google

Kirkland, Washington, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Technical Marketing Engineer - AI Infrastructure

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Amazon Games - Senior Software Developer, Amazon Games AI

Amazon Games

San Diego, California, United States (On-Site)
4 Months ago
NVIDIA - Director, AI Software

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
ByteDance - Video Analysis and Quality Algorithm Intern 2023 Summer/Fall (MS)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
NVIDIA - Deep Learning Engineer, Datacenters

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Corsair - Senior Manager, AI & Data

Corsair

Munich, Bavaria, Germany (On-Site)
1 Month ago
ByteDance - DevOps Engineer - Applied Machine Learning, Engine

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
NVIDIA - Senior System Software Engineer - Triton Inference Server

NVIDIA

California, United States (Remote)
3 Months ago
Canva - Staff Machine Learning Engineer - User Voice

Canva

Sydney, New South Wales, Australia (Remote)
3 Weeks ago
Google - Field Solutions Architect, GenAI, Google Cloud

Google

Tokyo, Japan (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

ByteDance - Senior Software Engineer, Unified Datastore

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Google - Data Center Technician III, Global Server Operations

Google

Reno, Nevada, United States (On-Site)
2 Weeks ago
Scout - Engineer, Coordination Integration Planning

Scout

Novi, Michigan, United States (On-Site)
1 Day ago
Trackman - Sales Representative - South Georgia & North Florida

Trackman

Atlanta, Georgia, United States (Hybrid)
1 Month ago
Visa - Director, Sales & Commercial Operations

Visa

Ashburn, Virginia, United States (Hybrid)
9 Hours ago
NVIDIA - Senior ASIC Verification Engineer

NVIDIA

Westford, Massachusetts, United States (On-Site)
1 Month ago
NVIDIA - Senior Physical Design Methodology Engineer, Innovus Flows

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
Google - Video Measurement Lead, Integrated Solutions

Google

Seattle, Washington, United States (On-Site)
1 Week ago
Jane Street - Tax Specialist

Jane Street

New York, New York, United States (On-Site)
6 Hours ago
ByteDance - Site Reliability Engineer, Edge Services

ByteDance

Boston, Massachusetts, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Mashgin - Senior Software Engineer, Computer Vision and Deep Learning

Mashgin

Palo Alto, California, United States (Hybrid)
6 Months ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Pune, Maharashtra, India (Hybrid)
2 Months ago
Google - Customer Engineer, AI/ML, HCLS, Google Cloud

Google

Chicago, Illinois, United States (On-Site)
2 Weeks ago
Microsoft - Member of Technical Staff, High Performance Computing Engineer

Microsoft

Mountain View, California, United States (Hybrid)
2 Weeks ago
Google - Software Engineer, Runtime, AICore, Platforms and Devices

Google

Taipei City, Taiwan (On-Site)
2 Days ago
Google - Software Engineer III, Education Scaled Deployments

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Days ago
AI Fund - Head of Engineering

AI Fund

United States (Remote)
3 Weeks ago
Interface AI - Technical Customer Success Manager

Interface AI

United States (Remote)
2 Months ago
Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Mountain View, California, United States (Remote)
1 Month ago
Google - Photonic Engineer, Machine Learning Systems, Platforms

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug