AI Computing Software Development Engineer, TensorRT

1 Month ago • 2 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA's AI Computing team seeks a TensorRT Software Development Engineer to craft and develop robust inferencing software scalable across multiple platforms. Responsibilities include performance analysis, optimization, and tuning; tracking academic AI advancements and updating TensorRT; providing feedback on architecture and hardware; collaborating across teams to guide machine learning inferencing; and publishing results at scientific conferences. The ideal candidate possesses a Master's degree (or equivalent) in a related field, 2+ years of software development experience, and excellent C/C++ skills. Strong AI curiosity and experience with deep learning frameworks (TensorFlow, PyTorch) are essential.
Must have:
  • Master's degree in relevant field
  • 2+ years software development experience
  • Excellent C/C++ programming skills
  • Deep learning framework experience (TensorFlow, PyTorch)
  • Performance analysis and optimization
  • Strong AI knowledge

Job Details

We are now looking for a TensorRT Software Development Engineer!

NVIDIA is hiring software engineers for its AI Computing team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning-powered AI, enabling breakthroughs in areas like LLM, ChatGPT and GenerativeAI that has put DL at the “iPhone moment” for AI. Join the team which is building the inferencing software which will be used across our product lines! The ability to work on a fast-paced delivery-focused team is required and excellent interpersonal skills are a must.

What you'll be doing:

  • Craft and develop robust inferencing software that can be scaled to multiple platforms for functionality and performance

  • Performance analysis, optimization and tuning

  • Closely follow academic developments in the field of artificial intelligence and feature update TensorRT

  • Provide feedback into the architecture and hardware design and development

  • Collaborate across the company to guide the direction of machine learning inferencing, working with software, research and product teams

  • Publish key results in scientific conferences

What we need to see:

  • Masters or higher degree in Computer Engineering, Computer Science, Applied Mathematics or related computing focused degree (or equivalent experience)

  • 2+ years of relevant software development experience.

  • Excellent C/C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Strong curiosity about artificial intelligence, awareness of the latest developments in deep learning like LLMs, generative and recommender models

  • Experience working with deep learning frameworks like TensorFlow and PyTorch

  • Proactive and able to work without supervision

  • Excellent written and oral communication skills in English

NVIDIA is widely considered to be one of technology’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. Does the idea of contributing to and pushing the boundaries of state-of-the-art AI and Compute systems excite you? Interested in getting exposure to the entire DL SW stack? Come join us and help build the GPU-accelerated DL platform used worldwide.

#LI-Hybrid

#deeplearning

Similar Jobs

NVIDIA - AI Computing Architect Intern - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Second Dinner - Director of Data

Second Dinner

United States (Remote)
1 Month ago
ThreeV Technologies,  Inc  - Data Scientist Computer Vision

ThreeV Technologies, Inc

Bengaluru, Karnataka, India (Remote)
4 Months ago
Dolby Laboratories - AIOps Research Scientist

Dolby Laboratories

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Meta - AI Research Scientist, Language - Generative AI

Meta

New York, New York, United States (On-Site)
3 Months ago
Fun Dog Studios - Artificial Intelligence Engineer

Fun Dog Studios

United States (Remote)
5 Months ago
Microsoft - Research Intern - Interactive Entertainment with Generative AI

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Beijing, Beijing, China (On-Site)
1 Month ago
Microsoft - Research Intern - AI & Productivity

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

WebFX - Full Stack JavaScript Developer (Remote PH)

WebFX

Philippines (Remote)
3 Months ago
ByteDance - Research Engineer Graduate (Machine Learning Sys-US) - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
LeoVegas - Data Scientist - Sportsbook

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
2 Months ago
Rackspace Technology - MLOps Engineer (AWS / Azure / GCP)

Rackspace Technology

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Genies - Machine Learning Engineer: 3D Generative AI

Genies

San Mateo, California, United States (Remote)
3 Months ago
Axinous - Sr. Staff ML Engineer

Axinous

San Jose, California, United States (Hybrid)
1 Month ago
ByteDance - Backend Engineer (Model Inference), Machine Learning Systems

ByteDance

Singapore (On-Site)
3 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Rackspace Technology - Senior MLOPs Engineer

Rackspace Technology

United States (Remote)
4 Months ago
NVIDIA - Senior Software Engineer - Triton Tools

NVIDIA

California, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Mattel  Inc  - Equipment & Facility Engineer

Mattel Inc

Dongguan, Guangdong Province, China (On-Site)
2 Months ago
Riot Games - Esports Product Manager

Riot Games

Shanghai, Shanghai, China (On-Site)
2 Months ago
NVIDIA - Principal Autonomous Vehicles Engineer - Mapping and Localization

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
NVIDIA - Senior Custom SOC IP Verification Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
Tencent - Senior Combat Designer

Tencent

Shanghai, Shanghai, China (On-Site)
1 Month ago
Mattel  Inc  - Accounting Administrator

Mattel Inc

Foshan, Guangdong Province, China (On-Site)
2 Months ago
Virtuos - Senior / Lead Software Engineer

Virtuos

China (On-Site)
1 Month ago
Zengame Technology - Advertising Optimization Specialist

Zengame Technology

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Every matrix - Middle Front-end Developer

Every matrix

Changsha, Hunan, China (On-Site)
2 Weeks ago
NVIDIA - GPU Kernel Software Engineering Intern - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Student Researcher, BS/MS, Winter/Summer 2025

Google

Ann Arbor, Michigan, United States (On-Site)
3 Months ago
Meta - Research Scientist, Computer Vision for Generative AI (PhD)

Meta

New York, New York, United States (On-Site)
3 Months ago
Microsoft - Research Intern - AI Frontiers - Foundation Model Evaluation and Understanding

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Pika - Research Engineer (Foundation Model) New Grad

Pika

Palo Alto, California, United States (On-Site)
2 Months ago
Microsoft - Senior Researcher: Machine Learning – Microsoft Research AI for Science

Microsoft

Cambridge, England, United Kingdom (On-Site)
1 Month ago
AI Fund - General Manager - New Business Unit (College Admissions)

AI Fund

California, United States (Remote)
4 Months ago
Modulate - Senior Machine Learning Engineer

Modulate

Somerville, Massachusetts, United States (Hybrid)
1 Month ago
AjnaLens - Senior Computer Vision Engineer

AjnaLens

Mumbai, Maharashtra, India (On-Site)
5 Months ago
Kenvue - Generative AI TPO

Kenvue

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Google - Software Engineer III, Core Machine Learning, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug