Senior Software Engineer, Deep Learning Inference

1 Week ago • 5 Years + • Research & Development

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer passionate about performance optimization and generative AI. Responsibilities include collaborating with research teams to onboard LLMs and VLMs into NVIDIA's open-source AI runtimes, optimizing inference workloads, building robust inference software systems, implementing low-level GPU code, and owning end-to-end inference acceleration features. This role involves working with diverse teams to deliver production-grade products, requiring strong software design principles, proficiency in system and scripting languages, and a deep understanding of machine learning concepts.
Must have:
  • 5+ years software engineering experience
  • Profound knowledge of software design principles
  • Proficiency in system & scripting languages
  • Strong machine learning concepts
  • Excellent communication & teamwork skills
Good to have:
  • Familiarity with NVIDIA's DL software stack (Triton, TensorRT-LLM, Model Optimizer)
  • Experience with performance modeling, profiling, debugging on NVIDIA accelerators
  • Knowledge of LLM quantization, fine-tuning, and caching algorithms
  • GPU kernel programming (CUDA or OpenCL)
  • Experience on large software projects (50+ contributors)

Job Details

NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence.

We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stack—from server-level request batching to GPU kernel fusion—and collaborate with teams across diverse disciplines to push Nvidia's hardware to its full potential.

What you’ll be doing:

  • Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes

  • Optimize inference workloads using sophisticated profiling and simulation tools

  • Build SOLID, extendable inference software systems, and refine robust APIs

  • Implement and debug low-level GPU code to harness the latest HW features

  • Own end-to-end inference acceleration features and work with teams around the world to deliver production-grade products

What we need to see:

  • B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering

  • 5+ years of relevant hands-on software engineering experience

  • Profound knowledge of software design principles

  • Strong proficiency in at least one system and one scripting language

  • Strong grasp of machine learning concepts

  • People person with excellent communication skills that enjoys collaboration and teamwork.

Ways to stand out from the crowd:

  • Familiarity with Nvidia's DL software stack, e.g. Triton Inference Server, TensorRT-LLM, and Model Optimizer

  • Proven track record of performance modeling, profiling, debugging, and development in a performance-critical setting with Nvidia's accelerators.

  • Familiarity with LLM quantization, fine-tunning, and caching algorithms

  • Proficiency in GPU kernel programming (CUDA or OpenCL)

  • Prior experience working on a large software project with 50+ contributors

NVIDIA is widely considered one of the world’s most desirable employers in the technology field. We have some of the most forward-thinking and hardworking people working for us. If you're creative and autonomous, we want to hear from you! We are committed to fostering a diverse work environment and are proud to be an equal-opportunity employer. We highly value diversity in our current and future employees. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Similar Jobs

Fictiv - Document Controller

Fictiv

Bengaluru, Karnataka, India (On-Site)
8 Hours ago
brightmachines - Principal Software Engineer - Omniverse

brightmachines

San Francisco, California, United States (Hybrid)
4 Months ago
Google - Staff Software Engineer, Machine Learning, Computer Vision, Silicon

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Austin, Texas, United States (On-Site)
2 Months ago
Google - Software Engineer III, Full Stack, Learning and Education

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Weeks ago
Google - Senior Software Engineer, CPU Performance Modeling Engineer

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Google - Software Engineer, People with Disabilities

Google

State Of Minas Gerais, Brazil (On-Site)
4 Months ago
Google - Senior Software Engineering Manager, Wear OS Platform

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Riot Games - Principal Software Engineer (ML Focused) - League Studio, League Data Central

Riot Games

Los Angeles, California, United States (On-Site)
6 Months ago
Microsoft - Silicon Engineer

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Product Manager, Content Discovery

Google

Bengaluru, Karnataka, India (On-Site)
2 Days ago
ByteDance - Research Scientist Graduate (Foundation Model, Video Generation) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Powerintegration - Senior Product Marketing Manager

Powerintegration

San Jose, California, United States (On-Site)
4 Weeks ago
version 1 - Senior Python Developer

version 1

Málaga, Andalusia, Spain (On-Site)
1 Month ago
OKX - Graduate Hire 2024/25 - Blockchain Engineer

OKX

Hong Kong (On-Site)
6 Months ago
Tekion Corp - Senior Applied Scientist

Tekion Corp

Bengaluru, Karnataka, India (On-Site)
1 Day ago
Microsoft - Principal Software Engineer

Microsoft

Belgrade, Serbia (On-Site)
1 Week ago
GoMotive - Embedded Engineer Telematics

GoMotive

(Remote)
1 Day ago
Google - Site Reliability Engineer, Home and Assistant, Infrastructure

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Google - Software Engineer, PhD, Early Career, Campus, 2025 Start

Google

Mountain View, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Ramat Gan, Tel Aviv District, Israel

Playtika - Data Infrastructure Group Manager

Playtika

Israel (On-Site)
2 Weeks ago
Google - SoC and IP Design Engineer

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Week ago
NVIDIA - Senior Physical Design Backend Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
2 Months ago
Google - Customer Solutions Engineer

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Google - Software Engineer III, Onboarding and Discovery, Core

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Days ago
Tesla - Service Advisor

Tesla

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
SciPlay - Data Analyst - Maternity Leave Replacement

SciPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Playtika - User Acquisition Lead

Playtika

Israel (On-Site)
6 Months ago
Google - Research Scientist, Reinforcement Learning

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Google - Software Engineering Manager, Android Accessibility

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Days ago
Meta - Software Engineer, Machine Learning

Meta

Redmond, Washington, United States (On-Site)
5 Months ago
NVIDIA - Senior Timing Methodology Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Luxoft - Regular C++ Software Developer

Luxoft

Chennai, Tamil Nadu, India (On-Site)
5 Months ago
Google - Staff Software Engineer, Google Cloud

Google

(On-Site)
5 Months ago
W Beyond   - Embedded C

W Beyond

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Tesla - Bachelor/Master Thesis Research and Development, Mechanical Engineering

Tesla

Prüm, Rhineland-Palatinate, Germany (On-Site)
2 Months ago
NVIDIA - Senior Photonic Layout Design Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Avathon - Software Engineering Manager

Avathon

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Meta - Research Scientist Intern, Photorealistic Telepresence (PhD)

Meta

Redmond, Washington, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug