Senior Software Engineer, Deep Learning Inference

2 Days ago • 5 Years + • Research & Development

Job Summary

Job Description

NVIDIA seeks a Senior Software Engineer passionate about performance optimization and generative AI. Responsibilities include collaborating with research teams to onboard LLMs and VLMs into NVIDIA's open-source AI runtimes, optimizing inference workloads, building robust inference software systems, implementing low-level GPU code, and owning end-to-end inference acceleration features. This role involves working with diverse teams to deliver production-grade products, requiring strong software design principles, proficiency in system and scripting languages, and a deep understanding of machine learning concepts.
Must have:
  • 5+ years software engineering experience
  • Profound knowledge of software design principles
  • Proficiency in system & scripting languages
  • Strong machine learning concepts
  • Excellent communication & teamwork skills
Good to have:
  • Familiarity with NVIDIA's DL software stack (Triton, TensorRT-LLM, Model Optimizer)
  • Experience with performance modeling, profiling, debugging on NVIDIA accelerators
  • Knowledge of LLM quantization, fine-tuning, and caching algorithms
  • GPU kernel programming (CUDA or OpenCL)
  • Experience on large software projects (50+ contributors)

Job Details

NVIDIA has been at the forefront of the deep learning revolution, pioneering innovations that have transformed the entire field. As the leading provider of GPUs and AI computing platforms, NVIDIA has empowered researchers and engineers worldwide to accelerate breakthroughs in artificial intelligence.

We seek a versatile Senior Software Engineer who is passionate about performance optimization and generative AI. Our team builds software solutions that enable efficient inference on the latest and greatest generative AI models. We tackle problems on all levels of the stack—from server-level request batching to GPU kernel fusion—and collaborate with teams across diverse disciplines to push Nvidia's hardware to its full potential.

What you’ll be doing:

  • Cooperate with research teams to onboard new LLMs and VLMs into Nvidia's opensource AI runtimes

  • Optimize inference workloads using sophisticated profiling and simulation tools

  • Build SOLID, extendable inference software systems, and refine robust APIs

  • Implement and debug low-level GPU code to harness the latest HW features

  • Own end-to-end inference acceleration features and work with teams around the world to deliver production-grade products

What we need to see:

  • B.Sc., M.Sc. or equivalent experience in Computer Science or Computer Engineering

  • 5+ years of relevant hands-on software engineering experience

  • Profound knowledge of software design principles

  • Strong proficiency in at least one system and one scripting language

  • Strong grasp of machine learning concepts

  • People person with excellent communication skills that enjoys collaboration and teamwork.

Ways to stand out from the crowd:

  • Familiarity with Nvidia's DL software stack, e.g. Triton Inference Server, TensorRT-LLM, and Model Optimizer

  • Proven track record of performance modeling, profiling, debugging, and development in a performance-critical setting with Nvidia's accelerators.

  • Familiarity with LLM quantization, fine-tunning, and caching algorithms

  • Proficiency in GPU kernel programming (CUDA or OpenCL)

  • Prior experience working on a large software project with 50+ contributors

NVIDIA is widely considered one of the world’s most desirable employers in the technology field. We have some of the most forward-thinking and hardworking people working for us. If you're creative and autonomous, we want to hear from you! We are committed to fostering a diverse work environment and are proud to be an equal-opportunity employer. We highly value diversity in our current and future employees. We do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.

Similar Jobs

Microsoft - Member of Technical Staff, AI - Reinforcement Systems

Microsoft

London, England, United Kingdom (On-Site)
3 Weeks ago
Ubisoft - Gen AI Programmer

Ubisoft

Pune, Maharashtra, India (On-Site)
5 Days ago
Google - Software Engineer, Cloud AI, Early Career

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Days ago
Epic Games - Rendering Programmer

Epic Games

(On-Site)
3 Months ago
Google - Senior Software Engineer, Mobile (Android)

Google

Bucharest, Bucharest, Romania (On-Site)
3 Days ago
Google - Senior Software Engineer, ISP IQ Software and gSimulator

Google

New Taipei, New Taipei City, Taiwan (On-Site)
3 Days ago
Assystems - Aveva Specialist

Assystems

Gurugram, Haryana, India (On-Site)
5 Months ago
NVIDIA - Physical Design Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Ethernovia - GUI Tools Software Development

Ethernovia

Pune, Maharashtra, India (Remote)
5 Months ago
Regent Craft - Propulsion Engineering Intern

Regent Craft

North Kingstown, Rhode Island, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Principal Engineer

NVIDIA

(Remote)
1 Month ago
Wildlife Studios - Data Engineer

Wildlife Studios

São Paulo, State Of São Paulo, Brazil (On-Site)
3 Months ago
Google - Software Engineer II, Chrome Web Store

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Days ago
Equivalent Jobs - C++ SOFTWARE ENGINEER (SIMULATOR)

Equivalent Jobs

(Remote)
5 Months ago
Google - Senior Software Engineer, Full Stack, Chrome Web Store

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Days ago
Google - Software Engineer III, Artificial Intelligence/Machine Learning

Google

Hyderabad, Telangana, India (On-Site)
12 Hours ago
Google - Software Engineer II, Device Integrity, Web and IOS

Google

Zürich, Zurich, Switzerland (On-Site)
3 Days ago
The Walt Disney Company - Lead Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
5 Days ago
Google - Software Engineer II, Chrome Browser

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Days ago
Google - Senior Software Engineer, Mobile, Wear OS

Google

Bucharest, Bucharest, Romania (On-Site)
3 Days ago

Get notifed when new similar jobs are uploaded

Jobs in Ramat Gan, Tel Aviv District, Israel

Pazu Games - Personal Assistant

Pazu Games

Israel (On-Site)
2 Weeks ago
Playtika - Experienced Game Economy Manager

Playtika

Israel (On-Site)
5 Months ago
PAPAYA - UA Creative Strategist

PAPAYA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Weeks ago
Playtika - Application Security Researcher

Playtika

Israel (On-Site)
5 Months ago
Playtika - VIP Account Manager - Temporary Position

Playtika

Israel (On-Site)
5 Days ago
NVIDIA - Physical Design Backend Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
3 Weeks ago
Playtika - Data Architect

Playtika

Israel (On-Site)
5 Days ago
PAPAYA - Business Analyst

PAPAYA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Weeks ago
Playtika - Senior Level Designer - Solitaire Grand Harvest

Playtika

Israel (On-Site)
2 Months ago
NVIDIA - Physical Design Full Chip STA Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Senior ASIC Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
Rivos - Silicon Formal Verification - Intern

Rivos

Santa Clara, California, United States (On-Site)
6 Months ago
ByteDance - Algorithm Engineer Intern (Multimedia Lab - Video Processing and Enhancement)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Google - Research Scientist, Paradigms of Intelligence

Google

Zürich, Zurich, Switzerland (On-Site)
3 Days ago
Netflix - Machine Learning Software Engineer (L5) - Content and Studio

Netflix

United States (Remote)
1 Month ago
Google - Senior Software Engineer, Chrome OS, Android

Google

Bengaluru, Karnataka, India (On-Site)
3 Days ago
Riot Games - Principal Insights Researcher - VALORANT

Riot Games

Los Angeles, California, United States (On-Site)
5 Days ago
NVIDIA - Networking Software and System Architect

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
Google - Staff Software Engineer

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Days ago
Fluence - Controls Software Engineer-II(m/f/d)

Fluence

Berlin, Berlin, Germany (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Bengaluru, Karnataka, India (On-Site)

Santa Clara, California, United States (On-Site)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug