Senior System Software Engineer - Dynamo and Triton Inference Server

1 Month ago • 6 Years + • Artificial Intelligence • $184,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior System Software Engineer to contribute to the Dynamo and Triton Inference Server. This role involves developing open-source software for AI model inference on GPUs, focusing on building robust, scalable, and high-performance components. Responsibilities include load balancing asynchronous requests, optimizing prediction throughput, integrating open-source technology, and collaborating with team leads on feature prioritization. The ideal candidate will possess strong Rust/Python/C++ skills, experience with distributed systems and ML systems, and familiarity with AI research and efficient implementation strategies.
Must have:
  • 6+ years experience in CS/related field
  • Excellent Rust/Python/C++ skills
  • Experience with high-scale distributed systems
  • Experience with ML systems
  • Performance optimization skills
Good to have:
  • Experience improving AI inference system performance
  • Deep learning algorithm and framework knowledge (PyTorch, TensorFlow, etc.)
  • Experience with Large Language Models
  • Cloud service deployment experience (HTTP REST, gRPC, etc.)
  • Experience with Docker and Kubernetes
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a Senior System Software Engineer to work on Dynamo & Triton Inference Server! NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution in AI, enabling breakthroughs in problems from image classification to speech recognition to natural language processing. We are a fast-paced team building back-end services and software to make design and deployment of new AI models easier and accessible to all users.

What you'll be doing:

In this role, you will develop open source software to serve inference of trained AI models running on GPUs. You will balance a variety of objectives: build robust, scalable, high performance software components to support our distributed inference workloads; work with team leads to prioritize features and capabilities; load-balance asynchronous requests across available resources; optimize prediction throughput under latency constraints; and integrate the latest open source technology.

What we need to see:

  • Masters or PhD or equivalent experience

  • 6+ years in Computer Science, Computer Engineering, or related field

  • Ability to work in a fast-paced, agile team environment

  • Excellent Rust/Python / C++ programming and software design skills, including debugging, performance analysis, and test design.

  • Experience with high scale distributed systems and ML systems

Ways to stand out from the crowd:

  • Prior work experience improving performance of AI inference systems.

  • Background with deep learning algorithms and frameworks. Especially experience Large Language Models and frameworks such as PyTorch, TensorFlow, TensorRT, and ONNX Runtime.

  • Experience building and deploying cloud services using HTTP REST, gRPC, protobuf, JSON and related technologies.

  • Experience with container technologies, such as Docker and container orchestrators, such as Kubernetes.

  • Have familiarity with the latest AI research and working knowledge of how these systems are efficiently implemented.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most expert and passionate people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the multifaceted and quickly growing field Deep Learning and Artificial Intelligence.

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Trendyol - Data Science Professionals - Trendyol GO

Trendyol

İzmir, İzmir, Türkiye (Hybrid)
5 Months ago
Google - ML Accelerator Architect and Performance Engineer, Silicon

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Weeks ago
Match Group - Staff Software Engineer, Machine Learning

Match Group

Palo Alto, California, United States (Hybrid)
6 Months ago
Zazz - Artificial Intelligence Engineer

Zazz

(Remote)
2 Months ago
Netflix - Research Scientist (L6) - Identity Algorithms

Netflix

Los Gatos, California, United States (On-Site)
6 Months ago
NVIDIA - Global Developer Relations Account Manager – Ansys

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Google - Software Engineer III, Embedded Systems/Firmware, Platforms Infrastructure Engineering

Google

Madison, Wisconsin, United States (On-Site)
1 Week ago
Google - Staff Software Engineer, Applied AI

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
2 Weeks ago
AI Fund - Founder in Residence/CEO (AI for Construction)

AI Fund

United States (Remote)
1 Month ago
RoofStack - AI/ML Engineer

RoofStack

İstanbul, İstanbul, Türkiye (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Starkflow - Principal Full Stack Developer

Starkflow

Karnataka, India (Hybrid)
1 Month ago
Ubisoft - Lead R&D Scientist

Ubisoft

Shanghai, Shanghai, China (On-Site)
3 Months ago
Meta - Research Scientist, Machine Learning (PhD)

Meta

Bellevue, Washington, United States (On-Site)
2 Weeks ago
Trendyol - Data Science Professionals - Trendyol GO

Trendyol

Ankara, Ankara, Türkiye (Hybrid)
5 Months ago
Outbrain - Machine Learning Engineer

Outbrain

Ljubljana, Ljubljana, Slovenia (Hybrid)
22 Hours ago
ByteDance - Senior Software Engineer - Generative AI

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
NVIDIA - Senior Research Engineer for Reinforcement Learning

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
ByteDance - Tech Lead Manager - Code AI

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago
ByteDance - Research Scientist - AI Security

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

MIQ Digital - Research & Insights Director, Global Marketing

MIQ Digital

New York, New York, United States (On-Site)
8 Hours ago
Interactive Brokers - Head of Compliance Testing

Interactive Brokers

Greenwich, Connecticut, United States (On-Site)
6 Months ago
ION - Lead UI Developer, New York

ION

New York, New York, United States (Hybrid)
6 Months ago
Google - Software Engineer III, Security/Privacy, Google Cloud Compute Infrastructure

Google

Kirkland, Washington, United States (On-Site)
2 Weeks ago
Revolgy - L2 Cloud Operations Engineer

Revolgy

Georgia, United States (Remote)
1 Month ago
The Walt Disney Company - Sales Manager - Celebrations & Events

The Walt Disney Company

Celebration, Florida, United States (On-Site)
1 Week ago
Rocket Science - Bookkeeper (Part-Time)

Rocket Science

Albany, New York, United States (Hybrid)
1 Month ago
Comscore - Panel Development Analyst

Comscore

Reston, Virginia, United States (On-Site)
21 Hours ago
Cognite - Director of FP&A

Cognite

Phoenix, Arizona, United States (Hybrid)
1 Week ago
The Walt Disney Company - Sr Software Engineer (Rust Developer)

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Senior Risk and Compliance Lead, AI and Content

Google

Austin, Texas, United States (On-Site)
2 Days ago
ByteDance - Software Engineer / Researcher, AI-Native Database Systems

ByteDance

Seattle, Washington, United States (On-Site)
2 Days ago
NVIDIA - AI Network System Architect

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Weeks ago
Google - Software Engineer III, Education AI Platform

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Days ago
CharacterAI - Software Engineer, Machine Learning Infrastructure

CharacterAI

San Francisco, California, United States (On-Site)
2 Weeks ago
Equivalent Jobs - MLOPS ENGINEER

Equivalent Jobs

(Remote)
5 Months ago
ByteDance - Senior Software Engineer - Generative AI

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Ubisoft - Senior ML Ops - Content Creation Technology Group

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Months ago
ByteDance - Research Engineer / Scientist - AI for Databases

ByteDance

San Jose, California, United States (On-Site)
2 Days ago
Interface AI - Software Development Engineer IV - Backend

Interface AI

India (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug