Jobs Courses Resources Companies Placements

Home >

Jobs >

Senior System Software Engineer - MLOps

NVIDIA

California, United States (Hybrid)

Senior System Software Engineer - MLOps

4 Months ago • 3 Years + • Devops • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior System Software Engineer to contribute to the Triton Inference Server. Responsibilities include building infrastructure solutions, defining CI/CD processes, ensuring cross-platform compatibility, collaborating with cross-functional teams, and optimizing the deployment pipeline. The ideal candidate will possess strong software design skills (Bash, Python, CI/CD), experience with cloud platforms (GitHub), distributed systems programming, and a deep understanding of deep learning frameworks. Experience with Docker and Kubernetes is a plus.

Must have:

3+ years experience in relevant field
Excellent Bash, Python, CI/CD skills
Experience with GitHub and cloud platforms
Knowledge of distributed systems
Software design and debugging skills

Good to have:

Experience with Docker and Kubernetes
Experience designing/architecting systems
Contributions to open-source deep learning community
Experience with infrastructure as code

Perks:

Equity
Benefits

15 skills required

15 skills required for this role

Add these skills to join the top 1% applicants for this job

deep-learning

problem-solving

github

agile-development

bash

ci-cd

kubernetes

containers

image-classification

python

docker

design-patterns

bug-tracking

quality-control

cross-functional

Job Details

We are now looking for a Senior System Software Engineer to work on Triton Inference Server! NVIDIA is hiring software engineers for its GPU-accelerated deep learning software team. Academic and commercial groups around the world are using GPUs to power a revolution in deep learning, enabling breakthroughs in problems from image classification to speech recognition to natural language processing. We are a fast-paced team building tools and software to make design and deployment of new deep learning models easier and accessible to more data scientists.

What you'll be doing:

In this role, you will build infrastructure solutions from first principles needed to deliver Triton Inference Server. You will apply software design skills to define the processes and best practices for performing continuous integration, testing, and releasing builds, while ensuring the cross-platform compatibility of Triton Inference Server across a wide range of operating systems and architecture systems. Using your expertise, you will influence how we design our customer facing technology and tools to enable an optimized pipeline for building and deploying our product. Extensive collaboration with cross-functional teams to integrate pipelines from deep learning frameworks and components is essential to ensuring seamless deployment and inference of deep learning models across Triton Inference Server.

What we need to see:

Masters degree or equivalent experience
3+ years of experience in Computer Science, computer architecture, or related field
Ability to work in a fast-paced, agile team environment
Excellent Bash, CI/CD, Python programming and software design skills, including debugging, performance analysis, and test design.
Experience in administering, monitoring, and deploying systems and services on GitHub and cloud platforms. Support other technical teams in monitoring operating efficiencies of the platform, and responding as needs arise.
Knowledge of distributed systems programming.

Ways to stand out from the crowd:

Experience designing or architecting (design patterns, reliability and scaling) of new and existing systems experience.
Experience driving efficiencies in software architecture, creating metrics, implementing infrastructure as code and other automation improvements.
Background deploying cloud-native services using modern technologies such as Docker, and Kubernetes, optimizing software for scalable and efficient deployment in cloud environments.
Experience contributing to a large open-source deep learning community - use of GitHub, bug tracking, branching and merging code, OSS licensing issues handling patches, etc.
Excellent problem solving abilities spanning multiple software (storage systems, kernels and containers) as well as collaborating within an agile team environment to prioritize deep learning-specific features and capabilities within Triton Inference Server, employing advanced troubleshooting and debugging techniques to resolve complex technical issues.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and hard-working people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the dynamic and quickly growing field Deep Learning and Artificial Intelligence.

#LI-Hybrid

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Manager - Accounts Receivable

AppZen

Pune, Maharashtra, India (Hybrid)

• 4 Months ago

R&D Engineer II

Ansys

Hsinchu County, Taiwan (On-Site)

• 3 Months ago

Quantitative Researcher

Jane Street

London, England, United Kingdom (On-Site)

• 3 Months ago

Senior Site Reliability Engineer, ML System

ByteDance

San Jose, California, United States (On-Site)

• 9 Months ago

Large Language Model Algorithm Engineer - Volcano Ark

ByteDance

Singapore (On-Site)

• 9 Months ago

DevOps Engineer

Playtech

Manchester, England, United Kingdom (On-Site)

• 3 Months ago

Lead Software Engineer

The Walt Disney Company

Burbank, California, United States (On-Site)

• 7 Months ago

Manager, Database Reliability Engineering

The Walt Disney Company

Washington, United States (On-Site)

• 4 Months ago

Senior Systems Engineer

The Walt Disney Company

New York, New York, United States (On-Site)

• 3 Months ago

Executive - Cloud Engineer

Malabar Gold & Diamonds

Sri Vijaya Puram, Andaman And Nicobar Islands, India (On-Site)

• 1 Year ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Senior Data Engineer

Armada

Thiruvananthapuram, Kerala, India (On-Site)

• 10 Months ago

Data Scientist

PhonePe

Bengaluru, Karnataka, India (On-Site)

• 3 Months ago

Data Science Internship/Workstudent

Altagram Group

Germany (On-Site)

• 4 Months ago

Backend Engineer, Applied Machine Learning Platform - 2025 Start

ByteDance

Singapore (On-Site)

• 9 Months ago

Research Scientist Graduate (Foundation Model - Vision and Language)

ByteDance

Seattle, Washington, United States (On-Site)

• 4 Months ago

Staff Software Engineer, ML Understanding

(Remote)

• 3 Months ago

Staff Software Engineer, Generative AI, Google Workspace

Google

Kirkland, Washington, United States (On-Site)

• 3 Months ago

Senior SWQA Test Development Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)

• 5 Months ago

Machine Learning Research Engineering Manager

Canva

Vienna, Vienna, Austria (On-Site)

• 5 Months ago

Senior Analog Layout Design Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)

• 4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Director of FP&A

Cognite

Phoenix, Arizona, United States (Hybrid)

• 3 Months ago

Business Manager - Innovation & Partnerships Office

llnl

Livermore, California, United States (On-Site)

• 3 Months ago

AI Policy Lead

Scale AI

Washington, District Of Columbia, United States (On-Site)

• 3 Months ago

Non CDL Local Route Driver / Warehouse Associate

Iron Mountain

Greenville, South Carolina, United States (On-Site)

• 3 Months ago

Senior Battery Thermal Analyst

Wisk

Mountain View, California, United States (Hybrid)

• 4 Months ago

Machine Learning Engineer for Game Technology

PlayStation Global

Aliso Viejo, California, United States (On-Site)

• 10 Months ago

Community Specialist, Channel Retail

Apple

Peachtree City, Georgia, United States (On-Site)

• 3 Months ago

Senior Gameplay Capture Artist

PlayStation Global

California, United States (Remote)

• 8 Months ago

Revenue Accountant, Commercial Controllership

Axon

Boston, Massachusetts, United States (Hybrid)

• 3 Months ago

Sales Team Lead

TransUnion

Boca Raton, Florida, United States (Hybrid)

• 3 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

DevOps & Automation Engineer

Sandsoft Games

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)

• 4 Months ago

Infrastructure Engineer, Google Distributed Cloud

Google

Cambridge, Massachusetts, United States (On-Site)

• 3 Months ago

Senior Associate _ Automation Tester_ Emerging Technologies_ Advisory_ Bengaluru

PwC

Bengaluru, Karnataka, India (On-Site)

• 10 Months ago

Senior Automation Engineer

Larian Studios

Warsaw, Masovian Voivodeship, Poland (On-Site)

• 4 Months ago

Orchestrade - Azure infrastructure cloud Senior engineer

Luxoft

(On-Site)

• 8 Months ago

Junior DevOps Engineer

11 bit studios

Warsaw, Masovian Voivodeship, Poland (Hybrid)

• 3 Months ago

Tencent Cloud - Technical Account Manager (South Korea)

Tencent

Seoul, South Korea (On-Site)

• 7 Months ago

DevOps Engineer

Kolibri Games

Berlin, Berlin, Germany (Hybrid)

• 4 Months ago

Site Reliability Engineer

Google

Dublin, County Dublin, Ireland (On-Site)

• 3 Months ago

Senior Python Engineer (Part-Time)

N-iX

Poland (Remote)

• 3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

NVIDIA

136 Active Jobs

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

Senior System Software Engineer - MLOps

Job Summary

Job Description

15 skills required

15 skills required for this role

Job Details

Similar Jobs

Manager - Accounts Receivable

R&D Engineer II

Quantitative Researcher

Senior Site Reliability Engineer, ML System

Large Language Model Algorithm Engineer - Volcano Ark

DevOps Engineer

Lead Software Engineer

Manager, Database Reliability Engineering

Senior Systems Engineer

Executive - Cloud Engineer

Similar Skill Jobs

Senior Data Engineer

Data Scientist

Data Science Internship/Workstudent

Backend Engineer, Applied Machine Learning Platform - 2025 Start

Research Scientist Graduate (Foundation Model - Vision and Language)

Staff Software Engineer, ML Understanding

Staff Software Engineer, Generative AI, Google Workspace

Senior SWQA Test Development Engineer

Machine Learning Research Engineering Manager

Senior Analog Layout Design Engineer

Jobs in California, United States

Director of FP&A

Business Manager - Innovation & Partnerships Office

AI Policy Lead

Non CDL Local Route Driver / Warehouse Associate

Senior Battery Thermal Analyst

Machine Learning Engineer for Game Technology

Community Specialist, Channel Retail

Senior Gameplay Capture Artist

Revenue Accountant, Commercial Controllership

Sales Team Lead

Devops Jobs

DevOps & Automation Engineer

Infrastructure Engineer, Google Distributed Cloud

Senior Associate _ Automation Tester_ Emerging Technologies_ Advisory_ Bengaluru

Senior Automation Engineer

Orchestrade - Azure infrastructure cloud Senior engineer

Junior DevOps Engineer

Tencent Cloud - Technical Account Manager (South Korea)

DevOps Engineer

Site Reliability Engineer

Senior Python Engineer (Part-Time)

About The Company

System Design Power Validation Engineer

OEM Account Manager

System Debug Lead Engineer

Network Site Reliability Engineer

ASIC Engineer

Senior ASIC Design Engineer

Physical Design CAD Team Manager

Senior Data Scientist and System Architect

Solutions Architect for NCP

Senior Networking Architect

Level Up Your Career in Game Development!