Senior DevOps Engineer, Deep Learning Frameworks

3 Months ago • 5 Years + • DevOps • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA's Deep Learning Optimized Frameworks Group seeks a Senior DevOps Engineer to enhance their high-performing deep learning software stacks (TensorFlow, PyTorch). Responsibilities include automating build, test, integration, and release processes; configuring and maintaining industry-standard tools (Gitlab, Jenkins, Docker, etc.); developing shared utilities; leading best practices; and identifying infrastructure needs. The ideal candidate will have strong experience with CI systems, SCM, build systems, and Python programming, along with a passion for automation.
Must have:
  • 5+ years relevant experience
  • CI/CD automation expertise
  • SCM & build system fluency (Git, CMake, Bazel)
  • Python programming skills
  • Problem-solving & collaboration
Good to have:
  • CUDA & Deep Learning experience
  • Container & cluster tech (Kubernetes, Jenkins)
  • GPU computing systems knowledge
  • Experience with new tech incorporation
  • Contribution to large SW projects
Perks:
  • Competitive salary
  • Comprehensive benefits package
  • Equity

Job Details

NVIDIA's Deep Learning Optimized Frameworks Group is looking for an excellent DevOps Engineer to enable the next wave of NVIDIA’s highest performing deep learning software stacks. Your role spans multiple products such as TensorFlow and PyTorch and is instrumental for streamlining development, build, and releases with modern DevOps tools. Join our technically hardworking team of software engineers and infrastructure authorities to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning frameworks.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for optimized NVIDIA Deep Learning Frameworks

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Gitlab, Jenkins, Docker, LXC, HyperV, CMake, Bazel)

  • Developing shared utilities for setting up systems, running tests, and recording results

  • Lead best-practices for building, testing, and releasing software

  • Identifying infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 5+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Fluency in SCM (e.g. Github, Gitlab, Git) and build systems (e.g. Make, CMake, Bazel, Docker)

  • Adept programming skills in Python (or Perl, Shell scripting, like bash, tcsh, sh)

  • Pragmatic approach to solving problems and collaboration

  • Real passion for “it just works” automation and enabling team members

Ways to stand out from the crowd:

  • Experience with CUDA and Deep Learning Software Stack

  • Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

  • Experience as an active contributor to a SW project involving many developers

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us and, due to unprecedented growth, our special engineering teams are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Rivos - Member of Technical Staff (91839)

Rivos

Santa Clara, California, United States (Hybrid)
6 Months ago
Genies - Machine Learning Engineer, Character Animation & Motion AI

Genies

Los Angeles, California, United States (On-Site)
2 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
NVIDIA - Senior Performance Software Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago
Velotio Technologies - Senior DevOps (Azure) Engineer

Velotio Technologies

Maharashtra, India (Remote)
1 Month ago
VGW - Staff Site Reliability Engineer

VGW

Perth, Western Australia, Australia (On-Site)
2 Months ago
Warner Bros Games - Senior Software Engineering - Cloud Support and Operations

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Assystems - DevOps Engineer

Assystems

Gurugram, Haryana, India (On-Site)
6 Months ago
Netflix - Distributed Systems Engineer (L5) - Infra Control Planes

Netflix

Los Gatos, California, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Sr Machine Learning Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
5 Months ago
ByteDance - Software Engineer, Model Inference

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Electronic Arts - Senior Software Engineer

Electronic Arts

Orlando, Florida, United States (On-Site)
1 Month ago
NVIDIA - Global VAT Advisory and Compliance Manager

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Epic Games - Machine Learning Engineer

Epic Games

London, England, United Kingdom (On-Site)
1 Month ago
GoTo Group - Senior Data Scientist - Computer Vision - KYC

GoTo Group

Singapore (On-Site)
6 Months ago
NVIDIA - Senior Solution Architect, HPC and AI

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
2 Months ago
NVIDIA - Senior Solutions Architect, Generative AI - Inference

NVIDIA

California, United States (Remote)
3 Months ago
ByteDance - Senior Research Scientist, Foundation Model, Speech Understanding

ByteDance

San Jose, California, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Twitch - Product Marketing Manager

Twitch

Seattle, Washington, United States (On-Site)
1 Month ago
Flow - Senior/Staff Web Engineer

Flow

New York, New York, United States (Hybrid)
6 Months ago
Lionsgate Games - Director, Financial Planning Systems

Lionsgate Games

Santa Monica, California, United States (On-Site)
2 Months ago
Xsolla - Outsourcing Producer

Xsolla

Los Angeles, California, United States (Remote)
3 Months ago
31st Union - Expert UI Engineer

31st Union

San Mateo, California, United States (On-Site)
2 Months ago
Alphasense - Product Specialist, Financial Services

Alphasense

New York, New York, United States (On-Site)
5 Months ago
ByteDance - Hardware Interaction Industrial Designer

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
The Walt Disney Company - Manager, Software Engineering

The Walt Disney Company

Washington, United States (On-Site)
2 Months ago
Scientific Games  - Helpdesk Technician I

Scientific Games

Alpharetta, Georgia, United States (On-Site)
3 Months ago
Anavation - Deployment Staff Officer

Anavation

Linthicum Heights, Maryland, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Egnyte - Senior DevOps Engineer - Azure

Egnyte

India (Remote)
2 Months ago
GoTo Group - Senior Software Engineer - Event Platform

GoTo Group

Gurugram, Haryana, India (On-Site)
6 Months ago
Tencent - Tencent Cloud - Senior Cloud Architect (R&D & Solution Design)

Tencent

Singapore (On-Site)
5 Months ago
DraftKings - Manager, System DBA Operations

DraftKings

Sofia, Sofia City Province, Bulgaria (On-Site)
5 Months ago
NVIDIA - Software Manager, Golang Kubernetes

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
Egnyte - Senior Build Engineer - Python - Jenkins

Egnyte

India (Remote)
4 Months ago
CapSpire - Senior Consultant – Endur Technical

CapSpire

Bengaluru, Karnataka, India (Remote)
5 Months ago
Ubisoft - Monitoring Specialist - Golang Developer

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
2 Months ago
Zeta - Senior Site Reliability Engineer

Zeta

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Larian Studios - Senior Automation Engineer

Larian Studios

Guildford, England, United Kingdom (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Austin, Texas, United States (Remote)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug