Senior DevOps Engineer, Deep Learning Frameworks

2 Months ago • 5 Years + • DevOps • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA's Deep Learning Optimized Frameworks Group seeks a Senior DevOps Engineer to enhance their high-performing deep learning software stacks (TensorFlow, PyTorch). Responsibilities include automating build, test, integration, and release processes; configuring and maintaining industry-standard tools (Gitlab, Jenkins, Docker, etc.); developing shared utilities; leading best practices; and identifying infrastructure needs. The ideal candidate will have strong experience with CI systems, SCM, build systems, and Python programming, along with a passion for automation.
Must have:
  • 5+ years relevant experience
  • CI/CD automation expertise
  • SCM & build system fluency (Git, CMake, Bazel)
  • Python programming skills
  • Problem-solving & collaboration
Good to have:
  • CUDA & Deep Learning experience
  • Container & cluster tech (Kubernetes, Jenkins)
  • GPU computing systems knowledge
  • Experience with new tech incorporation
  • Contribution to large SW projects
Perks:
  • Competitive salary
  • Comprehensive benefits package
  • Equity

Job Details

NVIDIA's Deep Learning Optimized Frameworks Group is looking for an excellent DevOps Engineer to enable the next wave of NVIDIA’s highest performing deep learning software stacks. Your role spans multiple products such as TensorFlow and PyTorch and is instrumental for streamlining development, build, and releases with modern DevOps tools. Join our technically hardworking team of software engineers and infrastructure authorities to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning frameworks.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for optimized NVIDIA Deep Learning Frameworks

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Gitlab, Jenkins, Docker, LXC, HyperV, CMake, Bazel)

  • Developing shared utilities for setting up systems, running tests, and recording results

  • Lead best-practices for building, testing, and releasing software

  • Identifying infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 5+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Fluency in SCM (e.g. Github, Gitlab, Git) and build systems (e.g. Make, CMake, Bazel, Docker)

  • Adept programming skills in Python (or Perl, Shell scripting, like bash, tcsh, sh)

  • Pragmatic approach to solving problems and collaboration

  • Real passion for “it just works” automation and enabling team members

Ways to stand out from the crowd:

  • Experience with CUDA and Deep Learning Software Stack

  • Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

  • Experience as an active contributor to a SW project involving many developers

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us and, due to unprecedented growth, our special engineering teams are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

NVIDIA - Senior Synthesis Flow Development Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Ubisoft - Principal R&D Scientist on Bots & Behaviors

Ubisoft

Bordeaux, Nouvelle-Aquitaine, France (Hybrid)
6 Days ago
NVIDIA - Developer Technology Engineer - AI

NVIDIA

Seoul, South Korea (Hybrid)
2 Months ago
N-iX - Senior C++ Engineer (High Performance Computing)

N-iX

Argentina (Remote)
1 Week ago
The Walt Disney Company - Sr Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
4 Months ago
DraftKings - Lead Software Engineer

DraftKings

Sofia, Sofia City Province, Bulgaria (Hybrid)
4 Months ago
Zeta - Lead Data Reliability Engineer

Zeta

Hyderabad, Telangana, India (On-Site)
5 Months ago
Demonware - Site Reliability Intern

Demonware

Shanghai, Shanghai, China (On-Site)
20 Hours ago
Kefir Games - Build Engineer

Kefir Games

Cyprus (On-Site)
5 Months ago
prizepicks - Database Reliability Engineer

prizepicks

United States (Remote)
6 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Ubisoft - Principal R&D Scientist on Bots & Behaviors

Ubisoft

Bordeaux, Nouvelle-Aquitaine, France (Hybrid)
6 Days ago
NVIDIA - Senior Mixed Signal and Analog Circuit Designer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Weeks ago
Digital Extremes - Senior Data Scientist

Digital Extremes

Ontario, Canada (Remote)
2 Months ago
NVIDIA - Senior Digital Circuit Design Engineer

NVIDIA

Canada (On-Site)
1 Month ago
Samsung Semiconductor - Staff Engineer, AI/ML Software Compiler

Samsung Semiconductor

San Jose, California, United States (Hybrid)
4 Months ago
NVIDIA - Senior SRAM Circuit Design Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
4 Hours ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
6 Days ago
NVIDIA - Software Engineering Manager - Data Processing Libraries

NVIDIA

Warsaw, Masovian Voivodeship, Poland (Remote)
2 Months ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Austin, Texas, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Meta - AI Research Scientist, Language - Generative AI

Meta

Burlingame, California, United States (On-Site)
4 Months ago
Tap Nation - Unity Developer

Tap Nation

New York, New York, United States (Remote)
4 Months ago
Nagarro - Associate Engineer

Nagarro

New York, New York, United States (On-Site)
4 Months ago
On Location - Senior Coordinator, Sales Partnerships - FIFA World Cup 2026

On Location

New York, New York, United States (On-Site)
3 Weeks ago
ByteDance - Software Engineer Intern (Applied Machine Learning-Enterprise) - 2025 Summer/Fall (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Onward Search - Sales Coordinator

Onward Search

Washington, District Of Columbia, United States (On-Site)
4 Months ago
Feld Entertainment - Immigration Legal Specialist

Feld Entertainment

Vienna, Virginia, United States (On-Site)
5 Months ago
Nissan - Warehouse Operator -Lebanon

Nissan

Lebanon, Tennessee, United States (On-Site)
6 Months ago
Next Level Business Services - SAP MDG Techno-Functional Consultant

Next Level Business Services

Cary, North Carolina, United States (On-Site)
5 Months ago
Google - Software Engineer III, Machine Learning, Search

Google

Seattle, Washington, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

The Walt Disney Company - Senior Systems Engineer, Data Services [Database Administration]

The Walt Disney Company

Vancouver, British Columbia, Canada (On-Site)
4 Months ago
Brillio - DB Migration Engineer - R01531207

Brillio

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Easygo - Senior DevOps Engineer

Easygo

Belgrade, Serbia (On-Site)
5 Days ago
Canva - Senior Platform Engineer - Workload Integration

Canva

Surry Hills, New South Wales, Australia (Remote)
5 Days ago
ByteDance - Production System Engineer, Infrastructure Engineering

ByteDance

Singapore (On-Site)
5 Months ago
Info Stretch - Programmer Analyst 5

Info Stretch

Lansing, Michigan, United States (Hybrid)
4 Months ago
Vi - Data Infrastructure Engineer

Vi

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
5 Months ago
Riot Games - Staff Software Engineer - Infrastructure Reliability

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Equivalent Jobs - Technical Product Owner

Equivalent Jobs

(Remote)
2 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

London, England, United Kingdom (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug