Senior DevOps Engineer, Deep Learning Frameworks

2 Months ago • 5 Years + • DevOps • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA's Deep Learning Optimized Frameworks Group seeks a Senior DevOps Engineer to automate and optimize build, test, integration, and release processes for deep learning frameworks like TensorFlow and PyTorch. Responsibilities include configuring and maintaining industry-standard DevOps tools (Gitlab, Jenkins, Docker, etc.), developing shared utilities, leading best practices for software development, and identifying infrastructure needs. The ideal candidate will have strong experience with CI/CD systems, scripting languages (Python), and a passion for automation. Experience with CUDA and GPU computing is a plus.
Must have:
  • 5+ years relevant experience
  • CI/CD system automation
  • SCM & build system fluency (Git, CMake, Bazel)
  • Python (or Perl/Shell) programming
  • Problem-solving & collaboration
Good to have:
  • CUDA & Deep Learning experience
  • Container & cluster tech (Kubernetes, Slurm)
  • Experience with new tech incorporation
  • Contribution to large SW projects
Perks:
  • Competitive salary
  • Comprehensive benefits package
  • Equity

Job Details

NVIDIA's Deep Learning Optimized Frameworks Group is looking for an excellent DevOps Engineer to enable the next wave of NVIDIA’s highest performing deep learning software stacks. Your role spans multiple products such as TensorFlow and PyTorch and is instrumental for streamlining development, build, and releases with modern DevOps tools. Join our technically hardworking team of software engineers and infrastructure authorities to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning frameworks.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for optimized NVIDIA Deep Learning Frameworks

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Gitlab, Jenkins, Docker, LXC, HyperV, CMake, Bazel)

  • Developing shared utilities for setting up systems, running tests, and recording results

  • Lead best-practices for building, testing, and releasing software

  • Identifying infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 5+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Fluency in SCM (e.g. Github, Gitlab, Git) and build systems (e.g. Make, CMake, Bazel, Docker)

  • Adept programming skills in Python (or Perl, Shell scripting, like bash, tcsh, sh)

  • Pragmatic approach to solving problems and collaboration

  • Real passion for “it just works” automation and enabling team members

Ways to stand out from the crowd:

  • Experience with CUDA and Deep Learning Software Stack

  • Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

  • Experience as an active contributor to a SW project involving many developers

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us and, due to unprecedented growth, our special engineering teams are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Hedra - Senior Research Engineer

Hedra

New York, New York, United States (On-Site)
1 Month ago
Zoox - Senior/Staff Machine Learning Engineer - Prediction & Behavior ML

Zoox

Boston, Massachusetts, United States (Hybrid)
6 Months ago
ByteDance - Researcher - Large Language Models, Applied Machine Learning

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
ByteDance - Senior Software Engineer - Generative AI

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
NVIDIA - Solution Architect - CSP Cloud

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Playgendary - DevOps (Cloud Engineer)

Playgendary

Limassol, Limassol, Cyprus (Remote)
2 Months ago
PwC - IN- Senior Associate_ DevOps_Advisory Corporate_Advisory _Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Saviynt - Senior Principal Software Engineer - Privileged Access Management (PAM)

Saviynt

El Segundo, California, United States (Hybrid)
6 Months ago
Luxoft - Senior Software Support Engineer

Luxoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Remote)
5 Months ago
Scanline VFX - Senior DevOps Engineer

Scanline VFX

Seoul, South Korea (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Memory Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
ByteDance - Software Engineer, Architecture and Infrastructure

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Kaedim - Machine Learning Engineer

Kaedim

Singapore (On-Site)
9 Months ago
NVIDIA - Senior Server Firmware Bringup Engineer

NVIDIA

Canada (On-Site)
1 Month ago
NVIDIA - Senior Developer Technology Engineer, Public Sector

NVIDIA

California, Maryland, United States (Remote)
1 Month ago
Krafton  - Deep Learning Research Scientist - Core

Krafton

Seoul, South Korea (On-Site)
2 Months ago
NVIDIA - PCB Layout Engineer - New College Graduate

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
NVIDIA - Senior Software Engineer - HPC

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
NVIDIA - Research Scientist, Deep Learning and Computer Vision

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Framestore - Montreal Launchpad Internship 2025

Framestore

Montreal, Quebec, Canada (Hybrid)
1 Month ago
Ubisoft - Team Lead - Animation

Ubisoft

Toronto, Ontario, Canada (On-Site)
1 Month ago
Amber - Localization Quality Assurance (Norwegian)

Amber

Quebec, Canada (Hybrid)
2 Months ago
Larian Studios - VFX DIRECTOR

Larian Studios

Quebec, Canada (On-Site)
4 Months ago
NVIDIA - Senior System Level Product Engineer

NVIDIA

Canada (Hybrid)
2 Months ago
People Can Fly - Community Manager

People Can Fly

Montreal, Quebec, Canada (Remote)
1 Month ago
Nintendo - Brand Ambassador - Bilingual (French-English)

Nintendo

Montreal, Quebec, Canada (On-Site)
7 Months ago
Bethesda - Animation Programmer

Bethesda

Montreal, Quebec, Canada (On-Site)
9 Months ago
Epic Games - Programmeur de systèmes Gameplay sénior, Relation avec les développeurs

Epic Games

Montreal, Quebec, Canada (On-Site)
3 Months ago
Activate Games - Store Leader (Store Manager)

Activate Games

Toronto, Ontario, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

ION - Microsoft System Engineer, Italy

ION

Italy (Hybrid)
6 Months ago
Nintendo - Machine Learning Operations Engineer

Nintendo

Redmond, Washington, United States (On-Site)
2 Months ago
PlayStation Global - Sr. Software Engineer - ML/AI DevOps

PlayStation Global

San Mateo, California, United States (On-Site)
1 Month ago
ByteDance - Security Systems Engineer, Fleet Management

ByteDance

Singapore (On-Site)
3 Months ago
Auros Global - Strategy Developer - Asia

Auros Global

Asia, Lima Region, Peru (Remote)
5 Months ago
Moon Active - DevOps Engineer

Moon Active

Warsaw, Masovian Voivodeship, Poland (Hybrid)
7 Months ago
Remedy Entertainment Plc - Senior/Lead DevOps Engineer

Remedy Entertainment Plc

Helsinki, Uusimaa, Finland (Hybrid)
2 Months ago
Rackspace Technology - Software Developer III (Windows PowerShell Automation)

Rackspace Technology

India (Remote)
1 Month ago
Ubisoft - Back-End Golang Developer

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago
Barracuda Networks  Inc  - Software Engineer

Barracuda Networks Inc

Bengaluru, Karnataka, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug