Senior DevOps Engineer, Deep Learning Frameworks

4 Months ago • 5 Years + • DevOps

Job Summary

Job Description

NVIDIA's Deep Learning Optimized Frameworks Group seeks a Senior DevOps Engineer to enhance their high-performing deep learning software stacks. Responsibilities include automating build, test, integration, and release processes for frameworks like TensorFlow and PyTorch; configuring and maintaining industry-standard DevOps tools (Gitlab, Jenkins, Docker, etc.); developing shared utilities; leading best practices; and identifying infrastructure needs. The ideal candidate will possess strong experience in CI/CD, SCM, and build systems, along with programming skills in Python (or similar).
Must have:
  • 5+ years relevant experience
  • CI/CD system automation
  • SCM & build systems expertise (Git, CMake, etc.)
  • Python (or Perl/Shell scripting)
  • Problem-solving & collaboration
Good to have:
  • CUDA & Deep Learning Software Stack experience
  • Container & cluster tech (Kubernetes, Jenkins, etc.)
  • GPU computing systems knowledge
  • Experience with new tech incorporation
Perks:
  • Highly competitive salaries
  • Extensive benefits package
  • Diverse and inclusive work environment

Job Details

NVIDIA's Deep Learning Optimized Frameworks Group is looking for an excellent DevOps Engineer to enable the next wave of NVIDIA’s highest performing deep learning software stacks. Your role spans multiple products such as TensorFlow and PyTorch and is instrumental for streamlining development, build, and releases with modern DevOps tools. Join our technically hardworking team of software engineers and infrastructure authorities to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning frameworks.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for optimized NVIDIA Deep Learning Frameworks

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Gitlab, Jenkins, Docker, LXC, HyperV, CMake, Bazel)

  • Developing shared utilities for setting up systems, running tests, and recording results

  • Lead best-practices for building, testing, and releasing software

  • Identifying infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 5+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Fluency in SCM (e.g. Github, Gitlab, Git) and build systems (e.g. Make, CMake, Bazel, Docker)

  • Adept programming skills in Python (or Perl, Shell scripting, like bash, tcsh, sh)

  • Pragmatic approach to solving problems and collaboration

  • Real passion for “it just works” automation and enabling team members

Ways to stand out from the crowd:

  • Experience with CUDA and Deep Learning Software Stack

  • Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

  • Experience as an active contributor to a SW project involving many developers

NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology. We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.

Similar Jobs

NVIDIA - Digital Circuit Design Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
4 Months ago
Capgemini - Data Scientist

Capgemini

Bengaluru, Karnataka, India (On-Site)
6 Days ago
Ubisoft - Research Internship (F/M/NB) – Crafting NPCs & Bots behaviors with LLM/VLM - La Forge

Ubisoft

Bordeaux, Nouvelle-Aquitaine, France (Hybrid)
3 Weeks ago
Qualcomm - Senior Research Engineer for On-Device LLM Efficiency

Qualcomm

San Diego, California, United States (On-Site)
1 Month ago
diligent coorperation - Director, Product Management, AI

diligent coorperation

Vancouver, British Columbia, Canada (On-Site)
2 Weeks ago
N-ix - Solution Architect (Spanish Speaking)

N-ix

Poland (Remote)
2 Months ago
Info Stretch - Lead Data Engineer

Info Stretch

Bengaluru, Karnataka, India (On-Site)
7 Months ago
N-ix - Solution Architect (Spanish Speaking)

N-ix

Poland (Remote)
1 Month ago
Milestone - Senior Software Engineer

Milestone

Portland, Oregon, United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Principal Engineer

NVIDIA

(Remote)
3 Months ago
Net ease games - LLM Algorithm Engineer

Net ease games

Singapore, Singapore (On-Site)
1 Month ago
Reddit - Senior Machine Learning Manager

Reddit

United States (Remote)
2 Weeks ago
Ansys - Senior R&D Engineer - AI Product Owner

Ansys

Montigny-le-Bretonneux, Île-de-France, France (On-Site)
1 Month ago
CoActive Systems - Sr. Product Marketing Manager

CoActive Systems

San Jose, California, United States (Hybrid)
2 Weeks ago
NVIDIA - Solution Architect - AI and ML

NVIDIA

(Remote)
4 Months ago
Google - Software Engineer III, Generative AI

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
NVIDIA - Principal Engineer

NVIDIA

(Remote)
3 Months ago
Apple - Model Optimization Engineer

Apple

Cupertino, California, United States (On-Site)
1 Week ago
Hedra - Senior Research Engineer

Hedra

New York, New York, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Warsaw, Masovian Voivodeship, Poland

Reality studios - Junior Game Artist

Reality studios

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
3 Weeks ago
Adtran - System Integration Test Engineer I

Adtran

Gdynia, Pomeranian Voivodeship, Poland (Hybrid)
1 Week ago
11 Bit Studios - Junior Front Desk Specialist (Administration)

11 Bit Studios

Warsaw, Masovian Voivodeship, Poland (On-Site)
4 Days ago
PwC - Tester/Testerka automatyzujący/a (freelance)

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
8 Months ago
Haleon - Finance Controlling Analyst

Haleon

Poznań, Greater Poland Voivodeship, Poland (On-Site)
3 Weeks ago
SoftSwiss - Manual QA Engineer

SoftSwiss

Warsaw, Masovian Voivodeship, Poland (Remote)
1 Month ago
Google - Software Engineer II, Android Automotive

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
1 Month ago
Simcorp - Operations Analyst (Reconciliation)

Simcorp

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Week ago
Futurum Technology - Junior Java Developer

Futurum Technology

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
3 Days ago
Larian Studios - Engine Programmer

Larian Studios

Warsaw, Masovian Voivodeship, Poland (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Sony Interactive Entertainment - Developer Experience Engineer (PlayStation™Network Server Platform Development)

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
1 Month ago
Nagarro - Senior Engineer, Cloud

Nagarro

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Ion - Site Reliability Engineer

Ion

London, England, United Kingdom (Hybrid)
7 Months ago
Nagarro - Associate Principal Engineer, DevOps

Nagarro

India (Remote)
7 Months ago
Axinous - Senior Software Development Manager - C, Linux, Distributed Systems

Axinous

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Rackspace Technology - Data Architect

Rackspace Technology

Vietnam (Remote)
4 Months ago
Sony Interactive Entertainment - Senior Cloud Security Engineer

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
6 Months ago
Netflix - Full Stack Engineer L5 - Cloud Engineering

Netflix

Los Gatos, California, United States (On-Site)
7 Months ago
DraftKings - Manager, System DBA Operations

DraftKings

Plovdiv, Plovdiv Province, Bulgaria (On-Site)
3 Months ago
Google - Software Engineer, PhD

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug