Senior DevOps Engineer, Deep Learning Frameworks

2 Months ago • 5 Years + • DevOps

Job Summary

Job Description

NVIDIA's Deep Learning Optimized Frameworks Group seeks a Senior DevOps Engineer to enhance their high-performing deep learning software stacks. Responsibilities include automating build, test, integration, and release processes for frameworks like TensorFlow and PyTorch; configuring and maintaining industry-standard DevOps tools (Gitlab, Jenkins, Docker, etc.); developing shared utilities; leading best practices; and identifying infrastructure needs. The ideal candidate will possess strong experience in CI/CD, SCM, and build systems, along with programming skills in Python (or similar).
Must have:
  • 5+ years relevant experience
  • CI/CD system automation
  • SCM & build systems expertise (Git, CMake, etc.)
  • Python (or Perl/Shell scripting)
  • Problem-solving & collaboration
Good to have:
  • CUDA & Deep Learning Software Stack experience
  • Container & cluster tech (Kubernetes, Jenkins, etc.)
  • GPU computing systems knowledge
  • Experience with new tech incorporation
Perks:
  • Highly competitive salaries
  • Extensive benefits package
  • Diverse and inclusive work environment

Job Details

NVIDIA's Deep Learning Optimized Frameworks Group is looking for an excellent DevOps Engineer to enable the next wave of NVIDIA’s highest performing deep learning software stacks. Your role spans multiple products such as TensorFlow and PyTorch and is instrumental for streamlining development, build, and releases with modern DevOps tools. Join our technically hardworking team of software engineers and infrastructure authorities to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning frameworks.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for optimized NVIDIA Deep Learning Frameworks

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Gitlab, Jenkins, Docker, LXC, HyperV, CMake, Bazel)

  • Developing shared utilities for setting up systems, running tests, and recording results

  • Lead best-practices for building, testing, and releasing software

  • Identifying infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 5+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Fluency in SCM (e.g. Github, Gitlab, Git) and build systems (e.g. Make, CMake, Bazel, Docker)

  • Adept programming skills in Python (or Perl, Shell scripting, like bash, tcsh, sh)

  • Pragmatic approach to solving problems and collaboration

  • Real passion for “it just works” automation and enabling team members

Ways to stand out from the crowd:

  • Experience with CUDA and Deep Learning Software Stack

  • Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

  • Experience as an active contributor to a SW project involving many developers

NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology. We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.

Similar Jobs

NVIDIA - Mixed Signal Design Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Tencent - Data Science Intern

Tencent

(On-Site)
1 Month ago
GoTo Group - Senior Data Scientist  (Singapore)

GoTo Group

Singapore (On-Site)
5 Months ago
Airlab Inc  - Junior Programmer Artificial Intelligence

Airlab Inc

Quebec, Canada (On-Site)
5 Days ago
Canva - Engineering Manager - Design Generation

Canva

Sydney, New South Wales, Australia (Hybrid)
5 Months ago
Sinch - Site Reliability Engineer (Expert-level)

Sinch

France (Remote)
1 Week ago
N-iX - Senior (Lead) DevOps Engineer

N-iX

(Hybrid)
1 Week ago
Netflix - Solutions Support Engineer (L5) - Observability

Netflix

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Months ago
ByteDance - Cloud Site Reliability Engineer

ByteDance

Seattle, Washington, United States (On-Site)
6 Days ago
Zazz - Data Engineer

Zazz

(Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

PlayStation Global - Machine Learning Engineer for Game Technology

PlayStation Global

London, England, United Kingdom (On-Site)
8 Months ago
ByteDance - Senior Machine Learning Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
NVIDIA - Senior Post Silicon Hardware Engineer

NVIDIA

Canada (Hybrid)
3 Weeks ago
NVIDIA - Principal Software Architect, GPU Networking Research

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
PwC - IN-Senior Associate_ML Engineer_Data &Analytics_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Corsair - Senior Manager, AI & Data

Corsair

Munich, Bavaria, Germany (On-Site)
1 Week ago
NVIDIA - Senior Circuit Characterization Engineer

NVIDIA

Canada (Hybrid)
3 Weeks ago
ByteDance - High-Performance Computing Research Scientist (Inference Optimization)

ByteDance

San Jose, California, United States (On-Site)
6 Days ago
NVIDIA - Account Leader, Automotive

NVIDIA

(On-Site)
1 Month ago
NVIDIA - Manager, Developer Technology, Data Compression

NVIDIA

California, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Warsaw, Masovian Voivodeship, Poland

Testronic - LQA Game Tester with Korean

Testronic

Poland (Remote)
1 Day ago
CD PROJEKT RED - Lead VFX Artist

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Lucid Reality Labs - QA Lead

Lucid Reality Labs

Poland (Hybrid)
1 Month ago
Netflix - Software Engineer (L5) - Developer Enablement Tooling

Netflix

Warsaw, Masovian Voivodeship, Poland (Hybrid)
5 Months ago
Techland - Animation Director

Techland

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
PwC - Application to PwC

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Months ago
Fool's Theory - Administration Specialist

Fool's Theory

Poland (On-Site)
2 Months ago
Social Discovery Group - Digital Experience Designer

Social Discovery Group

Poland (Remote)
1 Month ago
ComeOn Group - Frontend Developer

ComeOn Group

Silesian Voivodeship, Poland (Hybrid)
1 Week ago
CD PROJEKT RED - Intern in Compensation & Benefits Team

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Oriserve - Senior DevOps Engineer (4+ Yrs exp)

Oriserve

Noida, Uttar Pradesh, India (On-Site)
5 Months ago
Rackspace Technology - Site Reliability Engineer / Observability Engineer

Rackspace Technology

Giza, Giza Governorate, Egypt (Remote)
3 Months ago
Rackspace Technology - Cloud Engineer IV (Java Dev Google Cloud Practice Engineer)

Rackspace Technology

Gurugram, Haryana, India (Remote)
6 Days ago
ByteDance - Senior Software Engineer, Cloud Infrastructure

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Velotio Technologies - Senior Software Engineer (Golang)

Velotio Technologies

Maharashtra, India (Remote)
2 Weeks ago
Dream Sports - SDE - 1 - DevOps

Dream Sports

Mumbai, Maharashtra, India (On-Site)
5 Months ago
Nielsen Holdings - Senior Software Engineer (Java/Scala, Spark, Kubernetes, AWS)

Nielsen Holdings

Gurugram, Haryana, India (Hybrid)
5 Months ago
ION - Senior DevSecOps Engineer, Italy

ION

Collecchio, Emilia-Romagna, Italy (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug