Senior DevOps Engineer, Deep Learning Frameworks

1 Month ago • 5 Years + • DevOps • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA's Deep Learning Optimized Frameworks Group seeks a Senior DevOps Engineer to enhance their high-performing deep learning software stacks (TensorFlow, PyTorch). Responsibilities include automating build, test, integration, and release processes; configuring and maintaining industry-standard tools (Gitlab, Jenkins, Docker, etc.); developing shared utilities; leading best practices; and identifying infrastructure needs. The ideal candidate will have strong experience with CI systems, SCM, build systems, and Python programming, along with a passion for automation.
Must have:
  • 5+ years relevant experience
  • CI/CD automation expertise
  • SCM & build system fluency (Git, CMake, Bazel)
  • Python programming skills
  • Problem-solving & collaboration
Good to have:
  • CUDA & Deep Learning experience
  • Container & cluster tech (Kubernetes, Jenkins)
  • GPU computing systems knowledge
  • Experience with new tech incorporation
  • Contribution to large SW projects
Perks:
  • Competitive salary
  • Comprehensive benefits package
  • Equity

Job Details

NVIDIA's Deep Learning Optimized Frameworks Group is looking for an excellent DevOps Engineer to enable the next wave of NVIDIA’s highest performing deep learning software stacks. Your role spans multiple products such as TensorFlow and PyTorch and is instrumental for streamlining development, build, and releases with modern DevOps tools. Join our technically hardworking team of software engineers and infrastructure authorities to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning frameworks.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for optimized NVIDIA Deep Learning Frameworks

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Gitlab, Jenkins, Docker, LXC, HyperV, CMake, Bazel)

  • Developing shared utilities for setting up systems, running tests, and recording results

  • Lead best-practices for building, testing, and releasing software

  • Identifying infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 5+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Fluency in SCM (e.g. Github, Gitlab, Git) and build systems (e.g. Make, CMake, Bazel, Docker)

  • Adept programming skills in Python (or Perl, Shell scripting, like bash, tcsh, sh)

  • Pragmatic approach to solving problems and collaboration

  • Real passion for “it just works” automation and enabling team members

Ways to stand out from the crowd:

  • Experience with CUDA and Deep Learning Software Stack

  • Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

  • Experience as an active contributor to a SW project involving many developers

With highly competitive salaries and a comprehensive benefits package, NVIDIA is widely considered to be one of the technology world's most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us and, due to unprecedented growth, our special engineering teams are growing fast. If you're a creative and autonomous engineer with a genuine passion for technology, we want to hear from you.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Krafton  - [AI] Game AI Contents Programmer (2년 이상)

Krafton

Seoul, South Korea (On-Site)
3 Months ago
Intel Corporation - AI Frameworks Architect

Intel Corporation

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Krafton  - Deep Learning Engineer - RL

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Appier - Senior Software Engineer, Data Backend(CrossX)

Appier

Taipei City, Taiwan (On-Site)
3 Months ago
Microsoft - Senior Researcher - Embodied AI/Robotics - Microsoft Research

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Enphase Energy - Staff Devops Engineer

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
3 Months ago
GoTo Group - Senior Software Engineer - Engineering Platform

GoTo Group

Bengaluru, Karnataka, India (On-Site)
3 Months ago
GoTo Group - Lead Software Engineer - Engineering Platform

GoTo Group

Gurugram, Haryana, India (On-Site)
3 Months ago
Ondezx - Elasticsearch

Ondezx

Karnataka, India (Hybrid)
5 Months ago
Patterned Learning Career - Senior Architectural Software Engineer

Patterned Learning Career

(Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Microsoft - Research Intern - Conversational AI

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
NVIDIA - Customer Technical Program Manager

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
NVIDIA - Developer Relations Manager, EDA

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Tencent - NLP Research Intern 104493

Tencent

London, England, United Kingdom (On-Site)
1 Month ago
ByteDance - Senior Software Development Engineer, Large Language Models & Generative AI

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Meta - Postdoctoral Researcher, Embodied AI (PhD)

Meta

Seattle, Washington, United States (On-Site)
3 Months ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Menlo Park, California, United States (Remote)
3 Months ago
NVIDIA - ASIC Engineer - PCIe

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Ubisoft - Principal R&D Scientist on Bots & Behaviors

Ubisoft

Bordeaux, Nouvelle-Aquitaine, France (Hybrid)
4 Weeks ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model, Speech & Audio) - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Netflix - Data Engineer (L5) - Product (Device)

Netflix

United States (Remote)
3 Months ago
Rackspace Technology - Professional Services Delivery Director

Rackspace Technology

United States (Remote)
1 Month ago
Rivos - Silicon Logic Formal Verification - Full Time

Rivos

Santa Clara, California, United States (Hybrid)
4 Months ago
Pipeworks - Senior Environment Artist

Pipeworks

Eugene, Oregon, United States (Remote)
2 Weeks ago
ByteDance - Commerce Data Analyst

ByteDance

San Jose, California, United States (On-Site)
5 Days ago
Epic Games - Texture Artist

Epic Games

Cary, North Carolina, United States (On-Site)
2 Weeks ago
Netflix - Manager, Communications - Consumer Products & Live Experiences

Netflix

Los Angeles, California, United States (On-Site)
1 Month ago
The Walt Disney Company - Exhibition Designer (PH)

The Walt Disney Company

Los Angeles, California, United States (On-Site)
2 Months ago
ION - Technical Consultant - Endur

ION

New York, New York, United States (On-Site)
4 Months ago
NVIDIA - Senior Emulation Power Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Ubisoft - DevOps Linux Administrator

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
3 Weeks ago
UniVoxx - Kamailio (VOIP) Engineer

UniVoxx

Ahmedabad, Gujarat, India (On-Site)
5 Months ago
Microsoft - Software Engineer - Storage

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Demonware - Platform Engineering Co-op

Demonware

Vancouver, British Columbia, Canada (Hybrid)
3 Weeks ago
Anavation - Cloud Engineer

Anavation

Reston, Virginia, United States (On-Site)
2 Months ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Trend Micro - Sr. Engineer

Trend Micro

Taipei City, Taiwan (On-Site)
4 Months ago
Dambuster Studios - Lead Build Engineer

Dambuster Studios

Nottingham, England, United Kingdom (Hybrid)
2 Weeks ago
Logifuture - Senior DevOps Engineer

Logifuture

Belgrade, Serbia (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug