Senior DevOps Engineer, Deep Learning Frameworks

6 Months ago • 5 Years + • Devops

Job Summary

Job Description

NVIDIA's Deep Learning Optimized Frameworks Group seeks a Senior DevOps Engineer to enhance their high-performing deep learning software stacks. Responsibilities include automating build, test, integration, and release processes for frameworks like TensorFlow and PyTorch; configuring and maintaining industry-standard DevOps tools (Gitlab, Jenkins, Docker, etc.); developing shared utilities; leading best practices; and identifying infrastructure needs. The ideal candidate will possess strong experience in CI/CD, SCM, and build systems, along with programming skills in Python (or similar).
Must have:
  • 5+ years relevant experience
  • CI/CD system automation
  • SCM & build systems expertise (Git, CMake, etc.)
  • Python (or Perl/Shell scripting)
  • Problem-solving & collaboration
Good to have:
  • CUDA & Deep Learning Software Stack experience
  • Container & cluster tech (Kubernetes, Jenkins, etc.)
  • GPU computing systems knowledge
  • Experience with new tech incorporation
Perks:
  • Highly competitive salaries
  • Extensive benefits package
  • Diverse and inclusive work environment

Job Details

NVIDIA's Deep Learning Optimized Frameworks Group is looking for an excellent DevOps Engineer to enable the next wave of NVIDIA’s highest performing deep learning software stacks. Your role spans multiple products such as TensorFlow and PyTorch and is instrumental for streamlining development, build, and releases with modern DevOps tools. Join our technically hardworking team of software engineers and infrastructure authorities to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning frameworks.

What you'll be doing:

  • Automating and optimizing build, test, integrate, and release processes for optimized NVIDIA Deep Learning Frameworks

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Gitlab, Jenkins, Docker, LXC, HyperV, CMake, Bazel)

  • Developing shared utilities for setting up systems, running tests, and recording results

  • Lead best-practices for building, testing, and releasing software

  • Identifying infrastructure needs and translating them into action

What we need to see:

  • BS or higher degree in computer science (or equivalent experience)

  • 5+ years of relevant experience

  • Strong experience setting up, maintaining, and automating continuous integration systems

  • Fluency in SCM (e.g. Github, Gitlab, Git) and build systems (e.g. Make, CMake, Bazel, Docker)

  • Adept programming skills in Python (or Perl, Shell scripting, like bash, tcsh, sh)

  • Pragmatic approach to solving problems and collaboration

  • Real passion for “it just works” automation and enabling team members

Ways to stand out from the crowd:

  • Experience with CUDA and Deep Learning Software Stack

  • Good knowledge of container and cluster technologies like slurm, kubernetes, jenkins, gitlab-ci, and zabbix

  • Experience with GPU computing systems

  • Track record of identifying useful new technologies and incorporating them into SW development flows

  • Experience as an active contributor to a SW project involving many developers

NVIDIA is at the forefront of breakthroughs in Artificial Intelligence, High-Performance Computing, and Visualization. Our teams are composed of driven, innovative professionals dedicated to pushing the boundaries of technology. We offer highly competitive salaries, an extensive benefits package, and a work environment that promotes diversity, inclusion, and flexibility. As an equal opportunity employer, we are committed to fostering a supportive and empowering workplace for all.

Similar Jobs

truecaller - Android Engineer

truecaller

Stockholm, Stockholm County, Sweden (On-Site)
2 Months ago
Qualcomm - System Hardware Validation Engineer

Qualcomm

Cork, County Cork, Ireland (On-Site)
1 Month ago
Ion - Principal Software Engineer, Italy

Ion

Milan, Lombardy, Italy (On-Site)
9 Months ago
Juego Studios - Asset Optimization _Technical Artist

Juego Studios

Bengaluru, Karnataka, India (On-Site)
9 Months ago
LeoVegas - Backend Engineer - Document Verification & AML

LeoVegas

Doetinchem, Gelderland, Netherlands (On-Site)
3 Months ago
Lambda - Hardware Solutions Engineer

Lambda

San Jose, California, United States (Hybrid)
3 Months ago
Globalization Partners - Principal Software Engineer (full stack, Node.js, TypeScript, React.js, AWS)

Globalization Partners

Ireland (Remote)
2 Months ago
Crowd Strick - Senior IT Monitoring Engineer / Site Reliability Engineer

Crowd Strick

India (Remote)
1 Year ago
Spruce Systems - Software Engineer, Cross-Platform Mobile

Spruce Systems

(Remote)
3 Years ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Ion - Senior Software Engineer, Italy

Ion

Milan, Lombardy, Italy (On-Site)
9 Months ago
Rockstar Games - Full Stack Developer

Rockstar Games

New York, New York, United States (On-Site)
3 Weeks ago
Bitlane - Full Stack Developer

Bitlane

Berlin, Berlin, Germany (On-Site)
4 Years ago
unicos studio - Performance Marketing Specialist

unicos studio

(On-Site)
3 Weeks ago
Gigamon - Sr. Technical Writer

Gigamon

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
WildBrain - Pipeline Developer

WildBrain

Vancouver, British Columbia, Canada (On-Site)
3 Months ago
Enphase Energy - Sr. Backend resource

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Scanline VFX - Senior Pipeline Developer (Maya)

Scanline VFX

Toronto, Ontario, Canada (Remote)
9 Months ago
extreme network - Principal SW Systems Engineer

extreme network

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Loft Orbital - Test Infrastructure Technical Lead

Loft Orbital

Golden, Colorado, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Warsaw, Masovian Voivodeship, Poland

Maersk - Accountant/Associate Finance Controller with Polish

Maersk

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Weeks ago
Activision - Expert UX Technical Designer

Activision

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Testronic - LQA Game Tester with Spanish

Testronic

Warsaw, Masovian Voivodeship, Poland (Hybrid)
2 Months ago
Larian Studios - Writer

Larian Studios

Warsaw, Masovian Voivodeship, Poland (On-Site)
9 Months ago
CD PROJEKT RED - DevOps Engineering Manager

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
N-ix - Middle Data Analyst

N-ix

Poland (Hybrid)
1 Month ago
CD PROJEKT RED - Quest Designer

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Weeks ago
Veeam Software - Senior Staff Platform Engineer

Veeam Software

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Evoplay games - Junior QA Engineer

Evoplay games

Poland (On-Site)
1 Week ago
Valeo - Paid Internship in the Maintenance Department

Valeo

Skawina, Lesser Poland Voivodeship, Poland (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Apple - Cloud Traffic Engineer, Apple Pay

Apple

New York, New York, United States (On-Site)
1 Month ago
Nintendo - Contract - DevOps Engineer

Nintendo

Redmond, Washington, United States (On-Site)
6 Months ago
dbt Labs - Solutions Architect, Enterprise

dbt Labs

Toronto, Ontario, Canada (On-Site)
3 Weeks ago
bytedance - Cloud Technical Support Engineer

bytedance

Singapore (On-Site)
4 Months ago
Cognite - Senior Solution Architect

Cognite

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Remote)
2 Months ago
Cadence - Principal Solutions Engineer - AE

Cadence

Noida, Uttar Pradesh, India (On-Site)
10 Months ago
bytedance - Software Engineer (ElasticSearch / OpenSearch) - Cloud Infrastructure- San Jose

bytedance

San Jose, California, United States (On-Site)
9 Months ago
deel. - Senior Backend Engineer, Node.js + AWS

deel.

Greece (Remote)
1 Week ago
Shield AI - Senior Cloud Engineer (SD/TX/DC/Boston)

Shield AI

San Diego, California, United States (On-Site)
1 Week ago
TALA - Lead Cloud Infrastructure Engineer

TALA

India (Remote)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

Beijing, Beijing, China (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug