Senior DevOps Engineer - Accelerated Computing

6 Days ago • 6 Years + • DevOps • $184,000 PA - $356,500 PA

Job Summary

Job Description

The Senior DevOps Engineer - Accelerated Computing at NVIDIA's CUDA Math Libraries team will manage numerous builds and tests across various architectures and operating systems. This role involves collecting and analyzing data, building infrastructure and tools, and collaborating to improve efficiency. Responsibilities include deploying, configuring, and debugging distributed systems, working with C++ code and build processes (GNU Make, CMake, etc.), using source control (Git), and containerization. The ideal candidate has 6+ years of relevant experience, strong Linux proficiency, and expertise in scripting languages like Python or Perl. Experience with HPC systems and Agile methodologies is a plus.
Must have:
  • 6+ years relevant experience
  • Proficient with Linux
  • Expert in scripting (Python, Perl, etc.)
  • Deploying/debugging distributed systems
  • C++ build process knowledge (Make, CMake)
  • Source control (Git), container familiarity
Good to have:
  • HPC hardware/software experience
  • System admin experience on multi-user Linux
  • GPU-accelerated systems background
  • Agile experience
Perks:
  • Equity
  • Benefits

Job Details

We are the CUDA Math Libraries team at NVIDIA -- consistently named one of America's Best Places to Work by Glassdoor. We are looking for a Senior DevOps Engineer to join our team, although Site Reliability Engineer, Build and Release Engineer, Continuous Integration Engineer.. all can be valid titles for this role. Our team builds software that finds its way into AI applications, self-driving cars, and some of the world's fastest supercomputers solving challenges in science, medicine, and engineering. We're looking for someone with strong integrity, reliability, persistence, problem-solving ability, and skills in Linux, scripting, debugging, and troubleshooting.

What you will be doing:

  • Running a lot of builds and tests on a lot of architectures, operating systems, and devices.

  • Collecting a lot of data and working collaboratively to brainstorm and build infrastructure and tools to make sense of it all.

  • Building relationships that allow us to work together as a team, not a group.

  • Working in a highly dynamic environment where we have to think on our feet.

What we need to see:

  • 6+ years of relevant industry experience.

  • Proficient with Linux.

  • Bachelors degree in a related area of study or equivalent experience.

  • Expert with scripting in one or more of Python, Perl, shell, Groovy, etc..

  • Strong background with deploying, configuring, and debugging distributed systems.

  • You should be familiar with the software build process (read compiling C++ code with GNU Make, CMake, Visual Studio, MSBuild, etc.).

  • Background with some form of source control management (SCM), preferably git.

  • Familiar with containers.

Ways to stand out from the crowd:

  • Experience with HPC hardware systems such as compute clusters and HPC software performance benchmarking on such systems.

  • System administrator level experience with multi-user Linux servers.

  • Background with GPU accelerated systems.

  • Experience working in an environment where Agile processes and methodologies are used.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and hard-working people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the dynamic and quickly growing field Deep Learning and Artificial Intelligence.

#LI-Hybrid

The base salary range is 184,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Site Reliability Engineer, Traffic Infrastructure

ByteDance

Singapore (On-Site)
5 Months ago
Luxoft - Murex Technical Developer - Lead

Luxoft

Toronto, Ontario, Canada (On-Site)
5 Months ago
Microsoft - Member of Technical Staff – Windows Engineer

Microsoft

(Hybrid)
1 Week ago
Sinch - System Engineer

Sinch

Noida, Uttar Pradesh, India (On-Site)
4 Weeks ago
Intrepid Studios,  Inc  - Associate Software Engineer

Intrepid Studios, Inc

(Remote)
2 Months ago
Scanline VFX - Release DevOps Engineer

Scanline VFX

Vancouver, British Columbia, Canada (Hybrid)
3 Weeks ago
Revolgy - Cloud Developer

Revolgy

United Kingdom (Remote)
1 Week ago
Google - Program Manager, Google Distributed Cloud

Google

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
6 Days ago
PwC - AWS DataOps Engineer

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Tesla - Construction Site Manager - MEP

Tesla

Brandenburg, Germany (On-Site)
2 Months ago
ByteDance - Cloud Technical Support

ByteDance

Singapore (On-Site)
1 Month ago
Patreon - Site Reliability Engineer

Patreon

New York, New York, United States (Remote)
4 Weeks ago
Tencent - SRE Intern

Tencent

(On-Site)
1 Month ago
NVIDIA - Senior SWQA Test Development Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
Rackspace Technology - DEVOP Engineer (AWS Terraform)-PSDE III

Rackspace Technology

India (Remote)
5 Months ago
ByteDance - Site Reliability Engineer, Edge Services

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Saviynt - Technical Lead, Professional Services - NA

Saviynt

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
ByteDance - Site Reliability Engineer, Edge Services

ByteDance

Boston, Massachusetts, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Westford, Massachusetts, United States

Epic Games - Senior Procurement Operations Analyst

Epic Games

New York, New York, United States (On-Site)
1 Week ago
ByteDance - Research Scientist, Data Management and Security

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Nintendo - Receiving Agent

Nintendo

New York, New York, United States (On-Site)
2 Months ago
Nintendo - Sr UX Writer

Nintendo

Redmond, Washington, United States (Hybrid)
4 Months ago
WebFX - Jr. Online Creative Designer

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
6 Months ago
Forescout Technologies  Inc  - Professional Services Engineer

Forescout Technologies Inc

United States (Hybrid)
5 Months ago
Onward Search - Retention Marketing Manager

Onward Search

New York, United States (Hybrid)
1 Week ago
Devrev - Enterprise Account Executive- New York

Devrev

United States (Remote)
4 Months ago
Flow - Senior/Staff Backend Software Engineer

Flow

New York, New York, United States (Hybrid)
6 Months ago
Google - Software Engineer III, Infrastructure, Platforms Infrastructure Engineering

Google

Sunnyvale, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Omnissa - Member of technical staff (C++,iOS)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Microsoft - Technical Support Engineer - Azure Identity

Microsoft

(On-Site)
1 Week ago
E-Hireo - Cloud Engineer

E-Hireo

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Warner Bros Games - Staff Software Engineer - AWS Architecture (Observability Team)

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Fortis Games - Senior DevOps Engineer

Fortis Games

Brazil (On-Site)
3 Months ago
Rackspace Technology - Cloud Database Engineer I/II

Rackspace Technology

Gurugram, Haryana, India (Remote)
1 Week ago
Microsoft - Technical Program Manager - Azure Core - Cloud Buildout Infrastructure & Lifecycle

Microsoft

Bucharest, Bucharest, Romania (On-Site)
1 Day ago
Nagarro - Senior Cloud Consultant

Nagarro

Germany (Remote)
1 Month ago
Canva - Senior Platform Engineer - Workload Integration

Canva

Surry Hills, New South Wales, Australia (Remote)
2 Months ago
Scanline VFX - Senior DevOps Engineer

Scanline VFX

Seoul, South Korea (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug