Senior DevOps Engineer - Accelerated Computing

NVIDIA

Job Summary

NVIDIA is seeking a Senior DevOps Engineer to build and maintain infrastructure and tools for accelerated computing, supporting AI applications, self-driving cars, and supercomputers. The role involves running builds and tests across various architectures, collecting and analyzing data, and collaborating to develop robust software solutions. Candidates need strong Linux, scripting, debugging, and troubleshooting skills, with a focus on distributed systems and software build processes.

Must Have

  • 6+ years of relevant industry experience
  • Proficient with Linux
  • Bachelor's degree in a related area of study or equivalent experience
  • Expert with scripting in Python, Perl, shell, Groovy, etc.
  • Strong background with deploying, configuring, and debugging distributed systems
  • Familiar with the software build process (compiling C++ code with GNU Make, CMake, Visual Studio, MSBuild, etc.)
  • Background with some form of source control management (SCM), preferably git
  • Familiar with containers

Good to Have

  • Experience with HPC hardware systems such as compute clusters and HPC software performance benchmarking on such systems
  • System administrator level experience with multi-user Linux servers
  • Background with GPU accelerated systems
  • Experience working in an environment where Agile processes and methodologies are used

Job Description

Job Requisition ID

JR2002534

Job Category

Engineering

Time Type

Full time

We are looking for a Senior DevOps Engineer to join our team, although Site Reliability Engineer, Build and Release Engineer, Continuous Integration Engineer.. all can be valid titles for this role. Our team builds software that finds its way into AI applications, self-driving cars, and some of the world's fastest supercomputers solving challenges in science, medicine, and engineering. We're looking for someone with strong integrity, reliability, persistence, problem-solving ability, and skills in Linux, scripting, debugging, and troubleshooting.

What you will be doing:

  • Running a lot of builds and tests on a lot of architectures, operating systems, and devices.
  • Collecting a lot of data and working collaboratively to brainstorm and build infrastructure and tools to make sense of it all.
  • Building relationships that allow us to work together as a team, not a group.
  • Working in a highly dynamic environment where we have to think on our feet.

What we need to see:

  • 6+ years of relevant industry experience.
  • Proficient with Linux.
  • Bachelors degree in a related area of study or equivalent experience.
  • Expert with scripting in one or more of Python, Perl, shell, Groovy, etc..
  • Strong background with deploying, configuring, and debugging distributed systems.
  • You should be familiar with the software build process (read compiling C++ code with GNU Make, CMake, Visual Studio, MSBuild, etc.).
  • Background with some form of source control management (SCM), preferably git.
  • Familiar with containers.

Ways to stand out from the crowd:

  • Experience with HPC hardware systems such as compute clusters and HPC software performance benchmarking on such systems.
  • System administrator level experience with multi-user Linux servers.
  • Background with GPU accelerated systems.
  • Experience working in an environment where Agile processes and methodologies are used.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most experienced and hard-working people in the world working for us. Are you creative and autonomous? Do you love a challenge? If so, we want to hear from you. Come help us build the real-time, efficient computing platform driving our success in the dynamic and quickly growing field Deep Learning and Artificial Intelligence.

#LI-Hybrid

Your base salary will be determined based on your location, experience, and the pay of employees in similar positions. For Poland: The base salary range is 292,500 PLN - 507,000 PLN for Level 4, and 375,000 PLN - 650,000 PLN for Level 5.

13 Skills Required For This Role

Problem Solving Github Cpp Game Texts Agile Development Linux Deep Learning Git Python Shell Perl Visual Studio C Make

Similar Jobs