Senior Infrastructure Software Engineer, Deep Learning Libraries

2 Months ago • 3 Years + • DevOps • Full Stack Development • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA's Deep Learning Libraries Group seeks a Senior Infrastructure Software Engineer to design and develop scalable, modular infrastructure for deep learning libraries (cuDNN, TensorRT, CUDA). Responsibilities include building automation for build, test, and release processes; developing across the software stack (UI to database layers); configuring and maintaining tools like Kubernetes, Jenkins, Docker, and CMake; and developing front-end solutions using HTML, CSS, JavaScript. The role requires expertise in continuous integration systems, SCM, and build systems. The engineer will work to improve development velocity across NVIDIA's AI/DL/Compute Software projects.
Must have:
  • 3+ years relevant experience
  • Strong Python & C/C++ skills
  • CI/CD system automation experience
  • Experience with HTML5, CSS, NodeJS, or React
  • SCM & build system fluency (Git, CMake)
  • Master's degree in CS/CE or equivalent
Good to have:
  • Jenkins automation with Groovy
  • Experience with Kubernetes
  • Unit/integration test framework design
  • Mobile/embedded platform experience
  • Experience with multiple OS (Ubuntu, RedHat, Windows, QNX)
Perks:
  • Equity
  • Benefits

Job Details

We are now looking for a Senior Infrastructure Software Engineer for Deep Learning Libraries!

NVIDIA's Deep Learning Libraries Group is seeking excellent software engineers to enable the next wave of NVIDIA’s highest performing deep learning libraries. The role spans multiple products, including cuDNN, TensorRT, and CUDA kernel libraries. The mission is to design and develop scalable, modular infrastructure that streamlines development, builds, and tests across NVIDIA’s diverse set of platforms, from Drive AGX for autonomous vehicles to DGX servers for datacenters and large language models. Join our technically diverse team of software engineers and infrastructure experts to design the systems that enable NVIDIA to stay ahead of the competition as we deliver the world's fastest deep learning platforms.

What you'll be doing:

  • Designing and developing software for testing and analysis of our codebases

  • Building scalable automation for build, test, integration, and release processes for publicly distributed deep learning libraries

  • Developing throughout the software stack, from the user experience and user interfaces down to the cluster and database layers

  • Configuring, maintaining, and building upon deployments of industry-standard tools (e.g. Kubernetes, Jenkins, Docker, CMake, Gitlab, Jira, etc.)

  • Develop front-end solutions using HTML, CSS, JavaScript, and related web technologies

  • Advancing the state of the art in those industry-standard tools

What we need to see:

  • A Masters Degree in Computer Science or Computer Engineering or equivalent experience.

  • 3+ years of relevant experience

  • Strong programming skills in Python (or similar) and familiarity with C/C++ development

  • Experience setting up, maintaining, and automating continuous integration systems (e.g. Jenkins, GitHub Actions, GitLab pipelines, Azure DevOps)

  • Experience in HTML5, CSS, NodeJS, or React

  • Fluency in SCM (e.g. Git, Perforce) and build systems (e.g. Make, CMake, Bazel)

Ways to stand out from the crowd:

  • Experience designing and developing automation in Jenkins with Groovy (or similar)

  • Background with distributed systems and cluster/cloud computing, especially with Kubernetes

  • Experience designing and developing unit and integration test frameworks

  • Experience with mobile/embedded platforms and multiple operating systems (Ubuntu, RedHat, Windows, QNX, or similar)

  • Track record of identifying useful new technologies and incorporating them into SW development flows

This is an opportunity to have a wide impact at NVIDIA by improving development velocity across our many AI/DL/Compute Software projects. Are you creative, driven, and autonomous? Do you love a challenge? If so, we want to hear from you!

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

NVIDIA - Senior Math Libraries Engineers - Python APIs

NVIDIA

Remote, Oregon, United States (Remote)
2 Months ago
Thatgamecompany - Product Data Scientist

Thatgamecompany

United States (Remote)
1 Month ago
Flying Bark Productions - Rigging & Animation Software Developer

Flying Bark Productions

Sydney, New South Wales, Australia (Hybrid)
1 Month ago
NVIDIA - Senior AI Training Performance Engineer

NVIDIA

Shanghai, Shanghai, China (Hybrid)
3 Months ago
Microsoft - Principal Software Engineer

Microsoft

Vancouver, British Columbia, Canada (On-Site)
3 Weeks ago
Google - Customer Engineer, Application Modernization, Google Cloud

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Weeks ago
Newrick Network - AWS DevOps Engineer

Newrick Network

Ontario, Canada (Remote)
1 Month ago
Google - Customer Engineer, Data Management, Google Cloud

Google

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
3 Weeks ago
Rackspace Technology - Trainee Cloud Engineer

Rackspace Technology

Dubai, Dubai, United Arab Emirates (Hybrid)
1 Month ago
Argus Labs - Site Reliability Engineer

Argus Labs

Indonesia (Remote)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Applike Group - Senior Data Scientist (Recommendation Systems Expert) (f/m/d)

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
6 Months ago
NVIDIA - Senior HPC AI Cluster Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
NVIDIA - Manager, Substrate Planning - Chips Supply Planning

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Kenvue - GenAI ML Engineer, Data Science

Kenvue

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
ByteDance - Machine Learning Engineer Intern (Search-TikTok Recommendation)

ByteDance

Seattle, Washington, United States (On-Site)
4 Weeks ago
Microsoft - Senior Principal Researcher - Deep Learning & AI

Microsoft

New York, New York, United States (On-Site)
3 Weeks ago
NVIDIA - Senior Software Architect, AI Networking

NVIDIA

Santa Clara, California, United States (Remote)
1 Month ago
Enterprise Bot - Data Scientist

Enterprise Bot

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Netflix - ML Software Engineer (L4/L5) - Media Algorithms

Netflix

Los Angeles, California, United States (On-Site)
3 Weeks ago
ByteDance - Ad Delivery Algorithm Intern - Game

ByteDance

Singapore (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

GoMotive - Director of Product Management, AI

GoMotive

United States (Remote)
2 Months ago
Google - Senior Verification Engineer, Supply Chain and Operations

Google

Austell, Georgia, United States (On-Site)
3 Weeks ago
Google - Software Engineer III, Generative AI, Google Workspace

Google

Sunnyvale, California, United States (On-Site)
3 Weeks ago
Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Mountain View, California, United States (Remote)
1 Month ago
Gearbox Software - Technical Director, SDK

Gearbox Software

Frisco, Texas, United States (On-Site)
5 Months ago
ByteDance - Software Engineer Intern

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Onward Search - Java Developer

Onward Search

San Jose, California, United States (Hybrid)
1 Month ago
Netflix - Senior Researcher - Studio Insights, Promotional Media

Netflix

Los Gatos, California, United States (On-Site)
3 Weeks ago
Nagarro - Senior Staff Engineer, Java Fullstack

Nagarro

Jacksonville, Florida, United States (On-Site)
6 Months ago
The Walt Disney Company - Vice President, Local Sales Mid-Atlantic

The Walt Disney Company

Philadelphia, Pennsylvania, United States (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Starkflow - Oracle Integration Cloud Consultant

Starkflow

(Remote)
3 Weeks ago
Quizizz - Platform Engineer

Quizizz

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Next Level Business Services - Hadoop AWS Developer

Next Level Business Services

Beaverton, Oregon, United States (On-Site)
6 Months ago
Zeta - Data Reliability Engineer II

Zeta

Hyderabad, Telangana, India (On-Site)
6 Months ago
Microsoft - Senior Software Engineer

Microsoft

(On-Site)
3 Weeks ago
Riot Games - Staff Software Engineer - Infrastructure Reliability

Riot Games

Los Angeles, California, United States (On-Site)
3 Months ago
N-iX - DevOps Engineer

N-iX

Poland (Hybrid)
4 Weeks ago
Ubisoft - Back-End Golang Developer

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
Microsoft - Technical Support Engineer - Spark Databricks

Microsoft

Lisbon, Lisbon, Portugal (Hybrid)
3 Weeks ago
Google - Software Engineering Manager, Privacy Sandbox, Cloud Computing

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug