Senior Deep Learning Systems Software Engineer - AI Infrastructure

1 Month ago • 5 Years + • Full Stack Development

Job Summary

Job Description

NVIDIA seeks a Senior Deep Learning Systems Software Engineer to optimize deep learning workloads on cutting-edge hardware and software. Responsibilities include analyzing, profiling, and optimizing workloads; building automation tools; collaborating with cross-functional teams; identifying and resolving performance bottlenecks; designing performance benchmarks; and providing guidance on cloud application optimization. The ideal candidate will have 5+ years of experience in application performance engineering, experience with large-scale GPU infrastructure, deep learning model architectures (PyTorch), application profiling tools (NVIDIA Nsight, Intel VTune), and strong programming skills (Python, C/C++). The role involves working across the hardware/software stack to achieve peak performance in deep learning training and inference.
Must have:
  • 5+ years application performance engineering experience
  • Large-scale multi-node GPU infrastructure experience
  • Deep learning model architectures & PyTorch expertise
  • Application profiling tools (NVIDIA NSight, Intel VTune)
  • Strong understanding of computer architecture and GPU architecture
  • Proficiency in Python and C/C++
Good to have:
  • CUDA or OpenCL experience
  • NVIDIA server and software ecosystem understanding
  • Experience with large-scale distributed systems
  • Hands-on experience with NVIDIA GPUs, HPC storage, networking, and cloud computing
  • In-depth understanding of storage systems, Linux file systems, and RDMA networking

Job Details

NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is seeking senior engineers who are mindful of performance analysis and optimization to help us squeeze every last clock cycle out of all facets of Deep Learning such as training and inferencing, one of today's most important workloads in the world. If you are unafraid to work across all layers of the hardware/software stack from GPU architecture to Deep Learning Framework to achieve peak performance, we want to hear from you! This role offers an opportunity to directly impact the hardware and software roadmap in a fast-growing technology company that leads the AI revolution while helping deep learning users around the globe enjoy ever-higher training speeds.

What you'll be doing:

  • Understand, analyze, profile, and optimize deep learning workloads on state-of-the-art hardware and software platforms.

  • Build tools to automate workload analysis, workload optimization, and other critical workflows.

  • Collaborate with cross-functional teams to analyze and optimize cloud application performance on diverse GPU architectures.

  • Identify bottlenecks and inefficiencies in application code and propose optimizations to enhance GPU utilization.

  • Drive end-to-end platform optimization from a hardware level to the application and service levels

  • Design and implement performance benchmarks and testing methodologies to evaluate application performance.

  • Provide guidance and recommendations on optimizing cloud-native applications for speed, scalability, and resource efficiency.

  • Share knowledge and best practices with domain expert teams as they transition applications to distributed environments.

What we need to see:

  • Masters in CS, EE or CSEE or equivalent experience

  • 5+ years of experience in application performance engineering

  • Experience using large scale multi node GPU infrastructure on premise or in CSPs

  • Background in deep learning model architectures and experience with Pytorch and large scale distributed training

  • Experience with application profiling tools such as NVIDIA NSight, Intel VTune etc.

  • Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture. Experience with NVIDIA's Infrastructure and software stacks.

  • Proven experience analyzing, modeling and tuning DL application performance.

  • Proficiency in Python and C/C++ for analyzing and optimizing application code

Ways to stand out from the crowd:

  • Strong fundamentals in algorithms and GPU programming experience (CUDA or OpenCL)

  • Understanding of NVIDIA's server and software ecosystem

  • Hands-on experience in performance optimization and benchmarking on large-scale distributed systems

  • Hands-on experience with NVIDIA GPUs, HPC storage, networking, and cloud computing.

  • In-depth understanding storage systems, Linux file systems, RDMA networking

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you.

Similar Jobs

ZeniMax Media - Programmeur.se de serveur senior / Senior Server Programmer

ZeniMax Media

Montreal, Quebec, Canada (On-Site)
• 5 Months ago
Thatgamecompany - Game Engine Engineer (Optimization) - China

Thatgamecompany

Shanghai, Shanghai, China (On-Site)
• 5 Months ago
The Walt Disney Company - Lead Level Designer

The Walt Disney Company

Glendale, California, United States (Remote)
• 1 Week ago
Epic Games - Senior Gameplay Animation Programmer, LEGO Fortnite

Epic Games

Cary, North Carolina, United States (On-Site)
• 1 Month ago
Milestone - Lead Software Engineer

Milestone

United States (Remote)
• 1 Day ago
Xsolla - Software Manager

Xsolla

Los Angeles, California, United States (Hybrid)
• 2 Weeks ago
Meta - Software Engineer, Product

Meta

San Francisco, California, United States (Remote)
• 3 Months ago
PwC - IN_Associate – Crm Technical -Ms Dynamics– Advisory -Hyderabad

PwC

Hyderabad, Telangana, India (On-Site)
• 3 Months ago
Zuru - Senior Node JS Developer

Zuru

Ahmedabad, Gujarat, India (On-Site)
• 4 Months ago
The Mill Adventure - Senior Back-End Developer

The Mill Adventure

St. Julian's, Malta (Remote)
• 4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Gearbox Software - Technical Director, SDK

Gearbox Software

Frisco, Texas, United States (On-Site)
• 2 Months ago
Nintendo - CONTRACT - Embedded Engineer, Display (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
• 2 Months ago
Epic Games - Senior SDET

Epic Games

(On-Site)
• 1 Week ago
Hasbro - Intern - Software Development Engineer (Summer 2025)

Hasbro

Renton, Washington, United States (On-Site)
• 1 Month ago
The Walt Disney Company - Pipeline Technical Director, Level II

The Walt Disney Company

Vancouver, British Columbia, Canada (Hybrid)
• 3 Months ago
Activision - Expert Engineer, Accounts and Authentication - Demonware (Dublin)

Activision

Dublin, County Dublin, Ireland (On-Site)
• 1 Week ago
Electronic Arts - Security Software Engineer

Electronic Arts

Vancouver, British Columbia, Canada (On-Site)
• 4 Months ago
Ubisoft - Senior ML Programmer

Ubisoft

Montreal, Quebec, Canada (On-Site)
• 1 Month ago
Reversing Labs - Application Security Architect

Reversing Labs

United Kingdom (Remote)
• 3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Qt Group - Software Engineer

Qt Group

Bengaluru, Karnataka, India (On-Site)
• 4 Months ago
Starkflow - Talent Acquisition Specialist

Starkflow

Karnataka, India (Hybrid)
• 1 Week ago
PwC - Associate - AppTech - GDC

PwC

Kolkata, West Bengal, India (On-Site)
• 4 Months ago
Landis+Gyr - Senior Frontend Developer (React Native)

Landis+Gyr

Noida, Uttar Pradesh, India (Hybrid)
• 3 Months ago
PwC - Oracle Finance Functional - Senior Associate

PwC

Hyderabad, Telangana, India (On-Site)
• 4 Months ago
PhonePe - Channel Sales Manager Rural - Referral

PhonePe

Bengaluru, Karnataka, India (On-Site)
• 3 Months ago
GoTo Group - Software Engineer (iOS) - Engineering Platform

GoTo Group

Bengaluru, Karnataka, India (On-Site)
• 4 Months ago
PwC - IN_Associate_Implementation of Compliance_Contract and Compliance_TRS_Bengaluru

PwC

Bengaluru, Karnataka, India (On-Site)
• 4 Months ago
PhonePe - Engineering Manager - Frontend

PhonePe

Bengaluru, Karnataka, India (On-Site)
• 3 Months ago

Get notifed when new similar jobs are uploaded

Full Stack Development Jobs

Google - Software Engineer II, Full Stack, Subscription Services

Google

Bengaluru, Karnataka, India (On-Site)
• 1 Month ago
Next Level Business Services - Xymon Integration Engineer

Next Level Business Services

Mount Laurel Township, New Jersey, United States (On-Site)
• 4 Months ago
Next Level Business Services - JavaScript Developer with Full stack Experience

Next Level Business Services

Dallas, Texas, United States (On-Site)
• 4 Months ago
Blazesoft - Front-End Developer

Blazesoft

Vaughan, Ontario, Canada (On-Site)
• 1 Month ago
Warner Bros Games - Senior Software Engineer

Warner Bros Games

(Hybrid)
• 4 Weeks ago
Google - Staff Software Engineer, Geo

Google

Seattle, Washington, United States (On-Site)
• 1 Month ago
PhonePe - Software Engineer - Test (4-6yrs) - (Pune)

PhonePe

Pune, Maharashtra, India (On-Site)
• 3 Months ago
PwC - Associate_ Java developer _Application  Technology_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
• 4 Months ago
Crunchyroll - Staff Software Engineer, Content Delivery

Crunchyroll

San Francisco, California, United States (Remote)
• 2 Months ago
IGN - Senior Full Stack Software Engineer

IGN

Los Angeles, California, United States (Hybrid)
• 3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

United States (Remote)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug