Deep Learning Engineer, Datacenters

14 Minutes ago • 3 Years + • Research & Development

Job Summary

Job Description

NVIDIA's Deep Learning Engineer in Datacenters will help develop software infrastructure to analyze deep learning applications, evolve cost-efficient datacenter architectures for LLMs, and work with experts to develop analysis and profiling tools in Python, bash, and C++. Responsibilities involve analyzing system and software characteristics of DL applications, developing analysis tools, and measuring key performance metrics to estimate efficiency improvements. The role requires collaboration with various teams across NVIDIA, from research to silicon architecture. The ideal candidate will have experience with system software, GPU kernels, or DL frameworks and a strong understanding of system architecture and performance.
Must have:
  • Bachelor's degree in EE/CS (Master's/PhD preferred)
  • 3+ years relevant experience
  • System software/Silicon architecture experience
  • C/C++ and Python programming
  • Deep Learning application analysis
Good to have:
  • CUDA, PyTorch, TensorFlow
  • Containerization (Docker), Slurm
  • Performance monitoring tools (perf, gprof)
  • Performance modeling (CPU, GPU, Memory, Network)
  • Multi-site/functional team experience

Job Details

As NVIDIA makes inroads into the Datacenter business, our team plays a central role in getting the most out of our exponentially growing datacenter deployments as well as establishing a data-driven approach to hardware design and system software development. We collaborate with a broad cross section of teams at NVIDIA ranging from DL research teams to CUDA Kernel and DL Framework development teams, to Silicon Architecture Teams. As our team grows, and as we seek to identify and take advantage of long term opportunities, our skillset needs are expanding as well.

Do you want to influence the development of high-performance Datacenters designed for the future of AI? Do you have an interest in system architecture and performance? In this role you will find how CPU, GPU, networking, and IO relate to deep learning (DL) architectures for Natural Language Processing, Computer Vision, Autonomous Driving and other technologies. Come join our team, and bring your interests to help us optimize our next generation systems and Deep Learning Software Stack.

What you'll be doing:

  • Help develop software infrastructure to characterize and analyze a broad range Deep Learning applications
  • Evolve cost-efficient datacenter architectures tailored to meet the needs of Large Language Models (LLMs).
  • Work with experts to help develop analysis and profiling tools in Python, bash and C++ to measure key performance metrics of DL workloads running on Nvidia systems.
  • Analyze system and software characteristics of DL applications.
  • Develop analysis tools and methodologies to measure key performance metrics and to estimate potential for efficiency improvement.

What we need to see:

  • A Bachelor’s degree in Electrical Engineering or Computer Science with 3 years or more of relevant experience (Masters or PhD degree preferred)
  • Experience in at least one of the following:
    • System Software: Operating Systems (Linux), Compilers, GPU kernels (CUDA), DL Frameworks (PyTorch, TensorFlow).
    • Silicon Architecture and Performance Modeling/Analysis: CPU, GPU, Memory or Network Architecture
  • Experience programming in C/C++ and Python. Exposure to Containerization Platforms (docker) and Datacenter Workload Managers (slurm) is a plus
  • Demonstrated ability to work in virtual environments, and a strong drive to own tasks from beginning to end. Prior experience with such environments will make you stand out.

Ways to stand out from the crowd:

  • Background with system software, Operating system intrinsics, GPU kernels (CUDA), or DL Frameworks (PyTorch, TensorFlow).

  • Experience with silicon performance monitoring or profiling tools (e.g. perf, gprof, nvidia-smi, dcgm).

  • In depth performance modeling experience in any one of CPU, GPU, Memory or Network Architecture

  • Exposure to Containerization Platforms (docker) and Datacenter Workload Managers (slurm).

  • Prior experience with multi-site teams or multi-functional teams.

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people on the planet working for us. If you're creative and autonomous, we want to hear from you!

#LI-Hybrid

Similar Jobs

Huuuge Games - Senior Java Software Engineer

Huuuge Games

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Days ago
GoReel - DevOps Lead

GoReel

Romania (Remote)
5 Days ago
ByteDance - Senior/Tech Lead Network Software Development Engineer, Switch - San Jose

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Wargaming - DevOps Engineer (Deployment team)

Wargaming

Nicosia, Nicosia, Cyprus (On-Site)
1 Week ago
The Walt Disney Company - Senior Pipeline Engineer

The Walt Disney Company

Glendale, California, United States (On-Site)
1 Week ago
ByteDance - Senior Software Development Engineer - Database NoSQL Redis

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Nielsen Holdings - Staff Machine learning Engineer

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Power Integrations - Senior Failure Analysis Engineer

Power Integrations

Penang, Malaysia (On-Site)
5 Months ago
Krafton  - Vulnerability Diagnosis Engineer

Krafton

Seoul, South Korea (On-Site)
5 Days ago
Riot Games - VFX Artist II - VALORANT, Premium Content

Riot Games

United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Technical Instructor - AI and Data Center Infrastructure

NVIDIA

United Kingdom (Remote)
3 Weeks ago
Every matrix - Application Security Engineer

Every matrix

Bucharest, Bucharest, Romania (Hybrid)
2 Months ago
Onward Search - DevOps Engineer

Onward Search

Irvine, California, United States (Hybrid)
1 Month ago
Luxoft - DevOps Engineer with Azure

Luxoft

Pune, Maharashtra, India (On-Site)
3 Months ago
Google - Senior Cyber Security Consultant, Google Public Sector

Google

Reston, Virginia, United States (On-Site)
3 Months ago
Scopely - Software Engineer

Scopely

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Aristocrat Gaming - CI/CD Specialist

Aristocrat Gaming

Montreal, Quebec, Canada (Hybrid)
6 Days ago
Ajmera Infotech - DevOps Engineer

Ajmera Infotech

San Jose, California, United States (On-Site)
6 Months ago
Nielsen Holdings - Devops Engineer (026)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Velotio Technologies - Cloud Security Engineer

Velotio Technologies

Pune, Maharashtra, India (Remote)
6 Days ago
Passion Gaming - Customer Service Executive (Kannada)

Passion Gaming

Panchkula, Haryana, India (On-Site)
1 Year ago
Abnormal Security - Senior Software Engineer - Fullstack

Abnormal Security

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Cadence - Principal Solutions Engineer - AE

Cadence

Noida, Uttar Pradesh, India (On-Site)
6 Months ago
Zeta - Specialist  Process & Compliance

Zeta

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Trek - Product Support Analyst (ERP)

Trek

Haryana, India (On-Site)
1 Month ago
InMobiInMobi - Lead HR Generalist

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
4 Weeks ago
Nielsen Holdings - Senior Devops Engineer-Sports(T)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Zones - Network Administrator L2

Zones

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Logitech - Kinaxis Senior Analyst

Logitech

Mumbai, Maharashtra, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Google - Student Researcher, PhD, Winter/Summer 2025

Google

(On-Site)
4 Months ago
Vigaet - Internship - Mechanical Engineer

Vigaet

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Rivos - SOC Electrical Analysis Engineer - Full Time

Rivos

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Tencent - Software Engineering Associate 104534

Tencent

Singapore (On-Site)
3 Months ago
Riot Games - Senior Software Engineer - VALORANT - Foundations Developer Experience & Workflows

Riot Games

Dublin, County Dublin, Ireland (On-Site)
4 Months ago
NVIDIA - ASIC Physical Design Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
NVIDIA - Senior Software Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Senior Resiliency and Safety Architect

NVIDIA

Canada (On-Site)
3 Weeks ago
NVIDIA - BMC Firmware Developer

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
Samsung Semiconductor - Staff Engineer, Design Verification

Samsung Semiconductor

San Jose, California, United States (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Pune, Maharashtra, India (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Karnataka, India (Hybrid)

Courbevoie, Île-de-France, France (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug