DGX Cloud Infrastructure Engineering Intern - Fall 2025

13 Minutes ago • Upto 1 Years • Research & Development

Job Summary

Job Description

NVIDIA is seeking a DGX Cloud Infrastructure Engineering Intern for Fall 2025 to contribute to scaling its AI infrastructure. Responsibilities include designing and architecting a platform for automated GPU asset management across cloud providers; developing, testing, and optimizing solutions for datacenter firmware; collaborating with hardware, software, and business teams; defining server-level reliability requirements; driving failure analysis; and ensuring seamless software integration across the entire stack. The ideal candidate possesses a strong programming background, understands distributed systems, and has experience with software testing, deployment, and GPU computing. Strong communication and problem-solving skills are essential.
Must have:
  • Strong programming (C, C++, Python)
  • Distributed systems understanding
  • Software testing & deployment experience
  • GPU computing (CUDA, OpenCL)
  • Deep Learning Frameworks (PyTorch, TensorFlow)
Good to have:
  • Perl
  • OpenACC
  • Caffe
  • HPC (MPI, OpenMP)
  • Performance Modeling & Optimization
Perks:
  • Intern benefits

Job Details

NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, a deep understanding of distributed systems, familiarity with software testing and deployment, and excellent communication and planning abilities. We also welcome out-of-the-box thinkers who can provide new ideas with strong at execution bias. Expect to be constantly challenged, improving, and evolving for the better. You and other engineers in this team will help advance NVIDIA's capacity to build and deploy leading infrastructure solutions for a broad range of AI-based applications that affect core data science. What are you waiting for if you're creative, passionate about what you do, and love having fun apply today!


What you’ll be doing:

  • We are designing and architecting a comprehensive platform that automates GPU asset provisioning, configuration, and lifecycle management across cloud providers.
  • Design, develop, test, debug, and optimize creative solutions for Datacenter firmware throughout lifecycle.
  • Work closely with hardware, software, infrastructure, and business teams to transform new firmware features from idea to reality.
  • Define server-level reliability, availability, and serviceability requirements in collaboration with various customers like CSPs and deliver fault resilient solution at scale as per customer expectations.
  • Collaborate with hardware, software and firmware teams to drive failure analysis and large scale solution deployment.
  • Work with engineering teams across NVIDIA to ensure your software integrates seamlessly from the hardware all the way up to the AI training applications.

What we need to see: 

  • Currently pursuing a Bachelor's, Master's, or PhD degree within Computer Engineering, Electrical Engineering, Computer Science, or a related field 
  • Course or internship experience related to the following areas required: Computer Architecture, Deep Learning or Machine Learning, GPU computing and Parallel Programming, Performance Modeling, profiling, optimizing, and/or analysis.
  • Prior experience or knowledge required on the following programming skills and technologies: C, C++, Python, Perl, GPU Computing (CUDA, OpenCL, OpenACC), Deep Learning Frameworks (PyTorch, TensorFlow, Caffe), HPC (MPI, OpenMP) 

The hourly rate for our interns is 18 USD - 71 USD. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.

You will also be eligible for Intern benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Canva - Senior Machine Learning Engineer - Specialist Platform and Experience

Canva

Melbourne, Victoria, Australia (Remote)
1 Week ago
Balbix - AI/ML Architect

Balbix

Bengaluru, Karnataka, India (On-Site)
5 Months ago
DraftKings - Director of Data Science

DraftKings

Boston, Massachusetts, United States (On-Site)
3 Weeks ago
Canva - Senior Machine Learning Engineer - Photo AI

Canva

Prague, Czechia (Remote)
2 Months ago
NVIDIA - Senior Software Engineer, AI Resiliency

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
NVIDIA - Senior Math Libraries Engineer - Dense Linear Algebra

NVIDIA

California, United States (Hybrid)
2 Months ago
NVIDIA - Senior Firmware Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Backend Engineer, Full Chip Layout

NVIDIA

Iași, Iași County, Romania (Remote)
4 Weeks ago
NVIDIA - Senior Mixed Signal Circuit Design Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
NVIDIA - GPU Firmware Engineer (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Rackspace Technology - AI/ML Architect

Rackspace Technology

Vietnam (Remote)
1 Week ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Menlo Park, California, United States (On-Site)
4 Months ago
ByteDance - Software Engineer Intern (Machine Learning Platform) - 2024 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Ciklum - Expert Data Scientist

Ciklum

Pune, Maharashtra, India (Hybrid)
5 Months ago
VGW - Senior Machine Learning Engineer

VGW

Sydney, New South Wales, Australia (On-Site)
1 Week ago
The Walt Disney Company - Lead Machine Learning Engineer

The Walt Disney Company

Washington, United States (On-Site)
2 Months ago
NVIDIA - Senior Research Engineer, Foundation Model Training Infrastructure

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Trustana - Senior Data Engineer

Trustana

Gurugram, Haryana, India (Hybrid)
6 Months ago
Meta - Software Engineer, Machine Learning

Meta

San Francisco, California, United States (On-Site)
4 Months ago
DNEG - Head of Machine Learning

DNEG

London, England, United Kingdom (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Flying Bark Productions - Pipeline TD

Flying Bark Productions

California, United States (Hybrid)
3 Weeks ago
Google - Senior Software Engineer, Performance, Platforms Infrastructure Engineering

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Next Level Business Services - Java Developer

Next Level Business Services

El Segundo, California, United States (On-Site)
5 Months ago
Trek - Assembler - Seasonal Part Time

Trek

Columbia, Maryland, United States (On-Site)
1 Month ago
ByteDance - Frontend Software Engineer Intern (PDI-CSP-FE-i18n )- 2025 Summer (BS/MS)

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Evolution - Online Game Presenter (Server Alternative) $20-$25/hr.

Evolution

Atlantic City, New Jersey, United States (On-Site)
7 Months ago
Zoox - Senior/Staff Software Engineer - Simulator

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago
ByteDance - Software Engineer Intern (Payment Risk - Global Payment)

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
ByteDance - Music Product Counsel - Global Legal

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Easygo - Software Engineering Manager

Easygo

Melbourne, Victoria, Australia (On-Site)
4 Months ago
NVIDIA - Senior Software Verification Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
Krafton  - PUBG Mobile Marketing Manager (Korea)

Krafton

Seoul, South Korea (On-Site)
1 Week ago
NVIDIA - Solutions Architect

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago
NVIDIA - Senior Methodology Software Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech Understanding) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
DigitalFish - Research Scientist, Computer Vision

DigitalFish

California, United States (Hybrid)
7 Months ago
Logitech - Firmware Engineering Manager (Gaming & Simulation)

Logitech

Chennai, Tamil Nadu, India (On-Site)
5 Months ago
N-iX - Senior C++ Engineer (High Performance Computing)

N-iX

United Kingdom (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Massachusetts, United States (Remote)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug