Performance Engineer Intern, Deep Learning and HPC

3 Months ago • Upto 1 Years • Research & Development

Job Summary

Job Description

This internship involves benchmarking, profiling, and analyzing the performance of AI workloads (LLM training and inference) and HPC on NVIDIA supercomputers. Responsibilities include creating reports with testing data for various internal teams, setting up and configuring systems, debugging performance issues, developing Python automation scripts, and assisting in developing automated testing tools and processes. The intern will collaborate with multi-functional teams in a dynamic environment where priorities may shift frequently. The role requires experience with Linux, container platforms (Docker/Singularity), and compiling/running software from source code.
Must have:
  • Python scripting
  • Data analysis skills
  • Linux experience
  • Benchmarking experience
  • Problem-solving skills
Good to have:
  • GPU/CPU benchmarking
  • ML/DL frameworks (TensorFlow/PyTorch)
  • AI model development
  • Cloud provisioning tools (Kubernetes, SLURM)
  • Testing automation

Job Details

We are now looking for a Performance Engineer Intern to support our growing investments in perf testing of various company datacenter products and applications.

Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world, all while striving to deliver the highest possible performance of our products.

You will be part of global Performance Lab team, improving our capacity to expertly and accurately benchmark state-of-the-art datacenter applications and products. We also work to develop new scripts that enhance the team’s ability to gather data through automation and designing efficient processes for testing a wide variety of applications and hardware. The data that we collect drives marketing/sales collaterals as well as engineering studies for current and future products. You will have the opportunity to work with multi-functional teams and in a dynamic environment where multiple projects will be active at once and priorities may shift frequently.

What you’ll be doing:

  • Benchmark, profile, and analyze the performance of AI workloads specifically tailored for large-scale LLM training and inference, as well as High-Performance Computing (HPC) on NVIDIA supercomputers and distributed systems.

  • Aggregate and produce written and visual reports with the testing data for internal sales, marketing, SW, and HW teams

  • Setup and configure systems with appropriate hardware and software to run benchmarks

  • Collaborate with internal teams to debug and improve performance issues

  • Develop Python scripts to automate the testing of various applications

  • Assist with the development of tools and processes that improve our ability to perform automated testing

What we need to see:

  • Currently pursuing a Bachelor's degree (or higher) in Computer Science, Electrical Engineering, or a related field.

  • Experienced in programming and debugging with scripting languages such as Python or Unix shell.

  • Strong data analysis skills and the ability to summarize findings in a written report

  • Hands-on experience with Linux based systems. Familiarity using a container platform such as Docker or Singularity. Experience with compiling and running software from source code.

  • Fast and self-learning capabilities with strong analytical and problem-solving skills.

  • Good English verbal and written interpersonal skills to improve collaboration with coworkers

Ways to stand out from the crowd:

  • Background with GPU/CPU benchmarking

  • Familiar with ML/DL techniques, algorithms and frameworks like TensorFlow or PyTorch.

  • Experience in AI model development, training, evaluation and deployment on Cloud, Cluster or on-premises. Familiar with cloud provisioning and scheduling tools (Kubernetes, SLURM).

  • Exposure to testing automation for various applications.

We have some of the most forward thinking and hardworking people in the world working for us and our best-in-class engineering teams are rapidly growing. We are building a team that will help shape the future of data center computing. If you are passionate about new technologies, care about improving efficiency and quality, and want to be at the forefront of AI & HPC & Gaming, we would love for you to join us.

Similar Jobs

Section 9 Interactive - 3D Artist

Section 9 Interactive

Malmö, Skåne County, Sweden (Hybrid)
1 Year ago
ByteDance - Site Reliability Engineer (Cloud Native Platform) - Traffic Infrastructure

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Senior Machine Learning Ops Engineer, ML System - Foundation Model

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Garena - Database Engineer/Senior Engineer

Garena

Singapore (On-Site)
1 Month ago
Garena - Garena - Operation Engineer (Game System Operations Engineer)

Garena

Taipei City, Taiwan (On-Site)
2 Months ago
NVIDIA - Physical Design CAD Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
2 Months ago
Qt Group - Software Engineer

Qt Group

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Google - Senior Staff Software Engineer, Google Cloud

Google

Hyderabad, Telangana, India (On-Site)
6 Months ago
NVIDIA - Senior Malware Research Architect

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Truecaller - Senior MLOps Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
6 Months ago
Google - Technical Solutions Engineer, Infrastructure Compute

Google

Pune, Maharashtra, India (On-Site)
1 Month ago
Meta - Software Engineer, Machine Learning

Meta

Mountain View, California, United States (On-Site)
6 Months ago
Every matrix - Database Administrator

Every matrix

Bucharest, Bucharest, Romania (Hybrid)
2 Months ago
PwC - IN- Senior Associate_ DevOps_Advisory Corporate_Advisory _Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Axon - Senior Security Engineer

Axon

Scottsdale, Arizona, United States (Hybrid)
5 Months ago
ByteDance - Global Site Reliability Engineer Lead - Security Engineering - San Jose

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Interactive Brokers - Java Software Engineer

Interactive Brokers

Zug, Zug, Switzerland (On-Site)
7 Months ago
ByteDance - Research Scientist, Foundation Model, Speech & Audio

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Fluence - Controls Software Engineer II

Fluence

Houston, Texas, United States (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Tencent - Global Communications Manager - Games

Tencent

Shenzhen, Guangdong Province, China (On-Site)
8 Months ago
Tencent - 安全技术开发

Tencent

Shenzhen, Guangdong Province, China (On-Site)
5 Months ago
Ourpalm - MMO System Planner

Ourpalm

Beijing, Beijing, China (On-Site)
1 Month ago
Tencent - UGC Operation - PUBG Mobile

Tencent

Shenzhen, Guangdong Province, China (On-Site)
2 Months ago
NVIDIA - Hardware Application Engineer, Ethernet Switch

NVIDIA

Beijing, Beijing, China (Hybrid)
3 Months ago
Virtuos - Operations Director

Virtuos

China (On-Site)
1 Month ago
Tencent - Senior Animation Designer - Global Realistic 3A Action Game

Tencent

Shenzhen, Guangdong Province, China (On-Site)
2 Months ago
undefined - Scenario mode FO

Beijing, Beijing, China (On-Site)
10 Months ago
Riot Games - Game Designer, Level Design - R&D

Riot Games

Shanghai, Shanghai, China (On-Site)
1 Month ago
Google - Account Manager, Gaming, Large Customer Sales

Google

Shanghai, Shanghai, China (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

Meta - Research Scientist, Machine Learning (PhD)

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
6 Months ago
ByteDance - Research Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (MS)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Google - ASIC Design Verification Engineer

Google

Madison, Wisconsin, United States (On-Site)
3 Weeks ago
ByteDance - Site Reliability Engineer, ML System - Foundation Model

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Regent Craft - Propulsion Engineering Intern

Regent Craft

North Kingstown, Rhode Island, United States (On-Site)
7 Months ago
NVIDIA - Design Engineer, Coherent High Speed Interconnect

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - Stress Simulation Engineer - Test

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
4 Months ago
The Walt Disney Company - Software Engineer, Platform

The Walt Disney Company

California, United States (On-Site)
2 Months ago
ByteDance - Senior Machine Learning Ops Engineer, ML System

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Rockstar Games - Development Support

Rockstar Games

Dundee, Scotland, United Kingdom (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug