Performance Engineer Intern, Deep Learning and HPC

2 Months ago • Upto 1 Years • Research & Development

Job Summary

Job Description

This internship involves benchmarking, profiling, and analyzing the performance of AI workloads (LLM training and inference) and HPC on NVIDIA supercomputers and distributed systems. The intern will generate reports, set up systems, debug performance issues, develop Python automation scripts, and assist in developing automated testing tools and processes. Collaboration with internal teams across sales, marketing, software, and hardware is key. The role requires strong data analysis, Linux experience, and familiarity with container platforms like Docker or Singularity. The intern will contribute to improving the efficiency and quality of NVIDIA's data center computing products and applications.
Must have:
  • Python/Unix scripting
  • Data analysis & reporting
  • Linux system experience
  • Container platform familiarity (Docker/Singularity)
  • Software compilation & execution
Good to have:
  • GPU/CPU benchmarking
  • ML/DL techniques (TensorFlow/PyTorch)
  • AI model development & deployment
  • Cloud provisioning (Kubernetes, SLURM)
  • Testing automation

Job Details

We are now looking for a Performance Engineer Intern to support our growing investments in perf testing of various company datacenter products and applications.

Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world, all while striving to deliver the highest possible performance of our products.

You will be part of global Performance Lab team, improving our capacity to expertly and accurately benchmark state-of-the-art datacenter applications and products. We also work to develop new scripts that enhance the team’s ability to gather data through automation and designing efficient processes for testing a wide variety of applications and hardware. The data that we collect drives marketing/sales collaterals as well as engineering studies for current and future products. You will have the opportunity to work with multi-functional teams and in a dynamic environment where multiple projects will be active at once and priorities may shift frequently.

What you’ll be doing:

  • Benchmark, profile, and analyze the performance of AI workloads specifically tailored for large-scale LLM training and inference, as well as High-Performance Computing (HPC) on NVIDIA supercomputers and distributed systems.

  • Aggregate and produce written and visual reports with the testing data for internal sales, marketing, SW, and HW teams

  • Setup and configure systems with appropriate hardware and software to run benchmarks

  • Collaborate with internal teams to debug and improve performance issues

  • Develop Python scripts to automate the testing of various applications

  • Assist with the development of tools and processes that improve our ability to perform automated testing

What we need to see:

  • Currently pursuing a Bachelor's degree (or higher) in Computer Science, Electrical Engineering, or a related field.

  • Experienced in programming and debugging with scripting languages such as Python or Unix shell.

  • Strong data analysis skills and the ability to summarize findings in a written report

  • Hands-on experience with Linux based systems. Familiarity using a container platform such as Docker or Singularity. Experience with compiling and running software from source code.

  • Fast and self-learning capabilities with strong analytical and problem-solving skills.

  • Good English verbal and written interpersonal skills to improve collaboration with coworkers

Ways to stand out from the crowd:

  • Background with GPU/CPU benchmarking

  • Familiar with ML/DL techniques, algorithms and frameworks like TensorFlow or PyTorch.

  • Experience in AI model development, training, evaluation and deployment on Cloud, Cluster or on-premises. Familiar with cloud provisioning and scheduling tools (Kubernetes, SLURM).

  • Exposure to testing automation for various applications.

We have some of the most forward thinking and hardworking people in the world working for us and our best-in-class engineering teams are rapidly growing. We are building a team that will help shape the future of data center computing. If you are passionate about new technologies, care about improving efficiency and quality, and want to be at the forefront of AI & HPC & Gaming, we would love for you to join us.

Similar Jobs

Easybrain - Senior Data Engineer

Easybrain

Cyprus (On-Site)
8 Months ago
NVIDIA - Senior SRAM Engineer, Circuit Design

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
NVIDIA - Senior SWQA Test Development Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
CloudLinux - Senior Python Developer for KernelCare

CloudLinux

Tbilisi, Tbilisi, Georgia (Remote)
5 Days ago
Trackman - Team Lead - Radar & High-Speed Electronics

Trackman

(On-Site)
6 Days ago
Rockstar Games - Lead Software Engineer (C++)

Rockstar Games

New York, New York, United States (On-Site)
6 Months ago
Samsung Semiconductor - Staff Engineer, Machine Learning

Samsung Semiconductor

San Jose, California, United States (Hybrid)
2 Weeks ago
NVIDIA - Senior Software Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
9 Hours ago
Riot Games - Principal Software Engineer, Gameplay - Teamfight Tactics

Riot Games

Dublin, County Dublin, Ireland (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer, ML System Scheduling

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
CloudLinux - Lead SDET/QA Automation Engineer

CloudLinux

Masovian Voivodeship, Poland (Remote)
5 Days ago
Next Level Business Services - JAvA Full Stack Developer

Next Level Business Services

New York, New York, United States (On-Site)
5 Months ago
ByteDance - Linux System Engineer

ByteDance

London, England, United Kingdom (On-Site)
6 Days ago
White Hat Gaming  - SRE/DevOps Engineer

White Hat Gaming

(Remote)
1 Month ago
ByteDance - Research Scientist, Multimodality

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
PwC - IN- Senior Associate_ DevOps_Advisory Corporate_Advisory _Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
ByteDance - Software Engineer, SRE - Platform Services

ByteDance

Seattle, Washington, United States (On-Site)
6 Days ago
ByteDance - Lead Research Scientist, Foundation Model, Speech & Audio

ByteDance

San Jose, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Animoca Brands - Game Developer

Animoca Brands

China (Remote)
6 Months ago
Tencent - 安全技术开发

Tencent

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
NVIDIA - Senior Thermal Design Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Weeks ago
Voodoo - Publishing Manager

Voodoo

Shanghai, Shanghai, China (Remote)
2 Months ago
Canva - Quality Engineer - Internationalization

Canva

Wuhan, Hubei, China (Remote)
2 Weeks ago
Virtuos - Senior C&B Specialist

Virtuos

China (On-Site)
1 Week ago
Tencent - NIKKE Game Community Content Operation

Tencent

Shanghai, Shanghai, China (On-Site)
1 Week ago
Virtuos - 3D Environment Trainee

Virtuos

China (On-Site)
1 Week ago
Riot Games - Software Engineer - Platform & Tools (Contractor)

Riot Games

Shanghai, Shanghai, China (On-Site)
5 Months ago
Tencent - Overseas Game Content Creative Designer

Tencent

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Senior Emulation Power Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
CloudHire - Sr. Java Application Architect

CloudHire

Karnataka, India (Remote)
1 Week ago
ByteDance - Software Engineer in Large Model System Graduate (Machine Learning Sys-US) - 2024 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Board Design Engineer, LDE

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
1 Week ago
NVIDIA - Senior Technical Program Manager – Silicon Solutions

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
NVIDIA - Physical Design Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
Krafton  - [PUBG IP Franchise] 게임 제작관리 PM (5년 이상)

Krafton

Seoul, South Korea (On-Site)
5 Months ago
ByteDance - Software Engineer, Model Interference

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
NVIDIA - Senior Solution Engineer, Mission Control

NVIDIA

Durham, North Carolina, United States (On-Site)
5 Days ago
NXP - <2025 Internship Program> Application Engineer

NXP

Taipei City, Taiwan (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug