Python Software Engineering Intern, Accelerated LLM Data Applications - Fall 2025

1 Day ago • Upto 1 Years • Research & Development

Job Summary

Job Description

NVIDIA seeks a Python Software Engineering Intern to accelerate data engineering for Large Language Models (LLMs). The intern will develop and optimize Python-based data processing frameworks for GPU-accelerated environments, contributing to RAPIDS and other GPU-accelerated libraries. Responsibilities include designing and implementing components for Retrieval Augmented Generation (RAG) pipelines, benchmarking algorithms, and collaborating with LLM & ML researchers. The ideal candidate possesses strong Python skills, familiarity with LLMs and RAG pipelines, experience with PyData and ML/DL ecosystems, and a passion for optimization and iterative development. The internship involves working with large datasets, optimizing for speed and cost, and improving system accuracy through various techniques.
Must have:
  • Python library development experience
  • Familiarity with LLMs and RAG pipelines
  • Understanding of PyData & ML/DL ecosystems
  • Contributions to open-source projects
Good to have:
  • Experience with production-level data pipelines
  • Experience with software packaging technologies
  • Familiarity with Docker-Compose, Kubernetes
  • Knowledge of parallel programming in CUDA C++
Perks:
  • Intern benefits

Job Details

Today, NVIDIA is tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, encouraging environment where everyone is inspired to do their best work. Come join the team and see how we can make a lasting impact on the world.

Come join the team and see how you can make a lasting impact on the world! NVIDIA is seeking a Python Software Engineer Intern to further our efforts to GPU-accelerate data engineering for Large Language Model (LLM) tools and libraries. This role is pivotal in accelerating pre-processing pipelines for high-quality multi-modal dataset curation. The day to day focus is on developing efficient, scalable systems for de-duplicating, filtering, and classifying training corpora for foundation model LLMs, as well as ingesting and prepping datasets for use in Retrieval Augmented Generation (RAG) pipelines. Fundamental to these efforts are iterative testing and improvement in system cost, speed, & accuracy through micro-optimization, prompt engineering, fine tuning, and applying new research. The ideal candidate is happiest releasing early and often! They court user feedback with an ear open to the spirit of related feature requests. You are comfortable objectively evaluating the latest AI models and frameworks with an eye on acceleration potential. Would you like to run your training & test experiments on our supercomputers on thousands of GPU? Come work with us!

What you'll be doing:

  • Develop and optimize Python-based data processing frameworks, ensuring efficient handling of large datasets on GPU-accelerated environments, vital for LLM training.

  • Contribute to the design and implementation of RAPIDS and other GPU-accelerated libraries, focusing on seamless integration and performance enhancement in the context of LLM training data preparation and RAG pipelines.

  • Lead development and iterative optimization of components for RAG pipelines, ensuring they demonstrate GPU acceleration & the best performing models for improved TCO.

  • Collaborate with teams of LLM & ML researchers in the development of full-stack, GPU-accelerated data preparation pipelines for multimodal models Implement benchmarking, profiling, and optimization of innovative algorithms in Python in various system architectures, specifically targeting LLM applications.

  • Work closely with diverse teams to understand requirements, build & evaluate POCs, and develop roadmaps for production level tools and library features within the growing LLM ecosystem.

What we need to see:

  • Pursuing a MS or PhD in Computer Science, Computer Engineering, or a related field.

  • Python library development experience, including CI systems (GitHub Actions), integration testing, benchmarking, & profiling

  • Familiarity with LLMs and RAG pipelines: prompt engineering, LangChain, llama-index

  • Understanding of the PyData & ML/DL ecosystems, including RAPIDS, Pandas, numpy, scikit-learn, XGBoost, Numba, PyTorch

  • Familiarity with distributed programming frameworks like Dask, Apache Spark, or Ray

  • Visible contributions to open-source projects on GitHub

Ways to stand out from the crowd:

  • Active engagement (published papers, conference talks, blogs) in the data science community

  • Experience with production-level data pipelines, especially SQL-based

  • Experience with software packaging technologies: pip, conda, Docker images

  • Familiarity with Docker-Compose, Kubernetes, and Cloud deployment frameworks

  • Knowledge of parallel programming approaches, especially in CUDA C++

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

The hourly rate for our interns is 18 USD - 71 USD. Our internship hourly rates are a standard pay determined based on the position and your location, year in school, degree, and experience.

You will also be eligible for Intern benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Google - Senior Software Engineer, Infrastructure, Google Cloud Business Platforms

Google

Kirkland, Washington, United States (On-Site)
6 Days ago
Warner Bros Games - Staff Software Engineer - Golang - QoE Platform

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
ION - Senior Consultant - Risk Advisory, Italy

ION

Pisa, Tuscany, Italy (On-Site)
6 Months ago
ByteDance - Senior Network Engineer- Seattle

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - Software Engineer - Low-code Platform

ByteDance

Singapore (On-Site)
1 Week ago
Krafton  - IT Service & Solutions Manager

Krafton

Seoul, South Korea (On-Site)
1 Month ago
NVIDIA - Senior ASIC Power Engineer

NVIDIA

Durham, North Carolina, United States (On-Site)
1 Month ago
Google - Senior Research Scientist, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
NVIDIA - SRAM Circuit Design Engineer - New College Grad 2025

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
Google - Chip Infrastructure Engineer, SoC CAD

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Wargaming - Senior Engine Developer (World of Tanks)

Wargaming

Belgrade, Serbia (Hybrid)
1 Month ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Pune, Maharashtra, India (On-Site)
1 Week ago
Patreon - Site Reliability Engineer

Patreon

United States (Remote)
4 Weeks ago
NVIDIA - AI and ML Infra Software Engineer, GPU Clusters

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
NVIDIA - Senior Software Engineer - Triton Tools

NVIDIA

California, United States (Remote)
3 Months ago
Warner Bros Games - Staff Machine Learning Engineer-Search & Personalization

Warner Bros Games

Hyderabad, Telangana, India (Hybrid)
1 Month ago
ByteDance - Senior Research Scientist- Foundation Model, Vision and Language

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Technical Delivery Infrastructure Engineer, Public Sector

Google

Reston, Virginia, United States (On-Site)
1 Week ago
Fanatee - Product Analyst - AI/Expansion

Fanatee

Spain (Hybrid)
1 Month ago
Moon Active - Site Reliability Engineer

Moon Active

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Activision - Senior Narrative Animator

Activision

Los Angeles, California, United States (On-Site)
1 Month ago
Feld Entertainment - Monster Jam Truck Technician

Feld Entertainment

Ellenton, Florida, United States (On-Site)
6 Months ago
ByteDance - US Payroll Tax Specialist

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Google - Security Manager, Google Data Centers

Google

Council Bluffs, Iowa, United States (On-Site)
4 Days ago
Google - Technical Program Manager II, Big Data & Analytics

Google

Atlanta, Georgia, United States (On-Site)
1 Week ago
The Walt Disney Company - Principal Product Designer

The Walt Disney Company

Glendale, California, United States (On-Site)
1 Month ago
Meta - Software Engineer, Infrastructure

Meta

Bellevue, Washington, United States (Remote)
5 Months ago
The Walt Disney Company - Lead Software Engineer - Applied AI & Machine Learning

The Walt Disney Company

Santa Monica, California, United States (On-Site)
1 Week ago
The Walt Disney Company - Custodial - Part Time

The Walt Disney Company

Hilton Head Island, South Carolina, United States (On-Site)
1 Week ago
ByteDance - Cloud Network Engineer

ByteDance

Ashburn, Virginia, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Research & Development Jobs

NVIDIA - Senior Graphics System Software Engineer - Tegra

NVIDIA

Santa Clara, California, United States (On-Site)
5 Days ago
Google - Staff Software Engineer

Google

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
6 Days ago
Google - Physical Verification and Convergence Engineer

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Riot Games - Senior Software Engineer (Mobile C++) - Teamfight Tactics

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
NVIDIA - Senior ASIC Verification Engineer - GPU Memory Subsystem

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - System Software Architect, Programmable Vision Accelerator

NVIDIA

Pune, Maharashtra, India (On-Site)
1 Month ago
Krafton  - Release Manager Product Manager

Krafton

Seoul, South Korea (On-Site)
2 Weeks ago
Google - Staff Software Engineer, Private Machine Learning

Google

Mountain View, California, United States (On-Site)
1 Week ago
Samsung Semiconductor - Senior Engineer, Design Verification

Samsung Semiconductor

San Jose, California, United States (On-Site)
1 Month ago
Analog Devices - CAD Engineer

Analog Devices

Bengaluru, Karnataka, India (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug