Deep Learning Software Engineer, Performance Optimization

3 Months ago • 5 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA seeks Deep Learning Software Engineers to develop and optimize deep learning solutions for consumer products. Responsibilities include analyzing and optimizing DNN AI algorithms, implementing production-quality software libraries for latency-critical applications on next-generation hardware, pushing performance and efficiency boundaries using techniques like model compression and quantization, collaborating with researchers and engineers on future NVIDIA chip architectures, and assisting customers in bringing AI-powered products to market. This role demands expertise in deep learning toolchains, neural network optimization, and accelerated hardware implementation.
Must have:
  • 5+ years experience in relevant fields
  • Proficiency in C++, Python
  • Deep learning toolchain knowledge
  • DNN optimization and implementation on accelerated hardware
  • Experience with CNN, LLM, ViT architectures
Good to have:
  • Experience with CUDA kernels or low-level libraries
  • Building distributed deep-learning infrastructure
  • Contributions to open-source projects
  • Publications at relevant conferences
  • Achievements in programming competitions

Job Details

We are looking for outstanding Deep Learning Software Engineers to develop and productize NVIDIA's deep learning solutions for use in real-life consumer products. The role gives opportunity to develop new deep learning architectures and tailor DNN-s for NVIDIA hardware platform. This role also involves building a close technical relationship with NVIDIA partners during product development and coordinate with internal architecture and software teams to develop the right solution for partners who rely on our platforms.

What you'll be doing:

  • Analyze, profile and optimize the latest DNN AI algorithms, and implement as production-quality software libraries for latency-critical use-cases on next-generation hardware.

  • Push the boundaries of the state of the art in DNN performance and efficiency, including model compression, quantization and architecture search techniques.

  • Collaborate with researchers and engineers across NVIDIA improving the architecture of future NVIDIA chips and ensure that they are ready to support the latest advances in AI.

  • Assist NVIDIA customers to bring ground-breaking products to life on the foundation of NVIDIA AI technology.

What we need to see:

  • University degree, or equivalent knowledge, in Computer Science, Electrical Engineering, Physics or Mathematics.

  • 5+ years of work experience in related fields, such as HPC, numeric computing, machine learning, AI with responsibilities for software optimization.

  • Proficiency in C++, Python, data structures, algorithms, computer architecture and operating system concepts.

  • Knowledge of deep-learning toolchains (PyTorch, TensorFlow, Keras, ONNX, TensorRT, numeric libraries, containers, etc.)

  • Experience with neural network training, pruning and quantization, deploying DNN inference in production systems.

  • Experience optimizing and implementing compute algorithms on accelerated hardware, such as SIMD instruction sets, GPU-s, FPGA or DNN ASIC.

  • Familiarity with CNN, LLM and ViT architectures, as well as the latest progress in the field.

  • Experience creating DNN models for solving production problems in any domain, including computer vision, speech recognition, natural language processing, optimization or generative AI.

Ways to stand out from the crowd:

  • Experience implementing DNN inference natively using C++, CUDA kernels or low level libraries, such as BLAS.

  • Experience building distributed deep-learning infrastructure, HPC, cloud programming.

  • Contribution to open-source projects, including personal projects published as open-source (please provide link to github repository).

  • Published paper at relevant conferences or in journals (e.g. NIPS, ICML, ICLR, CVPR, ICCV, ECCV, SIGGRAPH, etc.)

  • Achievements in programming or machine learning competitions, such as Kaggle, HackerRank, TopCoder, etc.

NVIDIA is leading the way in groundbreaking developments in artificial intelligence, high-performance computing and visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables outstanding creativity and discovery, and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is looking for great engineers and scientists to help us accelerate the next wave of artificial intelligence.

Similar Jobs

Google - Senior Data Scientist, Research, Chrome

Google

Kirkland, Washington, United States (On-Site)
1 Week ago
Limit Break - Unity エンジニア

Limit Break

Tokyo, Japan (On-Site)
9 Months ago
Lingo Kids LLC - Sr. Game Animator

Lingo Kids LLC

Madrid, Community Of Madrid, Spain (On-Site)
1 Month ago
Life church - Senior UX Researcher

Life church

Edmond, Oklahoma, United States (On-Site)
6 Months ago
Epic Games - Lead Automation Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
1 Month ago
Sony Interactive Entertainment - Learning & Development Specialist (AI Talent Development & Training Program Lead)

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
1 Month ago
Light Speed Studios - Senior Researcher, Natural Language Processing

Light Speed Studios

Tokyo, Japan (On-Site)
4 Weeks ago
SmileGate - NPC AI Development

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
2 Weeks ago
GT - AI/ML Engineer

GT

(Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

thndr - Unity Game Developer (Remote)

thndr

Kraków, Lesser Poland Voivodeship, Poland (Remote)
4 Months ago
ILogos Game Studios - Lead Unity Developer

ILogos Game Studios

(Remote)
3 Months ago
Bigpoint - Senior Full Stack Developer (m/f/d) - #5739

Bigpoint

Hamburg, Hamburg, Germany (Hybrid)
8 Months ago
Ubisoft - Lead Technical Artist

Ubisoft

Annecy, Auvergne-Rhône-Alpes, France (On-Site)
1 Month ago
Dream Games - Software Engineer

Dream Games

İstanbul, Türkiye (On-Site)
11 Months ago
N-iX - Senior/Middle Python Developer in AI Services

N-iX

Poland (Remote)
2 Weeks ago
ByteDance - Optical System Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
PlayStation Global - Lead Engineer, Mobile Performance

PlayStation Global

London, England, United Kingdom (On-Site)
1 Week ago
Meta - Program Manager, Product Testing

Meta

New York, New York, United States (On-Site)
1 Week ago
Playrix - Senior Engineering Manager

Playrix

Serbia (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Tokyo, Japan

Limit Break - Technical Project Manager (Japan)

Limit Break

Tokyo, Japan (On-Site)
8 Months ago
Match Group - Back-end Engineer

Match Group

Tokyo, Japan (Hybrid)
6 Months ago
ByteDance - Lemon8 Content & Creator Operation Specialist

ByteDance

Tokyo, Japan (On-Site)
1 Month ago
NetEase Games - Treasury Analyst

NetEase Games

Shinjuku City, Tokyo, Japan (On-Site)
4 Months ago
Google - Data Center Technician

Google

Inzai, Chiba, Japan (On-Site)
18 Hours ago
ByteDance - Partner Operations Manager

ByteDance

Tokyo, Japan (On-Site)
5 Months ago
Google - Technical Operations Manager, Third Party Data Centers

Google

Osaka, Osaka, Japan (On-Site)
18 Hours ago
Sony Interactive Entertainment - Advertising Media Planner - Japan Domestic Market

Sony Interactive Entertainment

Tokyo, Japan (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Meta - Software Engineer, Machine Learning

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
5 Months ago
ByteDance - Software Engineer (Applied Machine Learning - Enterprise)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Microsoft - Engineering Manager

Microsoft

Mountain View, California, United States (Hybrid)
1 Month ago
Google - Customer Engineer, Machine Learning, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Week ago
PwC - AI/ML Azure Engineer (m/f/d)

PwC

Luxembourg (On-Site)
6 Months ago
Keywords Studios - Research Associate - Fresher

Keywords Studios

Karnataka, India (On-Site)
3 Weeks ago
Interface AI - Staff Software Engineer, Backend

Interface AI

United States (Remote)
2 Months ago
Microsoft - Member of Technical Staff - AI Multimodal

Microsoft

Zürich, Zurich, Switzerland (On-Site)
1 Week ago
Google - Technical Program Manager III, AI/ML, Cloud AI Systems

Google

Austin, Texas, United States (On-Site)
1 Week ago
Snail Games - Software Engineer - AI/Machine Translation

Snail Games

Beverly Hills, California, United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug