Principal Engineer

1 Month ago • 12 Years + • Artificial Intelligence • $272,000 PA - $431,250 PA

Job Summary

Job Description

NVIDIA seeks a Principal Research Engineer focused on Generative AI inference to develop optimized inferencing technologies. Responsibilities include developing new models and algorithms in speech recognition, speech synthesis, NLP, and deep learning; architecting and implementing features in C++, CUDA, and Python; ensuring seamless software integration across NVIDIA's accelerated serving stack; mentoring junior engineers; and collaborating with internal and external partners. The role involves implementing new algorithms, performance tuning, API definition, and general software engineering tasks.
Must have:
  • 12+ years experience in Deep Learning
  • Excellent C++ and Python skills
  • Strong ML/DNN/NLP/Speech Recognition understanding
  • Experience with PyTorch or TensorFlow
  • Strong communication and mentoring skills
Good to have:
  • Experience with large-scale distributed systems
  • Knowledge of CPU/GPU architecture
  • GPU programming (CUDA)
  • Contributions to major open-source projects
Perks:
  • Competitive salary
  • Generous benefits package
  • Equity

Job Details

We are now looking for a Principal Research Engineer focused on Generative AI inference. Are you excited to change the way people infuse AI into products and services? NVIDIA is at the forefront of generative AI models, from language to images. NVIDIA's provides building blocks to democratize AI and make generative AI easy to develop, integrate, and deploy. Our team is dedicated to developing optimized inferencing technologies to support our growing generative AI needs. We contribute to all steps of the machine learning lifecycle: from conceptualization, to applied research, engineering for optimized inference, and deployment.

As a research engineer on the team, you will interact with internal partners, users, and members of the open-source community to define, analyze, and implement highly optimized algorithms for speech recognition, natural language understanding, image generation and speech synthesis. The scope of these efforts includes a combination of implementing new algorithms, performance tuning and analysis, defining APIs, analyzing functionality coverage, and other general software engineering work.

What you will be doing:

  • Developing new models and algorithms in Speech Recognition, Speech Synthesis, Natural Language Processing and Deep Learning

  • Architecting and implementing features in C++, CUDA, and Python

  • Demonstrate good engineering practices and mentoring other team members to do the same.

  • Working with engineering teams across all of NVIDIA to ensure our software integrates seamlessly up and down the NVIDIA accelerated serving stack.

What we need to see:

  • Understanding of modern techniques in Machine Learning, Deep Neural Networks, Natural Language Processing, or Speech Recognition

  • 12+ years industry experience in Deep Learning frameworks (PyTorch or Tensorflow)

  • Passion for software engineering. We are especially looking for excellent C++ and Python development skills, with meaningful contributions to major open-source projects.

  • Strong communication and interpersonal skills along with the ability to work in a dynamic and distributed team. Your history of mentoring junior engineers and interns is a huge plus.

  • Bachelor's degree or equivalent experience.

  • A desire to constantly grow and learn new things.

  • Strong computer science fundamentals - algorithms and data structures, computational complexity, parallel and distributed computing, system software.

Ways to stand out from a crowd:

  • Experience architecting or developing large-scale distributed systems for deep learning

  • Knowledge of CPU and/or GPU architecture

  • GPU programming (CUDA)

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

The base salary range is 272,000 USD - 431,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Software Engineer, Inference

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
G5 Games - 2D UI/UX Artist (Hidden objects project)

G5 Games

Yerevan, Yerevan, Armenia (Remote)
2 Months ago
Playrix - SDET (Software Development Engineer in Test)

Playrix

Ireland (Remote)
1 Week ago
Nintendo - Senior Data Scientist

Nintendo

Redmond, Washington, United States (On-Site)
2 Months ago
G5 Games - 2D UI/UX Artist (match-3 project)

G5 Games

Astana, Astana, Kazakhstan (Remote)
5 Months ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Seattle, Washington, United States (On-Site)
4 Months ago
Microsoft - Member of Technical Staff - Software Engineer

Microsoft

Redmond, Washington, United States (On-Site)
1 Week ago
NVIDIA - Principal DGX Cloud Machine Learning Architect

NVIDIA

Canada (On-Site)
1 Month ago
NVIDIA - Solution Architect, Generative AI - Digital Human

NVIDIA

Canada (On-Site)
1 Month ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Homa games - Senior MLOps Engineer

Homa games

Île-de-France, France (On-Site)
1 Day ago
Playrix - Feature Owner (LiveOps)

Playrix

Ukraine (Remote)
5 Months ago
Resemble AI - Deep Learning Speech Researcher

Resemble AI

Mountain View, California, United States (On-Site)
7 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

London, England, United Kingdom (Remote)
2 Months ago
Playrix - Game Director

Playrix

Portugal (Remote)
5 Months ago
Applike Group - Senior Data Scientist (Recommendation Systems Expert) (f/m/d)

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
5 Months ago
Warner Bros Games - Staff Data Scientist

Warner Bros Games

Hyderabad, Telangana, India (Hybrid)
1 Month ago
NVIDIA - Senior AI Training Performance Engineer

NVIDIA

Shanghai, Shanghai, China (Hybrid)
2 Months ago
ZiMAD - Graphic Designer

ZiMAD

(Remote)
1 Month ago
G5 Games - 2D Illustrator (HOG project)

G5 Games

Tbilisi, Tbilisi, Georgia (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in undefined

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Artificial Intelligence Jobs

Keywords Studios (Player Support) - Technical Research Associate - AI

Keywords Studios (Player Support)

(Remote)
6 Days ago
Meta - Software Engineer, Machine Learning

Meta

Sunnyvale, California, United States (On-Site)
4 Months ago
Scale AI - AI Product Manager, Generative AI

Scale AI

San Francisco, California, United States (On-Site)
5 Months ago
NVIDIA - Senior Field Application Engineer

NVIDIA

Durham, North Carolina, United States (On-Site)
2 Months ago
Airlab Inc  - Jr Programmer Artificial Intelligence

Airlab Inc

Montreal, Quebec, Canada (On-Site)
10 Months ago
Krafton  - Applied Research Scientist/Engineer - LLM Game Agent

Krafton

Seoul, South Korea (On-Site)
6 Days ago
NVIDIA - Principal Engineer

NVIDIA

United States (Remote)
1 Month ago
Saama Technologies,  Inc  - NLP Engineer

Saama Technologies, Inc

(Remote)
1 Month ago
Keywords Studios (Player Support) - AI - Technical Research Associate (Prompts)

Keywords Studios (Player Support)

Silesian Voivodeship, Poland (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug