Director of AI Research

2 Months ago • 15 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA seeks a Director of AI Research to lead a team developing deep learning algorithms and techniques for efficient large language model (LLM) inference. Responsibilities include designing new architectures for advanced LLMs, adapting foundation AI models to tasks like math and code reasoning, contributing to the Nemo framework, curating datasets, and collaborating with product and hardware teams. The ideal candidate possesses a PhD in Computer Science/Electrical Engineering, 15+ years of machine learning/deep learning experience, and 10+ years of management experience, along with expertise in NLP, deep learning frameworks (PyTorch), and a strong publication record. The role involves building prototypes, collaborating on real-world applications, and pushing the boundaries of generative AI.
Must have:
  • PhD in CS/EE
  • 15+ years ML/DL experience
  • 10+ years management experience
  • NLP & speech processing knowledge
  • Deep learning frameworks (PyTorch)
  • Strong publication record
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

We are looking for Director of AI research team, to work on new deep learning algorithms and techniques for efficient LLM inference 

NVIDIA is searching for world-class researchers in deep learning and natural language processing (NLP) to join our Conversational AI research team. Our team is pushing the boundaries of generative AI by building state-of-the-art large language models (LLM). We  work on new neural architectures to enable LLM with very long context, on applying LLM to solve complicated math and coding problems, and on improvement LLM robustness. If you are passionate about the latest research and technologies revolutionizing generative AI and want to explore creative new paradigms for applied foundation models such as reasoning  agents, this team will be a great fit for you. After building prototypes that demonstrate the promise of your research, you will collaborate with product teams to apply your ideas into industry-leading real-world applications.

What you will do:

  • Lead the team which will work on new deep learning algorithms and techniques for efficient LLM inference 
  • Develop new architectures for advanced large language models
  • Design and adapt foundation AI models to downstream tasks such as math and code reasoning.
  • Contribute these new models to Nemo framework
  • Construct and curate datasets for large-scale machine learning, for learning from human preferences, and for specific domains of applications.
  • Work closely with product and hardware architecture teams to integrate your research and developments into products.

What we need to see:

  • MSc or PhD in  Computer Science/ Electrical Engineering 
  • 15 overall Years of extensive machine learning / deep learning research or work experience, and 10 years of management experience
  • Knowledge of application areas such as natural language processing and speech processing.
  • Excellent programming skills in some rapid prototyping environments such as Python; 
  • Expertise with deep learning frameworks such as PyTorch.
  • A track record of research excellence demonstrated in publications at leading conferences and journals.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Similar Jobs

Visa - Director, NA Visa Direct Cross Border Bank Account Manager

Visa

San Francisco, California, United States (Hybrid)
4 Weeks ago
Maliyo Games - Game Designer

Maliyo Games

Nigeria (On-Site)
6 Months ago
Riot Games - Senior Researcher, Wild Rift

Riot Games

Shanghai, Shanghai, China (On-Site)
2 Months ago
Epic Games - Senior Technical Designer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
4 Months ago
Highspot - Implementation Manager

Highspot

Hyderabad, Telangana, India (Hybrid)
3 Months ago
Google - Conversational AI Consultant

Google

Karnataka, India (On-Site)
1 Month ago
Google - Senior Technical Program Manager, AI Risk Reporting Lead

Google

Seattle, Washington, United States (On-Site)
1 Month ago
PlayStation Global - Sr. ML Software Engineer

PlayStation Global

United States (Remote)
2 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
7 Months ago
Google - Cloud AI Engineer, Global Services Delivery

Google

Mexico City, Mexico City, Mexico (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Moloco - Staff Machine Learning Engineer

Moloco

Seoul, South Korea (On-Site)
1 Month ago
Doiq - Technical Artist

Doiq

United States (Remote)
5 Months ago
Omnissa - Member of Technical staff - Android

Omnissa

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
ByteDance - Partner Sales Manager - Lark - Malaysia

ByteDance

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
7 Months ago
Tesla - Construction Site Manager - MEP

Tesla

Brandenburg, Germany (On-Site)
3 Months ago
Highspot - Implementation Manager

Highspot

Hyderabad, Telangana, India (Hybrid)
3 Months ago
ByteDance - Partner Sales Manager - Lark

ByteDance

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Month ago
MIQ Digital - Head of Sales, Thailand

MIQ Digital

Thailand (On-Site)
2 Weeks ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
7 Months ago
MyGames - 3D HTML5 Playable Ads Developer

MyGames

(Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Tel Aviv-Yafo, Tel Aviv District, Israel

Google - Senior CPU Design Verification Engineer

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Senior Physical Design Full Chip STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Unity - Gaming Business Analyst

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Playtika - Influencer Marketing & Content Manager

Playtika

Israel (On-Site)
6 Months ago
Unity - Senior Full Stack Developer

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Senior ICT and JTAG Test Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
4 Months ago
Playtika - Social Media Manager

Playtika

Israel (On-Site)
7 Months ago
Outbrain - Senior Data Scientist in Ad Tech

Outbrain

Netanya, Center District, Israel (Hybrid)
3 Weeks ago
NVIDIA - Senior Product Manager, ASIC Simulation

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Staff Software Engineer, Machine Learning

Google

Mountain View, California, United States (On-Site)
1 Month ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

New York, New York, United States (Remote)
6 Months ago
Keywords Studios - AI - Technical Research Associate (Prompts)

Keywords Studios

Silesian Voivodeship, Poland (On-Site)
2 Months ago
Microsoft - Member of Technical Staff, Platform Engineer

Microsoft

Redmond, Washington, United States (Hybrid)
1 Month ago
Scale AI - Machine Learning Engineer, International Public Sector

Scale AI

United Kingdom (On-Site)
7 Months ago
Rackspace Technology - Machine Learning Architect (AWS)

Rackspace Technology

(Remote)
4 Months ago
Microsoft - Member of Technical Staff - AI Multimodal

Microsoft

Zürich, Zurich, Switzerland (On-Site)
1 Month ago
Canva - Machine Learning Research Engineering Manager - Image Generation

Canva

Vienna, Vienna, Austria (Remote)
1 Month ago
NVIDIA - AI Digital Human Development Intern - 2025

NVIDIA

(On-Site)
3 Months ago
Mashgin - Senior Software Engineer, Machine Learning and Artificial Intelligence

Mashgin

Palo Alto, California, United States (Hybrid)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Pune, Maharashtra, India (On-Site)

Taipei City, Taiwan (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug