Director of AI Research

2 Months ago • 15 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA's Conversational AI research team seeks a Director of AI Research to lead the development of new deep learning algorithms and techniques for efficient LLM inference. Responsibilities include designing new architectures for advanced LLMs, adapting foundation AI models to downstream tasks (math, code reasoning), contributing to the Nemo framework, curating datasets, and collaborating with product and hardware teams. The ideal candidate will possess a PhD in Computer Science/Electrical Engineering, 15+ years of machine learning/deep learning experience (10+ in management), expertise in NLP and speech processing, proficiency in Python and PyTorch, and a strong publication record. This role involves leading a team to build and deploy cutting-edge AI solutions.
Must have:
  • PhD in CS/EE
  • 15+ years ML/DL experience
  • 10+ years management experience
  • NLP/Speech processing knowledge
  • Python & PyTorch expertise
  • Strong publication record
  • Lead LLM inference research
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

We are looking for Director of AI research team, to work on new deep learning algorithms and techniques for efficient LLM inference 

NVIDIA is searching for world-class researchers in deep learning and natural language processing (NLP) to join our Conversational AI research team. Our team is pushing the boundaries of generative AI by building state-of-the-art large language models (LLM). We  work on new neural architectures to enable LLM with very long context, on applying LLM to solve complicated math and coding problems, and on improvement LLM robustness. If you are passionate about the latest research and technologies revolutionizing generative AI and want to explore creative new paradigms for applied foundation models such as reasoning  agents, this team will be a great fit for you. After building prototypes that demonstrate the promise of your research, you will collaborate with product teams to apply your ideas into industry-leading real-world applications.

What you will do:

  • Lead the team which will work on new deep learning algorithms and techniques for efficient LLM inference 
  • Develop new architectures for advanced large language models
  • Design and adapt foundation AI models to downstream tasks such as math and code reasoning.
  • Contribute these new models to Nemo framework
  • Construct and curate datasets for large-scale machine learning, for learning from human preferences, and for specific domains of applications.
  • Work closely with product and hardware architecture teams to integrate your research and developments into products.

What we need to see:

  • MSc or PhD in  Computer Science/ Electrical Engineering 
  • 15 overall Years of extensive machine learning / deep learning research or work experience, and 10 years of management experience
  • Knowledge of application areas such as natural language processing and speech processing.
  • Excellent programming skills in some rapid prototyping environments such as Python; 
  • Expertise with deep learning frameworks such as PyTorch.
  • A track record of research excellence demonstrated in publications at leading conferences and journals.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Similar Jobs

Assystems - Geological Engineer

Assystems

Ankara, Ankara, Türkiye (On-Site)
6 Months ago
Riot Games - Researcher III - RDS Central User Research Team

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Quizizz - UX Researcher

Quizizz

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Actian - Client Director (Midwest) - HCLSoftware

Actian

United States (Remote)
6 Months ago
ByteDance - Data Scientist

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
HP - Machine Learning Engineer

HP

Palo Alto, California, United States (On-Site)
7 Months ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Menlo Park, California, United States (On-Site)
5 Months ago
Zoox - Senior/Staff Software Engineer - Simulation Traffic & Behavior Modeling

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
Krafton  - Technical Project Manager, Deep Learning Division

Krafton

Seoul, South Korea (On-Site)
3 Months ago
AI Fund - Head of AI @ Olakai

AI Fund

California, United States (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Virtuos - Technical Art Lead / Director

Virtuos

Japan (On-Site)
1 Month ago
Simple Viral Games - Game Designer

Simple Viral Games

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Riot Games - Senior User Researcher

Riot Games

Shanghai, Shanghai, China (On-Site)
9 Months ago
ByteDance - Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
Gaming Innovation Group  - Database Engineer (MySQL)

Gaming Innovation Group

Catalonia, Spain (Hybrid)
1 Month ago
DPS Games - Senior Environment Artist (Unannounced Project)

DPS Games

Guildford, England, United Kingdom (On-Site)
7 Months ago
ByteDance - Senior Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

ByteDance

Boston, Massachusetts, United States (On-Site)
4 Months ago
GreenWave™ Radios - Tech Lead, Design Verification

GreenWave™ Radios

Bengaluru, Karnataka, India (On-Site)
7 Months ago
The Walt Disney Company - Entertainment Technician - Full Time

The Walt Disney Company

Hong Kong (On-Site)
4 Months ago
Amanotes - Senior Game Data Analyst (Based in HCM)

Amanotes

Ho Chi Minh City, Ho Chi Minh City, Vietnam (On-Site)
11 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Tel Aviv-Yafo, Tel Aviv District, Israel

NVIDIA - Senior System Networking Engineer, InfiniBand

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
Playtika - Product Security Team Leader

Playtika

Israel (On-Site)
4 Months ago
SuperPlay - Senior User Acquisition Manager

SuperPlay

Tel Aviv District, Israel (On-Site)
3 Weeks ago
NVIDIA - Senior Chip Architect

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - Senior Product Manager, ASIC Simulation

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
PAPAYA - Monetization Manager - Solitaire

PAPAYA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Senior Software QA Automation Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - Senior HPC DevOps Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
NVIDIA - Senior Product Manager - SONiC

NVIDIA

Ra'anana, Center District, Israel (Hybrid)
2 Months ago
SuperPlay - Senior Game Economist

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Machine Learning Software Platform Architect

NVIDIA

Canada (On-Site)
2 Months ago
Zoox - Staff/Senior Staff Software Engineer, ML Performance Optimization

Zoox

Seattle, Washington, United States (On-Site)
6 Months ago
Airlab Inc  - Artificial Intelligence Researcher

Airlab Inc

Quebec, Canada (On-Site)
1 Month ago
Omnissa - Staff Engineer (Data Science)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Tencent - Large Language Model Algorithm Engineer

Tencent

California, United States (On-Site)
1 Month ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

London, England, United Kingdom (Remote)
3 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model AI Platform) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Keywords Studios - Research Associate - AI

Keywords Studios

(Remote)
1 Month ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
2 Months ago
Virtuos - Senior Games Tool Engineer (Machine Learning Specialist)

Virtuos

Shanghai, Shanghai, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Austin, Texas, United States (Remote)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug