Director of AI Research

1 Month ago • 15 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA seeks a Director of AI Research to lead a team developing deep learning algorithms and techniques for efficient large language model (LLM) inference. Responsibilities include designing new architectures for advanced LLMs, adapting foundation AI models to tasks like math and code reasoning, contributing to the Nemo framework, curating datasets, and collaborating with product and hardware teams. The ideal candidate possesses a PhD in Computer Science/Electrical Engineering, 15+ years of machine learning/deep learning experience, and 10+ years of management experience, along with expertise in NLP, deep learning frameworks (PyTorch), and a strong publication record. The role involves building prototypes, collaborating on real-world applications, and pushing the boundaries of generative AI.
Must have:
  • PhD in CS/EE
  • 15+ years ML/DL experience
  • 10+ years management experience
  • NLP & speech processing knowledge
  • Deep learning frameworks (PyTorch)
  • Strong publication record
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

We are looking for Director of AI research team, to work on new deep learning algorithms and techniques for efficient LLM inference 

NVIDIA is searching for world-class researchers in deep learning and natural language processing (NLP) to join our Conversational AI research team. Our team is pushing the boundaries of generative AI by building state-of-the-art large language models (LLM). We  work on new neural architectures to enable LLM with very long context, on applying LLM to solve complicated math and coding problems, and on improvement LLM robustness. If you are passionate about the latest research and technologies revolutionizing generative AI and want to explore creative new paradigms for applied foundation models such as reasoning  agents, this team will be a great fit for you. After building prototypes that demonstrate the promise of your research, you will collaborate with product teams to apply your ideas into industry-leading real-world applications.

What you will do:

  • Lead the team which will work on new deep learning algorithms and techniques for efficient LLM inference 
  • Develop new architectures for advanced large language models
  • Design and adapt foundation AI models to downstream tasks such as math and code reasoning.
  • Contribute these new models to Nemo framework
  • Construct and curate datasets for large-scale machine learning, for learning from human preferences, and for specific domains of applications.
  • Work closely with product and hardware architecture teams to integrate your research and developments into products.

What we need to see:

  • MSc or PhD in  Computer Science/ Electrical Engineering 
  • 15 overall Years of extensive machine learning / deep learning research or work experience, and 10 years of management experience
  • Knowledge of application areas such as natural language processing and speech processing.
  • Excellent programming skills in some rapid prototyping environments such as Python; 
  • Expertise with deep learning frameworks such as PyTorch.
  • A track record of research excellence demonstrated in publications at leading conferences and journals.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Similar Jobs

Epic Games - Senior Technical Designer

Epic Games

(On-Site)
1 Month ago
ByteDance - Software Engineer, Multi-Cloud CDN

ByteDance

Boston, Massachusetts, United States (On-Site)
3 Days ago
INTEL - Yield Development Systems Analytics Engineer

INTEL

Hillsboro, Oregon, United States (On-Site)
20 Hours ago
ByteDance - Network Engineer, Optical Long-Haul and Submarine

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
quience - Principal Data Scientist, ML - Storefront

quience

Palo Alto, California, United States (On-Site)
1 Day ago
NVIDIA - Senior Solution Engineer, Mission Control

NVIDIA

Durham, North Carolina, United States (On-Site)
1 Month ago
GoTo Group - Lead Data Scientist - KYC

GoTo Group

Singapore (On-Site)
4 Months ago
Arrise Solutions (India)   - Senior Data Scientist (Remote)

Arrise Solutions (India)

Hyderabad, Telangana, India (Remote)
6 Months ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Burlingame, California, United States (On-Site)
5 Months ago
ByteDance - Solutions Architect

ByteDance

Gurugram, Haryana, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ARVORE Immersive Experiences - Game Developer

ARVORE Immersive Experiences

São Paulo, State Of São Paulo, Brazil (Remote)
4 Weeks ago
TTC Global - Senior Software QA Engineer

TTC Global

Montréal, Québec, Canada (On-Site)
8 Hours ago
Nintendo - Manager, Business Intelligence

Nintendo

Redmond, Washington, United States (Hybrid)
1 Month ago
The Walt Disney Company - Construction Sewing Specialist

The Walt Disney Company

Florida, United States (On-Site)
2 Months ago
Vigaet - Internship- AI Engineer

Vigaet

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Google - Technical Accounting, Consolidation and Lease Accounting

Google

Atlanta, Georgia, United States (On-Site)
2 Weeks ago
ION - Senior C#/.NET Developer, Budapest

ION

Budapest, Hungary (Hybrid)
6 Months ago
ByteDance - Product Solution Architect (Edge Cloud)

ByteDance

Singapore (On-Site)
6 Months ago
Moloco - Machine Learning Engineer, Tech Lead

Moloco

Seoul, South Korea (On-Site)
6 Hours ago
Flying Bark Productions - Senior 3D Modeller

Flying Bark Productions

Sydney, New South Wales, Australia (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Tel Aviv-Yafo, Tel Aviv District, Israel

Google - Junior CPU Formal Verification Engineer

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Unity - DevOps Tech Lead

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
9 Hours ago
NVIDIA - Senior Physical Design Full Chip STA Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
2 Months ago
NVIDIA - STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Unity - Senior App Growth and Operations Manager

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
9 Hours ago
Playtika - Marketing Strategy Manager (Temporary Position)

Playtika

Israel (On-Site)
4 Months ago
Google - Software Engineer III, Onboarding and Discovery, Core

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Days ago
Playtika - Director of Monetization

Playtika

Israel (On-Site)
2 Months ago
NVIDIA - Senior Financial Analyst

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago
NVIDIA - Senior Chip Architect

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Bellevue, Washington, United States (On-Site)
5 Months ago
Google - Software Engineer III, Diagnostics, Tools, Google Cloud Platform

Google

Taipei City, Taiwan (On-Site)
2 Weeks ago
Microsoft - Technical Program Manager, AI Multimodal

Microsoft

London, England, United Kingdom (On-Site)
1 Month ago
Genies - 2025 Summer Backend Engineer Intern

Genies

San Mateo, California, United States (On-Site)
1 Month ago
Google - Director, Development, Ads Safety, Platform and Experiences

Google

Los Angeles, California, United States (On-Site)
1 Week ago
Google - Senior Software Engineer, AI/ML, Search

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Ello - Tech Lead, GenAI & Machine Learning

Ello

San Francisco, California, United States (On-Site)
2 Weeks ago
Zoox - Senior Software Engineer - High Performance Computing

Zoox

Seattle, Washington, United States (Hybrid)
6 Months ago
ByteDance - Research Scientist Graduate (Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
Zoox - Senior/Staff Software Engineer - Simulation Workload Orchestration

Zoox

Seattle, Washington, United States (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug