Director of AI Research

1 Month ago • 15 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA's Conversational AI research team seeks a Director of AI Research to lead the development of new deep learning algorithms and techniques for efficient LLM inference. Responsibilities include designing new architectures for advanced LLMs, adapting foundation AI models to downstream tasks (math, code reasoning), contributing to the Nemo framework, curating datasets, and collaborating with product and hardware teams. The ideal candidate will possess a PhD in Computer Science/Electrical Engineering, 15+ years of machine learning/deep learning experience (10+ in management), expertise in NLP and speech processing, proficiency in Python and PyTorch, and a strong publication record. This role involves leading a team to build and deploy cutting-edge AI solutions.
Must have:
  • PhD in CS/EE
  • 15+ years ML/DL experience
  • 10+ years management experience
  • NLP/Speech processing knowledge
  • Python & PyTorch expertise
  • Strong publication record
  • Lead LLM inference research
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

We are looking for Director of AI research team, to work on new deep learning algorithms and techniques for efficient LLM inference 

NVIDIA is searching for world-class researchers in deep learning and natural language processing (NLP) to join our Conversational AI research team. Our team is pushing the boundaries of generative AI by building state-of-the-art large language models (LLM). We  work on new neural architectures to enable LLM with very long context, on applying LLM to solve complicated math and coding problems, and on improvement LLM robustness. If you are passionate about the latest research and technologies revolutionizing generative AI and want to explore creative new paradigms for applied foundation models such as reasoning  agents, this team will be a great fit for you. After building prototypes that demonstrate the promise of your research, you will collaborate with product teams to apply your ideas into industry-leading real-world applications.

What you will do:

  • Lead the team which will work on new deep learning algorithms and techniques for efficient LLM inference 
  • Develop new architectures for advanced large language models
  • Design and adapt foundation AI models to downstream tasks such as math and code reasoning.
  • Contribute these new models to Nemo framework
  • Construct and curate datasets for large-scale machine learning, for learning from human preferences, and for specific domains of applications.
  • Work closely with product and hardware architecture teams to integrate your research and developments into products.

What we need to see:

  • MSc or PhD in  Computer Science/ Electrical Engineering 
  • 15 overall Years of extensive machine learning / deep learning research or work experience, and 10 years of management experience
  • Knowledge of application areas such as natural language processing and speech processing.
  • Excellent programming skills in some rapid prototyping environments such as Python; 
  • Expertise with deep learning frameworks such as PyTorch.
  • A track record of research excellence demonstrated in publications at leading conferences and journals.

With competitive salaries and a generous benefits package, we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to unprecedented growth, our exclusive engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

Similar Jobs

prizepicks - Data Scientist - Simulation/Sports Modeling

prizepicks

Atlanta, Georgia, United States (Remote)
1 Week ago
ByteDance - Partner Sales Manager, Indonesia, Lark APAC

ByteDance

Jakarta, Jakarta, Indonesia (On-Site)
5 Months ago
ByteDance - Senior Product Solution Manager, Edge Cloud

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Omnissa - C++ with macOS internals - Staff Engineer & Member of Technical Staff - II / III

Omnissa

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Virtuos - Technical Art Lead / Director

Virtuos

Japan (On-Site)
1 Week ago
Interface AI - Senior Technical Recruiter

Interface AI

United States (Remote)
1 Month ago
RoofStack - AI/ML Engineer

RoofStack

İstanbul, İstanbul, Türkiye (On-Site)
1 Month ago
ION - Senior AI Engineer, Italy

ION

Pisa, Tuscany, Italy (On-Site)
5 Months ago
NVIDIA - AI Computing Software Development Engineer, TensorRT

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
PlayStation Global - Mid-Career Machine Learning Engineer - Recommendation Systems

PlayStation Global

San Francisco, California, United States (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Strategy Analyst – Strategy & Operations

ByteDance

Seattle, Washington, United States (On-Site)
1 Week ago
Epic Games - Senior Technical Designer

Epic Games

(On-Site)
1 Week ago
Flying Bark Productions - 3D Modeller

Flying Bark Productions

Madrid, Community Of Madrid, Spain (On-Site)
5 Days ago
Epic Games - Senior Technical Designer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
2 Months ago
Riot Games - Manager, Insights - Central User Research

Riot Games

Los Angeles, California, United States (On-Site)
1 Week ago
Riot Games - Manager, Software Engineering - Infrastructure / Cloud Foundations

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
Far Out Scout - Senior Back End Engineer

Far Out Scout

Brazil (Remote)
1 Week ago
Amber - Community Manager

Amber

Bucharest, Bucharest, Romania (Hybrid)
6 Days ago
Push Gaming - Game Mathematician

Push Gaming

Malta (Hybrid)
5 Days ago
PwC - ID FY24 - Associate - Strategy & Operations - Talent Pool

PwC

Jakarta, Jakarta, Indonesia (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Tel Aviv-Yafo, Tel Aviv District, Israel

CrazyLabs - Marketing Tech Lead

CrazyLabs

Tel Aviv District, Israel (On-Site)
2 Months ago
SuperPlay - 2D Artist

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
6 Days ago
NVIDIA - Senior STA Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
SuperPlay - Senior Game Economist

SuperPlay

Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Senior Analog Mixed Signal Design Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago
NVIDIA - Physical Design CAD Manager

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago
Overwolf - Monetization Manager

Overwolf

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
5 Days ago
NVIDIA - PCB Layout Design Manager

NVIDIA

Ra'anana, Center District, Israel (On-Site)
1 Month ago
PAPAYA - Business Analyst

PAPAYA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago
SuperPlay - 2D Team Lead

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

N-iX - AI Engineer

N-iX

Poland (Remote)
1 Week ago
Microsoft - Platform Engineering Manager

Microsoft

Redmond, Washington, United States (Hybrid)
1 Week ago
Zoox - Staff/Senior Staff Software Engineer, ML Performance Optimization

Zoox

Foster City, California, United States (On-Site)
5 Months ago
Rackspace Technology - Practice Manager, Data Science, AI and ML

Rackspace Technology

San Diego, California, United States (Remote)
6 Days ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Westford, Massachusetts, United States (On-Site)
1 Month ago
NVIDIA - Software Engineering Manager - Data Processing Libraries

NVIDIA

Warsaw, Masovian Voivodeship, Poland (Remote)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model AI Platform) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Glean - Software Engineer, Machine Learning

Glean

Palo Alto, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug