Solutions Architect, Generative AI

2 Months ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks a Solutions Architect to lead LLM development for Agentic AI, enabling professional services partners on NVIDIA's accelerated computing platforms. The role involves building agentic LLM applications, exploring advancements in model training and customization, and enabling partners to build enterprise AI solutions using NVIDIA's AI stack (including NeMo microservices). Responsibilities include collaborating with developers, providing technical guidance, anticipating customer needs, establishing reference architectures, and communicating standard processes. The ideal candidate will have a strong background in deep learning, generative models, and experience building enterprise-grade RAG-based systems. Experience with NVIDIA AI platforms (NeMo, NIMs) and expertise in GPT and Megatron model training are highly valued.
Must have:
  • MSc/PhD in relevant field or equivalent experience
  • 5+ years experience in deploying AI models at scale
  • Experience building enterprise-grade RAG systems
  • Proficiency in Python, C++, and deep learning frameworks
  • Excellent communication and presentation skills
Good to have:
  • Experience with NVIDIA AI platforms (NeMo, NIMs)
  • Expertise in GPT and Megatron model training
  • Understanding of MLOps/LLMOps
  • CUDA programming and performance analysis experience
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking an outstanding Solutions Architect to lead development of LLM for Agentic AI to join our fast-growing Generative AI team, who are enabling a global network of Professional Services partners on NVIDIA’s full-stack accelerated computing platforms. Our team is dedicated to applying next-generation technologies to solve customer problems. We are looking for an ambitious and forward-thinking engineer to contribute to the develop of AI applications and solving real world problems for enterprise customers using the latest Generative AI models and research, including NLP, RAG, distributed computing and large-scale system design. In this role, you will be a lead AI developer and trusted technical expert on the latest Generative AI frameworks and LLM family of products and work closely with partners and customers to build scalable industry-specific enterprise AI solutions including project scoping to POC to production.

As part of Generative AI enablement team, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world by applying accelerated computing AI and solve category defining systems and AI/ML solutions.

What you will be doing:

  • Building agentic LLM applications and exploring the latest advancements in model training, fine-tuning and customization.

  • Enabling NVIDIA strategic service delivery partners to build enterprise AI solutions using accelerated computing stack including NIMs and NeMo microserviecs.

  • Collaborate with developers and onboard them to NVIDIA AI platforms and services by providing deep technical guidance.

  • Anticipate customer and partners needs and find enablement opportunities to expand adoption and utilization of NVIDIA Generative AI products and platforms.

  • Establishing and building repeatable reference architecture, communicate standard processes and understand solution trade-offs. Share findings and feedback to improve products and services.

 

What we need to see:

  • MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).

  • 5+ years of relevant work experience in developing and deploying AI models at scale as a Software Engineer or deep learning engineer.

  • Proven track record of building enterprise-grade RAG based systems using open-source models and orchestration frameworks with strong foundation in deep learning, with a particular emphasis on generative models.

  • Proficiency in Python, C++ programming and Deep Learning frameworks,

  • Excellent communication and presentation skills to effectively collaborate with both internal and external customers.

Ways to stand out from the crowd:

  • Demonstrate expertise and hands-on experience with NVIDIA AI platforms. Some products of interest include natural language processing and Large Language Models (NVIDIA NeMo) and inference at scale (NIMs).

  • Excellent practical knowledge of Generative AI and LLM development. Ability to train GPT and Megatron Models.

  • Understanding of MLOps life cycle management and experience with LLMOps workflows.

  • Experience with CUDA programming and benchmarking and analyzing performance AI Agentic systems.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Radical Forge - Backend Engineer

Radical Forge

Middlesbrough, England, United Kingdom (Remote)
1 Month ago
Rockstar Games - AI/Gameplay Programmer

Rockstar Games

Oakville, Ontario, Canada (On-Site)
5 Days ago
Playrix - Senior C++ Software Engineer (Tools)

Playrix

Ukraine (Remote)
5 Months ago
Riot Games - Principal Software Engineer, Foundations Developer Experience & Workflows

Riot Games

Los Angeles, California, United States (On-Site)
5 Months ago
Playrix - Senior C++ Software Engineer (Tools)

Playrix

Armenia (Remote)
5 Months ago
Krafton  - [Global Strategy & BD Div.] Strategy Manager(AI Ethics) (4년 ~ 7년)

Krafton

Seoul, South Korea (On-Site)
3 Months ago
Google DeepMind - Research Scientist, Language

Google DeepMind

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Inworld AI - Forward Deployed Engineer (AI Gameplay Engineer)

Inworld AI

Mountain View, California, United States (On-Site)
1 Week ago
Inworld AI - Product Manager (Technical)

Inworld AI

Mountain View, California, United States (On-Site)
6 Days ago
Avathon - Data Scientist

Avathon

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Solutions Architect, Financial Services

NVIDIA

New York, New York, United States (Remote)
2 Months ago
NVIDIA - Senior Verification Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
3 Weeks ago
The Walt Disney Company - Software Engineer, Platform

The Walt Disney Company

California, United States (On-Site)
1 Week ago
Meta - Software Engineer, Machine Learning

Meta

Fremont, California, United States (Remote)
4 Months ago
ION - Lead Software Engineer, Italy

ION

Rome, Lazio, Italy (On-Site)
5 Months ago
Streamline Media Group  Inc  - Game Programmer (Unreal)

Streamline Media Group Inc

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
4 Months ago
ZeniMax Media - Programmeur.se de build / Build Programmer

ZeniMax Media

Montreal, Quebec, Canada (On-Site)
6 Months ago
Epic Games - Senior Security Engineer - Asset Integrity

Epic Games

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)
1 Week ago
ByteDance - Research Scientist in LLM Foundation Models (reasoning, planning & agent)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
CD PROJEKT RED - Engineering Intern (3C's Gameplay)

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Epic Games - Senior Gameplay Systems Developer, Developer Relations

Epic Games

Cary, North Carolina, United States (On-Site)
2 Months ago
Nintendo - Senior Ambassador - Nintendo San Francisco Store

Nintendo

San Francisco, California, United States (On-Site)
4 Months ago
Universal Music - Senior Staff Accountant

Universal Music

Los Angeles, California, United States (Hybrid)
1 Month ago
Fluence - Commissioning Engineer

Fluence

Alpharetta, Georgia, United States (On-Site)
5 Months ago
Zoox - Senior Software Engineer -  Fail Operational Planning

Zoox

Foster City, California, United States (Hybrid)
5 Months ago
Netflix - Engineering Manager, Identity & Authentication Security

Netflix

United States (Remote)
1 Month ago
holospark - Gameplay Engineer

holospark

Bellevue, Washington, United States (On-Site)
3 Months ago
The Walt Disney Company - Lead Java Software Engineer

The Walt Disney Company

Celebration, Florida, United States (On-Site)
1 Week ago
ByteDance - Senior Software Engineer, Traffic Platform

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Hedra - Senior Full-Stack Engineer

Hedra

New York, New York, United States (On-Site)
5 Days ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Zoox - Sensor Software Developer

Zoox

Foster City, California, United States (On-Site)
5 Months ago
ByteDance - Research Scientist- Foundation Model, Generative AI

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Zazz - Machine Learning Engineer

Zazz

(Remote)
1 Month ago
Scale AI - AI Product Manager, Generative AI

Scale AI

San Francisco, California, United States (On-Site)
5 Months ago
FTF Studios - FTF Senior Programmer

FTF Studios

(Remote)
1 Year ago
Zoox - Senior Software Engineer - Simulaton Scenario Automation

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago
Hedra - Research Scientist

Hedra

New York, New York, United States (On-Site)
5 Days ago
Discord - Director of Machine Learning, Safety

Discord

San Francisco, California, United States (Remote)
2 Months ago
ByteDance - Product Solution Architect, Volcano ARK (Singapore)

ByteDance

Singapore (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug