Solutions Architect, Generative AI

2 Months ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks a Solutions Architect to lead LLM development for Agentic AI. This role involves building agentic LLM applications, exploring advancements in model training, and enabling NVIDIA's partners to build enterprise AI solutions using accelerated computing. Responsibilities include collaborating with developers, anticipating customer needs, establishing reference architectures, and sharing feedback to improve products. The ideal candidate possesses a strong background in deep learning, generative models, and enterprise-grade RAG systems, proficiency in Python and C++, and excellent communication skills.
Must have:
  • MSc/PhD in relevant field or equivalent experience
  • 5+ years experience in developing and deploying AI models at scale
  • Experience building enterprise-grade RAG systems
  • Proficiency in Python, C++, deep learning frameworks
  • Excellent communication and presentation skills
Good to have:
  • Experience with NVIDIA AI platforms (NeMo, NIMs)
  • Experience training GPT and Megatron Models
  • Understanding of MLOps/LLMOps
  • CUDA programming experience
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking an outstanding Solutions Architect to lead development of LLM for Agentic AI to join our fast-growing Generative AI team, who are enabling a global network of Professional Services partners on NVIDIA’s full-stack accelerated computing platforms. Our team is dedicated to applying next-generation technologies to solve customer problems. We are looking for an ambitious and forward-thinking engineer to contribute to the develop of AI applications and solving real world problems for enterprise customers using the latest Generative AI models and research, including NLP, RAG, distributed computing and large-scale system design. In this role, you will be a lead AI developer and trusted technical expert on the latest Generative AI frameworks and LLM family of products and work closely with partners and customers to build scalable industry-specific enterprise AI solutions including project scoping to POC to production.

As part of Generative AI enablement team, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world by applying accelerated computing AI and solve category defining systems and AI/ML solutions.

What you will be doing:

  • Building agentic LLM applications and exploring the latest advancements in model training, fine-tuning and customization.

  • Enabling NVIDIA strategic service delivery partners to build enterprise AI solutions using accelerated computing stack including NIMs and NeMo microserviecs.

  • Collaborate with developers and onboard them to NVIDIA AI platforms and services by providing deep technical guidance.

  • Anticipate customer and partners needs and find enablement opportunities to expand adoption and utilization of NVIDIA Generative AI products and platforms.

  • Establishing and building repeatable reference architecture, communicate standard processes and understand solution trade-offs. Share findings and feedback to improve products and services.

 

What we need to see:

  • MSc, PhD in Computer Science, Electrical Engineering, Software Engineer, ML Engineer, or related fields (or equivalent experience).

  • 5+ years of relevant work experience in developing and deploying AI models at scale as a Software Engineer or deep learning engineer.

  • Proven track record of building enterprise-grade RAG based systems using open-source models and orchestration frameworks with strong foundation in deep learning, with a particular emphasis on generative models.

  • Proficiency in Python, C++ programming and Deep Learning frameworks,

  • Excellent communication and presentation skills to effectively collaborate with both internal and external customers.

Ways to stand out from the crowd:

  • Demonstrate expertise and hands-on experience with NVIDIA AI platforms. Some products of interest include natural language processing and Large Language Models (NVIDIA NeMo) and inference at scale (NIMs).

  • Excellent practical knowledge of Generative AI and LLM development. Ability to train GPT and Megatron Models.

  • Understanding of MLOps life cycle management and experience with LLMOps workflows.

  • Experience with CUDA programming and benchmarking and analyzing performance AI Agentic systems.

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Activision - Senior Network Programmer

Activision

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Months ago
ByteDance - Network Software Engineer - Network Systems

ByteDance

Singapore (On-Site)
5 Months ago
NVIDIA - Senior System Validation Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
Epic Games - Senior QA Programmer

Epic Games

(On-Site)
3 Months ago
Equivalent Jobs - C++ SOFTWARE ENGINEER (SIMULATOR)

Equivalent Jobs

(Remote)
5 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Machine Learning System) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
NVIDIA - Software Engineering Intern, NGC Data Platform - Fall 2025

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
ByteDance - Student Researcher (Doubao (Seed) Foundation Model - Video Generation) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Redmond, Washington, United States (On-Site)
5 Months ago
NVIDIA - Senior Deep Learning Performance Architect

NVIDIA

Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Scorewarrior - Build & CI Engineer

Scorewarrior

Limassol, Limassol, Cyprus (On-Site)
1 Month ago
Romero Games - Multiplayer Gameplay Programmer

Romero Games

Galway, County Galway, Ireland (Hybrid)
6 Months ago
HoYoverse - Senior Gameplay Programmer AI [CA]

HoYoverse

Montreal, Quebec, Canada (Remote)
11 Months ago
Zoox - Senior/Staff Machine Learning Engineer - Prediction & Behavior ML

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
ByteDance - Software Engineer in Machine Learning Systems

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
InMobiInMobi - Senior Solutions Engineer

InMobiInMobi

London, England, United Kingdom (On-Site)
5 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

United States (On-Site)
2 Months ago
Keen Software House - Senior Gameplay Programmer

Keen Software House

Prague, Prague, Czechia (Remote)
2 Months ago
Epic Games - Lead Rendering Engineer

Epic Games

(On-Site)
1 Month ago
31st Union - Senior Test Automation Engineer

31st Union

San Mateo, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Ubisoft - Security Analyst – Organizational Resiliency

Ubisoft

Montreal, Quebec, Canada (On-Site)
6 Months ago
Turbulent - UI Artist - Star Citizen

Turbulent

Montreal, Quebec, Canada (On-Site)
3 Weeks ago
NVIDIA - Technical Marketing Engineer

NVIDIA

Canada (On-Site)
2 Months ago
DNEG - Layout Technical Supervisor (FEAT)

DNEG

Montreal, Quebec, Canada (Hybrid)
1 Month ago
Rockstar Games - Technical Artist: Shotgrid Development Support

Rockstar Games

Oakville, Ontario, Canada (On-Site)
4 Weeks ago
Warner Bros Games - Software Developer II

Warner Bros Games

Toronto, Ontario, Canada (Hybrid)
1 Month ago
Luma Pictures - Compositors, Mid to Senior Level

Luma Pictures

Vancouver, British Columbia, Canada (Remote)
8 Months ago
ION - Technical Support Analyst, Toronto - 4363

ION

Toronto, Ontario, Canada (On-Site)
6 Months ago
Salesforce - Prime Named Account Executive, MuleSoft

Salesforce

Montreal, Quebec, Canada (Remote)
1 Month ago
Mistplay - Principal Product Marketing Manager

Mistplay

Montreal, Quebec, Canada (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Zoox - Senior/Staff Software Engineer - Simulator

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Vancouver, British Columbia, Canada (Remote)
1 Month ago
Snail Games - Software Engineer - AI/Machine Translation

Snail Games

Beverly Hills, California, United States (Remote)
2 Months ago
Krafton  - Head of Deep Learning PM & Ops Dept.

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Redmond, Washington, United States (On-Site)
5 Months ago
ByteDance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Software Engineer III, Machine Learning, Search

Google

Seattle, Washington, United States (On-Site)
5 Months ago
The Walt Disney Company - Lead Machine Learning Engineer

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
1 Month ago
FTF Studios - FTF Entry-Level Programmer

FTF Studios

(Remote)
1 Year ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug