Senior Solutions Architect, Generative AI - Inference

3 Months ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects for Generative AI Inference. Responsibilities include partnering with internal and external teams to build AI solutions using NVIDIA's technology, engaging with developers and researchers, working with key customers, and analyzing deep learning inference performance. This role requires strong expertise in Deep Learning frameworks (PyTorch, TensorFlow), Large Language Models, and GPU technologies. The ideal candidate will possess excellent communication, problem-solving, and collaboration skills and experience deploying/optimizing DL inference in production. The role involves working on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models and some travel.
Must have:
  • 5+ years Deep Learning experience
  • PyTorch/TensorFlow expertise
  • Strong programming & optimization skills
  • LLM & Deep Learning inference knowledge
  • Excellent communication & collaboration
Good to have:
  • NVIDIA GPU & software experience (NeMo, Triton, TensorRT)
  • C/C++ programming skills
  • Parallel programming & distributed computing
  • Experience with large-scale DL training
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!


What You Will Be Doing:

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

  • Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

  • Working closely with customers to help them adopt and build solutions using NVIDIA technology

  • Analyze performance and power efficiency of deep learning inference workloads

  • Some travel to conferences and customers may be required


What We Need To See:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

  • Strong fundamentals in programming, optimizations and software design, especially in Python

  • Strong problem-solving and debugging skills

  • Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

  • Excellent presentation, communication and collaboration skills

  • Desire to be involved in multiple diverse and creative projects


Ways To Stand Out From The Crowd:

  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Familiarity with parallel programming and distributed computing platforms

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Research Scientist, Multimodality

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Altagram Group - Data Science Internship/Workstudent

Altagram Group

Germany (On-Site)
1 Month ago
Tesla - Senior Machine Learning, AI Engineer

Tesla

Brandenburg, Germany (On-Site)
2 Months ago
Nintendo - Intern – Machine Learning Software Engineer (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
5 Months ago
Netflix - Machine Learning Intern, Summer 2025

Netflix

Los Gatos, California, United States (On-Site)
3 Months ago
ION - AI Engineer - Graduate Development Program

ION

Pisa, Tuscany, Italy (On-Site)
6 Months ago
Glean - Software Engineer, Machine Learning (India)

Glean

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Interface AI - Senior Account Manager

Interface AI

United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
ByteDance - Research Scientist in Foundation Model, Music Core Machine Learning Graduates - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Arrise Solutions (India)   - Lead ML Engineer

Arrise Solutions (India)

Hyderabad, Telangana, India (On-Site)
7 Months ago
NVIDIA - Senior DevOps Engineer, Deep Learning Frameworks

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
Electronic Arts - Senior Software Engineer

Electronic Arts

Austin, Texas, United States (On-Site)
1 Month ago
Truecaller - Senior ML Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
5 Months ago
Krafton  - Applied Research Engineer - Reinforcement Learning

Krafton

Seoul, South Korea (On-Site)
1 Month ago
ByteDance - Machine Learning Engineer-Model Serving Infrastructure (AML-Engine)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
NVIDIA - Senior Solutions Architect, HPC and AI

NVIDIA

Santa Clara, California, United States (Hybrid)
1 Month ago
Rackspace Technology - Senior Machine Learning Engineer

Rackspace Technology

Vietnam (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

The Walt Disney Company - Lead Software Engineer (Front End/JavaScript)

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

Mountain View, California, United States (On-Site)
5 Months ago
Patreon - Benefits Manager

Patreon

New York, New York, United States (Hybrid)
1 Month ago
WebMD - Associate Director, Marketing

WebMD

Newark, New Jersey, United States (On-Site)
6 Months ago
NVIDIA - Senior Signal Integrity Design Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
3 Months ago
ByteDance - Software Development Engineer, Network Automation - Seattle

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Patel greene - PD&E Project Manager

Patel greene

Tallahassee, Florida, United States (On-Site)
6 Months ago
CloudHire - Pathology Assistant Reallocation Opportunity

CloudHire

Pennsylvania, United States (On-Site)
1 Month ago
Games For Love - Scholars Program Coordinator

Games For Love

Washington, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Senior Software Engineer, AI/ML GenAI, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
5 Months ago
Google - Senior Software Engineer, Machine Learning, Google Ads

Google

Mountain View, California, United States (On-Site)
5 Months ago
NVIDIA - Senior Field Application Engineer

NVIDIA

Durham, North Carolina, United States (On-Site)
3 Months ago
Canva - GenAI Research Engineering Manager - Image Generation (m/f/x) - Canva Austria

Canva

Vienna, Vienna, Austria (Remote)
5 Months ago
Soul AI - Subject Matter Expert (AI Trainer)

Soul AI

Hyderabad, Telangana, India (On-Site)
7 Months ago
Inworld AI - Staff / Principal Machine Learning Engineer - USA

Inworld AI

Mountain View, California, United States (Remote)
5 Months ago
Google - Senior Software Engineer, Machine Learning, Google Cloud Compute

Google

Sunnyvale, California, United States (On-Site)
5 Months ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Months ago
Netflix - Software Engineer L4/L5, Training Platform, Machine Learning Platform

Netflix

California, United States (Remote)
3 Months ago
Lionbridge Games - Games Language AI Specialist (Linguist)

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug