Senior Solutions Architect, Generative AI - Inference

1 Month ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects specializing in Generative AI inference to support customers building solutions using NVIDIA's AI technology. Responsibilities include partnering with internal and external teams, understanding customer needs, defining solutions, engaging with developers and researchers, collaborating on performance analysis and modeling, and assisting customers in adopting NVIDIA technology. The role requires strong experience with deep learning frameworks (PyTorch, TensorFlow), large language models, and GPU technologies. Some travel may be required.
Must have:
  • 5+ years Deep Learning experience
  • PyTorch/TensorFlow expertise
  • Strong programming & optimization skills
  • LLM and Deep Learning inference knowledge
  • Excellent communication & collaboration
Good to have:
  • NVIDIA GPU & software experience (NeMo, Triton, TensorRT)
  • C/C++ programming skills
  • Parallel programming & distributed computing
  • DL training & inference deployment experience
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!


What You Will Be Doing:

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

  • Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

  • Working closely with customers to help them adopt and build solutions using NVIDIA technology

  • Analyze performance and power efficiency of deep learning inference workloads

  • Some travel to conferences and customers may be required


What We Need To See:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

  • Strong fundamentals in programming, optimizations and software design, especially in Python

  • Strong problem-solving and debugging skills

  • Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

  • Excellent presentation, communication and collaboration skills

  • Desire to be involved in multiple diverse and creative projects


Ways To Stand Out From The Crowd:

  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Familiarity with parallel programming and distributed computing platforms

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Burlingame, California, United States (On-Site)
4 Months ago
ByteDance - Backend Engineer (Model Inference), Machine Learning Systems

ByteDance

Singapore (On-Site)
4 Months ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Days ago
ByteDance - Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
ByteDance - Software Engineer in ML Engineering Platform

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Westford, Massachusetts, United States (Hybrid)
2 Days ago
Wargaming - Gen AI Business Development Manager

Wargaming

Prague, Prague, Czechia (On-Site)
1 Month ago
Trustana - Senior Data Engineer

Trustana

Gurugram, Haryana, India (Hybrid)
5 Months ago
Tencent - Machine Learning Development Intern

Tencent

Auckland, Auckland, New Zealand (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Match Group - Senior ML Software Engineering Team Leader

Match Group

Seoul, South Korea (Hybrid)
5 Months ago
Google - Software Engineer III, Machine Learning, Google Ads

Google

Los Angeles, California, United States (On-Site)
4 Months ago
ByteDance - Video Analysis and Quality Algorithm Intern 2023 Summer/Fall (MS)

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
Tencent - Game Research & Development Intern, Engine Research

Tencent

Irvine, California, United States (On-Site)
1 Month ago
NVIDIA - Senior Solution Engineer, Mission Control

NVIDIA

Durham, North Carolina, United States (On-Site)
2 Days ago
Inworld AI - Staff / Principal Machine Learning Engineer - USA

Inworld AI

Mountain View, California, United States (Remote)
4 Months ago
Netflix - Machine Learning Intern - Spring or Summer 2025

Netflix

Los Gatos, California, United States (On-Site)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

Bellevue, Washington, United States (On-Site)
4 Months ago
DraftKings - Director of Data Science

DraftKings

Boston, Massachusetts, United States (On-Site)
2 Weeks ago
SymphonyAI - Data Scientist

SymphonyAI

Bengaluru, Karnataka, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Washington, United States

Onward Search - Pricing Coordinator

Onward Search

Albany, New York, United States (On-Site)
3 Days ago
Evolution - Casino Game Presenter (Customer Service Alternative) up to $25/hr. FULL BENEFITS

Evolution

Atlantic City, New Jersey, United States (On-Site)
8 Months ago
Trackman - Range Sales Manager - Southeast

Trackman

Arizona, United States (Hybrid)
3 Weeks ago
Gearbox Software - Senior UI Programmer

Gearbox Software

Frisco, Texas, United States (On-Site)
3 Months ago
Riot Games - Staff Software Engineer, Unreal Tools - MMO

Riot Games

Los Angeles, California, United States (On-Site)
5 Months ago
PlayStation Global - Sr. Software Engineer - ML/AI DevOps

PlayStation Global

San Mateo, California, United States (On-Site)
1 Day ago
Crunchyroll - Senior Director, Global Theatrical Marketing

Crunchyroll

Culver City, California, United States (On-Site)
3 Months ago
The Walt Disney Company - Vacation Advisor - Japan Member Services

The Walt Disney Company

Hawaii, United States (Remote)
3 Days ago
Lionsgate Games - Client Services Support Analyst

Lionsgate Games

Santa Monica, California, United States (On-Site)
4 Days ago
The Walt Disney Company - Senior Product Manager I, Hulu Product (Web Experiences)

The Walt Disney Company

Seattle, Washington, United States (On-Site)
3 Days ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

ByteDance - Research Engineer Graduate (Vision AI Platform)

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Westford, Massachusetts, United States (On-Site)
1 Month ago
Riot Games - Senior Data Scientist - Singapore Efficiency Team

Riot Games

Singapore (On-Site)
1 Month ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Seattle, Washington, United States (Remote)
4 Months ago
Razer - Solutions Architect

Razer

Singapore (On-Site)
5 Months ago
Magic Media - Senior Automation Engineer

Magic Media

İstanbul, İstanbul, Türkiye (Remote)
2 Days ago
NVIDIA - AI and ML Infra Software Engineer, GPU Clusters

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
NVIDIA - Platform Technical Program Manager, AI Computing Infrastructure

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Trend Micro - Sr. Data Scientist (AI Lab)

Trend Micro

Taipei City, Taiwan (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Yokne'am Illit, North District, Israel (On-Site)

Hyderabad, Telangana, India (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug