Senior Solutions Architect, Generative AI - Inference

2 Months ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects for Generative AI Inference. Responsibilities include partnering with internal and external teams to build AI solutions using NVIDIA's technology, engaging with developers and researchers, working with key customers, and analyzing deep learning inference performance. This role requires strong expertise in Deep Learning frameworks (PyTorch, TensorFlow), Large Language Models, and GPU technologies. The ideal candidate will possess excellent communication, problem-solving, and collaboration skills and experience deploying/optimizing DL inference in production. The role involves working on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models and some travel.
Must have:
  • 5+ years Deep Learning experience
  • PyTorch/TensorFlow expertise
  • Strong programming & optimization skills
  • LLM & Deep Learning inference knowledge
  • Excellent communication & collaboration
Good to have:
  • NVIDIA GPU & software experience (NeMo, Triton, TensorRT)
  • C/C++ programming skills
  • Parallel programming & distributed computing
  • Experience with large-scale DL training
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!


What You Will Be Doing:

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

  • Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

  • Working closely with customers to help them adopt and build solutions using NVIDIA technology

  • Analyze performance and power efficiency of deep learning inference workloads

  • Some travel to conferences and customers may be required


What We Need To See:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

  • Strong fundamentals in programming, optimizations and software design, especially in Python

  • Strong problem-solving and debugging skills

  • Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

  • Excellent presentation, communication and collaboration skills

  • Desire to be involved in multiple diverse and creative projects


Ways To Stand Out From The Crowd:

  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Familiarity with parallel programming and distributed computing platforms

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

PwC - IN-Senior Associate_ML Engineer_Data and Analytics_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
Netflix - Machine Learning Software Engineer L4/L5

Netflix

United States (Remote)
2 Months ago
DraftKings - Director of Data Science

DraftKings

Boston, Massachusetts, United States (On-Site)
2 Weeks ago
Ubisoft - Scientifique en données ML Senior _ Groupe Technologique Content Creation

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
Google - Software Engineer III, AI/ML

Google

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Dolby Laboratories - AIOps Research Scientist

Dolby Laboratories

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
GoTo Group - Lead Data Scientist - KYC

GoTo Group

Singapore (On-Site)
3 Months ago
Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Vancouver, British Columbia, Canada (Remote)
1 Week ago
ASSIST Software - Other Positions

ASSIST Software

Suceava, Suceava County, Romania (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Machine Learning Engineer Intern (Knowledge Graph) - 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
NVIDIA - Solutions Architect, Financial Services

NVIDIA

New York, New York, United States (Remote)
2 Months ago
Great Learning - Data Scientist

Great Learning

Bengaluru, Karnataka, India (On-Site)
6 Months ago
ByteDance - Research Scientist, Multimodality

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Truecaller - Senior MLOps Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
4 Months ago
Altagram Group - Data Science Internship/Work Student

Altagram Group

Germany (On-Site)
2 Months ago
ByteDance - Software Engineer Intern (Machine Learning Platform) - 2024 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
NVIDIA - Software Engineer Intern, Perception - Autonomous Vehicles - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Lucid Reality Labs - ML/AI Engineer

Lucid Reality Labs

Poland (Remote)
6 Days ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

PlayStation Global - Senior Program Manager, Ecommerce

PlayStation Global

Carlsbad, California, United States (On-Site)
6 Days ago
Google - Senior Software Engineer, Infrastructure, Core

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
ByteDance - Software Developer (Routing Verification & Emulation)

ByteDance

Seattle, Washington, United States (On-Site)
1 Week ago
AGBO - Senior Rigging Technical Director

AGBO

Los Angeles, California, United States (On-Site)
7 Months ago
Activision - Principal Governance, Risk & Compliance Engineer

Activision

United States (Remote)
1 Week ago
Team Liquid - Freelance Marketing Manager

Team Liquid

Los Angeles, California, United States (Remote)
6 Months ago
Hawk Eye Innovations - College Sports Systems Technician

Hawk Eye Innovations

Nebraska, United States (On-Site)
5 Days ago
Crunchyroll - Implementation Manager

Crunchyroll

Dallas, Texas, United States (Hybrid)
4 Weeks ago
Axon - Senior Technical Program Manager, AI

Axon

Seattle, Washington, United States (Remote)
1 Month ago
Pika - Product Design Lead

Pika

Palo Alto, California, United States (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Canva - Senior Machine Learning Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
2 Months ago
Interface AI - Technical Customer Success Manager

Interface AI

United States (Remote)
1 Month ago
NVIDIA - Senior Solutions Architect - Generative AI

NVIDIA

Bengaluru, Karnataka, India (On-Site)
2 Months ago
ByteDance - Research Scientist Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
NVIDIA - Machine Learning Software Platform Architect

NVIDIA

Shanghai, Shanghai, China (Hybrid)
2 Months ago
A-Team - AI Strategy Lead

A-Team

New York, New York, United States (Hybrid)
11 Hours ago
Lionbridge Games - Language AI Specialist (Test & Tech)

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
1 Month ago
PlayStation Global - Senior Machine Learning Software Engineer

PlayStation Global

United States (Remote)
6 Days ago
Onward Search - API Developer

Onward Search

North Arlington, New Jersey, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug