Senior Solutions Architect, Generative AI - Inference

2 Months ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects specializing in Generative AI inference to support customers building solutions with their AI technology. This role involves partnering with various internal teams and external customers to define and implement high-value solutions leveraging NVIDIA's accelerated computing and deep learning platforms. Responsibilities include engaging with developers and researchers, collaborating on performance analysis, and working on proof-of-concepts focused on Generative AI and Large Language Models. The ideal candidate possesses strong deep learning expertise (PyTorch, TensorFlow), excellent communication skills, and experience with relevant NVIDIA software libraries (NeMo, Triton Inference Server, TensorRT).
Must have:
  • 5+ years Deep Learning experience
  • Proficiency in PyTorch/TensorFlow
  • Strong programming & optimization skills (Python)
  • Excellent problem-solving & debugging
  • Expertise in LLMs & Deep Learning inference
Good to have:
  • Experience with NVIDIA GPUs and software (NeMo, Triton, TensorRT)
  • C/C++ programming skills
  • Parallel programming & distributed computing experience
  • Experience with large-scale DL training and inference deployment
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!


What You Will Be Doing:

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

  • Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

  • Working closely with customers to help them adopt and build solutions using NVIDIA technology

  • Analyze performance and power efficiency of deep learning inference workloads

  • Some travel to conferences and customers may be required


What We Need To See:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

  • Strong fundamentals in programming, optimizations and software design, especially in Python

  • Strong problem-solving and debugging skills

  • Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

  • Excellent presentation, communication and collaboration skills

  • Desire to be involved in multiple diverse and creative projects


Ways To Stand Out From The Crowd:

  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Familiarity with parallel programming and distributed computing platforms

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Meta - Software Engineer, Machine Learning

Meta

Sunnyvale, California, United States (On-Site)
5 Months ago
NVIDIA - Senior Software Engineer - Robot Learning Platform

NVIDIA

Toronto, Ontario, Canada (On-Site)
1 Month ago
The Walt Disney Company - Lead Applied AI Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
1 Month ago
PwC - Senior Data Scientist

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
6 Months ago
Attentive - Senior Machine Learning Engineer

Attentive

San Francisco, California, United States (Hybrid)
6 Months ago
DraftKings - Lead Machine Learning Engineer, Platform

DraftKings

Boston, Massachusetts, United States (On-Site)
2 Months ago
Stylumia - Senior Machine Learning Engineer - Time Series & Computer Vision

Stylumia

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Interface AI - Senior Vice President of Engineering

Interface AI

United States (Remote)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) Foundation Model - Video Generation) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Sony Interactive Entertainment - AI Learning and Development Coordinator

Sony Interactive Entertainment

Tokyo, Japan (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Tencent - Game AI Researcher

Tencent

Tokyo, Japan (On-Site)
3 Weeks ago
NVIDIA - Senior Developer Relations Manager - Robotics

NVIDIA

Tokyo, Japan (On-Site)
3 Months ago
Netomi - Data Scientist - I

Netomi

Gurugram, Haryana, India (Hybrid)
6 Months ago
ASSIST Software - Other Positions

ASSIST Software

Suceava, Suceava County, Romania (Remote)
5 Months ago
Canva - Senior Machine Learning Engineer - Specialist Platform and Experience

Canva

Melbourne, Victoria, Australia (Remote)
1 Month ago
The Walt Disney Company - Sr Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
5 Months ago
Meta - Software Engineer, Machine Learning

Meta

New York, New York, United States (On-Site)
5 Months ago
ByteDance - GPU/AI Application System Software Engineer Intern

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
NVIDIA - Senior HPC Performance Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Hashlist - Data Scientist

Hashlist

Bengaluru, Karnataka, India (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Texas, United States

Riot Games - Senior Manager, Software Engineering - League Studio, Build, Test, Ship

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
ION - Senior Linux Systems Administrator - Somerset, NJ

ION

Clifton, New Jersey, United States (Hybrid)
6 Months ago
Twitch - Senior Product Manager - Ads

Twitch

San Francisco, California, United States (Remote)
7 Months ago
CloudHire - Pathologist Assistant

CloudHire

Yonkers, New York, United States (On-Site)
1 Month ago
Meta - Director, Business Marketing Insights

Meta

New York, New York, United States (On-Site)
5 Months ago
Barracuda Networks  Inc  - Principal Application Security Engineer

Barracuda Networks Inc

United States (Remote)
1 Month ago
The Walt Disney Company - Electrician - Full Time

The Walt Disney Company

Anaheim, California, United States (On-Site)
2 Months ago
The Walt Disney Company - Housekeeping Support

The Walt Disney Company

Kapolei, Hawaii, United States (On-Site)
1 Month ago
PlayStation Global - Manager, FP&A

PlayStation Global

United States (Remote)
1 Month ago
Games For Love - Volunteer Marketer

Games For Love

Washington, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Meta - AI Research Scientist - Generative AI Red Teaming (London or Paris)

Meta

Zürich, Zurich, Switzerland (On-Site)
5 Months ago
NVIDIA - Global Developer Relations Account Manager – Ansys

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Scale AI - QA Engineer, Generative AI

Scale AI

Argentina (On-Site)
6 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Trend Micro - Large Language Models (LLM) Expert (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Inkittt - Senior Machine Learning Engineer, Recommendations

Inkittt

San Francisco, California, United States (Hybrid)
3 Months ago
Match Group - Senior ML Platform Engineer

Match Group

New York, New York, United States (Hybrid)
6 Months ago
Genies - Research Scientist Intern - LLM/Vision/Speech

Genies

San Mateo, California, United States (Hybrid)
1 Month ago
Meta - Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Redmond, Washington, United States (On-Site)
5 Months ago
ByteDance - Research Engineer- Foundation Model AI Platform- Seattle

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug