Jobs Courses Resources Companies Placements

Home >

Jobs >

Senior Solutions Architect, Generative AI - Inference

NVIDIA

Texas, United States (Remote)

Senior Solutions Architect, Generative AI - Inference

4 Months ago • 5 Years + • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects specializing in Generative AI inference to support customers building solutions with their AI technology. This role involves partnering with various internal teams and external customers to define and implement high-value solutions leveraging NVIDIA's accelerated computing and deep learning platforms. Responsibilities include engaging with developers and researchers, collaborating on performance analysis, and working on proof-of-concepts focused on Generative AI and Large Language Models. The ideal candidate possesses strong deep learning expertise (PyTorch, TensorFlow), excellent communication skills, and experience with relevant NVIDIA software libraries (NeMo, Triton Inference Server, TensorRT).

Must have:

5+ years Deep Learning experience
Proficiency in PyTorch/TensorFlow
Strong programming & optimization skills (Python)
Excellent problem-solving & debugging
Expertise in LLMs & Deep Learning inference

Good to have:

Experience with NVIDIA GPUs and software (NeMo, Triton, TensorRT)
C/C++ programming skills
Parallel programming & distributed computing experience
Experience with large-scale DL training and inference deployment

Perks:

Equity
Benefits

8 skills required

8 skills required for this role

Add these skills to join the top 1% applicants for this job

unity

tensorflow

lighthouse

deep-learning

python

pytorch

problem-solving

performance-analysis

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!

What You Will Be Doing:

Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions
Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas
Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform
Working closely with customers to help them adopt and build solutions using NVIDIA technology
Analyze performance and power efficiency of deep learning inference workloads
Some travel to conferences and customers may be required

What We Need To See:

BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)
5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow
Strong fundamentals in programming, optimizations and software design, especially in Python
Strong problem-solving and debugging skills
Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference
Excellent presentation, communication and collaboration skills
Desire to be involved in multiple diverse and creative projects

Ways To Stand Out From The Crowd:

Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM
Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design
Familiarity with parallel programming and distributed computing platforms
Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Software Engineer, Machine Learning

Meta

Sunnyvale, California, United States (On-Site)

• 7 Months ago

Senior Software Engineer - Robot Learning Platform

NVIDIA

Toronto, Ontario, Canada (On-Site)

• 4 Months ago

Lead Applied AI Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)

• 3 Months ago

Senior Data Scientist

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

• 9 Months ago

Senior Machine Learning Engineer

Attentive

San Francisco, California, United States (Hybrid)

• 8 Months ago

Lead Machine Learning Engineer, Platform

DraftKings

Boston, Massachusetts, United States (On-Site)

• 4 Months ago

Senior Machine Learning Engineer - Time Series & Computer Vision

Stylumia

Bengaluru, Karnataka, India (Hybrid)

• 10 Months ago

Senior Vice President of Engineering

Interface AI

United States (Remote)

• 4 Months ago

Student Researcher (Doubao (Seed) Foundation Model - Video Generation) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)

• 7 Months ago

AI Learning and Development Coordinator

Sony Interactive Entertainment

Tokyo, Japan (On-Site)

• 5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Game AI Researcher

Tencent

Tokyo, Japan (On-Site)

• 2 Months ago

Senior Developer Relations Manager - Robotics

NVIDIA

Tokyo, Japan (On-Site)

• 5 Months ago

Data Scientist - I

Netomi

Gurugram, Haryana, India (Hybrid)

• 8 Months ago

Other Positions

ASSIST Software

Suceava, Suceava County, Romania (Remote)

• 7 Months ago

Senior Machine Learning Engineer - Specialist Platform and Experience

Canva

Melbourne, Victoria, Australia (Remote)

• 3 Months ago

Sr Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)

• 7 Months ago

Software Engineer, Machine Learning

Meta

New York, New York, United States (On-Site)

• 7 Months ago

GPU/AI Application System Software Engineer Intern

ByteDance

San Jose, California, United States (On-Site)

• 3 Months ago

Senior HPC Performance Engineer

NVIDIA

Santa Clara, California, United States (On-Site)

• 4 Months ago

Data Scientist

Hashlist

Bengaluru, Karnataka, India (Hybrid)

• 7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Texas, United States

Senior Manager, Software Engineering - League Studio, Build, Test, Ship

Riot Games

Los Angeles, California, United States (On-Site)

• 3 Months ago

Senior Linux Systems Administrator - Somerset, NJ

ION

Clifton, New Jersey, United States (Hybrid)

• 8 Months ago

Senior Product Manager - Ads

Twitch

San Francisco, California, United States (Remote)

• 10 Months ago

Pathologist Assistant

CloudHire

Yonkers, New York, United States (On-Site)

• 3 Months ago

Director, Business Marketing Insights

Meta

New York, New York, United States (On-Site)

• 7 Months ago

Principal Application Security Engineer

Barracuda Networks Inc

United States (Remote)

• 3 Months ago

Electrician - Full Time

The Walt Disney Company

Anaheim, California, United States (On-Site)

• 4 Months ago

Housekeeping Support

The Walt Disney Company

Kapolei, Hawaii, United States (On-Site)

• 3 Months ago

Manager, FP&A

PlayStation Global

United States (Remote)

• 3 Months ago

Volunteer Marketer

Games For Love

Washington, United States (Remote)

• 3 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

AI Research Scientist - Generative AI Red Teaming (London or Paris)

Meta

Zürich, Zurich, Switzerland (On-Site)

• 7 Months ago

Global Developer Relations Account Manager – Ansys

NVIDIA

Santa Clara, California, United States (On-Site)

• 4 Months ago

QA Engineer, Generative AI

Scale AI

Argentina (On-Site)

• 8 Months ago

Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)

• 7 Months ago

Large Language Models (LLM) Expert (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)

• 9 Months ago

Senior Machine Learning Engineer, Recommendations

Inkittt

San Francisco, California, United States (Hybrid)

• 5 Months ago

Senior ML Platform Engineer

Match Group

New York, New York, United States (Hybrid)

• 8 Months ago

Research Scientist Intern - LLM/Vision/Speech

Genies

San Mateo, California, United States (Hybrid)

• 3 Months ago

Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Meta

Redmond, Washington, United States (On-Site)

• 7 Months ago

Research Engineer- Foundation Model AI Platform- Seattle

ByteDance

Seattle, Washington, United States (On-Site)

• 7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

NVIDIA

457 Active Jobs

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

Senior Solutions Architect, Generative AI - Inference

Job Summary

Job Description

8 skills required

8 skills required for this role

Job Details

Similar Jobs

Software Engineer, Machine Learning

Senior Software Engineer - Robot Learning Platform

Lead Applied AI Engineer

Senior Data Scientist

Senior Machine Learning Engineer

Lead Machine Learning Engineer, Platform

Senior Machine Learning Engineer - Time Series & Computer Vision

Senior Vice President of Engineering

Student Researcher (Doubao (Seed) Foundation Model - Video Generation) - 2025 Start (PhD)

AI Learning and Development Coordinator

Similar Skill Jobs

Game AI Researcher

Senior Developer Relations Manager - Robotics

Data Scientist - I

Other Positions

Senior Machine Learning Engineer - Specialist Platform and Experience

Sr Machine Learning Engineer

Software Engineer, Machine Learning

GPU/AI Application System Software Engineer Intern

Senior HPC Performance Engineer

Data Scientist

Jobs in Texas, United States

Senior Manager, Software Engineering - League Studio, Build, Test, Ship

Senior Linux Systems Administrator - Somerset, NJ

Senior Product Manager - Ads

Pathologist Assistant

Director, Business Marketing Insights

Principal Application Security Engineer

Electrician - Full Time

Housekeeping Support

Manager, FP&A

Volunteer Marketer

Similar Category Jobs

AI Research Scientist - Generative AI Red Teaming (London or Paris)

Global Developer Relations Account Manager – Ansys

QA Engineer, Generative AI

Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

Large Language Models (LLM) Expert (VicOne_Automotive Security)

Senior Machine Learning Engineer, Recommendations

Senior ML Platform Engineer

Research Scientist Intern - LLM/Vision/Speech

Research Scientist Intern, Machine Perception for Input and Interaction (PhD)

Research Engineer- Foundation Model AI Platform- Seattle

About The Company

Solutions Architect, Generative AI

VLSI Physical Design Engineer - New College Grad 2025

Senior VLSI CAD Engineer

Senior Software Engineer, ASIC Verification Tools

Senior ASIC Full Chip Verification Engineer

Principal Engineer - Enterprise Applications

Senior Business System Architect, AI and ML

Senior Product Security Engineer

System Software Engineer

System Design Power Validation Engineer

Level Up Your Career in Game Development!