Senior Solutions Architect, Generative AI - Inference

1 Month ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects for Generative AI Inference. Responsibilities include partnering with internal and external teams to build AI solutions using NVIDIA's technology, engaging with developers and researchers, working with key customers, and analyzing deep learning inference performance. This role requires strong expertise in Deep Learning frameworks (PyTorch, TensorFlow), Large Language Models, and GPU technologies. The ideal candidate will possess excellent communication, problem-solving, and collaboration skills and experience deploying/optimizing DL inference in production. The role involves working on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models and some travel.
Must have:
  • 5+ years Deep Learning experience
  • PyTorch/TensorFlow expertise
  • Strong programming & optimization skills
  • LLM & Deep Learning inference knowledge
  • Excellent communication & collaboration
Good to have:
  • NVIDIA GPU & software experience (NeMo, Triton, TensorRT)
  • C/C++ programming skills
  • Parallel programming & distributed computing
  • Experience with large-scale DL training
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!


What You Will Be Doing:

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

  • Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

  • Working closely with customers to help them adopt and build solutions using NVIDIA technology

  • Analyze performance and power efficiency of deep learning inference workloads

  • Some travel to conferences and customers may be required


What We Need To See:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

  • Strong fundamentals in programming, optimizations and software design, especially in Python

  • Strong problem-solving and debugging skills

  • Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

  • Excellent presentation, communication and collaboration skills

  • Desire to be involved in multiple diverse and creative projects


Ways To Stand Out From The Crowd:

  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Familiarity with parallel programming and distributed computing platforms

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Speech & Audio) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
• 3 Months ago
NVIDIA - Senior Video Compression Architect

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
• 1 Month ago
Activision - 2025 US Summer Internship - Data Analytics & Data Science

Activision

Irvine, California, United States (On-Site)
• 4 Weeks ago
NeST Digital - 1730 - Data Scientist

NeST Digital

Bengaluru, Karnataka, India (On-Site)
• 3 Months ago
Paypal - Senior AI Machine Learning Engineer

Paypal

San Jose, California, United States (On-Site)
• 4 Months ago
Meta - Visiting Senior Research Scientist

Meta

Paris, ÃŽle-de-France, France (On-Site)
• 3 Months ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Redmond, Washington, United States (On-Site)
• 3 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Serbia (Remote)
• 3 Months ago
Stonewall Collision & Auto Painting - Lead Data Scientist

Stonewall Collision & Auto Painting

Hyderabad, Telangana, India (On-Site)
• 5 Months ago
Interface AI - Technical Delivery Manager

Interface AI

Hyderabad, Telangana, India (Remote)
• 6 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

GlobalLogic - Data Scientist IRC241434

GlobalLogic

Hyderabad, Telangana, India (On-Site)
• 5 Months ago
NVIDIA - Solution Architect - OEM AI Software

NVIDIA

Texas, United States (Remote)
• 3 Weeks ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Washington, District Of Columbia, United States (On-Site)
• 3 Months ago
Epic Games - Research Programmer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
• 1 Month ago
Armada - Senior Data Engineer

Armada

Thiruvananthapuram, Kerala, India (On-Site)
• 4 Months ago
Genies - Machine Learning Engineer: 3D Generative AI

Genies

San Mateo, California, United States (Remote)
• 3 Months ago
Aera Technology - Senior Data Scientist

Aera Technology

Pune, Maharashtra, India (On-Site)
• 4 Months ago
ByteDance - Machine Learning Engineer-Model Serving Infrastructure (AML-Engine)

ByteDance

San Jose, California, United States (On-Site)
• 3 Months ago
Scale AI - Machine Learning Engineer, International Public Sector

Scale AI

United Kingdom (On-Site)
• 4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Visa - Principle Data Scientist, Visa Rules, Compliance and Standards (RCS)

Visa

Austin, Texas, United States (Hybrid)
• 2 Months ago
Tencent - Principal Researcher: Artificial General Intelligence (Audio, Speech and Multimodal Processing)

Tencent

Bellevue, Washington, United States (On-Site)
• 5 Months ago
Nintendo - Engineer, Electro Mechanical (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
• 4 Months ago
The Walt Disney Company - Lead Software Engineer (Roku Engineer)

The Walt Disney Company

New York, New York, United States (On-Site)
• 3 Months ago
Zones - Director, Global Field Services

Zones

Texas, United States (On-Site)
• 1 Month ago
Salt AI - Sr. QA Automation Engineer

Salt AI

Los Angeles, California, United States (Remote)
• 7 Months ago
Nintendo - Security Engineer

Nintendo

Redmond, Washington, United States (Hybrid)
• 2 Months ago
ByteDance - Senior/Tech Lead Software Development Engineer, Network Monitoring & Alerts - San Jose

ByteDance

San Jose, California, United States (On-Site)
• 3 Months ago
Saviynt - Sr. Director (Application Access Governance) -  Governance Risk & Compliance

Saviynt

Atlanta, Georgia, United States (Hybrid)
• 4 Months ago
Trek - Staff Engineer

Trek

Waterloo, Wisconsin, United States (On-Site)
• 3 Weeks ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Customer Program Manager

NVIDIA

Taipei City, Taiwan (On-Site)
• 1 Month ago
Zoox - Senior Software Engineer - Perception

Zoox

Foster City, California, United States (Hybrid)
• 4 Months ago
Spell Brush - AI Anime Researcher

Spell Brush

San Francisco, California, United States (On-Site)
• 4 Months ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

New York, New York, United States (On-Site)
• 3 Months ago
Google - Senior Software Engineer, Machine Learning, Google Ads

Google

(On-Site)
• 2 Months ago
Google - Software Engineer III, Machine Learning, Google Ads

Google

Mountain View, California, United States (On-Site)
• 3 Months ago
ByteDance - Product Solution Architect, Volcano ARK (Singapore)

ByteDance

Singapore (On-Site)
• 3 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Georgia (Remote)
• 3 Months ago
Tech Mahindra - Computational Linguist

Tech Mahindra

Hyderabad, Telangana, India (On-Site)
• 5 Months ago
Soul AI - Subject Matter Expert (AI Trainer)

Soul AI

Hyderabad, Telangana, India (On-Site)
• 5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug