Senior Solutions Architect, Generative AI - Inference

1 Month ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects for Generative AI Inference. Responsibilities include partnering with internal and external teams to build AI solutions using NVIDIA's technology, engaging with developers and researchers, working with key customers, and analyzing deep learning inference performance. This role requires strong expertise in Deep Learning frameworks (PyTorch, TensorFlow), Large Language Models, and GPU technologies. The ideal candidate will possess excellent communication, problem-solving, and collaboration skills and experience deploying/optimizing DL inference in production. The role involves working on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models and some travel.
Must have:
  • 5+ years Deep Learning experience
  • PyTorch/TensorFlow expertise
  • Strong programming & optimization skills
  • LLM & Deep Learning inference knowledge
  • Excellent communication & collaboration
Good to have:
  • NVIDIA GPU & software experience (NeMo, Triton, TensorRT)
  • C/C++ programming skills
  • Parallel programming & distributed computing
  • Experience with large-scale DL training
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!


What You Will Be Doing:

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

  • Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

  • Working closely with customers to help them adopt and build solutions using NVIDIA technology

  • Analyze performance and power efficiency of deep learning inference workloads

  • Some travel to conferences and customers may be required


What We Need To See:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

  • Strong fundamentals in programming, optimizations and software design, especially in Python

  • Strong problem-solving and debugging skills

  • Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

  • Excellent presentation, communication and collaboration skills

  • Desire to be involved in multiple diverse and creative projects


Ways To Stand Out From The Crowd:

  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Familiarity with parallel programming and distributed computing platforms

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Microsoft - Senior Applied Scientist- Content Service

Microsoft

Beijing, Beijing, China (On-Site)
• 1 Month ago
Unity - Principal Applied Research Machine Learning Engineer

Unity

Helsinki, Uusimaa, Finland (On-Site)
• 4 Months ago
Salesforce - Principal Data Scientist

Salesforce

Palo Alto, California, United States (On-Site)
• 3 Months ago
Epic Games - Principal Research Scientist

Epic Games

Montreal, Quebec, Canada (On-Site)
• 1 Month ago
ByteDance - Software Researcher/Engineer - Applied Research Center (Infrastructure+AI)

ByteDance

Seattle, Washington, United States (On-Site)
• 3 Months ago
Autodesk - Machine Learning Developer 3D Geometry/ Multi-Modal

Autodesk

Toronto, Ontario, Canada (On-Site)
• 4 Months ago
Scopely - AI Artist (Portrait Specialist)

Scopely

Bengaluru, Karnataka, India (On-Site)
• 5 Days ago
Hyper Verge - Machine Learning Engineer II

Hyper Verge

Bengaluru, Karnataka, India (On-Site)
• 4 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Video Generation) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
• 3 Months ago
Sphere Entertainment Co - Director Production Technology Innovation

Sphere Entertainment Co

Burbank, California, United States (On-Site)
• 3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Alpha Sense - Join AlphaSense India Talent Community

Alpha Sense

Pune, Maharashtra, India (On-Site)
• 3 Months ago
ByteDance - Algorithm Intern (Video Codec - Realtime Codec Optimizations - Multimedia Streaming) - 2025 Summer (PhD)

ByteDance

San Diego, California, United States (On-Site)
• 2 Months ago
ByteDance - Machine Learning Engineer Intern (Applied Machine Learning-Algorithm) - 2025 Summer/Fall (MS)

ByteDance

San Jose, California, United States (On-Site)
• 3 Months ago
Unity - Principal Data Engineer

Unity

San Francisco, California, United States (On-Site)
• 7 Months ago
Match Group - Machine Learning Engineer (MG AI)

Match Group

Seoul, South Korea (On-Site)
• 3 Months ago
ByteDance - Senior Machine Learning Engineer - AML Algorithm

ByteDance

Seattle, Washington, United States (On-Site)
• 3 Months ago
Visa - Senior Manager Data Science - Visa Consulting & Analytics

Visa

Mumbai, Maharashtra, India (On-Site)
• 4 Months ago
Rackspace Technology - Data Scientist

Rackspace Technology

Alexandria, Alexandria Governorate, Egypt (Remote)
• 1 Month ago
Microsoft - Research Intern - AI-driven Hardware Design

Microsoft

Vancouver, British Columbia, Canada (On-Site)
• 1 Month ago
Recro - Automatic speech Recognition

Recro

Gurugram, Haryana, India (On-Site)
• 4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in California, United States

Trance Ending Films - Freelance VFX Artist (Smoke Compositing)

Trance Ending Films

California, United States (Remote)
• 5 Months ago
PTW - Character Concept Artist - Talent Pool

PTW

United States (Remote)
• 1 Month ago
Nintendo - Intern – Networking Software Engineer (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
• 2 Months ago
NVIDIA - Technical Marketing Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
• 1 Month ago
Pixar Animation Studios - Software Engineer, Tools Internals (Core)

Pixar Animation Studios

Emeryville, California, United States (Hybrid)
• 2 Days ago
Ziff Davis - Director - Sales & Business Development

Ziff Davis

Drexel Hill, Pennsylvania, United States (Remote)
• 3 Months ago
Nintendo - Senior Engineer, Installer (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
• 6 Months ago
Funko - Manager, Creative Marketing Operations

Funko

Washington, United States (On-Site)
• 1 Month ago
Bally's Interactive - Senior Internal Auditor

Bally's Interactive

Jersey City, New Jersey, United States (Hybrid)
• 1 Month ago
Fluence - Director of Planning, Americas

Fluence

Houston, Texas, United States (Hybrid)
• 4 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

ByteDance - Research Scientist Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
• 3 Months ago
Microsoft - Machine Learning Engineer II

Microsoft

Bengaluru, Karnataka, India (On-Site)
• 4 Weeks ago
PAPAYA - Data Scientist

PAPAYA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
• 6 Days ago
Thatgamecompany - Machine Learning Engineer

Thatgamecompany

United States (Remote)
• 5 Months ago
Stonewall Collision & Auto Painting - Data Scientist

Stonewall Collision & Auto Painting

Hyderabad, Telangana, India (On-Site)
• 5 Months ago
Flutter Entertainment - Lead Data Scientist

Flutter Entertainment

Hyderabad, Telangana, India (Hybrid)
• 3 Months ago
HP - AI Lab – ML Engineer, Model Optimization

HP

Sant Cugat Del Vallès, Catalonia, Spain (On-Site)
• 5 Months ago
PwC - Risk Services - AI Solution Specialist

PwC

Singapore (On-Site)
• 4 Months ago
CloudHire - Technical Lead / Technical Project Manager

CloudHire

Noida, Uttar Pradesh, India (Hybrid)
• 4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Yokne'am Illit, North District, Israel (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

United States (Remote)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug