Jobs Courses Resources Companies Placements

Home >

Jobs >

NIM Solution Architect

NVIDIA

Shanghai, China (On-site)

NIM Solution Architect

2 Months ago • 3 Years +

Job Summary

Job Description

As a NIM Solution Architect at NVIDIA, you will drive the implementation and deployment of NVIDIA Inference Microservice (NIM) solutions. Responsibilities include using NIM Factory Pipeline to package optimized models into containers, refining NIM tools for the community, designing agentic AI solutions using NIMs, delivering technical projects and demos, providing client support, collaborating with cross-functional teams, and championing NVIDIA software within the technical community. You'll also support the NVAIE team and contribute to their business in China. This role requires expertise in deploying and optimizing large language models, proficiency in inference frameworks (TensorRT, ONNX Runtime, PyTorch), strong Python/C++ programming, and familiarity with DevOps/MLOps practices.

Must have:

3+ years experience
LLM deployment & optimization
Inference framework proficiency (TensorRT, etc.)
Python/C++ programming skills
DevOps/MLOps experience
Problem-solving & troubleshooting skills

Good to have:

Experience with field LLM projects
TensorRT expertise
AI workflow design experience
Cluster resource management tools
Agile methodologies
CUDA optimization experience
Large-scale HPC/enterprise system design

10 skills required

10 skills required for this role

Add these skills to join the top 1% applicants for this job

ci-cd

github

containers

python

docker

pytorch

git

cuda

agile-development

cross-functional

Job Details

NVIDIA is leading company of AI computing. At NVIDIA, our employees are passionate about AI, HPC , VISUAL, GAMING. Our Solution Architect team is more focusing to bring NVIDIA new technology into difference industries. We help to design the architecture of AI computing platform, analysis the AI and HPC applications to deliver our value to customers. This role will be instrumental in leveraging NVIDIA's cutting-edge technologies to optimize open-source and proprietary large models, create AI workflows, and support our customers in implementing advanced AI solutions.

What you’ll be doing:

Drive the implementation and deployment of NVIDIA Inference Microservice (NIM) solutions
Use NVIDIA NIM Factory Pipeline to package optimized models (including LLM, VLM, Retriever, CV, OCR, etc.) into containers providing standardized API access for on-prem or cloud deployment
Refine NIM tools for the community, help the community to build their performant NIMs
Design and implement agentic AI tailored to customer business scenarios using NIMs
Deliver technical projects, demos and client support tasks as directed by the Solution Architecture Leadership
Provide technical support and guidance to customers, facilitating the adoption and implementation of NVIDIA technologies and products
Collaborate with cross-functional teams to enhance and expand our AI solutions portfolio
Be an internal champion for NVIDIA software and total solutions in technical community
Be an industry thought leader on integrating NVIDIA technology especially inference services into LHA, business partners and whole community
Assist in supporting NVAIE team and driving NVAIE business in China

What we need to see:

3+ years working experience with Bachelor's or Master's degree in Computer Science, Artificial Intelligence, or a related field
Proven experience in deploying and optimizing large language models
Proficiency in at least one inference framework (e.g., TensorRT, ONNX Runtime, PyTorch)
Strong programming skills in Python or C++
Familiarity with main stream inference engines (e.g., vLLM, SGLang)
Experience with DevOps/MLOps such as Docker, Git, and CI/CD practices
Excellent problem-solving skills and ability to troubleshoot complex technical issues
Demonstrated ability to collaborate effectively across diverse, global teams, adapting communication styles while maintaining clear, constructive professional interactions

Ways to stand out from the crowd:

Experience in architectural design for field LLM projects
Expertise in model optimization techniques, particularly using TensorRT
Knowledge of AI workflow design and implementation, experience on cluster resource management tools. Familiarity with agile development methodologies
CUDA optimization experience, extensive experience designing and deploying large scale HPC and enterprise computing systems

Similar Jobs

Staff Software Engineer - Infrastructure Reliability

Riot Games

Los Angeles, California, United States (On-Site)

• 2 Months ago

Staff Technical Architect

Seedify

(Remote)

• 10 Months ago

Senior Machine Learning, AI Engineer

Tesla

Brandenburg, Germany (On-Site)

• 4 Months ago

Senior Engineer, iOS

Alphasense

Helsinki, Uusimaa, Finland (Hybrid)

• 1 Month ago

Controls Software Engineer II

Fluence

Houston, Texas, United States (Hybrid)

• 8 Months ago

Research Engineer Intern

ByteDance

Seattle, Washington, United States (On-Site)

• 2 Months ago

Engineering Manager, AI Developer Technology

NVIDIA

Austin, Texas, United States (On-Site)

• 3 Months ago

ML/AI Engineer

Lucid Reality Labs

Poland (Remote)

• 3 Months ago

Senior AI Data Scientist

Hitachi

Chennai, Tamil Nadu, India (On-Site)

• 8 Months ago

Research Scientist, Deep Learning and Computer Vision

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)

• 4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Senior AQA Engineer (Python + Robot)

N-iX

Colombia (Remote)

• 2 Months ago

Lead Software Engineer, Machine Learning - Ad Platforms

The Walt Disney Company

California, United States (On-Site)

• 2 Months ago

Staff Software Engineer - Fullstack

Super

United States (Remote)

• 7 Months ago

Senior Software Developer

Barracuda Networks Inc

Ottawa, Ontario, Canada (Hybrid)

• 4 Months ago

Build Engineer

G- space studios

(Remote)

• 1 Month ago

Software Engineer II

Telastra

Bengaluru, Karnataka, India (On-Site)

• 1 Month ago

Server-Side Engineer (New Title)

Colo pl

Minato City, Tokyo, Japan (On-Site)

• 12 Months ago

Mobile Architect

Capgemini

Mumbai, Maharashtra, India (On-Site)

• 1 Month ago

Cloud Engineer Kubernetes

ION

Rome, Lazio, Italy (Hybrid)

• 8 Months ago

Cloud Engineer (Azure)

Zazz

(Remote)

• 4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Research Associate, Climate

World Resource Institute

Beijing, China (On-Site)

• 1 Month ago

Consultant

Marsh McLennan

Shanghai, China (Hybrid)

• 1 Month ago

Sr. Account Manager, MRT

Haleon

Chengdu, Sichuan, China (On-Site)

• 1 Month ago

Software Test Developer Intern - Spark Rapids, Big Data & Deep Learning - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)

• 2 Months ago

Senior Environment Artist

Tencent

Shanghai, Shanghai, China (On-Site)

• 4 Months ago

Senior Backend Engineer - China

Thatgamecompany

Shanghai, Shanghai, China (On-Site)

• 3 Months ago

Manufacturing Engineer II

Nordson Corporation

Shanghai, China (On-Site)

• 1 Month ago

Marketing Project Manager - China

Thatgamecompany

Shanghai, Shanghai, China (On-Site)

• 3 Months ago

Senior AI Training Performance Engineer

NVIDIA

Shanghai, Shanghai, China (Hybrid)

• 5 Months ago

Account Strategist, Mid-Market Sales

Google

Guangdong Province, China (On-Site)

• 2 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Senior Account Manager

Interface AI

United States (Remote)

• 4 Months ago

AI Strategy Lead

A-Team

New York, New York, United States (Hybrid)

• 3 Months ago

Software Engineer III, AI/ML, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)

• 2 Months ago

Solutions Architect, Financial Services

NVIDIA

New Jersey, United States (Remote)

• 2 Months ago

Senior Computer Architect - Deep Learning

NVIDIA

Santa Clara, California, United States (On-Site)

• 5 Months ago

Visiting Senior Research Scientist

LLM Software Engineer/Researcher (Applied Machine Learning)

ByteDance

Seattle, Washington, United States (On-Site)

• 3 Months ago

Research Scientist, Multimodal Interaction & World Model - 2025 Start

ByteDance

Singapore (On-Site)

• 7 Months ago

AI Computing Software Development Engineer, TensorRT

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)

• 4 Months ago

Conversational AI Consultant

Google

Gurugram, Haryana, India (On-Site)

• 2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

NVIDIA

400 Active Jobs

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

A global community of game builders. Helping people upskill and land jobs in the best gaming studios.

Company

Key Links

hello@outscal.com

Made in INDIA 💛💙

NIM Solution Architect

Job Summary

Job Description

10 skills required

10 skills required for this role

Job Details

Similar Jobs

Staff Software Engineer - Infrastructure Reliability

Staff Technical Architect

Senior Machine Learning, AI Engineer

Senior Engineer, iOS

Controls Software Engineer II

Research Engineer Intern

Engineering Manager, AI Developer Technology

ML/AI Engineer

Senior AI Data Scientist

Research Scientist, Deep Learning and Computer Vision

Similar Skill Jobs

Senior AQA Engineer (Python + Robot)

Lead Software Engineer, Machine Learning - Ad Platforms

Staff Software Engineer - Fullstack

Senior Software Developer

Build Engineer

Software Engineer II

Server-Side Engineer (New Title)

Mobile Architect

Cloud Engineer Kubernetes

Cloud Engineer (Azure)

Jobs in Shanghai, Shanghai, China

Research Associate, Climate

Consultant

Sr. Account Manager, MRT

Software Test Developer Intern - Spark Rapids, Big Data & Deep Learning - 2025

Senior Environment Artist

Senior Backend Engineer - China

Manufacturing Engineer II

Marketing Project Manager - China

Senior AI Training Performance Engineer

Account Strategist, Mid-Market Sales

Similar Category Jobs

Senior Account Manager

AI Strategy Lead

Software Engineer III, AI/ML, Google Cloud AI

Solutions Architect, Financial Services

Senior Computer Architect - Deep Learning

Visiting Senior Research Scientist

LLM Software Engineer/Researcher (Applied Machine Learning)

Research Scientist, Multimodal Interaction & World Model - 2025 Start

AI Computing Software Development Engineer, TensorRT

Conversational AI Consultant

About The Company

Solutions Architect, Generative AI

VLSI Physical Design Engineer - New College Grad 2025

Senior Software Engineer, ASIC Verification Tools

Senior ASIC Full Chip Verification Engineer

Principal Engineer - Enterprise Applications

Senior Business System Architect, AI and ML

Senior Product Security Engineer

System Design Power Validation Engineer

OEM Account Manager

System Debug Lead Engineer

Level Up Your Career in Game Development!