Senior Solutions Architect, Generative AI - Inference

2 Months ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects specializing in Generative AI inference to support customers building solutions using NVIDIA's AI technology. Responsibilities include partnering with internal and external teams, understanding customer needs, defining solutions, engaging with developers and researchers, collaborating on performance analysis and modeling, and assisting customers in adopting NVIDIA technology. The role requires strong experience with deep learning frameworks (PyTorch, TensorFlow), large language models, and GPU technologies. Some travel may be required.
Must have:
  • 5+ years Deep Learning experience
  • PyTorch/TensorFlow expertise
  • Strong programming & optimization skills
  • LLM and Deep Learning inference knowledge
  • Excellent communication & collaboration
Good to have:
  • NVIDIA GPU & software experience (NeMo, Triton, TensorRT)
  • C/C++ programming skills
  • Parallel programming & distributed computing
  • DL training & inference deployment experience
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!


What You Will Be Doing:

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

  • Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

  • Working closely with customers to help them adopt and build solutions using NVIDIA technology

  • Analyze performance and power efficiency of deep learning inference workloads

  • Some travel to conferences and customers may be required


What We Need To See:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

  • Strong fundamentals in programming, optimizations and software design, especially in Python

  • Strong problem-solving and debugging skills

  • Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

  • Excellent presentation, communication and collaboration skills

  • Desire to be involved in multiple diverse and creative projects


Ways To Stand Out From The Crowd:

  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Familiarity with parallel programming and distributed computing platforms

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Tencent - Research Intern (NLP)

Tencent

Palo Alto, California, United States (On-Site)
2 Months ago
Casumo - AI Engineer

Casumo

(Hybrid)
4 Weeks ago
The Walt Disney Company - Sr Machine Learning Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
5 Months ago
ByteDance - DevOps Engineer - Applied Machine Learning, Engine

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

New York, New York, United States (On-Site)
5 Months ago
NVIDIA - Distinguished Engineer, AI Resiliency Lead

NVIDIA

Redmond, Washington, United States (On-Site)
2 Months ago
Microsoft - Principal Product Manager, AI

Microsoft

Redmond, Washington, United States (Hybrid)
1 Month ago
VGW - Machine Learning Engineer

VGW

Sydney, New South Wales, Australia (On-Site)
1 Month ago
Zoox - Software Engineer - Perception & Sensing

Zoox

Foster City, California, United States (Hybrid)
6 Months ago
Chiselon Technologies   - Data Scientist( ML AI )

Chiselon Technologies

Gurugram, Haryana, India (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Granicus - Data Scientist 4

Granicus

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
Amazon Games - Senior Software Developer, Amazon Games AI

Amazon Games

San Diego, California, United States (On-Site)
4 Months ago
Passive Logic - AI Control Theory & Optimization Scientist

Passive Logic

Salt Lake City, Utah, United States (On-Site)
4 Months ago
prizepicks - Engineering Manager — Data Science Engineering

prizepicks

Atlanta, Georgia, United States (Remote)
1 Month ago
Rackspace Technology - Senior Machine Learning Engineer

Rackspace Technology

Vietnam (Remote)
2 Months ago
SmileGate - Game Data Engineer

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
1 Month ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Months ago
NVIDIA - AI Computing Software Development Engineer, TensorRT

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
NVIDIA - Performance Engineer Intern, Deep Learning and HPC

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Washington, United States

Next Level Business Services - SAP HANA Developer

Next Level Business Services

Charlotte, North Carolina, United States (On-Site)
6 Months ago
Match Group - Product Operations Specialist

Match Group

Palo Alto, California, United States (Hybrid)
6 Months ago
ByteDance - Research Scientist, Vision Foundation Model

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Singularity 6 - QA Application Drop Box

Singularity 6

United States (Hybrid)
5 Months ago
ByteDance - Immersive Video Research Intern (Multimedia Streaming) 2023 Summer/Fall (BS)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Hedra - Senior Full-Stack Engineer

Hedra

New York, New York, United States (On-Site)
1 Month ago
ByteDance - Senior Site Reliability Engineer - Applied Machine Learning

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
ByteDance - Senior Site Reliability Engineer, ML System

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Rackspace Technology - Inventory Control Specialist I

Rackspace Technology

Richardson, Texas, United States (On-Site)
1 Month ago
IGT - QA Technician III

IGT

West Greenwich, Rhode Island, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Canva - Senior Backend Software Engineer - AI Help Platform

Canva

Sydney, New South Wales, Australia (Remote)
1 Month ago
Microsoft - Member of Technical Staff Platform Engineer

Microsoft

Mountain View, California, United States (Hybrid)
1 Month ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Hyderabad, Telangana, India (Hybrid)
2 Months ago
prizepicks - Customer Insights Internship

prizepicks

(Remote)
1 Month ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Bellevue, Washington, United States (On-Site)
5 Months ago
Microsoft - Member of Technical Staff, AI Post-Training

Microsoft

London, England, United Kingdom (On-Site)
1 Month ago
My Fitness Pal - Staff Machine Learning Engineer

My Fitness Pal

United States (Remote)
3 Months ago
ByteDance - Software Development Engineer - Large Language Models, AML

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Balbix - Staff AI Engineer

Balbix

Bengaluru, Karnataka, India (On-Site)
6 Months ago
NVIDIA - Customer Program Manager

NVIDIA

Taipei City, Taiwan (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug