Outscal Logooutscal logo

Senior Solutions Architect, Generative AI - Inference

1 Month ago • 5 Years + • Artificial Intelligence • $148,000 PA - $287,500 PA

Job Summary

Job Description

NVIDIA seeks Senior Solutions Architects specializing in Generative AI inference to support customers building solutions with their AI technology. This role involves partnering with various internal teams and external customers to define and implement high-value solutions leveraging NVIDIA's accelerated computing and deep learning platforms. Responsibilities include engaging with developers and researchers, collaborating on performance analysis, and working on proof-of-concepts focused on Generative AI and Large Language Models. The ideal candidate possesses strong deep learning expertise (PyTorch, TensorFlow), excellent communication skills, and experience with relevant NVIDIA software libraries (NeMo, Triton Inference Server, TensorRT).
Must have:
  • 5+ years Deep Learning experience
  • Proficiency in PyTorch/TensorFlow
  • Strong programming & optimization skills (Python)
  • Excellent problem-solving & debugging
  • Expertise in LLMs & Deep Learning inference
Good to have:
  • Experience with NVIDIA GPUs and software (NeMo, Triton, TensorRT)
  • C/C++ programming skills
  • Parallel programming & distributed computing experience
  • Experience with large-scale DL training and inference deployment
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is seeking outstanding AI Solutions Architects to assist and support customers that are building solutions with our newest AI technology. At NVIDIA, our solutions architects work across different teams and enjoy helping customers with the latest Accelerated Computing and Deep Learning software and hardware platforms. We're looking to grow our company, and build our teams with the smartest people in the world. Would you like to join us at the forefront of technological advancement? You will become a trusted technical advisor with our customers and work on exciting projects and proof-of-concepts focused on Generative AI and Large Language Models. You will also collaborate with a diverse set of internal teams on performance analysis and modeling of inference software. You should be comfortable working in a dynamic environment, and have experience with Generative AI, Large Language Models, Deep Learning and GPU technologies. This role is an excellent opportunity to work in an interdisciplinary team with the latest technologies at NVIDIA!


What You Will Be Doing:

  • Partnering with other solution architects, engineering, product and business teams. Understanding their strategies and technical needs and helping define high-value solutions

  • Dynamically engaging with developers, scientific researchers, data scientists, which will give you experience across a range of technical areas

  • Strategically partnering with lighthouse customers and industry-specific solution partners targeting our computing platform

  • Working closely with customers to help them adopt and build solutions using NVIDIA technology

  • Analyze performance and power efficiency of deep learning inference workloads

  • Some travel to conferences and customers may be required


What We Need To See:

  • BS, MS, or PhD in Computer Science, Electrical/Computer Engineering, Physics, Mathematics, other Engineering or related fields (or equivalent experience)

  • 5+ years of hands-on experience with Deep Learning frameworks such as PyTorch and TensorFlow

  • Strong fundamentals in programming, optimizations and software design, especially in Python

  • Strong problem-solving and debugging skills

  • Excellent knowledge of theory and practice of Large Language Models and Deep Learning inference

  • Excellent presentation, communication and collaboration skills

  • Desire to be involved in multiple diverse and creative projects


Ways To Stand Out From The Crowd:

  • Experience with NVIDIA GPUs and software libraries, such as NVIDIA NeMo Framework, NVIDIA Triton Inference Server, TensorRT, TensorRT-LLM

  • Excellent C/C++ programming skills, including debugging, profiling, code optimization, performance analysis, and test design

  • Familiarity with parallel programming and distributed computing platforms

  • Prior experience with DL training at scale, deploying or optimizing DL inference in production

The base salary range is 148,000 USD - 287,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Research Scientist, Reinforcement Learning

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
NVIDIA - Senior Technical Marketing Engineer - AI Infrastructure

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
ByteDance - Algorithm Engineer - Audio Understanding - Start 2025

ByteDance

Singapore (On-Site)
4 Months ago
NVIDIA - Senior Solution Engineer, Mission Control

NVIDIA

Santa Clara, California, United States (On-Site)
8 Hours ago
NVIDIA - Software Engineer Intern - Mapping and Generative AI

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Airlab Inc  - C++ & Python Programmer

Airlab Inc

Quebec, Canada (On-Site)
11 Hours ago
Zoox - Software Engineer - Simulation Workload Orchestration

Zoox

Foster City, California, United States (Hybrid)
5 Months ago
Thumbtack - Staff Software Engineer,  Machine Learning Infrastructure

Thumbtack

Ontario, Canada (Remote)
4 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - MultiModal Generative Model)

ByteDance

San Jose, California, United States (On-Site)
1 Day ago
Google - Software Engineer III, Core Machine Learning, Google Cloud

Google

Mountain View, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Riot Games - Senior Data Scientist - Singapore Efficiency Team

Riot Games

Singapore (On-Site)
1 Month ago
ByteDance - Software Engineer, ML System Scheduling

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
NVIDIA - AI Algorithms Software Engineer (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
1 Month ago
Granicus - Data Scientist 4

Granicus

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

New York, New York, United States (On-Site)
4 Months ago
ByteDance - Video Analysis and Quality Algorithm Intern 2023 Summer/Fall (MS)

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
Thumbtack - Staff Software Engineer,  Machine Learning Infrastructure

Thumbtack

Ontario, Canada (Remote)
4 Months ago
ByteDance - Research Scientist, Foundation Model, Speech Understanding

ByteDance

San Jose, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Texas, United States

Rackspace Technology - Enterprise Sales Executive VI

Rackspace Technology

United States (Remote)
1 Month ago
Pika - Research Scientist

Pika

Palo Alto, California, United States (On-Site)
3 Months ago
PENN Interactive - Data Analyst, Product

PENN Interactive

Philadelphia, Pennsylvania, United States (Hybrid)
5 Days ago
Activision - Senior Narrative Animator

Activision

Los Angeles, California, United States (On-Site)
3 Months ago
Google - Senior Software Developer, Site Reliability Engineering, Google Cloud

Google

Raleigh, North Carolina, United States (On-Site)
4 Months ago
Apex logic - Front-End Developer

Apex logic

United States (Remote)
4 Months ago
Samsung Semiconductor - IT Infrastructure Engineer Contractor

Samsung Semiconductor

San Jose, California, United States (Hybrid)
2 Months ago
Ello - Tech Lead, Machine Learning

Ello

San Francisco, California, United States (On-Site)
5 Days ago
NVIDIA - Senior Infrastructure Software Engineer, Deep Learning Libraries

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
Onward Search - Marketing Associate

Onward Search

Westwood, Massachusetts, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Microsoft - Platform Engineering Manager

Microsoft

Redmond, Washington, United States (Hybrid)
1 Day ago
Meta - Software Engineer, Machine Learning

Meta

Fremont, California, United States (Remote)
4 Months ago
Microsoft - Principal Product Manager, AI

Microsoft

Redmond, Washington, United States (Hybrid)
22 Hours ago
DEVOTEAM - Data Driven | MLOps Engineer

DEVOTEAM

Lisbon, Lisbon, Portugal (Remote)
5 Months ago
Inkittt - Director of AI

Inkittt

San Francisco, California, United States (On-Site)
7 Months ago
NVIDIA - Senior ASIC Infrastructure Engineer

NVIDIA

Toronto, Ontario, Canada (Hybrid)
2 Weeks ago
Meetelise - Senior Research Scientist

Meetelise

(Remote)
4 Months ago
Samsung Semiconductor - Intern, Machine Learning Engineer - VLMs

Samsung Semiconductor

San Jose, California, United States (Hybrid)
2 Months ago
Virtuos - R&D Machine Learning Engineer

Virtuos

China (On-Site)
1 Day ago
NVIDIA - Director, AI Software

NVIDIA

Taipei City, Taiwan (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Hsinchu, Hsinchu City, Taiwan (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Seoul, South Korea (Hybrid)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Shanghai, Shanghai, China (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Be'er Sheva, South District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug