Solutions Architect, Generative AI

3 Months ago • 5 Years + • Devops • $148,000 PA - $235,750 PA

Job Summary

Job Description

NVIDIA seeks a Solutions Architect, Generative AI, to develop end-to-end Generative AI solutions for enterprise clients. Responsibilities include leveraging NVIDIA AI SDKs and APIs, designing GPU-accelerated pipelines, optimizing compute resource utilization, and building solutions using ML/DL technologies (language and multimodal models, information retrieval, etc.). The role involves creating reference architectures, improving NVIDIA products, addressing scaling challenges, and sharing expertise through training and product contributions. The ideal candidate will have strong experience in Deep Learning, large-scale Gen AI applications, and cluster orchestration tools.
Must have:
  • 5+ years Deep Learning/ML experience
  • Strong coding skills (Python, C/C++, Bash, Linux)
  • Gen AI application development (information retrieval, model optimization)
  • Experience with Docker, Kubernetes, SLURM
  • AI workload optimization (Ethernet, InfiniBand)
Good to have:
  • NVIDIA AI products (NIM, Nemo Retriever, Nemo Microservices, Nemo Framework)
  • NVIDIA Spectrum-X expertise
  • NVIDIA Collective Communication Library (NCCL) experience
Perks:
  • Equity
  • Benefits

Job Details

Do you want to be part of the team that brings Artificial Intelligence (AI) technology to the field? We are looking for a Solution Architect or Data Scientist to join the NVIDIA AI Enterprise (NVAIE) SA Segment team. We specialize on the newest technology and advances in Machine Learning, Deep Learning, Generative AI, and Cloud. The vision of the NVAIE Segment team is to use our deep expertise to guide and enable the successful adoption at data center scale of NVIDIA AI Enterprise Software!

If you are passionate about Generative AI and how it can be applied to solve real-world problems, we should talk. NVIDIA is the world leader in GPU accelerated computing and AI, and is looking for developers like you to design and build enterprise AI solutions using our newest technology. As a member of the NVAIE Segment Solution Architecture team, you will work closely with customers and partners to tackle hard problems in customizing and deploying Generative AI workloads in production at scale.

What you’ll be doing:

  • A huge part of our work involves developing end-to-end Generative AI solutions for enterprise use cases. We help customers adopt NVIDIA AI SDKs and APIs by offering deep technical expertise and designing GPU-accelerated pipelines that optimize compute resource utilization and improve workload performance.

  • We solve customer problems by building solutions using Machine Learning and Deep Learning technology including language and multimodal models, information retrieval, domain customization, reasoning, inferencing, agentic systems, and other sophisticated Generative AI workloads.

  • As we work with customers across multiple industries, we build the reference architectures needed to deploy and optimize workloads at large scale. With this knowledge, we help improve NVIDIA products and build creative solutions to overcome scaling challenges.

  • We contribute to the wider organization and community by sharing our expert knowledge with others. This can vary from product engineering contributions to building and delivering hands-on training.

Above all, you will be part of the team that helps bring NVIDIA technology to life in the Enterprise! We empower you and give you the tools to achieve this with the backing of all of NVIDIA, including other Solution Architects, Product, Engineering and Research teams. You’ll get to be the face and trusted expert advisor that our customers and partners rely on.

What we need to see:

  • Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).

  • 5+ years experience demonstrating an established track record in Deep Learning and Machine Learning; experience with GPUs as well as expertise in using deep learning frameworks such as TensorFlow or PyTorch.

  • Strong coding development and debugging skills. Including experience with Python, C/C++, Bash, and Linux.

  • Real-world development of large scale Gen AI applications, including but not limited to information retrieval, model pre-training and post-training, model and pipeline evaluation, inference optimization, guard-railing, agents, and reasoning systems.

  • Demonstrated experience with cluster orchestration tools including Docker, Kubernetes, and SLURM across cloud service providers and on premise.

  • Demonstrated expertise in optimizing AI training and inference workloads over high-performance networks, including both Ethernet and InfiniBand fabrics.

  • Ability to learn fast and quickly adapt to change.

  • Clear written and oral communications skills with the ability to effectively collaborate with executives and engineering teams.

Ways to stand out from the crowd:

  • Proven expertise and hands-on experience with NVIDIA AI products including NIM, Nemo Retriever, Nemo Microservices, and Nemo Framework.

  • Expertise on NVIDIA Spectrum-X.

  • Experience with NVIDIA Collective Communication Library (NCCL).

  • Extensive engineering and customer experience on projects with multiple collaborators.

  • Show willingness and ability to dig into unfamiliar territories to solve complex problems relying on experience from previous work.

The base salary range is 148,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

bytedance - Software Engineer, Real Time Communication

bytedance

Singapore (On-Site)
9 Months ago
Capgemini - Oracle HCM Cloud Fusion Consultant

Capgemini

India (On-Site)
2 Months ago
Flow - Senior/Staff Platform Engineer/SRE

Flow

Palo Alto, California, United States (Hybrid)
5 Months ago
InMobiInMobi - SDE III - Devops

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
10 Months ago
2K - Principal Product Manager

2K

Montreal, Quebec, Canada (On-Site)
2 Months ago
Veeam Software - Senior Manager, APJ Cloud and Service Provider

Veeam Software

Singapore, Singapore (On-Site)
3 Months ago
AiDash - Software Development Engineer - III (DevOps)

AiDash

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Cognite - Senior Solution Architect

Cognite

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Remote)
2 Months ago
Power Integrations - Systems & Infrastructure Applications Engineer

Power Integrations

Pasig, Metro Manila, Philippines (On-Site)
10 Months ago
Postman - Corporate Solutions Engineer - DACH

Postman

Germany (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer III, Full Stack, Google Ads

Google

New York, New York, United States (On-Site)
3 Months ago
NCR Voyix - Software Engineer IV - Java

NCR Voyix

Hyderabad, Telangana, India (On-Site)
2 Months ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

Toronto, Ontario, Canada (Remote)
4 Months ago
Snlo studios - Financial Controller

Snlo studios

San Francisco, California, United States (Remote)
2 Months ago
Next Level Business Services - SFDC Senior  Developer

Next Level Business Services

Parsippany-Troy Hills, New Jersey, United States (On-Site)
9 Months ago
supercell - Senior Software Engineer

supercell

Helsinki, Uusimaa, Finland (On-Site)
2 Months ago
Rippling - Developer Support Specialist

Rippling

United States (Remote)
3 Months ago
endava - Senior Cloud Operations Engineer - AWS

endava

Iași, Iași County, Romania (On-Site)
1 Month ago
Fox Factory - Assembler I

Fox Factory

Jasper, Indiana, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

C3 IoT - Senior Director, Strategic Solutions - DHS and Federal Law Enforcement

C3 IoT

Tysons, Virginia, United States (On-Site)
3 Weeks ago
Redhorse Corp - Mid-Level Data Engineer

Redhorse Corp

Falls Church, Virginia, United States (On-Site)
1 Month ago
Hasbro - Sr. Manager, Accounting Controls & Compliance

Hasbro

Pawtucket, Rhode Island, United States (Hybrid)
3 Weeks ago
Marvell - Sr. Engineer, Digital IC Design

Marvell

Santa Clara, California, United States (On-Site)
2 Months ago
Demandbase - Senior Accounts Receivable Manager

Demandbase

United States (Remote)
2 Months ago
Dentsu - Senior Manager, Media Activation

Dentsu

New York, United States (Remote)
2 Months ago
AI Fund - Enterprise Sales Director - US Western Region

AI Fund

United States (On-Site)
9 Months ago
Next Level Business Services - Technical Lead (ASP.NET / Site core)

Next Level Business Services

Philadelphia, Pennsylvania, United States (On-Site)
9 Months ago
Activision - Senior Staff Software Engineer (Data)

Activision

San Francisco, California, United States (On-Site)
1 Month ago
Glean - Enterprise Account Executive

Glean

Palo Alto, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

bytedance - Software Engineer Intern (AIGC Platform - Monetization GenAI)

bytedance

San Jose, California, United States (On-Site)
3 Months ago
bytedance - Site Reliability Engineer (Cloud) - Infrastructure Engineering

bytedance

Singapore (On-Site)
9 Months ago
extreme network - Staff Software Applications Engineer - CloudOps/DevOps - Linux-Kubernetes-AWS/Azure

extreme network

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
endava - Solution Architect

endava

Cluj-Napoca, Cluj County, Romania (On-Site)
1 Month ago
bytedance - GPU/AI Application Platform Engineer Intern (Server Platform)

bytedance

San Jose, California, United States (On-Site)
5 Months ago
bytedance - Software Engineer Intern (On-Device AI - Intelligent Creation-AI Platform)

bytedance

San Jose, California, United States (On-Site)
3 Months ago
Ajmera Infotech - CI/CD Pipeline Engineer

Ajmera Infotech

Ahmedabad, Gujarat, India (On-Site)
3 Weeks ago
GoTo Group - Lead Software Engineer - Engineering Platform

GoTo Group

Gurugram, Haryana, India (On-Site)
8 Months ago
Ansys - Software Developer Cloud Node.js

Ansys

Canonsburg, Pennsylvania, United States (On-Site)
2 Months ago
Roblox - Senior Frontend Software Engineer, Open Platform & AI Enablement

Roblox

San Mateo, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Taipei City, Taiwan (On-Site)

Beijing, Beijing, China (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Dubai, Dubai, United Arab Emirates (On-Site)

Beijing, Beijing, China (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug