Solutions Architect, Generative AI Agents and Data Processing

1 Month ago • 5 Years + • Artificial Intelligence • $148,000 PA - $235,750 PA

Job Summary

Job Description

NVIDIA seeks a Solutions Architect to join its AI Enterprise team, specializing in Generative AI and data processing. The role involves developing end-to-end ML/DL solutions for enterprise clients using NVIDIA's AI SDKs and APIs, designing GPU-accelerated pipelines, optimizing resource utilization, and improving workload performance. Responsibilities include offering deep technical expertise, providing feedback for software improvement, educating vertical teams, and building communities around NVIDIA AI products. The team focuses on large-scale data streaming with Morpheus, machine learning with RAPIDS, and distributed computing with Dask and Spark-RAPIDS, leveraging tools like NV-Ingest and NeMo Data Curator for data preparation. Experience with agent-based systems for data integration and RAG pipelines is crucial.
Must have:
  • BS/MS/PhD in relevant field
  • 5+ years experience in ML/DL
  • Strong software engineering skills (Python, C/C++, Linux)
  • Deep learning frameworks (TensorFlow/PyTorch)
  • Agentic RAG system development experience
  • Vector database expertise (Pinecone, FAISS, Milvus)
  • Excellent communication and collaboration skills
Good to have:
  • Experience with NVIDIA AI Enterprise Software (Morpheus, RAPIDS, NeMo, NIM)
  • AI infrastructure knowledge (storage, networking)
  • DevOps/MLOps expertise (Kubernetes, Docker, Helm)
  • Experience with large-scale multi-modal datasets
  • Cloud deployment experience (AWS, Azure, GCP)
  • Data pipeline building and optimization for multimodal models
Perks:
  • Equity
  • Benefits

Job Details

Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for Solution Architects to join the NVIDIA AI Enterprise (NVAIE) SA Segment Team. We specialize on the newest technology and advances in Machine Learning, Deep Learning, Accelerated Data Analytics and Cloud. The vision of the NVAIE Segment team is to use our deep expertise to guide and enable the successful adoption at scale of NVIDIA AI Enterprise Software in production!

The Gen AI Data Processing team mission is to deliver innovative and efficient solutions that help enterprises lower costs through the use of our GPU data processing capabilities. We showcase the power of GPU processing through our knowledge of large scale data streaming with Morpheus, machine learning with RAPIDS, and distributed computing through Dask and Spark-RAPIDS. We enable accelerated data extraction and curation tools, including NV-Ingest and NeMo Data Curator to ensure the highest quality data for retrieval and generation using AI models. We use these tools to enhance performance in the context of LLM training data preparation and Retrieval-Augmented Generation (RAG) pipelines. With high quality datasets and Gen AI models, we also develop and optimize agent-based systems to integrate data from multiple sources and deliver accurate responses for complex queries.

What you’ll be doing:

A huge part of our work involves developing end-to-end Machine Learning and Deep Learning solutions for enterprise use cases. We help customers adopt NVIDIA AI SDKs and APIs by offering deep technical expertise and designing GPU-accelerated data processing pipelines that optimize compute resource utilization and improve workload performance for customers and partners. We provide feedback from these first-time implementations to improve our software products and scale knowledge by educating vertical teams and building communities on NVIDIA AI software products!

What we need to see:

  • Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).

  • 5+ years experience demonstrating an established track record in Deep Learning and Machine Learning. Strong software engineering and debugging skills, including experience with Python, C/C++, and Linux. Experience with GPUs as well as expertise in using deep learning frameworks such as TensorFlow or PyTorch.

  • Real-world development of agentic RAG systems, built with frameworks such as LangGraph, LlamaIndex, CrewAI, etc.

  • Strong background with vector databases (e.g., Pinecone, FAISS, or Milvus) and advanced indexing techniques, including k-nearest neighbors (KNN) and approximate nearest neighbor (ANN) search, to efficiently manage and query high-dimensional data.

  • Ability to multitask effectively in a dynamic environment, as well as clear written and oral communications skills with the ability to effectively collaborate with executives and engineering teams.

Ways to stand out from the crowd:

  • Hands-on experience with NVIDIA AI Enterprise Software (Morpheus, RAPIDS, NeMo and NIM) and AI infrastructure, including storage and networking (InfiniBand or Ethernet) knowledge. Expertise in DevOps/MLOps including Kubernetes, Docker, Helm charts, Jupyter notebooks.

  • Proven experience in curating, collecting, and preprocessing large-scale multi-modal datasets using SOTA models and techniques.

  • Experience with building and taking AI applications into production on cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.

  • Proven ability to build data preparation pipelines for multimodal models, including benchmarking, profiling, and optimization of innovative algorithms.

  • Extremely motivated, highly passionate, and curious about new technologies.

The base salary range is 148,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Research Scientist, Reinforcement Learning

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Google - Student Researcher, BS/MS, Winter/Summer 2025

Google

Ann Arbor, Michigan, United States (On-Site)
3 Months ago
Microsoft - Senior Machine Learning Engineer

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
ByteDance - Software Researcher/Engineer - Applied Research Center (Infrastructure+AI)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Research Scientist in LLM Foundation Models (reasoning, planning & agent)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Orion Innovation - Data Engineer-AI,ML

Orion Innovation

Chennai, Tamil Nadu, India (On-Site)
4 Months ago
FTF Studios - FTF Senior Programmer

FTF Studios

(Remote)
1 Year ago
NVIDIA - Developer Relations Manager

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Zoox - Collision Avoidance System, Machine Learning Internship/Co-op

Zoox

Foster City, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Trendyol - Data Science Team Lead - Dolap

Trendyol

İstanbul, İstanbul, Türkiye (Hybrid)
1 Month ago
Google - Software Engineer, PhD, Early Career, Campus, AI/Machine Learning, 2025 Start

Google

Atlanta, Georgia, United States (On-Site)
3 Months ago
Kaedim - Machine Learning Engineer

Kaedim

London, England, United Kingdom (On-Site)
6 Months ago
ByteDance - LLM Software Engineer/Researcher (Applied Machine Learning)- 2024 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
NVIDIA - Enterprise PR Manager, Taiwan

NVIDIA

Taipei City, Taiwan (On-Site)
6 Days ago
Cerebras Systems - IT/DevOps Engineer

Cerebras Systems

Bengaluru, Karnataka, India (Hybrid)
10 Months ago
ByteDance - Software Engineer Graduate (Applied Machine Learning - Enterprise) - 2025 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Zazz - Artificial Intelligence Engineer

Zazz

(Remote)
6 Days ago
Velotio Technologies - AI/ML Engineer

Velotio Technologies

India (Remote)
3 Weeks ago
Paypal - Staff Machine Learning Engineer

Paypal

San Jose, California, United States (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

SPECTRAFORCE - Audio Technician

SPECTRAFORCE

San Diego, California, United States (On-Site)
8 Months ago
Axon - Senior Technical Program Manager, AI

Axon

Seattle, Washington, United States (Remote)
6 Days ago
JMA - Regional Sales Director - DAS - Pacific Northwest

JMA

United States (Remote)
4 Months ago
HP - Director, Hybrid Systems Supply Chain

HP

Spring, Texas, United States (On-Site)
1 Month ago
GoMotive - Manager, Global Accounts Payable & Procure-to-Pay

GoMotive

United States (Remote)
6 Days ago
HoYoverse - Senior Game Recruiter

HoYoverse

Santa Monica, California, United States (Remote)
9 Months ago
Spin Master - Associate Brand Manager, Wheels

Spin Master

California, United States (Hybrid)
4 Weeks ago
Universal Music - Manager, Sync Licensing

Universal Music

Santa Monica, California, United States (On-Site)
2 Months ago
Genies - Senior Fullstack Engineer

Genies

Los Angeles, California, United States (Hybrid)
1 Month ago
Varonis  - Sales Engineering Enablement Manager

Varonis

United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

ByteDance - Research Scientist, Foundation Model, Speech Understanding

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
NVIDIA - Developer Relations Manager - Central and Eastern Europe

NVIDIA

Warsaw, Masovian Voivodeship, Poland (Remote)
4 Weeks ago
ByteDance - AI Security Researcher - Security - San Jose

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
NVIDIA - Principal Engineer for AI Software Resiliency

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
Xsolla - Principal AI Engineer

Xsolla

Raleigh, North Carolina, United States (On-Site)
8 Months ago
Inworld AI - Head of Developer Product Marketing

Inworld AI

Mountain View, California, United States (Hybrid)
1 Month ago
The Walt Disney Company - Principal Machine Learning Engineer, Research - Ad Platforms

The Walt Disney Company

Seattle, Washington, United States (On-Site)
3 Months ago
NVIDIA - Senior AI Instructor

NVIDIA

United States (Remote)
1 Month ago
Modulate - Senior Machine Learning Engineer

Modulate

Somerville, Massachusetts, United States (Hybrid)
1 Month ago
Trend Micro - NLP / Prompt Engineer (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug