Solutions Architect, Generative AI Agents and Data Processing

2 Months ago • 5 Years + • Artificial Intelligence • $148,000 PA - $235,750 PA

Job Summary

Job Description

This role involves developing end-to-end machine learning and deep learning solutions for enterprise use cases, focusing on generative AI agents and data processing. The Solutions Architect will leverage NVIDIA's AI Enterprise software (Morpheus, RAPIDS, NeMo, and NIM) to design GPU-accelerated data pipelines, optimize compute resource utilization, and improve workload performance. Responsibilities include working with customers to adopt NVIDIA AI SDKs and APIs, providing feedback for software improvement, educating teams on NVIDIA AI software, and developing and optimizing agent-based systems for complex query responses. The position requires experience with large-scale data streaming, machine learning, distributed computing, and vector databases.
Must have:
  • Deep Learning & Machine Learning expertise
  • Strong software engineering skills (Python, C/C++, Linux)
  • Experience with deep learning frameworks (TensorFlow, PyTorch)
  • Agentic RAG system development experience
  • Vector database and advanced indexing knowledge
Good to have:
  • Experience with NVIDIA AI Enterprise Software
  • DevOps/MLOps expertise (Kubernetes, Docker)
  • Experience with multi-modal datasets
  • Cloud environment experience (AWS, Azure, GCP)
  • Data pipeline building for multimodal models
Perks:
  • Equity
  • Benefits

Job Details

Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for Solution Architects to join the NVIDIA AI Enterprise (NVAIE) SA Segment Team. We specialize on the newest technology and advances in Machine Learning, Deep Learning, Accelerated Data Analytics and Cloud. The vision of the NVAIE Segment team is to use our deep expertise to guide and enable the successful adoption at scale of NVIDIA AI Enterprise Software in production!

The Gen AI Data Processing team mission is to deliver innovative and efficient solutions that help enterprises lower costs through the use of our GPU data processing capabilities. We showcase the power of GPU processing through our knowledge of large scale data streaming with Morpheus, machine learning with RAPIDS, and distributed computing through Dask and Spark-RAPIDS. We enable accelerated data extraction and curation tools, including NV-Ingest and NeMo Data Curator to ensure the highest quality data for retrieval and generation using AI models. We use these tools to enhance performance in the context of LLM training data preparation and Retrieval-Augmented Generation (RAG) pipelines. With high quality datasets and Gen AI models, we also develop and optimize agent-based systems to integrate data from multiple sources and deliver accurate responses for complex queries.

What you’ll be doing:

A huge part of our work involves developing end-to-end Machine Learning and Deep Learning solutions for enterprise use cases. We help customers adopt NVIDIA AI SDKs and APIs by offering deep technical expertise and designing GPU-accelerated data processing pipelines that optimize compute resource utilization and improve workload performance for customers and partners. We provide feedback from these first-time implementations to improve our software products and scale knowledge by educating vertical teams and building communities on NVIDIA AI software products!

What we need to see:

  • Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).

  • 5+ years experience demonstrating an established track record in Deep Learning and Machine Learning. Strong software engineering and debugging skills, including experience with Python, C/C++, and Linux. Experience with GPUs as well as expertise in using deep learning frameworks such as TensorFlow or PyTorch.

  • Real-world development of agentic RAG systems, built with frameworks such as LangGraph, LlamaIndex, CrewAI, etc.

  • Strong background with vector databases (e.g., Pinecone, FAISS, or Milvus) and advanced indexing techniques, including k-nearest neighbors (KNN) and approximate nearest neighbor (ANN) search, to efficiently manage and query high-dimensional data.

  • Ability to multitask effectively in a dynamic environment, as well as clear written and oral communications skills with the ability to effectively collaborate with executives and engineering teams.

Ways to stand out from the crowd:

  • Hands-on experience with NVIDIA AI Enterprise Software (Morpheus, RAPIDS, NeMo and NIM) and AI infrastructure, including storage and networking (InfiniBand or Ethernet) knowledge. Expertise in DevOps/MLOps including Kubernetes, Docker, Helm charts, Jupyter notebooks.

  • Proven experience in curating, collecting, and preprocessing large-scale multi-modal datasets using SOTA models and techniques.

  • Experience with building and taking AI applications into production on cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.

  • Proven ability to build data preparation pipelines for multimodal models, including benchmarking, profiling, and optimization of innovative algorithms.

  • Extremely motivated, highly passionate, and curious about new technologies.

The base salary range is 148,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Software Engineer, Model Interference

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - High-Performance Computing Research Scientist (Algorithm Acceleration)

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
NVIDIA - Senior Infrastructure Software Engineer, Deep Learning Libraries

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - Senior AI Training Performance Engineer

NVIDIA

Shanghai, Shanghai, China (Hybrid)
3 Months ago
Krafton  - [AI] AI Engineer - NLP/Chatbot (3년 이상)

Krafton

Seoul, South Korea (On-Site)
5 Months ago
Match Group - Machine Learning Engineer

Match Group

New York, New York, United States (Hybrid)
6 Months ago
Pika - Summer Research Internship

Pika

Palo Alto, California, United States (On-Site)
2 Months ago
Lionbridge Games - Language AI Specialist (Test & Tech)

Lionbridge Games

Masovian Voivodeship, Poland (On-Site)
2 Months ago
Wargaming - Gen AI Business Development Manager

Wargaming

Nicosia, Nicosia, Cyprus (On-Site)
2 Months ago
Passive Logic - AI Control Theory & Optimization Scientist

Passive Logic

Salt Lake City, Utah, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - GPU/AI Application System Software Engineer Intern

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
NVIDIA - DGX Cloud Platform Software Engineer Intern - Fall 2025

NVIDIA

Santa Clara, California, United States (On-Site)
3 Weeks ago
NVIDIA - Software Engineer Intern, Autonomous Vehicle - 2025

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
Trend Micro - Sr. Data Scientist (AI Lab)

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
ByteDance - Research Scientist Graduate (Foundation Model - Generative AI) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
NVIDIA - Senior Mixed Design Validation Systems - Electrical/Optical Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Generative AI)

ByteDance

Seattle, Washington, United States (On-Site)
2 Months ago
Trendyol - Data Science Professionals - Trendyol GO

Trendyol

Ankara, Ankara, Türkiye (Hybrid)
5 Months ago
Zazz - Artificial Intelligence Engineer

Zazz

(Remote)
2 Months ago
NVIDIA - Senior Software Architect, AI Networking

NVIDIA

Santa Clara, California, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Super - Agent Enablement & Vendor Operations Specialist

Super

Canada (Remote)
1 Month ago
Salesforce - Business Development Representative - East (Canada)

Salesforce

Toronto, Ontario, Canada (On-Site)
1 Month ago
Airlab Inc  - Gameplay Programmer (Mobile)

Airlab Inc

Quebec, Canada (On-Site)
1 Month ago
Airlab Inc  - Game Designer (Mobile)

Airlab Inc

Quebec, Canada (On-Site)
1 Month ago
Ubisoft - QA Analyst - Cinematic

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Months ago
Ubisoft - Team Lead - Animation

Ubisoft

Toronto, Ontario, Canada (On-Site)
1 Month ago
Larian Studios - QA Lead | Responsable Contrôle Qualité

Larian Studios

Quebec, Canada (On-Site)
11 Months ago
Super - Manager, Data Analytics

Super

Canada (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

NVIDIA - Senior Manager, Internal GPU and HPC Computing Clusters

NVIDIA

Washington, United States (On-Site)
1 Month ago
NVIDIA - LLM Application Intern, AV Infrastructure - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
Canva - Senior Backend Software Engineer - AI Help Platform

Canva

Sydney, New South Wales, Australia (Remote)
4 Weeks ago
The Walt Disney Company - Lead Machine Learning Engineer

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
1 Month ago
Tencent - NLP Research Intern

Tencent

London, England, United Kingdom (On-Site)
5 Months ago
NVIDIA - AI Algorithm Engineer - Silicon Solution Group

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Month ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

(Remote)
1 Month ago
Meta - AI Research Scientist, Language - Generative AI

Meta

New York, New York, United States (On-Site)
5 Months ago
NVIDIA - Engineering Manager, AI Developer Technology

NVIDIA

Austin, Texas, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug