Outscal Logooutscal logo

Solutions Architect, Generative AI Agents and Data Processing

1 Month ago • 5 Years + • Artificial Intelligence • $148,000 PA - $235,750 PA

Job Summary

Job Description

This role involves developing end-to-end machine learning and deep learning solutions for enterprise use cases, focusing on generative AI agents and data processing. The Solutions Architect will leverage NVIDIA's AI Enterprise software (Morpheus, RAPIDS, NeMo, and NIM) to design GPU-accelerated data pipelines, optimize compute resource utilization, and improve workload performance. Responsibilities include working with customers to adopt NVIDIA AI SDKs and APIs, providing feedback for software improvement, educating teams on NVIDIA AI software, and developing and optimizing agent-based systems for complex query responses. The position requires experience with large-scale data streaming, machine learning, distributed computing, and vector databases.
Must have:
  • Deep Learning & Machine Learning expertise
  • Strong software engineering skills (Python, C/C++, Linux)
  • Experience with deep learning frameworks (TensorFlow, PyTorch)
  • Agentic RAG system development experience
  • Vector database and advanced indexing knowledge
Good to have:
  • Experience with NVIDIA AI Enterprise Software
  • DevOps/MLOps expertise (Kubernetes, Docker)
  • Experience with multi-modal datasets
  • Cloud environment experience (AWS, Azure, GCP)
  • Data pipeline building for multimodal models
Perks:
  • Equity
  • Benefits

Job Details

Do you want to be part of the team that brings Artificial Intelligence (AI) emerging technology to the field? We are looking for Solution Architects to join the NVIDIA AI Enterprise (NVAIE) SA Segment Team. We specialize on the newest technology and advances in Machine Learning, Deep Learning, Accelerated Data Analytics and Cloud. The vision of the NVAIE Segment team is to use our deep expertise to guide and enable the successful adoption at scale of NVIDIA AI Enterprise Software in production!

The Gen AI Data Processing team mission is to deliver innovative and efficient solutions that help enterprises lower costs through the use of our GPU data processing capabilities. We showcase the power of GPU processing through our knowledge of large scale data streaming with Morpheus, machine learning with RAPIDS, and distributed computing through Dask and Spark-RAPIDS. We enable accelerated data extraction and curation tools, including NV-Ingest and NeMo Data Curator to ensure the highest quality data for retrieval and generation using AI models. We use these tools to enhance performance in the context of LLM training data preparation and Retrieval-Augmented Generation (RAG) pipelines. With high quality datasets and Gen AI models, we also develop and optimize agent-based systems to integrate data from multiple sources and deliver accurate responses for complex queries.

What you’ll be doing:

A huge part of our work involves developing end-to-end Machine Learning and Deep Learning solutions for enterprise use cases. We help customers adopt NVIDIA AI SDKs and APIs by offering deep technical expertise and designing GPU-accelerated data processing pipelines that optimize compute resource utilization and improve workload performance for customers and partners. We provide feedback from these first-time implementations to improve our software products and scale knowledge by educating vertical teams and building communities on NVIDIA AI software products!

What we need to see:

  • Strong foundational expertise, from a BS, MS, or Ph.D. degree in Engineering, Mathematics, Physics, Computer Science, Data Science, or similar (or equivalent experience).

  • 5+ years experience demonstrating an established track record in Deep Learning and Machine Learning. Strong software engineering and debugging skills, including experience with Python, C/C++, and Linux. Experience with GPUs as well as expertise in using deep learning frameworks such as TensorFlow or PyTorch.

  • Real-world development of agentic RAG systems, built with frameworks such as LangGraph, LlamaIndex, CrewAI, etc.

  • Strong background with vector databases (e.g., Pinecone, FAISS, or Milvus) and advanced indexing techniques, including k-nearest neighbors (KNN) and approximate nearest neighbor (ANN) search, to efficiently manage and query high-dimensional data.

  • Ability to multitask effectively in a dynamic environment, as well as clear written and oral communications skills with the ability to effectively collaborate with executives and engineering teams.

Ways to stand out from the crowd:

  • Hands-on experience with NVIDIA AI Enterprise Software (Morpheus, RAPIDS, NeMo and NIM) and AI infrastructure, including storage and networking (InfiniBand or Ethernet) knowledge. Expertise in DevOps/MLOps including Kubernetes, Docker, Helm charts, Jupyter notebooks.

  • Proven experience in curating, collecting, and preprocessing large-scale multi-modal datasets using SOTA models and techniques.

  • Experience with building and taking AI applications into production on cloud environments (e.g., AWS, Azure, GCP) and on-premises infrastructure.

  • Proven ability to build data preparation pipelines for multimodal models, including benchmarking, profiling, and optimization of innovative algorithms.

  • Extremely motivated, highly passionate, and curious about new technologies.

The base salary range is 148,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Hyderabad, Telangana, India (Hybrid)
2 Months ago
ByteDance - Software Researcher/Engineer - Applied Research Center (Infrastructure+AI)

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
NVIDIA - Senior Math Libraries Engineers - Python APIs

NVIDIA

Louisiana, United States (Remote)
1 Month ago
NVIDIA - Deep Learning Solution Architect

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
NVIDIA - Senior GPU Architect, Profiling System

NVIDIA

Santa Clara, California, United States (On-Site)
1 Day ago
Inworld AI - Staff / Principal Machine Learning Engineer - USA

Inworld AI

Mountain View, California, United States (Remote)
4 Months ago
Microsoft - Engineering Manager

Microsoft

Mountain View, California, United States (Hybrid)
1 Day ago
ByteDance - Solutions Architect

ByteDance

Gurugram, Haryana, India (On-Site)
1 Day ago
Zoox - Senior/Staff Software Engineer - Simulation Workload Orchestration

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago
Krafton  - Applied Research Scientist/Engineer - LLM Game Agent

Krafton

Seoul, South Korea (On-Site)
12 Hours ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Research Engineer (Machine Learning Training System) - 2025 Start

ByteDance

Singapore (On-Site)
5 Months ago
Mashgin - Software Engineer, Infrastructure

Mashgin

Palo Alto, California, United States (Hybrid)
5 Months ago
N-iX - Senior C++ Engineer (High Performance Computing)

N-iX

United Kingdom (Flexible)
1 Month ago
NVIDIA - Senior Mixed Design Validation Systems - Electrical/Optical Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
ByteDance - Software Engineer - Serverless Compute Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
ByteDance - Machine Learning Researcher (Reasoning Agent) Intern - 2025 Start

ByteDance

Singapore (On-Site)
4 Months ago
NVIDIA - Senior Observability Engineer, AI and HPC

NVIDIA

Canada (On-Site)
1 Month ago
ByteDance - Lead Research Scientist, Foundation Model, Speech & Audio

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
ByteDance - Machine Learning Engineer-Model Serving Infrastructure (AML-Engine)

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
ByteDance - GPU/AI Application System Software Engineer Intern

ByteDance

San Jose, California, United States (On-Site)
21 Hours ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Haven Studios  Inc  - Senior UI Programmer

Haven Studios Inc

Montreal, Quebec, Canada (On-Site)
5 Days ago
Epic Games - Creator Ambassador

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Month ago
Mistplay - Senior Counsel II

Mistplay

Montreal, Quebec, Canada (Hybrid)
2 Weeks ago
Ubisoft - Producer

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago
Vidsy - Creative

Vidsy

Toronto, Ontario, Canada (Hybrid)
5 Months ago
Airlab Inc  - Game Artist (Mobile)

Airlab Inc

Montreal, Quebec, Canada (On-Site)
8 Months ago
NVIDIA - Senior SRAM Engineer, Circuit Design

NVIDIA

Canada (Hybrid)
1 Month ago
NVIDIA - Senior Observability Engineer, AI and HPC

NVIDIA

Canada (On-Site)
1 Month ago
Intrepid Studios,  Inc  - Senior Anti-Cheat Engineer

Intrepid Studios, Inc

Canada (On-Site)
7 Months ago
Scanline VFX - Producer, Visual Pioneering

Scanline VFX

Vancouver, British Columbia, Canada (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Meta - Software Engineer, Machine Learning

Meta

Burlingame, California, United States (On-Site)
4 Months ago
Zoox - Staff Software Engineer - Perception

Zoox

Foster City, California, United States (Hybrid)
5 Months ago
NVIDIA - Web Software Development Intern - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Flutter Entertainment - Lead Data Scientist

Flutter Entertainment

Hyderabad, Telangana, India (Hybrid)
4 Months ago
Keywords Studios (Player Support) - AI - Research Associate (Prompts)

Keywords Studios (Player Support)

Silesian Voivodeship, Poland (On-Site)
1 Week ago
Wargaming - Gen AI Business Development Manager

Wargaming

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
ByteDance - Research Scientist - Multimodal Foundation Model - 2025 Start

ByteDance

Singapore (On-Site)
4 Months ago
NVIDIA - Solutions Architect, Generative AI

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
Meta - Software Engineer, Machine Learning

Meta

Redmond, Washington, United States (On-Site)
4 Months ago
Tencent - Senior Researcher: Artificial General Intelligence (Natural Language Processing)

Tencent

Bellevue, Washington, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Hsinchu, Hsinchu City, Taiwan (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Seoul, South Korea (Hybrid)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Shanghai, Shanghai, China (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Be'er Sheva, South District, Israel (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug