Principal DGX Cloud Machine Learning Architect

1 Month ago • 15 Years + • Artificial Intelligence • $272,000 PA - $425,500 PA

Job Summary

Job Description

NVIDIA seeks a Principal DGX Cloud Machine Learning Architect to optimize generative AI models (VLMs, WFMs, CV models) for DGX Cloud customers. This role focuses on advancing Physical AI, improving model performance and fidelity for applications like classification, auto-labeling, and autonomous vehicle policies. Responsibilities include developing and optimizing generative AI models using NVIDIA's AI software stack, designing evaluation methodologies, collaborating with cross-functional teams, analyzing and improving model performance, and architecting scalable software platforms for seamless user experiences. The ideal candidate will possess extensive experience in deep learning, model engineering, and cloud AI deployment, with a strong background in Python, PyTorch, and Hugging Face.
Must have:
  • 15+ years in deep learning/AI model development
  • Proficiency in model engineering (data curation, fine-tuning)
  • Strong software design & debugging skills
  • Advanced Python & PyTorch expertise
  • Deep understanding of AI system architectures
  • Cloud AI experience (AWS, Azure, GCP)
Good to have:
  • Industry impact in robotics or autonomous systems
  • Open-source contributions to AI frameworks
  • Published research in generative AI
  • Cross-disciplinary team leadership
  • GPU acceleration and performance optimization expertise
Perks:
  • Competitive salary
  • Generous benefits package

Job Details

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.

NVIDIA DGX™ Cloud is a scalable, end-to-end AI platform, co-engineered with leading cloud service providers (CSPs) to deliver groundbreaking performance for developers. We are seeking a Principal DGX Cloud Machine Learning Architect to optimize generative AI models, including Vision Language Models (VLMs), World Foundation Models (WFMs), and Computer Vision (CV) models, for DGX Cloud customers. This role is crucial for advancing Physical AI, the next frontier in AI. Your expertise will improve model performance and fidelity for applications like classification, auto-labeling, and robotic/autonomous vehicle policies. You will help DGX Cloud customers achieve outstanding results at scale. You will work across the entire machine learning stack, from high-level frameworks like PyTorch and Hugging Face to model evaluation, data curation, fine-tuning, and re-architecture. This is an outstanding opportunity for an innovative ML engineer at the intersection of research and engineering to drive impact in a high-performance AI environment.

What You’ll Be Doing:

  • Develop and optimize innovative generative AI models such as VLMs and WFMs using NVIDIA’s AI software stack for Physical AI applications.

  • Design and implement sophisticated evaluation methodologies and automation techniques to streamline model assessments.

  • Collaborate with world-class teams across NVIDIA to refine and improve foundation models for AI-powered solutions.

  • Analyze, profile, and improve model performance to drive efficiency, scalability, and precision.

  • Architect scalable, modular software platforms that improve AI adoption by DGX Cloud customers, ensuring seamless user experiences and broad model support.

What We Need to See:

  • Master’s, PhD, or equivalent experience in Computer Science, AI, Applied Math, or a related field (or equivalent experience).

  • 15+ years of demonstrated ability in deep learning, AI model development, or research.

  • Proficiency in model engineering, including data curation, fine-tuning, and evaluation.

  • Strong software design and debugging skills, with expertise in performance analysis and test design.

  • Advanced Python and PyTorch proficiency, with experience in ML tools such as Hugging Face.

  • Deep understanding of algorithms, programming fundamentals, and AI system architectures.

  • Excellent written and verbal communication skills, with the ability to work independently and collaboratively in a multifaceted environment.

  • Cloud AI Expertise: Hands-on experience optimizing and deploying AI systems at scale on major cloud platforms (AWS, Azure, GCP), focusing on performance and cost-efficiency.

  • Automation & Scalability: Proven track record to build automated evaluation methodologies and scalable data curation pipelines to continuously boost model performance.

Ways to Stand Out from the Crowd:

  • Industry Impact: A consistent record of transitioning groundbreaking AI research into production, particularly in robotics or autonomous systems.

  • Open-Source Leadership: Active contributions to key AI frameworks (e.g., PyTorch, Hugging Face) or other significant open-source projects.

  • Thought Leadership: Published research, patents, or recognition in generative AI, Physical AI, or related fields that highlight your ability to drive innovation.

  • Cross-Disciplinary Leadership: Experience leading multi-functional teams (engineering, research, product) to deliver coordinated AI solutions.

  • Performance Optimization Mastery: Expertise in GPU acceleration, hardware/software co-design, and deep performance optimization for AI workloads.

With competitive salaries and a generous benefits package (www.nvidiabenefits.com ), we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!

The base salary range is 272,000 USD - 425,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Client Engineer (Real Time Communication) - 2025 Start

ByteDance

Singapore (On-Site)
5 Months ago
Blitz app - Senior Software Engineer (C++)

Blitz app

India (Remote)
6 Days ago
ByteDance - Tech Lead Manager, Large Language Models & Generative AI

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Trend Micro - (Sr.) Data Engineer/AI Trainer

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
Smarsh - (Principal Engineer)Cloud Application Developer

Smarsh

India (Hybrid)
5 Months ago
Axon - Senior Technical Program Manager, AI

Axon

Seattle, Washington, United States (Remote)
1 Month ago
Spell Brush - AI Infrastructure Engineer

Spell Brush

San Francisco, California, United States (On-Site)
5 Days ago
Genies - Backend Engineer Intern (LLM)

Genies

San Mateo, California, United States (Hybrid)
6 Days ago
Vigaet - Internship -AI Agents

Vigaet

(Remote)
4 Months ago
Zoox - Technical Program Manager - Artificial Intelligence

Zoox

Foster City, California, United States (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Welevel - Unreal AI Developer

Welevel

Munich, Bavaria, Germany (On-Site)
2 Months ago
NVIDIA - Deep Learning Software Engineer, Performance Optimization

NVIDIA

Tokyo, Japan (On-Site)
2 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Vision and Language) - 2025 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Senior Software Engineer, Machine Learning, Google Ads

Google

Los Angeles, California, United States (On-Site)
3 Months ago
Trend Micro - Sr. Data Scientist (AI Lab)

Trend Micro

Taipei City, Taiwan (On-Site)
6 Months ago
ByteDance - Senior Software Engineer - MySQL

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Google - Software Engineer III, AI/ML, Google Cloud

Google

Gurugram, Haryana, India (On-Site)
3 Months ago
Google - Software Engineer III, Google Cloud Platforms

Google

(On-Site)
3 Months ago
ByteDance - Research Scientist for Generative AI, LLM and Multimodal 【Talent Spotters】

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
Google - Senior Software Engineer, Titian Platform

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Riot Games - Staff Software Engineer - League of Legends, Champions

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Riot Games - Associate Art Director (Characters) - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
5 Months ago
Lionsgate Games - Financial Analyst, Contract Compliance & Third Party Audit

Lionsgate Games

Santa Monica, California, United States (On-Site)
1 Week ago
Epic Games - Principal Programmer, Language Engineering

Epic Games

United States (On-Site)
2 Months ago
CloudHire - Pathology Assistant Reallocation Opportunity, New York

CloudHire

North Carolina, United States (On-Site)
1 Week ago
Valve corporation - Sound Designer

Valve corporation

Bellevue, Washington, United States (On-Site)
4 Months ago
The Walt Disney Company - Sales Analytics & Insights Analyst

The Walt Disney Company

Celebration, Florida, United States (On-Site)
1 Week ago
Evolution - Studio Game Presenter (Server/Waitress Alternative)

Evolution

Trumbull, Connecticut, United States (On-Site)
10 Months ago
The Walt Disney Company - Manager, Software Engineering

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Months ago
Proof of Play - Senior Fullstack Engineer

Proof of Play

United States (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Georgia (Remote)
5 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

London, England, United Kingdom (Remote)
2 Months ago
Genies - Machine Learning Engineer, Character Animation & Motion AI

Genies

Los Angeles, California, United States (On-Site)
1 Month ago
NVIDIA - Deep Learning Performance Architect

NVIDIA

Hyderabad, Telangana, India (Hybrid)
1 Month ago
NVIDIA - Principal Engineer - DL and AI Software

NVIDIA

Canada (On-Site)
1 Month ago
Meta - Research Scientist, Computer Vision for Generative AI (PhD)

Meta

Menlo Park, California, United States (On-Site)
4 Months ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model - Vision Generative AI)

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
NetEase Games - Game AI Research Leader

NetEase Games

Singapore (On-Site)
6 Days ago
CharacterAI - Software Engineer, Machine Learning Infrastructure

CharacterAI

New York, New York, United States (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug