Machine Learning Software Platform Architect

2 Months ago • 5 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA seeks a Machine Learning Software Platform Architect to design, develop, and maintain infrastructure for LLM-based applications in chip design. Responsibilities include managing LLMs, developing applications (like QA bots and code generators), collaborating with hardware designers and LLM researchers, optimizing infrastructure for performance and scalability, and ensuring data security. The ideal candidate possesses 5+ years of experience in AI/ML infrastructure, strong Python and web development skills, and a deep understanding of chip design and LLM techniques.
Must have:
  • 5+ years experience in AI/ML infrastructure (LLMs preferred)
  • Proficiency in Python and web development
  • Understanding of chip design & computational challenges
  • Experience with data management and secure storage
  • Excellent problem-solving skills and teamwork
Good to have:
  • Experience with microservices
  • Cloud/distributed infrastructure background
  • Familiarity with React or Vue.js
  • Understanding of SQL & NoSQL databases
Perks:
  • Highly competitive salaries
  • Comprehensive benefits package

Job Details

Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars.

NVIDIA is seeking a highly skilled and experienced Large Language Model (LLM) based Application Infrastructure engineer to join our growing team. The successful candidate will work at the intersection of GPU chip design and AI, you will be responsible for the design, development, and maintenance of the infrastructure around NVIDIA's internal large language model aimed at facilitating chip design.

What you'll be doing:

  • Develop and maintain the infrastructure for managing large language models (LLMs) based application specifically adapted for the chip design and hardware domain.

  • Develop and maintain LLM based applications to serve hardware engineers, such as LLM based QA bot, code generator etc.

  • Collaborate with HW chip designers and LLM research teams to understand the specific needs and challenges of GPU design and ensure the LLM infrastructure is well-suited to these needs.

  • Collaborate with LLM research teams to collect & organize training / fine-tuning data to train hardware specific language model

  • Optimize the infrastructure for performance, scalability, and reliability, and ensure the secure and efficient management of data.

  • Stay updated with the latest industry trends in AI and machine learning, and continuously look for opportunities to apply these advancements to improve the LLM infrastructure.

What we need to see:

  • 5+ years work experience in developing and maintaining AI or machine learning infrastructure, preferably in the context of large language models.

  • BS in computer science or related or equivalent experience

  • Strong proficiency in Python and web development, and familiarity with LLM related techniques e.g., langchain, vector database, prompt engineering, etc.

  • Understanding of chip design and related computational and data challenges.

  • Experience with data management, including doc cleaning, transformation, and secure storage.

  • Excellent problem-solving skills and the ability to work effectively in a team.

  • In depth understanding of Machine Learning / Deep Learning / NLP concepts.

Ways to stand out from the crowd:

  • You crafted & developed production quality microservices

  • Strong technical background in cloud/distributed infrastructure

  • An excellent plus if you are familiar with front-end development using React or Vue.js

  • Strong understanding of SQL & NoSQL Data platforms.

NVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our exclusive engineering teams are rapidly growing. Are you a creative and passionate about applying Machine Learning to solve remarkably interesting problems? Are you interested in being involved in state-of-the-art development in the field of AI & love a challenge? If so, we want to hear from you!

#LI-Hybrid

Similar Jobs

Super - Senior Full-Stack Software Engineer ( Remote! )

Super

Raleigh, North Carolina, United States (Remote)
5 Months ago
Zoox - Senior Software Engineer - Simulation Scenario Frontend Full Stack

Zoox

Foster City, California, United States (Hybrid)
5 Months ago
GT - Full-stack Engineer (Python + React.js)

GT

Poland (Remote)
6 Days ago
Canva - Senior Frontend Engineer - Conversational Editing

Canva

Sydney, New South Wales, Australia (On-Site)
3 Months ago
Velotio Technologies - Senior Engineer (ROR)

Velotio Technologies

Pune, Maharashtra, India (Remote)
5 Days ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Menlo Park, California, United States (On-Site)
4 Months ago
Match Group - Senior ML Platform Engineer

Match Group

New York, New York, United States (Hybrid)
5 Months ago
NVIDIA - Research Scientist, Deep Learning and Computer Vision

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
1 Month ago
Inworld AI - AI Trainer (Contractor) - Writing & Gaming

Inworld AI

Mountain View, California, United States (Remote)
6 Days ago
Zoox - Software Engineer - Simulation Traffic & Behavior Modeling

Zoox

Foster City, California, United States (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Blazesoft - Front-End Developer

Blazesoft

Vaughan, Ontario, Canada (On-Site)
1 Week ago
WebFX - Jr. Web Developer

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
5 Months ago
CloudLinux - Senior Python/Full Stack Developer (Django-focused)

CloudLinux

City Of Zagreb, Croatia (Remote)
5 Days ago
Relax Gaming  - Game Developer

Relax Gaming

Belgrade, Serbia (On-Site)
6 Days ago
PlayStation Global - Principal Full Stack Engineer (Crash Reporting System)

PlayStation Global

London, England, United Kingdom (On-Site)
2 Weeks ago
The Walt Disney Company - Lead Software Engineer (Roku Engineer)

The Walt Disney Company

Charlotte, North Carolina, United States (On-Site)
4 Months ago
NinjaVan - Staff Software Engineer

NinjaVan

Ho Chi Minh City, Ho Chi Minh City, Vietnam (Hybrid)
5 Months ago
Canva - Frontend Software Engineer

Canva

Surry Hills, New South Wales, Australia (Remote)
5 Days ago
Volley - Senior Software Engineer, Platform

Volley

San Francisco, California, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Tencent - 游戏帐号安全产品经理

Tencent

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
Nagarro - Principal Engineer, Delivery

Nagarro

Shanghai, Shanghai, China (On-Site)
5 Months ago
NVIDIA - Software Engineer Intern - Autonomous Vehicles - 2025

NVIDIA

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Logitech - Sr. SDT EE Engineer

Logitech

Suzhou, Jiangsu, China (On-Site)
3 Months ago
Paper Games - Game Writer - Shining Nikki (2025 Spring Recruitment)

Paper Games

Shanghai, Shanghai, China (On-Site)
1 Week ago
Visa - Copy of Senior Manager, Client Consulting

Visa

Shenzhen, Guangdong Province, China (On-Site)
5 Months ago
Yodo1 - Finance Intern, Chinese Speaking

Yodo1

Beijing, Beijing, China (Remote)
9 Months ago
Tencent - Senior Business Development Manager -Supercell Games

Tencent

Shenzhen, Guangdong Province, China (On-Site)
3 Months ago
InMobiInMobi - Search Engine Marketing Manager - Microsoft Advertising, Beijing

InMobiInMobi

Beijing, Beijing, China (On-Site)
4 Months ago
Spin Master - Senior Project Engineer

Spin Master

Zhejiang, China (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Student Researcher, BS/MS, Winter/Summer 2025

Google

Mountain View, California, United States (On-Site)
4 Months ago
CloudHire - Machine Learning Engineer

CloudHire

India (Remote)
1 Week ago
NVIDIA - Senior Technical Instructor - AI and Data Center Infrastructure

NVIDIA

Texas, United States (Remote)
3 Weeks ago
NVIDIA - AI Algorithms Software Engineer (RDSS Intern)

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
ByteDance - Student Researcher (Doubao (Seed) - Foundation Model AI Platform) - 2025 Start (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
Velotio Technologies - Data Scientist

Velotio Technologies

Maharashtra, India (Remote)
2 Weeks ago
AI Fund - Machine Learning Engineer

AI Fund

(Remote)
5 Months ago
VGW - Senior Machine Learning Engineer

VGW

Sydney, New South Wales, Australia (On-Site)
6 Days ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Austin, Texas, United States (On-Site)
1 Month ago
Zoox - Sensor Software Developer

Zoox

Foster City, California, United States (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug