Machine Learning Software Platform Architect

3 Months ago • 5 Years + • Artificial Intelligence • $148,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks a Machine Learning Software Platform Architect to design, develop, and maintain infrastructure for LLM-based applications within the chip design domain. Responsibilities include developing LLM applications for hardware engineers (e.g., QA bots, code generators), collaborating with chip designers and LLM research teams, optimizing infrastructure for performance and scalability, and ensuring data security. The ideal candidate possesses strong expertise in AI/ML infrastructure, Python, web development, LLM techniques (Langchain, vector databases, prompt engineering), chip design, data management, and problem-solving skills.
Must have:
  • 5+ years experience in AI/ML infrastructure
  • Proficiency in Python and web development
  • LLM expertise (Langchain, vector databases)
  • Understanding of chip design and data challenges
  • Data management skills (cleaning, transformation)
  • Excellent problem-solving and teamwork skills
Good to have:
  • Experience with microservices
  • Cloud/distributed infrastructure background
  • React or Vue.js experience
  • SQL & NoSQL database knowledge
Perks:
  • Highly competitive salary
  • Comprehensive benefits package
  • Equity

Job Details

Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is seeking a highly skilled and experienced Large Language Model (LLM) based Application Infrastructure engineer to join our growing team. The successful candidate will work at the intersection of GPU chip design and AI. You will be responsible for the design, development, and maintenance of the infrastructure around Nvidia's internal large language model aimed at facilitating chip design.

What you'll be doing:

  • Develop and maintain the infrastructure for managing large language models (LLMs) based application specifically adapted for the chip design and hardware domain.

  • Develop and maintain LLM based applications to serve hardware engineers, such as LLM based QA bot, code generator etc.

  • Collaborate with HW chip designers and LLM research teams to understand the specific needs and challenges of GPU design and ensure the LLM infrastructure is well-suited to these needs.

  • Collaborate with LLM research teams to collect & organize training / fine-tuning data to train hardware specific language model

  • Optimize the infrastructure for performance, scalability, and reliability, and ensure the secure and efficient management of data.

  • Stay updated with the latest industry trends in AI and machine learning, and continuously look for opportunities to apply these advancements to improve the LLM infrastructure.

What we need to see:

  • BS in computer science or related or equivalent experience

  • 5+ years experience

  • Experience in developing and maintaining AI or machine learning infrastructure, preferably in the context of large language models.

  • Strong proficiency in Python and web development, and familiarity with LLM related techniques e.g., langchain, vector database, prompt engineering, etc.

  • Understanding of chip design and related computational and data challenges.

  • Experience with data management, including doc cleaning, transformation, and secure storage.

  • Excellent problem-solving skills and the ability to work effectively in a team.

  • In depth understanding of Machine Learning / Deep Learning / NLP concepts.

Ways to stand out from the crowd:

  • You crafted & developed production quality microservices

  • Strong technical background in cloud/distributed infrastructure

  • An excellent plus if you are familiar with front-end development using React or Vue.js

  • Strong understanding of SQL & NoSQL Data platforms.

NVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our exclusive engineering teams are rapidly growing. Are you a creative and passionate about applying Machine Learning to solve remarkably interesting problems? Are you interested in being involved in state-of-the-art development in the field of AI & love a challenge? If so, we want to hear from you!

The base salary range is 148,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

ByteDance - Backend Software Engineer - FIDO2 Authentication

ByteDance

San Jose, California, United States (On-Site)
1 Month ago
Aristocrat Gaming - Senior Software Developer

Aristocrat Gaming

London, England, United Kingdom (Hybrid)
4 Months ago
FRVR - Growth Freelancer (SEO, Content & Product Focus)

FRVR

Lisbon, Lisbon, Portugal (On-Site)
2 Weeks ago
Canva - Senior Frontend Engineer - Canva for Education

Canva

Surry Hills, New South Wales, Australia (Remote)
1 Month ago
The Walt Disney Company - Senior Software Engineer - Front End/Roku

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
2 Weeks ago
Google - Customer Engineer, Cloud AI, Google Cloud

Google

Seattle, Washington, United States (On-Site)
1 Day ago
Hyper Verge - Machine Learning Engineer II

Hyper Verge

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Google - Silicon Architecture/Design Engineer

Google

Bengaluru, Karnataka, India (On-Site)
1 Week ago
NVIDIA - AI Digital Human Development Intern - 2025

NVIDIA

(On-Site)
2 Months ago
Canva - Senior Computer Vision Engineer - Photo AI

Canva

Vienna, Vienna, Austria (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Patreon - Frontend Engineer, Insights (L4)

Patreon

New York, New York, United States (Hybrid)
3 Weeks ago
Canva - Senior Frontend Engineer - Apps API Platform

Canva

Brisbane, Queensland, Australia (Remote)
1 Month ago
Glean - Software Engineer, Frontend

Glean

Palo Alto, California, United States (On-Site)
5 Months ago
Nagarro - Associate Staff Engineer, Frontend React

Nagarro

Hyderabad, Telangana, India (On-Site)
6 Months ago
N-iX - Senior Full Stack Engineer (.NET, with focus on React)

N-iX

Colombia (Remote)
2 Weeks ago
Hogarth - QA Engineer

Hogarth

Hyderabad, Telangana, India (Hybrid)
5 Months ago
Nagarro - Senior Staff Engineer, Java Fullstack

Nagarro

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
6 Months ago
Trend Micro - Senior Frontend Engineer

Trend Micro

Manila, Metro Manila, Philippines (On-Site)
16 Years ago
Accurate - Senior Engineering Manager - Java

Accurate

Hyderabad, Telangana, India (Hybrid)
6 Months ago
Microsoft - Software Engineer 2

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

PlayStation Global - Director, People Technology Services

PlayStation Global

Aliso Viejo, California, United States (On-Site)
6 Days ago
Microsoft - Applied Scientist: Microsoft AI – PhD – Redmond

Microsoft

Redmond, Washington, United States (On-Site)
1 Week ago
Meta - Software Engineer (Technical Leadership)

Meta

New York, New York, United States (On-Site)
5 Months ago
Hawk Eye Innovations - Live Operations Coordinator - SEC College Sport

Hawk Eye Innovations

Birmingham, Alabama, United States (On-Site)
1 Month ago
Nintendo - Senior Manager, Program Management - Merchandise

Nintendo

Redmond, Washington, United States (Hybrid)
7 Months ago
Netflix - Software Engineer L4/L5, Training Platform, Machine Learning Platform

Netflix

California, United States (Remote)
3 Months ago
Google - Senior Interaction Designer, Google Cloud, Apigee

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Hedra - Senior Research Engineer

Hedra

New York, New York, United States (On-Site)
1 Month ago
Elsewhere - 2D Art Generalist (Contract)

Elsewhere

San Francisco, California, United States (Remote)
3 Weeks ago
Scientific Games  - Senior Client Project Manager

Scientific Games

Alpharetta, Georgia, United States (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Ubisoft - Senior Software Engineer - AI Applications

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
1 Month ago
Google - Cloud AI Engineer, Global Services Delivery (Multiple Language)

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Week ago
Google - Software Engineer III, AI/ML GenAI, Google Cloud

Google

Hyderabad, Telangana, India (On-Site)
1 Week ago
Google - Applied Machine Learning Engineer, AICore, Platforms and Devices

Google

Taipei City, Taiwan (On-Site)
1 Day ago
Google - Lead Group Product Manager, Vertex AI Platform Development

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Netflix - Machine Learning Scientist (L5) - Payments DSE

Netflix

United States (Remote)
2 Months ago
Google - Engineering Manager, Physical Design

Google

Sunnyvale, California, United States (On-Site)
1 Week ago
Inworld AI - Senior Product Manager, AI Engine - USA

Inworld AI

Mountain View, California, United States (On-Site)
6 Months ago
NVIDIA - Senior AI-HPC Cluster Engineer

NVIDIA

Westford, Massachusetts, United States (Hybrid)
1 Month ago
CharacterAI - Software Engineer, Machine Learning Infrastructure

CharacterAI

New York, New York, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug