Machine Learning Software Platform Architect

1 Month ago • 5 Years + • Artificial Intelligence • $148,000 PA - $356,500 PA

Job Summary

Job Description

NVIDIA seeks a Machine Learning Software Platform Architect to design, develop, and maintain infrastructure for large language model (LLM) applications in chip design. Responsibilities include developing LLM-based applications (e.g., QA bots, code generators), collaborating with hardware engineers and LLM research teams, optimizing infrastructure for performance and scalability, managing data securely, and staying current with AI trends. The role requires expertise in LLM infrastructure, Python, web development, chip design, and data management.
Must have:
  • 5+ years experience in AI/ML infrastructure
  • Proficiency in Python and web development
  • LLM expertise (Langchain, vector databases)
  • Understanding of chip design and data challenges
  • Data management skills (cleaning, transformation, storage)
Good to have:
  • Microservices development experience
  • Cloud/distributed infrastructure expertise
  • React or Vue.js front-end development
  • SQL & NoSQL database knowledge
Perks:
  • Highly competitive salary
  • Comprehensive benefits package
  • Equity

Job Details

Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. Our work opens up new universes to explore, enables amazing creativity and discovery and powers what were once science fiction inventions from artificial intelligence to autonomous cars. NVIDIA is seeking a highly skilled and experienced Large Language Model (LLM) based Application Infrastructure engineer to join our growing team. The successful candidate will work at the intersection of GPU chip design and AI. You will be responsible for the design, development, and maintenance of the infrastructure around Nvidia's internal large language model aimed at facilitating chip design.

What you'll be doing:

  • Develop and maintain the infrastructure for managing large language models (LLMs) based application specifically adapted for the chip design and hardware domain.

  • Develop and maintain LLM based applications to serve hardware engineers, such as LLM based QA bot, code generator etc.

  • Collaborate with HW chip designers and LLM research teams to understand the specific needs and challenges of GPU design and ensure the LLM infrastructure is well-suited to these needs.

  • Collaborate with LLM research teams to collect & organize training / fine-tuning data to train hardware specific language model

  • Optimize the infrastructure for performance, scalability, and reliability, and ensure the secure and efficient management of data.

  • Stay updated with the latest industry trends in AI and machine learning, and continuously look for opportunities to apply these advancements to improve the LLM infrastructure.

What we need to see:

  • BS in computer science or related or equivalent experience

  • 5+ years experience

  • Experience in developing and maintaining AI or machine learning infrastructure, preferably in the context of large language models.

  • Strong proficiency in Python and web development, and familiarity with LLM related techniques e.g., langchain, vector database, prompt engineering, etc.

  • Understanding of chip design and related computational and data challenges.

  • Experience with data management, including doc cleaning, transformation, and secure storage.

  • Excellent problem-solving skills and the ability to work effectively in a team.

  • In depth understanding of Machine Learning / Deep Learning / NLP concepts.

Ways to stand out from the crowd:

  • You crafted & developed production quality microservices

  • Strong technical background in cloud/distributed infrastructure

  • An excellent plus if you are familiar with front-end development using React or Vue.js

  • Strong understanding of SQL & NoSQL Data platforms.

NVIDIA offers highly competitive salaries and a comprehensive benefits package. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our exclusive engineering teams are rapidly growing. Are you a creative and passionate about applying Machine Learning to solve remarkably interesting problems? Are you interested in being involved in state-of-the-art development in the field of AI & love a challenge? If so, we want to hear from you!

The base salary range is 148,000 USD - 356,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Urbint - Senior Full Stack Developer

Urbint

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Plarium - Data Architect

Plarium

Herzliya, Tel Aviv District, Israel (On-Site)
1 Month ago
The Walt Disney Company - Sr Software Engineer (JavaScript)

The Walt Disney Company

Washington, United States (On-Site)
2 Months ago
Super - Staff Software Engineer - Full-Stack

Super

Canada (Remote)
6 Days ago
ByteDance - Software Engineer in ML Engineering Platform

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
NVIDIA - Solutions Architect - Cloud Providers and Hyperscale

NVIDIA

California, United States (On-Site)
1 Week ago
Rackspace Technology - Machine Learning Architect (AWS)

Rackspace Technology

(Remote)
2 Months ago
NVIDIA - Principal DGX Cloud Machine Learning Architect

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - Principal Engineer

NVIDIA

United States (Remote)
1 Month ago
Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Menlo Park, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Super - Senior Software Engineer, Payments (Remote!)

Super

Toronto, Ontario, Canada (Remote)
5 Months ago
ByteDance - Backend Engineer, Machine Learning Systems - Singapore

ByteDance

Singapore (On-Site)
5 Months ago
Info Stretch - Sr. .NET Developer

Info Stretch

Indianapolis, Indiana, United States (On-Site)
3 Months ago
Ludeo - Front End Tech Lead

Ludeo

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Meta - Software Engineer, Front End

Meta

Singapore (On-Site)
4 Months ago
PwC - .NET Developer (freelance)

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Months ago
Canva - Senior Frontend Engineer - Canva for Education

Canva

Auckland, Auckland, New Zealand (Remote)
6 Days ago
Tesla - Software Developer, IT Application

Tesla

North Holland, Netherlands (On-Site)
1 Month ago
PwC - Senior .NET Developer

PwC

Athens, Greece (Remote)
2 Months ago
Tru India - React Native Developer

Tru India

Sahibzada Ajit Singh Nagar, Punjab, India (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Canada

Keywords Studios (Player Support) - IT Support Technician

Keywords Studios (Player Support)

Vancouver, British Columbia, Canada (On-Site)
5 Days ago
People Can Fly - AI Programmer

People Can Fly

Montreal, Quebec, Canada (Remote)
6 Days ago
Scientific Games  - Electrotechnician

Scientific Games

Montreal, Quebec, Canada (On-Site)
1 Month ago
Sago Mini - Unity Game Developer Intern

Sago Mini

Toronto, Ontario, Canada (On-Site)
2 Weeks ago
Ubisoft - Process Analyst - Organizational Transformation

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Weeks ago
Ubisoft - ServiceNow Developer

Ubisoft

Montreal, Quebec, Canada (On-Site)
3 Days ago
Airlab Inc  - Senior Lead Programmer (Game Industry)

Airlab Inc

Quebec, Canada (On-Site)
6 Days ago
Skybox Labs - Senior Environment Artist - Levels

Skybox Labs

Burnaby, British Columbia, Canada (Hybrid)
6 Days ago
Gamebreaking Studios - Engineering Manager (Unreal Gameplay Focus)

Gamebreaking Studios

Canada (Remote)
5 Months ago
Inworld AI - Staff C++ Developer

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
6 Days ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Token Metrics - Crypto Data Scientist / Machine Learning Engineer  (Remote)

Token Metrics

Tiranë, Tirana County, Albania (Remote)
5 Months ago
Genies - ML Engineering Intern

Genies

Los Angeles, California, United States (Hybrid)
6 Days ago
FTF Studios - FTF Mid-Level Programmer

FTF Studios

(Remote)
1 Year ago
NVIDIA - AI Developer Technology Engineer

NVIDIA

Shanghai, Shanghai, China (On-Site)
21 Hours ago
NVIDIA - Deep Learning Software Engineer, Performance Optimization

NVIDIA

Tokyo, Japan (On-Site)
2 Months ago
Inworld AI - Staff C++ Engineer

Inworld AI

Mountain View, California, United States (On-Site)
6 Days ago
Krafton  - Deep Learning Engineer - LLM Game Agent

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Wargaming - Gen AI Business Development Manager

Wargaming

Berlin, Berlin, Germany (On-Site)
1 Month ago
Omnissa - Staff Engineer (Data Science)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Ra'anana, Center District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug