Solutions Architect, HPC Systems Engineer

3 Weeks ago • 5 Years + • Network Engineering • $148,000 PA - $235,750 PA

Job Summary

Job Description

NVIDIA seeks an experienced Solutions Architect & HPC Systems Engineer to drive the deployment of AI hardware and software solutions in customer data centers. Responsibilities include working with AI Native, Consumer Internet, and IT Services clients on large-scale GPU server and networking deployments; guiding network design and compute/storage; supporting server/network/cluster deployments (including on-site visits); providing technical expertise on advanced GPU & network systems; guiding product roadmap features based on customer requirements; identifying new project opportunities; collaborating with engineering, product, and sales teams; serving as a customer technical advisor; building product demonstrations; analyzing and debugging performance issues; and ensuring high-performing clusters. The role requires strong systems engineering skills, experience with GPU/network systems, and excellent communication.
Must have:
  • BS/MS/PhD in relevant field or equivalent experience
  • 5+ years of Systems/Solution Engineering experience
  • Expertise in CPU/GPU server architecture, NICs, Linux
  • Networking switch knowledge (Ethernet/Infiniband)
  • Data center infrastructure knowledge (power/cooling)
  • Effective time management and multitasking
Good to have:
  • External customer-facing background
  • Large cluster deployment experience
  • C/C++, Linux kernel and driver experience
  • Experience with NVIDIA GPU systems/SDKs (CUDA)
  • NVIDIA Networking technologies (NICs, RoCE, InfiniBand)
  • ARM CPU solutions experience
  • Virtualization technology knowledge
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is looking for an experienced GPU and network systems Solutions Architect & Engineer. Do you want to be part of a team that brings new Artificial Intelligence (AI) hardware and software technologies to production in customer data centers? As part of the NVIDIA SA organization, you will be driving deployment of our end-to-end technology solutions integration at some of NVIDIA's most strategic technology customers, as well as offering recommendations to business and engineering teams on our product roadmap.

What you will be doing:

  • Working with NVIDIA AI Native, Consumer Internet and IT Services customers on large data center GPU server and networking system deployments as Solution Architect Engineer. Guide customer discussions on network design, compute/storage and support bring up of server/network/cluster deployments. You will need to visit customer data center during bring up phase.

  • Demonstrate subject matter expertise in advanced GPU & network systems and be a trusted technical advisor to NVIDIA's strategic customers. Bring customer-specific requirements to product teams to guide product roadmap features.

  • Identify new project opportunities for NVIDIA products and technology solutions in data center and artificial intelligence applications. Work closely with the GPU/Network Systems Engineering, Product management and Sales teams

  • Work as customer trusted advisor conducting regular technical customer meetings for product roadmap, cluster issues debug, feature discussions and introduction to new technology solutions

  • Build custom product demonstrations and POCs for solutions that address critical business needs of our customers

  • Analyze and debug compute/network configuration, performance issues to deliver performant clusters

What we need to see:

  • BS/MS/PhD in Electrical/Computer Engineering, Computer Science, Physics, or other Engineering fields or equivalent experience.

  • This role is for an individual with the motivation and skills to drive the data center engineering process. Ideal candidate has 5+ years of Systems/Solution Engineering (or similar Engineering roles) experience

  • System level expertise of CPU/GPU server architecture, NICs, Linux, system software and kernel drivers

  • Experience with networking switches for Ethernet/Infiniband, and Data Center infrastructure (power/cooling)

  • Knowledge of DevOps/MLOps technologies such as Docker/containers, Kubernetes

  • Effective time management and capable of balancing multiple tasks

  • Strong verbal/written communication skills and share your ideas/code clearly through documents, presentation etc

Ways to stand out from the crowd:

  • External customer facing background

  • Experience with bringup and deployment of large clusters

  • Systems engineering, coding, and debugging skills including experience with C/C++, Linux kernel and drivers

  • Hands-on experience with NVIDIA GPU systems/SDKs (e.g. CUDA), NVIDIA Networking technologies (e.g. NICs, RoCE, InfiniBand), and/or ARM CPU solutions

  • Familiarity with virtualization technology concepts

We make extensive use of conferencing tools, but occasional (20%) travel is required for on-site visit to customers and industry events. We are open to remote work location and look forward to have you join our team!

NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. If you're creative and autonomous, we want to hear from you!

The base salary range is 148,000 USD - 235,750 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

Meta - Production Engineer

Meta

Sunnyvale, California, United States (Remote)
5 Months ago
Game District - Game Developer

Game District

Lahore, Punjab, Pakistan (On-Site)
1 Month ago
Unity - Senior Technical Trainer

Unity

(Remote)
1 Month ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

New York, New York, United States (On-Site)
5 Months ago
The Walt Disney Company - Principal Software Engineer - Ad Platform

The Walt Disney Company

Glendale, California, United States (On-Site)
3 Months ago
ByteDance - Senior Software Engineer, Payment Network

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Build A Rocket Boy - Senior Network Programmer

Build A Rocket Boy

Edinburgh, Scotland, United Kingdom (On-Site)
3 Months ago
ByteDance - CDN Senior Site Reliability Engineer - Traffic Infrastructure

ByteDance

Singapore (On-Site)
6 Months ago
Google - Senior Design Engineer, Networking, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Week ago
ByteDance - Software Developer Graduate (Routing Verification & Emulation)

ByteDance

San Jose, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Optics Metrology Lead

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Red Games Co - Lead Engineer

Red Games Co

Salt Lake City, Utah, United States (On-Site)
1 Week ago
hinter land - Game Designer

hinter land

(Remote)
1 Week ago
Meta - Software Engineer, Product

Meta

Bellevue, Washington, United States (Remote)
2 Weeks ago
MIQ Digital - Research & Insights Director, Global Marketing

MIQ Digital

New York, New York, United States (On-Site)
9 Hours ago
Life church - Network Engineer

Life church

Edmond, Oklahoma, United States (On-Site)
6 Months ago
Animoca Brands - Game Developer

Animoca Brands

Malaysia (Remote)
6 Months ago
Beyond Sports  - 3D Art Internship

Beyond Sports

Alkmaar, North Holland, Netherlands (On-Site)
3 Weeks ago
Star Stable Entertainment - QA Engineer

Star Stable Entertainment

Stockholm, Stockholm County, Sweden (Hybrid)
1 Week ago
Sperasoft - Senior 3D Vegetation Artist

Sperasoft

Vojvodina, Serbia (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Riot Games - Senior Manager, Game Product Management - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Google - Software Engineer III, Engineering Productivity, Health

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Next Level Business Services - .NET Developer

Next Level Business Services

Minneapolis, Minnesota, United States (On-Site)
6 Months ago
Inkittt - Senior Machine Learning Engineer, Recommendations

Inkittt

San Francisco, California, United States (Hybrid)
3 Months ago
Google - Senior Design Verification Engineer

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Univision - Intern, Multimedia Sales

Univision

New York, New York, United States (On-Site)
19 Hours ago
Scale AI - SEAL Research Scientist, Agent Robustness

Scale AI

San Francisco, California, United States (On-Site)
1 Day ago
PlayStation Global - Software Engineer II

PlayStation Global

San Mateo, California, United States (On-Site)
1 Week ago
InfoStretch Corporation - Data Warehouse Architect

InfoStretch Corporation

Lansing, Michigan, United States (On-Site)
1 Month ago
Netflix - Financial Analyst, Residuals

Netflix

Los Angeles, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Network Engineering Jobs

ByteDance - Network Automation Engineer

ByteDance

Ashburn, Virginia, United States (On-Site)
1 Month ago
PwC - ETIC, OCI Technical Support Engineer - Associate

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
5 Months ago
ByteDance - Software Engineer Intern (SDN) - 2025 Summer (PhD)

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
ByteDance - Senior/Tech Lead AI/LLM Network Software Development Engineer

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
The Walt Disney Company - Network Engineer (1-year contract)

The Walt Disney Company

Hong Kong (On-Site)
5 Months ago
ByteDance - Site Reliability Engineer - Privacy & Security - Singapore

ByteDance

Singapore (On-Site)
6 Months ago
Tesla - Apprenticeship: IT Specialist in Digital Networking

Tesla

Brandenburg, Germany (On-Site)
2 Months ago
Google - Test Engineering Manager, Network

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
ION - Cloud Network Engineer

ION

Italy (Hybrid)
6 Months ago
ByteDance - Senior Software Engineer, Multi Cloud CDN - San Jose / Seattle / Boston

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug