Outscal Logooutscal logo

Senior Solution Engineer, Mission Control

7 Hours ago • 5 Years + • Artificial Intelligence • Research & Development • $136,000 PA - $264,500 PA

Job Summary

Job Description

NVIDIA seeks a Senior Solution Engineer to join its Mission Control team, focusing on automating AI Factory operations. This role involves direct customer interaction, troubleshooting software issues, collaborating with engineering teams, creating support tools, and driving issue resolution. The ideal candidate will possess strong Linux, containerization (Kubernetes), and programming (Python) skills, expertise in analyzing distributed GPU-accelerated workloads, and excellent communication abilities. Responsibilities include providing direct customer support, working with engineering teams on issue triage, developing and updating tools, and documenting customer interactions. Experience with parallel filesystems (Lustre, GPFS, WekaIO), Jupyter, ML frameworks, Spark, Ceph, and various hardware (GPUs, AI accelerators) is beneficial. Occasional weekend/holiday work may be required.
Must have:
  • 5+ years AI/ML experience
  • Linux expertise
  • Kubernetes experience
  • Python proficiency
  • Excellent communication
  • Problem-solving skills
Good to have:
  • Chatbot experience
  • RAG pipelines
  • Vector databases
  • Distributed training
  • PyTorch/TensorFlow
  • C/C++ development
  • CUDA experience
Perks:
  • Equity
  • Benefits

Job Details

NVIDIA is looking for an engineer who wants the buzz of direct customer interaction, and the reward of contributing to software and products. We want the right person to join our team of Solution Engineers working on the NVIDIA Mission Control, which automates the operations of AI Factories.  We need an expert engineer to triage customer software issues and resolve customer problems. You must have excellent problem-solving abilities and communication experience and be able to work on multiple projects and tasks. You must be strong in Linux, have solid programming skills, and possess experience working with containers and related technologies such as Kubernetes.  Experience analyzing the distributed GPU-accelerated workload performance is a plus.

What you'll be doing:

  • Provide direct support to our NVIDIA Enterprise customers and work to answer questions, reproduce, or resolve customer issues.

  • Work with engineering teams on customer issues, providing logs, reproduction information, and other triage information.

  • Create/update product and/or support tools.

  • Own and drive customer issues from inception to resolution.

  • Document customer interactions and better enhance our knowledge base.

  • Work with the latest hardware (e.g. GPUs, AI accelerators, high-speed interconnects) and software technologies such as parallel filesystems (e.g. Lustre, GPFS, WekaIO), Jupyter, and various ML frameworks and tools, Spark, Kubernetes, and Ceph

  • Occasional work on weekends and holidays to support customers

What we need to see:

  • Minimum of a BS in Computer Science, Electrical Engineering, or equivalent experience.

  • At least 5+ years of engineering experience with a proven track record in AI/ML-focused projects or enterprise-grade solutions.

  • Expertise analyzing, optimizing, and customizing Linux environments for AI/ML workloads.

  • Strong container orchestration/job scheduling experience on compute clusters, especially with Kubernetes

  • Professional-level communication experience, able to adjust to the technical level of the audience, and stay calm and focused in negative situations.

  • Excellent follow-up and organizational skills, with a love for solving problems.

  • Proficient in Python programming with the ability to develop scripts and build custom tools. Experience with parallel programming or GPU acceleration (e.g., CUDA) is highly desirable.
     

Ways to stand out from the crowd:

  • Experience with Chatbots, RAG pipelines, vector databases, distributed training or inference workloads

  • Experience developing in GPU accelerated / cloud / virtualized environments

  • Containerized solutions/job scheduling experience with knowledge of Docker and/or Kubernetes and/or Slurm, and/or experience analyzing software performance of distributed workloads

  • Experience with common deep learning frameworks such as PyTorch or TensorFlow

  • Experience developing with C/C++

The base salary range is 136,000 USD - 264,500 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

You will also be eligible for equity and benefits. NVIDIA accepts applications on an ongoing basis.

NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

Similar Jobs

HP - Machine Learning Intern

HP

Austin, Texas, United States (On-Site)
6 Months ago
ByteDance - Research Scientist in Foundation Model, Speech Understanding - 2024 Start (PhD)

ByteDance

San Jose, California, United States (On-Site)
4 Months ago
NVIDIA - Senior Research Engineer for Reinforcement Learning

NVIDIA

Canada (On-Site)
1 Month ago
N-iX - AI Engineer

N-iX

Poland (Remote)
1 Day ago
Nintendo - Machine Learning Operations Engineer

Nintendo

Redmond, Washington, United States (On-Site)
1 Month ago
The Walt Disney Company - Lead Machine Learning Engineer

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
1 Day ago
NVIDIA - Senior Solutions Architect, Global Partner Team

NVIDIA

Canada (On-Site)
2 Months ago
NVIDIA - Senior Staff Application Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
Meta - Software Engineer, Machine Learning

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
4 Months ago
Rackspace Technology - Machine Learning Architect (AWS)

Rackspace Technology

(Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

SmileGate - Game Data Engineer [LOST ARK]

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
2 Months ago
NVIDIA - Senior Math Libraries Engineers - Python APIs

NVIDIA

Louisiana, United States (Remote)
1 Month ago
Lucid Reality Labs - ML/AI Engineer

Lucid Reality Labs

Poland (Remote)
11 Hours ago
NVIDIA - AI Algorithms Software Engineer (RDSS Intern)

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Salesforce - 2025 PhD Intern - AI Research, Singapore

Salesforce

Singapore, Singapore (On-Site)
5 Months ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
PwC - IN-Senior Associate_ML Engineer_Data and Analytics_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
5 Months ago
NVIDIA - Software Engineer Intern, Perception - Autonomous Vehicles - 2025

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Netomi - Data Scientist - I

Netomi

Gurugram, Haryana, India (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Santa Clara, California, United States

Smarsh - Sales Development Representative I

Smarsh

New York, New York, United States (Hybrid)
5 Months ago
Next Level Business Services - Salesforce Solution Architect

Next Level Business Services

Diamond Bar, California, United States (On-Site)
5 Months ago
Onward Search - Customer Service Call Center Representative

Onward Search

Salt Lake City, Utah, United States (On-Site)
1 Day ago
Framestore - FREELANCE: VFX PRODUCERS - NEW YORK

Framestore

New York, New York, United States (On-Site)
10 Months ago
NVIDIA - Solutions Architect, Retail Data Science

NVIDIA

California, United States (Remote)
2 Months ago
Tencent - Product Lead, Game Marketing Technology - SaaS Platform

Tencent

Palo Alto, California, United States (On-Site)
6 Months ago
Rockstar Games - Senior Product Manager, Customer Experience

Rockstar Games

New York, New York, United States (On-Site)
1 Month ago
ByteDance - Software Engineer - MySQL

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Infinity Ward - Senior Narrative Animator

Infinity Ward

California, United States (On-Site)
1 Day ago
Trek - Sales Associate - Part Time

Trek

Ellicott City, Maryland, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Flutter Entertainment - Lead Data Scientist

Flutter Entertainment

Hyderabad, Telangana, India (Hybrid)
4 Months ago
NVIDIA - System Software Engineering Manager

NVIDIA

Pune, Maharashtra, India (On-Site)
1 Week ago
ByteDance - Research Scientist/Engineer - Multimodal Interaction & World Model

ByteDance

Singapore (On-Site)
4 Months ago
Netomi - Data Scientist - I

Netomi

Gurugram, Haryana, India (Hybrid)
5 Months ago
NVIDIA - Principal Engineer

NVIDIA

United States (Remote)
1 Month ago
Ubisoft - Scientifique principal en données ML _ Groupe Technologique Content Creation

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago
Granicus - Data Scientist 4

Granicus

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
PlayStation Global - Sr. ML Software Engineer

PlayStation Global

United States (Remote)
11 Hours ago
Zoox - Senior/Staff Software Engineer - Simulation Infrastructure

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago
Microsoft - Technical Program Manager, AI Multimodal

Microsoft

London, England, United Kingdom (On-Site)
1 Day ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Hsinchu, Hsinchu City, Taiwan (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

Seoul, South Korea (Hybrid)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Ra'anana, Center District, Israel (On-Site)

Shanghai, Shanghai, China (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Be'er Sheva, South District, Israel (On-Site)

California, United States (On-Site)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug