Technical Support Engineer, Linux and HPC Admin

1 Month ago • 5 Years + • Administrative

Job Summary

Job Description

NVIDIA seeks a Technical Support Engineer specializing in Linux and HPC administration for their Base Command Manager (BCM) product. This role involves providing technical support to both internal and external customers utilizing BCM for managing clusters ranging from a few to thousands of nodes. Responsibilities include troubleshooting issues, collaborating with the development team, becoming a subject matter expert, conducting research and development tasks, and promoting best practices. The ideal candidate will have 5+ years of experience in HPC support, strong Linux expertise, and familiarity with parallel filesystems, ML frameworks, and related technologies. The position is remote in New Zealand or Australia.
Must have:
  • 5+ years HPC support experience
  • Strong Linux knowledge
  • Customer-facing experience
  • Research and problem-solving skills
  • Excellent communication skills
Good to have:
  • BCM/Bright Cluster Manager experience
  • Experience with parallel filesystems (Lustre, GPFS, WekaIO)
  • Familiarity with ML frameworks (Spark, Kubernetes)
  • Experience with Ceph

Job Details

NVIDIA has been redefining computer graphics, PC gaming, and accelerated computing for over 25 years. It’s a unique legacy of innovation fueled by great technology—and dynamic people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. NVIDIANS immerse themselves in a diverse, supportive environment that encourages everyone to do their best work. Join the team and see how you can make a lasting impact on the world.

NVIDIA Base Command Manager powers thousands of clusters worldwide, varying from a few to several thousands of nodes, and streamlines cluster provisioning, workload management, and infrastructure monitoring. It provides all the tools you need to deploy and run an AI data center. We take great pride in providing excellent, comprehensive support to our customers! The Technical Support Engineer in this role will significantly impact and contribute to the overall success of both external customers running their clusters with NVIDIA solutions AND internal clusters used for research, operations, and next-generation projects.

What you’ll be doing:

  • Support our internal and external customers using our Linux-based cluster management software product, ensuring everyone receives the help they require to support their clusters.

  • Collaborate with the development team to collect the correct information and escalate issues to the appropriate development team.

  • Become and serve as a subject-matter expert in several areas.

  • Research and development tasks for customers or internal use by our development team.

  • Participate in proactive discussions with internal stakeholders to ensure BCM best practices are widely communicated.

  • Work with the latest hardware (e.g. GPUs, AI accelerators, high-speed interconnects) and software technologies such as parallel filesystems (e.g. Lustre, GPFS, WekaIO), Jupyter, and various ML frameworks and tools, Spark, Kubernetes, and Ceph.

What we need to see:

  • BS degree or equivalent experience in Electrical Engineering or related field.

  • 5 years of relevant, aligned experience providing support in the HPC realm, ideally in a customer-facing role.

  • Proven research skills and interest in assisting customers to achieve their goals.

  • Experience in a technical customer-facing role.

  • Eagerness to learn and become an authority on our product.

  • Excellent written communication skills with the ability to easily convey complex technical information to consumable summaries.

  • In-depth knowledge of Linux.

  • Familiarity with typical Linux installations and their most common software elements.

Ways to stand out from the crowd:

  • Experience with high-performance computing and system administration would be an asset

  • Previous experience as a system admin running BCM/Bright Cluster Manager/Base Command Manager clusters is a definite plus. 

Similar Jobs

NinjaVan - Staff Software Engineer

NinjaVan

Ho Chi Minh City, Ho Chi Minh City, Vietnam (Hybrid)
4 Months ago
ByteDance - Senior Machine Learning Ops Engineer, ML System

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
Playtech - Java Developer

Playtech

Sofia, Sofia City Province, Bulgaria (On-Site)
2 Months ago
Activision - Cloud Engineering Co-op

Activision

Vancouver, British Columbia, Canada (Hybrid)
3 Weeks ago
Buckman - Senior Lead Digital Innovation Engineer - Solution Architect

Buckman

Chennai, Tamil Nadu, India (On-Site)
3 Months ago
Tesla - Automotive Technician

Tesla

Duisburg, North Rhine-Westphalia, Germany (On-Site)
1 Week ago
Samsung Semiconductor - Sr. Administrative Assistant (Contractor)

Samsung Semiconductor

San Jose, California, United States (On-Site)
2 Weeks ago
Tesla - Payroll Specialist

Tesla

Brandenburg, Germany (On-Site)
1 Week ago
ION - Office Assistant - Categorie Protette Law. 68/99

ION

Pisa, Tuscany, Italy (On-Site)
4 Months ago
Hawk Eye Innovations - Football Systems Operator

Hawk Eye Innovations

Netherlands (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Crunchyroll - Senior Software Engineer

Crunchyroll

(Remote)
1 Month ago
Alphasense - Join AlphaSense India Talent Community

Alphasense

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Patterned Learning Career - Senior Software Engineer, Backend

Patterned Learning Career

(Remote)
1 Week ago
Hawk Eye Innovations - Backend Java Engineer - Contract

Hawk Eye Innovations

Budapest, Hungary (On-Site)
1 Week ago
CloudHire - Senior Full Stack Architect : Angular & NestJS

CloudHire

Hyderabad, Telangana, India (Remote)
4 Months ago
Ubisoft - Golang Developer

Ubisoft

Montreal, Quebec, Canada (Hybrid)
5 Months ago
Make - Senior Software Engineer - Full-Stack - Scenario designer

Make

Prague, Czechia (Hybrid)
3 Months ago
PwC - IN_Senior Associate_Full Stack Developer_Data & Analytics_Advisory_PAN India

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
Extreme Network - Staff Backend Developer (Python, Microservices, GenAI - 92890) Ireland

Extreme Network

Shannon, County Clare, Ireland (Remote)
4 Months ago
Sinch - Senior Frontend (Full Stack) Engineer

Sinch

Malmö, Skåne County, Sweden (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in New Zealand

CAUSE AND FX - FX Artist

CAUSE AND FX

Auckland, Auckland, New Zealand (Hybrid)
1 Week ago
Canva - Backend Software Engineer - Security Platform Engineering (Open to remote across ANZ)

Canva

Auckland, Auckland, New Zealand (Remote)
2 Months ago
Entain - Broadcast Channels Operator

Entain

Auckland, Auckland, New Zealand (On-Site)
4 Weeks ago
Rocket Werkz - GAME PROGRAMMER (C#)

Rocket Werkz

Auckland, Auckland, New Zealand (On-Site)
6 Months ago
Entain - Customer Service Representative

Entain

Taupō, Waikato, New Zealand (On-Site)
2 Months ago
Blind Squirrel Games - Technical Director

Blind Squirrel Games

Auckland, Auckland, New Zealand (On-Site)
2 Months ago
Rocket Werkz - TECHNICAL ARTIST (UNREAL ENGINE)

Rocket Werkz

Auckland, Auckland, New Zealand (On-Site)
7 Months ago
PikPok - Experienced/Senior Game Data Analyst

PikPok

Wellington, Wellington, New Zealand (On-Site)
1 Week ago
PikPok - Video Designer

PikPok

Wellington, Wellington, New Zealand (On-Site)
1 Month ago
Zuru - Product Design Engineer

Zuru

Auckland, Auckland, New Zealand (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Administrative Jobs

Tesla - Customer Support Advisor - German (C1), Ukrainian (C1) - Berlin

Tesla

Berlin, Berlin, Germany (On-Site)
4 Days ago
Paytm - MYSQL - Senior Lead DBA

Paytm

Noida, Uttar Pradesh, India (On-Site)
3 Months ago
NetBrain Technologies  Inc  - Senior System Engineer - IT Infrastructure

NetBrain Technologies Inc

Hyderabad, Telangana, India (Hybrid)
4 Months ago
Interface AI - Technical Support Engineer L2

Interface AI

India (Remote)
6 Days ago
ION - Senior Linux Systems Administrator - Somerset, NJ

ION

Clifton, New Jersey, United States (Hybrid)
4 Months ago
Playground Games - Senior Office Administrator - Contract

Playground Games

Royal Leamington Spa, England, United Kingdom (On-Site)
2 Months ago
Wind River Systems - Star Lab - Principal Technologist - Embedded Security Professional Services

Wind River Systems

Washington, District Of Columbia, United States (On-Site)
3 Months ago
ByteDance - Senior Software Engineer - MySQL

ByteDance

Seattle, Washington, United States (On-Site)
2 Weeks ago
1920 - Production Coordinator for Commercials

1920

London, England, United Kingdom (Hybrid)
2 Months ago
Toptracer - Install Technician

Toptracer

Spain (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug