Senior System Networking Engineer, InfiniBand

4 Months ago • 8 Years + • Artificial Intelligence • Research & Development

Job Summary

Job Description

NVIDIA seeks a Senior System Networking Engineer specializing in InfiniBand for its HPC/AI E2E Verification team. This role involves designing and implementing innovative architectures for high-performance computing systems, focusing on scalability, performance, and functionality of NVIDIA InfiniBand HPC/AI solutions. Responsibilities include collaborating with cross-functional teams, planning and executing end-to-end test scenarios, analyzing results, generating reports, and driving process improvements. The ideal candidate possesses in-depth InfiniBand, Linux networking, and HPC/AI experience, with a strong analytical and problem-solving aptitude. Experience with AI application benchmarks and distributed job scheduling is highly desirable.
Must have:
  • 8+ years experience in networking
  • InfiniBand XDR/NDR knowledge
  • Linux networking expertise
  • Strong analytical & problem-solving skills
  • HPC/AI architecture understanding
Good to have:
  • AI Application benchmarks experience
  • Distributed job scheduling expertise
  • Nvidia Networking & AI architecture knowledge

Job Details

NVIDIA is looking for an outstanding candidate for a Senior System Networking Engineer role in HPC/AI E2E Verification team. Be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in InfiniBand networking technologies and High-performance computing. You will work with the latest InfiniBand based Switches, HCAs, AI servers and Software, together with many researchers, Architects and developers leading differentiated InfiniBand HPC/AI solutions.

What You Will Be Doing:

  • As a Senior System InfiniBand Networking Engineer, you will play a crucial role in crafting and implementing innovative architectures for high-performance computing systems, enabling efficient and scalable computation for AI/ML applications and HPC Benchmarks

  • Collaborating closely with multi-functional teams, including hardware engineers, software developers, and domain experts, to deliver optimized solutions that meet the demanding requirements of HPC/AI workloads

  • Planning, Reviewing, and Executing complexed End-to-End scenarios with strong emphasis on Scalability, Performance, and Functionality of NVIDIA InfiniBand HPC/AI solutions ensuring alignment with Nvidia Networking & AI specifications

  • Analyzing test results and generating detailed reports for stakeholders to facilitate informed decision-making

  • Drive continuous improvement initiatives, identifying opportunities to enhance verification processes and methodologies in the context of Nvidia Networking & AI solutions

What We Need To See:

  • Bachelor's/Master’s degree in electrical engineering, Computer Science, or equivalent experience in Networking/System field

  • 8+ years experience driving large-scale complexed solutions with strong emphasis on networking troubleshooting and Performance analysis

  • In depth experience and understanding of Linux based networking systems

  • Strong analytical and problem-solving skills

  • Excellent communication and interpersonal skills

  • Ability to work effectively in a collaborative, fast-paced environment

Ways To Stand Out From The Crowd:

  • In-depth knowledge of InfiniBand XDR/NDR technology, Nvidia Networking, and AI architectures, protocols, and standards

  • Expertise in High-performance computing and Machine learning

  • Experience in AI Application benchmarks and Distributed job scheduling

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Similar Jobs

Offworld - DevOps Engineer

Offworld

New Westminster, British Columbia, Canada (On-Site)
1 Month ago
Milestone - Senior QA Engineer

Milestone

Sofia, Sofia City Province, Bulgaria (Hybrid)
1 Month ago
Rockstar Games - Senior Technical Artist: Performance Capture

Rockstar Games

London, England, United Kingdom (On-Site)
1 Month ago
Rush Street Interactive - Senior IT Support Engineer

Rush Street Interactive

Toronto, Ontario, Canada (On-Site)
3 Months ago
Appier - Software Engineer, Site Reliability Engineering

Appier

Taipei City, Taiwan (On-Site)
6 Months ago
ByteDance - Senior Research Scientist, Foundation Model, Speech Understanding

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Inworld AI - Staff / Principal AI Researcher - USA

Inworld AI

Mountain View, California, United States (Remote)
5 Months ago
Rackspace Technology - Principal MLOps Engineer

Rackspace Technology

San Antonio, Texas, United States (Remote)
1 Month ago
Canva - Machine Learning Research Engineering Manager - Image Generation

Canva

Vienna, Vienna, Austria (Remote)
1 Month ago
Ubisoft - Senior Software Engineer - AI Applications

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Technical Instructor - AI and Data Center Infrastructure

NVIDIA

Ra'anana, Center District, Israel (On-Site)
2 Months ago
Google - Software Engineer, PhD, Early Career, Campus, Embedded Systems and Firmware, 2025 start

Google

Atlanta, Georgia, United States (On-Site)
6 Months ago
Google - Data Center Operations Manager, Global Server Operations

Google

Papillion, Nebraska, United States (On-Site)
1 Month ago
Newrick Network - AWS DevOps Engineer

Newrick Network

Ontario, Canada (Remote)
1 Month ago
SideFX Software - Quality Assurance Specialist - Houdini

SideFX Software

Ontario, Canada (Hybrid)
3 Months ago
Skydio - Senior Software Engineer - Manufacturing Software

Skydio

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Wargaming - Infrastructure Engineer

Wargaming

Nicosia, Nicosia, Cyprus (Hybrid)
1 Month ago
Google - Technical Solutions Engineer, Infrastructure Compute

Google

Pune, Maharashtra, India (On-Site)
1 Month ago
NVIDIA - Senior Technical Instructor - AI and Data Center Infrastructure

NVIDIA

United Kingdom (Remote)
2 Months ago
GoMotive - Technical Lead Manager, Embedded Safety

GoMotive

United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Yokne'am Illit, North District, Israel

Google - Software Engineer III, Research

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
PAPAYA - Facebook Community Manager

PAPAYA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Pazu Games - Monetization Product Manager

Pazu Games

Israel (On-Site)
1 Month ago
NVIDIA - STA Backend Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Plarium - IT Support Administrator

Plarium

Herzliya, Tel Aviv District, Israel (On-Site)
3 Months ago
Varonis  - Python Developer

Varonis

Herzliya, Tel Aviv District, Israel (Hybrid)
5 Months ago
SciPlay - Configuration Manager

SciPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Playtika - Marketing Strategy Manager (Temporary Position)

Playtika

Israel (On-Site)
5 Months ago
NVIDIA - Senior Network Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
4 Months ago
Playtika - Director Of Monetization, VIP & CS

Playtika

Israel (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Google - Senior Software Engineer, Core Machine Learning, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
6 Months ago
Microsoft - Senior Researcher – Generative AI

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
Google - Software Engineer, Performance Modeling

Google

Durham, North Carolina, United States (On-Site)
1 Month ago
Light Speed Studios - Senior Researcher, Natural Language Processing

Light Speed Studios

Tokyo, Japan (On-Site)
1 Month ago
Google - Developer Relations Engineer, AI and Compute Enablement

Google

New York, New York, United States (On-Site)
1 Month ago
Zoox - Senior/ Staff Software Engineer - Simulation Workload Orchestration

Zoox

Foster City, California, United States (Hybrid)
7 Months ago
Omnissa - Staff Engineer (Data Science)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
ByteDance - Software Engineer Intern (Doubao (Seed) - Machine Learning System) - 2025 Summer (PhD)

ByteDance

Seattle, Washington, United States (On-Site)
6 Months ago
Dolby Laboratories - AIOps Research Scientist

Dolby Laboratories

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Ubisoft - Senior C++ Programmer - Machine Learning

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug