Senior System Networking Engineer, InfiniBand

1 Month ago • 8 Years + • Artificial Intelligence • Research & Development

Job Summary

Job Description

NVIDIA seeks a Senior System Networking Engineer specializing in InfiniBand for its HPC/AI E2E Verification team. This role involves designing and implementing innovative architectures for high-performance computing systems, focusing on scalability, performance, and functionality of NVIDIA InfiniBand HPC/AI solutions. Responsibilities include collaborating with cross-functional teams, planning and executing end-to-end test scenarios, analyzing results, generating reports, and driving process improvements. The ideal candidate possesses in-depth InfiniBand, Linux networking, and HPC/AI experience, with a strong analytical and problem-solving aptitude. Experience with AI application benchmarks and distributed job scheduling is highly desirable.
Must have:
  • 8+ years experience in networking
  • InfiniBand XDR/NDR knowledge
  • Linux networking expertise
  • Strong analytical & problem-solving skills
  • HPC/AI architecture understanding
Good to have:
  • AI Application benchmarks experience
  • Distributed job scheduling expertise
  • Nvidia Networking & AI architecture knowledge

Job Details

NVIDIA is looking for an outstanding candidate for a Senior System Networking Engineer role in HPC/AI E2E Verification team. Be a key player to the most exciting computing hardware and software to contribute to the latest breakthroughs in InfiniBand networking technologies and High-performance computing. You will work with the latest InfiniBand based Switches, HCAs, AI servers and Software, together with many researchers, Architects and developers leading differentiated InfiniBand HPC/AI solutions.

What You Will Be Doing:

  • As a Senior System InfiniBand Networking Engineer, you will play a crucial role in crafting and implementing innovative architectures for high-performance computing systems, enabling efficient and scalable computation for AI/ML applications and HPC Benchmarks

  • Collaborating closely with multi-functional teams, including hardware engineers, software developers, and domain experts, to deliver optimized solutions that meet the demanding requirements of HPC/AI workloads

  • Planning, Reviewing, and Executing complexed End-to-End scenarios with strong emphasis on Scalability, Performance, and Functionality of NVIDIA InfiniBand HPC/AI solutions ensuring alignment with Nvidia Networking & AI specifications

  • Analyzing test results and generating detailed reports for stakeholders to facilitate informed decision-making

  • Drive continuous improvement initiatives, identifying opportunities to enhance verification processes and methodologies in the context of Nvidia Networking & AI solutions

What We Need To See:

  • Bachelor's/Master’s degree in electrical engineering, Computer Science, or equivalent experience in Networking/System field

  • 8+ years experience driving large-scale complexed solutions with strong emphasis on networking troubleshooting and Performance analysis

  • In depth experience and understanding of Linux based networking systems

  • Strong analytical and problem-solving skills

  • Excellent communication and interpersonal skills

  • Ability to work effectively in a collaborative, fast-paced environment

Ways To Stand Out From The Crowd:

  • In-depth knowledge of InfiniBand XDR/NDR technology, Nvidia Networking, and AI architectures, protocols, and standards

  • Expertise in High-performance computing and Machine learning

  • Experience in AI Application benchmarks and Distributed job scheduling

We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.

Similar Jobs

Moon Active - DevOps Engineer

Moon Active

Warsaw, Masovian Voivodeship, Poland (Hybrid)
5 Months ago
NVIDIA - Senior Software QA Automation Engineer

NVIDIA

Ra'anana, Center District, Israel (On-Site)
1 Month ago
DNEG - Pipeline TD

DNEG

Chennai, Tamil Nadu, India (On-Site)
4 Months ago
Acceldata - Site Reliability Engineer - Hadoop Focused

Acceldata

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Stonewall Collision & Auto Painting - Senior Data Scientist

Stonewall Collision & Auto Painting

Vijayawada, Andhra Pradesh, India (On-Site)
5 Months ago
NVIDIA - Senior Solution Architect, HPC and AI

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
PwC - Conversational AI Architect-Senior Associate

PwC

Bengaluru, Karnataka, India (On-Site)
4 Months ago
NVIDIA - Director, AI Software

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
ByteDance - Research Engineer (Foundation Model) - Machine Learning Systems

ByteDance

Singapore (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Samsung Semiconductor - Staff Engineer, SoC Design Verification

Samsung Semiconductor

Folsom, California, United States (Hybrid)
2 Weeks ago
NVIDIA - Senior System Software Engineer, Deep Learning Accelerator

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
ByteDance - Data Center Technical Project Manager - Fibre Delivery

ByteDance

Singapore (On-Site)
3 Months ago
Evolution - IT Support Engineer

Evolution

Madrid, Community Of Madrid, Spain (On-Site)
2 Weeks ago
PhonePe - Software Engineer (Backend, 3 to 5 yrs)

PhonePe

Bengaluru, Karnataka, India (On-Site)
3 Months ago
BigID - Manager, Technical Client Support - EMEA/APJ

BigID

London, England, United Kingdom (Remote)
2 Months ago
Nielsen Holdings - Software Engineer - Bigdata ( Java/Scala ,Python, Spark, SQL, AWS )

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Larian Studios - DevOps Build Engineer

Larian Studios

Dublin, County Dublin, Ireland (On-Site)
3 Months ago
ByteDance - Senior Site Reliability Engineer, ML System - Foundation Model

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Luxoft - Data Engineer for Market Data Projects (with Streamlit Expertise)

Luxoft

Brazil, Indiana, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Yokne'am Illit, North District, Israel

SciPlay - Head Of Art

SciPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Vi - Assistant Controller

Vi

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
Vi - Senior Software Engineer

Vi

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago
SuperPlay - 3D Rigger & Generalist

SuperPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
NVIDIA - Senior Analog Mixed Signal Design Engineer

NVIDIA

Ra'anana, Center District, Israel (On-Site)
1 Month ago
PLAYSTUDIOS - Tetris- Experienced Business Analyst

PLAYSTUDIOS

Tel Aviv District, Israel (On-Site)
2 Months ago
BigID - Data Engineering Team Lead

BigID

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
2 Months ago
NVIDIA - Senior Switch Software Verification Engineer

NVIDIA

Be'er Sheva, South District, Israel (Hybrid)
1 Month ago
Fairmatic - Senior Full Stack Engineer

Fairmatic

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
4 Months ago
Pazu Games - Business Development Manager

Pazu Games

Israel (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

ByteDance - Research Scientist- Foundation Model, Video Generation

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Microsoft - Research Intern - Microsoft Research and Outlook

Microsoft

Redmond, Washington, United States (On-Site)
1 Month ago
The Walt Disney Company - Senior Machine Learning Engineer - Ad Platforms

The Walt Disney Company

San Francisco, California, United States (On-Site)
1 Day ago
Imagineio - Senior Generative AI Engineer

Imagineio

India (Remote)
3 Months ago
NVIDIA - Global Developer Relations Account Manager – Ansys

NVIDIA

Santa Clara, California, United States (On-Site)
2 Weeks ago
Interface AI - Staff Software Engineer, Backend

Interface AI

United States (Remote)
6 Days ago
NVIDIA - Senior AI-HPC Storage Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
1 Month ago
NVIDIA - Solution Architect - Auto

NVIDIA

Beijing, Beijing, China (On-Site)
1 Month ago
Tencent - Senior Researcher: Artificial General Intelligence (Natural Language Processing) 104531

Tencent

Bellevue, Washington, United States (On-Site)
1 Month ago
Unity - Senior ML Engineer

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.


Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bengaluru, Karnataka, India (On-Site)

Taipei City, Taiwan (On-Site)

Taipei City, Taiwan (On-Site)

Shanghai, Shanghai, China (On-Site)

Shanghai, Shanghai, China (On-Site)

Yokne'am Illit, North District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug