AI Network System Architect

2 Weeks ago • 2 Years + • Artificial Intelligence

Job Summary

Job Description

NVIDIA seeks a Senior AI Network System Architect to design and develop next-generation networking products for high-performance and ML/AI computing. Responsibilities include investigating emerging technologies in ML/AI, executing workloads, profiling and analyzing bottlenecks, optimizing communication libraries (NCCL, UCX), conceptualizing next-generation networking products, developing simulation models, and collaborating with multi-functional teams. The role requires expertise in ML/AI workloads, distributed training, large-scale network behavior, and simulation environments. Experience with communication libraries, network protocols (InfiniBand, IP, TCP, RoCE), and programming languages (Python, C++) is highly desirable.
Must have:
  • M.Sc./Ph.D. in CS/CE/EE
  • 2+ years experience in computer networks
  • Expertise in ML/AI workloads
  • Understanding of large-scale network behavior
  • Simulation environment development
  • Problem-solving and critical thinking
Good to have:
  • Knowledge of NCCL, UCX, UCC
  • Knowledge of InfiniBand, IP, TCP, RoCE
  • Experience with Python, C++, Docker
  • System engineering expertise
  • Experience with DLRM, LLM, or generative AI

Job Details

Our technology has no boundaries! NVIDIA is building the world’s most groundbreaking and state-of-the-art accelerated computing platforms. Because of our work, scientists, researchers, and engineers can advance their ideas. We pioneered a supercharged form of computing loved by the fastest-paced computer users in the world - scientists, designers, artists, and gamers.

We seek a highly motivated Senior AI Network System Architect to join our team of experts and help shape the future of high-performance and ML / AI computing. Our next-generation Infiniband, NVLink, and Ethernet systems will be at the forefront of connecting and powering the world's most advanced AI clusters. As an AI system architect at NVIDIA, you will have the opportunity to work on some of the most cutting-edge technology and help drive the innovation of our next-generation networks that top researchers and engineers worldwide will use.

What You’ll Be Doing:

  • Investigating emerging technologies and methodologies in ML and AI to discern their interactions with network infrastructure.

  • Executing workloads on AI systems, conducting profiling, and analyzing bottlenecks and possible enhancements.

  • Conducting research and implementing optimizations for communication libraries like NCCL and UCX.

  • Spearheading the conceptualization of next-generation networking products tailored to support and accelerate state-of-the-art ML workloads.

  • Develop models for simulations, analyze simulation results, and develop optimization algorithms.

  • Collaborate with multi-functional teams, including other architecture teams, logic design, system software, firmware, and ML research teams, to ensure the successful execution of the project.

What We Need To See:

  • M.Sc, or Ph. D degree in Computer Science, Computer Engineering, or Electrical Engineering.

  • At least 2+ years of industry or research experience in computer networks.

  • Extensive expertise in ML/AI workloads, particularly in distributed training.

  • Excellent understanding of large-scale network behavior and the effect of distributed computing workloads on the network.

  • Experience in the development of simulation environments.

  • Great problem-solving and critical-thinking skills.

  • Ability to thrive in a fast-paced and dynamic environment is necessary.

  • Work concurrently with multiple groups in the organization.

Ways To Stand Out Of The Crowd:

  • Knowledge of communication libraries such as NCCL, UCX, and UCC.

  • Good knowledge of network protocols - such as InfiniBand, IP, TCP, RoCE, and network topologies.

  • Experience with Python, C++, and dockers.

  • Expertise in system engineering, operations research, and intricate hardware-software integrated systems.

  • Demonstrated experience in DLRM, LLM or other generative AI.

NVIDIA has some of the most forward-thinking and hardworking people in the world working for us, and due to unprecedented growth, our world-class engineering teams are growing fast. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you.

We are committed to fostering a diverse work environment and are proud to be an equal-opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, perform essential job functions, and receive other benefits and privileges of employment. Please contact us to request accommodation.

Similar Jobs

Tekion Corp - Senior Applied Scientist

Tekion Corp

Bengaluru, Karnataka, India (On-Site)
1 Day ago
Wildlife Studios - Data Engineer

Wildlife Studios

São Paulo, State Of São Paulo, Brazil (On-Site)
3 Months ago
Snowed In Studios - Principle Software Developer

Snowed In Studios

Quebec, Canada (Remote)
1 Month ago
Ubisoft - Senior Gameplay Programmer

Ubisoft

Barcelona, Catalonia, Spain (Hybrid)
2 Weeks ago
Google - Engineering Analyst, Trust and Safety Search

Google

Dublin, County Dublin, Ireland (On-Site)
2 Weeks ago
Universal Music - Universal Music Group 2025 Summer Internship Program: Prompt Engineer

Universal Music

Los Angeles, California, United States (On-Site)
2 Weeks ago
Google - Cloud AI Engineer, Global Services Delivery

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Days ago
Trend Micro - NLP / Prompt Engineer (VicOne_Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

New York, New York, United States (Remote)
5 Months ago
Virtuos - Senior Games Tool Engineer (Machine Learning Specialist)

Virtuos

Shanghai, Shanghai, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Bluetooth Firmware Engineer

Google

New Taipei, New Taipei City, Taiwan (On-Site)
1 Week ago
Illuminia - Sr. Software Engineer

Illuminia

San Diego, California, United States (On-Site)
1 Day ago
Meta - Software Engineer (Technical Leadership)

Meta

New York, New York, United States (On-Site)
5 Months ago
Google - Senior Software Engineer, Search Ads Auction Mechanisms

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Google - Software Engineer II, Back End, Core

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Weeks ago
Google - Software Engineer III, AI/ML, YouTube Ads

Google

Mountain View, California, United States (On-Site)
2 Weeks ago
Google - Staff Research Engineer, Applied ML

Google

London, England, United Kingdom (On-Site)
2 Weeks ago
Google - Senior Staff Software Engineer, Google Cloud Compute

Google

Seattle, Washington, United States (On-Site)
2 Weeks ago
Google - Software Engineer III, AI/ML GenAI, Google Workspace

Google

Kirkland, Washington, United States (On-Site)
2 Weeks ago
IO Interactive - Senior Core Programmer

IO Interactive

Brighton And Hove, England, United Kingdom (Hybrid)
4 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Yokne'am Illit, North District, Israel

Google - Senior Design Engineer, Networking, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Fairmatic - Senior Data Scientist

Fairmatic

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
3 Weeks ago
Google - CPU Design Manager, Hardware

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Days ago
Playtika - Business Analyst

Playtika

Israel (On-Site)
2 Weeks ago
Playtika - Senior Level Designer - Solitaire Grand Harvest

Playtika

Israel (On-Site)
3 Months ago
NVIDIA - Senior Chip Design Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
NVIDIA - Director, Ethernet Solutions Product Management

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Physical Design Engineer

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Plarium - Marketing Data Analyst

Plarium

Herzliya, Tel Aviv District, Israel (On-Site)
2 Months ago
NVIDIA - Senior Networking Security Research Architect

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Artificial Intelligence Jobs

Hedra - Senior Research Engineer

Hedra

San Francisco, California, United States (On-Site)
1 Month ago
The Walt Disney Company - Lead Software Engineer - Applied AI & Machine Learning

The Walt Disney Company

Santa Monica, California, United States (On-Site)
2 Weeks ago
Google - Group Product Manager, Generative AI, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Meta - Software Engineer, Machine Learning

Meta

Pittsburgh, Pennsylvania, United States (On-Site)
5 Months ago
ByteDance - Research Scientist, Vision Foundation Model

ByteDance

San Jose, California, United States (On-Site)
6 Months ago
Microsoft - Member of Technical Staff – Voice & Vision

Microsoft

London, England, United Kingdom (On-Site)
1 Week ago
Meta - AI Research Scientist, Language - Generative AI

Meta

Menlo Park, California, United States (On-Site)
5 Months ago
Trend Micro - Sr. Data Scientist (AI Lab)

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Krafton  - Technical Project Manager, Deep Learning Division

Krafton

Seoul, South Korea (On-Site)
3 Months ago
ByteDance - Research Scientist/Engineer - Multimodal Interaction & World Model

ByteDance

Singapore (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Since its founding in 1993, NVIDIA (NASDAQ: NVDA) has been a pioneer in accelerated computing. The company’s invention of the GPU in 1999 sparked the growth of the PC gaming market, redefined computer graphics, ignited the era of modern AI and is fueling the creation of the metaverse. NVIDIA is now a full-stack computing company with data-center-scale offerings that are reshaping industry.

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Massachusetts, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (On-Site)

Texas, United States (On-Site)

Santa Clara, California, United States (On-Site)

Santa Clara, California, United States (Hybrid)

Santa Clara, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by NVIDIA

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug