Network Engineer (HPC/RDMA)

2 Months ago • 5 Years + • Network Engineering

Job Summary

Job Description

TensorWave is seeking a passionate HPC/RDMA Engineer to join their IT team. The role involves designing and implementing innovative networking solutions to support high-performance AI workloads and cloud services. Responsibilities include exploring and integrating new network fabrics to enhance platform performance and scalability, ensuring network reliability, performance, and security for AI projects utilizing AMD and NVIDIA GPU technologies, and troubleshooting complex networking issues. The ideal candidate will have at least 5 years of experience in network engineering, strong knowledge of BGP, Ethernet protocols, RoCEv2, and network security practices, and experience with or interest in new network technologies for AI and cloud computing. This position offers opportunities for growth and creative problem-solving.
Must have:
  • 5+ years in network engineering
  • Focus on HPC/AI networking
  • Knowledge of BGP, Ethernet, RoCEv2
  • Network security practices
  • Familiarity with AMD/NVIDIA GPUs
  • Problem-solving skills
Good to have:
  • Bachelor's degree in CS/IT
  • Interest in new network fabrics
Perks:
  • Stock Options
  • 100% paid Medical, Dental, and Vision insurance
  • Life and Voluntary Supplemental Insurance
  • Short Term Disability Insurance
  • Flexible Spending Account
  • 401(k)
  • Flexible PTO
  • Paid Holidays
  • Parental Leave
  • Mental Health Benefits

Job Details

At TensorWave, we’re leading the charge in AI compute, building a versatile cloud platform that’s driving the next generation of AI innovation. We’re focused on creating a foundation that empowers cutting-edge advancements in intelligent computing, pushing the boundaries of what’s possible in the AI landscape.

About the Role:

We are looking for a HPC/RDMA Engineer with a passion for AI and advanced networking technologies. The ideal candidate will support our vision by developing and managing a networking infrastructure that underpins our innovative AI cloud services. This role involves exploring and integrating new types of network fabrics to enhance our platform's performance and scalability, ensuring optimal operation for our clients' AI projects.

Responsibilities:

  • Collaborate with a dynamic IT team to design and implement innovative networking solutions that meet the demands of high-performance AI workloads.

  • Lead initiatives to explore and integrate new types of network fabrics, enhancing the scalability and efficiency of our AI infrastructure.

  • Ensure network reliability, performance, and security for cloud services, optimizing for both AMD and NVIDIA GPU technologies.

  • Work closely with the AI development team to align networking strategies with the overall goals of TensorWave's cloud platform.

  • Troubleshoot and resolve complex networking issues, providing expert guidance and solutions to maintain high service levels.

Essential Skills & Qualifications:

  • Bachelor’s degree in Computer Science, Information Technology, or related field.

  • At least 5 years of relevant experience in network engineering, with a focus on supporting high-performance computing (HPC) and AI applications.

  • Strong knowledge of BGP, Ethernet protocols, RoCEv2, and network security practices.

  • Experience with or keen interest in exploring new network fabrics and technologies, particularly in the context of AI and cloud computing.

  • Familiarity with AMD and NVIDIA GPU ecosystems and their impact on network performance and configuration.

  • Exceptional problem-solving abilities and a commitment to innovation in networking for AI applications.

We’re looking for resilient, adaptable people to join our team—folks who enjoy collaborating and tackling tough challenges. We’re all about offering real opportunities for growth, letting you dive into complex problems and make a meaningful impact through creative solutions. If you're a driven contributor, we encourage you to explore opportunities to make an impact at TensorWave. Join us as we redefine the possibilities of intelligent computing.

What We Bring:

In addition to a competitive salary, we offer a variety of benefits to support your needs, including:

  • Stock Options

  • 100% paid Medical, Dental, and Vision insurance 

  • Life and Voluntary Supplemental Insurance

  • Short Term Disability Insurance

  • Flexible Spending Account

  • 401(k)

  • Flexible PTO

  • Paid Holidays

  • Parental Leave

  • Mental Health Benefits through Spring Health

Similar Jobs

Epic Games - Senior Software Engineer

Epic Games

Germany (On-Site)
4 Months ago
Google - Software Engineer, Machine Learning, YouTube Ads

Google

Mountain View, California, United States (On-Site)
3 Weeks ago
broadcom - Junior R & D Software Engineer

broadcom

Palo Alto, California, United States (On-Site)
3 Weeks ago
PhonePe - Site Reliability Engineer - Systems

PhonePe

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Poppulo - Engineering Manager

Poppulo

Minneapolis, Minnesota, United States (On-Site)
2 Months ago
NCR Voyix - Network Engineer

NCR Voyix

Tokyo, Japan (On-Site)
2 Months ago
extreme network - Senior/Staff Software Systems Engineer - Golang, Networking/Cloud Technologies

extreme network

Bengaluru, Karnataka, India (Hybrid)
8 Months ago
Temporal Technologies - Senior Engineering Manager - Open Source Server

Temporal Technologies

United States (On-Site)
2 Months ago
Zscaler - Senior Network Engineer

Zscaler

Japan (Hybrid)
2 Months ago
extreme network - Senior/Staff Systems Software Engineer – Python, Go, C++, Networking

extreme network

Ontario, Canada (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Axel springer - Video Editor - Social (m/f/d)

Axel springer

Berlin, Berlin, Germany (On-Site)
3 Weeks ago
Jane Street - Experienced Hire Recruiter, Technology

Jane Street

New York, United States (On-Site)
1 Month ago
binance - Frontend Developer (Big Data)

binance

Taipei City, Taiwan (Remote)
10 Months ago
Scientific Games - Tech Ops Engineer

Scientific Games

Montreal, Quebec, Canada (Remote)
3 Weeks ago
Capgemini - Network Administration-Fortinet Network Security

Capgemini

Mumbai, Maharashtra, India (On-Site)
2 Months ago
Google - Software Engineering Manager (For Women in Tech Candidates)

Google

State Of Minas Gerais, Brazil (On-Site)
7 Months ago
Assystems - Network Administrator - L2

Assystems

Gurugram, Haryana, India (On-Site)
9 Months ago
Saronic Technologies - Systems Software Engineer

Saronic Technologies

Austin, Texas, United States (On-Site)
3 Weeks ago
Addepar - Staff Site Reliability Engineer

Addepar

United States (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Las Vegas, Nevada, United States

Visa - Sr. Analyst, Regulatory Affairs

Visa

Atlanta, Georgia, United States (Hybrid)
2 Months ago
Flying Bark - Production Assistant

Flying Bark

Glendale, California, United States (Hybrid)
3 Weeks ago
Penrose studios - Executive Assistant

Penrose studios

San Francisco, California, United States (On-Site)
3 Months ago
Snap Mobile INC - Account Executive

Snap Mobile INC

Jackson, Mississippi, United States (On-Site)
3 Months ago
Roblox - Senior Software Engineer, Rights & Guidelines

Roblox

San Mateo, California, United States (On-Site)
1 Month ago
Meta - Software Engineer, Systems ML - SW/HW Co-design

Meta

Redmond, Washington, United States (On-Site)
9 Months ago
Pomelo - Security and Compliance Analyst

Pomelo

United States (Remote)
1 Month ago
attentive - Senior Communications Manager

attentive

United States (Remote)
3 Weeks ago
USE Insider - Senior Product Marketing Manager, Analyst Relations

USE Insider

United States (Remote)
3 Months ago
Illumina - Engineer 2 - Process Development

Illumina

San Diego, California, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Network Engineering Jobs

Universal Music Group - Network Engineer

Universal Music Group

New York, United States (On-Site)
1 Month ago
Google - Senior Software Engineer, Google Cloud Global Networking

Google

Austin, Texas, United States (On-Site)
3 Months ago
Rockstar Games - Senior Network Programmer

Rockstar Games

Dundee, Scotland, United Kingdom (On-Site)
2 Months ago
Sailpoint - Senior Network Engineer

Sailpoint

Austin, Texas, United States (On-Site)
1 Month ago
Zones - Network Engineer L2

Zones

Bengaluru, Karnataka, India (On-Site)
8 Months ago
dun bradstreet - Network Engineer

dun bradstreet

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Months ago
NetBrain - Network Automation Engineer

NetBrain

Hyderabad, Telangana, India (Hybrid)
2 Months ago
Qualcomm - Network Engineer

Qualcomm

Bengaluru, Karnataka, India (On-Site)
2 Months ago
luxsoft - Network Engineer / Backend Developer

luxsoft

Egypt (Remote)
4 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Las Vegas, Nevada, United States (On-Site)

Las Vegas, Nevada, United States (On-Site)

Las Vegas, Nevada, United States (On-Site)

Las Vegas, Nevada, United States (On-Site)

Las Vegas, Nevada, United States (On-Site)

Las Vegas, Nevada, United States (On-Site)

Las Vegas, Nevada, United States (On-Site)

Tucson, Arizona, United States (On-Site)

Las Vegas, Nevada, United States (Remote)

Las Vegas, Nevada, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by TensorWave

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug