Network Operations Engineer

TensorWave

5+ Years | Remote | Full Time | 1 day ago

Apply Now

Job Summary

Tensorwave Cloud's mission is to build seamless, secure, reliable, and resilient AI infrastructure at scale, eliminating barriers and challenging the status quo to empower builders and support AI innovation. We are seeking Network Operations Engineers responsible for the day-to-day operation, maintenance, and on-call support of large-scale data center networks supporting AI and GPU workloads.

Must Have

Operate and maintain large Ethernet fabrics
Participate in on-call rotation and incident response
Execute maintenance, upgrades, and hardware replacements
Troubleshoot latency, packet loss, and connectivity issues
Support continuous scale-out growth of production networks
5+ years data center network operations experience
Hands-on experience with large Ethernet fabrics and edge networks
Strong understanding of scale-up vs scale-out architectures
Experience operating production networks at scale
Multi-Vendor Experience (Juniper, Cisco, Arista, Whitebox)
NOS Experience (Junos, IOS/IOS-XE, NX-OS, EOS, SONiC)
Automation or scripting experience in Python, GO, Bash, or equivalent

Good to Have

Experience in 100G+ environments
Exposure to AI, GPU, or HPC

Perks & Benefits

Mission driven company
Competitive Salary
Stock Options
100% paid Medical, Dental, and Vision insurance
Life and Voluntary Supplemental Insurance
Short Term Disability Insurance
Flexible Spending Account
401(k)
Flexible PTO
Paid Holidays
Parental Leave
Mental Health Benefits through Spring Health

Job Description

Our mission at Tensorwave Cloud is to build seamless, secure, reliable, and resilient AI infrastructure at scale, eliminating barriers and challenging the status quo to empower builders and support AI innovation.

About the Role

We are seeking Network Operations Engineers responsible for the day-to-day operation, maintenance, and on-call support of large-scale data center networks supporting AI and GPU workloads.

Core Responsibilities

Operate and maintain large Ethernet fabrics
Participate in on-call rotation and incident response
Execute maintenance, upgrades, and hardware replacements
Troubleshoot latency, packet loss, and connectivity issues
Support continuous scale-out growth of production networks

Required Experience

5+ years data center network operations experience
Hands-on experience with large Ethernet fabrics and edge networks
Strong understanding of scale-up vs scale-out architectures
Experience operating production networks at scale
Multi-Vendor Experience (Juniper, Cisco, Arista, Whitebox)
NOS Experience (Junos, IOS/IOS-XE, NX-OS, EOS, SONiC)
Automation or scripting experience in Python, GO, Bash, or equivalent

Preferred Experience

100G+ environments
AI, GPU, or HPC exposure

We’re looking for resilient, adaptable people to join our team, people who believe in the mission and think at massive scale. The solutions that worked on a handful of devices will not work at Exascale. Be prepared to be pushed daily, to learn a lot, and literally build the future.

What We Bring