Staff Network Engineer

Lambda

Job Summary

Lambda, The Superintelligence Cloud, is seeking a Staff Network Engineer to scale its high-performance cloud network. This role involves contributing to reproducible automation, designing software-defined networks, managing Spine and Leaf networks, and ensuring high availability and predictable performance. The engineer will also deploy and maintain network monitoring tools, working in a hybrid model from San Francisco or Seattle.

Must Have

  • Scale high-performance cloud networks.
  • Automate network configuration reproducibly.
  • Design and develop software-defined networks.
  • Manage Spine and Leaf networks.
  • Ensure high network availability and predictable performance.
  • Deploy and maintain network monitoring and management tools.
  • 15+ years experience in designing and operating production datacenter networks.
  • Led large production-scale networking projects.
  • Expert in CLOS/Spine and Leaf fabrics, EVPN/VXLAN, ECMP, BGP, and fast convergence techniques.
  • Experience with multi-data center, backbone, and hybrid cloud networks.
  • Production experience with at least two switches/routers vendors.
  • Experience with Next-Generation Firewalls (NGFW).
  • Experience with LoadBalancers like F5, NetScaler.
  • Comfortable on the Linux command line and understanding of the Linux networking stack.
  • Strong automation skills (Python, Ansible) and network APIs.

Good to Have

  • Hands-on with HPC/AI networking: RoCEv2 and/or InfiniBand (Congestion Control, VLs, partitions), GPUDirect RDMA concepts.
  • Experience with DWDM technologies and SD-WAN.
  • Understanding of data center power/space/cooling trade-offs and their impact on topology choices.
  • Experience with Observability tools like Datadog, Splunk, Grafana, Prometheus.
  • Experience automating network configuration within public clouds, with tools like Terraform.
  • Led implementation of production-scale SDNs in a cloud context.
  • Deep understanding of the Linux networking stack and its interaction with network virtualization.
  • Experience with SDN ecosystem (e.g. OVS, Neutron, DPDK, Cisco ACI or Nexus Fabric Controller, Arista CVP).

Perks & Benefits

  • Generous cash & equity compensation.
  • Health, dental, and vision coverage for you and your dependents.
  • Wellness and Commuter stipends for select roles.
  • 401k Plan with 2% company match (USA employees).
  • Flexible Paid Time Off Plan that we all actually use.

Job Description

Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference. Lambda’s mission is to make compute as ubiquitous as electricity and give every person access to artificial intelligence. One person, one GPU.

If you'd like to build the world's best deep learning cloud, join us.

*Note: This position requires presence in our San Francisco or Seattle office location 4 days per week; Lambda’s designated work from home day is currently Tuesday.

What You'll Do

  • Help scale Lambda’s high performance cloud network
  • Contribute to the reproducible automation of network configuration
  • Contribute to the design and development of software defined networks
  • Help manage Spine and Leaf networks
  • Ensure high availability of our network through monitoring, failover, and redundancy
  • Ensure VMsclients have predictable networking performance through the use of QoS and other applicable technologies
  • Help with deploying and maintaining network monitoring and management tools

You

  • Have 15+ years of experience in designing and operating production datacenter networks
  • Have led the implementation of large production-scale networking projects
  • Expert in CLOS/Spine and Leaf fabrics,EVPN/VXLAN, ECMP, BGP, and fast convergence techniques.
  • Have experience with multi-data center networks, backbone and hybrid cloud networks
  • Production experience with at least two switches/routers vendors (e.g., Arista, Juniper, Cisco, NVIDIA/Mellanox, Cumulus/SONiC)
  • Experience with Next-Generation Firewalls (NGFW)(e.g. Fortigate, Juniper)
  • Experience with LoadBalancers like F5, NetScaler
  • Are comfortable on the Linux command line, and have an understanding of the Linux networking stack
  • Strong automation skills (Python, Ansible) and network APIs

Nice To Have

  • Hands-on with HPC/AI networking: RoCEv2 and/or InfiniBand (Congestion Control, VLs, partitions), GPUDirect RDMA concepts.
  • Experience with DWDM technologies and SD-WAN
  • Understanding of data center power/space/cooling trade-offs and their impact on topology choices
  • Experience with Observability tools like Datadog, Splunk, Grafana, Prometheus
  • Experience automating network configuration within public clouds, with tools like Terraform
  • Have led implementation of production-scale SDNs in a cloud context (e.g. helped implement the infrastructure that powers an AWS VPC-like feature)
  • Deep understanding of the Linux networking stack and its interaction with network virtualization
  • Experience with SDN ecosystem (e.g. OVS, Neutron, DPDK, Cisco ACI or Nexus Fabric Controller, Arista CVP)

Salary Range Information

The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

  • Founded in 2012, ~400 employees (2025) and growing fast
  • We offer generous cash & equity compensation
  • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.
  • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
  • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
  • Health, dental, and vision coverage for you and your dependents
  • Wellness and Commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible Paid Time Off Plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

14 Skills Required For This Role

Game Texts Networking Linux Aws Network Monitoring Prometheus Ansible Terraform Grafana Deep Learning Spine Python Splunk Machine Learning

Similar Jobs