Senior Site Reliability Engineer - Networking

2 Months ago • 5 Years + • Network Engineering • $41,759,376 PA - $57,180,180 PA

Job Summary

Job Description

Lambda is seeking a Senior Site Reliability Engineer specializing in Networking to scale its high-performance cloud network. The role involves contributing to the automation of network configuration and deployments, implementing and operating Software Defined Networks (SDN), and managing Spine and Leaf networks. The engineer will ensure network availability through observability, failover, and redundancy, and maintain predictable client networking performance. Responsibilities also include deploying and managing network monitoring tools. This position requires a strong background in software development, site reliability engineering, or network reliability engineering with over 5 years of experience, and participation in production-scale networking projects.
Must have:
  • 5+ years of SWE, SRE, or Network Reliability Engineering experience
  • Experience in production-scale networking projects
  • Experience with on-call and incident response management
  • Experience building/maintaining SDN (OpenStack, Neutron, OVN)
  • Comfortable on Linux command line and networking stack
  • Experience with multi-datacenter and hybrid cloud networks
  • Python programming and Ansible experience
  • Experience with CI/CD tools and GitOps
  • Experience with Kubernetes application lifecycle
Good to have:
  • Operated production-scale SDNs in a cloud context
  • Software development experience in C, GO, Python
  • Experience automating network config in public clouds
  • Deep understanding of Linux networking stack and virtualization
  • Understanding of SDN ecosystem (OVS, Neutron, NSX, ACI)
  • Experience with Spine and Leaf (Clos) topology
  • Experience with BGP EVPN VXLAN networks
  • Experience with multi-datacenter networks, SD-WAN, DWDM
  • Experience with Next-Generation Firewalls
Perks:
  • Generous cash & equity compensation
  • Health, dental, and vision coverage
  • Wellness and commuter stipends
  • 401k Plan with 2% company match
  • Flexible Paid Time Off

Job Details

Lambda is the #1 GPU Cloud for ML/AI teams training, fine-tuning and inferencing AI models, where engineers can easily, securely and affordably build, test and deploy AI products at scale. Lambda’s product portfolio includes on-prem GPU systems, hosted GPUs across public & private clouds and managed inference services – servicing government, researchers, startups and Enterprises world-wide.


If you'd like to build the world's best deep learning cloud, join us. 


Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance.

What You'll Do

  • Help scale Lambda’s high performance cloud network

  • Contribute to the reproducible automation of network configuration and deployments

  • Contribute to the implementation and operations of Software Defined Networks

  • Help to deploy and manage Spine and Leaf networks

  • Ensure high availability of our network through observability, failover, and redundancy

  • Ensure clients have predictable networking performance through the use of network engineering and other applicable technologies

  • Help with deploying and maintaining network monitoring and management tools

You

  • Have 5+ years of experience being SWE, SRE or Network Reliability Engineering

  • Been part of the implementation of production-scale networking projects

  • Experience being on-call and incident response management

  • Have experience building and maintaining Software Defined Networks (SDN), experience with OpenStack, Neutron, OVN

  • Are comfortable on the Linux command line, and have an understanding of the Linux networking stack

  • Have experience with multi-data center networks and hybrid cloud networks

  • Have Python programming experience and configuration management tools like Ansible

  • Have experience with CI/CD tools for deployment and GIT. Operated network environment with GitOps practices in place.

  • Experience with application lifecycle and deployments on Kubernetes

Nice To Have

  • Operated production-scale SDNs in a cloud context (e.g. helped implement or operate the infrastructure that powers an AWS VPC-like feature)

  • Have Software development experience with C, GO, Python

  • Experience automating network configuration within public clouds, with tools like kubentetes, HELM, Terraform, Ansible

  • Deep understanding of the Linux networking stack and its interaction with network virtualization, SR-IOV and DPDK

  • Understanding of the SDN ecosystem (e.g. OVS, Neutron, VMware NSX, Cisco ACI or Nexus Fabric Controller, Arista CVP)

  • Have experience with Spine and Leaf (Clos) network topology

  • Have experience and understanding of BGP EVPN VXLAN networks

  • Experience with building and maintaining multi-data center networks, SD-WAN, DWDM

  • Experience with Next-Generation Firewalls (NGFW)

Salary Range Information

The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

  • Founded in 2012, ~350 employees (2024) and growing fast

  • We offer generous cash & equity compensation

  • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.

  • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability

  • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG

  • Health, dental, and vision coverage for you and your dependents

  • Wellness and Commuter stipends for select roles

  • 401k Plan with 2% company match (USA employees)

  • Flexible Paid Time Off Plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Similar Jobs

Highspot - Software Development Engineer II

Highspot

Hyderabad, Telangana, India (Hybrid)
2 Months ago
playrix  - SDET (Software Development Engineer in Test)

playrix

Ireland (Remote)
4 Months ago
Zazz - Artificial Intelligence Engineer

Zazz

(Remote)
6 Months ago
Thatgamecompany - Full Stack iOS Engineer

Thatgamecompany

Shanghai, Shanghai, China (On-Site)
4 Months ago
Toast - Senior Manager, GTM Engineering

Toast

United States (Remote)
2 Months ago
bytedance - Software Engineer Graduate (RDMA Network - High Speed Network)

bytedance

Seattle, Washington, United States (On-Site)
4 Months ago
Jane Street - Senior Network Engineer

Jane Street

Singapore (On-Site)
3 Months ago
Airbyte - Engineering Talent Network

Airbyte

San Francisco, California, United States (On-Site)
3 Months ago
Google - Software Engineer III, Infrastructure, Google Cloud Global Networking

Google

Sunnyvale, California, United States (On-Site)
4 Months ago
Cygames - Network Engineer

Cygames

Osaka, Osaka, Japan (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Kaedim - Software Engineer

Kaedim

Singapore (On-Site)
1 Year ago
Nice - Senior Software Engineer (Java, Angular)

Nice

Pune, Maharashtra, India (Hybrid)
1 Month ago
Scale AI - Software Engineer (Infrastructure)

Scale AI

Doha, Doha Municipality, Qatar (On-Site)
3 Months ago
Applied materials  - DevOps Support Engineer

Applied materials

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Digital sun games - Unity Programmer

Digital sun games

Valencia, Valencian Community, Spain (On-Site)
3 Months ago
Autodesk - Principal Software Engineer, Data Streaming

Autodesk

San Francisco, California, United States (On-Site)
1 Year ago
Domo - DevOps Engineer - India

Domo

Pune, Maharashtra, India (Hybrid)
3 Weeks ago
Synechron - Senior Tech Support Engineer

Synechron

Singapore (On-Site)
2 Months ago
Kabam - Senior Software Engineer (1 Year Contract)

Kabam

Montreal, Quebec, Canada (Hybrid)
10 Months ago
endava - Senior Python Developer

endava

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in London, England, United Kingdom

deel. - Account Executive, Deel IT, SMB | EMEA

deel.

United Kingdom (Remote)
3 Weeks ago
Behaviour Interactive - Principal Gameplay Programmer - Dead by Daylight | Programmeur·se jouabilité Principal·e - Dead by Daylight

Behaviour Interactive

Middlesbrough, England, United Kingdom (Hybrid)
10 Months ago
GoDaddy - Workday Engineer - SaaS Platform Systems Engineer

GoDaddy

London, England, United Kingdom (Remote)
3 Months ago
King - Global Corporate & Internal Communications Manager

King

London, England, United Kingdom (On-Site)
1 Month ago
TiMi Studio Group - TiMi Europe- Senior business development manager

TiMi Studio Group

London, England, United Kingdom (On-Site)
9 Months ago
fish in bottle  - 3D Game Artist

fish in bottle

England, United Kingdom (Hybrid)
5 Months ago
Level lr - Full Stack Engineer

Level lr

United Kingdom (Remote)
3 Months ago
Salesforce - Principal AI Architect

Salesforce

London, England, United Kingdom (On-Site)
1 Year ago
Cubic corporation - Field Services Technician

Cubic corporation

Kingswinford, England, United Kingdom (On-Site)
1 Year ago
TransUnion - Business Intelligence Analyst

TransUnion

Leeds, England, United Kingdom (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Network Engineering Jobs

Activision - Senior Network Programmer

Activision

Santa Monica, California, United States (On-Site)
4 Weeks ago
bytedance - Network Engineer, High Performance GPU Network Direction

bytedance

Seattle, Washington, United States (On-Site)
3 Weeks ago
Accenture - S&C Global Network - Strategy - MC - Industry X - Digital Engineering R&D - Consultant

Accenture

Pune, Maharashtra, India (On-Site)
1 Month ago
NVIDIA - Senior Networking Security Research Architect

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
7 Months ago
Daybreak Game Company LLC - Network Engineer

Daybreak Game Company LLC

San Diego, California, United States (Hybrid)
9 Months ago
Jane Street - Senior Network Engineer

Jane Street

Singapore (On-Site)
3 Months ago
playrix  - Senior Node.js Developer (Server)

playrix

Ireland (Remote)
6 Months ago
bytedance - Software Engineer (Payment Network) - Global Payment - Singapore

bytedance

Singapore (On-Site)
9 Months ago
bytedance - Network Engineer Graduate (Tech Infra - IaaS) - 2025 Start (PhD)

bytedance

Seattle, Washington, United States (On-Site)
9 Months ago
Thousand Eyes - Site Reliability Engineering Technical Leader, Network Assurance Data Platform

Thousand Eyes

Bengaluru, Karnataka, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Atlanta, Georgia, United States (On-Site)

San Francisco, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Lambda

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug