Data Center Operations Engineer

undefined ago • All levels • Data Analysis • $89,000 PA - $134,000 PA

Job Summary

Job Description

We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be. The Operations team plays a critical role in ensuring the seamless end-to-end execution of our AI-IaaS infrastructure and hardware, from procurement to deployment and operational efficiency.
Must have:
  • Ensure new server, storage and network infrastructure is properly racked, labeled, cabled, and configured
  • Document data center layout and network topology in DCIM software
  • Work with supply chain & manufacturing teams to ensure timely deployment of systems and project plans for large-scale deployments
  • Participate in data center capacity and roadmap planning with sales and customer success teams to allocate floorspace
  • Assess current and future state data center requirements based on growth plans and technology trends
  • Manage a parts depot inventory and track equipment through the delivery-store-stage-deploy-handoff process in each of our data centers
  • Work closely with HW Support team to ensure data center infrastructure-related support tickets are resolved
  • Work with RMA team to ensure faulty parts are returned and replacements are ordered
  • Create installation standards and documentation for placement, labeling, and cabling to drive consistency and discoverability across all data centers
  • Serve as a subject-matter expert on data center deployments as part of sales engagement for large-scale deployments in our data centers and at customer sites
  • Experience with critical infrastructure systems supporting data centers, such as power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, and cable management
  • Strong Linux administration experience
  • Experience in setting up networking appliances (Ethernet and InfiniBand) across multiple data center locations
  • Action-oriented and have a strong willingness to learn
  • Willing to travel for bring up of new data center locations
Good to have:
  • Experience with troubleshooting the following network layers, technologies, and system protocols: TCP/IP, DP/IP, BGP, OSPF, SNMP, SSL, HTTP, FTP, SSH, Syslog, DHCP, DNS, RDP, NETBIOS, IP routing, Ethernet, switched Ethernet, 802.11x, NFS, and VLANs.
  • Experience with working in large-scale distributed data center environments
  • Experience working with auditors to meet all compliance requirements (ISO/SOC)
Perks:
  • Generous cash & equity compensation
  • Health, dental, and vision coverage for you and your dependents
  • Wellness and Commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible Paid Time Off Plan that we all actually use

Job Details

We're here to help the smartest minds on the planet build Superintelligence. The labs pushing the edge? They run on Lambda. Our gear trains and serves their models, our infrastructure scales with them, and we move fast to keep up. If you want to work on massive, world-changing AI deployments with people who love action and hard problems, we're the place to be.

If you'd like to build the world's best deep learning cloud, join us.

*Note: This position requires presence in our Atlanta, GA Data Center 5 days per week.

The Operations team plays a critical role in ensuring the seamless end-to-end execution of our AI-IaaS infrastructure and hardware. This team is responsible for sourcing all necessary infrastructure and components, overseeing day-to-day data center operations to maintain optimal performance and uptime, and driving cross company coordination through product management organization to align operational capabilities with strategic goals. By managing the full lifecycle from procurement to deployment and operational efficiency, the Operations team ensures that our AI-driven infrastructure is reliable, scalable, and aligned with business priorities.

What You'll Do

  • Ensure new server, storage and network infrastructure is properly racked, labeled, cabled, and configured
  • Document data center layout and network topology in DCIM software
  • Work with supply chain & manufacturing teams to ensure timely deployment of systems and project plans for large-scale deployments
  • Participate in data center capacity and roadmap planning with sales and customer success teams to allocate floorspace
  • Assess current and future state data center requirements based on growth plans and technology trends
  • Manage a parts depot inventory and track equipment through the delivery-store-stage-deploy-handoff process in each of our data centers
  • Work closely with HW Support team to ensure data center infrastructure-related support tickets are resolved
  • Work with RMA team to ensure faulty parts are returned and replacements are ordered
  • Create installation standards and documentation for placement, labeling, and cabling to drive consistency and discoverability across all data centers
  • Serve as a subject-matter expert on data center deployments as part of sales engagement for large-scale deployments in our data centers and at customer sites

You

  • Have experience with critical infrastructure systems supporting data centers, such as power distribution, air flow management, environmental monitoring, capacity planning, DCIM software, structured cabling, and cable management
  • Have strong Linux administration experience
  • Have experience in setting up networking appliances (Ethernet and InfiniBand) across multiple data center locations
  • You are action-oriented and have a strong willingness to learn
  • You are willing to travel for bring up of new data center locations

Nice to Have

  • Experience with troubleshooting the following network layers, technologies, and system protocols: TCP/IP, DP/IP, BGP, OSPF, SNMP, SSL, HTTP, FTP, SSH, Syslog, DHCP, DNS, RDP, NETBIOS, IP routing, Ethernet, switched Ethernet, 802.11x, NFS, and VLANs.
  • Experience with working in large-scale distributed data center environments
  • Experience working with auditors to meet all compliance requirements (ISO/SOC)

Salary Range Information

The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

  • Founded in 2012, ~400 employees (2025) and growing fast
  • We offer generous cash & equity compensation
  • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.
  • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
  • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
  • Health, dental, and vision coverage for you and your dependents
  • Wellness and Commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible Paid Time Off Plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Similar Jobs

Nice - Software Engineer (Dot Net, AWS)

Nice

Pune, Maharashtra, India (Hybrid)
1 Month ago
Go Fund Me - Compliance Analyst II

Go Fund Me

United States (Remote)
1 Month ago
PwC - Senior Associate - D&A - GDC

PwC

Kolkata, West Bengal, India (On-Site)
10 Months ago
OKX - Senior Associate, Risk Operations (Fraud Risk, Mandarin & English Support)

OKX

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
10 Months ago
CME Group - Lead Analyst, Value Management

CME Group

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Winzo - Data Engineer

Winzo

New Delhi, Delhi, India (On-Site)
3 Months ago
Devoteam - Google Cloud Data Engineer

Devoteam

Amsterdam, North Holland, Netherlands (On-Site)
5 Months ago
upwork - Senior Manager, Strategic Insights & Data Science

upwork

United States (Remote)
2 Months ago
Roblox - Senior Data Scientist - Ecosystem and Learning Platform

Roblox

San Mateo, California, United States (On-Site)
2 Months ago
Fortra - Lead Data Scientist

Fortra

United Kingdom (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Apple - Cellular Protocol Stack Development Engineer

Apple

Sunnyvale, California, United States (On-Site)
2 Months ago
Yodlee - Solutions Consulting Director

Yodlee

Raleigh, North Carolina, United States (Remote)
2 Months ago
Granicus - Senior Design Services Engineer

Granicus

Costa Rica (Remote)
1 Month ago
Corsair gaming - Embedded Hardware Engineer

Corsair gaming

Landshut, Bavaria, Germany (On-Site)
1 Month ago
Guardian - Senior Consultant IT

Guardian

Gurugram, Haryana, India (Hybrid)
3 Months ago
Cognite - Senior Technical Writer

Cognite

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Sandbox VR - Retail Associate

Sandbox VR

Dublin, Ohio, United States (On-Site)
3 Years ago
PwC - Associate SailPoint Identity Management Advisory

PwC

Gurugram, India (On-Site)
1 Month ago
Google - Software Engineer III, Engineering Productivity, Google Cloud Platforms

Google

Sunnyvale, California, United States (On-Site)
4 Months ago
Synechron - Junior Tech Support Engineer

Synechron

Singapore (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Atlanta, Georgia, United States

Fliff - CRM Analyst

Fliff

Philadelphia, Pennsylvania, United States (On-Site)
1 Year ago
Feld Entertainment - Cycle Counter

Feld Entertainment

Jessup, Maryland, United States (On-Site)
5 Years ago
Instawork - Product Operations Intern

Instawork

Chicago, Illinois, United States (Hybrid)
4 Weeks ago
FORTUNE - Director of Product, Engagement & Monetization

FORTUNE

New York, United States (On-Site)
2 Months ago
Draftwise - Senior Front End Software Engineer

Draftwise

New York, United States (Remote)
1 Month ago
Fireworks AI - Senior Digital Designer

Fireworks AI

Redwood City, California, United States (Hybrid)
3 Weeks ago
ClinDCast - GenAI Application Lead

ClinDCast

Austin, Texas, United States (Remote)
1 Year ago
Clearwater Analytics - Data Operations Manager

Clearwater Analytics

Boise, Idaho, United States (On-Site)
2 Months ago
Ariens Company - Distribution Center Associate (Tuesday-Friday 7a-5:30) $18/HR

Ariens Company

Shepherdsville, Kentucky, United States (On-Site)
3 Weeks ago
broadcom - Solution Architect - Application Networking and Security

broadcom

Colorado, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Epic Games - Principal Data Analyst

Epic Games

New York, New York, United States (On-Site)
4 Months ago
HHA Exchange - Data Analyst

HHA Exchange

New York, New York, United States (Hybrid)
2 Months ago
eBay - Manager, Data Science – Analytics

eBay

Bengaluru, Karnataka, India (Hybrid)
1 Year ago
Eqvilent - C++ Software Engineer (Market Data)

Eqvilent

(Remote)
3 Months ago
Sigma Software - Senior Data Engineer

Sigma Software

Ukraine (Remote)
1 Month ago
Apple - Sr Data Scientist, Measurement

Apple

Cupertino, California, United States (On-Site)
3 Months ago
Rockstar Games - Senior Data Engineer

Rockstar Games

Andover, Massachusetts, United States (On-Site)
2 Months ago
Google - Data Center Facilities Engineer, Electrical (English, Japanese)

Google

Inzai, Chiba, Japan (On-Site)
8 Months ago
Eneba Games - Data Engineer

Eneba Games

Lithuania (Remote)
6 Months ago
Sailpoint - Senior Data Engineer

Sailpoint

Austin, Texas, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Atlanta, Georgia, United States (On-Site)

San Francisco, California, United States (Remote)

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Lambda

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug