Engineering Program Manager - Fleet Engineering

Lambda

Job Summary

The Fleet Engineering team is responsible for the logical deployment of cutting-edge NVIDIA GPU clusters and ensuring the reliability of the production fleet. As an Engineering Program Manager, you will coordinate collaboratively across cross-functional teams to deliver new GPU capacity on time and at 100% quality. This involves managing efforts, communicating progress, actively managing risks and prioritization, and working with Product and Infrastructure engineering teams to improve transparency, metrics, automation, and overall efficiency.

Must Have

  • Partner with Fleet Engineering Managers to align on expectations and track deliverables.
  • Identify opportunities for continuous improvement in programs.
  • Execute against tight deadlines while improving process, tooling, and automation.
  • Collaborate with Platform & Infrastructure engineering, Program Management, Product Management, DC Operations, and finance.
  • Lead cross-functional engineering teams to deliver complex infrastructure projects.
  • Demonstrate technical expertise in infrastructure technologies like NVIDIA GPUs, hardware troubleshooting, and automation.
  • Drive risk management and stakeholder communication.
  • Refine project management processes for efficiency and collaboration.
  • 10+ years of infrastructure experience with 5+ years in program management for major projects.
  • Experience leading engineers on complex, cross-functional projects.
  • Comfortable managing cross-functional teams and driving decisions.
  • Experience designing and implementing scalable processes.
  • Bachelor's degree in Computer Science, Engineering, or related technical field.
  • Proven track record of delivering complex technical projects.
  • Exceptional leadership, communication, and interpersonal skills.
  • Ability to manage multiple projects simultaneously in a fast-paced environment.

Good to Have

  • Experience managing hybrid hardware deployment and software engineering projects.
  • Experience in a hyperscaler (CSP), neocloud provider (NCP), or high-performance computing (HPC) production environments.
  • Worked closely with product managers to deliver products to specification.
  • Deep understanding of infrastructure technologies and software development best practices.

Perks & Benefits

  • Generous cash & equity compensation
  • Health, dental, and vision coverage for you and your dependents
  • Wellness and Commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible Paid Time Off Plan

Job Description

About the Team

The Fleet Engineering team is responsible for the logical deployment of cutting edge NVIDIA GPU clusters, the reliability of the production fleet, and the tools and processes to support these outcomes.

About the Role

Reporting to the Director of Fleet Engineering, your role as an Engineering Program Manager is to coordinate collaboratively across a set of cross functional teams to ensure we deliver new GPU capacity on time and at 100% quality. You will be responsible for managing and coordinating the efforts of multiple teams, communicating progress and actively managing risks and prioritization. You will work collaboratively with Product and Infrastructure engineering teams to improve transparency, metrics, automation and overall efficiency for the team.

We value diverse backgrounds, experiences, and skills, and we are excited to hear from candidates who can bring unique perspectives to our team. If you do not exactly meet this description but believe you may be a good fit, please still apply and help us understand your readiness for this Manager role. Your application is not a waste of our time.

What You’ll Do

  • Partner with Fleet Engineering Managers to ensure the teams are aligned on expectations, track progress towards deliverables, providing repeatable & scalable programs.
  • Identify opportunities for improvement: ensuring we are capturing the appropriate signals throughout the program and facilitating continuous improvement.
  • Work with Fleet Engineering Deployments on executing against tight deadlines while improving process, tooling, automation.
  • Collaborate closely with a broad set of stakeholders, including Platform & Infrastructure engineering, Program Management, Product Management, DC Operations, and finance
  • Lead cross-functional engineering teams to deliver complex infrastructure projects from concept to deployment. Define scope, goals, and deliverables; plan resources, timelines, risks and ensure execution aligns with organizational objectives.
  • Demonstrate technical expertise in infrastructure technologies, including NVIDIA GPUs, hardware troubleshooting, lab methodologies, and automation tools.
  • Drive risk management and stakeholder communication by proactively identifying issues, driving realtime and inflight tight timeline projects, and providing transparent updates on progress and milestones.
  • Continuously refine project management processes to improve efficiency, collaboration, and cross-functional alignment with product, operations, and security teams. Maintain a customer-focused approach in defining and meeting technical requirements.

You

  • 10+ years of infrastructure experience with 5+ years performing program management for major projects including capital projects or hyperscaler infrastructure deployment
  • Demonstrated experience leading a team of engineers on complex, cross-functional projects in a fast-paced environment.
  • Comfortable managing cross functional teams and driving decisions and communications
  • Experience successfully designing and implementing simple, scalable processes that solve complex problems.
  • Thrive in ambiguous, fast-paced environments, You bring clarity and order to the rest of the team.
  • Bachelor's degree in Computer Science, Engineering, or a related technical field.
  • Proven track record of successfully leading and delivering complex technical projects.
  • Exceptional leadership, communication, and interpersonal skills.
  • Ability to thrive in a fast-paced, high-pressure environment and manage multiple projects simultaneously.

Nice to Have

  • Experience managing hybrid hardware deployment and software engineering projects.
  • Experience in a hyperscaler (CSP), neocloud provider (NCP), or high-performance computing (HPC) production environments.
  • Worked closely with product managers to deliver products to specification.
  • Deep understanding of infrastructure technologies and software development best practices.

Salary Range Information

The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

  • Founded in 2012, ~400 employees (2025) and growing fast
  • We offer generous cash & equity compensation
  • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.
  • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
  • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
  • Health, dental, and vision coverage for you and your dependents
  • Wellness and Commuter stipends for select roles
  • 401k Plan with 2% company match (USA employees)
  • Flexible Paid Time Off Plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

6 Skills Required For This Role

Cross Functional Communication Problem Solving Risk Management Game Texts Machine Learning

Similar Jobs