Senior Software Engineer, Developer Productivity
Lambda
Job Summary
Lambda is seeking a Senior Software Engineer for the Shared Foundations Platform team to enhance engineering productivity. This role involves designing, developing, and operating a shared platform that powers CI/CD workflows, documentation systems, API standards, and shared tooling. You will partner with product and infrastructure teams to deliver reusable code foundations, incorporate service reliability practices, and provide technical leadership on operational practices. The position requires a focus on platform engineering and developer experience, ensuring high availability and maintainable architecture.
Must Have
- Own design, development, and operation of the shared foundations platform.
- Work with engineering teams to refactor common needs into the shared foundation.
- Incorporate reusable service reliability practices like retries, circuit breakers, and observability instrumentation.
- Deliver technical leadership on CI/CD workflows, documentation systems, API standards, and operational practices.
- Build and maintain reusable GitHub Actions workflows, shared libraries, and developer tooling.
- Establish and steward documentation standards, information architecture, and freshness policies.
- Partner with infrastructure, product engineering, and DevOps teams on roadmap priorities.
- Drive API versioning, contract-change management, and cross-team communication patterns.
- Lead working groups on CI/CD, Python/Go tooling, and design system standards.
- Mentor engineers, run design reviews, and help teams adopt bottom-up standards.
- Monitor and improve KPIs including CI workflow adoption, docs freshness, time-to-green for new services, and PR-to-prod velocity.
Good to Have
- Experience designing and operating documentation systems at scale (e.g., Notion, MkDocs, Confluence) with automated freshness tracking.
- Prior work building reusable CI/CD workflows, GitHub Actions, or shared developer tooling.
- Experience establishing API versioning standards, contract-change management, or cross-team integration patterns.
- Familiarity with Python packaging tools (uv, Poetry, pip-tools) and modern dependency management practices.
- Experience leading technical working groups, driving consensus on standards, and managing exception processes.
- Background in developer experience (DevEx) or platform engineering with measurable impact on team productivity metrics.
Perks & Benefits
- Generous cash & equity compensation
- Health, dental, and vision coverage for you and your dependents
- Wellness and Commuter stipends for select roles
- 401k Plan with 2% company match (USA employees)
- Flexible Paid Time Off Plan
Job Description
Lambda, The Superintelligence Cloud, builds Gigawatt-scale AI Factories for Training and Inference. Lambda’s mission is to make compute as ubiquitous as electricity and give every person access to artificial intelligence. One person, one GPU.
If you'd like to build the world's best deep learning cloud, join us.
*Note: This position requires presence in our San Francisco/San Jose/Seattle office location 4 days per week; Lambda’s designated work from home day is currently Tuesday.
About Lambda / About the Team
In 2012, Lambda started with a crew of AI engineers publishing research at top machine-learning conferences. Today we're building the world's top AI computing platform — fast, secure, affordable, and built to scale. The Shared Foundations Platform team builds the developer experience software infrastructure that powers engineering productivity across Lambda — including CI/CD workflows, documentation systems, API standards, and shared tooling that help product and infrastructure teams ship faster and more reliably.
Role Summary
We're hiring a Senior Software Engineer to design and build the shared foundations platform that powers engineering productivity across Lambda. You'll partner with product and infrastructure engineering teams to deliver code foundations used by our services. These foundations will provide crosscutting functionality, enabling industry-standard quality and availability practices and accelerating service delivery. You will also contribute to reliable CI/CD workflows, maintainable documentation systems, and internal developer tooling.
What You’ll Do
- Own design, development, and operation of the shared foundations platform, ensuring high availability, developer productivity, and maintainable architecture.
- Work with engineering teams to identify opportunities to refactor common needs into the shared foundation.
- Incorporate reusable service reliability practices like retries, circuit breakers, and observability instrumentation.
- Deliver technical leadership on CI/CD workflows, documentation systems, API standards, and operational practices across engineering teams.
- Build and maintain reusable GitHub Actions workflows, shared libraries, and developer tooling that accelerate delivery.
- Establish and steward documentation standards, information architecture, and freshness policies to ensure engineering docs remain discoverable and up-to-date.
- Partner with infrastructure, product engineering, and DevOps teams on roadmap priorities, technical tradeoffs, and adoption strategies.
- Drive API versioning, contract-change management, and cross-team communication patterns to reduce integration friction.
- Lead working groups on CI/CD, Python/Go tooling, and design system standards, facilitating consensus and adoption.
- Mentor engineers, run design reviews, and help teams adopt bottom-up standards with clear exception processes.
- Monitor and improve KPIs including CI workflow adoption, docs freshness, time-to-green for new services, and PR-to-prod velocity.
(This role emphasizes platform engineering, developer experience, and cross-functional coordination rather than on-call operations.)
You
- 5+ years total software engineering experience with 2+ years building developer platforms, CI/CD systems, or infrastructure tooling.
- Strong programming experience in Python and/or Go, with expertise in system-level debugging, testing, and building maintainable APIs.
- Demonstrated experience designing and operating production services at scale, with focus on reliability, observability, and developer experience.
- Deep knowledge of CI/CD systems (GitHub Actions, ArgoCD), containerization (Kubernetes), and infrastructure-as-code (Terraform).
- Proven ability to establish technical standards, drive adoption across engineering teams, and mentor other engineers.
- Excellent written and verbal communication skills, with experience facilitating working groups and building consensus on technical decisions.
Nice to Have
- Experience designing and operating documentation systems at scale (e.g., Notion, MkDocs, Confluence) with automated freshness tracking.
- Prior work building reusable CI/CD workflows, GitHub Actions, or shared developer tooling adopted across multiple teams.
- Experience establishing API versioning standards, contract-change management, or cross-team integration patterns.
- Familiarity with Python packaging tools (uv, Poetry, pip-tools) and modern dependency management practices.
- Experience leading technical working groups, driving consensus on standards, and managing exception processes.
- Background in developer experience (DevEx) or platform engineering with measurable impact on team productivity metrics.
Salary Range Information
The annual salary range for this position has been set based on market data and other factors. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.
About Lambda
- Founded in 2012, ~400 employees (2025) and growing fast
- We offer generous cash & equity compensation
- Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.
- We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability
- Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG
- Health, dental, and vision coverage for you and your dependents
- Wellness and Commuter stipends for select roles
- 401k Plan with 2% company match (USA employees)
- Flexible Paid Time Off Plan that we all actually use
A Final Note:
You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.
Equal Opportunity Employer
Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.