Staff Storage Systems Architect

1 Week ago • 7 Years + • System Design • $349,000 PA - $465,000 PA

Job Summary

Job Description

Lambda is seeking a Staff Storage Systems Architect to design, plan, test, and implement large-scale distributed storage systems for AI workloads. This role is crucial in defining the company's storage infrastructure strategy, ensuring scalability, reliability, and efficiency. The architect will collaborate with engineering teams, infrastructure operations, and product stakeholders to align storage solutions with business objectives. Responsibilities include optimizing distributed storage for AI, developing standards and best practices, evaluating storage technologies, integrating solutions with cloud offerings, planning capacity, and driving operational excellence with high availability and data integrity. The position also involves mentoring storage and infrastructure engineers.
Must have:
  • Architect distributed storage for AI workloads.
  • Develop storage system standards and best practices.
  • Evaluate and benchmark storage technologies for AI.
  • Collaborate with engineering on storage integration.
  • Define storage capacity plans and roadmaps.
  • Ensure high availability and data integrity.
  • Provide technical leadership to engineers.
  • 7+ years designing distributed storage systems.
  • Deep expertise with Ceph, Lustre, or similar.
  • Knowledge of underlying hardware systems.
  • Understanding of storage architectures and optimization.
  • Experience with high-performance storage.
  • Familiarity with object, block, and file protocols.
  • Ability to resolve complex technical issues.
  • Excellent communication skills.
Good to have:
  • Design and manage enterprise and open-source storage.
  • Experience with cloud storage solutions (AWS S3, Azure Blob, GCS).
  • Familiarity with container storage and orchestration.
  • Manage storage in large-scale cloud environments.
  • Exposure to data security and compliance.
Perks:
  • Generous cash & equity compensation
  • Health, dental, and vision coverage
  • Wellness and Commuter stipends
  • 401k Plan with 2% company match
  • Flexible Paid Time Off Plan

Job Details

Lambda is the #1 GPU Cloud for ML/AI teams training, fine-tuning and inferencing AI models, where engineers can easily, securely and affordably build, test and deploy AI products at scale. Lambda’s product portfolio includes on-prem GPU systems, hosted GPUs across public & private clouds and managed inference services – servicing government, researchers, startups and Enterprises world-wide.


If you'd like to build the world's best deep learning cloud, join us. 

*Note: This position requires presence in our San Francisco office location 4 days per week; Lambda’s designated work from home day is currently Tuesday.


Engineering at Lambda is responsible for building and scaling our cloud offering. Our scope includes the Lambda website, cloud APIs and systems as well as internal tooling for system deployment, management and maintenance.

We're looking for a Storage Systems Architect experienced in designing, planning, testing, and implementing large-scale distributed storage systems. You will play a critical role in defining our storage infrastructure strategy, driving technical solutions that ensure scalability, reliability, and efficiency for our growing business and operational needs.

You will collaborate closely with engineering teams, infrastructure operations, and product stakeholders to ensure our storage solutions align with company objectives and technical requirements.

What You'll Do:

  • Architect, design, and implement distributed storage solutions optimized for AI workloads.

  • Drive the development of storage system standards and best practices to ensure consistency and reliability across infrastructure.

  • Evaluate and benchmark storage technologies to meet demanding AI performance requirements.

  • Collaborate with engineering teams to integrate storage solutions with cloud product offerings.

  • Define and maintain storage capacity plans, performance metrics, and scalability roadmaps.

  • Drive operational excellence, ensuring high availability, disaster recovery, and data integrity.

  • Provide mentorship and technical leadership to storage and infrastructure engineers.

About You:

  • Proven experience (7+ years) designing and implementing distributed storage systems.

  • Deep expertise with distributed storage technologies (e.g., Ceph, Lustre, or similar).

  • In-depth knowledge of underlying hardware systems supporting storage deployments.

  • Strong understanding of storage architectures, data replication, redundancy strategies, and performance optimization techniques.

  • Experience working with high-performance and high-availability storage solutions.

  • Familiarity with object, block, and file storage protocols.

  • Ability to identify, analyze, and resolve complex technical issues.

  • Excellent communication skills, capable of clearly articulating technical concepts to diverse stakeholders.

Nice to Have:

  • Expertise in designing, implementing, and managing both enterprise-grade and open-source storage systems.

  • Experience with cloud storage solutions (AWS S3, Azure Blob Storage, Google Cloud Storage).

  • Familiarity with container storage solutions and orchestration.

  • Experience managing storage in large-scale cloud environments.

  • Exposure to data security, compliance, and regulatory requirements (e.g., GDPR, HIPAA).

Salary Range Information

Based on market data and other factors, the annual salary range for this position is $349,000-$465,000. However, a salary higher or lower than this range may be appropriate for a candidate whose qualifications differ meaningfully from those listed in the job description.

About Lambda

  • Founded in 2012, ~350 employees (2024) and growing fast

  • We offer generous cash & equity compensation

  • Our investors include Andra Capital, SGW, Andrej Karpathy, ARK Invest, Fincadia Advisors, G Squared, In-Q-Tel (IQT), KHK & Partners, NVIDIA, Pegatron, Supermicro, Wistron, Wiwynn, US Innovative Technology, Gradient Ventures, Mercato Partners, SVB, 1517, Crescent Cove.

  • We are experiencing extremely high demand for our systems, with quarter over quarter, year over year profitability

  • Our research papers have been accepted into top machine learning and graphics conferences, including NeurIPS, ICCV, SIGGRAPH, and TOG

  • Health, dental, and vision coverage for you and your dependents

  • Wellness and Commuter stipends for select roles

  • 401k Plan with 2% company match (USA employees)

  • Flexible Paid Time Off Plan that we all actually use

A Final Note:

You do not need to match all of the listed expectations to apply for this position. We are committed to building a team with a variety of backgrounds, experiences, and skills.

Equal Opportunity Employer

Lambda is an Equal Opportunity employer. Applicants are considered without regard to race, color, religion, creed, national origin, age, sex, gender, marital status, sexual orientation and identity, genetic information, veteran status, citizenship, or any other factors prohibited by local, state, or federal law.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in San Francisco, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

System Design Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Jose, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

San Francisco, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Lambda

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug