Staff Site Reliability Engineer

1 Month ago • 8 Years + • Devops • $145,000 PA - $185,000 PA

Job Summary

Job Description

Aerospike is seeking a Staff Site Reliability Engineer to provide technical leadership within their global SRE organization. This role involves driving reliability, performance, and scalability across hybrid and multi-cloud environments. The engineer will mentor others, design resilient systems, and champion modern SRE practices. Key responsibilities include leading infrastructure efforts, defining reliability standards using SLIs/SLOs, and partnering with product and engineering teams. The role also involves participating in on-call rotations and implementing automation-first approaches using IaC and CI/CD pipelines. The company aims to unleash the power of real-time data with a database built for infinite scale, speed, and sustainability, powering critical applications for innovative organizations.
Must have:
  • 8+ years in SRE, DevOps, or infra engineering
  • Experience with major public cloud (AWS, GCP, Azure)
  • Production experience with Kubernetes
  • Proficiency in IaC tools (Terraform)
  • Expertise in observability tools (Datadog, Prometheus)
  • Programming/scripting in Python, Go, or Bash
  • Experience with incident response
  • Proven ability to mentor engineers
  • Strong communication skills
Good to have:
  • Azure experience is a plus
  • DataDog experience is a plus
  • Familiarity with Aerospike or other distributed databases
  • Kubernetes or cloud certifications
  • Experience managing/optimizing database deployments

Job Details

At Aerospike, we dream big and deliver even bigger. Our mission is to unleash the power of the world’s real-time data with a database built for infinite scale, speed, and sustainability.

We empower companies to tackle seemingly insurmountable challenges and achieve what’s never been done before. That’s why we developed the world’s leading real-time database—powering mission-critical applications for the most innovative, category-disrupting organizations.

Aerospike enables extreme-scale, real-time applications that:

  • Fight fraud in microseconds.
  • Drive dramatic increases in shopping cart size.
  • Power global digital payments.
  • Deliver hyper-personalized user experiences to tens of millions.

Industry leaders like Airtel, Experian, Nielsen, PayPal, Snap, Verizon Media, and Wayfair trust Aerospike as the foundation for their future. They rely on us to act in the moments that matter.

Headquartered in Mountain View, California, with offices in London, Bangalore, and Tel Aviv, Aerospike is the uncontested leader in next-generation, always-on, hyperscale data solutions. Unlike legacy NoSQL systems, our patented Hybrid Memory Architecture unlocks today’s hardware to deliver unimaginable performance and value for the most demanding data workloads—from the edge, to the core, to the cloud.

If you're ready to shape the future of data, join us.

Staff Site Reliability Engineer

As a Staff Site Reliability Engineer at Aerospike, you’ll be a technical leader within our global SRE organization, helping drive reliability, performance, and scalability across our hybrid and multi-cloud environments. You’ll bring deep operational experience and lead by example—mentoring others, designing resilient systems, and championing modern SRE practices across new and legacy platforms.

You’ll play a key role in shaping the direction of our infrastructure initiatives, from Kubernetes-based platforms like AKS and the Aerospike Kubernetes Operator to existing services in AWS and GCP. Your impact will span teams and systems as you solve complex problems, influence architecture, and foster a culture of ownership, resilience, and continuous improvement.

Key Responsibilities

  • Provide technical leadership across multiple systems and environments, proactively identifying risks, shaping architecture decisions, and improving reliability and performance at scale.
  • Lead key infrastructure efforts including Kubernetes platform expansion (AKS, AKO), and application of SRE principles to legacy systems and new cloud offerings.
  • Define, measure, and enforce reliability standards through SLIs/SLOs, observability tooling, and incident response frameworks.
  • Mentor and guide other SREs by leading design sessions, conducting technical deep dives, and reviewing code, configurations, and infrastructure decisions.
  • Partner with product, engineering, and cloud teams to align reliability goals with delivery objectives.
  • Lead root cause analyses and implement systemic fixes for issues spanning multiple platforms or services.
  • Drive automation-first approaches using IaC, CI/CD pipelines, and scripting to reduce toil and increase deployment confidence.
  • Influence cross-functional roadmaps, identifying areas for innovation, technical debt reduction, and long-term scalability.
  • Participate in the global on-call rotation, bringing senior-level calm and clarity during incidents and escalations.

Required Experience

  • 8+ years of experience in SRE, DevOps, or infrastructure engineering, including significant time operating production systems at scale.
  • Deep hands-on experience with at least one major public cloud (AWS, GCP, Azure), and working knowledge of the others; Azure experience is a plus.
  • Production experience with Kubernetes, including operating clusters, Helm, operators, and supporting microservices in real-world environments.
  • Strong proficiency in infrastructure-as-code tools such as Terraform and CI/CD automation platforms.
  • Expertise in observability tools and practices (Datadog, Prometheus, Grafana, ELK, etc.) and using them to define SLIs and SLOs.; DataDog experience is a plus
  • Programming and scripting ability in one or more languages (Python, Go, Bash, etc.).
  • Experience with large-scale incident response and post-incident review practices.
  • Proven ability to mentor other engineers and influence technical strategy across multiple teams.
  • Strong communication skills to articulate complex concepts to technical and non-technical stakeholders.

Preferred Skills and Qualifications

  • Hands-on experience managing and optimizing database deployments and services in production environments, ensuring high availability and performance.
  • Familiarity with Aerospike or other distributed databases is a plus.
  • Kubernetes or cloud certifications (CKA, CKS, AWS/GCP DevOps/Architect) a plus but not require
  • Track record of influencing architectural decisions across teams or domains.

Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.

Join us at Aerospike and be part of a dynamic team that is shaping the future of data management. Salary Range for California Based Applicants: [$145,000 - $185,000] (actual compensation will be determined based on experience, location, and other factors permitted by law).

 

 

Similar Jobs

Ziff Davis - IT Support Engineer

Ziff Davis

Denver, Colorado, United States (Remote)
1 Month ago
Figma - Data Engineer

Figma

United States (Remote)
2 Weeks ago
Zenoti - Lead ETL/Data Engineering

Zenoti

Hyderabad, Telangana, India (On-Site)
1 Week ago
Salesforce - Territory Account Executive - SMB

Salesforce

Mexico City, Mexico (On-Site)
8 Months ago
kuda  - Lead Data Engineer

kuda

Cape Town, Western Cape, South Africa (Remote)
1 Week ago
Synechron - API Automation Engineer (Java/Python)

Synechron

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Ansys - Senior R&D Engineer (Cloud Platform Developer)

Ansys

Canonsburg, Pennsylvania, United States (On-Site)
2 Months ago
Halcyon - Commercial Solutions Architect

Halcyon

(Remote)
3 Weeks ago
Zscaler - Senior Staff Software Development Engineer - API, Cloud

Zscaler

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Capgemini - Devops

Capgemini

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

CyberArk - Principal Program Manager - AI Excellence

CyberArk

Israel (Hybrid)
2 Months ago
ARVORE Immersive Experiences - Concept Artist

ARVORE Immersive Experiences

State Of São Paulo, Brazil (Remote)
1 Month ago
Roblox - Principal Software Engineer, Virtual Economy Optimization

Roblox

San Mateo, California, United States (On-Site)
1 Month ago
Toast - Product Counsel, Toast Payroll & HR Suite

Toast

Boston, Massachusetts, United States (Hybrid)
1 Month ago
Survay Monkey - Senior Product Designer

Survay Monkey

Ottawa, Ontario, Canada (Remote)
1 Month ago
Mozilla - Staff Software Engineer, Identity and Access Management

Mozilla

Canada (Remote)
1 Month ago
Apple - Inductive Engineering Program Manager

Apple

Cupertino, California, United States (On-Site)
2 Months ago
Alpha Sense - Senior Director of Strategic Initiatives, Global Markets

Alpha Sense

London, England, United Kingdom (Remote)
2 Months ago
WebMD - Director of Affiliate Marketing

WebMD

El Segundo, California, United States (On-Site)
4 Months ago
Palo Alto Networks - Consulting Director - Security Operations - Proactive Services (Unit 42)

Palo Alto Networks

Netherlands (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

GoDaddy - Technical Product Manager - PKI and Cryptographic Compliance

GoDaddy

United States (Remote)
2 Weeks ago
Square - Technical Consultant

Square

Orlando, Florida, United States (Remote)
1 Week ago
Apple - Embedded Software Engineer

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Sierra - Technical Product Marketer

Sierra

San Francisco, California, United States (On-Site)
7 Months ago
Roblox - Software Engineer, Reliability

Roblox

San Mateo, California, United States (On-Site)
2 Weeks ago
GameChanger - Product Design Manager, Monetization

GameChanger

United States (Remote)
4 Months ago
Stord - Customer Experience Associate

Stord

North Haven, Connecticut, United States (On-Site)
3 Weeks ago
Nagarro - Senior Compliance (GSEDD) Analyst

Nagarro

United States (Remote)
1 Month ago
Next Level Business Services - Visual Analytics Architect

Next Level Business Services

Atlanta, Georgia, United States (On-Site)
9 Years ago
Nice - Enterprise Account Executive

Nice

United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

C3 IoT - Site Reliability Engineer – Field Operations

C3 IoT

London, England, United Kingdom (On-Site)
3 Weeks ago
Poppulo - Senior DevOps Engineer

Poppulo

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Addepar - Senior Fullstack Software Engineer - Partner Platform

Addepar

United States (Remote)
1 Month ago
bytedance - Software Engineer - Compute Infrastructure (Orchestration & Scheduling)

bytedance

San Jose, California, United States (On-Site)
3 Months ago
Moonton  - Mid-platform Backend Framework Engineer Urgently Hiring

Moonton

Shanghai, Shanghai, China (On-Site)
1 Month ago
Google - Site Reliability Manager, Platforms and Devices, SRE

Google

Bengaluru, Karnataka, India (On-Site)
3 Months ago
GameChanger - Senior Full Stack Software Engineer, Video Platform

GameChanger

New York, New York, United States (Remote)
4 Months ago
Nagarro - Senior Cloud Developer/Architect

Nagarro

Germany (Remote)
5 Months ago
Tesla - Distributed Systems Engineer, Autobidder Platform

Tesla

North Holland, Netherlands (On-Site)
6 Months ago
Jane Street - Cross-Platform Software Engineer

Jane Street

New York, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Headquartered in Mountain View, California, Aerospike also has a global presence with offices in London, Bangalore, and Tel Aviv. Aerospike does not accept resumes from staffing agencies with which we do not have a written agreement and specific engagement for a particular opening. Our employment activities, inquiries, and offers are managed through our HR/Talent department, and all candidates are presented through this channel only. We do not accept unsolicited resumes.

Bengaluru, Karnataka, India (On-Site)

Mountain View, California, United States (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Austin, Texas, United States (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Mountain View, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by AeroSpike