Senior Site Reliability Engineer

8 Hours ago • 6 Years + • Devops

Job Summary

Job Description

Aerospike is seeking a Senior Site Reliability Engineer to design, build, and optimize a scalable, highly resilient cloud platform. The role involves improving reliability, performance, and automation for seamless service delivery. Responsibilities include developing infrastructure, implementing monitoring systems, and driving continuous improvement for system efficiency and stability. Key tasks include designing and deploying large-scale cloud infrastructure, leading automation initiatives, building monitoring and alerting, managing incident response, enforcing security best practices, collaborating with development teams, participating in on-call rotations, establishing documentation, leading capacity planning, and mentoring junior engineers.
Must have:
  • 6+ years in SRE, DevOps, or related fields
  • Experience with scalable, resilient cloud systems
  • Expertise in a major cloud provider (AWS, GCP, Azure)
  • Proficiency in infrastructure-as-code (Terraform)
  • Experience with CI/CD pipelines
  • Deep understanding of Linux/Unix, networking
  • Proficiency in Python, Bash, or Go scripting
  • Experience with Docker and Kubernetes
  • Hands-on experience with monitoring tools (Prometheus, Grafana)
  • Strong problem-solving skills
  • Experience implementing cloud security best practices
  • Excellent English communication skills
Good to have:
  • Experience managing database deployments
  • Familiarity with Aerospike or NoSQL databases
  • Advanced cloud security knowledge
  • Relevant industry certifications (AWS, GCP)
  • Kubernetes certifications (CKA, CKAD, CKS)
  • Proficiency with configuration management (Ansible)
  • Experience leading collaborative development

Job Details

About Aerospike

Aerospike is the real-time database for mission-critical use cases and workloads, including machine learning, generative, and agentic AI. Aerospike powers millions of transactions per second with millisecond latency, at a fraction of the total cost of ownership compared to other databases.

Global leaders, including Adobe, Airtel, Barclays, Criteo, DBS Bank, Experian, Grab, HDFC Bank, PayPal, Sony Interactive Entertainment, The Trade Desk, and Wayfair, rely on Aerospike for customer 360, fraud detection, real-time bidding, profile stores, recommendation engines, and other use cases

Headquartered in Mountain View, California, Aerospike has a global presence with offices in London, Bangalore, and Tel Aviv.

In Bengaluru we follow hybrid models with mandate two days’ work from office.

Senior Site Reliability Engineer

As a Senior Site Reliability Engineer (SRE) for Aerospike, you will be instrumental in designing, building, and optimizing a scalable, highly resilient cloud platform. You will focus on improving reliability, performance, and automation to ensure seamless delivery and operation of our cloud platform services. Your responsibilities will include developing robust infrastructure, implementing intelligent monitoring systems, and driving continuous improvement initiatives that enhance system efficiency, scalability, and overall platform stability.

Key Responsibilities

  • Designing, deploying, and optimizing large-scale Aerospike cloud platform infrastructure and services across multiple environments
  • Leading the development and enhancement of automation and infrastructure-as-code solutions to improve operational efficiency
  • Building and maintaining monitoring, alerting, and observability implementations to proactively detect and resolve system issues
  • Leading incident response activities, conducting post-mortems, and driving continuous improvement initiatives
  • Designing and enforcing security best practices for cloud infrastructure and access control
  • Collaborating with development teams to ensure reliable service delivery and alignment with SRE best practices
  • Participating in 24/7 on-call rotation, responding to critical incidents and minimizing downtime through proactive mitigation strategies
  • Establishing documentation standards, runbooks, and system configurations for team knowledge sharing
  • Leading capacity planning and performance optimization efforts
  • Mentoring junior engineers and sharing knowledge to build team capabilities

Required Experience

  • 6+ years of experience in Site Reliability Engineering (SRE), DevOps, or related fields, with a focus on building scalable, resilient, and automated cloud-based systems
  • Hands-on experience designing, deploying, and optimizing production-grade, business-critical systems in cloud environments
  • Expertise with at least one major public cloud provider (AWS, Google Cloud, or Azure), including cloud-native services and architectures
  • Strong proficiency in infrastructure-as-code (IaC) tools such as Terraform to enable automated and reproducible infrastructure
  • Experience in CI/CD pipeline design and implementation, enabling seamless, automated software delivery and infrastructure updates
  • Deep understanding of Linux/Unix systems, networking fundamentals, and distributed system architectures
  • Proficiency in scripting and software development using Python, Bash, or Go to build automation, tooling, and infrastructure enhancements
  • Experience with containerization and orchestration technologies such as Docker and Kubernetes for efficient service deployment and scaling
  • Hands-on experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Elasticsearch, Kibana) to drive data-driven system improvements
  • Strong problem-solving skills with an engineering-first mindset for improving system reliability, scalability, and performance
  • Experience implementing security best practices for cloud infrastructure, access control, and data protection
  • Excellent English communication skills (verbal and written) to collaborate effectively across teams and document key processes

Preferred Skills and Qualifications

  • Hands-on experience managing and optimizing database deployments and services in production environments, ensuring high availability and performance
  • Familiarity with Aerospike or other distributed NoSQL databases
  • Advanced understanding of security practices and implementation in cloud environments
  • Relevant industry certifications, such as AWS Certified DevOps Engineer, AWS Certified Solutions Architect, Google Professional Cloud DevOps Engineer, or equivalent
  • Kubernetes certifications such as Certified Kubernetes Administrator (CKA), Certified Kubernetes Application Developer (CKAD), or Certified Kubernetes Security Specialist (CKS)
  • Proficiency with configuration management tools (Ansible, Terraform, or similar) in complex environments
  • Experience leading collaborative development practices and advanced version control workflows

Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.

Similar Jobs

Bebopbee - Marketing Motion Designer

Bebopbee

(Remote)
2 Months ago
Sailpoint - Salesforce Developer

Sailpoint

Pune, Maharashtra, India (Remote)
6 Days ago
WaveApps - Technical Project Manager

WaveApps

Toronto, Ontario, Canada (Remote)
1 Week ago
Rockstar Games - Senior Animation R&D Programmer: Retargeting

Rockstar Games

New York, United States (On-Site)
1 Month ago
Illumina - Associate Director, IT SCM

Illumina

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Rockstar Games - Senior DevOps Engineer

Rockstar Games

Edinburgh, Scotland, United Kingdom (On-Site)
2 Months ago
reversing labs  - Principal Infrastructure & Cloud Optimization Engineer

reversing labs

Zagreb, Grad Zagreb, Croatia (Hybrid)
3 Months ago
Rennsportgg - Site Reliability Engineer (f/m/x)

Rennsportgg

Munich, Bavaria, Germany (Remote)
1 Month ago
Penrose studios - Lead Platform Engineer

Penrose studios

San Francisco, California, United States (On-Site)
4 Years ago
appier - Staff/Senior Software Engineer, Machine Learning Platform (Ad Cloud)

appier

Taipei City, Taiwan (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Whatnot - Benefits Lead

Whatnot

Los Angeles, California, United States (Remote)
1 Week ago
Landor - Senior Designer (Packaging Focus)

Landor

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Thales - Manager Design & Development / CTO

Thales

Zürich, Zurich, Switzerland (On-Site)
2 Months ago
Suki - Senior Engineering Manager - Frontend

Suki

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Interactive Brokers - Global Client Associate - English and Italian/French/Russian speaking

Interactive Brokers

Tallinn, Harju County, Estonia (On-Site)
8 Months ago
Workato - Partner Solutions Architect, AI Solutions

Workato

Palo Alto, California, United States (On-Site)
1 Month ago
Microsoft - Technical Support Engineer - Windows Networking

Microsoft

(Hybrid)
3 Months ago
Diligent Corporation - Compensation Analyst

Diligent Corporation

Budapest, Hungary (Hybrid)
2 Months ago
Microsoft - Principal Data Science Manager

Microsoft

Hyderabad, Telangana, India (On-Site)
3 Months ago
HCL Tech - Dynamics CRM Functional Lead Consultant

HCL Tech

North Carolina, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Applied materials  - Senior Network Security Engineer - Detection & Protection

Applied materials

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Assystems - Internal Finance Auditor

Assystems

Gurugram, Haryana, India (On-Site)
8 Months ago
Morning Star - Associate Team Lead

Morning Star

Mumbai, Maharashtra, India (Hybrid)
5 Days ago
Neolytix - Team Lead Full Stack Developer - Voice Transcription Platform

Neolytix

Gurugram, Haryana, India (On-Site)
2 Months ago
Outscal - Growth - Product Manager

Outscal

Delhi, India (On-Site)
8 Months ago
Digicore studios - Graphic Designer

Digicore studios

Pune, Maharashtra, India (On-Site)
6 Months ago
Capgemini - Financial Accounting

Capgemini

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Capgemini - Automation Testing Specialist

Capgemini

Pune, Maharashtra, India (On-Site)
1 Month ago
Accenture - Learning Operations New Associate

Accenture

Mumbai, Maharashtra, India (On-Site)
1 Month ago
extreme network - SR PROGRAMMER - Oracle Fusion Cloud- VBCS/ BI Reports/ OTBI/FRS & SmartView

extreme network

Chennai, Tamil Nadu, India (Hybrid)
9 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

bytedance - Software Engineer, Multi-Cloud CDN

bytedance

Boston, Massachusetts, United States (On-Site)
2 Months ago
bounteous - Site Reliability Engineer

bounteous

Montreal, Quebec, Canada (Hybrid)
2 Months ago
Dialpad AI - Solutions Architect

Dialpad AI

Buenos Aires, Buenos Aires, Argentina (On-Site)
1 Week ago
Expedia - Windows Infrastructure Engineer III - Endpoint Management

Expedia

London, England, United Kingdom (On-Site)
1 Month ago
full swing studio - Principal Software Engineer, Platform

full swing studio

Carlsbad, California, United States (Hybrid)
1 Week ago
Toast - Manager, Site Reliability Engineering Tooling

Toast

Dublin, County Dublin, Ireland (Hybrid)
1 Week ago
Synechron - Scrum Master (DevOps Expertise in Cloud Computing and AI/ML Technologies)

Synechron

Pune, Maharashtra, India (On-Site)
1 Month ago
Granicus - Senior DevOps Engineer

Granicus

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Varonis  - Cloud Security Researcher

Varonis

Herzliya, Tel Aviv District, Israel (On-Site)
9 Months ago
Veeam Software - Senior Staff Platform Engineer

Veeam Software

Prague, Czechia (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Headquartered in Mountain View, California, Aerospike also has a global presence with offices in London, Bangalore, and Tel Aviv. Aerospike does not accept resumes from staffing agencies with which we do not have a written agreement and specific engagement for a particular opening. Our employment activities, inquiries, and offers are managed through our HR/Talent department, and all candidates are presented through this channel only. We do not accept unsolicited resumes.

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Mountain View, California, United States (On-Site)

United States (Remote)

United States (Remote)

New York, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by AeroSpike

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug