Senior Site Reliability Engineer

2 Months ago • 6 Years + • Devops

Job Summary

Job Description

As a Senior Site Reliability Engineer (SRE) for Aerospike Cloud, you will design, build, and optimize scalable and resilient cloud-based Aerospike deployments. You will focus on enhancing reliability, performance, and automation, ensuring the platform efficiently supports multiple cloud product offerings. This role involves developing robust infrastructure, implementing intelligent monitoring, and driving continuous improvements to enhance system efficiency and scalability. You will be responsible for managing large-scale Aerospike deployments, automating infrastructure, building monitoring solutions, implementing security best practices, and participating in incident response. You will also collaborate with development teams and be part of a 24/7 on-call rotation.
Must have:
  • 6+ years of experience in SRE, DevOps, or related fields.
  • Experience designing, deploying, and optimizing cloud-based systems.
  • Expertise with at least one major public cloud provider.
  • Strong proficiency in infrastructure-as-code (IaC) tools.
  • Experience in CI/CD pipeline design and implementation.
  • Deep understanding of Linux/Unix systems and networking.
  • Proficiency in scripting and software development (Python, Bash, or Go).
  • Experience with containerization and orchestration (Docker, Kubernetes).
  • Hands-on experience with monitoring, logging, and observability tools.
  • Strong problem-solving skills and an engineering-first mindset.
  • Experience implementing security best practices for cloud infrastructure.
  • Excellent English communication skills.
Good to have:
  • Hands-on experience managing and optimizing database deployments.
  • Familiarity with Aerospike or other distributed NoSQL databases.
  • Relevant industry certifications (AWS, Google Cloud, Kubernetes).

Job Details

About Aerospike
At Aerospike, we dream big. Our focus is helping companies tackle seemingly insurmountable problems and doing what’s never been done before. That is why we developed the world's leading real-time data platform that powers mission-critical applications at the world's most innovative, category-disrupting companies. Aerospike companies have deployed extreme-scale real-time applications to fight fraud, dramatically increase shopping cart size, enable global digital payments, and deliver hyper-personalized
user experiences to tens of millions of customers.
Customers like Airtel, Experian, Nielsen, PayPal, Snap, Verizon Media, and Wayfair rely on Aerospike as the data foundation for the future to help them act in the microsecond moments that matter.
Headquartered in Mountain View, California, Aerospike has a global presence with offices in London, Bangalore, and Tel Aviv.

Senior Site Reliability Engineer

As a Senior Site Reliability Engineer (SRE) for Aerospike Cloud, you will play a key role in designing, building, and optimizing scalable and resilient cloud-based Aerospike deployments. You will focus on enhancing reliability, performance, and automation, ensuring our platform efficiently supports multiple cloud product offerings. Your work will involve developing robust infrastructure, implementing intelligent monitoring, and driving continuous improvements to enhance system efficiency and scalability.

Key Responsibilities

  • Designing, implementing, and managing large-scale Aerospike deployments across multiple cloud environments, ensuring high availability and performance.
  • Developing deep expertise in Aerospike and its cloud deployment patterns, understanding failure scenarios, and designing resilient remediation strategies.
  • Automating infrastructure and service configurations to improve system efficiency, reliability, and scalability.
  • Building and maintaining monitoring, alerting, and observability solutions to proactively detect and resolve issues, ensuring system health.
  • Implementing and enforcing security best practices for cloud infrastructure, access control, and data protection to safeguard deployments.
  • Participating in incident response, post-mortems, and continuous improvement initiatives, driving long-term stability and reliability.
  • Collaborating with development teams to ensure new deployments and updates align with SRE best practices for reliability, performance, and scalability.
  • Being part of a 24/7 on-call rotation, responding to critical incidents and minimizing downtime through proactive mitigation strategies.

Required Experience

  • 6+ years of experience in Site Reliability Engineering (SRE), DevOps, or related fields, with a focus on building scalable, resilient, and automated cloud-based systems.
  • Hands-on experience designing, deploying, and optimizing production-grade, business-critical systems in cloud environments.
  • Expertise with at least one major public cloud provider (AWS, Google Cloud, or Azure), including cloud-native services and architectures.
  • Strong proficiency in infrastructure-as-code (IaC) tools such as Terraform to enable automated and reproducible infrastructure.
  • Experience in CI/CD pipeline design and implementation, enabling seamless, automated software delivery and infrastructure updates.
  • Deep understanding of Linux/Unix systems, networking fundamentals, and distributed system architectures.
  • Proficiency in scripting and software development using Python, Bash, or Go to build automation, tooling, and infrastructure enhancements.
  • Experience with containerization and orchestration technologies such as Docker and Kubernetes for efficient service deployment and scaling.
  • Hands-on experience with monitoring, logging, and observability tools (e.g., Prometheus, Grafana, Datadog, Elasticsearch, Kibana) to drive data-driven system improvements.
  • Strong problem-solving skills with an engineering-first mindset for improving system reliability, scalability, and performance.
  • Experience implementing security best practices for cloud infrastructure, access control, and data protection.
  • Excellent English communication skills (verbal and written) to collaborate effectively across teams and document key processes.

Preferred Skills and Qualifications

  • Hands-on experience managing and optimizing database deployments and services in production environments, ensuring high availability and performance.
  • Familiarity with Aerospike or other distributed NoSQL databases.
  • Relevant industry certifications, such as AWS Certified DevOps Engineer, AWS Certified Solutions Architect, Google Professional Cloud DevOps Engineer, or equivalent.
  • Kubernetes certifications such as Certified Kubernetes Administrator (CKA), Certified Kubernetes Application Developer (CKAD), or Certified Kubernetes Security Specialist (CKS).

Aerospike is an Equal Opportunity Employer. We are committed to providing an environment free from discrimination on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status, or any other basis covered by appropriate law.

Similar Jobs

Tesla - Senior Project Engineer - BESS, EMEA

Tesla

North Holland, Netherlands (On-Site)
4 Months ago
Vercel - Senior Legal Counsel, Product and Commercial

Vercel

San Francisco, California, United States (Hybrid)
1 Month ago
Xentrix studios - Texturing – Artist

Xentrix studios

India (On-Site)
7 Months ago
Lightcast - Associate Data Curator

Lightcast

Dharmapuri, Tamil Nadu, India (Hybrid)
1 Month ago
Qualcomm - Sr Engineer- Test

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Month ago
Aristocrat - DevOps Engineer

Aristocrat

Kraków, Lesser Poland Voivodeship, Poland (Hybrid)
1 Month ago
Brillio - Azure Architect

Brillio

Maryland, United States (On-Site)
4 Days ago
Uniswap Labs - Senior Site Reliability Engineer (SRE)

Uniswap Labs

New York, United States (Hybrid)
1 Month ago
Amber - Bazel Senior Build Engineer (Project Based)

Amber

Bucharest, Bucharest, Romania (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Scout - Senior Software Engineer

Scout

Fremont, California, United States (Hybrid)
1 Week ago
Minecast - Senior Customer Success Manager

Minecast

Australia (On-Site)
1 Month ago
CrowdStrike - Sr. Software Engineer, Backend - Ingestion

CrowdStrike

United States (Remote)
1 Month ago
Cubic corporation - Senior Software Engineer (Fullstack)

Cubic corporation

Hyderabad, Telangana, India (On-Site)
1 Year ago
Zuora - Billing Solution Architect

Zuora

Atlanta, Georgia, United States (Hybrid)
1 Month ago
FICO - Director, Strategic Pricing

FICO

United States (Remote)
1 Month ago
HCL Tech - Senior Technical Lead with Java, Microservices, React.js

HCL Tech

Texas, United States (On-Site)
1 Month ago
WebMD - Media Planner

WebMD

London, England, United Kingdom (On-Site)
1 Month ago
Scientific Games - Software Development Manager

Scientific Games

Alpharetta, Georgia, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Bengaluru, Karnataka, India

Capgemini - Finance Reporting Officer

Capgemini

Chennai, Tamil Nadu, India (On-Site)
3 Weeks ago
Qualcomm - Physical Design Engineer

Qualcomm

Noida, Uttar Pradesh, India (On-Site)
1 Month ago
Paytm - Senior Executive - Key Account Manager (Premium)

Paytm

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
GoMotive - Senior Salesforce Developer

GoMotive

India (Remote)
1 Month ago
GoTo Group - Senior Software Engineer - Big Data

GoTo Group

Bengaluru, Karnataka, India (Remote)
1 Month ago
Highspot - Salesforce Technical Lead

Highspot

Hyderabad, Telangana, India (Hybrid)
8 Months ago
T systems - Architect

T systems

Pune, Maharashtra, India (On-Site)
1 Month ago
M365 connect - Marketing Lead

M365 connect

Delhi, India (Remote)
2 Months ago
London stock Exchange - Software Development Engineer in Test (SDET)

London stock Exchange

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Capgemini - Technical Architect

Capgemini

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Lead Venture - Infrastructure Engineer III

Lead Venture

Gurugram, Haryana, India (On-Site)
8 Months ago
Qualcomm - Engineer- Camera Execution/Automation Engineer

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Month ago
Revenera - Senior Site Reliability Engineer

Revenera

Bengaluru, Karnataka, India (Hybrid)
8 Months ago
bytedance - AI and Cloud Solution Architect

bytedance

Singapore (On-Site)
2 Months ago
AliveCor - Infrastructure Support Engineer

AliveCor

Bengaluru, Karnataka, India (Hybrid)
3 Weeks ago
Rackspace Technology - Site Reliability Engineer III

Rackspace Technology

India (Remote)
4 Months ago
Bungie - Senior Infrastructure Engineer

Bungie

(Hybrid)
3 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
3 Months ago
Nintendo - DevOps Engineer (Site Reliability)

Nintendo

Redmond, Washington, United States (Hybrid)
2 Months ago
Google - Senior Software Engineer, Google Cloud

Google

Pune, Maharashtra, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Headquartered in Mountain View, California, Aerospike also has a global presence with offices in London, Bangalore, and Tel Aviv. Aerospike does not accept resumes from staffing agencies with which we do not have a written agreement and specific engagement for a particular opening. Our employment activities, inquiries, and offers are managed through our HR/Talent department, and all candidates are presented through this channel only. We do not accept unsolicited resumes.

New York, United States (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

United States (Remote)

Mountain View, California, United States (Hybrid)

Bengaluru, Karnataka, India (Hybrid)

Mountain View, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by AeroSpike

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug