Associate Site Reliability Engineer

1 Month ago • 2-3 Years • Devops

Job Summary

Job Description

We are seeking a highly skilled Associate Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a pivotal role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will collaborate closely with development, operations, and other teams to implement and maintain efficient and resilient systems. Our group ensures the health and performance of system and services is optimal using monitoring tools and dashboards. Our goal is to maintain a scalable, fault-tolerant, high-load, distributed system. We are searching for an outstanding SRE expert who is responsible for driving and improving the Incident Management processes and goals for Site Reliability teams, with a focus on triaging and ensuring the reliability, performance, and scalability of CyberArk’s SaaS services and underlying AWS infrastructure.
Must have:
  • 2-3 years of experience as a Site Reliability Engineer
  • Strong proficiency in AWS cloud services
  • Good Logical, Analytical and Problem-solving skills
  • Strong communication skills
  • Ability to work in shifts (24x7)
  • Strong scripting skills (Python, PowerShell, CDK, Shell scripting)
  • Understanding of infrastructure as code tools (Terraform, Ansible)
  • Knowledge of containerization (Docker) and orchestration platforms (Kubernetes)
  • Expertise in CI/CD pipelines and automation tools (Jenkins, GitHub)
  • Exposure to monitoring and alerting tools (CloudWatch, Datadog, ELK, Grafana, Site24x7)
  • Documenting SOP and RCAs
  • Understanding of security best practices and compliance standards
Good to have:
  • AWS Certification
  • Security Certification

Job Details

About the Role:

We are seeking a highly skilled Associate Site Reliability Engineer (SRE) to join our team. As an SRE, you will play a pivotal role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure. You will collaborate closely with development, operations, and other teams to implement and maintain efficient and resilient systems.

We are the SRE Frontline Team of CyberArk. Our group ensures the health and performance of system and services is optimal using monitoring tools and dashboards. Our goal is to maintain a scalable, fault-tolerant, high-load, distributed system. We are searching for an outstanding SRE expert who is responsible for driving and improving the Incident Management processes and goals for Site Reliability teams, with a focus on triaging and ensuring the reliability, performance, and scalability of CyberArk’s SaaS services and underlying AWS infrastructure. This role involves a combination of technical expertise, documentation, and collaboration to meet the organization's reliability and availability goals. 

Responsibilities:

  • Incident Management, Monitoring and Alerting: Drive incident response processes and troubleshoot complex issues, ensuring timely resolution of outages. Establish monitoring, logging, and alerting best practices using tools like Datadog, Site24x7 etc
  • Tooling and Automation: Build essential tooling to improve reliability of systems and automated remediation of issues.  
  • Be a part of the on-call rotation 365x24x7. 
  • SOP Documentation: Create and maintain documentation for infrastructure, processes, and incident management protocols.
  • Understanding of Infrastructure as Code (IaC) tools such as Terraform and Ansible to automate the provisioning, configuration, and deployment processes.
  • Attend all training programs and complete all tasks set by the supervisor and assist other trainees wherever possible. 
  • Cloud Platform Expertise: Hands-on with AWS cloud services, including EC2, S3, VPC, RDS, EKS, ECS, CF and more.
  • CI/CD Pipelines: Fair understanding of CI/CD pipelines using tools like Jenkins.
  • Monitoring and Alerting: Hands-on experience with monitoring and alerting tools like ELK, Datadog, CloudWatch, Grafana etc to proactively identify and resolve issues.
  • Performance Tuning: Continuously optimize system performance, identify bottlenecks, and implement strategies to improve scalability and efficiency.
  • Cost Optimization: Identify and implement strategies to reduce cloud costs while maintaining performance and reliability.
  • Security Best Practices: Adhere to security best practices and implement measures to protect infrastructure and data from vulnerabilities and threats.
  • Collaboration and Communication: Work effectively with cross-functional teams to understand business requirements and provide technical guidance.

#IL-MP01

Required Skills and Experience:

 

  • 2-3 years of experience as a Site Reliability
  • Strong proficiency in AWS cloud services like EC2, S3, VPC, RDS, EKS, ECS, CloudFormation and more. AWS Certification helps.
  • Good Logical, Analytical and Problem-solving skills. 
  • Strong communication skills and Ability to work in shifts (24x7).
  • Strong scripting skills (Python, PowerShell, CDK, Shell scripting).
  • Understanding of infrastructure as code tools (Terraform, Ansible) and AWX Tower for Ansible automation.
  • Knowledge of containerization (Docker) and orchestration platforms (Kubernetes).
  • Expertise in CI/CD pipelines and automation tools (Jenkins, GitHub).
  • Exposure to monitoring and alerting tools (CloudWatch, Datadog, ELK, Grafana, Site24x7).
  • Documenting SOP and RCAs.
  • Understanding of security best practices and compliance standards. Security Certification is a plus.

Similar Jobs

deel. - Senior Backend Engineer, Node.js + AWS

deel.

United Kingdom (Remote)
3 Weeks ago
EMA - DevOps Engineering Lead

EMA

California, United States (Hybrid)
5 Months ago
EveryMatrix - Experienced CRM Data Scientist

EveryMatrix

United Kingdom (Hybrid)
10 Months ago
Motorola solutions - Business Analyst with Cloud and FinOps Experience

Motorola solutions

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Temporal Technologies - Staff Solutions Architect

Temporal Technologies

Seattle, Washington, United States (On-Site)
2 Months ago
Flowable - Senior Cloud Solution Engineer

Flowable

Stuttgart, Baden-Württemberg, Germany (Hybrid)
1 Year ago
Devoteam - Consultant DevOps CI / CD

Devoteam

Cesson-Sévigné, Brittany, France (On-Site)
10 Months ago
1047 games - Infrastructure Engineer

1047 games

(Remote)
1 Month ago
Zazz - Cloud Engineer (AWS)

Zazz

(Remote)
6 Months ago
Tellius - Solutions Engineer

Tellius

(Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Alpha Sense - Mid-Market Account Executive, Corporate

Alpha Sense

London, England, United Kingdom (On-Site)
3 Months ago
Egnyte - Software Engineer - AI/ML

Egnyte

Mountain View, California, United States (Hybrid)
7 Months ago
Autodesk - Account Executive, Territory

Autodesk

Texas, United States (Remote)
1 Month ago
deel. - Intern, Deel Lab

deel.

Indonesia (Remote)
3 Weeks ago
dun bradstreet - Senior Principal Data Scientist, AaaS

dun bradstreet

Frankfurt Am Main, Hessen, Germany (Hybrid)
2 Months ago
extreme network - Commercial Counsel

extreme network

California, United States (Remote)
8 Months ago
LMArena - DevOps Engineer, Site Reliability Engineering (SRE)

LMArena

California, United States (Hybrid)
2 Months ago
Matellio - Senior Technical Content Writer

Matellio

Jaipur, Rajasthan, India (On-Site)
3 Months ago
Globalization Partners - Employment Counsel I - North America

Globalization Partners

Canada (Remote)
1 Month ago
Netomi - Lead UX Designer

Netomi

Gurugram, India (Remote)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in India

Motorola solutions - Full Stack Engineer (Node.JS with Angular/React)

Motorola solutions

Bengaluru, Karnataka, India (On-Site)
1 Year ago
NCR Atleos - SW Engineer II BI

NCR Atleos

Hyderabad, Telangana, India (On-Site)
3 Months ago
Tide - Senior Data Engineer (DBT/Snowflake)

Tide

Hyderabad, Telangana, India (Remote)
2 Months ago
Tellius - Data Scientist

Tellius

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Nagarro - Senior Analyst, UXD

Nagarro

Mumbai, Maharashtra, India (On-Site)
10 Months ago
Axi - Senior Backend Developer

Axi

India (On-Site)
1 Month ago
Boomi  - Boomi Technical Architect

Boomi

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Buckman - Business Development Manager - South

Buckman

Chennai, Tamil Nadu, India (On-Site)
4 Weeks ago
PhonePe - Manager, Finance

PhonePe

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Demandbase - Accounts Payable Manager

Demandbase

Hyderabad, Telangana, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Semgrep - Senior Software Engineer, Infrastructure

Semgrep

San Francisco, California, United States (On-Site)
1 Month ago
Nousresearch - Machine Learning Engineer (Training Infrastructure)

Nousresearch

(On-Site)
1 Month ago
Gusto - Sr Site Reliability Engineer

Gusto

Denver, Colorado, United States (Remote)
3 Weeks ago
Zazz - Cloud Engineer (Azure)

Zazz

(Remote)
6 Months ago
Rippling - Frontend Engineer II - Ads Platform

Rippling

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
version 1 - Oracle Cloud Infrastructure (OCI) Architect

version 1

Dublin, County Dublin, Ireland (Hybrid)
4 Months ago
Google - Staff Software Engineer, Infrastructure, Core

Google

Sunnyvale, California, United States (On-Site)
4 Months ago
deel. - Senior Backend Engineer, Node.js + AWS

deel.

Romania (Remote)
3 Weeks ago
Rippling - Senior Software Engineer - Global Payroll Platform

Rippling

Bengaluru, Karnataka, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

About The Company

CyberArk's mission is to secure the world against cyber threats so together we can move fearlessly forward. CyberArk is a global leader in identity security, helping organizations worldwide protect their most valuable assets and critical infrastructure. They offer a comprehensive platform that addresses the evolving challenges of identity-related risks, providing solutions for workforce access, privileged access, customer access, and machine identity security. CyberArk is committed to innovation and providing cutting-edge security solutions that empower their customers to be more secure and efficient.

Israel (Hybrid)

United States (On-Site)

United States (On-Site)

United States (Hybrid)

United States (Hybrid)

View All Jobs

Get notified when new jobs are added by CyberArk