Senior Site Reliability Engineer

2 Months ago • 7 Years + • DevOps • Undisclosed

About the job

Job Description

Seeking a Senior Site Reliability Engineer with 7+ years of experience in building and maintaining reliable cloud infrastructure. Expertise in AWS, Kubernetes, Terraform, and CI/CD is essential. You'll drive incident response, optimize performance, and collaborate with cross-functional teams.
Must have:
  • AWS Cloud Services
  • Kubernetes Expertise
  • Terraform & Ansible
  • CI/CD Pipelines
Good to have:
  • Serverless Architecture
  • AWS Lambda Functions
  • Cloud Development Kit
  • Security Certifications

About the job

Company Description

About CyberArk:

CyberArk (NASDAQ: CYBR), is the global leader in Identity Security. Centered on privileged access management, CyberArk provides the most comprehensive security offering for any identity – human or machine – across business applications, distributed workforces, hybrid cloud workloads and throughout the DevOps lifecycle. The world’s leading organizations trust CyberArk to help secure their most critical assets. To learn more about CyberArk, visit our CyberArk blogs or follow us on Twitter, LinkedIn or Facebook.

Job Description

We are seeking a highly skilled Senior Site Reliability Engineer (SRE) to join our team. As an Sr.SRE, you will play a pivotal role in ensuring the reliability, scalability, and performance of our cloud-based infrastructure.

You will collaborate closely with development, operations, and other teams to implement and maintain efficient and resilient systems.

Responsibilities:


  • Infrastructure Automation: Developing, deploying, and overseeing Infrastructure as Code (IaC) solutions using tools such as Terraform and Ansible to automate the provisioning, configuration, and deployment processes.
  • Cloud Platform Expertise: Deep understanding of AWS cloud services, including EC2, S3, VPC, RDS, EKS, ECS, CF and more. Experience with serverless architecture and AWS Lambda functions is a plus.
  • Containerization and Orchestration: Proficiency in containerization technologies (Docker) and orchestration platforms (Kubernetes) with deploying applications using tools like K8s and Helm.
  • CI/CD Pipelines: Build and maintain robust CI/CD pipelines using tools like Jenkins.
  • Monitoring and Alerting: Implement comprehensive monitoring and alerting solutions using tools like ELK, Datadog, CloudWatch, Grafana to proactively identify and resolve issues.
  • Incident Management: Drive incident response processes, troubleshoot complex issues, and perform Root Cause analysis (RCA) to prevent future occurrences (CAPA).
  • Performance Tuning: Continuously optimize system performance, identify bottlenecks, and implement strategies to improve scalability and efficiency.
  • Cost Optimization: Identify and implement strategies to reduce cloud costs while maintaining performance and reliability.
  • Security Best Practices: Adhere to security best practices and implement measures to protect infrastructure and data from vulnerabilities and threats.
  • Collaboration and Communication: Work effectively with cross-functional teams to understand business requirements and provide technical guidance.
  • SOP Documentation: Create and maintain documentation for infrastructure, processes, and incident management protocols.


Qualifications


  • 7+ years of experience as a DevOps engineer or Site Reliability Engineer
  • B.Tech computer
  • Strong proficiency in AWS cloud services like EC2, S3, VPC, RDS, EKS, ECS, CF and more. AWS Certification helps.
  • 3+ years of experience with serverless architectures using AWS Lambda.
  • Strong scripting skills (Python, PowerShell, CDK, Shell scripting).
  • Knowledge of CDK (Cloud Development Kit) for infrastructure as code.
  • Experience with infrastructure as code tools (Terraform, Ansible) and AWX Tower for Ansible automation.
  • Knowledge of containerization (Docker) and orchestration platforms (Kubernetes).
  • Expertise in CI/CD pipelines and automation tools (Jenkins, GitHub).
  • Exposure to monitoring and alerting tools (CloudWatch, Datadog, ELK, Grafana, NewRelic).
  • Documenting SOP and RCAs.
  • Understanding of security best practices and compliance standards. Security Certification is a plus.


View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Hyderabad, Telangana, India (On-Site)

Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Hyderabad, Telangana, India (On-Site)

View All Jobs

Get notified when new jobs are added by CyberArk

Similar Jobs

PlayStation Global - Senior Machine Learning Engineer - GenAI

PlayStation Global, United States (Remote)

Token Metrics - Tech Lead - Crypto & AI (Colombia- Remote)

Token Metrics, Colombia (Remote)

Zscaler - Senior Backend Engineer

Zscaler, India (Hybrid)

Playtech - Dev Ops Engineer

Playtech, United Kingdom (On-Site)

Wildlife Studios - Senior Site Reliability Engineer

Wildlife Studios, Brazil (On-Site)

Limbic Entertainment - DevOps Lead (m/f/d)

Limbic Entertainment, France (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Alpha Sense - Senior Software Engineer

Alpha Sense, Canada (On-Site)

N-iX - Lead Data Engineer (#2528)

N-iX, United States (Remote)

Pragma - Service Operations Specialist

Pragma, United States (Remote)

Vi - Data Engineer

Vi, Israel (On-Site)

PwC - Cyber Security Architect

PwC, Netherlands (On-Site)

Skyhigh Security - Senior Software Engineer

Skyhigh Security, India (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Hyderabad, Telangana, India

Get notifed when new similar jobs are uploaded

DevOps Jobs

Barracuda Networks  Inc  - Sr. Salesforce Developer

Barracuda Networks Inc , India (Hybrid)

IDEMIA - Site Reliability Engineer.

IDEMIA, India (Hybrid)

SES Satellites - Senior Engineer, Site Reliability

SES Satellites, India (Hybrid)

Ubisoft - IT Developer - Temporary Contract

Ubisoft, Canada (On-Site)

Nintendo - Sr Manager, Engineering Infrastructure and IT

Nintendo, United States (On-Site)

ARHS - AWS or Azure Cloud Architect

ARHS, Luxembourg (On-Site)

Luxoft - ETL Developer - Python

Luxoft, India (On-Site)

bosh group india - Senior Data Engineer

bosh group india, India (On-Site)

Get notifed when new similar jobs are uploaded