Site Reliability Engineer I / II

1 Year ago • 2-4 Years • Devops

Job Summary

Job Description

Zeta is seeking a Site Reliability Engineer I / II to ensure the reliability of software systems by designing, implementing, and maintaining scalable and reliable infrastructure. Responsibilities include developing automation tools, responding to incidents, capacity planning, performance optimization, implementing Infrastructure as Code (IaC) using tools like Terraform or Ansible, and maintaining monitoring and logging solutions. The role also involves on-call support, security collaboration, disaster recovery planning, and continuous improvement of system performance and reliability.
Must have:
  • Ensuring system reliability through infrastructure design and maintenance.
  • Developing automation tools and scripts for operational tasks.
  • Responding to incidents to minimize downtime.
  • Forecasting future capacity needs for infrastructure.
  • Optimizing software systems for performance.
  • Implementing Infrastructure as Code (IaC) practices.
  • Maintaining monitoring and logging solutions.
  • Participating in on-call rotation for 24/7 availability.
  • Collaborating on infrastructure and application security.
  • Developing disaster recovery plans.
  • Proficiency in Python, Go, Shell, or Bash.
  • Experience with automation tools like Ansible, Puppet, Chef.
  • Experience with containerization (Docker) and orchestration (Kubernetes).
  • Proficiency in cloud platforms (AWS, Azure, GCP).
  • Familiarity with monitoring tools (Prometheus, Grafana, ELK stack).
  • Understanding of networking concepts and protocols.
  • Knowledge of security best practices.
  • Understanding of CI/CD pipelines.
  • Proficient use of version control systems like Git.
Good to have:
  • Knowledge of Infrastructure as Code (IaC) tools like Terraform.
  • Experience working for a product organization.
  • Certifications from cloud service providers (AWS, Google Cloud, Microsoft).

Job Details

All about Zeta Suite : 
Zeta is the world’s first and only Omni Stack for banks and fintechs. We are rethinking payments from core to the edge, led by the vision to augment the purpose of money and banking with technology. A single, modern software stack comprising processing, loans, customizable mobile and web apps, a fraud engine, and rewards for retail banking.
We are a new-age, high-growth startup (& a unicorn!) founded in 2015 by two visionary leaders, Bhavin Turakhia & Ramki Gaddipati, whose entrepreneurial legacy & excellence has put us on top of the global fintech ecosystem. Zeta counts amongst its customers over 10 banks and 25 fintechs across 8 countries - some of our notable clients include Sodexo - a leading issuer of employee benefits & rewards with over 30 million global users, and HDFC Bank - the 14th largest bank by market cap in the world. Learn more about our manifesto & beyond.


Responsibilities:

    • System Reliability: Ensuring the reliability of software systems by designing, implementing, and maintaining scalable and reliable infrastructure.
    • Automation: Developing automation tools and scripts to streamline operational tasks, reduce manual intervention, and improve overall system efficiency.
    • Incident Response and Resolution: Monitoring system performance and responding to incidents promptly to minimize downtime and ensure high availability.
    • Capacity Planning: Analyzing system usage patterns and forecasting future capacity needs to ensure that the infrastructure can handle current and future demands.
    • Performance Optimization: Identifying and addressing performance bottlenecks in software systems through optimization and tuning.
    • Infrastructure as Code (IaC): Implementing infrastructure as code practices, using tools like Terraform or Ansible, to define and manage infrastructure in a version-controlled and automated manner.
    • Monitoring and Logging: Implementing and maintaining monitoring and logging solutions to gain insights into system behavior, troubleshoot issues, and proactively address potential problems.
    • On-Call Support: Participating in an on-call rotation to respond to incidents outside of regular working hours and ensure 24/7 system availability
    • Security: Collaborating with security teams to implement and maintain security best practices in infrastructure and application
    • Disaster Recovery Planning: Developing and maintaining disaster recovery plans to ensure that systems can quickly recover from major outages or failures
    • Continuous Improvement: Continuously analyzing system performance, reliability, and incidents to identify areas for improvement and implementing changes to enhance overall system resilience.

Skills:

    • Programming Languages: Proficiency in one or more programming languages, commonly Python, Go, Shell, Bash.
    • Automation and Scripting: Strong automation skills using tools like Ansible, Puppet, Chef, or custom scripts. Knowledge of Infrastructure as Code (IaC) tools like Terraform
    • Containerization and Orchestration: Experience with containerization technologies like Docker and container orchestration platforms like Kubernetes.
    • Cloud Computing: Proficiency in any of the cloud platforms such as AWS, Azure, or Google Cloud Platform, and knowledge of managing infrastructure in the cloud.
    • Monitoring and Logging: Familiarity with monitoring tools (e.g., Prometheus, Grafana, ELK stack) and logging frameworks to track system performance and troubleshoot issues.
    • Networking: Understanding of networking concepts, protocols, and troubleshooting skills.
    • Security: Knowledge of security best practices, including encryption, access controls, and vulnerability management.
    • Continuous Integration/Continuous Deployment (CI/CD): Understanding and implementation of CI/CD pipelines for automated testing and deployment.
    • Load Balancing: Experience in incident response, troubleshooting, and resolution.
    • Version Control: Proficient use of version control systems like Git.

Experience and Qualifications:

    • 2-4 years of experience in site reliability engineering.
    • B.Tech/M.Tech in computer science, information technology or a related field.
    • Having experience working for a product organization is a plus.
    • Certifications from cloud service providers like AWS Certified DevOps Engineer, Google Cloud Professional DevOps Engineer, or Microsoft Certified is a plus

Life At ZetaAt Zeta, we want you to grow to be the best version of yourself by unlocking the great potential that lies within you. This is why our core philosophy is ‘People Must Grow.’ We recognize your aspirations; act as enablers by bringing you the right opportunities, and let you grow as you chase disruptive goals. 


#LifeAtZeta is adventurous and exhilarating at the same time. You get to work with some of the best minds in the industry and experience a culture that values the diversity of thoughts. If you want to push boundaries, learn continuously and grow to be the best version of yourself,  Zeta is the place to be!  Explore the life at zeta 

Zeta is an equal opportunity employer.  
At Zeta, we are committed to equal employment opportunities regardless of job history, disability, gender identity, religion, race, marital/parental status, or another special status. We are proud to be an equitable workplace that welcomes individuals from all walks of life if they fit the roles and responsibilities.

Similar Jobs

Inspiren - Senior Technical Operations Engineer

Inspiren

New York, United States (Remote)
1 Month ago
Capgemini - Business Advisor - A

Capgemini

Noida, Uttar Pradesh, India (On-Site)
4 Weeks ago
bytedance - Cloud Technical Support Engineer

bytedance

Singapore (On-Site)
5 Months ago
flying wild hog - Junior IT Support

flying wild hog

Warsaw, Masovian Voivodeship, Poland (On-Site)
3 Months ago
Veeam Software - Virtualization Backup Engineer (Italian speaker)

Veeam Software

Poland (Remote)
3 Months ago
HCL Tech - Enterprise solution architect

HCL Tech

New Jersey, United States (On-Site)
2 Months ago
Rackspace Technology - Site Reliability Engineer III

Rackspace Technology

India (Remote)
1 Month ago
Trellix - Principal Engineer – Developer Enablement & CI/CD Strategy

Trellix

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Activision - Staff Platform Solutions Engineer

Activision

New York, United States (On-Site)
2 Months ago
Qualcomm - Engineer, Staff -Devops

Qualcomm

Hyderabad, Telangana, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Senior Networking Security Research Architect

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
7 Months ago
Qualcomm - Engineer, Senior - Core Platform Boot Loaders

Qualcomm

Hyderabad, Telangana, India (On-Site)
3 Months ago
Axel springer - Senior Systems/DevOps Engineer

Axel springer

Berlin, Berlin, Germany (Hybrid)
3 Months ago
Match Group - Apprenticeship Junior Helpdesk Technician

Match Group

Paris, Île-de-France, France (Hybrid)
1 Month ago
NVIDIA - Senior Hardware Validation Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
5 Months ago
Adtran - Principal Software Engineer

Adtran

Yokne'am Illit, North District, Israel (Hybrid)
3 Months ago
Qualcomm - Sr Engineer- Test

Qualcomm

Hyderabad, Telangana, India (On-Site)
2 Months ago
Tesla - Electrician / Energy Field Service Technician

Tesla

Antwerp, Flanders, Belgium (On-Site)
6 Months ago
Capgemini - Business Advisor

Capgemini

Bengaluru, Karnataka, India (On-Site)
2 Months ago
GoMotive - Designated Support Engineer

GoMotive

Pakistan (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Hyderabad, Telangana, India

Telastra - Senior Software Engineer

Telastra

Bengaluru, Karnataka, India (On-Site)
1 Month ago
PwC - Associate - Kolkata Y-14 - Technology Consulting

PwC

Kolkata, West Bengal, India (On-Site)
10 Months ago
Synechron - Java Developer (Spring, Hibernate, Couchbase & Cloud-Ready Applications)

Synechron

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
Qualcomm - Engineer, QA_Automation framework/Python

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Month ago
Glean - Tax Apprentice

Glean

Bengaluru, Karnataka, India (On-Site)
1 Month ago
ShyftLabs - Senior Backend Developer

ShyftLabs

Noida, Uttar Pradesh, India (On-Site)
9 Months ago
frames store - Associate Visual Effects  Supervisor

frames store

Mumbai, Maharashtra, India (On-Site)
8 Months ago
Sprinkler - Technical Project Manager

Sprinkler

Gurugram, Haryana, India (On-Site)
3 Months ago
Neolytix - Team Lead – Accounts Receivable (US Healthcare)

Neolytix

Gurugram, Haryana, India (On-Site)
1 Month ago
Contentstack - Senior Engineer I - DevOps

Contentstack

Chennai, Tamil Nadu, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

bytedance - Solutions Architect

bytedance

Taguig, Metro Manila, Philippines (On-Site)
6 Months ago
Cadence - DRC, LVS, 3DIC PV Solutions Engineer

Cadence

Cork, County Cork, Ireland (On-Site)
3 Months ago
Apple - AIML - Sr./Staff ML Engineer, Machine Learning Platform & Intelligence

Apple

Seattle, Washington, United States (On-Site)
3 Months ago
Shield AI - Software Engineer, API's & Infrastructure (R2609)

Shield AI

San Diego, California, United States (On-Site)
4 Weeks ago
Square - Senior Azure DevOps Engineer

Square

Groningen, Groningen, Netherlands (Hybrid)
4 Weeks ago
Litmus - Solutions Architect

Litmus

Pune, Maharashtra, India (On-Site)
3 Months ago
Google - Senior Staff Software Engineer, Site Reliability Engineering, Google Cloud

Google

Kirkland, Washington, United States (On-Site)
4 Months ago
Nagarro - System Engineer Infrastructure Services

Nagarro

Germany (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

Bengaluru, Karnataka, India (On-Site)

View All Jobs

Get notified when new jobs are added by zeta

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug