Senior Site Reliability Engineer

4 Months ago • Upto 10 Years • DevOps

Job Summary

Job Description

Tanla seeks a Senior Site Reliability Engineer to ensure platform and application availability, scalability, and reliability. Must-haves: Kubernetes, Docker, CI/CD, and experience with monitoring stacks like Grafana. Experience with distributed queuing systems like Redis, Kafka, and databases like Postgres, MySQL, and Click House is preferred.
Must have:
  • Kubernetes Expertise
  • Docker Experience
  • CI/CD Hands-on
  • Grafana Expertise
Good to have:
  • Distributed Queuing
  • Database Skills
  • High Traffic
  • Security Concepts
Perks:
  • Impactful Work
  • Growth Opportunities

Job Details

About the job

About the Role: As a Site Reliability Engineer, you will be responsible for ensuring platform and application availability, scalability, and reliability, while maintaining optimal system uptime.


What you''ll be Responsible for?


  • Build, monitor and maintain highly scalable, large-scale deployments.
  • Installation/deployment of new releases, environments for applications.
  • Proactively monitor systems and applications, develop and maintain monitoring tools and dashboards, and ensure high availability of production environments by identifying performance issues and implementing corrective actions.
  • Incident Management: Lead incident response efforts, diagnose root causes, and implement long-term solutions to prevent recurrence. Ensure effective communication during outages.
  • Collaboration & Coordination: Work closely with cross-functional teams to ensure efficient platform integration, API management, and campaign execution, while providing technical guidance and support as needed.
  • Troubleshooting and Root Cause Analysis: Utilize your expertise to investigate and resolve incidents quickly during crisis situations, performing root cause analysis to prevent recurrence.
  • Ensure high availability of production environments by monitoring performance metrics and implementing corrective actions when necessary.
  • Platform Integration: Manage and oversee the integration of various APIs, ensuring seamless interoperability between systems and third-party services.
  • Support the compliance and security integrity of the environments.
  • Adherence to process compliance & ensuring platform reliability.
  • Experience in monitoring and automations in Prometheus Grafana or ELK or Datadog or Dynatrace or any observability tools
  • Experience with container management and micro-services architectures such as Docker in cloud or on-premises infrastructure.


What You'd have?


  • Kubernetes: Expertise in creation, maintenance, scaling, and upgrades of Production clusters.
  • Docker: Must have experience in writing Docker files complying with Industry standard best practices.
  • CI/CD: Must have hands-on experience with Azure-DevOps/Jenkins in creation & Execution of Pipelines in a multi-target environment.
  • Troubleshooting skills: Expertise in analysis of applications logs to drilldown in identification of the issue with expertise on logging stacks such as ELK, Dynatrace, Splunk
  • Monitoring Stacks: Expertise in using Grafana with skills on building & managing of dashboards on various data sources in Grafana.
  • Programming Skills: Experience in creating & managing of Bash scripts & Ansible with some exposure on Terraform.
  • Environment: Excellent skills and hands-on in Linux environments and able to troubleshoot issues at OS levels.
  • Experience on usage of project management tools such as JIRA
  • Experience in deploying & Managing of Distributed Queuing systems such as Redis, Kafka Rabbit-MQ, IBM-MQ, MSMQ
  • Experience in deploying & managing of Databases in standalone & cluster modes with basic DB Skills on Postgres, MySQL, Click House
  • Prior experience in working on high traffic & highly scalable platforms is an added advantage.
  • Good command on Linux, Networking concepts (TLS/SSL, DNS, Load Balancers, etc.,) and troubleshooting skills in large scale environments
  • Deep understanding of basic security concepts and protocols - authentication, authorization, signing, encryption, SSL/TLS, SSH/SFTP, X509 certificates
  • Good knowledge of ITIL terminology for incident and problem management
  • Track record of excellent interpersonal, analytical, and communication skills.
  • Bachelor of Science in Computer Science or other related discipline.


Why join us?


  • Impactful Work: Play a pivotal role in safeguarding Tanla's assets, data, and reputation in the industry.
  • Tremendous Growth Opportunities: Be part of a rapidly growing company in the telecom and CPaaS space, with opportunities for professional development.
  • Innovative Environment: Work alongside a world-class team in a challenging and fun environment, where innovation is celebrated. Tanla is an equal opportunity employer.


Tanla is an equal opportunity employer. We champion diversity and are committed to creating an inclusive environment for all employees.

www.tanla.com

Similar Jobs

NXP - 24/25 IT Application Support intern

NXP

Bangkok, Bangkok, Thailand (On-Site)
• 4 Months ago
ByteDance - Global Site Reliability Engineer Lead - Security Engineering - San Jose

ByteDance

San Jose, California, United States (On-Site)
• 3 Months ago
Warner Bros Games - Senior Software Developer

Warner Bros Games

Toronto, Ontario, Canada (Hybrid)
• 4 Months ago
Alphasense - Join AlphaSense India Talent Community

Alphasense

Bengaluru, Karnataka, India (On-Site)
• 2 Months ago
Metacore - Backend Programmer

Metacore

Helsinki, Uusimaa, Finland (Hybrid)
• 4 Months ago
Google - Principal Engineer, Rollouts

Google

(On-Site)
• 2 Months ago
Microsoft - Software Engineering

Microsoft

Hyderabad, Telangana, India (On-Site)
• 1 Month ago
Nielsen Holdings - Software Engineer (Java/Scala, Spark, SQL, AWS, Kubernetes)

Nielsen Holdings

Gurugram, Haryana, India (Hybrid)
• 3 Months ago
PwC - IN-Manager_D365 Azure Integration Developer_MS Dynamics– Advisory  - Kolkata

PwC

Kolkata, West Bengal, India (On-Site)
• 3 Months ago
HiLabs - Lead or Senior Data Scientist

HiLabs

Pune, Maharashtra, India (On-Site)
• 3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Manager, Software Engineer - Media Engineering

The Walt Disney Company

New York, New York, United States (On-Site)
• 2 Weeks ago
Razer - Lead Site Reliability Engineer

Razer

Shanghai, Shanghai, China (On-Site)
• 4 Months ago
PlayStation Global - Staff Service Reliability Engineer

PlayStation Global

Berlin, Berlin, Germany (On-Site)
• 3 Months ago
Starkflow - Java Architect

Starkflow

Las Vegas, Nevada, United States (On-Site)
• 1 Week ago
Consilio LLC - SR Site Reliability Engineer

Consilio LLC

Bengaluru, Karnataka, India (Hybrid)
• 3 Months ago
Playrix - Senior Golang Developer

Playrix

Ireland (Remote)
• 1 Week ago
DEVOTEAM - Tech Lead DevOps H/F

DEVOTEAM

Levallois-Perret, ĂŽle-de-France, France (Remote)
• 3 Months ago
Alpha Sense - Join AlphaSense India Talent Community

Alpha Sense

Bengaluru, Karnataka, India (On-Site)
• 3 Months ago
Ubisoft - DevOps Linux Administrator

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
• 1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Hyderabad, Telangana, India

CleverTap - Senior Manager - HR Business Partner

CleverTap

Mumbai, Maharashtra, India (On-Site)
• 4 Months ago
PwC - IN_Senior Manager_SAP FI_ Tax Technology_TRS_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
• 1 Month ago
BP - GA, Analyst

BP

Pune, Maharashtra, India (Hybrid)
• 4 Months ago
Viacom18 Media   - SSDE I - Frontend iOS/Apple TV Developer, JioCinema

Viacom18 Media

Bengaluru, Karnataka, India (On-Site)
• 6 Months ago
Assystems - Senior Hydraulic Engineer

Assystems

Gurugram, Haryana, India (On-Site)
• 3 Months ago
Experian - Senior iOS Engineer

Experian

Hyderabad, Telangana, India (Hybrid)
• 4 Months ago
Axinous - Manager - International Payroll

Axinous

Sahibzada Ajit Singh Nagar, Punjab, India (On-Site)
• 2 Months ago
Dream Sports - Manager - Digital Marketing

Dream Sports

Mumbai, Maharashtra, India (On-Site)
• 3 Months ago
Diversified - AV Tier 2 Agent

Diversified

Bengaluru, Karnataka, India (Remote)
• 4 Months ago
Nagarro - Associate Staff Engineer, QA Manual

Nagarro

Bengaluru, Karnataka, India (On-Site)
• 3 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Gaming Innovation Group  - Infrastructure Engineer

Gaming Innovation Group

Catalonia, Spain (On-Site)
• 3 Months ago
PearlAbyss - Junior System Engineer

PearlAbyss

(On-Site)
• 4 Weeks ago
Electronic Arts - [EA Sports FC] DevOps Engineer

Electronic Arts

Seoul, South Korea (On-Site)
• 2 Months ago
Paytm - Software Engineer - Cloud Automation

Paytm

Toronto, Ontario, Canada (Hybrid)
• 2 Months ago
Microsoft - Senior Service Engineer

Microsoft

(On-Site)
• 3 Weeks ago
Google - Data Cloud Consultant, Professional Services, Google Cloud

Google

Mexico City, Mexico City, Mexico (On-Site)
• 1 Month ago
LeoVegas - Cloud Security Engineer

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
• 3 Months ago
PwC - IN-Associate _ Hybrid Platform Modernization_OneCloud_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (Hybrid)
• 4 Months ago
Probably Monsters - Build Engineer, Ecosystems (Core Technology)

Probably Monsters

Texas, United States (On-Site)
• 5 Days ago
Razer - Lead Site Reliability Engineer

Razer

Shanghai, Shanghai, China (On-Site)
• 4 Months ago

Get notifed when new similar jobs are uploaded