Site Reliability Engineer (SRE)

2 Months ago • 5 Years + • Devops

Job Summary

Job Description

As an SRE at PayPay, the role involves ensuring high availability and top-level performance of systems to provide users with reliable service. The responsibilities include analyzing technologies, developing monitoring tools, ensuring system stability, developing solutions for performance, integrating telemetry platforms, implementing best practices, documenting knowledge, staying updated on modern technologies, and participating in incident management. The goal is to improve system reliability and scalability, while delivering positive user experiences. Experience in troubleshooting, tuning microservices, software development in languages like Python, Java, or Go with strong fundamentals, is required, and the role requires collaboration within cross-functional teams.
Must have:
  • Troubleshooting microservices architectures on Kubernetes and AWS
  • 5+ years of software development experience in Python, Java, or Go
  • Experience with observability tools and data gathering
  • Database knowledge such as RDS or NoSQL
  • Excellent communication and collaborative skills
Good to have:
  • Container image management and optimization
  • Experience in large distributed system architecture
  • Understanding of IaC and automation tools like Terraform
  • Background in SRE/DevOps concepts and implementation
  • Experience managing monitoring tools such as CloudWatch

Job Details

 

 

About PayPay India

PayPay, a fintech company providing a service enjoyed by over 63 million users (as of April 2024) merely 5 years since its launch in 2018 in Japan. The company is now home to a very diverse team of members from more than 50 countries. We grew to a team of several thousand employees in Japan but are far from over. We are still in the Day 1. Every day, new members join us from all over the world to create new value and deliver it to society.
 
 
 
 

Why India ?

To build our Payment services, we got technical cooperation from Paytm (A large payment service company in India). And based on their customer-first technologies , we created and expanded the smartphone payment service in Japan. Therefore, we have decided to establish a development base in India, because it is a major IT country with many talented engineers, as evidenced by the fact that cutting-edge mobile payments can continue to be generated.

OUR VISION IS UNLIMITED

We dare to believe that we do not need a clear vision to create a future beyond our imagination. PayPay will always stay true to our roots and realise a vision (future) that no one else can imagine by constantly taking risks and challenging ourselves. With this mindset, you will be presented with new and exciting opportunities on a daily basis and have the opportunity to grow and reach new dimensions that you could never have imagined.

Job Description

At PayPay, we’re constantly working on improving our systems and processes to support PayPay’s exponential growth. As an SRE at PayPay, we strive towards ensuring high availability and top-level performance so that our users can have flawless and reliable service exceeding expectations.

Considering PayPay’s growth, we are looking for experienced SREs who can deliver insights into system bottlenecks and ensure system reliability and scalability, while increasing the number of services that our company offers.

We are looking for individuals who can bring informed and unique viewpoints, enjoy collaborating with a cross-functional team and are actively pushing boundaries to develop reliable and scalable solutions and positive user experiences.

Main Responsibilities

  • Analyse current technologies used in the company and develop monitoring and notification tools to improve observability and visibility.
  • Ensure system stability by pre-emptively verifying failure scenarios and implement solutions to reduce MTTR
  • Develop solutions to improve system performance with a focus on high availability, scalability and resilience
  • Integrate telemetry and alerting platforms to track and improve reliability of systems
  • Implement industry best practices for system development, configuration management and system deployment
  • Ensure seamless flow of information between teams by documenting knowledge gained
  • Be up to date on modern technologies and trends to advocate for inclusion within products if they add value
  • Participate in incident management including troubleshooting production issues, driving root cause analysis (RCA) and actively sharing lessons learned to improve system reliability and internal knowledge.

Required Skills and Experiences

  • Experience troubleshooting, tuning high performance microservices architectures running on Kubernetes and AWS in highly available production environments.
  • 5+ years experience in software development in Python, Java, Go, etc with strong fundamentals in data structures, algorithms, problem solving and complexity analysis.
  • During the SRE selection process, you will have a coding challenge.
  • Curious and proactive in finding performance bottlenecks, scalability and resilience problem areas and addressing them.
  • Experience with observability tools and gathering data.
  • Database knowledge such as RDS, NoSQL, distributed TiDB, etc.
  • Excellent communication skills, collaborative and getting things done attitude.
  • Enjoy taking up a challenge and driving it to conclusion.

Preferred Qualifications

  • Container image management and optimization.
  • Experience in large distributed system architecture and capacity planning.
  • Understanding of IaC, automation tools, terraform, cloud formation, etc.
  • Background in SRE/DevOps concepts and implementation.
  • Experience in managing monitoring tools like CloudWatch, VictoriaMetrics, Prometheus and reporting with Snowflake and Sigma.
  • In depth knowledge of web technologies such as CloudFront, Nginx, etc.
  • Experience in designing, implementing or maintaining disaster recovery strategies and multi-region architecture to ensure high availability, resilience, and business continuity across critical systems.
  • Language ability in Japanese and English is a plus (We have a professional translator but it is nice to have language skills).

 

Remarks

*Please note that you cannot apply for PayPay (Japan-based jobs) or other positions in parallel or in duplicate.

PayPay 5 senses


Working Conditions 

Employment Status

  • Full Time

Office Location

  • Gurugram (Wework)

  ※The development center requires you to work in the Gurugram office to establish the strong core team.
   

Similar Jobs

Dynamis Inc - Principle Investigator/Senior Scientist

Dynamis Inc

Huntsville, Alabama, United States (On-Site)
2 Months ago
Riot Games - Principal Game Producer

Riot Games

Shanghai, Shanghai, China (On-Site)
3 Months ago
Zuora - Senior Solution Consultant - Fraud

Zuora

United States (Remote)
2 Weeks ago
The Walt Disney Company - Specialist, Brand & Content Marketing (MY), IM SEA

The Walt Disney Company

Petaling Jaya, Selangor, Malaysia (On-Site)
3 Months ago
Rippling - Senior Staff Software Engineer - Data Products

Rippling

San Francisco, California, United States (On-Site)
7 Months ago
Veeam Software - Senior Manager, APJ Cloud and Service Provider

Veeam Software

Singapore, Singapore (On-Site)
2 Months ago
Rackner - Kubernetes Engineer

Rackner

United States (Remote)
2 Months ago
Crowd Strick - Senior Software Engineer Cloud - Flight Control

Crowd Strick

United States (Remote)
4 Days ago
Scopely - Principal DevOps Engineer - Star Trek Fleet Command

Scopely

Dublin, County Dublin, Ireland (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Dialpad AI - Senior Paid Media Manager, Search

Dialpad AI

San Ramon, California, United States (On-Site)
1 Week ago
dun bradstreet - .NET Software Engineer

dun bradstreet

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Months ago
Glean - Technical Support Manager

Glean

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Brainrider - Global Marketing Manager, Brand Solutions

Brainrider

United States (Remote)
1 Week ago
Well - Digital Product Marketing Associate

Well

Boston, Massachusetts, United States (On-Site)
1 Week ago
Roof Stacks - Business Analyst (Card Payment Systems)

Roof Stacks

Istanbul, İstanbul, Türkiye (Hybrid)
3 Weeks ago
NCR Atleos - Payments IT Application Analyst

NCR Atleos

Hyderabad, Telangana, India (Hybrid)
6 Days ago
bytedance - Senior Software Engineer, Edge Cloud Platform

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
PayPal - Manager, Software Engineering

PayPal

Bengaluru, Karnataka, India (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Gurugram, India

Autodesk - Principal Engineer - Salesforce

Autodesk

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Springer Group - Executive Assistant

Springer Group

Pune, Maharashtra, India (On-Site)
4 Days ago
Nagarro - Principal Engineer, PHP Drupal

Nagarro

India (Remote)
9 Months ago
Axi - Senior Backend Developer

Axi

Bengaluru, Karnataka, India (On-Site)
3 Days ago
Accenture - Order to Cash Operations Analyst

Accenture

Gurugram, India (On-Site)
1 Week ago
Capgemini - AAA Security

Capgemini

Mumbai, Maharashtra, India (On-Site)
2 Months ago
Assystems - Junior Structure CAD

Assystems

Bengaluru, Karnataka, India (On-Site)
9 Months ago
Skydio - Senior Software Engineer - Manufacturing Software

Skydio

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Qualcomm - CPU RTL Design - Staff

Qualcomm

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Qualcomm - Engineer, Senior Staff/Manager-Platform Software Architect

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

GoTo Group - Principal SRE Engineer (SE5)

GoTo Group

Gurugram, Haryana, India (On-Site)
9 Months ago
Nagarro - Associate Principal Engineer, Cloud

Nagarro

Hyderabad, Telangana, India (On-Site)
9 Months ago
Canonical - Senior Site Reliability / Gitops Engineer

Canonical

(Remote)
2 Months ago
Luxoft - DevOps Engineer with Azure

Luxoft

Pune, Maharashtra, India (On-Site)
7 Months ago
Apple - Accessibility Software Automation Engineer

Apple

Cupertino, California, United States (On-Site)
1 Month ago
Flowable - Devops Architect

Flowable

Spain (Remote)
6 Days ago
Mashgin - Deployment Engineer - Georgia

Mashgin

Atlanta, Georgia, United States (Remote)
9 Months ago
Nice - Cloud Architect

Nice

Ra'anana, Center District, Israel (Hybrid)
1 Month ago
Gloss Genius - Senior Software Engineer, Platform

Gloss Genius

New York, United States (Hybrid)
1 Week ago
Qualcomm - Linux BSP / QNX Platform Staff Engineer

Qualcomm

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

PayPay corporation is a fintech company providing a service enjoyed by over 63 million users (as of April, 2024) merely 5 years since its launch in 2018 in Japan. The company is now home to a very diverse team of members from more than 50 countries. We grew to a team of several thousand employees in Japan but are far from over. We are still in the Day 1. PayPay India has been established in Gurugram, India in October 2022 as a first development center of PayPay outside of Japan.

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

View All Jobs

Get notified when new jobs are added by Pay2

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug