Site Reliability Engineer - Azure

19 Minutes ago • All levels
Devops

Job Description

This role is for a Site Reliability Engineer focusing on Azure at PhonePe, a leading Indian digital payments company. The engineer will be responsible for improving the reliability, performance, and efficiency of distributed systems and containerized deployments. Key tasks include diagnosing complex systems, automating tasks with scripting languages, troubleshooting across the entire stack, and participating in 24x7 on-call rotations. The position involves designing and maintaining core infrastructure to support millions of users and driving performance, capacity, and high availability initiatives.
Good To Have:
  • Knowledge in Database technologies, specifically in MySQL/NoSQL.
Must Have:
  • Improve reliability and performance of distributed systems and containerized deployments.
  • Diagnose and troubleshoot complex distributed systems handling millions of queries per second.
  • Possess knowledge of Linux cloud services using KVM/QEMU/LVM.
  • Have knowledge of containerization technologies like Docker for deployment and troubleshooting.
  • Understand Azure platform to set up, configure, monitor, and troubleshoot PaaS components.
  • Demonstrate in-depth knowledge in Perl/GoLang/Python for task automation.
  • Have a strong understanding of Linux for command-line driven work.
  • Troubleshoot issues across the entire stack: hardware, software, application, and network.
  • Participate in 24x7 on-call rotations.
  • Design, build, and maintain core infrastructure for scaling to concurrent users.
  • Actively take part in analysis and system improvement plans.
  • Drive performance testing, capacity planning, and high availability practices.
  • Own implementations of new technologies, ensuring proper testing and documentation.
  • Proactively monitor, identify, and solve issues impacting infrastructure.
  • Buddy new team members to get them production ready.
Perks:
  • Medical Insurance
  • Critical Illness Insurance
  • Accidental Insurance
  • Life Insurance
  • Employee Assistance Program
  • Onsite Medical Center
  • Emergency Support System
  • Maternity Benefit
  • Paternity Benefit Program
  • Adoption Assistance Program
  • Day-care Support Program
  • Relocation benefits
  • Transfer Support Policy
  • Travel Policy
  • Employee PF Contribution
  • Flexible PF Contribution
  • Gratuity
  • NPS
  • Leave Encashment
  • Higher Education Assistance
  • Car Lease
  • Salary Advance Policy

Add these skills to join the top 1% applicants for this job

problem-solving
team-player
talent-acquisition
game-texts
performance-testing
mysql
linux
nosql
azure
kvm
docker
python
perl

About the Company:

Headquartered in India, its flagship product, the digital payments app, was launched in Aug 2016. As of April 2025, the company has over 60 Crore (600 Million) registered users and a digital payments acceptance network spread across over 4 Crore (40+ million) merchants. The company also processes over 33 Crore (330+ Million) transactions daily with an Annualized Total Payment Value (TPV) of over INR 150 lakh crore.

The company’s portfolio of businesses includes the distribution of financial products (Insurance, Lending, and Wealth) as well as new consumer tech businesses (Pincode - hyperlocal e-commerce and Indus AppStore Localized App Store for the Android ecosystem) in India, which are aligned with the company’s vision to offer every Indian an equal opportunity to accelerate their progress by unlocking the flow of money and access to services.

Culture:

At the company, we go the extra mile to make sure you can bring your best self to work, Everyday!. And that starts with creating the right environment for you. We empower people and trust them to do the right thing. Here, you own your work from start to finish, right from day one. Our employees solve complex problems and execute quickly; often building frameworks from scratch. If you’re excited by the idea of building platforms that touch millions, ideating with some of the best minds in the country and executing on your dreams with purpose and speed, join us!

We are looking for engineers who are passionate about reliability, performance, and efficiency, and with experience in building tools, services, and automation to manage and improve production services.

  • Systems internals/security, Linux, Network, and Monitoring
  • work to improve the reliability and performance of the next generation of distributed systems and containerized deployments
  • Diagnose and troubleshoot complex distributed systems handling millions of queries per second
  • Knowledge of Linux cloud services using kvm/qemu/lvm.
  • Knowledge of containerization technologies like docker and deployment and troubleshooting of containers
  • Understanding of cloud platform Azure, ability to set up, configure, monitor and troubleshoot various PaaS components like Firewalls, VPN gateways, Load Balancers, Storage accounts, Networks and others
  • In-depth knowledge in Perl/GoLang/Python to automate tasks with minimal intervention.
  • Day-to-day work is heavily command-line driven, which requires a strong understanding of Linux.
  • Troubleshoot issues across the entire stack - hardware, software, application, and network
  • Knowledge in Database technologies, specifically in MySQL/NoSQL is good to have.
  • Participate in 24x7 on-call rotations.
  • Design, build and maintain core infrastructure that enables the company scaling to support hundreds of thousands of concurrent users.
  • Actively take part in the Analysis and System improvement plan.
  • Drive performance testing, capacity planning and high availability practices.
  • Own implementations of new technologies while ensuring proper testing and documentation.
  • Proactively monitor/identify/solve issues which could have a potential impact to our Infrastructure.
  • Natural team player and also have a resourceful attitude.
  • Buddy new team members, and get them production ready.

Full Time Employee Benefits (Not applicable for Intern or Contract Roles)

  • Insurance Benefits - Medical Insurance, Critical Illness Insurance, Accidental Insurance, Life Insurance
  • Wellness Program - Employee Assistance Program, Onsite Medical Center, Emergency Support System
  • Parental Support - Maternity Benefit, Paternity Benefit Program, Adoption Assistance Program, Day-care Support Program
  • Mobility Benefits - Relocation benefits, Transfer Support Policy, Travel Policy
  • Retirement Benefits - Employee PF Contribution, Flexible PF Contribution, Gratuity, NPS, Leave Encashment
  • Other Benefits - Higher Education Assistance, Car Lease, Salary Advance Policy

Our inclusive culture promotes individual expression, creativity, innovation, and achievement and in turn helps us better understand and serve our customers. We see ourselves as a place for intellectual curiosity, ideas and debates, where diverse perspectives lead to deeper understanding and better quality results. The company is an equal opportunity employer and is committed to treating all its employees and job applicants equally; regardless of gender, sexual preference, religion, race, color or disability. If you have a disability or special need that requires assistance or reasonable accommodation, during the application and hiring process, including support for the interview or onboarding process, please fill out this form.

Read more about the company on our blog.

Life at the company

The company in the news

Set alerts for more jobs like Site Reliability Engineer - Azure
Set alerts for new jobs by PhonePe
Set alerts for new Devops jobs in India
Set alerts for new jobs in India
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙