Cloud Platform Engineer (K8s & NW Specialist)

undefined ago • 6 Years + • Devops

Job Summary

Job Description

We are seeking a highly skilled and experienced Platform Engineer to manage and enhance our entire application delivery platform, from Cloudfront to the underlying EKS clusters and their associated components. The ideal candidate will possess deep expertise across cloud infrastructure, networking, Kubernetes, and service mesh technologies, coupled with strong programming skills. This role involves maintaining the stability, scalability, and performance of our production environment, including day-to-day operations, upgrades, troubleshooting, and developing in-house tools.
Must have:
  • Perform regular upgrades and patching of EKS clusters and associated components.
  • Oversee the health, performance, and scalability of the EKS clusters.
  • Manage and optimize Karpenter (cluster autoscaling) and ArgoCD (GitOps continuous delivery).
  • Implement and manage service mesh solutions (e.g., Istio, Linkerd).
  • Participate in an on-call rotation to provide 24/7 support for critical platform issues.
  • Monitor the platform for potential issues and implement preventative measures.
  • Develop, maintain, and automate in-house tools and scripts using Python or Go.
  • Configure and manage CloudFront distributions, WAF Policies.
  • Develop and maintain documentation for platform architecture, processes, and troubleshooting guides.
  • Proven 6+ Years experience as a Platform Engineer, Site Reliability Engineer (SRE), or similar role.
  • In-depth knowledge and hands-on experience of at least 4 years with Amazon EKS and Kubernetes.
  • Strong understanding and practical experience with Karpenter, ArgoCD, Terraform.
  • Solid grasp of core networking concepts and extensive experience of at least 5 years with AWS networking services.
  • Demonstrable experience with SSL/TLS certificate management.
  • Proficiency in programming languages such as Python or Go for automation scripts and internal tools.
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Excellent problem-solving and debugging skills across complex distributed systems.
  • Strong communication and collaboration abilities.
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).
Good to have:
  • Prior experience working with service mesh technologies (preferably Istio) in a production environment.
  • Experience building or contributing to Kubernetes Controllers.
  • Experience with multi-cluster Kubernetes architectures.
  • Experience building AZ isolated, DR architectures.

Job Details

We are seeking a highly skilled and experienced Platform Engineer to manage and enhance our entire application delivery platform, from Cloudfront to the underlying EKS clusters and their associated components. The ideal candidate will possess deep expertise across cloud infrastructure, networking, Kubernetes, and service mesh technologies, coupled with strong programming skills. This role involves maintaining the stability, scalability, and performance of our production environment, including day-to-day operations, upgrades, troubleshooting, and developing in-house tools.

Main Responsibilities

  • Perform regular upgrades and patching of EKS clusters and associated components & oversee the health, performance, and scalability of the EKS clusters.
  • Manage and optimize related components such as Karpenter (cluster autoscaling) and ArgoCD (GitOps continuous delivery).
  • Implement and manage service mesh solutions (e.g., Istio, Linkerd) for enhanced traffic management, security, and observability.
  • Participate in an on-call rotation to provide 24/7 support for critical platform issues and monitor the platform for potential issues and implement preventative measures.
  • Develop, maintain, and automate in-house tools and scripts using programming languages like Python or Go to improve platform operations and efficiency.
  • Configure and manage CloudFront distributions, WAF Policies for efficient & secure content delivery & routing.
  • Develop and maintain documentation for platform architecture, processes, and troubleshooting guides.

Tech Stack

  • AWS:
  • VPC, EC2, ECS, EKS, Lambda, Cloudfront, WAF, MWAA, RDS, ElastiCache, DynamoDB, Opensearch, S3, CloudWatch, Cognito, SQS, KMS, Secret Manager, KMS, MSK
  • Terraform, Github Actions, Prometheus, Grafana, Atlantis, ArgoCD, OpenTelemetry

Required Skills and Experiences

  • Proven 6+ Years experience as a Platform Engineer, Site Reliability Engineer (SRE), or similar role with a focus on end-to-end platform ownership.
  • In-depth knowledge and hands-on experience of at least 4 years with Amazon EKS and Kubernetes.
  • Strong understanding and practical experience with Karpenter, ArgoCD, Terraform..
  • Solid grasp of core networking concepts and extensive experience of at least 5 years with AWS networking services (VPC, Security Groups, Network ACLs, CloudFront, WAF, ALB, DNS).
  • Demonstrable experience with SSL/TLS certificate management.
  • Proficiency in programming languages such as Python or Go for developing and maintaining automation scripts and internal tools.
  • Experience with monitoring and logging tools (e.g., Prometheus, Grafana, ELK stack).
  • Excellent problem-solving and debugging skills across complex distributed systems.
  • Strong communication and collaboration abilities.
  • Bachelor's degree in Computer Science, Engineering, or a related field (or equivalent practical experience).

Preferred Qualifications

  • Prior experience working with service mesh technologies (preferably Istio) in a production environment.
  • Experience building or contributing to Kubernetes Controllers.
  • Experience with multi-cluster Kubernetes architectures.
  • Experience building AZ isolated, DR architectures.

Remarks

*Please note that you cannot apply for PayPay (Japan-based jobs) or other positions in parallel or in duplicate.

PayPay 5 senses

to learn what we value at work.

***

Working Conditions

Employment Status

  • Full Time

Office Location

  • Gurugram (Wework)

※The development center requires you to work in the Gurugram office to establish the strong core team.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Gurugram, India

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

PayPay corporation is a fintech company providing a service enjoyed by over 63 million users (as of April, 2024) merely 5 years since its launch in 2018 in Japan. The company is now home to a very diverse team of members from more than 50 countries. We grew to a team of several thousand employees in Japan but are far from over. We are still in the Day 1. PayPay India has been established in Gurugram, India in October 2022 as a first development center of PayPay outside of Japan.

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

Gurugram, India (On-Site)

View All Jobs

Get notified when new jobs are added by Pay2

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug