Software Engineer - Cloud SRE

7 Minutes ago • 5 Years +
Software Development & Engineering

Job Description

eBay is seeking an experienced Site Reliability Engineer (SRE) to join its Cloud Infrastructure team. This role focuses on enhancing the reliability, scalability, design, development, deployment, and operation of self-service platforms for application lifecycle management. The ideal candidate will have a deep understanding of cloud-native technologies, containers, Kubernetes, and strong programming skills in Go and Python, with at least 5 years of experience in large-scale, distributed systems.
Good To Have:
  • Certifications in Kubernetes, lifecycle management or related fields.
  • Understanding application lifecycle management, CI/CD.
  • Experience in a high-traffic, large-scale environment.
  • Familiarity with additional programming languages or frameworks.
  • Proficiency in Agile development methodologies.
  • Experience in participating in open-source standards and contributing to open-source projects.
Must Have:
  • Collaborate with internal customers and partners to deliver key business outcomes.
  • Ensure cloud products are reliable, scalable, efficient, and compliant with security and operational standards.
  • Enhance observability practices for comprehensive monitoring and alerting.
  • Respond to cloud incidents, perform root cause analysis, and implement corrective actions.
  • Analyze system performance metrics and make recommendations for improvements.
  • Drive improvements in CI/CD processes to increase deployment velocity and reliability.
  • Develop and maintain automation to streamline operations and enhance system reliability.
  • Minimum of 3+ years of programming experience with Go or Python.
  • 5+ years experience implementing large-scale, distributed, high-availability, fault-tolerant systems and infrastructure.
  • Proficiency in delivering products within a multi-functional team environment.
  • Demonstrated expertise in observability tools and practices.
  • Extensive experience with Kubernetes as an SRE, or related cloud infrastructure and cloud-native technologies.
  • Deep understanding of API design and RESTful principles, with experience in building web services at scale.

Add these skills to join the top 1% applicants for this job

performance-analysis
game-texts
agile-development
incident-response
ci-cd
kubernetes
python

We are seeking an experienced Site Reliability Engineer (SRE) to join our dynamic Cloud Infrastructure team at eBay in Dublin, Ireland. This role demands a deep understanding of cloud-native technologies, particularly containers and Kubernetes, along with strong programming skills in languages such as Go and Python. The ideal candidate will have a proven track record of at least 3 years in the field, focusing on enhancing the reliability, scalability, design, development, deployment, and operation of self-service platforms that facilitate the lifecycle management of applications supporting eBay's products and services.

Responsibilities

  • Collaborate with internal customers and partners to deliver key business outcomes.
  • Ensure that cloud products are reliable, scalable, efficient, and compliant with eBay's security and operational standards.
  • Enhance observability practices to ensure comprehensive monitoring and alerting across cloud services.
  • Respond to cloud incidents, perform root cause analysis, and implement corrective actions to prevent future occurrences. Develop and maintain incident response plans.
  • Analyze system performance metrics and make recommendations for improvements. Implement changes to optimize resource utilization and improve application performance.
  • Drive improvements in CI/CD processes to increase deployment velocity and reliability.
  • Develop and maintain automation to streamline operations, reduce manual work, and enhance system reliability.

Requirements

  • Minimum of 3+ years of programming experience with Go or Python.
  • 5+ years of experience in implementing large-scale, distributed, high-availability, fault-tolerant systems and infrastructure in a production environment.
  • Proficiency in delivering products within a multi-functional team environment.
  • Demonstrated expertise in observability tools and practices, ensuring system reliability and performance.
  • Extensive experience with Kubernetes as an SRE, or related cloud infrastructure and cloud-native technologies. Experience in developing with Kubernetes and/or building Kubernetes controllers is highly desirable.
  • Deep understanding of API design and RESTful principles, with experience in building web services at scale.

Preferred Skills:

  • Certifications in Kubernetes, lifecycle management or related fields.
  • Understanding application lifecycle management, CI/CD is a plus.
  • Experience in a high-traffic, large-scale environment.
  • Familiarity with additional programming languages or frameworks.
  • Proficiency in Agile development methodologies.
  • Experience in participating in open-source standards and contributing to open-source projects is a plus.

Set alerts for more jobs like Software Engineer - Cloud SRE
Set alerts for new jobs by eBay
Set alerts for new Software Development & Engineering jobs in India
Set alerts for new jobs in India
Set alerts for Software Development & Engineering (Remote) jobs
Contact Us
hello@outscal.com
Made in INDIA 💛💙