SREII

Electronic Arts

Job Summary

As a Software Engineer II on the Site Reliability Engineering (SRE) team at Electronic Arts, you will contribute to the design, automation, and operation of large-scale, cloud-based systems powering EA’s global gaming platform. This role involves enhancing service reliability, scalability, and performance across various game studios. Key responsibilities include building and operating scalable systems, developing automation scripts, monitoring, participating in incident response, contributing to CI/CD pipelines, and collaborating on reliability and performance engineering. You will also engage in post-incident reviews and continuous improvement efforts, working closely with senior SREs.

Must Have

  • Contribute to the design, automation, and operation of large-scale, cloud-based systems.
  • Support the development, deployment, and maintenance of distributed, cloud-based infrastructure.
  • Develop automation scripts, tools, and workflows to reduce manual effort and improve system reliability.
  • Create and maintain dashboards, alerts, and metrics for system visibility.
  • Participate in on-call rotations and assist in incident response and root cause analysis.
  • Contribute to the design, implementation, and maintenance of CI/CD pipelines.
  • Collaborate with cross-functional teams to identify reliability bottlenecks and implement improvements.
  • Participate in root cause analyses, document learnings, and contribute to preventive measures.
  • Maintain detailed operational documentation and runbooks.
  • 3-5 years of experience in Cloud Computing (AWS preferred), Virtualization, and Containerization.
  • Extensive hands-on experience in container orchestration technologies (EKS, Kubernetes, Docker).
  • Experience supporting production-grade, high-availability systems with defined SLIs/SLOs.
  • Strong Linux/Unix administration and networking fundamentals.
  • Hands-on experience with Infrastructure as Code and automation tools (Terraform, Helm, Ansible, Chef).
  • Proficiency in Python, Golang, Bash, or Java for scripting and automation.
  • Familiarity with monitoring and observability tools (Prometheus, Grafana, Loki, Datadog).
  • Exposure to distributed systems, SQL/NoSQL databases, and CI/CD pipelines.
  • Strong problem-solving, troubleshooting, and collaboration skills.

Perks & Benefits

  • Healthcare coverage
  • Mental well-being support
  • Retirement savings
  • Paid time off
  • Family leaves
  • Complimentary games

Job Description

General Information

Locations: Hyderabad, Telangana, India

Role ID

211510

Worker Type

Regular Employee

Studio/Department

CT - IT

Work Model

Hybrid

Description & Requirements

Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter. A team where everyone makes play happen.

SEII / SRE Engineer

As a Software Engineer II on the Site Reliability Engineering (SRE) team, you will contribute to the design, automation and operation of large-scale, cloud-based systems that power EA’s global gaming platform. You will work closely with senior engineers to enhance service reliability, scalability and performance across multiple game studios and services.

Responsibilities:

  • Build and Operate Scalable Systems: Support the development, deployment, and maintenance of distributed, cloud-based infrastructure leveraging modern open-source technologies (AWS/GCP/Azure, Kubernetes, Terraform, Docker, etc.).
  • Platform Operations and Automation: Develop automation scripts, tools, and workflows to reduce manual effort, improve system reliability, and optimize infrastructure operations (reducing MTTD and MTTR).
  • Monitoring, Alerting & Incident Response: Create and maintain dashboards, alerts, and metrics to improve system visibility and proactively identify issues. Participate in on-call rotations and assist in incident response and root cause analysis.
  • Continuous Integration / Continuous Deployment (CI/CD): Contribute to the design, implementation, and maintenance of CI/CD pipelines to ensure consistent, repeatable, and reliable deployments.
  • Reliability and Performance Engineering: Collaborate with cross-functional teams to identify reliability bottlenecks, define SLIs/SLOs/SLAs, and implement improvements that enhance the stability and performance of production services.
  • Post-Incident Reviews & Documentation: Participate in root cause analyses, document learnings, and contribute to preventive measures to avoid recurrence of production issues. Maintain detailed operational documentation and runbooks.
  • Collaboration & Mentorship: Work closely with senior SREs and software engineers to gain exposure to large-scale systems, adopt best practices, and gradually take ownership of more complex systems and initiatives.
  • Modernization & Continuous Improvement: Contribute to ongoing modernization efforts by identifying areas for improvement in automation, monitoring, and reliability.

Qualifications – Software Engineer II (Site Reliability Engineer)

  • 3–5 years of experience in Cloud Computing (AWS preferred), Virtualization, and Containerization using Kubernetes, Docker, or VMWare. And Extensive hands-on experience in container orchestration technologies, such as EKS, Kubernetes, Docker
  • Experience supporting production-grade, high-availability systems with defined SLIs/SLOs.
  • Strong Linux/Unix administration and networking fundamentals (protocols, load balancing, DNS, firewalls).
  • Hands-on experience with Infrastructure as Code and automation tools such as Terraform, Helm, Ansible, or Chef..
  • Proficiency in Python, Golang, Bash, or Java for scripting and automation.
  • Familiar with monitoring and observability tools like Prometheus, Grafana, Loki, or Datadog.
  • Exposure to distributed systems, SQL/NoSQL databases, and CI/CD pipelines.
  • Strong problem-solving, troubleshooting, and collaboration skills in cross-functional environments.

About Electronic Arts

We’re proud to have an extensive portfolio of games and experiences, locations around the world, and opportunities across EA. We value adaptability, resilience, creativity, and curiosity. From leadership that brings out your potential, to creating space for learning and experimenting, we empower you to do great work and pursue opportunities for growth.

We adopt a holistic approach to our benefits programs, emphasizing physical, emotional, financial, career, and community wellness to support a balanced life. Our packages are tailored to meet local needs and may include healthcare coverage, mental well-being support, retirement savings, paid time off, family leaves, complimentary games, and more. We nurture environments where our teams can always bring their best to what they do.

Electronic Arts is an equal opportunity employer. All employment decisions are made without regard to race, color, national origin, ancestry, sex, gender, gender identity or expression, sexual orientation, age, genetic information, religion, disability, medical condition, pregnancy, marital status, family status, veteran status, or any other characteristic protected by law. We will also consider employment qualified applicants with criminal records in accordance with applicable law. EA also makes workplace accommodations for qualified individuals with disabilities as required by applicable law.

26 Skills Required For This Role

Cross Functional Problem Solving Game Texts Networking Dns Incident Response Linux Aws Nosql Load Balancing Azure Unix Prometheus Ansible Terraform Grafana Chef Helm Vmware Ci Cd Docker Kubernetes Python Sql Bash Java