Software Developer - Online Service Quality

45 Minutes ago • 5 Years +
Programming

Job Description

Electronic Arts is seeking a Site Reliability Engineer (SRE) for the Battlefield Online Service Quality team. This role involves safeguarding the reliability, performance, and scalability of game services, focusing on observability, alerting, 24/7 on-call support, operational security, and cost optimization. The SRE will leverage expertise in Kubernetes, infrastructure as code (Jsonnet, Terraform), and automated certificate management to ensure uninterrupted gaming experiences at scale within a hybrid work model in Montreal.
Good To Have:
  • Direct experience operating massive online game infrastructures or similar high-demand digital platforms.
  • Experience working in a multi-team, distributed development environment supporting large-scale projects.
  • Background in automating operational tasks and continuously improving service delivery pipelines for expansive online systems.
Must Have:
  • Operate and maintain large-scale online game portfolio, ensuring uptime, secure environments, and seamless player experiences.
  • Design, implement, and manage observability solutions (monitoring, logging, alerting) for vast, distributed systems.
  • Participate in a 24/7 on-call rotation, leading incident response and driving continuous improvement.
  • Track, analyze, and optimize infrastructure and service costs across Battlefield Online’s cloud ecosystem.
  • Automate infrastructure management using Jsonnet, Terraform, and custom tools for efficient scaling and rapid deployment.
  • Collaborate with engineering, operations, and product teams to enhance service quality, reliability, and scalability.
  • Analyze feature designs and propose technical implementation solutions.
Perks:
  • Holistic benefits programs emphasizing physical, emotional, financial, career, and community wellness.
  • Healthcare coverage
  • Mental well-being support
  • Retirement savings
  • Paid time off
  • Family leaves
  • Complimentary games

Add these skills to join the top 1% applicants for this job

team-management
cross-functional
communication
github
game-texts
gitlab
ruby
networking
dns
linux
incident-response
aws
prometheus
terraform
grafana
helm
docker
kubernetes
python
jenkins

General Information

Locations: Montreal, Quebec, Canada

Role ID

210115

Worker Type

Regular Employee

Studio/Department

EA Studios - Motive Montreal

Work Model

Hybrid

Description & Requirements

Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter. A team where everyone makes play happen.

To view the job description in French, please select French from the drop-down menu at the top of the page

Site Reliability Engineer – Service Quality team, Battlefield Online

Foundational Technology

Electronic Arts creates next-level entertainment experiences that inspire players and fans around the world. Here, everyone is part of the story. Part of a community that connects across the globe. A place where creativity thrives, new perspectives are invited, and ideas matter. A team where everyone makes play happen.

Motive is a creative studio with offices in Montréal. We believe in the power of diversity and welcome game creators from all backgrounds to collaborate with us as we unlock the potential for the future of Battlefield!

We’re always pushing to be at the forefront of creative entertainment - blending digital art, design, and technology to push boundaries. Our collaborative culture is fueled by passion, driving innovation and making a positive difference for our players and community.

At Motive, your ideas matter. We offer an inclusive space where you can thrive, be yourself, and grow alongside a team dedicated to making a meaningful impact on the world of gaming.

We’re all-in on the future and our most ambitious Battlefield yet. Want to be part of something special? Read on.

The Role

We are seeking an SRE with outstanding capability for running and scaling massive online service infrastructure. You will be at the heart of our operations, safeguarding the reliability, performance, and scalability of our game services. Your expertise in managing distributed, complex environments will be vital as you focus on service observability, alerting, 24/7 on-call support, operational security and cost optimization, all while ensuring uninterrupted gaming experiences at scale.

You will leverage your deep knowledge of Kubernetes, infrastructure as code with Jsonnet and Terraform, along with automated certificate management to deliver robust, high-availability autoscaling environments. The ability to anticipate, identify, and resolve challenges in large systems will be central to your success in this role.

You will work in a small distributed team that collaborates to create solutions for the Battlefield Franchise, using modern technologies and frameworks deployed to cloud-based infrastructure. You will work with multiple existing systems; some developed here at Battlefield Foundational Tech, some developed externally. This will require collaborating with lots of different teams within EA. You will report into a Development Director.

You will work in hybrid mode 3 days a week from the office located in Montreal.

RESPONSIBILITIES:

  • Operate and maintain our large-scale online game portfolio, ensuring exceptional uptime, secure environments and seamless player experiences across global infrastructure.
  • Design, implement, and manage observability solutions—monitoring, logging, and alerting—capable of supporting vast, distributed systems.
  • Participate in a 24/7 on-call rotation, leading incident response for our high-traffic services and driving continuous improvement based on root cause analysis.
  • Track, analyze, and optimize infrastructure and service costs across Battlefield Online’s expansive cloud ecosystem.
  • Automate infrastructure management using Jsonnet, Terraform and custom tools, enabling efficient scaling and rapid deployment of services.
  • Collaborate closely with engineering, operations, and product teams to enhance the quality, reliability, and scalability of our services.
  • Analyze feature designs and propose technical solutions for how they can be implemented

SKILLS:

  • Strong analytical skills to troubleshoot and solve complex technical challenges.
  • Excellent teamwork and communication skills for working in a cross-functional, globally distributed environment.
  • Linux administration experience for container orchestration platforms.
  • Experience with networking configuration and maintenance in public cloud environments.
  • Experience implementing data and infrastructure security best practices.

REQUIREMENTS:

  • 5+ years of experience in managing distributed, scalable, resilient, high-performing systems
  • 5+ years of relevant experience with public cloud services (preferably AWS), including design, implementation, and operational support of critical systems.
  • Experience with container workload technologies such as Kubernetes, Helm, and Docker.
  • Experience with with several DevOps tools and methodologies, including infrastructure as code and GitOps
  • Experience with Monitoring/observability systems such as Prometheus, Grafana and Datadog.
  • Experience with continuous integration and delivery, using pipeline automation systems such as Jenkins, GitLab, and GitHub
  • Experience developing automated solutions with at least one of the following languages: Python, Ruby, Go.
  • Demonstrated expertise in operating system and network security fundamentals for publicly accessible services hosted on Linux servers.
  • Experience implementing network resources in public cloud context, including DNS, subnetting, route tables, NAT, and firewalls

NICE TO HAVE:

  • Direct experience operating massive online game infrastructures or similar high-demand digital platforms.
  • Experience working in a multi-team, distributed development environment supporting large-scale projects.
  • Background in automating operational tasks and continuously improving service delivery pipelines for expansive online systems.

About Electronic Arts

We’re proud to have an extensive portfolio of games and experiences, locations around the world, and opportunities across EA. We value adaptability, resilience, creativity, and curiosity. From leadership that brings out your potential, to creating space for learning and experimenting, we empower you to do great work and pursue opportunities for growth.

We adopt a holistic approach to our benefits programs, emphasizing physical, emotional, financial, career, and community wellness to support a balanced life. Our packages are tailored to meet local needs and may include healthcare coverage, mental well-being support, retirement savings, paid time off, family leaves, complimentary games, and more. We nurture environments where our teams can always bring their best to what they do.

Electronic Arts is an equal opportunity employer. All employment decisions are made without regard to race, color, national origin, ancestry, sex, gender, gender identity or expression, sexual orientation, age, genetic information, religion, disability, medical condition, pregnancy, marital status, family status, veteran status, or any other characteristic protected by law. We will also consider employment qualified applicants with criminal records in accordance with applicable law. EA also makes workplace accommodations for qualified individuals with disabilities as required by applicable law.

Set alerts for more jobs like Software Developer - Online Service Quality
Set alerts for new jobs by Maxis Studios
Set alerts for new Programming jobs in Canada
Set alerts for new jobs in Canada
Set alerts for Programming (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙