Site Reliability Engineer

1 Month ago • 5-5 Years • DevOps

Job Summary

Job Description

As a Site Reliability Engineer at Escape Velocity, you'll be responsible for the delivery, scalability, and reliability of cloud-hosted game titles. You'll partner with game teams, implementing best practices and optimizing reliability, availability, observability, and cost. Responsibilities include analyzing and improving complex systems, owning projects from conception to completion, diagnosing and resolving operational concerns, and participating in on-call rotations. You'll work with tools like Prometheus, Grafana, Loki, Ansible, Terraform, and Cloudformation, and have experience with backend game engines and CI/CD systems.
Must have:
  • 5+ years SRE/DevOps experience
  • Experience with observability tools (Prometheus, Grafana, Loki)
  • GitOps workflows and version control
  • CI/CD system implementation & maintenance
  • IaC design (Ansible, Terraform, Cloudformation)
  • Backend game engine experience
  • Capacity planning and FinOps
  • Proficiency in Python, Kotlin, JavaScript, or C++
  • Strong Linux skills
  • Public cloud expertise
Perks:
  • Fully remote
  • Excellent benefits

Job Details

Description

What we are looking for:

As a Site Reliability Engineer at Escape Velocity, you will be a game maker, enabling the teams to create new ways to enhance experiences in interactive entertainment.

We are looking for an experienced Senior Site Reliability Engineer that brings a broad set of technical skills and achievements, development, and automation focused mindset to solving problems, who is eager to tackle a few of technology’s greatest challenges and make an impact on millions, of users.

Requirements

What we will do together:

  • Analyze, implement, and improve complex systems responsible for delivering our game to millions of fans
  • Own the delivery, scalability, and reliability of the cloud hosted game title
  • Partner with game team to advise on and implement best practices as we turn ideas into experiences
  • Take ownership of projects, seeing them through to completion in a timely manner while maintaining exacting standards for the quality of execution
  • Engage with product teams to diagnose and resolve operational concerns
  • Form and maintain relationships with internal and external partners to best support our peers and customers
  • Consistently aim to optimize reliability, availability, observability, and cost
  • Participate in on-call rotation that assists with business-critical incidents impacting our partners

What you will bring:

  • 5+ years of experience in a Site Reliability, Devops, or Platform engineering role
  • 5+ years of experience with observability, application monitoring and alerting, telemetry collection and data visualization using common tools (Prometheus, Grafana, Loki)
  • Experience with GitOps workflows and Helix Core / Perforce versioning system
  • Experience implementing and maintaining CI/CD systems - Buildkite, Github or Gitlab runners
  • Expertise in IaC design using combination of Ansible, Terraform and Cloudformation  
  • Experience with backend game engines such as Pragma, GameLift, Agones or others
  • Experience with capacity planning and FinOps
  • Proficient in one or more high-level languages (Ie: Python, Kotlin, JavaScript, C++)
  • Strong Linux Skills
  • Understanding of public cloud services, their use cases, automation, best practices and cost optimization 
  • Ability to analyze desired project outcomes and derive requirements that will take an idea from concept to completion

Benefits

Interested… a bit about Escape Velocity:

Escape Velocity is a team of passionate, talented developers working at a studio that offers excellent benefits and what we believe is a fantastic project. But that make us unique. What does make us special? We value the time, energy, and talent of our players and our colleagues. We want every hour spent developing and playing our games to be an incredibly rewarding and worthwhile use of your time. To that end, we are fully remote and support team members in the world.  We prioritize honesty, even when that means sharing . We strive to give everyone autonomy so they can determine how best to work and play, and we care more about what you bring to the game and studio than how many hours clocking. We believe in constant improvement across all areas – from the game itself to our production design – and are constantly re-evaluating how we spend our time to ensure that not wasting and all always working on the most important things. Finally (and importantly) dedicated to building a culture of play, which not only makes us better game makers, but also strengthens the bonds between us all.

 

How to apply:

Click apply and complete an application along with a Resume / CV. If we would like to move forward with your application, a member of the Recruiting Team will reach out to you and guide you through our process.

Escape Velocity is proud to be an equal opportunity employer, we are committed to hiring, promoting, and compensating employees based on their qualifications and demonstrated ability to perform job responsibilities.

Applications will be considered​ regardless ​of age, disability, gender identity, sexual orientation, religion, belief, race, or any other protected category.

Similar Jobs

Nagarro - Associate Distinguished Engineer

Nagarro

France (Remote)
6 Months ago
Aisera Jobs - Professional Services Engineer

Aisera Jobs

Palo Alto, California, United States (On-Site)
1 Day ago
Scopely - Senior Server Engineer (Platform)

Scopely

Lisbon, Lisbon, Portugal (Hybrid)
2 Months ago
Next Level Business Services - DevOps Engineer

Next Level Business Services

Redmond, Washington, United States (On-Site)
6 Months ago
PlayStation Global - DevOps/Build Engineer

PlayStation Global

Los Angeles, California, United States (On-Site)
3 Weeks ago
Microsoft - Senior Software Engineer

Microsoft

(On-Site)
1 Week ago
Google - Customer Engineer III, Navy

Google

San Diego, California, United States (On-Site)
1 Week ago
SmileGate - SRE Strategy Project Manager

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Aristocrat Gaming - Software Engineer

Aristocrat Gaming

Las Vegas, Nevada, United States (Hybrid)
2 Months ago
Canva - Senior Software Engineer - Video Compositor (Fullstack)

Canva

Auckland, Auckland, New Zealand (Remote)
2 Months ago
Every matrix - Middle QA Tester

Every matrix

Bucharest, Bucharest, Romania (Hybrid)
1 Week ago
Ubisoft - Senior Graphic Technical Art Director

Ubisoft

Montreal, Quebec, Canada (On-Site)
1 Month ago
Vercel - Software Engineer, Observability

Vercel

(Remote)
7 Hours ago
Dream11 - SDE 2 - Frontend

Dream11

Mumbai, Maharashtra, India (On-Site)
6 Months ago
Google - Technical Solutions Engineer, Infrastructure Compute

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Google - Senior Advertising Solutions Architect

Google

Tokyo, Japan (On-Site)
2 Weeks ago
Netflix - Full Stack Software Engineer 4 - Game Lifecycle Engineering

Netflix

United States (Remote)
2 Weeks ago
Mouser Electronics - Web Developer I

Mouser Electronics

Bengaluru, Karnataka, India (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Worldwide

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

DevOps Jobs

Google - Cloud Technical Solutions Engineer, Infrastructure

Google

Tokyo, Japan (On-Site)
1 Week ago
NVIDIA - DevOps Engineering Intern, DGXC Console - Fall 2025

NVIDIA

Washington, United States (On-Site)
2 Weeks ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Westford, Massachusetts, United States (Hybrid)
2 Months ago
Google - Infrastructure Engineer, Google Distributed Cloud

Google

Cambridge, Massachusetts, United States (On-Site)
2 Weeks ago
Google - Staff Software Engineer, Site Reliability Engineering

Google

Sydney, New South Wales, Australia (On-Site)
2 Weeks ago
Google - Staff Software Engineer, Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
1 Week ago
Inworld AI - Staff Platform Engineer - USA

Inworld AI

Mountain View, California, United States (On-Site)
5 Months ago
N-iX - Senior Engineer with AWS Greengrass Expertise

N-iX

Ukraine (Remote)
2 Months ago
Google - Customer Engineer, Data Management, Google Cloud

Google

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
1 Week ago
Microsoft - Technical Support Engineer - Azure Monitoring

Microsoft

Taipei City, Taiwan (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded