Senior Observability Engineer

1 Month ago • 4-8 Years • Backend Development • $186,133 PA - $272,995 PA

Job Summary

Job Description

The Senior Observability Engineer at Epic Games will build and operate the infrastructure supporting their online services used by over half a billion players. Responsibilities include service ownership, developing and shipping new data processing pipelines for telemetry data, automating processes, and collaborating across teams as an observability expert. This role requires experience with large-scale systems in AWS (Kubernetes, Terraform), application monitoring (OpenTelemetry, Prometheus, Grafana, etc.), and working in a fast-paced environment. The team handles company-wide metrics, logging, exception handling, and dashboarding solutions.
Must have:
  • Experience with large-scale AWS systems (Kubernetes)
  • Proficient in Terraform
  • Expertise in application monitoring tools (OpenTelemetry, Prometheus, etc.)
  • Experience in a fast-paced, interrupt-driven environment
  • Service ownership mentality
Perks:
  • 100% premium coverage for medical, dental, vision
  • Long-term disability and life insurance
  • 401k with competitive match
  • Mental well-being program
  • Unlimited PTO and sick time
  • Paid sabbatical after 7 years

Job Details

WHAT MAKES US EPIC?

At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-winning games or crafting engine technology that enables others to make visually stunning interactive experiences, we’re always innovating.

Being Epic means being a part of a team that continually strives to do right by our community and users. We’re constantly innovating to raise the bar of engine and game development.

ONLINE INFRASTRUCTURE

What We Do

We enable Epic’s online services teams to build, deploy, and manage services that are used by more than half a billion players around the world. Our mission is to provide world class tools and platforms to improve the experience of our developers and make it easier, faster, and safer to build, operate, and scale their applications. We operate at massive scale as one of the largest cloud computing users in the world.

What You'll Do

Our Observability team is looking for a Senior SRE to help us build and operate the infrastructure our teams rely on to keep our platforms, games, and online services running. Our Observability team works across all of Epic to implement industry best practices and develop new monitoring capabilities. As an SRE on Observability, you will tackle problems that impact how we understand and operate our products at scale. This team is responsible for company-wide metrics, logging, exception handling, and dashboarding solutions. In this role, you will build and operate the systems that process and transport the large volumes of telemetry data generated by services at Epic.

In this role, you will

  • Service Ownership - At Epic we embrace a Service Owner (You build it, you run it) mentality. In this role, you will work together with other members of the Observability team to operate the infrastructure our developers depend on to operate their own services.
  • Develop and Ship - You will work to modernize key portions of our observability infrastructure. Building new data processing pipelines for telemetry data as well as writing software to automate processes and generate new insights.
  • Collaborate - You will work with teams across Epic as an observability subject matter expert to provide guidance on observability best practices.

What we're looking for

  • Experience with executing meaningful change in a fast-paced interrupt driven environment.
  • Self-starter, you approach challenges creatively and methodically, seeing them through to final resolution.
  • Ability to adapt and be effective in new situations within a highly dynamic environment.
  • Experience working with large scale systems in AWS, mostly deployed via Kubernetes.
  • Comfortable in a very terraform heavy environment, both reviewing PRs as well as contributing yourself.
  • Are familiar with application/service monitoring strategies and technologies, examples include OpenTelemetry, Prometheus, Grafana, FluentD, New Relic, Datadog, Grafana, Sentry, and Sumo Logic.

This role is open to multiple locations across North America (including CA).

EPIC JOB + EPIC BENEFITS = EPIC LIFE

Our intent is to cover all things that are medically necessary and improve the quality of life. We pay 100% of the premiums for both you and your dependents. Our coverage includes Medical, Dental, a Vision HRA, Long Term Disability, Life Insurance & a 401k with competitive match. We also offer a robust mental well-being program through Modern Health, which provides free therapy and coaching for employees & dependents. Throughout the year we celebrate our employees with events and company-wide paid breaks. We offer unlimited PTO and sick time and recognize individuals for 7 years of employment with a paid sabbatical.

Pay Transparency Information

The expected annual base pay range(s) for this position are detailed below. Each base pay range is relevant only for individuals who are residents of or will be expected to work within the specified locale. Compensation varies based on a variety of factors, which include (but aren’t limited to) things such as skills and competencies, qualifications, knowledge, and experience. In addition to base pay, most employees are eligible to participate in Epic’s generous benefit plans and discretionary incentive programs (subject to the terms of those plans or programs).

California Base Pay Range
$186,133$272,995 USD

ABOUT US

Epic Games spans across 25 countries with 46 studios and 4,500+ employees globally. For over 25 years, we've been making award-winning games and engine technology that empowers others to make visually stunning games and 3D content that bring environments to life like never before. Epic's award-winning Unreal Engine technology not only provides game developers the ability to build high-fidelity, interactive experiences for PC, console, mobile, and VR, it is also a tool being embraced by content creators across a variety of industries such as media and entertainment, automotive, and architectural design. As we continue to build our Engine technology and develop remarkable games, we strive to build teams of world-class talent.

Like what you hear? Come be a part of something Epic!

Epic Games deeply values diverse teams and an inclusive work culture, and we are proud to be an Equal Opportunity employer. Learn more about our Equal Employment Opportunity (EEO) Policy .

Note to Recruitment Agencies: Epic does not accept any unsolicited resumes or approaches from any unauthorized third party (including recruitment or placement agencies) (i.e., a third party with whom we do not have a negotiated and validly executed agreement). We will not pay any fees to any unauthorized third party. Further details on these matters can be found .

Similar Jobs

Autodesk - Principal Software Engineer (Back-End)

Autodesk

Vancouver, British Columbia, Canada (Remote)
4 Months ago
Meta - Production Engineer

Meta

Sunnyvale, California, United States (Remote)
3 Months ago
PwC - IN-Senior Associate _Java Developer _Data & Analytics _Advisory _PAN India

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
Intel Corporation - IDC (Intel Developer Cloud) Service Desk Engineer

Intel Corporation

San José, San José Province, Costa Rica (Hybrid)
2 Months ago
Logifuture - IT Infrastructure Specialist

Logifuture

(Remote)
5 Days ago
Appier - Software Engineer, Backend Development (Graduate)

Appier

Taipei City, Taiwan (On-Site)
3 Weeks ago
Spyke Games - Backend Game Developer

Spyke Games

İstanbul, Türkiye (On-Site)
6 Months ago
Whoop - Staff Software Engineer (Backend, Platform)

Whoop

Boston, Massachusetts, United States (On-Site)
4 Months ago
DMarket - Sr. Back-end Developer

DMarket

Ukraine (Remote)
1 Month ago
MURKA - PHP Developer

MURKA

Poland (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Database Administrator - Game

ByteDance

Singapore (On-Site)
2 Months ago
prizepicks - Senior Front End Engineer (React/Typescript)

prizepicks

Atlanta, Georgia, United States (Remote)
1 Month ago
Revolgy - Junior Cloud Ops Engineer (Intern)

Revolgy

(Remote)
1 Month ago
Axon - Manager, Site Reliability Engineering

Axon

Canada (Remote)
6 Days ago
Netflix - Distributed Systems Engineer (L5) - Cloud Network Engineering

Netflix

United States (Remote)
3 Weeks ago
Pixar Animation Studios - Software Engineer, Platform

Pixar Animation Studios

Emeryville, California, United States (Hybrid)
4 Months ago
ByteDance - Experienced Software Engineer - Traffic Platform

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
Sourcegraph  Inc  - Support Engineer

Sourcegraph Inc

(Remote)
1 Month ago
ByteDance - Senior Site Reliability Engineer - Data Infrastructure (Seattle)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in North America

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Backend Development Jobs

ByteDance - Software Engineer, Backend and Infrastructure

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Fliff  Inc  - Software Engineer III

Fliff Inc

Sofia, Sofia City Province, Bulgaria (On-Site)
8 Months ago
Puzzle Cats - Senior Software Engineer

Puzzle Cats

Toronto, Ontario, Canada (On-Site)
6 Months ago
Blis - Senior Software Engineer, C++ - Mumbai

Blis

Maharashtra, India (Hybrid)
5 Months ago
Playtika - Server Technical Lead

Playtika

Poland (Hybrid)
3 Months ago
Extreme Network - Staff Backend Developer (Python, Microservices, GenAI - 92890) Ireland

Extreme Network

Shannon, County Clare, Ireland (Remote)
4 Months ago
Electronic Arts - Software Engineer - AI Solutions

Electronic Arts

Vancouver, British Columbia, Canada (Hybrid)
1 Week ago
DMarket - Staff Blockchain Developer

DMarket

Ukraine (Remote)
3 Months ago
Ness Digital - Junior Integration Java Developer

Ness Digital

Prague, Prague, Czechia (Remote)
1 Month ago
Social Discovery Group - Go-developer (RCML)

Social Discovery Group

Poland (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 1991, Epic Games is a leading interactive entertainment company and provider of 3D engine technology. Epic operates Fortnite, one of the world’s largest games with over 350 million accounts and 2.5 billion friend connections. Epic also develops Unreal Engine, which powers the world’s leading games and is adopted across industries such as film and television, architecture, automotive, manufacturing, and simulation. Through Unreal Engine, Epic Games Store, and Epic Online Services, Epic provides an end-to-end digital ecosystem for developers and creators to build, distribute, and operate games and other content. Epic has over 40 offices worldwide with headquarters in Cary, North Carolina.

Cary, North Carolina, United States (On-Site)

Novi Sad, Vojvodina, Serbia (On-Site)

Berlin, Berlin, Germany (On-Site)

(On-Site)

London, England, United Kingdom (On-Site)

Cary, North Carolina, United States (On-Site)

Cary, North Carolina, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Epic Games

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug