Senior Observability Engineer

1 Month ago • 4-8 Years • Backend Development • $186,133 PA - $272,995 PA

Job Summary

Job Description

The Senior Observability Engineer at Epic Games will build and operate the infrastructure supporting their online services used by over half a billion players. Responsibilities include service ownership, developing and shipping new data processing pipelines for telemetry data, automating processes, and collaborating across teams as an observability expert. This role requires experience with large-scale systems in AWS (Kubernetes, Terraform), application monitoring (OpenTelemetry, Prometheus, Grafana, etc.), and working in a fast-paced environment. The team handles company-wide metrics, logging, exception handling, and dashboarding solutions.
Must have:
  • Experience with large-scale AWS systems (Kubernetes)
  • Proficient in Terraform
  • Expertise in application monitoring tools (OpenTelemetry, Prometheus, etc.)
  • Experience in a fast-paced, interrupt-driven environment
  • Service ownership mentality
Perks:
  • 100% premium coverage for medical, dental, vision
  • Long-term disability and life insurance
  • 401k with competitive match
  • Mental well-being program
  • Unlimited PTO and sick time
  • Paid sabbatical after 7 years

Job Details

WHAT MAKES US EPIC?

At the core of Epic’s success are talented, passionate people. Epic prides itself on creating a collaborative, welcoming, and creative environment. Whether it’s building award-winning games or crafting engine technology that enables others to make visually stunning interactive experiences, we’re always innovating.

Being Epic means being a part of a team that continually strives to do right by our community and users. We’re constantly innovating to raise the bar of engine and game development.

ONLINE INFRASTRUCTURE

What We Do

We enable Epic’s online services teams to build, deploy, and manage services that are used by more than half a billion players around the world. Our mission is to provide world class tools and platforms to improve the experience of our developers and make it easier, faster, and safer to build, operate, and scale their applications. We operate at massive scale as one of the largest cloud computing users in the world.

What You'll Do

Our Observability team is looking for a Senior SRE to help us build and operate the infrastructure our teams rely on to keep our platforms, games, and online services running. Our Observability team works across all of Epic to implement industry best practices and develop new monitoring capabilities. As an SRE on Observability, you will tackle problems that impact how we understand and operate our products at scale. This team is responsible for company-wide metrics, logging, exception handling, and dashboarding solutions. In this role, you will build and operate the systems that process and transport the large volumes of telemetry data generated by services at Epic.

In this role, you will

  • Service Ownership - At Epic we embrace a Service Owner (You build it, you run it) mentality. In this role, you will work together with other members of the Observability team to operate the infrastructure our developers depend on to operate their own services.
  • Develop and Ship - You will work to modernize key portions of our observability infrastructure. Building new data processing pipelines for telemetry data as well as writing software to automate processes and generate new insights.
  • Collaborate - You will work with teams across Epic as an observability subject matter expert to provide guidance on observability best practices.

What we're looking for

  • Experience with executing meaningful change in a fast-paced interrupt driven environment.
  • Self-starter, you approach challenges creatively and methodically, seeing them through to final resolution.
  • Ability to adapt and be effective in new situations within a highly dynamic environment.
  • Experience working with large scale systems in AWS, mostly deployed via Kubernetes.
  • Comfortable in a very terraform heavy environment, both reviewing PRs as well as contributing yourself.
  • Are familiar with application/service monitoring strategies and technologies, examples include OpenTelemetry, Prometheus, Grafana, FluentD, New Relic, Datadog, Grafana, Sentry, and Sumo Logic.

This role is open to multiple locations across North America (including CA).

EPIC JOB + EPIC BENEFITS = EPIC LIFE

Our intent is to cover all things that are medically necessary and improve the quality of life. We pay 100% of the premiums for both you and your dependents. Our coverage includes Medical, Dental, a Vision HRA, Long Term Disability, Life Insurance & a 401k with competitive match. We also offer a robust mental well-being program through Modern Health, which provides free therapy and coaching for employees & dependents. Throughout the year we celebrate our employees with events and company-wide paid breaks. We offer unlimited PTO and sick time and recognize individuals for 7 years of employment with a paid sabbatical.

Pay Transparency Information

The expected annual base pay range(s) for this position are detailed below. Each base pay range is relevant only for individuals who are residents of or will be expected to work within the specified locale. Compensation varies based on a variety of factors, which include (but aren’t limited to) things such as skills and competencies, qualifications, knowledge, and experience. In addition to base pay, most employees are eligible to participate in Epic’s generous benefit plans and discretionary incentive programs (subject to the terms of those plans or programs).

California Base Pay Range
$186,133$272,995 USD

ABOUT US

Epic Games spans across 25 countries with 46 studios and 4,500+ employees globally. For over 25 years, we've been making award-winning games and engine technology that empowers others to make visually stunning games and 3D content that bring environments to life like never before. Epic's award-winning Unreal Engine technology not only provides game developers the ability to build high-fidelity, interactive experiences for PC, console, mobile, and VR, it is also a tool being embraced by content creators across a variety of industries such as media and entertainment, automotive, and architectural design. As we continue to build our Engine technology and develop remarkable games, we strive to build teams of world-class talent.

Like what you hear? Come be a part of something Epic!

Epic Games deeply values diverse teams and an inclusive work culture, and we are proud to be an Equal Opportunity employer. Learn more about our Equal Employment Opportunity (EEO) Policy .

Note to Recruitment Agencies: Epic does not accept any unsolicited resumes or approaches from any unauthorized third party (including recruitment or placement agencies) (i.e., a third party with whom we do not have a negotiated and validly executed agreement). We will not pay any fees to any unauthorized third party. Further details on these matters can be found .

Similar Jobs

Salesforce - Distributed Systems Software Engineer - Public Cloud (Senior/Lead/Principal)

Salesforce

San Francisco, California, United States (On-Site)
4 Months ago
Trendyol - Backend Developer

Trendyol

Ankara, Ankara, Türkiye (Hybrid)
3 Months ago
Trustana - Senior Data Engineer

Trustana

Gurugram, Haryana, India (Hybrid)
4 Months ago
Razer - Senior API Developer

Razer

Singapore (On-Site)
4 Months ago
Omnissa - Member of Technical Staff (Automation)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Far Out Scout - Senior Back End Engineer (BLW - 0326243)

Far Out Scout

Brazil (Remote)
3 Months ago
Epic Games - Technical Director, Machine Learning Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
1 Week ago
SciPlay - PHP Backend Developer

SciPlay

Kyiv, Kyiv City, Ukraine (Remote)
4 Weeks ago
Solvative - Wordpress Support Engineer

Solvative

India (Remote)
4 Months ago
Onehouse - Data Infrastructure Engineer (India)

Onehouse

Bengaluru, Karnataka, India (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Kong  Inc  - Staff Software Engineer - Logs/traces

Kong Inc

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
ByteDance - Site Reliability Engineer – Data Infrastructure

ByteDance

Seattle, Washington, United States (On-Site)
3 Weeks ago
Ajmera Infotech - DevOps Engineer

Ajmera Infotech

San Jose, California, United States (On-Site)
4 Months ago
Meta - Production Engineering

Meta

Bellevue, Washington, United States (On-Site)
3 Months ago
SSC Technologies - Principal Software Engineer - Full Stack

SSC Technologies

Waltham, Massachusetts, United States (On-Site)
3 Months ago
PlayStation Global - Senior Software Engineer (Rust, C++)

PlayStation Global

Aliso Viejo, California, United States (On-Site)
2 Months ago
Extreme Network - Staff Backend Developer (Python, Microservices, GenAI - 92890) Ireland

Extreme Network

Shannon, County Clare, Ireland (Remote)
4 Months ago
ComeOn Group - DevOps Engineer

ComeOn Group

Stockholm, Stockholm County, Sweden (Hybrid)
3 Months ago
CloudHire - Sr Django Backend Developer

CloudHire

India (Remote)
3 Months ago
Zuora - Data Scientist III

Zuora

Bengaluru, Karnataka, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in North America

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Backend Development Jobs

Gaming Innovation Group  - Java Engineer

Gaming Innovation Group

Oregon, United States (Remote)
4 Weeks ago
Playrix - Golang Tech Lead (GameOps)

Playrix

Ireland (Remote)
1 Week ago
Evolution - Technical Lead - Backend

Evolution

Sofia, Sofia City Province, Bulgaria (On-Site)
1 Month ago
Wargaming - Server Developer (World of Tanks)

Wargaming

Nicosia, Nicosia, Cyprus (Hybrid)
2 Months ago
Luxoft - Java Developer with Investment Banking and Trading experience

Luxoft

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Lurkit - Software Engineer

Lurkit

Linköping, Östergötland County, Sweden (On-Site)
1 Month ago
King - Principal Software Engineer | Candy Crush Soda

King

London, England, United Kingdom (On-Site)
2 Months ago
Paytm - Backend - Senior Software Engineer

Paytm

Noida, Uttar Pradesh, India (Hybrid)
3 Months ago
Kefir Games - Senior Server Software Engineer

Kefir Games

Cyprus (On-Site)
1 Month ago
King - Staff Software Engineer - Activision Blizzard Media

King

California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Founded in 1991, Epic Games is a leading interactive entertainment company and provider of 3D engine technology. Epic operates Fortnite, one of the world’s largest games with over 350 million accounts and 2.5 billion friend connections. Epic also develops Unreal Engine, which powers the world’s leading games and is adopted across industries such as film and television, architecture, automotive, manufacturing, and simulation. Through Unreal Engine, Epic Games Store, and Epic Online Services, Epic provides an end-to-end digital ecosystem for developers and creators to build, distribute, and operate games and other content. Epic has over 40 offices worldwide with headquarters in Cary, North Carolina.

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)

Cary, North Carolina, United States (On-Site)

(On-Site)

Montreal, Quebec, Canada (On-Site)

(On-Site)

View All Jobs

Get notified when new jobs are added by Epic Games

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug