Senior Site Reliability Engineer

2 Months ago • 4 Years + • Devops

Job Summary

Job Description

As a Senior Site Reliability Engineer at Gearbox Entertainment, you will be responsible for ensuring the observability and reliability of their online platform. This involves designing and developing solutions with a focus on cloud-native architectures and automation. You will work with Go, Python, AWS, and Terraform, and actively spend most of your time engineering solutions. This role requires participation in on-call rotations. You'll be a key player in pushing the platform forward and mentoring others, with a strong focus on user experience and continuous improvement.
Must have:
  • 4+ years instrumenting observability stacks in OOP languages, preferably Go.
  • Proficiency in AWS container management, orchestration, and observability features.
  • Experience managing AWS access and security services.
  • Experience in Terraform and/or CloudFormation.
  • 2+ years experience with containers, preferably Docker.
  • Understanding of observability stack management.
  • Comfortable communicator, able to clearly detail designs and implementations.
Good to have:
  • Extensive hands-on experience with OpenTelemetry.
  • Hands-on experience developing and maintaining CI/CD pipelines.
  • Understanding of RESTful and Websocket based APIs.
  • Bachelor's degree in computer science or related field.

Job Details

The Gearbox Entertainment Company is an award-winning creator and distributor of entertainment for people around the world. Gearbox Entertainment develops and publishes products through its subsidiaries, Gearbox Software and Gearbox Publishing. Gearbox Entertainment has become widely known for successful game franchises including Brothers in Arms and Borderlands, as well as acquired properties Duke Nukem and Homeworld. Gearbox’s ambition is to entertain the world and its key driving objectives include the pursuit of happiness for our talent, partners and customers, the prioritization of entertainment and creativity and a measured respect for profitability. For more information, visit www.Gearbox.com.

To further drive our vision of premier stability and rapid feature delivery, we are looking for a Senior Site Reliability Engineer to join our team. As a Senior SRE, you should feel exceptionally comfortable bringing architectural design proposals to the table for consideration among your colleagues on our platform and infrastructure development teams. You will be one of the principal technical designers helping push our cloud-native platform toward the future. You will be responsible for driving the implementation of flexible cloud architectures with an automation-first emphasis; manual user intervention likely makes you uneasy and maybe even a little twitchy. We would expect a successful candidate for this position to be a self-starter with the ability to complete tasks independently. Though you will have access to technical leadership and senior engineers at your disposal, you should feel well acquainted with tackling complex problems without significant oversight. Observability is paramount. If we can't measure it, we can't prove it works; if we can't prove it works, it must be assumed it doesn't work. This is a philosophy you hopefully love (and preferably obsess over). If we can't observe how a new feature is behaving, our SRE team is excited to dive into the application code and make the necessary improvements. Typical Day Tl;dr: You will be deeply immersed in Go and Python observability stacks; plenty of AWS and Terraform sprinkled in as well. This is a very hands-on Senior Engineering role where your days will be filled with building solutions to technical challenges in the observability and availability of our SHiFT online services. You will evangelize for and be obsessed with user experience as it relates to the services you support. You will help manage and orchestrate each of these by leaning heavily on technologies like Go, Terraform, Docker, and Bash. On any given day, you should expect to spend at least 80% of your time actively engineering and developing solutions; the rest will be a mixture of planning, reviewing code from your colleagues, participating in design meetings, documentation, and self-development. This position will eventually require you to carry a company-paid mobile device and participate in 24/7 on-call rotations alongside your engineering colleagues. Don't worry though, our on-call experience doesn't suck. Core Responsibilities:

  • Design, engineer, and develop solutions for ensuring the observability and reliability of our online platform
  • Be a trusted voice in the evangelism of reliability engineering throughout the team with an eagerness for mentoring other developers on the team
  • Help define and oversee short and mid-term project roadmaps for the future of our SRE team
  • Participate in after-hours on-call support rotations

Must Have (the non-negotiable parts):

  • Candidates must have at least 4 years of professional experience instrumenting complex observability stacks in object oriented programming languages, preferably Go.
  • Proficiency in AWS container management, orchestration, and observability features (ECS, Fargate, Aurora, AppConfig, CloudWatch, etc.)
  • Professional Experience managing AWS access and security services (IAM, kms, Secrets Manager, WAFv2, etc.)
  • Professional Experience in Terraform and/or CloudFormation
  • Minimum of 2 years experience with containers in a professional setting, preferably Docker
  • Adept understanding of observability stack management (otel, tracing, monitoring, alerting, structured logging, APM, etc.)
  • Comfortable communicator, able to clearly detail designs and implementations on an individual level and in large group settings

Should Have (some wiggle room):

  • Extensive hands-on experience with OpenTelemetry
  • Hands-on experience developing and maintaining CI/CD pipelines, preferably in git/GitLab
  • Understanding of RESTful and Websocket based APIs
  • Bachelor's degree in computer science, related field, or equivalent training and professional experience

Now you're just showing off:

  • Familiarity with Datadog
  • Familiarity with Atlassian products (OpsGenie, JIRA, Confluence)
  • Experience working with developers in an agile environment
  • Experience in the games industry, preferably launching multiple online-enabled AAAs
  • Knowledge about Gearbox-owned IPs

Gearbox Entertainment believes that all team members should be able to enjoy a work environment free from all forms of discrimination and harassment. We are committed to reflecting the diversity of the world we strive to entertain. As an Equal Opportunity Employer, we provide fair and equal treatment to all team members and applicants. We do not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, disability, genetic information, pregnancy or maternity, veteran status, or any other status protected by applicable national, federal, state or local law.

Similar Jobs

Sagecor - Software Engineer 3

Sagecor

Annapolis Junction, Maryland, United States (On-Site)
3 Weeks ago
Overdare - Lua Engineer

Overdare

Seoul, South Korea (On-Site)
3 Months ago
Ethernovia - Embedded Architecture Software Engineer

Ethernovia

Pune, Maharashtra, India (On-Site)
1 Week ago
Luxoft - Java Team Lead

Luxoft

Mississauga, Ontario, Canada (On-Site)
8 Months ago
Le Collectionist - Lead Data Engineer (H/F/X) - CDI - Paris

Le Collectionist

Paris, Île-de-France, France (On-Site)
11 Months ago
CyberArk - Senior Software Architect-Python

CyberArk

India (On-Site)
2 Months ago
Google - Staff Software Engineer, Google Cloud

Google

Hyderabad, Telangana, India (On-Site)
8 Months ago
Qualcomm - Embedded Platform Dev- Lead Engineer, Senior

Qualcomm

Bengaluru, Karnataka, India (On-Site)
1 Month ago
miniclip - Senior Cloud Engineer - Senior Cloud Engineer I

miniclip

Lisbon, Lisbon, Portugal (On-Site)
2 Months ago
Axon - Sr. Solutions Architect, Fusus

Axon

Atlanta, Georgia, United States (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Game District - Senior Game Developer

Game District

Lahore, Punjab, Pakistan (Remote)
1 Week ago
Qualcomm - Sr Engineer- Camera HWL

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Week ago
Saviynt - Senior Software Engineer - Privilege Access Management (PAM)

Saviynt

El Segundo, California, United States (Hybrid)
9 Months ago
Motorola solutions - System Test Engineer

Motorola solutions

Cork, County Cork, Ireland (On-Site)
1 Year ago
Trellix - Staff Network Engineer

Trellix

Bengaluru, Karnataka, India (Hybrid)
1 Year ago
Experian - Junior Frontend Software Development Analyst - Affirmative Action for Women

Experian

Blumenau, State Of Santa Catarina, Brazil (On-Site)
1 Week ago
Granicus - Software Engineer 3

Granicus

Costa Rica (Remote)
2 Months ago
Capgemini - Java Architect

Capgemini

Hyderabad, Telangana, India (On-Site)
1 Month ago
GoMotive - Senior Salesforce Developer

GoMotive

India (Remote)
2 Months ago
Trend Micro - Senior Software Development Engineer in Test (Network Endpoint Security)

Trend Micro

Taipei City, Taiwan (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Frisco, Texas, United States

Inkittt - Fullstack Martech Engineer

Inkittt

San Francisco, California, United States (Hybrid)
3 Months ago
USE Insider - North America Tech Partnership Director

USE Insider

United States (Remote)
6 Days ago
quience - Customer Care, Operations Support Lead, Voice of the Customer

quience

United States (Remote)
3 Months ago
HCL Tech - Senior Technical Lead

HCL Tech

Colorado, United States (On-Site)
2 Months ago
Mark43 - Lead Software Engineer - RMS

Mark43

New York, New York, United States (Remote)
2 Weeks ago
Inspiren - Director of Embedded Systems

Inspiren

United States (Remote)
2 Weeks ago
Ramboll3 - Senior Project Engineer, Civil (Data Center)

Ramboll3

Albany, New York, United States (Remote)
1 Week ago
extreme network - Insights Analyst, Senior Manager, Enterprise Data & Analytics

extreme network

Texas, United States (Remote)
1 Month ago
Ziff Davis - IT Support Engineer

Ziff Davis

Denver, Colorado, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Epic Games - Senior DevOps Programmer

Epic Games

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)
4 Months ago
Amber - Bazel Senior Build Engineer (Project Based)

Amber

Bucharest, Bucharest, Romania (Remote)
4 Months ago
Fearless - Software Engineer II (Cloud Solution Architect) Navy NIWC

Fearless

Charleston, South Carolina, United States (On-Site)
6 Days ago
London stock Exchange - Lead Engineer - Solution Architect - Analytics

London stock Exchange

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Minecast - Senior Software Engineer - Storage Platform

Minecast

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
Stacklok - Senior Site Reliability Engineer (SRE)

Stacklok

Bellevue, Washington, United States (Hybrid)
3 Weeks ago
WebTech Corporation - Senior Staff Software Architect

WebTech Corporation

Bengaluru, Karnataka, India (On-Site)
2 Months ago
SimpliSafe - Genesys/NICE Solutions Architect

SimpliSafe

India (On-Site)
2 Weeks ago
Ion - Cloud Engineer Kubernetes

Ion

Collecchio, Emilia-Romagna, Italy (Hybrid)
9 Months ago
Anthology  Inc  - Solutions Engineer - Enterprise

Anthology Inc

United States (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

We are an award-winning, creator and distributor of transmedia entertainment. Gearbox Entertainment has become widely known for successful game franchises, as well as acquired properties Duke Nukem and Homeworld, which it distributes across the world.

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

Frisco, Texas, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Gearbox

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug