Data Reliability Engineer

5 Months ago • Upto 10 Years

Job Summary

Job Description

Bungie seeks a Data Reliability Engineer to design, deploy, and maintain highly available data infrastructure, including Kafka, RabbitMQ, Redis, Elasticsearch, and Graphite. You'll troubleshoot issues, ensure data security, and collaborate with engineering teams on projects and services. Must have experience with Linux, infrastructure automation, and distributed production environments.
Must have:
  • Linux Administration
  • Infrastructure Automation
  • Distributed Environments
  • Troubleshooting Skills
Good to have:
  • Capacity Planning
  • Data Security
  • Time-Series Monitoring
  • Data Observability
Perks:
  • Hybrid Work
  • Bungie-Approved Remote

Job Details

Data Reliability Engineering at Bungie is a core team of the Central Tech area that keeps our games and tooling running at scale. Our team owns the overall scalability, observability and resilience of the databases, data processing platforms and in-memory key-value stores used throughout the Bungie ecosystem. We partner with our engineering teams and business units on projects, services, designs, and processes We are the stewards of architecture and provide tools and services to enable engineering teams to meet their design requirements.

RESPONSIBILITIES

  • Design, deploy, and maintain highly available and scalable data infrastructure components including Kafka, RabbitMQ, Redis, Elasticsearch, and Graphite
  • Perform capacity planning and scalability assessments for data platforms
  • Troubleshoot and resolve issues related to data processing pipelines, message queuing, and performance including participation in on-call rotation
  • Ensure data security, integrity, and compliance with industry best practices and regulatory requirements
  • Document system configurations, procedures, and operational knowledge
  • Advise service owners on industry and company standards and best practices
  • Maintain reliability and performance levels for core data platform infrastructure
  • Data observability strategy and implementation
  • Data ownership strategy and documentation

REQUIRED SKILLS

  • Strong understanding of Linux operating systems and their administration
  • Effective communication skills and ability to collaborate effectively in a team environment
  • Experience with infrastructure automation and configuration management (e.g., Ansible, Terraform…)
  • Excellent troubleshooting skills and the ability to analyze and resolve complex infrastructure resource and application deployment issues
  • Experience working in a distributed production environment
  • Deep understanding of cluster management areas, such as scaling, consistency tuning, replication, and multi-datacenter configuration
  • Familiarity with time-series monitoring systems & tools (e.g., Datadog, Prometheus, Grafana and ELK)
  • Experience designing and implementing logging and metric pipelines

Similar Jobs

ByteDance - Senior Site Reliability Engineer - Data Infrastructure (San Jose)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Senior Site Reliability Engineer - Data Infrastructure (Seattle)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Site Reliability Engineer Graduate (Product RD and Infrastructure-Global E-Commerce) - 2024 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Senior Site Reliability Engineer, ML System

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Site Reliability Engineer Graduate (AML- Engine) - 2024 Start (BS/MS)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Senior Site Reliability Engineer, ML System

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Site Reliability Engineer, ML System

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Site Reliability Engineer (Cloud) - Infrastructure Engineering

ByteDance

Singapore (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Engineer, ML System Scheduling

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Tech Lead - Global E-Commerce Supply Chain

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Senior Site Reliability Engineer - Data Infrastructure (San Jose)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Research Scientist in ML Systems

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Cloud Network Engineer - Physical Network Infrastructure

ByteDance

Singapore (On-Site)
3 Months ago
ByteDance - Senior Site Reliability Engineer - Data Infrastructure (Seattle)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Machine Learning Engineer-Model Serving Infrastructure (AML-Engine)

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Software Engineer, ML System Architecture

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
ByteDance - Machine Learning Engineer-Model Serving Infrastructure (AML-Engine)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Worldwide

paypay - Android Engineer

paypay

(Remote)
3 Months ago
ESL FACEIT Group - EFG - Data Analyst (Analytics Engineer)

ESL FACEIT Group - EFG

(Remote)
3 Months ago
My.Games - FX Artist

My.Games

(Remote)
3 Months ago
Hike - Sr. Business Analyst (Full Time, Remote)

Hike

(Remote)
3 Months ago
Polygon Labs - Technical Account Manager

Polygon Labs

(Remote)
3 Months ago
Playnetic - Technical Product Owner

Playnetic

(Remote)
3 Months ago
growe - Senior .Net Developer

growe

(Remote)
3 Months ago
growe - CRM manager

growe

(Remote)
3 Months ago
growe - Product Manager

growe

(Remote)
3 Months ago
growe - Marketing Analyst

growe

(Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Category Jobs

Aristocrat Gaming - Affiliate Program Backoffice

Aristocrat Gaming

Sliema, Malta (Hybrid)
3 Months ago
Fortra - Sr. Solutions Engineer_Cybersecurity DP -SE Asia

Fortra

Singapore (On-Site)
3 Months ago
Truecaller - Senior Android Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
3 Months ago
paypay - Android Engineer

paypay

(Remote)
3 Months ago
Axinous - Senior Manager, Global CXO Experiences

Axinous

San Jose, California, United States (Hybrid)
3 Months ago
ByteDance - Software Engineer, ML System Scheduling

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Tech Lead - Global E-Commerce Supply Chain

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Senior Software Engineer, Cross Platform Application

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ByteDance - Senior Site Reliability Engineer - Data Infrastructure (San Jose)

ByteDance

San Jose, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded