Staff DevOps Engineer

1 Month ago • 5 Years + • DevOps

Job Summary

Job Description

As a Staff DevOps Engineer at Fandom, you'll design, implement, and maintain high-availability systems for a platform with 300+ million monthly users. Responsibilities include improving code integration and deployment processes, leading meetings (planning, stand-ups, RCAs, retrospectives), providing strategic input on product and technical decisions, and developing solutions for operational administration, backup, disaster recovery, and security monitoring. You'll diagnose and resolve performance and reliability issues across the entire stack and mentor other engineers. The role involves on-call duties on a rotational basis and requires experience scaling infrastructure for heavy user loads (10,000,000+ MAU).
Must have:
  • Experience scaling production infrastructure
  • 5+ years DevOps/SysOps experience
  • Docker, Kubernetes experience
  • Configuration management (Chef preferred)
  • Strong Linux systems understanding
  • Proficiency with monitoring systems (Prometheus)
  • Networking protocols expertise
  • Scripting/coding (Go, Python, Perl)
  • Relational database knowledge
Good to have:
  • GCP or AWS experience
  • Prometheus, Consul, Terraform, Vault knowledge
  • ELK stack understanding
  • Strong networking experience (BGP, VPNs)
  • CI/CD pipeline experience
Perks:
  • MacBook Pro
  • Access to online courses
  • Company stock options
  • Company swag
  • Cafeteria benefit program
  • VTO
  • Flexible work hours
  • Employee interest groups

Job Details

About this Role

Fandom is growing! Our Staff DevOps Engineer opportunity is based in Poznan and reports to our Manager of Network Operations.

Our Engineering team is a fan of building web experiences on the world’s largest platform for fans, with 300+ million monthly users!As a Staff DevOps Engineer, you will develop and monitor our scalable infrastructure in a large-scale Linux environment. You will work closely with all development teams to ensure that services are designed with scale, operability, performance, and ease-of-use in mind. Your day-to-day operations will include diagnosing and resolving performance and reliability issues across the entire stack: hardware, kernel, application, and network. You’ll be responsible for driving the entire lifecycle of DevOps projects and feature developments.

You Will...

  • Design, implement, and maintain high availability systems
  • Lead improvement of code integration and deployment processes
  • Contribute to and occasionally lead meetings: planning, stand-ups, RCAs and retrospectives
  • Provide strategy and thought leadership in product and technical decisions
  • Develop and maintain solutions for operational administration, system/data backup, disaster recovery, network management, and security/performance monitoring
  • Continuously evaluate existing systems with industry standards and make recommendations for improvement
  • Mentor other passionate engineers with diverse skill sets in a collaborative team environment
  • Participate in on-call duties on a rotational basis.

You Have...

  • Experience scaling and monitoring production infrastructure for heavy user load, e.g., 10,000,000+ monthly active users.
  • 5+ years of Technical Operations, DevOps, System Operations, Network Operations or Site Reliability experience.
  • Experience with containerization and orchestration technologies such as Docker and Kubernetes.
  • Experience with configuration management tools (preferably Chef, but experience with Puppet, Ansible, or other equivalent tools is acceptable )
  • A strong understanding of Linux systems, both high and low level
  • Proficiency with modern monitoring systems e.g., Prometheus
  • Proficiency with networking protocols (OSI network layers, TCP/IP, routing) 
  • Proficiency in identifying and creating useful systems’ metrics needed to maintain performance and availability.
  • Knowledge of relational databases
  • Programming experience with scripting and coding: go, python, perl etc.

Bonus Points...

  • Experience with cloud providers: Google Cloud Platform (GCP) or Amazon AWS
  • Knowledge of Prometheus, Hashicorp Consul, Terraform, or Vault 
  • Understanding of the ELK stack (elasticsearch, logstash, kibana)
  • Strong Networking experience (BGP, VPNs, network security) – for example, CNA certification
  • Experience building and maintaining continuous integrations and deployment pipelines.

Benefits & Perks

  • MacBook Pro and all the gear you need for work
  • Free access to a multitude of popular online courses and books sponsored by our company
  • Company stock options
  • Company swag packages
  • Cafeteria Benefit Program (including private medical care, gym membership, shopping/wellness bonus, etc.)
  • VTO (Voluntary Time Off) - a day off every quarter for volunteering non-profit
  • Frequent team bonding events
  • Flexible work hours & time-off
  • Employee Interest and Hobby Groups supported by our company
  • Open, energetic and fan-focused, international work environment

About Fandom

Fandom is the world’s largest fan platform where fans immerse themselves in imagined worlds across entertainment and gaming. Reaching more than 350 million unique visitors per month and hosting more than 250,000 wikis, Fandom is the #1 source for in-depth information on pop culture, gaming, TV and film, where fans learn about and celebrate their favorite fandoms. Fandom’s Gaming division manages the online video game retailer Fanatical. Fandom Productions, the content arm of Fandom, enhances the fan experience through curated editorial coverage and branded content from trusted and established publishing brands Gamespot, TV Guide and Metacritic, along with its Emmy-nominated Honest Trailers and the weekly video news program The Loop. For more information follow @getfandom or visit:.

Fandom is an equal opportunity employer. Fandom values diversity, and all employment decisions are made on the basis of job requirements and individual qualifications.

#LI-AM1

Similar Jobs

ION - IT/Cyber Security Analyst

ION

London, England, United Kingdom (On-Site)
6 Months ago
Next Level Business Services - C++ Developer

Next Level Business Services

Milwaukee, Wisconsin, United States (On-Site)
6 Months ago
DNEG - FX - Crowd Artist

DNEG

Mumbai, Maharashtra, India (On-Site)
1 Month ago
NVIDIA - Performance Engineer Intern, Deep Learning and HPC

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
NVIDIA - Principal Silicon Circuits System Design Engineer

NVIDIA

Santa Clara, California, United States (Hybrid)
2 Months ago
Sonar Source - Solutions Engineer - Strategic Accounts

Sonar Source

Austin, Texas, United States (Hybrid)
6 Months ago
White Hat Gaming  - Site Reliability Engineer (SRE)

White Hat Gaming

(Remote)
1 Month ago
Nielsen Holdings - Senior Software Engineer - Bigdata ( Java / Scala / Python , Spark, SQL , AWS)

Nielsen Holdings

Mumbai, Maharashtra, India (Hybrid)
6 Months ago
Tencent - Tencent Cloud Technical Account Manager

Tencent

Palo Alto, California, United States (On-Site)
2 Months ago
Hashlist - Senior Data Engineer

Hashlist

Pune, Maharashtra, India (Hybrid)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Nielsen Holdings - Software Engineer ( Java , Python , SQL , AWS / Oracle)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
ByteDance - Software Engineer in Machine Learning Systems

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - Senior Software Development Engineer - NoSQL-DocumentDB

ByteDance

Seattle, Washington, United States (On-Site)
5 Months ago
ByteDance - Product Solutions Architect - Enterprise Security

ByteDance

Singapore (On-Site)
5 Months ago
ByteDance - Software Engineer, ML System Architecture

ByteDance

San Jose, California, United States (On-Site)
5 Months ago
The Walt Disney Company - Lead Animator

The Walt Disney Company

Sydney, New South Wales, Australia (Hybrid)
2 Months ago
NVIDIA - Enterprise Software Test Development Engineer

NVIDIA

Taipei City, Taiwan (On-Site)
3 Weeks ago
Electronic Arts - DevOps Engineer II

Electronic Arts

Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, Malaysia (On-Site)
3 Weeks ago
Egnyte - DevOps Engineer

Egnyte

India (Remote)
2 Months ago
NVIDIA - Senior System Software Engineer

NVIDIA

Canada (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Poland

Keywords Studios - Content Moderator - French (Video Games) - Remote

Keywords Studios

Katowice, Silesian Voivodeship, Poland (Remote)
3 Weeks ago
Playtika - Senior DATA/AI SRE Engineer

Playtika

Poland (On-Site)
5 Months ago
Tripledot Studios - Game Designer

Tripledot Studios

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Months ago
Techland - Junior Localization Specialist

Techland

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Egnyte - Sr Software Engineer - Java

Egnyte

Poznań, Greater Poland Voivodeship, Poland (On-Site)
4 Months ago
Meta - Production Engineer

Meta

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Months ago
Bloober Team - Senior Systems Programmer

Bloober Team

Kraków, Lesser Poland Voivodeship, Poland (Hybrid)
1 Month ago
PwC - Starszy Konsultant / Starsza Konsultantka | Aktuariat (obszar Actuarial Tools)

PwC

Warsaw, Masovian Voivodeship, Poland (Hybrid)
6 Months ago
Google - Software Engineer, Early Career, Cloud AI

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Months ago
11 bit studios - Senior Producer (Frostpunk 2)

11 bit studios

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

N-iX - Senior Data Engineer

N-iX

Kyiv, Kyiv City, Ukraine (Remote)
1 Month ago
ByteDance - Site Reliability Engineer, Traffic Infrastructure

ByteDance

Singapore (On-Site)
5 Months ago
Axinous - Principal Software Engineer (ZDX Platform Engineering)

Axinous

San Jose, California, United States (Hybrid)
4 Months ago
Playdead - DevOps Engineer

Playdead

Copenhagen, Denmark (On-Site)
7 Months ago
Cadence - Senior Cloud Platform Architect

Cadence

San Jose, California, United States (On-Site)
6 Months ago
Nagarro - Senior Engineer, DevOps

Nagarro

India (Remote)
6 Months ago
Anthology  Inc  - DevOps (SRE) Engineer

Anthology Inc

Brno, South Moravian Region, Czechia (On-Site)
6 Months ago
NVIDIA - Senior SRE Software Engineer, Storage and Data

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
Aristocrat Gaming - DevOps Engineer

Aristocrat Gaming

Montreal, Quebec, Canada (Hybrid)
1 Month ago
Zeta - Senior Site Reliability Engineer

Zeta

Bengaluru, Karnataka, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

United States (Remote)

Poznań, Greater Poland Voivodeship, Poland (On-Site)

Poznań, Greater Poland Voivodeship, Poland (On-Site)

New York, New York, United States (Remote)

United States (Remote)

Poznań, Greater Poland Voivodeship, Poland (Remote)

Poznań, Greater Poland Voivodeship, Poland (On-Site)

View All Jobs

Get notified when new jobs are added by Fandom

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug