Principal DevOps Engineer

1 Month ago • 5 Years + • DevOps

Job Summary

Job Description

Fandom seeks a Principal DevOps Engineer to design, implement, and maintain high-availability systems for their large-scale platform (300+ million monthly users). Responsibilities include improving code integration and deployment processes, leading meetings, providing technical strategy, developing operational solutions (backup, recovery, monitoring), and mentoring other engineers. The role requires expertise in scaling infrastructure, containerization (Docker, Kubernetes), configuration management (Chef preferred), Linux systems, monitoring (Prometheus), networking protocols, and scripting (Go, Python, Perl). On-call duties are part of the rotation.
Must have:
  • Scale production infrastructure for heavy user load
  • Experience with Docker and Kubernetes
  • Strong understanding of Linux systems
  • Proficiency with Prometheus and networking protocols
  • 5+ years DevOps/System Operations experience
Good to have:
  • GCP or AWS experience
  • Knowledge of Prometheus, Hashicorp tools, Terraform, or Vault
  • ELK stack understanding
  • Strong networking experience (BGP, VPNs)
  • CI/CD pipeline experience
Perks:
  • MacBook Pro
  • Free online courses
  • Company stock options
  • Cafeteria Benefit Program
  • VTO
  • Flexible work hours

Job Details

About this Role

Fandom is growing! Our Staff DevOps Engineer opportunity is based in Poznan and reports to our Manager of Network Operations.

Our Engineering team is a fan of building web experiences on the world’s largest platform for fans, with 300+ million monthly users!As a Staff DevOps Engineer, you will develop and monitor our scalable infrastructure in a large-scale Linux environment. You will work closely with all development teams to ensure that services are designed with scale, operability, performance, and ease-of-use in mind. Your day-to-day operations will include diagnosing and resolving performance and reliability issues across the entire stack: hardware, kernel, application, and network. You’ll be responsible for driving the entire lifecycle of DevOps projects and feature developments.

You Will...

  • Design, implement, and maintain high availability systems
  • Lead improvement of code integration and deployment processes
  • Contribute to and occasionally lead meetings: planning, stand-ups, RCAs and retrospectives
  • Provide strategy and thought leadership in product and technical decisions
  • Develop and maintain solutions for operational administration, system/data backup, disaster recovery, network management, and security/performance monitoring
  • Continuously evaluate existing systems with industry standards and make recommendations for improvement
  • Mentor other passionate engineers with diverse skill sets in a collaborative team environment
  • Participate in on-call duties on a rotational basis.

You Have...

  • Experience scaling and monitoring production infrastructure for heavy user load, e.g., 10,000,000+ monthly active users.
  • 5+ years of Technical Operations, DevOps, System Operations, Network Operations or Site Reliability experience.
  • Experience with containerization and orchestration technologies such as Docker and Kubernetes.
  • Experience with configuration management tools (preferably Chef, but experience with Puppet, Ansible, or other equivalent tools is acceptable )
  • A strong understanding of Linux systems, both high and low level
  • Proficiency with modern monitoring systems e.g., Prometheus
  • Proficiency with networking protocols (OSI network layers, TCP/IP, routing) 
  • Proficiency in identifying and creating useful systems’ metrics needed to maintain performance and availability.
  • Knowledge of relational databases
  • Programming experience with scripting and coding: go, python, perl etc.

Bonus Points...

  • Experience with cloud providers: Google Cloud Platform (GCP) or Amazon AWS
  • Knowledge of Prometheus, Hashicorp Consul, Terraform, or Vault 
  • Understanding of the ELK stack (elasticsearch, logstash, kibana)
  • Strong Networking experience (BGP, VPNs, network security) – for example, CNA certification
  • Experience building and maintaining continuous integrations and deployment pipelines.

Benefits & Perks

  • MacBook Pro and all the gear you need for work
  • Free access to a multitude of popular online courses and books sponsored by our company
  • Company stock options
  • Company swag packages
  • Cafeteria Benefit Program (including private medical care, gym membership, shopping/wellness bonus, etc.)
  • VTO (Voluntary Time Off) - a day off every quarter for volunteering non-profit
  • Frequent team bonding events
  • Flexible work hours & time-off
  • Employee Interest and Hobby Groups supported by our company
  • Open, energetic and fan-focused, international work environment

About Fandom

Fandom is the world’s largest fan platform where fans immerse themselves in imagined worlds across entertainment and gaming. Reaching more than 350 million unique visitors per month and hosting more than 250,000 wikis, Fandom is the #1 source for in-depth information on pop culture, gaming, TV and film, where fans learn about and celebrate their favorite fandoms. Fandom’s Gaming division manages the online video game retailer Fanatical. Fandom Productions, the content arm of Fandom, enhances the fan experience through curated editorial coverage and branded content from trusted and established publishing brands Gamespot, TV Guide and Metacritic, along with its Emmy-nominated Honest Trailers and the weekly video news program The Loop. For more information follow @getfandom or visit:.

Fandom is an equal opportunity employer. Fandom values diversity, and all employment decisions are made on the basis of job requirements and individual qualifications.

#LI-AM1

Similar Jobs

Playtech - System Administrator

Playtech

Latsia, Nicosia, Cyprus (On-Site)
3 Weeks ago
NVIDIA - Senior System Level Testing Infrastructure Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago
The Walt Disney Company - Software Engineer, Tools Internals (Core)

The Walt Disney Company

Emeryville, California, United States (On-Site)
6 Days ago
Samsung Semiconductor - Staff Engineer, Pre-Silicon Emulation

Samsung Semiconductor

San Jose, California, United States (On-Site)
3 Days ago
Trend Micro - Embedded Software Engineer (C/C++)

Trend Micro

Manila, Metro Manila, Philippines (On-Site)
16 Years ago
Rackspace Technology - Security Engineer - Palo Alto

Rackspace Technology

India (Remote)
1 Month ago
Nagarro - Senior Staff Engineer - Prompt Engineer

Nagarro

Colombia (Remote)
1 Week ago
Balbix - Staff /Sr Staff/ Principal Engineer - Lakehouse

Balbix

Gurugram, Haryana, India (On-Site)
5 Months ago
PwC - ETIC, Cloud Solution Architect - Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
5 Months ago
Hitachi - CE Developers-Jul-2024

Hitachi

Bengaluru, Karnataka, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

undefined - Scenario mode FO

Beijing, Beijing, China (On-Site)
8 Months ago
Riot Games - Manager, Service Reliability Analyst - Live Operations

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Social Discovery Group - ML Ops Engineer (AI Product)

Social Discovery Group

(Remote)
2 Months ago
Applike Group - DevOps Engineer (f/m/d)

Applike Group

Hamburg, Hamburg, Germany (Hybrid)
5 Months ago
ION - DBA Administrator

ION

Italy (Hybrid)
5 Months ago
NVIDIA - Senior Product Manager - SONiC

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
NVIDIA - Performance Engineer Intern, Deep Learning and HPC

NVIDIA

Shanghai, Shanghai, China (On-Site)
2 Months ago
Actian - Core Java Developer - Pune

Actian

Pune, Maharashtra, India (On-Site)
5 Months ago
PhonePe - Software Engineer - Backend (7-10 years), Pune

PhonePe

Bengaluru, Karnataka, India (On-Site)
4 Months ago
ByteDance - Site Reliability Engineer (Systems), Bytedance Engineering

ByteDance

Singapore (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Poznań, Greater Poland Voivodeship, Poland

Playtika - PHP Developer

Playtika

Poland (Hybrid)
5 Months ago
People Can Fly - Senior Sound Designer

People Can Fly

Poland (On-Site)
4 Months ago
N-iX - Solution Architect (Spanish Speaking)

N-iX

Poland (Remote)
1 Week ago
CD PROJEKT RED - Scanning Lead

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
N-iX - .NET Backend Engineer

N-iX

Poland (Hybrid)
1 Week ago
ARHS - Configuration / Deployment Specialist

ARHS

Warsaw, Masovian Voivodeship, Poland (On-Site)
5 Months ago
seeking alpha - Senior Data Scientist

seeking alpha

Poland (Remote)
2 Months ago
Activision - Senior Expert Graphics Engineer (VFX)

Activision

Warsaw, Masovian Voivodeship, Poland (Hybrid)
1 Month ago
Larian Studios - Lead Systems Administrator

Larian Studios

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Eleven Labs - Forward Deployed Engineer - Strategist

Eleven Labs

Poland (Remote)
3 Days ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Milestone - Senior Software Engineer

Milestone

Portland, Oregon, United States (Remote)
1 Week ago
Easygo - Senior DevOps Engineer

Easygo

Belgrade, Serbia (On-Site)
5 Days ago
Equivalent Jobs - HEAD OF TRADING INFRASTRUCTURE

Equivalent Jobs

(Remote)
4 Months ago
Inworld AI - Staff Cloud DevOps/Site Reliability Engineer (SRE) - USA

Inworld AI

Mountain View, California, United States (On-Site)
8 Months ago
PwC - Utilities Grid Modernization Senior Associate

PwC

Toronto, Ontario, Canada (On-Site)
3 Months ago
Scale AI - Software Engineer, Cloud Infrastructure

Scale AI

San Francisco, California, United States (On-Site)
5 Months ago
Truecaller - Senior MLOps Engineer

Truecaller

Stockholm, Stockholm County, Sweden (On-Site)
4 Months ago
ByteDance - Cloud Site Reliability Engineer

ByteDance

San Jose, California, United States (On-Site)
1 Week ago
Oportun - Senior ML Engineer

Oportun

India (Remote)
5 Months ago
WebMD - Technical Lead

WebMD

Maharashtra, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Poland (Remote)

United States (Remote)

Poznań, Greater Poland Voivodeship, Poland (Remote)

Rugeley, England, United Kingdom (Remote)

Poznań, Greater Poland Voivodeship, Poland (On-Site)

Remote, Oregon, United States (Remote)

New York, New York, United States (Hybrid)

Los Angeles, California, United States (On-Site)

Poznań, Greater Poland Voivodeship, Poland (On-Site)

View All Jobs

Get notified when new jobs are added by Fandom

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug