Senior Site Reliability Engineer

8 Minutes ago • All levels
Devops

Job Description

Crytek is seeking an experienced Senior Site Reliability Engineer for Hunt: Showdown's NetOps department in Frankfurt. This role involves being the key liaison between development and network operations, driving operational excellence, leading infrastructure initiatives, and collaborating with production and architecture to ensure highly available, scalable, and efficient systems. The position includes both operational and strategic responsibilities, based on-site in Frankfurt, Germany.
Good To Have:
  • Experience with Zero Trust Networks, WireGuard, Nomad, MaaS, Foreman.
  • Knowledge of capacity forecasting and cost optimization for large-scale systems.
Must Have:
  • Lead initiatives for reliability, scalability, and performance of live game infrastructure.
  • Serve as subject matter expert and mentor to engineers.
  • Perform daily operation and maintenance of hosted/cloud data-center environments.
  • Install, configure, and patch system and game software.
  • Define, monitor, and improve SLIs/SLOs for 99.9%+ uptime.
  • Own incident response and root cause analysis; create runbooks and playbooks.
  • Evaluate and implement new technologies, driving POCs to production.
  • Maintain accurate, up-to-date documentation for systems, workflows, and processes.
  • Lead capacity planning, scaling strategies, and disaster recovery efforts.
  • Optimize reliability, observability, and cost efficiency of critical infrastructure.
  • Strong Linux administration skills.
  • Experience with containerization and orchestration technologies.
  • Experience in CI/CD pipelines, automated deployment, and infrastructure as code.
  • Solid understanding of network security principles.
  • Hands-on experience with bare-metal and cloud (preferably AWS).
  • Proficient in automation tools like Ansible and Terraform.
  • Skilled with observability tools (Open Telemetry, Prometheus, Mimir, Grafana).
  • Deep understanding of scalability, profiling, debugging, and performance testing.
  • Strong grasp of web stack fundamentals (REST, HTTP, CDN, caching).
  • Experience setting up monitoring, metrics, and proactive alerting for production systems (Go, Java, C++).
  • Proficient scripting in Shell and Python.
  • Excellent communication and documentation skills in English.
Perks:
  • Career path development support.
  • Relocation budget and full coverage of flights to Frankfurt for you and your family.
  • Extensive assistance with visa, work permits, and settling into Germany.
  • Fully furnished company apartment for the first three months in Frankfurt.
  • Free public transport pass for Frankfurt.
  • Membership at the premium gym chain Fitness First.
  • Brand-new, modern office in the heart of Frankfurt.
  • International environment with employees from over 42 different countries.
  • German language courses for you and your family.
  • Exciting company events, including new starter breakfasts, summer and winter parties, and Gamescom trip.
  • 24 days of vacation per year, increasing by 1 day every 2 years (up to 28 days).
  • Approximately 10 public holidays per year.

Add these skills to join the top 1% applicants for this job

problem-solving
communication
forecasting-budgeting
budget-management
oops
cpp
game-texts
performance-testing
incident-response
linux
aws
ansible
prometheus
grafana
terraform
ci-cd
python
shell
java

Crytek is looking for an experienced Senior Site Reliability Engineer to support Hunt: Showdown’s NetOps department in our Frankfurt Studio.

The person in this position will serve as the key liaison between development teams and the network operations team. They will drive operational excellence, lead infrastructure initiatives, and work closely with production and architecture to ensure systems are highly available, scalable, and efficient. This position includes both operational and strategic responsibilities.

This role is based on-site at our headquarters in Frankfurt, Germany, where you’ll collaborate with world-class developers and benefit from our attractive relocation package.

Responsibilities

  • Lead initiatives to improve reliability, scalability, and performance across our live game infrastructure.
  • Serve as subject matter expert and mentor to junior and mid-level engineers.
  • Daily operation and maintenance of hosted/cloud data-center environments.
  • Installation, configuration, and patching of system and game software.
  • Define, monitor, and improve SLIs/SLOs to maintain 99.9%+ uptime.
  • Own incident response and root cause analysis processes; create and maintain runbooks and playbooks.
  • Evaluate and implement new technologies, conducting POCs and driving them to production.
  • Maintain accurate, up-to-date documentation for systems, workflows, and processes.
  • Lead capacity planning, scaling strategies, and disaster recovery efforts.
  • Continuously optimize the reliability, observability, and cost efficiency of critical infrastructure.

Requirements

  • Previous experience as a Site Reliability Engineer, Platform Engineer or similar.
  • Proven experience designing and operating large-scale, high-availability systems.
  • Strong Linux administration skills.
  • Experienced with containerization and orchestration technologies.
  • Experience in CI/CD pipelines, automated deployment, and infrastructure as code.
  • Solid understanding of network security principles.
  • Hands-on experience with both bare-metal and cloud (preferably AWS).
  • Proficient in automation tools such as Ansible and Terraform.
  • Skilled with observability tools like Open Telemetry, Prometheus, Mimir, and Grafana.
  • Deep understanding of scalability, profiling, debugging, and performance testing.
  • Strong grasp of web stack fundamentals (REST, HTTP, CDN, caching).
  • Experience setting up monitoring, metrics, and proactive alerting for production systems (Go, Java, C++).
  • Proficient scripting in Shell and Python.
  • Excellent communication and documentation skills in English.
  • Willing to relocate to Frankfurt.

Pluses

  • Experience with Zero Trust Networks, WireGuard, Nomad, MaaS, Foreman.
  • Knowledge of capacity forecasting and cost optimization for large-scale systems.

What you can expect from us

  • Career Path Your professional development is important to us, so we have laid out a career plan to help you progress towards your goals and objectives.
  • Relocation Support We offer a relocation budget and full coverage of flights to Frankfurt for you and your family. You can expect extensive assistance with visa, work permits, and communication with authorities during the relocation process, as well as help settling into Germany (e.g. setting up appointments with banks, government agencies, schools, landlords, finding apartments etc.).
  • Company Apartment To help you get settled, we provide you with a fully furnished company apartment during your first three months in Frankfurt.
  • Public Transport Pass Discover Frankfurt by bus, tram and metro – free of charge.
  • Gym Card A healthy body is a healthy mind. We offer a membership at the premium gym chain Fitness First in Germany. Work out, join group fitness classes, or relax in the wellness facilities.
  • State-of-the-art Office We’ve recently moved into a brand-new, modern office located in the heart of Frankfurt. Our new workspace is designed to inspire creativity and collaboration, with open areas, quiet zones, and top-tier facilities — all just steps away from public transport, restaurants, bars and cultural hotspots.
  • International Environment We truly embody diversity at Crytek. With employees from over 42 different countries, we define ourselves by our cultural diversity.
  • German Classes Understanding the local culture will make your stay abroad more enjoyable, and Crytek supports that by offering German language courses for you and your family.
  • Events Join us on our exciting company events, including new starter breakfasts, summer and winter parties, our annual trip to Gamescom in Cologne, and many more!
  • Vacation Days At our Frankfurt office you can enjoy 24 days of vacation per year, and every 2 years you get 1 more (up to a maximum of 28 days). You will also have on average 10 public holidays on top of the days you take off.

Set alerts for more jobs like Senior Site Reliability Engineer
Set alerts for new jobs by Crytek
Set alerts for new Devops jobs in Germany
Set alerts for new jobs in Germany
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙