Site Reliability Engineer (SRE)

3 Months ago • 5 Years + • Devops

Job Summary

Job Description

As an SRE Engineer at White Hat Gaming, you'll ensure the reliability, availability, and performance of their platform. Responsibilities include automating tasks, monitoring system health, responding to incidents, collaborating with development teams on scalable solutions, and conducting root cause analysis. You'll work with cloud platforms, scripting languages, and monitoring tools. This role requires a minimum of 5 years of experience in deploying, monitoring, and troubleshooting large-scale distributed systems.
Must have:
  • 5+ years experience with large-scale systems
  • Linux administration
  • Unix shell scripting
  • Networking knowledge (TCP/IP, DNS)
  • Cloud infrastructure (AWS)
  • Kubernetes experience
  • Terraform experience
Good to have:
  • Experience with multiple staging/dev environments
  • CI/CD experience (Jenkins, Spinnaker)
  • Open-source network/security monitoring
  • Postgres experience
  • AWS security experience
Perks:
  • Remote and flexible work schedule
  • Annual performance bonus
  • Hardware & software allowance
  • Well-being programs
  • Training and development opportunities
  • Generous time off

Job Details

About White Hat Gaming

White Hat Gaming is a state-of-the-art iGaming platform providing a secure, scalable and flexible modular Casino and Sportsbook Player Account Management solution.

 

We offer operators choice, from our proprietary Player Account Management (PAM) to a full white-label solution. WHG provides market-leading content including Kambi Sportsbook, CRM tools, all payment options, and more than 3000 games from 120 leading games providers.


With over 500 talented colleagues from around the world, we offer a dynamic, collaborative environment where your ideas can flourish alongside industry leaders. Join us and be at the forefront of iGaming innovation!


In Summary:
As an SRE Engineer at White Hat Gaming, you will play a crucial role in ensuring the reliability, availability, and performance of our platform. You will work in a dynamic, collaborative environment, automating tasks, monitoring system health, and responding to incidents to minimize downtime. Your expertise in cloud platforms, scripting languages, and monitoring tools will be essential in designing and implementing scalable solutions. Join us to be at the forefront of iGaming innovation and contribute to our global team's success.

 

Your day to day:

  • Ensure the reliability, availability, and performance of our platform.
  • Automate repetitive tasks to improve efficiency and reduce human error.
  • Monitor system health and respond to incidents to minimize downtime.
  • Collaborate with development teams to design and implement scalable solutions.
  • Conduct root cause analysis of incidents and implement preventive measures.


What we are looking for:

  • A minimum of 5 years of experience deploying, monitoring, and troubleshooting large-scale distributed systems
  • Background in Linux administration
  • Scripting/programming knowledge of at least Unix shell scripting
  • Good networking understanding (TCP/IP, DNS, routing, firewalls, etc.)
  • Good understanding of technologies such as Apache, Nginx, Databases (relational and key-value), DNS servers, etc
  • Understanding of cloud-based infrastructures, such as AWS
  • Experience with systems for automating deployment, scaling, and management of containerized applications, such as Kubernetes
  • Experience with Terraform for infrastructure
  • Quick to learn and fast to adapt to changing environments
  • Excellent communication and documentation skills
  • Excellent troubleshooting and creative problem-solving abilities
  • Excellent communication and organizational skills in English


Nice to have:

  • Experience deploying and supporting multiple staging/dev environments
  • Experience maintaining continuous integration and delivery pipelines with tools such as Jenkins and Spinnaker
  • Experience implementing, operating, and supporting open-source tools for network and security monitoring and management on Linux/Unix platforms
  • Experience with Postgres
  • Experience with security in AWS


How we approach things:

  • Dynamic Medium-Sized Environment: We have a can-do ethos, where innovation is encouraged, and action is valued.
  • Results-Oriented Focus: We prioritize getting things done while supporting each other to reach both collective and individual goals.
  • Global Team: We are truly a global team with people from various countries and cultures contributing to our success.
  • Open Collaboration: Our open-door policy fosters collaboration across all levels and departments, where ideas flow freely.
  • Core Values at Heart: We live by Teamwork, Innovation, Trust, and Integrity in everything we do.


What we offer:

  • A remote and flexible working schedule. 
  • Discretionary annual performance bonus
  • Hardware & Software allowance or work equipment is provided to make sure you have all the right tools to get the job done.
  • Various well-being programmes and initiatives.
  • Training and other learning & development opportunities to support you through your career progression.
  • Generous time off varied based on the country of residence.


Everything about WHG won't fit into a job ad, want to find out more about working with us? Apply to get the conversation started. 

We are an equal opportunities employer and welcome applications from all suitably qualified persons regardless of their race, gender, disability, religion/belief, sexual orientation, or age.

By submitting your application, you agree that we process your data in accordance with our Privacy Policy for the management of your candidature to any of the positions we offer.

Similar Jobs

Ion - Senior Software Developer, Italy

Ion

Italy (Hybrid)
8 Months ago
Zazz - Growth Marketing Analyst

Zazz

India (On-Site)
6 Months ago
WongDoody - UI DESIGNER, HONG KONG

WongDoody

Taipei City, Taiwan (On-Site)
7 Months ago
Liquid nitro games - Recruiter

Liquid nitro games

Hyderabad, Telangana, India (On-Site)
2 Months ago
Netflix - Senior Account Manager

Netflix

Seoul, South Korea (On-Site)
2 Months ago
Ion - Cloud Engineer Kubernetes

Ion

Italy (Hybrid)
8 Months ago
Wind River - Cloud Platform Software Developer – Member of Technical Staff

Wind River

Ottawa, Ontario, Canada (Hybrid)
1 Month ago
techholding - AWS DevOps Engineer

techholding

Mexico (Remote)
2 Months ago
ZeniMax Media - Senior Cloud Architect

ZeniMax Media

Rockville, Maryland, United States (On-Site)
6 Days ago
Perplexity - AI Software Engineer - Evaluation Platform

Perplexity

San Francisco, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Netflix - Team Lead, Launch Operations (Product Discovery & Promotion)

Netflix

Manila, Metro Manila, Philippines (On-Site)
2 Months ago
Corsair - Global Sourcing Manager

Corsair

Taipei City, Taiwan (On-Site)
3 Months ago
Tesla - Service Manager

Tesla

Dornbirn, Vorarlberg, Austria (On-Site)
4 Months ago
Zoe - Workplace Technology Lead

Zoe

United Kingdom (Remote)
2 Weeks ago
DraftKings - Lead Software Engineer, Android

DraftKings

Canada (Remote)
2 Months ago
Yodlee - Information Security & Risk Director

Yodlee

Raleigh, North Carolina, United States (Remote)
2 Months ago
PwC - Specialist External Audit

PwC

Monterrey, Nuevo Leon, Mexico (On-Site)
9 Months ago
Illumina - Automation Software Engineer I

Illumina

Singapore, Singapore (On-Site)
1 Month ago
AECOM - Design Lead - Electrical

AECOM

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
1 Week ago
Illumina - Sr Software Technical Product Manager

Illumina

San Diego, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Worldwide

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

NVIDIA - Senior Cloud Service Provider Application Engineer

NVIDIA

Santa Clara, California, United States (On-Site)
2 Months ago
London stock Exchange - Site Reliability Engineering Manager - ForexClear Support

London stock Exchange

London, England, United Kingdom (On-Site)
3 Weeks ago
luxsoft - Senior SAP BTP Platform Engineer

luxsoft

Poland (Remote)
2 Weeks ago
Next Level Business Services - Enovia – Solution Architect

Next Level Business Services

Greenville, South Carolina, United States (On-Site)
8 Months ago
Netflix - Software Engineer L4, Machine Learning Platform (Metaflow)

Netflix

Los Gatos, California, United States (On-Site)
4 Months ago
Apple - Compute SRE

Apple

Seattle, Washington, United States (On-Site)
3 Weeks ago
Gigamon - Cloud and AI Technical Marketing Engineer

Gigamon

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
Tencent - Tencent Cloud-Solution Architect-HK and Macau

Tencent

Hong Kong (On-Site)
7 Months ago
Saviynt - Senior Solutions Engineer

Saviynt

Singapore (Hybrid)
3 Weeks ago
Google - Site Reliability Manager, Platforms and Devices, SRE

Google

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded