Site Reliability Engineer (SRE)

1 Month ago • 5 Years + • DevOps

Job Summary

Job Description

As an SRE Engineer at White Hat Gaming, you'll ensure the reliability, availability, and performance of their platform. Responsibilities include automating tasks, monitoring system health, responding to incidents, collaborating with development teams on scalable solutions, and conducting root cause analysis. You'll work with cloud platforms, scripting languages, and monitoring tools. This role requires a minimum of 5 years of experience in deploying, monitoring, and troubleshooting large-scale distributed systems.
Must have:
  • 5+ years experience with large-scale systems
  • Linux administration
  • Unix shell scripting
  • Networking knowledge (TCP/IP, DNS)
  • Cloud infrastructure (AWS)
  • Kubernetes experience
  • Terraform experience
Good to have:
  • Experience with multiple staging/dev environments
  • CI/CD experience (Jenkins, Spinnaker)
  • Open-source network/security monitoring
  • Postgres experience
  • AWS security experience
Perks:
  • Remote and flexible work schedule
  • Annual performance bonus
  • Hardware & software allowance
  • Well-being programs
  • Training and development opportunities
  • Generous time off

Job Details

About White Hat Gaming

White Hat Gaming is a state-of-the-art iGaming platform providing a secure, scalable and flexible modular Casino and Sportsbook Player Account Management solution.

 

We offer operators choice, from our proprietary Player Account Management (PAM) to a full white-label solution. WHG provides market-leading content including Kambi Sportsbook, CRM tools, all payment options, and more than 3000 games from 120 leading games providers.


With over 500 talented colleagues from around the world, we offer a dynamic, collaborative environment where your ideas can flourish alongside industry leaders. Join us and be at the forefront of iGaming innovation!


In Summary:
As an SRE Engineer at White Hat Gaming, you will play a crucial role in ensuring the reliability, availability, and performance of our platform. You will work in a dynamic, collaborative environment, automating tasks, monitoring system health, and responding to incidents to minimize downtime. Your expertise in cloud platforms, scripting languages, and monitoring tools will be essential in designing and implementing scalable solutions. Join us to be at the forefront of iGaming innovation and contribute to our global team's success.

 

Your day to day:

  • Ensure the reliability, availability, and performance of our platform.
  • Automate repetitive tasks to improve efficiency and reduce human error.
  • Monitor system health and respond to incidents to minimize downtime.
  • Collaborate with development teams to design and implement scalable solutions.
  • Conduct root cause analysis of incidents and implement preventive measures.


What we are looking for:

  • A minimum of 5 years of experience deploying, monitoring, and troubleshooting large-scale distributed systems
  • Background in Linux administration
  • Scripting/programming knowledge of at least Unix shell scripting
  • Good networking understanding (TCP/IP, DNS, routing, firewalls, etc.)
  • Good understanding of technologies such as Apache, Nginx, Databases (relational and key-value), DNS servers, etc
  • Understanding of cloud-based infrastructures, such as AWS
  • Experience with systems for automating deployment, scaling, and management of containerized applications, such as Kubernetes
  • Experience with Terraform for infrastructure
  • Quick to learn and fast to adapt to changing environments
  • Excellent communication and documentation skills
  • Excellent troubleshooting and creative problem-solving abilities
  • Excellent communication and organizational skills in English


Nice to have:

  • Experience deploying and supporting multiple staging/dev environments
  • Experience maintaining continuous integration and delivery pipelines with tools such as Jenkins and Spinnaker
  • Experience implementing, operating, and supporting open-source tools for network and security monitoring and management on Linux/Unix platforms
  • Experience with Postgres
  • Experience with security in AWS


How we approach things:

  • Dynamic Medium-Sized Environment: We have a can-do ethos, where innovation is encouraged, and action is valued.
  • Results-Oriented Focus: We prioritize getting things done while supporting each other to reach both collective and individual goals.
  • Global Team: We are truly a global team with people from various countries and cultures contributing to our success.
  • Open Collaboration: Our open-door policy fosters collaboration across all levels and departments, where ideas flow freely.
  • Core Values at Heart: We live by Teamwork, Innovation, Trust, and Integrity in everything we do.


What we offer:

  • A remote and flexible working schedule. 
  • Discretionary annual performance bonus
  • Hardware & Software allowance or work equipment is provided to make sure you have all the right tools to get the job done.
  • Various well-being programmes and initiatives.
  • Training and other learning & development opportunities to support you through your career progression.
  • Generous time off varied based on the country of residence.


Everything about WHG won't fit into a job ad, want to find out more about working with us? Apply to get the conversation started. 

We are an equal opportunities employer and welcome applications from all suitably qualified persons regardless of their race, gender, disability, religion/belief, sexual orientation, or age.

By submitting your application, you agree that we process your data in accordance with our Privacy Policy for the management of your candidature to any of the positions we offer.

Similar Jobs

Casumo - Engineering Team Lead

Casumo

(Hybrid)
1 Month ago
Bally's Interactive - Treasury Operations and Systems Manager

Bally's Interactive

Malta, New York, United States (On-Site)
2 Weeks ago
MIQ Digital - Senior Account Manager

MIQ Digital

Tokyo, Japan (Hybrid)
8 Hours ago
NVIDIA - Network Security Research Architect

NVIDIA

United Kingdom (Remote)
1 Month ago
Aisera Jobs - Sales Engineer

Aisera Jobs

(Remote)
1 Day ago
Rackspace Technology - DevOps Engineer (AWS Terraform)

Rackspace Technology

India (Remote)
2 Months ago
Hitachi - Senior Offshore Azure Infrastructure - EST Shift

Hitachi

Pune, Maharashtra, India (On-Site)
6 Months ago
The Walt Disney Company - Manager, Software Engineering

The Walt Disney Company

Washington, United States (On-Site)
2 Months ago
Assystems - DevOps Engineer

Assystems

Gurugram, Haryana, India (On-Site)
6 Months ago
Google - Software Engineer III, Infrastructure, Google Cloud

Google

San Francisco, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Monsters - Senior Backend Engineer

Monsters

(Remote)
1 Month ago
LeoVegas - Trader - English Speaking

LeoVegas

Málaga, Andalusia, Spain (On-Site)
2 Months ago
PlayStation Global - Technical Product Manager II

PlayStation Global

San Mateo, California, United States (Hybrid)
1 Month ago
Eleven Labs - Website Designer

Eleven Labs

United States (Remote)
1 Month ago
Google - Senior Mechatronics Engineer, Data Center Automation Services

Google

Moncks Corner, South Carolina, United States (On-Site)
2 Days ago
ByteDance - Hardware Engineering Lab Manager - Pico

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Zuru - Marketing Manager (FMCG/CPG)

Zuru

Auckland, Auckland, New Zealand (On-Site)
3 Months ago
Western Digital - Network Engineer 3

Western Digital

Bayan Lepas, Penang, Malaysia (On-Site)
2 Days ago
Inworld AI - Staff Platform Engineer

Inworld AI

Vancouver, British Columbia, Canada (On-Site)
7 Hours ago
Google - Product Manager, Wearable Device

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Worldwide

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

DevOps Jobs

Argus Labs - Site Reliability Engineer (LATAM)

Argus Labs

(Remote)
1 Month ago
Google - Cloud Technical Solutions Engineer, Workspace

Google

Tokyo, Japan (On-Site)
1 Week ago
Nintendo - DevOps Engineer (Site Reliability)

Nintendo

Redmond, Washington, United States (Hybrid)
2 Weeks ago
Rackspace Technology - Cloud Architect

Rackspace Technology

India (Remote)
1 Month ago
Google - Customer Engineer, SAP, Google Cloud

Google

Addison, Texas, United States (On-Site)
2 Weeks ago
Google - ISV Specialist Partner Engineer IV, Data Management

Google

Fairburn, Georgia, United States (On-Site)
1 Week ago
SparkCognition - Senior DevOps Engineer

SparkCognition

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Google - Senior Software Engineer, Turn-up Site Reliability Engineering

Google

Dublin, County Dublin, Ireland (On-Site)
2 Weeks ago
Interactive Brokers - Senior Platform Engineer - Design

Interactive Brokers

Fort Lauderdale, Florida, United States (Hybrid)
6 Months ago
Info Stretch - Senior Engineer

Info Stretch

Mumbai, Maharashtra, India (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

White Hat Gaming is a state-of-the-art iGaming platform providing a secure, scalable and flexible modular Casino and Sportsbook Player Account Management solution.

View All Jobs

Get notified when new jobs are added by White Hat Gaming

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug