Service Reliability Senior Administrator

4 Months ago • 2 Years + • Administrative

About the job

Job Description

Service Reliability Senior Administrator required with 2+ years of experience in incident management, ITIL processes and distributed systems troubleshooting.
Must have:
  • Incident Management
  • ITIL Processes
  • Distributed Systems
  • Troubleshooting Skills
Good to have:
  • Relational Databases
  • CI/CD Pipelines
  • Container Ecosystems
  • AWS Cloud Services
Perks:
  • Full Health Insurance
  • Retirement Benefits
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

Established in 2020, Riot Singapore Studio has been expanding our footprint in Asia and accelerating our talent growth to develop games that deliver great experiences to our players. Our mission is to “scale Riot’s games to hyper-serve players.”

We partner with our LA headquarters on game development for League of Legends, Teamfight Tactics, VALORANT and 2XKO. The Singapore Studio is seeking talented, passionate craft experts with backgrounds in all areas of game development to build games that make it better to be a player.

We’re focused on working together to promote individual autonomy, ownership, collaboration, and inclusivity, so everyone can be their best while we boldly pursue games.ip, collaboration, and inclusivity, so everyone can be their best while we boldly pursue games.

That's where you come in.

The Riot Operations Center (ROC) manages the 24x7 monitoring and response components of Riot's player-facing services. We are the first line of defense when things go wrong with any of our live services. We leverage technical familiarity with best-practice processes to rapidly remediate incidents. The team helps to create and mentor other Riot teams on best practice in alerting, monitoring, and operational processes.

As a Service Reliability Senior Administrator, you will work closely with the Live Operations team and Riot globally to establish and maintain a high-performing and highly available game service for players. You will monitor and support all aspects of LIVE production environments, development environments, and general system needs. Your technical skills and grasp of system integration will help you diagnose and communicate potential issues to Rioters and the community, improving the quality of the player experience. You will be a craft expert in operational and triaging skills. You can also be involved in projects which help contribute to improving overall service quality in the incident management and observability problem spaces.

Responsibilities:

  • Triage and investigation of live incidents
  • Execute technical return to service actions in a fast-paced, distributed systems environment specifically microservices to quickly restore service and protect player experience
  • Monitor the health of Riot’s distributed services using observability tools, identify gaps with alerting, runbook steps, processes or tools
  • Runbook execution and maintenance to keep documentation up to date
  • Onboarding new team members
  • Provide support, coordination during major launches, events and release deployments
  • Contribute to project work with some guidance to develop automation scripts, utilities and new processes to continuously improve the incident management process
  • Document details of incident response as needed to identify problems and improve overall incident management/response
  • Participate in post-incident RCA meetings as required

Required Qualifications:

  • Computer Science/IT Systems/Information Technology diploma or equivalent
  • 2+ years of Service Reliability Administration or equivalent role (System Analyst, System Administrator/Engineer, Live Operations, Network Administrator, NOC Engineer etc)
  • Experience with incident management and have good understanding of ITIL processes
  • Familiarity with the core concepts of operating systems, networking, SDLC and Agile methodologies
  • Good troubleshooting skills with triaging incidents in a high-capacity, high-availability and highly distributed environment
  • Experience with the following tools/platforms:
    • Monitoring solutions eg: Datadog, NewRelic, Nagios, Elastic Search, Grafana
    • Event management tools eg: BigPanda, Moogsoft
    • ITIL-based Ticketing systems eg: ServiceNow, JIRA

Desired Qualifications:

  • Computer Science/IT Systems/Information Technology degree or equivalent
  • Understand relational databases like MySQL, CI/CD pipelines, especially Jenkins
  • Experience working on deployments in a live environment is a plus
  • Experience working in container-based ecosystems like docker and with a container scheduler like Kubernetes, Amazon EKS/ECS or GKE
  • AWS Cloud Services experience/certification/training or equivalent, Linux+ and Network+, or equivalents
  • Experience building automation scripts/utilities/jobs using either Python, Powershell, JavaScript or Bash
  • Familiarity with Site Reliability Engineering (SRE) principles and best practices

Our Perks:

  • Full health insurance for you, your spouse, and children
  • Open paid time off
  • Retirement benefits with company matching
  • Life insurance, parental leave, plus short-term and long-term disability
  • Play Fund so you can broaden and deepen your knowledge of our players and community through games
  • We will double down on your donations of time and money to non-profits

For this role, you'll find success through craft expertise, a collaborative spirit, and decision-making that prioritizes the delight of players. We will certainly be looking at your past studies and experience, but for this role, we also look for dedicated people with a personal relationship with games. If you embody player empathy and care about the experiences of players, this could be the role for you!

===

Don’t forget to include a resume and cover letter. We receive many applications, but we’ll notice a fun, well-written intro that shows us you Dare to Dream and Execute with Excellence.

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Riot Games is a video game developer, publisher, and esports tournament organizer best known for League of Legends.

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Shanghai, Shanghai, China (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Riot Games

Similar Jobs

LightSpeed Studios - Uncapped Games - Senior User Acquisition Manager

LightSpeed Studios, United States (Remote)

Chimera entertainment - (Senior) Project Lead Gaming industry (f/m/d)

Chimera entertainment, Germany (Hybrid)

Electronic Arts - Producer - EA Sports FC

Electronic Arts, Romania (On-Site)

Ubisoft - Social Media Specialist

Ubisoft, China (On-Site)

realworld one - IT Support / Helpdesk Intern

realworld one, United States (Hybrid)

The Walt Disney Company - Front Desk Agent - Full Time (English/Japanese Speaking), $31.44/Hour

The Walt Disney Company, United States (On-Site)

Avalanche Studios Group - Senior System Administrator

Avalanche Studios Group, Sweden (Hybrid)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Jam City - Producer

Jam City, Canada (Hybrid)

Fortis Games - Senior Product Manager

Fortis Games, United Kingdom (On-Site)

Outscal - Product Operations (Ed-tech)

Outscal, India (On-Site)

G5 Games - Level Design Director

G5 Games, Kazakhstan (On-Site)

IO Interactive - Senior Live Ops Producer

IO Interactive, Sweden (Hybrid)

PlayStation Global - Senior Technical Product Manager, Partner Experiences

PlayStation Global, United Kingdom (Hybrid)

Supercell - Marketing Lead, Clash Royale

Supercell, Finland (On-Site)

Blizzard Entertainment - Senior Producer, Release Management | Diablo IV

Blizzard Entertainment, United States (Hybrid)

Riot Games - Principal Researcher

Riot Games, United States (On-Site)

Get notifed when new similar jobs are uploaded

Jobs in Singapore

Get notifed when new similar jobs are uploaded

Administrative Jobs

Egnyte - IT Support Specialist

Egnyte, United States (On-Site)

World Relief - Operations Specialist, Regional - 2024751

World Relief, United States (On-Site)

Nasdaq - Technical Analyst, Calypso, Fintech

Nasdaq, Australia (Hybrid)

Ubisoft - Office Administrator (Part-Time)

Ubisoft, Poland (On-Site)

Windriver - Senior Linux Field Application Engineer

Windriver, United States (Remote)

The Walt Disney Company - Desktop Systems Specialist

The Walt Disney Company, United States (Hybrid)

Keywords Studios (Player Support) - Active Directory / Identity Engineer

Keywords Studios (Player Support), United Kingdom (On-Site)

inveniolsi - SAP BTP Senior Consultant

inveniolsi, India (On-Site)

Get notifed when new similar jobs are uploaded