Service Reliability Analyst II (ITIL)

1 Hour ago • 2-4 Years • Operations

Job Summary

Job Description

The Service Reliability Analyst II at Riot Games' Process & Analytics team uses operational data to understand player experience and improve game operational health. Responsibilities include leading technical discussions on service reliability, conducting incident data audits, collecting and reporting system health metrics, performing data analysis to identify and address systemic issues, participating in on-call rotations, assisting with corrective actions for root cause analysis, and developing dashboards and reports. This role requires strong ITIL process knowledge, data analysis skills, and experience with various monitoring and data visualization tools. The ideal candidate possesses a deep understanding of IT infrastructure, software development life cycles, and system ownership within a multi-team environment.
Must have:
  • 2-4 years IT service management experience
  • Proficient in ITIL processes (Incident, Problem, Change, Release)
  • Strong data analysis skills (SQL, JQL, XQuery)
  • Experience with DataDog, NewRelic, Tableau
  • Incident response and root cause analysis
Good to have:
  • SRE experience
  • AWS Certifications
  • Bachelor's degree in CS/IT
  • Advanced data analysis proficiency
Perks:
  • Open paid time off policy
  • Flexible work schedules
  • Medical, dental, and life insurance
  • Parental leave
  • 401k with company match

Job Details

The Process & Analytics team focuses on using operational data to understand the player experience and provide that visibility to Riot. This team strives to collect, audit and use data to improve our games’ operational health; empowering game leadership to make data informed decisions to improve stability.  

As a Service Reliability Analyst II, you will work with teams across Riot to build and execute effective ITIL processes, measurements of service health, and a highly contextual picture of the player experience. Your tenacity and drive for continuous improvement will help you uncover problematic trends and push for their resolution, improving the quality of the player experience. You will be a craft master in operational process and telling compelling visual stories with data. Live Ops can look to you to improve ITIL process, answer tough operational questions through data, and uncover previously unknown anti-patterns harming the player experience. 

Responsibilities:

  • Lead and facilitate weekly technical discussions on service reliability with key product teams, ensuring alignment on operational goals and performance metrics.
  • Conduct thorough audits of incident data in collaboration with service owners to validate accuracy and ensure comprehensive reporting and analysis.
  • Collect, synthesize, and report on system health metrics for Riot's diverse infrastructure, utilizing advanced data collection methods and monitoring tools.
  • Perform in-depth analysis of operational data trends to identify and address systemic issues and optimize service performance.
  • Participate in on-call rotations to provide critical support and ensure rapid response to incidents, minimizing downtime and service disruptions.
  • Assist in tracking and coordinating corrective actions for root cause analysis, ensuring thorough resolution of underlying issues and continuous improvement of operational processes.
  • Develop and maintain dashboards and reports that provide insights into key operational performance metrics, assisting leaders with making data-driven decisions.

Required Qualifications: 

  • 2-4 years of hands-on experience in IT service management, data analysis, or technical operations, with a focus on maintaining and optimizing IT infrastructure.
  • Strong proficiency in incident, problem, change, and release management, with the ability to design and implement process flows using industry-standard methodologies.
  • Solid understanding of software development life cycles (SDLC) and how various components interact within larger ecosystems, ensuring seamless operation and scalability.  
  • Clear awareness of system and service ownership within a multi-team environment, including the effective use of APIs/SDKs and adherence to SLAs.
  • Deep enthusiasm for operations and technology, with a proactive approach to continuous improvement in system reliability and performance.
  • ITIL-based Ticketing Systems: In-depth experience with ServiceNow, JIRA, or similar platforms for tracking and managing IT service processes.
  • Experience with the following tools and technologies:
    • Data Visualization Tools: Advanced skills in Tableau, DataWrapper, and Excel for creating actionable insights from complex datasets.
    • Query Languages: Proficient in JQL, SQL, and XQuery for querying and manipulating data across various platforms.
    • Monitoring Solutions: Expertise in setting up and managing monitoring frameworks using tools like DataDog and NewRelic to ensure system health and performance.
    • Event Management Tools: Skilled in Event Correlation to improve Incident Response with tools such as Datadog, Big Panda or PagerDuty

Desired Qualifications:

  • 2+ years of specialized experience in Service Reliability Engineering (SRE) or equivalent roles such as Technical Release Manager, Process Owner, Live Operations Engineer, or Network Administrator.
  • Bachelor’s degree in Computer Science, IT Systems, Information Technology, or a closely related field, or equivalent professional experience.
  • Advanced data analysis and data insights proficiency, with the ability to derive actionable intelligence from large datasets.
  • Relevant certifications such as AWS Certified Solutions Architect, CompTIA Linux+, or CompTIA Network+, or equivalent credentials, are highly valued.
  • Demonstrated expertise in deploying and managing monitoring solutions such as DataDog and NewRelic to ensure system health and performance within complex environments

For this role, you'll find success through craft expertise, a collaborative spirit, and decision-making that prioritizes your fellow Rioters, who are the customers of your work. Being a dedicated fan of games is not necessary for this position!

 

Our Perks:

Riot has a focus on work/life balance, shown by our open paid time off policy, in addition to other perks such as flexible work schedules. We offer medical, dental, and life insurance, parental leave for you, your spouse/domestic partner and children, and a 401k with company match. Check out our for more information.

Riot Games fosters a player and workplace experience that values teamwork embodied by the and . Our culture embraces differences as a strength, and our values are the guiding principles for how we approach work. We are committed to putting diversity and inclusion (D&I) at the center of everything we do, and promoting a fair and collaborative culture where Rioters treat one another with dignity and respect. We encourage you to read more about our value of and our ongoing work to build the .

 

It’s our policy to provide equal employment opportunity for all applicants and members of Riot Games, Inc. Riot Games makes reasonable accommodations for handicapped and disabled Rioters and does not unlawfully discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, handicap, veteran status, marital status, criminal history, or any other category protected by applicable federal and state law. We consider for employment all qualified applicants, including those with criminal histories, in a manner consistent with applicable federal, state and local law, including the California Fair Chance Act, the City of Los Angeles Fair Chance Initiative for Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, the San Francisco Fair Chance Ordinance, and the Washington Fair Chance Act.

Per the Los Angeles County Fair Chance Ordinance, the following core duties may create a basis for disqualifying candidates with relevant criminal histories:

  • Safeguarding confidential and sensitive Company data
  • Communication with others, including Rioters and third parties such as vendors, and/or players, including minors
  • Accessing Company assets, secure digital systems, and networks
  • Ensuring a safe interactive environment for players and other Rioters

These duties are directly related to essential operations, safety, trust, and compliance obligations within our organization. Please note that job duties may evolve based on business needs and additional responsibilities may be assigned as necessary to maintain operational efficiency and security. 

Similar Jobs

Sphere Entertainment Co - Senior Manager Data Science

Sphere Entertainment Co

Las Vegas, Nevada, United States (On-Site)
2 Weeks ago
Google - Software Engineer III, Infrastructure, Google TV

Google

San Jose, California, United States (On-Site)
5 Months ago
Oh Bibi - Game Economy Designer

Oh Bibi

Paris, Île-de-France, France (Hybrid)
2 Weeks ago
ByteDance - Financial Risk Strategy Expert - Global Payment

ByteDance

Singapore (On-Site)
5 Months ago
Easygo - Lead Data Analyst - Kick

Easygo

Melbourne, Victoria, Australia (On-Site)
1 Month ago
Voodoo - Operations Manager

Voodoo

Paris, Île-de-France, France (On-Site)
2 Weeks ago
Keywords Studios - Customer Support Team Lead - Remote

Keywords Studios

Suginami City, Tokyo, Japan (Remote)
3 Weeks ago
Microsoft - Director of Technical Support Engineering

Microsoft

(Remote)
1 Day ago
Tesla - Office Coordinator

Tesla

Saint-Ouen-sur-Seine, Île-de-France, France (On-Site)
2 Months ago
Keywords Studios - Player Engagement Operations Manager

Keywords Studios

Pasig, Metro Manila, Philippines (Hybrid)
7 Hours ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

NVIDIA - Solutions Architect, AI and ML

NVIDIA

Redmond, Washington, United States (On-Site)
1 Week ago
Scientific Games  - Supervisor, Logistics

Scientific Games

Duluth, Georgia, United States (On-Site)
2 Weeks ago
PlayStation Global - Senior Business Systems Analyst (Contract)

PlayStation Global

San Mateo, California, United States (On-Site)
2 Months ago
Voodoo - Senior Data Analyst

Voodoo

Paris, Île-de-France, France (Hybrid)
2 Weeks ago
ByteDance - Data Management and Strategy Intern

ByteDance

Taguig, Metro Manila, Philippines (On-Site)
2 Weeks ago
PwC - ETIC, Data Solution Architect - Senior Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
5 Months ago
Gallagher - Data Scientist

Gallagher

Bengaluru, Karnataka, India (On-Site)
5 Months ago
ARHS - Intermediate Application Developer

ARHS

Valletta, Malta (On-Site)
5 Months ago
ClinDCast - GenAI Application Lead

ClinDCast

Austin, Texas, United States (Remote)
8 Months ago
Riot Games - Senior User Researcher

Riot Games

Dublin, County Dublin, Ireland (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Los Angeles, California, United States

Rackspace Technology - Presales Enterprise Architect - Multi Service Line

Rackspace Technology

San Antonio, Texas, United States (Remote)
2 Weeks ago
Universal Music - Senior Manager, Controls Assurance

Universal Music

California, United States (On-Site)
1 Month ago
Valve corporation - Steam Software Engineer

Valve corporation

Bellevue, Washington, United States (On-Site)
5 Months ago
Titmouse - Pipeline Technical Director

Titmouse

Los Angeles, California, United States (On-Site)
1 Month ago
The Walt Disney Company - Sr Software Engineer (Rust Developer)

The Walt Disney Company

Seattle, Washington, United States (On-Site)
5 Months ago
Zoox - Senior/Staff Software Engineer - Simulation Infrastructure

Zoox

Seattle, Washington, United States (Hybrid)
5 Months ago
Evolution - Studio Game Presenter (Server/Waitress Alternative)

Evolution

Trumbull, Connecticut, United States (On-Site)
10 Months ago
ByteDance - Software Developer Graduate (Routing Verification & Emulation)

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
The Walt Disney Company - Sr Social Media Manager - Youth Audience Expansion

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
2 Weeks ago
CharacterAI - MBA Intern, Product Strategy & Operations

CharacterAI

San Francisco, California, United States (On-Site)
1 Week ago

Get notifed when new similar jobs are uploaded

Operations Jobs

Voodoo - Operations Manager

Voodoo

Paris, Île-de-France, France (On-Site)
2 Weeks ago
Paytm - Area Sales Manager- Deputy Manager - Bellary

Paytm

Ballari, Karnataka, India (On-Site)
4 Months ago
Inspired Entertainment - Seasonal Arcade Host

Inspired Entertainment

(On-Site)
2 Weeks ago
Trek - Future Store Manager - Portland Area

Trek

Portland, Oregon, United States (On-Site)
2 Months ago
People Can Fly - Live Operations Technician

People Can Fly

Montreal, Quebec, Canada (Remote)
6 Days ago
Sporty Group - IN Associate - Payment Operations Support

Sporty Group

Mumbai, Maharashtra, India (On-Site)
4 Months ago
DraftKings - Operations Associate

DraftKings

Reynoldsburg, Ohio, United States (On-Site)
3 Months ago
Keywords Studios - Player Support Agent - French/English

Keywords Studios

Silesian Voivodeship, Poland (Hybrid)
2 Weeks ago
Hapag-Lloyd AG - Service Delivery and Project Manager IT Support Services

Hapag-Lloyd AG

Chennai, Tamil Nadu, India (On-Site)
5 Months ago
Bragg - Compliance Operations Manager

Bragg

London, England, United Kingdom (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Riot Games is a video game developer, publisher, and esports tournament organizer best known for League of Legends.

Los Angeles, California, United States (On-Site)

Shanghai, Shanghai, China (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Riot Games

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug