Systems Engineer II - Infrastructure

4 Hours ago • 5 Years + • DevOps • Undisclosed

About the job

Job Description

As a Systems Engineer II on the Infrastructure Reliability team at Riot Games, you'll manage the production infrastructure supporting all their games. Responsibilities include designing, building, and maintaining cloud infrastructure (AWS), troubleshooting complex issues, advancing monitoring and observability platforms, and ensuring system uptime and SLAs. You'll work with IaC (Terraform), CI/CD pipelines, containerization (Docker, Kubernetes), and scripting languages (Python, Golang). The role demands expertise in troubleshooting, incident response, and automation, along with collaboration and mentoring within the team. You'll be part of an on-call rotation to resolve production issues and contribute to continuous improvement initiatives, ultimately enhancing the player experience.
Must have:
  • 5+ years infrastructure experience
  • AWS expertise (EC2, VPC, S3, IAM)
  • Kubernetes/ECS experience
  • Terraform or similar IaC experience
  • Troubleshooting & incident response skills
  • CI/CD pipeline experience
  • Python/Golang scripting proficiency
  • Docker & Kubernetes expertise
Good to have:
  • Pulumi experience
  • Network engineering expertise
  • Team leadership and mentorship experience
Perks:
  • Open paid time off
  • Flexible work schedules
  • Medical, dental, and life insurance
  • Parental leave
  • 401k with company match

As a Senior Systems Engineer on the Infrastructure Reliability team, you will manage the production infrastructure that supports all our games. You will collaborate closely with cross-functional teams to design, build, and maintain cloud infrastructure that underpins the foundational services we deliver. This role involves working on new infrastructure as well as iterating on existing services to deliver the best possible experience for our players.

We’re looking for hands-on experts who can join an on-call rotation to troubleshoot complex issues while continuously learning and identifying opportunities to improve our services, yourself and your peers. Your ability to write clean, efficient, and reusable code will be critical in automating processes, scaling systems, and driving continuous improvement.

Responsibilities: 

  • Solve complex challenges, diagnosing and resolving production issues across large scale globally distributed systems.
  • Advance our monitoring and observability platforms, driving innovation that keep our infrastructure visible, actionable, and resilient.
  • Troubleshoot incidents and design resilient solutions to maintain uptime and meet SLAs, continually evolving our infrastructure to improve reliability and adaptability.
  • Expand and optimize our cloud footprint, enhancing the scalability, reliability, and efficiency of our cloud environment.
  • Collaborate and Elevate your team by sharing knowledge, mentoring peers, and fostering a culture of continuous learning and growth.

Required Qualifications:

  • 5+ years of experience in infrastructure delivery, SRE or Operations for large-scale systems.
  • Expertise in AWS ecosystem: including infrastructure services (e.g., EC2, VPC, S3, IAM), container orchestration with Kubernetes (EKS) or ECS, with automation and monitoring  (e.g., CloudWatch, Lambdas).
  • Infrastructure as Code (IaC): Hands-on experience with Terraform or similar for infrastructure provisioning and configuration management. Knowledge of Pulumi for cloud infrastructure in code is a plus.
  • Troubleshooting and Incident Response: Skilled at troubleshooting live incidents, with a proactive approach to minimizing downtime and service impact. Familiarity with Root Cause Analysis (RCA) processes to identify, document, and drive long-term solutions to recurring issues.
  • CI/CD Expertise: Proven experience working with CI/CD pipelines including tools like Jenkins and GitHub actions, emphasizing deployment reliability and automation.
  • Automation and Scripting: Proficiency in scripting and programming languages like Python and Golang to drive automation, manage deployments, and create tooling.
  • Containerization: Expertise in container management and orchestration with Docker and Kubernetes, and experience designing robust microservices infrastructure. Strong understanding of configuration formats such as JSON and YAML and their application in IaC, Kubernetes manifests, and other deployment files.
  • Adaptability: Ability to quickly adopt and adapt to new technologies, frameworks, and cloud-native tools to solve complex problems.

Desired Qualifications: 

  • Proven experience leading and mentoring a team of engineers, fostering collaboration and technical growth.
  • Solid foundation in network engineering principles and protocols, with hands-on experience in designing and troubleshooting networked systems – a significant advantage.

For this role, you'll find success through craft expertise, a collaborative spirit, and decision-making that prioritizes your fellow Rioters, who are the customers of your work. Being a dedicated fan of games is not necessary for this position!

Our Perks:

Riot focuses on work/life balance, shown by our open paid time off policy and other perks such as flexible work schedules. We offer medical, dental, and life insurance, parental leave for you, your spouse/domestic partner, and children, and a 401k with company match. Check out our for more information.

Riot Games fosters a player and workplace experience that values teamwork embodied by the and . Our culture embraces differences as a strength, and our values are the guiding principles for how we approach work. We are committed to putting diversity and inclusion (D&I) at the center of everything we do, and promoting a fair and collaborative culture where Rioters treat one another with dignity and respect. We encourage you to read more about our value of and our ongoing work to build the .

It’s our policy to provide equal employment opportunity for all applicants and members of Riot Games, Inc. Riot Games makes reasonable accommodations for handicapped and disabled Rioters and does not unlawfully discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, handicap, veteran status, marital status, criminal history, or any other category protected by applicable federal and state law. We consider for employment all qualified applicants, including those with criminal histories, in a manner consistent with applicable federal, state and local law, including the California Fair Chance Act, the City of Los Angeles Fair Chance Initiative for Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, the San Francisco Fair Chance Ordinance, and the Washington Fair Chance Act.

Per the Los Angeles County Fair Chance Ordinance, the following core duties may create a basis for disqualifying candidates with relevant criminal histories:

  • Safeguarding confidential and sensitive Company data
  • Communication with others, including Rioters and third parties such as vendors, and/or players, including minors
  • Accessing Company assets, secure digital systems, and networks
  • Ensuring a safe interactive environment for players and other Rioters

These duties are directly related to essential operations, safety, trust, and compliance obligations within our organization. Please note that job duties may evolve based on business needs and additional responsibilities may be assigned as necessary to maintain operational efficiency and security. 

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Riot Games is a video game developer, publisher, and esports tournament organizer best known for League of Legends.

California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Berlin, Berlin, Germany (Hybrid)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Riot Games

Similar Jobs

Fliff  Inc  - Senior DevOps Engineering Manager

Fliff Inc , Bulgaria (Remote)

Evolution - Scala Engineer

Evolution, Portugal (On-Site)

Dmg - Senior Staff Engineer

Dmg, United States (On-Site)

Sinch - Machine Learning Engineer (LLMs)

Sinch, Belgium (Hybrid)

Applied Systems - Senior Systems Engineer

Applied Systems, India (On-Site)

Netflix - Full Stack Engineer L5 - Cloud Engineering

Netflix, United States (On-Site)

Nagarro - Associate Principal Engineer

Nagarro, Sri Lanka (Remote)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Telesign - Architect

Telesign, India (On-Site)

Onward Search - Operations Specialist, Global Network Operations Center

Onward Search, United States (Remote)

seeking alpha - Senior Backend  Developer

seeking alpha, Israel (On-Site)

Exabeam - Senior Site Reliability Engineer

Exabeam, India (On-Site)

ByteDance - Senior Software Engineer - Stability Platform

ByteDance, Singapore (On-Site)

Tesla - Software Distributed Systems Engineer

Tesla, Netherlands (On-Site)

Dotdash Meredith - Senior Software Engineer, 1

Dotdash Meredith, India (On-Site)

PlayStation Global - Staff Software Engineer

PlayStation Global, United States (On-Site)

Octopus Deploy - Senior Demand Generation Marketer

Octopus Deploy, (Remote)

Get notifed when new similar jobs are uploaded

Jobs in Los Angeles, California, United States

Samsung Semiconductor - Manager, Graphics DRAM Product Marketing

Samsung Semiconductor, United States (Hybrid)

Elsewhere - Audio Lead (CONTRACT)

Elsewhere, United States (Remote)

ByteDance - Tech Lead - Global E-Commerce Logistics

ByteDance, United States (On-Site)

Trek - Service Technician (Part-Time)

Trek, United States (On-Site)

Bungie - Senior Product Security Analyst

Bungie, United States (Hybrid)

The Walt Disney Company - Manager, Software Engineering(Scala)

The Walt Disney Company, United States (On-Site)

Trek - Sales Associate

Trek, United States (On-Site)

Fluence - Lead Engineer - Battery Module

Fluence, United States (Hybrid)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Bluevine - DevOps Engineer II

Bluevine, India (Hybrid)

BayOne Solutions - DevOps Engineer

BayOne Solutions, India (Hybrid)

Rackspace Technology - Professional Services Delivery Director

Rackspace Technology, United States (Remote)

ION - Senior DevSecOps Engineer, Italy

ION, United Kingdom (On-Site)

ZeniMax Media - Senior DevOps Programmer

ZeniMax Media, United States (On-Site)

Get notifed when new similar jobs are uploaded