Staff Software Engineer - Infrastructure Reliability

3 Months ago • 7 Years + • Devops • $192,500 PA - $269,400 PA

Job Summary

Job Description

As a Staff Software Engineer on the Infrastructure Reliability team, you will be essential in ensuring the scalability, availability, and performance of game infrastructure. This role involves coding, automation, and a focus on reliability engineering to create robust systems. You'll work on implementing infrastructure as code, developing self-healing systems, and creating tools for observability and troubleshooting. Responsibilities include automating infrastructure management, managing cloud and on-premises solutions, designing infrastructure, and mentoring junior engineers.
Must have:
  • 7+ years of software engineering experience
  • Expertise in AWS ecosystem and Kubernetes
  • Proficiency in Python and Golang scripting
  • Hands-on experience with Terraform or Cloudformation
  • Experience with CI/CD pipelines (Jenkins, Harness)
  • Expertise in container management with Docker and Kubernetes
  • Familiarity with monitoring and logging tools
  • Ability to quickly adapt to new technologies
  • Experience in guiding delivery goals across teams
Good to have:
  • Experience leading and mentoring a team of engineers
  • Good understanding of CDN, WAF and AWS firewalls
  • Familiarity with databases (SQL and NoSQL) and networking foundations.
Perks:
  • Open paid time off
  • Flexible work schedules
  • Medical, dental, and life insurance
  • Parental leave
  • 401k with company match

Job Details

Riot Engineers bring deep knowledge of specific technical areas but also value the chance to work in many broader domains. As a Software Engineer, you’ll also dive into projects that focus on team cohesiveness and cross-team objectives. You’ll lead without authority and provide other engineers with a clear illustration of extraordinary engineering.

As a Staff Software Engineer on the Infrastructure Reliability team, you will be a critical part of our efforts to ensure the scalability, availability, and performance of our game infrastructure. This role demands strong coding skills, a passion for automation, and a focus on reliability engineering to deliver robust and maintainable systems. You will work on implementing infrastructure as code, developing self-healing systems, and creating tools to enhance observability and streamline troubleshooting for core infrastructure services. This role typically combines technical expertise with leadership responsibilities and requires a strong understanding of distributed systems, DevOps practices, and software development.

Responsibilities: 

  • Automation & DevOps: Drive the automation of infrastructure management, deployment pipelines, and system monitoring. Build and maintain CI/CD pipelines to ensure efficient delivery of code to production.
  • Cloud & On-Premises Management: Architect, implement, and manage cloud-based (AWS, GCP) and hybrid infrastructure solutions. Oversee container orchestration using Kubernetes, Docker, or similar technologies
  • Infrastructure Design & Development: Design and implement infrastructure systems to support large-scale, high-availability services. Develop and maintain tools for infrastructure provisioning, monitoring, and management.
  • Technical Leadership: Mentor and guide junior engineers, fostering a culture of collaboration and technical excellence. Provide thought leadership on infrastructure trends and technologies to influence the organization’s roadmap.

Required Qualifications:

  • 7+ years of experience in software engineering supporting large-scale infrastructure
  • Expertise in the public cloud:  AWS ecosystem, including serverless services (e.g., Lambda, API Gateway), container orchestration with Kubernetes (EKS), and foundational services (e.g., S3, VPC, EBS, Firewalls). Knowledge of GCP is plus
  • Automation and Scripting: Proficiency in scripting and programming languages like Python and Golang to drive automation, manage deployments, and create tooling. Knowledge of Java is plus
  • Infrastructure as Code (IaC): Extensive hands-on experience with Terraform, Cloudformation or similar infrastructure provisioning and configuration management. Knowledge of Pulumi for cloud infrastructure in code is a plus.
  • CI/CD Expertise: Proven experience working with CI/CD pipelines including tools like Jenkins,Harness and GitHub actions, emphasizing deployment reliability and automation.
  • Containerization: Expertise in container management and orchestration with Docker and Kubernetes, and experience designing robust microservices infrastructure. Strong understanding of configuration formats such as JSON and YAML and their application in IaC, Kubernetes manifests, and other deployment files.
  • Tools and Systems : Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) and deep understanding of distributed systems, networking, and storage solutions
  • Adaptability: Ability to quickly adopt and adapt to new technologies, frameworks, and cloud-native tools to solve complex problems.
  • Team Leadership: Proven experience in guiding delivery goals across teams, advocating for best practices, and driving alignment on cross-initiative projects and initiatives.

Desired Qualifications: 

  • Proven experience leading and mentoring a team of engineers, fostering collaboration and technical growth.
  • Good understanding of CDN, WAF and AWS firewalls.
  • Familiarity with databases (SQL and NoSQL) and networking foundations.

For this role, you'll find success through craft expertise, a collaborative spirit, and decision-making that prioritizes your fellow Rioters, who are the customers of your work. Being a dedicated fan of games is not necessary for this position!

 

Our Perks:

Riot focuses on work/life balance, shown by our open paid time off policy and other perks such as flexible work schedules. We offer medical, dental, and life insurance, parental leave for you, your spouse/domestic partner, and children, and a 401k with company match. Check out our benefits pages for more information.

Riot Games fosters a player and workplace experience that values teamwork embodied by the Summoner's Code and Community Code. Our culture embraces differences as a strength, and our values are the guiding principles for how we approach work. We are committed to putting diversity and inclusion (D&I) at the center of everything we do, and promoting a fair and collaborative culture where Rioters treat one another with dignity and respect. We encourage you to read more about our value of thriving together and our ongoing work to build the most inclusive company in Gaming.

 

It’s our policy to provide equal employment opportunity for all applicants and members of Riot Games, Inc. Riot Games makes reasonable accommodations for handicapped and disabled Rioters and does not unlawfully discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, handicap, veteran status, marital status, criminal history, or any other category protected by applicable federal and state law. We consider for employment all qualified applicants, including those with criminal histories, in a manner consistent with applicable federal, state and local law, including the California Fair Chance Act, the City of Los Angeles Fair Chance Initiative for Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, the San Francisco Fair Chance Ordinance, and the Washington Fair Chance Act.

Per the Los Angeles County Fair Chance Ordinance, the following core duties may create a basis for disqualifying candidates with relevant criminal histories:

  • Safeguarding confidential and sensitive Company data
  • Communication with others, including Rioters and third parties such as vendors, and/or players, including minors
  • Accessing Company assets, secure digital systems, and networks
  • Ensuring a safe interactive environment for players and other Rioters

These duties are directly related to essential operations, safety, trust, and compliance obligations within our organization. Please note that job duties may evolve based on business needs and additional responsibilities may be assigned as necessary to maintain operational efficiency and security. 

Similar Jobs

NetEase Games - Director, Japan Post Investment Management

NetEase Games

Shinjuku City, Tokyo, Japan (On-Site)
8 Months ago
Zengame Technology - Advertising Video Designer

Zengame Technology

Shenzhen, Guangdong Province, China (On-Site)
1 Week ago
Infosys - Lead .Net Fullstack Developer

Infosys

Plano, Texas, United States (On-Site)
1 Month ago
Roof Stacks - Senior Cyber Security Engineer

Roof Stacks

Istanbul, İstanbul, Türkiye (On-Site)
4 Months ago
beghou consulting - Consultant, Commercial Operations & Analytics

beghou consulting

Evanston, Illinois, United States (Hybrid)
1 Month ago
Imanage - Site Reliability Engineer

Imanage

Chicago, Illinois, United States (Hybrid)
2 Months ago
Nousresearch - Machine Learning Engineer (Training Infrastructure)

Nousresearch

(On-Site)
2 Weeks ago
GameChanger - Senior DevOps Engineer

GameChanger

New York, United States (Remote)
1 Month ago
Jumio - DevOps Engineer III

Jumio

Bengaluru, Karnataka, India (On-Site)
4 Weeks ago
Animoca Brands - Senior DevOps Engineer

Animoca Brands

Hong Kong (On-Site)
10 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Toast - Territory Account Executive

Toast

Stamford, Connecticut, United States (On-Site)
1 Month ago
Perplexity - AI Machine Learning Engineer - Personalization

Perplexity

New York, New York, United States (On-Site)
2 Weeks ago
magnopus - Producer, Virtual Production

magnopus

Los Angeles, California, United States (Hybrid)
10 Months ago
HYCU - Business Systems Operations Manager (Sales Ops)

HYCU

Belgrade, Serbia (Hybrid)
2 Months ago
Wind River - Senior Payroll Analyst

Wind River

San José Province, Costa Rica (On-Site)
3 Weeks ago
SciPlay - Server Engineer

SciPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Year ago
WebFX - Jr. Business Data Analyst

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
9 Months ago
Monzo - Share Plans Senior Manager

Monzo

Cardiff, Wales, United Kingdom (Remote)
2 Months ago
Neolytix - Medical Billing Admin Assistant/Customer Success Manager

Neolytix

Chicago, Illinois, United States (On-Site)
4 Weeks ago
Paytm - Sales - Team Lead - Jalandhar

Paytm

Jalandhar, Punjab, India (On-Site)
9 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Los Angeles, California, United States

Match Group - Software Engineer, Backend

Match Group

Palo Alto, California, United States (Hybrid)
1 Month ago
The Walt Disney Company - Senior Software Engineer

The Walt Disney Company

Burbank, California, United States (On-Site)
3 Months ago
illumio - Senior Backend Software Engineer (Python (Golang a plus))

illumio

Sunnyvale, California, United States (Hybrid)
1 Week ago
Apple - Wireless PHY Design Verification Engineer

Apple

Sunnyvale, California, United States (On-Site)
1 Month ago
Palo Alto Networks - Principal Site Reliability Engineer (Prisma Access)

Palo Alto Networks

Reston, Virginia, United States (On-Site)
1 Month ago
Cerence - Business Development Manager (Partnerships)

Cerence

Montreal, Missouri, United States (Remote)
1 Year ago
Alpha Sense - Enterprise Account Executive, International Strategic Sales

Alpha Sense

New York, United States (Hybrid)
2 Months ago
Sonar Source - Renewals Specialist - LATAM & North America

Sonar Source

Austin, Texas, United States (On-Site)
1 Week ago
Playstation - Software Engineering Manager, Mobile

Playstation

San Mateo, California, United States (On-Site)
2 Months ago
London stock Exchange - Platform Principal Engineer

London stock Exchange

New York, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Enphase Energy - Sr. Staff Engineer Cloud

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
7 Months ago
Saronic Technologies - Foundry Software Architect

Saronic Technologies

Austin, Texas, United States (On-Site)
1 Week ago
CyberArk - Senior FrontEnd Engineer (p-Cloud)

CyberArk

Israel (Hybrid)
3 Months ago
Abridge - Senior Platform Engineer

Abridge

San Francisco, California, United States (Hybrid)
2 Months ago
Salesforce - Distinguished/Principal Solution Engineer - Communications and Media

Salesforce

Gurugram, Haryana, India (On-Site)
2 Weeks ago
Britive - SENIOR SOFTWARE ENGINEER (CLOUD)

Britive

Bengaluru, Karnataka, India (Remote)
8 Months ago
Unisys - Sr Cloud Engineer (AWS and DevOps)

Unisys

Richmond, Virginia, United States (On-Site)
2 Months ago
 Many Chat  Inc  - Cloud Infrastructure Engineer (AWS / Kubernetes / SRE)

Many Chat Inc

Amsterdam, North Holland, Netherlands (Hybrid)
1 Week ago
Sonar Source - Senior Platform Engineer

Sonar Source

Austin, Texas, United States (On-Site)
5 Months ago
Moon Active - Infrastructure Engineer

Moon Active

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

About The Company

Singapore (On-Site)

Shanghai, China (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Shanghai, China (On-Site)

Sydney, New South Wales, Australia (On-Site)

Shanghai, China (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Riot Games

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug