Staff Software Engineer - Infrastructure Reliability

4 Hours ago • 7 Years + • DevOps • Undisclosed

About the job

Job Description

As a Staff Software Engineer on the Infrastructure Reliability team at Riot Games, you'll ensure the scalability, availability, and performance of game infrastructure. This role requires strong coding skills, automation passion, and a focus on reliability engineering. Responsibilities include automating infrastructure management, building CI/CD pipelines, architecting cloud-based (AWS, GCP) and hybrid infrastructure solutions, designing and implementing infrastructure systems, mentoring junior engineers, and providing thought leadership. You'll work with Kubernetes, Docker, Terraform, and other tools to build robust and maintainable systems.
Must have:
  • 7+ years software engineering experience
  • AWS expertise (Lambda, API Gateway, EKS, S3)
  • Automation & scripting (Python, Golang)
  • IaC (Terraform, Cloudformation)
  • CI/CD (Jenkins, Harness, GitHub Actions)
  • Containerization (Docker, Kubernetes)
  • Monitoring & Logging tools (Prometheus, Grafana)
  • Team leadership & mentorship
Good to have:
  • GCP knowledge
  • Java knowledge
  • Pulumi experience
  • CDN, WAF, AWS firewall experience
  • Database (SQL, NoSQL) knowledge
  • Networking foundations
Perks:
  • Open paid time off policy
  • Flexible work schedules
  • Medical, dental, and life insurance
  • Parental leave
  • 401k with company match

Riot Engineers bring deep knowledge of specific technical areas but also value the chance to work in many broader domains. As a Software Engineer, you’ll also dive into projects that focus on team cohesiveness and cross-team objectives. You’ll lead without authority and provide other engineers with a clear illustration of extraordinary engineering.

As a Staff Software Engineer on the Infrastructure Reliability team, you will be a critical part of our efforts to ensure the scalability, availability, and performance of our game infrastructure. This role demands strong coding skills, a passion for automation, and a focus on reliability engineering to deliver robust and maintainable systems. You will work on implementing infrastructure as code, developing self-healing systems, and creating tools to enhance observability and streamline troubleshooting for core infrastructure services. This role typically combines technical expertise with leadership responsibilities and requires a strong understanding of distributed systems, DevOps practices, and software development.

Responsibilities: 

  • Automation & DevOps: Drive the automation of infrastructure management, deployment pipelines, and system monitoring. Build and maintain CI/CD pipelines to ensure efficient delivery of code to production.
  • Cloud & On-Premises Management: Architect, implement, and manage cloud-based (AWS, GCP) and hybrid infrastructure solutions. Oversee container orchestration using Kubernetes, Docker, or similar technologies
  • Infrastructure Design & Development: Design and implement infrastructure systems to support large-scale, high-availability services. Develop and maintain tools for infrastructure provisioning, monitoring, and management.
  • Technical Leadership: Mentor and guide junior engineers, fostering a culture of collaboration and technical excellence. Provide thought leadership on infrastructure trends and technologies to influence the organization’s roadmap.

Required Qualifications:

  • 7+ years of experience in software engineering supporting large-scale infrastructure
  • Expertise in the public cloud:  AWS ecosystem, including serverless services (e.g., Lambda, API Gateway), container orchestration with Kubernetes (EKS), and foundational services (e.g., S3, VPC, EBS, Firewalls). Knowledge of GCP is plus
  • Automation and Scripting: Proficiency in scripting and programming languages like Python and Golang to drive automation, manage deployments, and create tooling. Knowledge of Java is plus
  • Infrastructure as Code (IaC): Extensive hands-on experience with Terraform, Cloudformation or similar infrastructure provisioning and configuration management. Knowledge of Pulumi for cloud infrastructure in code is a plus.
  • CI/CD Expertise: Proven experience working with CI/CD pipelines including tools like Jenkins,Harness and GitHub actions, emphasizing deployment reliability and automation.
  • Containerization: Expertise in container management and orchestration with Docker and Kubernetes, and experience designing robust microservices infrastructure. Strong understanding of configuration formats such as JSON and YAML and their application in IaC, Kubernetes manifests, and other deployment files.
  • Tools and Systems : Familiarity with monitoring and logging tools (e.g., Prometheus, Grafana, ELK Stack) and deep understanding of distributed systems, networking, and storage solutions
  • Adaptability: Ability to quickly adopt and adapt to new technologies, frameworks, and cloud-native tools to solve complex problems.
  • Team Leadership: Proven experience in guiding delivery goals across teams, advocating for best practices, and driving alignment on cross-initiative projects and initiatives.

Desired Qualifications: 

  • Proven experience leading and mentoring a team of engineers, fostering collaboration and technical growth.
  • Good understanding of CDN, WAF and AWS firewalls.
  • Familiarity with databases (SQL and NoSQL) and networking foundations.

For this role, you'll find success through craft expertise, a collaborative spirit, and decision-making that prioritizes your fellow Rioters, who are the customers of your work. Being a dedicated fan of games is not necessary for this position!

Our Perks:

Riot focuses on work/life balance, shown by our open paid time off policy and other perks such as flexible work schedules. We offer medical, dental, and life insurance, parental leave for you, your spouse/domestic partner, and children, and a 401k with company match. Check out our for more information.

Riot Games fosters a player and workplace experience that values teamwork embodied by the and . Our culture embraces differences as a strength, and our values are the guiding principles for how we approach work. We are committed to putting diversity and inclusion (D&I) at the center of everything we do, and promoting a fair and collaborative culture where Rioters treat one another with dignity and respect. We encourage you to read more about our value of and our ongoing work to build the .

It’s our policy to provide equal employment opportunity for all applicants and members of Riot Games, Inc. Riot Games makes reasonable accommodations for handicapped and disabled Rioters and does not unlawfully discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, handicap, veteran status, marital status, criminal history, or any other category protected by applicable federal and state law. We consider for employment all qualified applicants, including those with criminal histories, in a manner consistent with applicable federal, state and local law, including the California Fair Chance Act, the City of Los Angeles Fair Chance Initiative for Hiring Ordinance, the Los Angeles County Fair Chance Ordinance for Employers, the San Francisco Fair Chance Ordinance, and the Washington Fair Chance Act.

Per the Los Angeles County Fair Chance Ordinance, the following core duties may create a basis for disqualifying candidates with relevant criminal histories:

  • Safeguarding confidential and sensitive Company data
  • Communication with others, including Rioters and third parties such as vendors, and/or players, including minors
  • Accessing Company assets, secure digital systems, and networks
  • Ensuring a safe interactive environment for players and other Rioters

These duties are directly related to essential operations, safety, trust, and compliance obligations within our organization. Please note that job duties may evolve based on business needs and additional responsibilities may be assigned as necessary to maintain operational efficiency and security. 

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Riot Games is a video game developer, publisher, and esports tournament organizer best known for League of Legends.

California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Shanghai, Shanghai, China (On-Site)

Berlin, Berlin, Germany (On-Site)

Dublin, County Dublin, Ireland (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Riot Games

Similar Jobs

Anthology  Inc  - Platform Engineer II

Anthology Inc , Colombia (Remote)

CloudHire - Angular NestJS Developer

CloudHire, India (Remote)

Glean - SRE Manager (India)

Glean, India (On-Site)

NetSPI - Lead DevOps Engineer

NetSPI, India (On-Site)

Microsoft - Software Engineer

Microsoft, India (On-Site)

Next Level Business Services - CI/CD with force.com

Next Level Business Services, United States (On-Site)

Zscaler - Senior Backend Engineer

Zscaler, India (Hybrid)

Luxoft - DevOps Engineer

Luxoft, Switzerland (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Warner Bros Games - Software Engineer II

Warner Bros Games, (Hybrid)

Swiss Re - Senior Site Reliability Engineer

Swiss Re, India (On-Site)

Ubisoft - Machine Learning Deployment Developer

Ubisoft, Canada (On-Site)

bosh group india - Infra Automation Expert

bosh group india, India (On-Site)

Futurum Technology  - DevOps Engineer

Futurum Technology , Poland (On-Site)

Drivetrain - SDE (Automation & Quality Focus)

Drivetrain, India (Remote)

Get notifed when new similar jobs are uploaded

Jobs in Los Angeles, California, United States

Hedra - Lead Full-Stack Engineer

Hedra, United States (On-Site)

The Walt Disney Company - Sr Software Engineer (Front End/JavaScript)

The Walt Disney Company, United States (On-Site)

Nintendo - CONTRACT - Associate Account Administrator

Nintendo, United States (Hybrid)

Notion - Software Engineer, Data Platform

Notion, United States (On-Site)

Valve corporation - Steam Database Administrator

Valve corporation, United States (On-Site)

The Walt Disney Company - Network Operations II (Night Shift)

The Walt Disney Company, United States (On-Site)

IGN - Senior Paid Social Media Manager

IGN, United States (Hybrid)

Netflix - Business Partner, Finance Procurement Operations

Netflix, United States (On-Site)

Power Integrations - Senior Reliability Engineer

Power Integrations, United States (On-Site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

Keywords Studios (Player Support) - DevSecOps Engineer II

Keywords Studios (Player Support), India (On-Site)

Microsoft - Senior Research Data and Service Engineer

Microsoft, United States (On-Site)

ION - Cloud Engineer Kubernetes

ION, Italy (Hybrid)

Playtika - Senior DATA/AI SRE Engineer

Playtika, Poland (On-Site)

Interactive Brokers - Senior Systems Engineer- Microsoft M365/Active Directory

Interactive Brokers, United States (Hybrid)

Nagarro - Senior Engineer, DevOps

Nagarro, India (Remote)

PlayStation Global - Staff Service Reliability Engineer

PlayStation Global, Germany (On-Site)

Get notifed when new similar jobs are uploaded