Senior Site Reliability Engineer

1 Month ago • 3 Years + • Devops

Job Summary

Job Description

As a Senior Site Reliability Engineer (SRE) at Razer, you will join a team focused on building scalable, observable, and resilient systems within an AWS environment. Your responsibilities will include designing and maintaining Infrastructure as Code (IaC) solutions using tools like Terraform and CloudFormation. You will collaborate with various teams to ensure secure and reliable cloud infrastructure. You will also implement monitoring and alerting systems, drive incident response, and provide mentorship to junior engineers. The role involves automating tasks, troubleshooting complex issues, and participating in on-call support, with a shift from 5:00 PM to 2:00 AM (UTC+8).
Must have:
  • 3+ years experience in SRE or related roles.
  • Expertise with AWS Cloud Services.
  • Deep understanding of IaC using Terraform/CloudFormation.
  • Proficiency in a scripting language (Python, etc.).
  • Experience operating and troubleshooting Linux/Windows environments.
  • Experience with monitoring, alerting, and incident management.
  • Experience with Zero Downtime Deployments.
  • Strong understanding of DR.

Job Details

Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.

Job Responsibilities :

We are seeking a skilled and driven Senior Site Reliability Engineer (SRE) to join our growing infrastructure and platform engineering team. The ideal candidate will have hands-on experience in Amazon Web Services (AWS), strong troubleshooting capabilities, and a passion for building scalable, observable, and resilient systems using modern Infrastructure as Code (IaC) and automation tools.

REQUIREMENTS:

  • Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.
  • Minimum 3 years of experience in SRE, DevOps, cloud infrastructure, or system administration roles.
  • Hands-on expertise with AWS Cloud Services, including:
  • Compute & Containerization: EC2, Lambda, ECS, EKS, Auto Scaling
  • Networking: Load Balancers, VPC, Route 53, Security Groups, Firewalls
  • Storage & Databases: RDS, ElastiCache, Athena, S3
  • Messaging: SQS, SES
  • Deep understanding of Infrastructure as Code (IaC) tools such as Terraform and CloudFormation.
  • Proficiency in at least one programming/scripting language: Python, Node.js, Bash, Ruby, or related.
  • Experience operating and troubleshooting across Linux, Windows, and container-based environments.
  • Strong understanding of distributed systems, cloud networking (routers, switches), firewalls, DNS, and HTTP/TLS.
  • Experience implementing monitoring and alerting systems and working with incident management processes.
  • Experience with Zero Downtime Deployments, blue/green or canary deployments.
  • Familiarity with cost optimization and right-sizing AWS resources.
  • Exposure to multi-region, multi-account AWS architecture.
  • Understanding of API gateway, or edge networking (e.g., Akamai, CloudFront).

JOB DESCRIPTION:

  • Design, implement, and maintain Infrastructure as Code (IaC) solutions using Terraform and/or CloudFormation across multi-account AWS environments.
  • Collaborate with developers, architects, and DevOps teams to build scalable, secure, and observable cloud infrastructure.
  • Lead and participate in architecture design sessions, focusing on system reliability, scalability, security, and performance.
  • Implement and manage robust monitoring, alerting, and observability solutions (e.g., CloudWatch, Prometheus, ELK, Datadog).
  • Set and monitor Key Performance Indicators (KPIs) for system uptime, latency, throughput, and overall reliability.
  • Drive incident response processes, including coordination, triaging, resolution, documentation, and post-incident reviews (PIRs).
  • Supervise and mentor junior SREs and infrastructure engineers, fostering knowledge-sharing and team growth.
  • Collaborate across development, operations, and security teams to ensure secure and compliant deployments.
  • Automate manual tasks and workflows through scripting and tooling (Python, Node.js, Bash, Ruby, JSON/YAML).
  • Troubleshoot complex infrastructure issues across Linux, Windows, Docker, and cloud-native environments.
  • Provide IaC and CI/CD best practices to ensure repeatability, scalability, and compliance across all environments.
  • Provide on-call support, participate in incident rotations, and lead technical investigations during outages or degradations.
  • Strong understanding and experience for Disaster Recovery (DR).
  • Support from 5:00PM to 2:00AM (UTC+8) shift to ensure continuous of SRE coverage.
  • Undergo initial familiarization period during regular working hours before transitioning to the designated shift.
  • Provide support and solution handling to incident and tickets assigned.


 

Pre-Requisites :

Are you game?

Similar Jobs

The Walt Disney Company - Senior Software Engineer - Roku/Disney+

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Months ago
Veeam Software - Virtualization Backup Engineer

Veeam Software

North Sydney, New South Wales, Australia (On-Site)
3 Weeks ago
Aspire - Senior Software Architect

Aspire

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
ISS Stoxx - Senior Software Engineer in .NET/Java and SQL (Oracle)

ISS Stoxx

Mumbai, Maharashtra, India (On-Site)
1 Month ago
FORTUNE - Senior Sales Enablement Strategist, Integrated Marketing

FORTUNE

New York, New York, United States (On-Site)
2 Months ago
Hitachi - Kubernetes Engineer

Hitachi

Pune, Maharashtra, India (On-Site)
8 Months ago
Capgemini - SRE Engineers

Capgemini

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Roblox - Senior Software Engineer, Continuous Integration (CI) Infrastructure

Roblox

San Mateo, California, United States (On-Site)
3 Days ago
Otherside Entertainment - Senior DevOps Engineer

Otherside Entertainment

United States (Remote)
2 Months ago
Google - Customer Solutions Engineer

Google

New York, New York, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

hogarth - Senior Print Project Manager

hogarth

Manila, Metro Manila, Philippines (On-Site)
1 Month ago
Pipeworks - Technical Designer II - Level/Gameplay Implementation (UE5)

Pipeworks

Eugene, Oregon, United States (Remote)
1 Month ago
UXBERT Labs - AR/IoT Development Specialist

UXBERT Labs

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)
5 Months ago
Activision - Producer - Live Ops, Call of Duty

Activision

Santa Monica, California, United States (On-Site)
2 Months ago
Capgemini - SAP BRIM Consultant

Capgemini

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
FICO - DevOps Engineering Enablement Lead Engineer

FICO

Bengaluru, Karnataka, India (Hybrid)
1 Year ago
Visa - Senior Site Reliability Engineer

Visa

Ashburn, Virginia, United States (Hybrid)
1 Month ago
bytedance - Traffic Access Architectural Engineer - Traffic Infrastructure

bytedance

Singapore (On-Site)
7 Months ago
Microsoft - Software Engineer: Microsoft Software and Systems Academy (MSSA)

Microsoft

Redmond, Washington, United States (On-Site)
2 Months ago
Accenture - Customer Contact Comms Associate

Accenture

Hyderabad, Telangana, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia

Intel  - Product Failure Analysis Engineer

Intel

Penang, Malaysia (Hybrid)
1 Month ago
bytedance - Partnership Manager - BytePlus

bytedance

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
2 Months ago
OKX - Senior Analyst, Customer Service Operations

OKX

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Month ago
PwC - Experienced Associate - Forensics Services

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
9 Months ago
Veeam Software - Senior Inside Sales Representative

Veeam Software

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Month ago
QS Quacquarelli Symonds  - Student Enquiry Officer

QS Quacquarelli Symonds

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Hybrid)
2 Months ago
luxsoft - Murex Environment Management Consultant

luxsoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Month ago
Informa Group - Manager, Media Planning

Informa Group

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Hybrid)
3 Weeks ago
Xsolla - HR Generalist (Compliance)

Xsolla

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
2 Weeks ago
Headout - Account Executive

Headout

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Match Group - Senior ML Platform Engineer

Match Group

New York, New York, United States (Hybrid)
8 Months ago
Synechron - .NET Developer (Cloud, Front-End & Database Proficiency)

Synechron

Pune, Maharashtra, India (On-Site)
2 Weeks ago
London stock Exchange - Lead Engineer - Kubernetes

London stock Exchange

London, England, United Kingdom (On-Site)
1 Month ago
Google - Software Engineering Manager II, Site Reliability Engineering

Google

San Bruno, California, United States (On-Site)
2 Months ago
sunscrappers  - Infrastructure Engineer

sunscrappers

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
Journee - Senior Cloud Infrastructure Engineer

Journee

Berlin, Berlin, Germany (Hybrid)
8 Months ago
playrix  - Senior C++ Software Engineer (Build System)

playrix

Almaty, Almaty Region, Kazakhstan (Remote)
7 Months ago
PwC - Senior Associate Azure DevOps

PwC

Hyderabad, Telangana, India (On-Site)
2 Weeks ago
Luxoft - Solution Architect

Luxoft

New Delhi, Delhi, India (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

At Razer, you'll be at the forefront of the most exciting industry in the world — gaming. Evolving forms of gaming require evolving forms of hardware, software and services. That’s where Razer comes in, offering innovative top-of-the-line products and services to allow gamers to fully immerse in the ultimate gaming experience.Getting onboard Razer will place you on a global mission to bring gamers closer to the games they love. Razer is a place to do great work, offering you the opportunity to be a part of a global team across 11 countries. Whether you are a hardcore evangelist who breathe life to the latest and greatest gaming gear or a behind-the-scene hero who runs our global operations, you are assured of a career-changing quest that transcends time zones and culture with one single spell: For Gamers. By Gamers.The journey towards phenomenal-ness won’t come easy. However, we will excel because gamers rely on teamwork. We achieve greatness because we are wicked problem-solvers and tenacious in clinching victories in all that we do. It is the team that makes Razer where it is today and will continue to bring Razer to even greater heights.

Orlando, Florida, United States (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

State Of São Paulo, Brazil (On-Site)

Paramus, New Jersey, United States (On-Site)

Paramus, New Jersey, United States (On-Site)

Irvine, California, United States (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

View All Jobs

Get notified when new jobs are added by Razer

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug