Senior Site Reliability Engineer

2 Months ago • 3 Years + • Devops

Job Summary

Job Description

As a Senior Site Reliability Engineer (SRE) at Razer, you will join a team focused on building scalable, observable, and resilient systems within an AWS environment. Your responsibilities will include designing and maintaining Infrastructure as Code (IaC) solutions using tools like Terraform and CloudFormation. You will collaborate with various teams to ensure secure and reliable cloud infrastructure. You will also implement monitoring and alerting systems, drive incident response, and provide mentorship to junior engineers. The role involves automating tasks, troubleshooting complex issues, and participating in on-call support, with a shift from 5:00 PM to 2:00 AM (UTC+8).
Must have:
  • 3+ years experience in SRE or related roles.
  • Expertise with AWS Cloud Services.
  • Deep understanding of IaC using Terraform/CloudFormation.
  • Proficiency in a scripting language (Python, etc.).
  • Experience operating and troubleshooting Linux/Windows environments.
  • Experience with monitoring, alerting, and incident management.
  • Experience with Zero Downtime Deployments.
  • Strong understanding of DR.

Job Details

Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.

Job Responsibilities :

We are seeking a skilled and driven Senior Site Reliability Engineer (SRE) to join our growing infrastructure and platform engineering team. The ideal candidate will have hands-on experience in Amazon Web Services (AWS), strong troubleshooting capabilities, and a passion for building scalable, observable, and resilient systems using modern Infrastructure as Code (IaC) and automation tools.

REQUIREMENTS:

  • Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.
  • Minimum 3 years of experience in SRE, DevOps, cloud infrastructure, or system administration roles.
  • Hands-on expertise with AWS Cloud Services, including:
  • Compute & Containerization: EC2, Lambda, ECS, EKS, Auto Scaling
  • Networking: Load Balancers, VPC, Route 53, Security Groups, Firewalls
  • Storage & Databases: RDS, ElastiCache, Athena, S3
  • Messaging: SQS, SES
  • Deep understanding of Infrastructure as Code (IaC) tools such as Terraform and CloudFormation.
  • Proficiency in at least one programming/scripting language: Python, Node.js, Bash, Ruby, or related.
  • Experience operating and troubleshooting across Linux, Windows, and container-based environments.
  • Strong understanding of distributed systems, cloud networking (routers, switches), firewalls, DNS, and HTTP/TLS.
  • Experience implementing monitoring and alerting systems and working with incident management processes.
  • Experience with Zero Downtime Deployments, blue/green or canary deployments.
  • Familiarity with cost optimization and right-sizing AWS resources.
  • Exposure to multi-region, multi-account AWS architecture.
  • Understanding of API gateway, or edge networking (e.g., Akamai, CloudFront).

JOB DESCRIPTION:

  • Design, implement, and maintain Infrastructure as Code (IaC) solutions using Terraform and/or CloudFormation across multi-account AWS environments.
  • Collaborate with developers, architects, and DevOps teams to build scalable, secure, and observable cloud infrastructure.
  • Lead and participate in architecture design sessions, focusing on system reliability, scalability, security, and performance.
  • Implement and manage robust monitoring, alerting, and observability solutions (e.g., CloudWatch, Prometheus, ELK, Datadog).
  • Set and monitor Key Performance Indicators (KPIs) for system uptime, latency, throughput, and overall reliability.
  • Drive incident response processes, including coordination, triaging, resolution, documentation, and post-incident reviews (PIRs).
  • Supervise and mentor junior SREs and infrastructure engineers, fostering knowledge-sharing and team growth.
  • Collaborate across development, operations, and security teams to ensure secure and compliant deployments.
  • Automate manual tasks and workflows through scripting and tooling (Python, Node.js, Bash, Ruby, JSON/YAML).
  • Troubleshoot complex infrastructure issues across Linux, Windows, Docker, and cloud-native environments.
  • Provide IaC and CI/CD best practices to ensure repeatability, scalability, and compliance across all environments.
  • Provide on-call support, participate in incident rotations, and lead technical investigations during outages or degradations.
  • Strong understanding and experience for Disaster Recovery (DR).
  • Support from 5:00PM to 2:00AM (UTC+8) shift to ensure continuous of SRE coverage.
  • Undergo initial familiarization period during regular working hours before transitioning to the designated shift.
  • Provide support and solution handling to incident and tickets assigned.


 

Pre-Requisites :

Are you game?

Similar Jobs

Enphase Energy - Senior Database Engineer

Enphase Energy

Bengaluru, Karnataka, India (On-Site)
6 Months ago
velotio technologies  - DataOps Engineer

velotio technologies

Bengaluru, Karnataka, India (Hybrid)
1 Week ago
Brillio - Lead BI Engineer

Brillio

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Lambda - Data Center Operations Engineer

Lambda

Kansas City, Missouri, United States (On-Site)
1 Month ago
DataVisor - Senior Technical Account Manager (TAM)

DataVisor

Ireland (On-Site)
4 Days ago
Extreme Inc. - Infrastructure Engineer

Extreme Inc.

Japan (Hybrid)
2 Months ago
Loyalty Juggernaut - Solutions Engineer

Loyalty Juggernaut

Hyderabad, Telangana, India (On-Site)
1 Year ago
Capgemini - DevOps Engineer - Splunk & AppDynamics

Capgemini

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Palo Alto Networks - Principal Consultant, Cloud Security, Proactive Services (Unit 42)

Palo Alto Networks

United States (Remote)
3 Weeks ago
Mistral AI - Solutions Architect, Partner - EMEA

Mistral AI

London, England, United Kingdom (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Wind River - Technical Leader - DO178 V&V

Wind River

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
luxsoft - SAP Senior Configuration SME

luxsoft

Spain (Remote)
1 Month ago
gitlab - Senior Backend Engineer (Ruby on Rails)

gitlab

(Remote)
2 Months ago
cyara - Senior Software Engineer - Backend Telephony

cyara

Hyderabad, Telangana, India (Hybrid)
1 Year ago
Workato - Senior Automation Engineer

Workato

Hyderabad, Telangana, India (On-Site)
3 Weeks ago
Power Integrations - Senior Field Applications Engineer

Power Integrations

Shanghai, Shanghai, China (On-Site)
9 Months ago
Capgemini - Oracle DBA

Capgemini

Mumbai, Maharashtra, India (On-Site)
1 Year ago
Postman - Backend and Systems Engineer, Flows

Postman

New York, New York, United States (On-Site)
9 Months ago
Nintendo - Senior Engineer, Multimedia (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
1 Year ago
Miratech - Google CCAI BOT Developer

Miratech

Ahmedabad, Gujarat, India (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Jobs in Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia

NinjaVan - Executive, Retail (Business Development)

NinjaVan

Kedah, Malaysia (On-Site)
2 Months ago
PwC - Senior Manager - Indirect Tax

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
7 Months ago
Site Core - Director of Engineering

Site Core

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
2 Months ago
e2 open - Billing Analyst

e2 open

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Week ago
Coda - Marketing Manager, B2B

Coda

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Hybrid)
2 Months ago
Intel  - Graduate Talent (Consumer IO IPs Design/Verification)

Intel

Penang, Malaysia (On-Site)
1 Week ago
OKX - Specialist, Risk Ops (Payment Risk)

OKX

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
2 Months ago
NinjaVan - Pre-Sales (FMCG - Ninja Mart) - Southern Region

NinjaVan

Johor, Malaysia (On-Site)
1 Month ago
bytedance - IDC Quality Manager, Supply Chain Quality

bytedance

Kulai, Johor, Malaysia (On-Site)
8 Months ago
Power Integrations - Test Engineer

Power Integrations

Penang, Malaysia (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Ajmera Infotech - Senior Android Developer – Build Mission-Critical Health-Tech Apps

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
2 Months ago
dbt Labs - Solutions Architect

dbt Labs

Atlanta, Georgia, United States (On-Site)
1 Week ago
Capgemini - AZURE SOLUTION ARCHITECT

Capgemini

Mumbai, Maharashtra, India (On-Site)
3 Months ago
neural concept - Cloud Solutions Engineer (ML Platform)

neural concept

Jersey City, New Jersey, United States (Hybrid)
1 Week ago
sitetracker - Salesforce Solution Architect

sitetracker

Sydney, New South Wales, Australia (On-Site)
1 Month ago
Ajmera Infotech - iOS Developer II – Build Mission-Critical Health-Tech Apps

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Sourcegraph  Inc  - Senior Solutions Engineer

Sourcegraph Inc

(Remote)
1 Week ago
Regrello - Senior Site Reliability Engineer

Regrello

Monterrey, Nuevo Leon, Mexico (Hybrid)
8 Months ago
Fractal - DevOps - Lead

Fractal

Mumbai, Maharashtra, India (On-Site)
8 Months ago
Palo Alto Networks - Marketplace Operations Manager (Cloud Service Providers)

Palo Alto Networks

Paris, Île-de-France, France (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

At Razer, you'll be at the forefront of the most exciting industry in the world — gaming. Evolving forms of gaming require evolving forms of hardware, software and services. That’s where Razer comes in, offering innovative top-of-the-line products and services to allow gamers to fully immerse in the ultimate gaming experience.Getting onboard Razer will place you on a global mission to bring gamers closer to the games they love. Razer is a place to do great work, offering you the opportunity to be a part of a global team across 11 countries. Whether you are a hardcore evangelist who breathe life to the latest and greatest gaming gear or a behind-the-scene hero who runs our global operations, you are assured of a career-changing quest that transcends time zones and culture with one single spell: For Gamers. By Gamers.The journey towards phenomenal-ness won’t come easy. However, we will excel because gamers rely on teamwork. We achieve greatness because we are wicked problem-solvers and tenacious in clinching victories in all that we do. It is the team that makes Razer where it is today and will continue to bring Razer to even greater heights.

San Jose, California, United States (On-Site)

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

San Jose, California, United States (On-Site)

State Of São Paulo, Brazil (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

Irvine, California, United States (On-Site)

San Diego, California, United States (On-Site)

Garden City, New York, United States (On-Site)

Singapore (On-Site)

View All Jobs

Get notified when new jobs are added by Razer

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug