Site Reliability Engineer

1 Month ago • 2 Years + • Devops

Job Summary

Job Description

The Site Reliability Engineer (SRE) will join the infrastructure and platform engineering team. The role involves designing, developing, and maintaining Infrastructure as Code (IaC) using tools like Terraform or AWS CloudFormation. Responsibilities include implementing and operating reliable, scalable cloud infrastructure on AWS, leading architecture reviews, developing monitoring and alerting solutions, performing incident management, collaborating with engineering teams on CI/CD pipelines, automating infrastructure operations, maintaining environments, ensuring security compliance, and providing on-call support. The SRE will monitor and maintain service-level objectives (SLOs) and will be required to work a shift from 5:00 PM to 2:00 AM (UTC+8).
Must have:
  • Bachelor’s degree in related field.
  • Minimum 2 years of experience in related roles.
  • Hands-on experience with AWS Cloud services.
  • Proficient in Infrastructure as Code (Terraform, CloudFormation).
  • Experience with CI/CD tools (GitLab CI, Jenkins, etc.).
  • Strong understanding of Linux/Windows system administration.
  • Comfortable with scripting languages (Python, Bash, etc.).
  • Strong grasp of network fundamentals (DNS, HTTP, etc.).
  • Familiar with observability tools and best practices.
Good to have:
  • Experience with containerization and orchestration (Docker, ECS, or Kubernetes is a plus).

Job Details

Joining Razer will place you on a global mission to revolutionize the way the world games. Razer is a place to do great work, offering you the opportunity to make an impact globally while working across a global team located across 5 continents. Razer is also a great place to work, providing you the unique, gamer-centric #LifeAtRazer experience that will put you in an accelerated growth, both personally and professionally.

Job Responsibilities :

We are seeking a skilled and driven Site Reliability Engineer (SRE) to join our growing infrastructure and platform engineering team. The ideal candidate will have hands-on experience in Amazon Web Services (AWS), strong troubleshooting capabilities, and a passion for building scalable, observable, and resilient systems using modern Infrastructure as Code (IaC) and automation tools.

REQUIREMENTS:

  • Bachelor’s degree in Computer Science, Software Engineering, Information Technology, or a related field.
  • Minimum 2 years of experience in SRE, DevOps, Cloud Infrastructure, or Systems Administration roles.
  • Solid hands-on experience with AWS Cloud services including (but not limited to):
  • Compute: EC2, Lambda, ECS, Auto Scaling
  • Networking: VPC, Load Balancers, Route 53
  • Messaging & Storage: SQS, S3, RDS, ElastiCache, SES
  • Monitoring: CloudWatch, X-Ray
  • Proficient in Infrastructure as Code using Terraform and/or CloudFormation.
  • Experience with CI/CD tools (e.g., GitLab CI, Jenkins, CodePipeline, ArgoCD).
  • Strong understanding of Linux and Windows system administration and troubleshooting.
  • Comfortable with one or more scripting/programming languages such as Python, Node.js, Bash, Ruby, or JSON/YAML for automation.
  • Strong grasp of network fundamentals, including DNS, HTTP(S), TLS/SSL, firewalls, and TCP/IP.
  • Experience with containerization and orchestration (Docker, ECS, or Kubernetes is a plus).
  • Familiar with observability tools and incident management best practices.

JOB DESCRIPTION:

  • Design, develop, and maintain Infrastructure as Code (IaC) using tools like Terraform or AWS CloudFormation.
  • Implement and operate reliable, scalable cloud infrastructure primarily on AWS (e.g., EC2, ECS, RDS, S3, Lambda, ElastiCache, SQS, SES, Auto Scaling, Load Balancers).
  • Lead and participate in architecture reviews focusing on reliability, scalability, security, and performance.
  • Develop and manage robust monitoring, alerting, and logging solutions (e.g., CloudWatch, Prometheus, Grafana, ELK, etc.) to detect and resolve issues proactively.
  • Perform incident management, postmortems, root cause analysis, and implement continuous improvement strategies.
  • Collaborate with software engineering teams to improve CI/CD pipelines, deployment automation, and release management.
  • Automate infrastructure operations, reduce manual toil, and improve reliability using scripting (Python, Bash, Node.js, or Ruby).
  • Maintain and troubleshoot environments involving web servers, databases, firewalls, DNS, load balancers, and networking.
  • Ensure systems are compliant with security standards, including patching, hardening, and secure access policies.
  • Provide on-call support, participate in incident rotations.
  • Monitor and maintain service-level objectives (SLOs), SLAs, and error budgets to ensure reliability targets are  met.
  • Support from 5:00PM to 2:00AM (UTC+8) shift to ensure continuous of SRE coverage.
  • Undergo initial familiarization period during regular working hours before transitioning to the designated shift.
  • Provide support and solution handling to incident and tickets assigned.

Pre-Requisites :

Are you game?

Similar Jobs

Sandbox VR - Store Manager

Sandbox VR

Kirkland, Washington, United States (On-Site)
3 Years ago
whoop - Staff Electrical Engineer (NPI)

whoop

Boston, Massachusetts, United States (On-Site)
2 Months ago
VOID Interactive - First Person Animator

VOID Interactive

Ireland (Remote)
3 Months ago
Trend Micro - Fullstack Development Engineer

Trend Micro

Manila, Metro Manila, Philippines (On-Site)
16 Years ago
Palo Alto Networks - Incident Commander

Palo Alto Networks

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Capgemini - Automation Engineer

Capgemini

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Aptive - Software Architect

Aptive

Monterrey, Nuevo Leon, Mexico (On-Site)
1 Month ago
bytedance - Solutions Architect

bytedance

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
3 Months ago
Google - Senior Staff Software Engineer, Infrastructure, Google Cloud

Google

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Loft Orbital - Space Infrastructure Software Engineer

Loft Orbital

San Francisco, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

gitlab - Associate Support Engineer (AMER)

gitlab

(Remote)
1 Month ago
CME Group - Staff Network Engineer

CME Group

Belfast, Northern Ireland, United Kingdom (Hybrid)
2 Weeks ago
Scout - Staff Engineer, Functional Safety

Scout

Novi, Michigan, United States (On-Site)
1 Month ago
Canva - Staff Frontend Engineer - Apps API Platform

Canva

Brisbane, Queensland, Australia (Remote)
3 Months ago
Anthology  Inc  - Senior Software Engineer in Support I (Senior Tier-3 Engineer)

Anthology Inc

Bogota, Colombia (Remote)
2 Months ago
Interactive Brokers - Associate - Client Services

Interactive Brokers

Singapore (Hybrid)
1 Month ago
NCS Soft - Mobile Senior QA Tester

NCS Soft

Irvine, California, United States (On-Site)
1 Month ago
Trellix - Senior SDET

Trellix

Bengaluru, Karnataka, India (Hybrid)
3 Weeks ago
Toku - Payroll Operations Specialist

Toku

Mumbai, Maharashtra, India (Remote)
4 Months ago
Synechron - Evaluation Engineer

Synechron

Charlotte, North Carolina, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Kuala Lumpur, Federal Territory of Kuala Lumpur, Malaysia

luxsoft - Developer/Engineer (Control-M, AS400, Mainframe)

luxsoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
2 Weeks ago
Luxoft - Senior Software Support Engineer

Luxoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Remote)
7 Months ago
Western Digital - Internship - IT

Western Digital

Bayan Lepas, Penang, Malaysia (On-Site)
2 Weeks ago
Electronic Arts - Junior Software Engineer - Full Stack

Electronic Arts

Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, Malaysia (On-Site)
3 Weeks ago
London stock Exchange - Lead Research Analyst (Arabic)

London stock Exchange

Penang, Malaysia (Hybrid)
2 Weeks ago
luxsoft - Murex Project Manager

luxsoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
1 Month ago
NinjaVan - Junior Executive, Retail (Operations)

NinjaVan

Kuching, Sarawak, Malaysia (On-Site)
3 Weeks ago
NinjaVan - Executive, Retail (Business Development)

NinjaVan

Johor Bahru, Johor, Malaysia (On-Site)
1 Month ago
Coda - Senior/Staff Software Engineer

Coda

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Hybrid)
1 Year ago
bytedance - Partnership Manager - BytePlus

bytedance

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Visa - Sr. Site Reliability Engineer - ServiceNow

Visa

Ashburn, Virginia, United States (Hybrid)
3 Weeks ago
GoDaddy - Full Stack Software Engineer - AWS

GoDaddy

Serbia (Remote)
1 Month ago
Arkose Labs - Principal Software Architect

Arkose Labs

San Mateo, California, United States (Remote)
2 Months ago
Saviynt - Senior Solutions Engineer

Saviynt

Singapore (Hybrid)
3 Weeks ago
Cadence - Software Security Architect

Cadence

San Jose, California, United States (On-Site)
1 Month ago
Virtusa - Cloud DevOps Lead

Virtusa

Andhra Pradesh, India (On-Site)
8 Months ago
Next Level Business Services - Site reliability engineer -SMTP Service Management (Full) Time

Next Level Business Services

Redmond, Washington, United States (On-Site)
8 Months ago
London stock Exchange - Automation Cloud Engineer

London stock Exchange

St. Louis, Missouri, United States (On-Site)
1 Month ago
Epic Games - Automation Engineer

Epic Games

Cary, North Carolina, United States (On-Site)
3 Months ago
bytedance - Senior Software Engineer - Compute Infrastructure (Orchestration & Scheduling)

bytedance

San Jose, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

At Razer, you'll be at the forefront of the most exciting industry in the world — gaming. Evolving forms of gaming require evolving forms of hardware, software and services. That’s where Razer comes in, offering innovative top-of-the-line products and services to allow gamers to fully immerse in the ultimate gaming experience.Getting onboard Razer will place you on a global mission to bring gamers closer to the games they love. Razer is a place to do great work, offering you the opportunity to be a part of a global team across 11 countries. Whether you are a hardcore evangelist who breathe life to the latest and greatest gaming gear or a behind-the-scene hero who runs our global operations, you are assured of a career-changing quest that transcends time zones and culture with one single spell: For Gamers. By Gamers.The journey towards phenomenal-ness won’t come easy. However, we will excel because gamers rely on teamwork. We achieve greatness because we are wicked problem-solvers and tenacious in clinching victories in all that we do. It is the team that makes Razer where it is today and will continue to bring Razer to even greater heights.

Orlando, Florida, United States (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

State Of São Paulo, Brazil (On-Site)

Paramus, New Jersey, United States (On-Site)

Paramus, New Jersey, United States (On-Site)

Irvine, California, United States (On-Site)

Shah Alam, Selangor, Malaysia (On-Site)

View All Jobs

Get notified when new jobs are added by Razer

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug