Senior Site Reliability Engineer

7 Months ago • 5 Years + • DevOps

Job Summary

Job Description

Senior Site Reliability Engineer with 5+ years of experience in Cloud and on-prem SRE design and implementation. Must have expertise in infrastructure automation, distributed systems, and cloud platforms like AWS, Azure, GCP. Strong knowledge of monitoring, logging, and configuration management is essential.
Must have:
  • Infrastructure Automation
  • Distributed Systems
  • Cloud Platforms
  • Monitoring Concepts
Good to have:
  • Containerization Tech
  • Network Experience
  • Elastic Search
  • Prometheus
Perks:
  • Global IT Team
  • Fast-Paced Environment

Job Details

Responsibilities:

About Tencent Overseas IT:
Tencent Overseas IT has the mission to empower Tencent’s rapid global growth with future-ready, global IT platforms, applications, and services. We are chartered to lead the Overseas IT strategy, architecture, roadmap, and execution. Satisfying our internal/external customers and becoming a world-class global IT team are our top aspirations.


We are seeking a Sr. Site Reliability Engineer with extensive cloud and on-prem SRE design and implementation experience.

Duties and Responsibilities:
This senior role will closely work with our internal IT and cloud providers to design the best global SRE architecture and solution in the cloud. This role will also support the studio’s infrastructure, game publishing infrastructure and its evolution to the cloud. Our customers include internal or acquired gaming studios, game publishing services, innovative offices/workplaces, various business groups, and external customers. The work scope will include understanding the internal customers’ business requirements, collecting the technical requirements, developing reference architecture and prototypes based on leading industry best practices, leading implementation, and deployment for global locations, as well as issue troubleshooting when necessary.

For this SRE job, you will:
• Design, implement, and support operational and reliability of large-scale Cloud-enabled studio with a focus on performance at scale, real-time monitoring, logging ,analyzing and alerting
• Maintain services once they go live by measuring and monitoring availability, latency, and overall system health.
• Design and develop robust and scalable products and tools to enhance operational efficiency.
• Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
• Participate in incident response and troubleshooting efforts to minimize downtime and ensure system reliability.
• Maintain project and product documents and knowledge
• Be part of an on-call rotation to support production systems (if needed)


Based in Shanghai, China, this person will work closely with the global IT team, and HQ teams.

Whom we are looking for:

  • A quick learner
  • A positive, self-motivated, and passionate person
  • Independent, insistent, and open-minded.
  • A great team player and both dependable and autonomous.
  • Customer-oriented and could work at a very fast pace.

Requirements:

Requirements

  • 5+ years of experience with Infrastructure automation, distributed systems design, experience with design, develop tools for running large-scale private or public cloud systems in Production
  • In-depth knowledge and understanding of monitoring concepts, alert mechanisms, log monitoring, anomaly detections, creation, and setup of dashboards.
  • In-depth knowledge and experience with Elastic Search, Prometheus
  • Expertise in configuration management with a framework such as Ansible, Terraform, Helm
  • Proficiency with programming languages like Python, Golang, and shell scripting to automate tasks
  • Passion for infrastructure and monitoring as code
  • Bachelor’s degree (or higher), Computer Science, Mathematics, or related science or engineering major
  • Solid understanding of cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes).
  • Good understanding and hands on experience in network is plus
  • Bilingual preferred (English, Chinese)

Similar Jobs

Progres - Senior Full-Stack Developer

Progres

Sofia, Sofia City Province, Bulgaria (Hybrid)
3 Months ago
Supercell - Senior Server Engineer

Supercell

Helsinki, Uusimaa, Finland (On-Site)
6 Months ago
Google - Bluetooth Firmware Engineer

Google

New Taipei, New Taipei City, Taiwan (On-Site)
2 Weeks ago
Offworld - DevOps Engineer

Offworld

New Westminster, British Columbia, Canada (On-Site)
1 Month ago
pay2dc - Data Engineer (AWS stack)

pay2dc

Gurugram, India (On-Site)
1 Day ago
ION - Microsoft System Engineer, Italy

ION

Italy (Hybrid)
6 Months ago
The Walt Disney Company - Database Engineer II

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
2 Weeks ago
Luxoft - Senior Software Support Engineer

Luxoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Remote)
5 Months ago
Scanline VFX - Release DevOps Engineer

Scanline VFX

Montreal, Quebec, Canada (Hybrid)
4 Weeks ago
NVIDIA - Senior DevOps Engineer - Accelerated Computing

NVIDIA

Westford, Massachusetts, United States (Hybrid)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - Software Development Engineer - Cloud Native Databases

ByteDance

Seattle, Washington, United States (On-Site)
3 Months ago
NVIDIA - Senior Functional Test Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
3 Months ago
ByteDance - Network Data Operations Engineer

ByteDance

Singapore (On-Site)
6 Months ago
Luxoft - Orchestrade - Azure infrastructure cloud Regular engineer

Luxoft

Poland, Ohio, United States (Remote)
5 Months ago
Fractal - DevOps - Lead

Fractal

Mumbai, Maharashtra, India (On-Site)
5 Months ago
Google - Technical Solutions Engineer, Infrastructure Compute

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
NVIDIA - Senior Verification Engineer

NVIDIA

Hsinchu, Hsinchu City, Taiwan (On-Site)
1 Month ago
Beta Craft - Full Stack Developer

Beta Craft

Pune, Maharashtra, India (On-Site)
1 Month ago
ION - Principal Technical Consultant - Endur

ION

Berlin, Berlin, Germany (On-Site)
6 Months ago
ByteDance - Security Operation Engineer, Security Assurance

ByteDance

Singapore (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Haleon - Innovation Marketing Manager

Haleon

Shanghai, China (On-Site)
20 Hours ago
TiMi Studio Group - Senior 3D Character Designer

TiMi Studio Group

Shenzhen, Guangdong Province, China (On-Site)
1 Week ago
Ourpalm - Card Game Operator

Ourpalm

Guangzhou, Guangdong Province, China (On-Site)
2 Weeks ago
TiMi Studio Group - Client Development Engineer for 3A Stylized Realistic Shooting Game

TiMi Studio Group

Shenzhen, Guangdong Province, China (On-Site)
3 Weeks ago
Kaiying Network - Senior 3D Level Designer

Kaiying Network

Shanghai, China (On-Site)
2 Weeks ago
Tencent - NIKKE 韩语项目经理

Tencent

Shenzhen, Guangdong Province, China (On-Site)
4 Months ago
Epic Games - Senior FX Artist

Epic Games

Shanghai, Shanghai, China (On-Site)
3 Months ago
NVIDIA - Senior SRE Software Engineer, Storage and Data

NVIDIA

Shanghai, Shanghai, China (On-Site)
3 Months ago
NVIDIA - Senior Solutions Architect, Omniverse Platform

NVIDIA

Beijing, Beijing, China (On-Site)
1 Month ago
Yodo1 - Finance Intern, Chinese Speaking

Yodo1

Beijing, Beijing, China (Remote)
10 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Canva - Senior Software Engineer -Cloud Platform- - Remote across ANZ

Canva

Sydney, New South Wales, Australia (Remote)
5 Months ago
Ubisoft - Storage Architect

Ubisoft

Montreal, Quebec, Canada (On-Site)
4 Months ago
Google - Systems Development Engineer III

Google

Reston, Virginia, United States (On-Site)
2 Weeks ago
N-iX - Lead DevOps Engineer

N-iX

Ukraine (Remote)
2 Weeks ago
The Walt Disney Company - Senior Pipeline Engineer

The Walt Disney Company

Glendale, California, United States (On-Site)
1 Month ago
Brillio - Enterprise Architect, Azure - R01535036

Brillio

Bengaluru, Karnataka, India (Hybrid)
6 Months ago
N-iX - Senior DevOps Engineer

N-iX

Argentina (Remote)
1 Month ago
Google - Customer Engineer, Data Analytics, ISV, Google Cloud

Google

San Francisco, California, United States (On-Site)
2 Days ago
Egnyte - DevOps Engineer

Egnyte

India (Remote)
2 Months ago
PwC - ETIC, Cloud Solution Architect (Multi-Cloud, DevOps Focus) - Senior Manager

PwC

Cairo, Cairo Governorate, Egypt (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life of people around the world.


Founded in 1998 with its headquarters in Shenzhen, China, Tencent's guiding principle is to use technology for good. Our communication and social services connect more than one billion people around the world, helping them to keep in touch with friends and family, access transportation, pay for daily necessities, and even be entertained.


Tencent also publishes some of the world's most popular video games and other high-quality digital content, enriching interactive entertainment experiences for people around the globe.


Tencent also offers a range of services such as cloud computing, advertising, FinTech, and other enterprise services to support our clients' digital transformation and business growth.


Tencent has been listed on the Stock Exchange of Hong Kong since 2004.

Shenzhen, Guangdong Province, China (On-Site)

London, England, United Kingdom (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

View All Jobs

Get notified when new jobs are added by Tencent

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug