Senior Site Reliability Engineer

4 Months ago • 5 Years + • DevOps

Job Summary

Job Description

Senior Site Reliability Engineer with 5+ years of experience in Cloud and on-prem SRE design and implementation. Must have expertise in infrastructure automation, distributed systems, and cloud platforms like AWS, Azure, GCP. Strong knowledge of monitoring, logging, and configuration management is essential.
Must have:
  • Infrastructure Automation
  • Distributed Systems
  • Cloud Platforms
  • Monitoring Concepts
Good to have:
  • Containerization Tech
  • Network Experience
  • Elastic Search
  • Prometheus
Perks:
  • Global IT Team
  • Fast-Paced Environment

Job Details

Responsibilities:

About Tencent Overseas IT:
Tencent Overseas IT has the mission to empower Tencent’s rapid global growth with future-ready, global IT platforms, applications, and services. We are chartered to lead the Overseas IT strategy, architecture, roadmap, and execution. Satisfying our internal/external customers and becoming a world-class global IT team are our top aspirations.


We are seeking a Sr. Site Reliability Engineer with extensive cloud and on-prem SRE design and implementation experience.

Duties and Responsibilities:
This senior role will closely work with our internal IT and cloud providers to design the best global SRE architecture and solution in the cloud. This role will also support the studio’s infrastructure, game publishing infrastructure and its evolution to the cloud. Our customers include internal or acquired gaming studios, game publishing services, innovative offices/workplaces, various business groups, and external customers. The work scope will include understanding the internal customers’ business requirements, collecting the technical requirements, developing reference architecture and prototypes based on leading industry best practices, leading implementation, and deployment for global locations, as well as issue troubleshooting when necessary.

For this SRE job, you will:
• Design, implement, and support operational and reliability of large-scale Cloud-enabled studio with a focus on performance at scale, real-time monitoring, logging ,analyzing and alerting
• Maintain services once they go live by measuring and monitoring availability, latency, and overall system health.
• Design and develop robust and scalable products and tools to enhance operational efficiency.
• Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
• Participate in incident response and troubleshooting efforts to minimize downtime and ensure system reliability.
• Maintain project and product documents and knowledge
• Be part of an on-call rotation to support production systems (if needed)


Based in Shanghai, China, this person will work closely with the global IT team, and HQ teams.

Whom we are looking for:

  • A quick learner
  • A positive, self-motivated, and passionate person
  • Independent, insistent, and open-minded.
  • A great team player and both dependable and autonomous.
  • Customer-oriented and could work at a very fast pace.

Requirements:

Requirements

  • 5+ years of experience with Infrastructure automation, distributed systems design, experience with design, develop tools for running large-scale private or public cloud systems in Production
  • In-depth knowledge and understanding of monitoring concepts, alert mechanisms, log monitoring, anomaly detections, creation, and setup of dashboards.
  • In-depth knowledge and experience with Elastic Search, Prometheus
  • Expertise in configuration management with a framework such as Ansible, Terraform, Helm
  • Proficiency with programming languages like Python, Golang, and shell scripting to automate tasks
  • Passion for infrastructure and monitoring as code
  • Bachelor’s degree (or higher), Computer Science, Mathematics, or related science or engineering major
  • Solid understanding of cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes).
  • Good understanding and hands on experience in network is plus
  • Bilingual preferred (English, Chinese)

Similar Jobs

Marvell India - Performance Engineer

Marvell India

Hyderabad, Telangana, India (On-Site)
4 Months ago
LeoVegas - Infrastructure Lead

LeoVegas

Stockholm, Stockholm County, Sweden (Hybrid)
1 Month ago
Fractal - DevOps - Lead

Fractal

Mumbai, Maharashtra, India (On-Site)
3 Months ago
Google - CPU Design Verification Engineer, Google Cloud

Google

(On-Site)
1 Month ago
Axon - Senior Security Engineer

Axon

Scottsdale, Arizona, United States (Hybrid)
2 Months ago
Microsoft - Data Engineer II

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Month ago
HiLabs - Sr. DevOps Engineer

HiLabs

Pune, Maharashtra, India (On-Site)
4 Months ago
ION - Cloud Engineer Kubernetes

ION

Castellazzo Bormida, Piedmont, Italy (Hybrid)
3 Months ago
Ubisoft - Senior Build Engineer

Ubisoft

Singapore, Singapore (On-Site)
1 Month ago
GoTo Group - Lead Software Engineer - Engineering Platform

GoTo Group

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Fortis Games - DevOps Engineer II

Fortis Games

Portugal (On-Site)
2 Months ago
ION - Senior Linux Systems Administrator - Trumbull, CT

ION

Trumbull, Connecticut, United States (Hybrid)
3 Months ago
Interactive Brokers - Technical Operations Specialist (TOPS)

Interactive Brokers

Chicago, Illinois, United States (Hybrid)
3 Months ago
Next Level Business Services - Sr. Performance Test Engineer

Next Level Business Services

El Segundo, California, United States (On-Site)
3 Months ago
Nielsen Holdings - Senior Software Engineer ( Java , Python , SQL , AWS / Oracle)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
HEAL Software  Inc  - Engineer – QA (Manual)

HEAL Software Inc

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Paytm - DBA-Senior Database Administrator

Paytm

Noida, Uttar Pradesh, India (On-Site)
1 Month ago
Matic Robots - Systems Engineer (Rust)

Matic Robots

Canada (On-Site)
3 Months ago
Nielsen Holdings - Software Engineer - Bigdata (Java/Scala and SQL)

Nielsen Holdings

Bengaluru, Karnataka, India (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Razer - Senior Electronics Engineer

Razer

Shenzhen, Guangdong Province, China (On-Site)
4 Months ago
Virtuos - Lead Technical Artist

Virtuos

China (On-Site)
3 Months ago
Riot Games - Regional Publishing Lead

Riot Games

Shanghai, Shanghai, China (On-Site)
2 Months ago
Tencent - Live Operation Manager 游戏商业化策划

Tencent

Shenzhen, Guangdong Province, China (On-Site)
6 Months ago
Kaiying Network - Frontend Developer

Kaiying Network

Shanghai, Shanghai, China (On-Site)
2 Weeks ago
Electronic Arts - Producer

Electronic Arts

Shanghai, Shanghai, China (On-Site)
2 Months ago
Ourpalm - Senior SLG System Planner

Ourpalm

Beijing, Beijing, China (On-Site)
1 Week ago
Tencent - Overseas Content Creative Designer

Tencent

Shenzhen, Guangdong Province, China (On-Site)
1 Month ago
Animoca Brands - Frontend Developer

Animoca Brands

China (Remote)
3 Months ago
Google - Solutions Consultant, gTech Ads Sellside (Mandarin, English)

Google

Beijing, Beijing, China (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Rush Street Interactive - Full-Stack Automation Engineer

Rush Street Interactive

Tallinn, Harju County, Estonia (On-Site)
1 Month ago
Visa - Chief Systems Architect

Visa

Auckland, Auckland, New Zealand (Hybrid)
1 Month ago
Saviynt - Engineer/Sr. Engineer, CloudOps

Saviynt

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Microsoft - Principal Software Engineer

Microsoft

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Integral Ad Science - Senior Site Reliability Engineer

Integral Ad Science

Pune, Maharashtra, India (Hybrid)
3 Months ago
Luxoft - Senior Database Administrator (Postgresql)

Luxoft

Ukrainka, Kyiv Oblast, Ukraine (Remote)
2 Months ago
Luxoft - Site Reliability Engineer

Luxoft

Singapore, Singapore (On-Site)
2 Months ago
ION - Cloud Engineer/Architect (DevOps)

ION

London, England, United Kingdom (On-Site)
3 Months ago
Egnyte - Senior Technical Program Manager

Egnyte

India (Remote)
2 Months ago
EvoPlay - Senior Java Developer

EvoPlay

Limassol, Limassol, Cyprus (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life of people around the world.


Founded in 1998 with its headquarters in Shenzhen, China, Tencent's guiding principle is to use technology for good. Our communication and social services connect more than one billion people around the world, helping them to keep in touch with friends and family, access transportation, pay for daily necessities, and even be entertained.


Tencent also publishes some of the world's most popular video games and other high-quality digital content, enriching interactive entertainment experiences for people around the globe.


Tencent also offers a range of services such as cloud computing, advertising, FinTech, and other enterprise services to support our clients' digital transformation and business growth.


Tencent has been listed on the Stock Exchange of Hong Kong since 2004.

Los Angeles, California, United States (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

Bellevue, Washington, United States (On-Site)

Palo Alto, California, United States (On-Site)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

London, England, United Kingdom (On-Site)

Shenzhen, Guangdong Province, China (On-Site)

View All Jobs

Get notified when new jobs are added by Tencent

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug