Senior Site Reliability Engineer

3 Months ago • 5 Years + • DevOps

About the job

Job Description

Senior Site Reliability Engineer with 5+ years of experience in Cloud and on-prem SRE design and implementation. Must have expertise in infrastructure automation, distributed systems, and cloud platforms like AWS, Azure, GCP. Strong knowledge of monitoring, logging, and configuration management is essential.
Must have:
  • Infrastructure Automation
  • Distributed Systems
  • Cloud Platforms
  • Monitoring Concepts
Good to have:
  • Containerization Tech
  • Network Experience
  • Elastic Search
  • Prometheus
Perks:
  • Global IT Team
  • Fast-Paced Environment
Not hearing back from companies?
Unlock the secrets to a successful job application and accelerate your journey to your next opportunity.

Responsibilities:

About Tencent Overseas IT:
Tencent Overseas IT has the mission to empower Tencent’s rapid global growth with future-ready, global IT platforms, applications, and services. We are chartered to lead the Overseas IT strategy, architecture, roadmap, and execution. Satisfying our internal/external customers and becoming a world-class global IT team are our top aspirations.


We are seeking a Sr. Site Reliability Engineer with extensive cloud and on-prem SRE design and implementation experience.

Duties and Responsibilities:
This senior role will closely work with our internal IT and cloud providers to design the best global SRE architecture and solution in the cloud. This role will also support the studio’s infrastructure, game publishing infrastructure and its evolution to the cloud. Our customers include internal or acquired gaming studios, game publishing services, innovative offices/workplaces, various business groups, and external customers. The work scope will include understanding the internal customers’ business requirements, collecting the technical requirements, developing reference architecture and prototypes based on leading industry best practices, leading implementation, and deployment for global locations, as well as issue troubleshooting when necessary.

For this SRE job, you will:
• Design, implement, and support operational and reliability of large-scale Cloud-enabled studio with a focus on performance at scale, real-time monitoring, logging ,analyzing and alerting
• Maintain services once they go live by measuring and monitoring availability, latency, and overall system health.
• Design and develop robust and scalable products and tools to enhance operational efficiency.
• Scale systems sustainably through mechanisms like automation and evolve systems by pushing for changes that improve reliability and velocity.
• Participate in incident response and troubleshooting efforts to minimize downtime and ensure system reliability.
• Maintain project and product documents and knowledge
• Be part of an on-call rotation to support production systems (if needed)


Based in Shanghai, China, this person will work closely with the global IT team, and HQ teams.

Whom we are looking for:

  • A quick learner
  • A positive, self-motivated, and passionate person
  • Independent, insistent, and open-minded.
  • A great team player and both dependable and autonomous.
  • Customer-oriented and could work at a very fast pace.

Requirements:

Requirements

  • 5+ years of experience with Infrastructure automation, distributed systems design, experience with design, develop tools for running large-scale private or public cloud systems in Production
  • In-depth knowledge and understanding of monitoring concepts, alert mechanisms, log monitoring, anomaly detections, creation, and setup of dashboards.
  • In-depth knowledge and experience with Elastic Search, Prometheus
  • Expertise in configuration management with a framework such as Ansible, Terraform, Helm
  • Proficiency with programming languages like Python, Golang, and shell scripting to automate tasks
  • Passion for infrastructure and monitoring as code
  • Bachelor’s degree (or higher), Computer Science, Mathematics, or related science or engineering major
  • Solid understanding of cloud platforms (e.g., AWS, Azure, GCP) and containerization technologies (e.g., Docker, Kubernetes).
  • Good understanding and hands on experience in network is plus
  • Bilingual preferred (English, Chinese)
View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Tencent is a world-leading internet and technology company that develops innovative products and services to improve the quality of life of people around the world.


Founded in 1998 with its headquarters in Shenzhen, China, Tencent's guiding principle is to use technology for good. Our communication and social services connect more than one billion people around the world, helping them to keep in touch with friends and family, access transportation, pay for daily necessities, and even be entertained.


Tencent also publishes some of the world's most popular video games and other high-quality digital content, enriching interactive entertainment experiences for people around the globe.


Tencent also offers a range of services such as cloud computing, advertising, FinTech, and other enterprise services to support our clients' digital transformation and business growth.


Tencent has been listed on the Stock Exchange of Hong Kong since 2004.

Palo Alto, California, United States (On-Site)

Osaka, Osaka, Japan (On-Site)

Osaka, Osaka, Japan (On-Site)

Amsterdam, North Holland, Netherlands (On-Site)

Beijing, Beijing, China (On-Site)

Palo Alto, California, United States (On-Site)

Los Angeles, California, United States (On-Site)

Frankfurt, Hessen, Germany (On-Site)

London, England, United Kingdom (On-Site)

View All Jobs

Get notified when new jobs are added by Tencent

Similar Jobs

Rambus - SMTS CAD Engineering

Rambus, India (Hybrid)

Intel Corporation - SoC Front-End Pre-Si Intern

Intel Corporation, (Remote)

IGT - Cloud Operations Engineer II

IGT, United States (On-Site)

Paypal - Lead Principal ML Engineer, AI Solutions

Paypal, United States (On-Site)

Allvue Systems - Sr Manager, Platform Engineer

Allvue Systems, India (On-Site)

Limit Break - Senior Site Reliability Engineer

Limit Break, Japan (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Marvell India - Performance Engineer

Marvell India, India (On-Site)

Rivos - Silicon DFT - Full time

Rivos, Taiwan (Hybrid)

 Sagecor Solutions - Application Developer 2 (NRN - 009)

Sagecor Solutions, United States (On-Site)

Warner Bros Discovery - Sr. Manager, Integrations

Warner Bros Discovery, Mexico (On-Site)

Infineon Technologies - Senior Staff SOC Engineer

Infineon Technologies, India (Hybrid)

Paytm - DBA - Senior DBA

Paytm, India (Hybrid)

Luxoft - Team Lead Devops

Luxoft, Ukraine (Remote)

Get notifed when new similar jobs are uploaded

Jobs in Shanghai, Shanghai, China

Intel Corporation - Cost Engineer

Intel Corporation, China (On-Site)

Virtuos - Prop Artist

Virtuos, China (On-Site)

Ubisoft - Senior Gameplay Programmer

Ubisoft, China (On-Site)

Intel Corporation - DMTM CMOS PI Engineer

Intel Corporation, China (On-Site)

Intel Corporation - Site Security System Specialist

Intel Corporation, China (On-Site)

Maersk Careers - Sales Enablement Manager

Maersk Careers, China (On-Site)

Cision - Senior Analyst

Cision, China (On-Site)

Intel Corporation - Senior NAND Product Development Technologist

Intel Corporation, China (On-Site)

Publicis Groupe - Senior Art Director

Publicis Groupe, China (On_site)

Get notifed when new similar jobs are uploaded

DevOps Jobs

ASSIST Software - Azure DevOps Engineer

ASSIST Software, (Remote)

Guerrilla - SENIOR INFRASTRUCTURE ENGINEER

Guerrilla, Netherlands (On-Site)

Luxoft - Senior Power Apps Developer

Luxoft, India (Remote)

ARHS - Application Engineer/Administrator

ARHS, Netherlands (On-Site)

Siemens Digital Industries Software - Teamcenter Release Manager

Siemens Digital Industries Software, India (Hybrid)

Jade Global - Release Manager

Jade Global, India (On-Site)

KingsIsle Entertainment - Build and Tools Software Engineer

KingsIsle Entertainment, United States (Hybrid)

Steneral Consulting - Principal Cloud Core Infrastructure Engineer

Steneral Consulting, United States (Hybrid)

Get notifed when new similar jobs are uploaded