Site Reliability Engineer

NetEase Games

Job Summary

Site Reliability Engineering (SRE) at NetEase Interactive Entertainment involves using software engineering to manage systems, automate operations, and improve service availability for games like Eggy Party and Marvel Rivals, as well as internal projects. Responsibilities include designing runtime environments, monitoring operational metrics, collaborating with product teams for technical optimization, and researching cutting-edge open-source technologies to develop business solutions. The role focuses on providing high-quality, efficient operational services at controllable costs, ensuring superior gaming experiences for a thriving global community.

Must Have

  • Manage the operational work of NetEase Interactive Entertainment services, such as Eggy Party, Marvel Rivals, UU Accelerator, Ace Racer, and other online services, as well as internal research projects
  • Design and select basic runtime environments for game servers based on different games' service architecture, performance requirements, and business conditions, providing high-quality and efficient operational services at controllable costs
  • Establish and monitor various operational metrics and customize data analysis standards
  • Collaborate with product departments to identify issues, optimize technical architecture, and enhance user experience based on game and infrastructure conditions
  • Participate in in-depth research on cutting-edge open-source software, virtualization, databases, and web services, and develop technical solutions for business implementation
  • Bachelor's degree or above, majors in computer science, networking, communications, automation, or related fields
  • Familiar with the Linux operating system
  • Knowledgeable about computer network architectures and common network protocols such as TCP/IP and HTTP
  • Proficient in at least one programming language, including but not limited to C/C++, Shell, Python, Golang, Rust, or Java
  • Strong logical thinking, communication, and learning abilities; adept at research and problem-solving
  • Skilled at teamwork, with a strong sense of collective honor, responsibility, and service awareness
  • Proficiency in Chinese is required for this role

Good to Have

  • Passionate about open-source
  • Experience or knowledge in open-source software such as Linux, Nginx, MySQL, K8S, and Istio
  • Open to trying new things, with excellent problem-solving skills and strong technical sensitivity
  • Experience in contributing to open-source communities

Job Description

Job Description:

  • Site Reliability Engineering (SRE) refers to using software engineering methods to manage systems, solve problems, and achieve operational automation to reduce trivial tasks and improve service availability. Responsibilities include but are not limited to:
  • Manage the operational work of NetEase Interactive Entertainment services, such as Eggy Party, Marvel Rivals, UU Accelerator, Ace Racer, and other online services, as well as internal research projects.
  • Design and select basic runtime environments (including servers, virtualization, cloud services, networks, databases, etc.) for game servers based on different games' service architecture, performance requirements, and business conditions, providing high-quality and efficient operational services at controllable costs.
  • Establish and monitor various operational metrics and customize data analysis standards.
  • Collaborate with product departments to identify issues, optimize technical architecture, and enhance user experience based on game and infrastructure conditions.
  • Participate in in-depth research on cutting-edge open-source software, virtualization, databases, and web services, and develop technical solutions for business implementation.

Job Requirements:

  • Bachelor's degree or above, majors in computer science, networking, communications, automation, or related fields are preferred.
  • Familiar with the Linux operating system; knowledgeable about computer network architectures and common network protocols such as TCP/IP and HTTP.
  • Proficient in at least one programming language, including but not limited to C/C++, Shell, Python, Golang, Rust, or Java.
  • Passionate about open-source; experience or knowledge in open-source software such as Linux, Nginx, MySQL, K8S, and Istio is preferred.
  • Strong logical thinking, communication, and learning abilities; adept at research and problem-solving.
  • Skilled at teamwork, with a strong sense of collective honor, responsibility, and service awareness.
  • Open to trying new things, with excellent problem-solving skills and strong technical sensitivity; experience in contributing to open-source communities is a plus.
  • Proficiency in Chinese is required for this role, as daily communication and collaboration with key stakeholders and team members based in China are essential to the responsibilities of the position.

15 Skills Required For This Role

Team Management Problem Solving Data Analytics Cpp Game Texts Mysql User Experience Ux Networking Nginx Linux Rust Marvel Python Shell Java

Similar Jobs