Manager, Site Reliability Engineer
Never forget games
Job Summary
Wildlife is looking for a Site Reliability Engineer Manager to join the Cloud Platform team. This role focuses on providing easy-to-use, highly available systems to engineers, improving infrastructure services through automation, and contributing to technical and business decisions for new services. The manager will also foster team career growth and retention. The ideal candidate is curious, proactive, and a problem-solver, thriving in a fast-growing environment.
Must Have
- Experience managing small teams with infrastructure background
- Coding experience in Go or Python
- University degree in computing or equivalent experience
- Solid understanding of computer concepts
- Experience with cloud computing services (AWS, Google Cloud, Azure)
- Experience with Infrastructure as Code (Terraform, Packer, Ansible, Crossplane)
- Experience managing Kubernetes clusters and developing operators
- Experience automating routine tasks
- Experience with incident management and on-call duties
- Strong written and spoken communication skills in English
- Experience with complex, large-scale, high-available systems
- Experience with monitoring and telemetry
Good to Have
- Player focused, ensuring top-level infrastructure for amazing player experience
- History of projecting and executing automation projects
- Calm and pragmatic in critical situations
- Curious about new technologies and testing new solutions
- Metrics-oriented, making data-driven decisions
- Bar raiser, mentoring peers and spreading knowledge
Job Description
What you'll do
- Be the manager of a cross-functional team, contributing to the team roadmap and growth of its individual contributors;
- Develop, maintain, and optimize infrastructure clusters (e.g., Kubernetes, NATS, ETCD, Postgres, MongoDB, Redis, Elasticsearch), infrastructure services (e.g., Gitlab, Jenkins, Vault, Artifactory, Datadog, Jaeger, etc.), and our APIs and automations to manage them (e.g., Kubernetes Operators, Infrastructure as code, Pipelines, CLIs,);
- Analyze costs of infrastructure services and help define and optimize the budget of our infrastructure and game teams;
- Contribute to improvements on monitoring and observability patterns for infrastructure services;
- Troubleshoot, manage and lead incidents in production;
- Manage and improve the tools and processes related to infrastructure management across the company (Infrastructure-as-code standards, CI/CD design, build of our Internal Developer Platform, etc.);
- Help partner teams to architect and scale their applications and infrastructure with cloud-native best practices.
What you'll need
We expect our Managers to be Technical, dedicating around 50% of their time to working together with the ICs in their day-to-day work and being an active voice and participative on the team technical roadmap.
- Experience managing small teams with infrastructure background;
- Some level of leadership skills, including the areas of people management, communications, project management, talent development, performance management, team effectiveness, agility, hiring, decision making, planning, budgeting, and collaboration;
- Coding experience in at least one programming language. We work mostly with Go and Python;
- University degree in courses related to computing such as Computer Engineering, Computer Science, Information Systems, and Systems Analysis and Development or equivalent Market Experience;
- Solid understanding of computer concepts (operational systems, networking, concurrency, memory management, and algorithm analysis);
- Experience with cloud computing services such as Amazon AWS, Google Cloud, or Microsoft Azure;
- Experience with Infrastructure as Code automations, such as Terraform, Packer, Ansible, Crossplane, etc;
- Experience managing Kubernetes clusters and developing Kubernetes operators;
- Experience automating routine tasks, such as deployments and monitoring setup;
- Experience with incident management and being oncall for productive systems and workloads;
- Strong written and spoken communication skills in English;
- Experience with complex, large-scale, and high-available systems;
- Experience with monitoring and telemetry in applications and infrastructure;
- History of technical leadership and ownership of critical projects, including the mentoring of junior team members.
More about you
- Player focused. We are player-oriented, and infrastructure has a great impact on their experience. You have empathy with our players and focus on ensuring they have an amazing experience. You aim for a top-level infrastructure, guaranteeing the highest availability possible.
- Automation is key to scaling. We look for engineers who have a history of projecting and executing automation projects in order to get rid of any manual and repetitive tasks.
- Calm and pragmatism. When everything seems to be falling apart around you, you have a plan and keep calm.
- Bleeding edge. You are curious and like to study new technologies, test new solutions, and measure the impact brought by changes. We want to ensure we are using the best stack possible.
- Metrics-oriented. We make decisions based on data and metrics. We measure the results of our tasks against the expected outcome. And we ensure our work has delivered the correct impact on our customers. We believe in ownership and in shipping features end to end.
- Bar raiser. You want to elevate your team skills and raise the bar, by mentoring your peers, spreading knowledge, being proactive and a tech lead.
About Wildlife
Wildlife is one of the leading mobile game developers and publishers in the world. We have released more than 60 titles, reaching billions of people around the globe. Here, we create games that will excite, intrigue, and engage our players for years to come!
Equal Opportunity
Wildlife is proud to be an Equal Opportunity and Affirmative Action employer. We do not discriminate based upon race, color, national origin, gender, gender identity, sexual orientation, protected veteran status, disability, age, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state, and local law.
We're committed to providing accommodations for candidates with disabilities in our recruiting process.