Your Mission
The Activision Infrastructure team is looking for a Cloud Engineer to join our team in our offices in Vancouver, Canada. This team supports a variety of cloud workloads, and you will be instrumental in setting standards and supporting key technologies. You will help drive efficiencies for both cost and performance within Azure and the Google Cloud Platform.
We work on building a solid Cloud Platform made up of Kubernetes and a CI/CD pipeline utilizing GitHub, Artifactory, ArgoCD, Vault, and Grafana LGTM. We help internal development teams build successfully in the Cloud by providing templates and guidance for Helm, Crossplane, and security compliance.
Onsite:
This role is an onsite work position, and the home studio for this role is Vancouver, BC.
What you bring to the table
The video game industry and therefore our business is fast-paced and will continue to evolve. As such, the duties and responsibilities of this role may be changed as directed by the Company at any time to promote and support our business and relationships with industry partners. Therefore, this role includes, but is not limited to, the following responsibilities:
- Build a solid Cloud Platform for our internal development teams
- Architect and maintain our Artifact Repository in Azure, and its migration from Google Cloud
- Implement and maintain our Observability (Logging, Visualization/Grafana, Metrics) platform in Azure, and its migration from Google Cloud
- Utilize AI to improve the accessibility and usability of Monitoring data
- Implement monitoring of systems and services, optimization of performance and resource utilization that identifies service disruptions
- Identify, diagnose, and resolve technical issues efficiently in a live production environment, and drive to quick resolutions, help with root cause diagnosis, and suggest improvements to prevent similar issues
- Participate in an on-call support rotation
- Consider security, sustainability, and supportability concerns with all work
Player Profile
Minimum Requirements
Experience
- At least 5+ years of experience with at least one scripting language (e.g., Python/Golang)
- Design experience and manage developer artifacts and workflows using JFrog Artifactory
- Experience engineering large-scale metrics systems (e.g., Thanos, Mimir, VictoriaMetrics)
- Proficient in monitoring tools like Grafana LGTM stack and Prometheus Experience with container orchestration systems such as Kubernetes
- Adept with logging tools like ELK, Loki, and Cloud Logging
- Experience with CI/CD platforms like GitHub Actions and ArgoCD
- Proficient in managing reproducible infrastructure, e.g., Infrastructure as Code
Knowledge & Skills
- Good working knowledge of source code control systems such as GitHub
- Solid knowledge of public cloud architecture concepts and practice (GCP, AWS, Azure)
- Knowledge of system and/or application monitoring
Key Attributes
- Clear communication, and the ability to use it to support developers
- Problem solving skills and ability to search for and implement appropriate solutions
Extra Points
Experience
- Experience with Web servers/reverse proxies such as nginx and haproxy
- Adept with alerting tools such as PagerDuty
- Familiarity with AI LLMs and using their APIs
- Experience working/defecting tracking & Wiki systems such as Jira/Confluence
Knowledge & Skills
- Knowledge of HashiCorp products like Vault, Consul, Terraform or similar products
- Clear understanding of Networking concepts (e.g., Firewalls, VPC, VPN, DNS, etc.)
- Knowledge of the use and maintenance of continuous integration and continuous deployment CI/CD systems