Site Reliability Engineer

6 Months ago • 4 Years +
Devops

Job Description

Argus Labs seeks a Site Reliability Engineer (SRE) to enhance user experience for both development and end-users. Responsibilities include designing and building operational infrastructure, spearheading company-wide security, owning backend infrastructure delivery and scalability, and collaborating with the engineering team to ensure reliable products. The ideal candidate will have 4+ years of experience managing software deployment pipelines in a production cloud environment, proficiency in Go, JavaScript, Python, or other object-oriented languages, and strong scripting skills with Bash. The company utilizes World Engine, a high-performance blockchain designed for games.
Good To Have:
  • Game development and game server hosting experience
  • Buildkite experience
  • Secrets management (HashiCorp Vault, Infisical)
  • KMS or equivalent for cryptographic key security
  • Layer 3 network optimization (Cloudflare)
  • Blockchain infrastructure deployment and maintenance
Must Have:
  • 4+ years managing software deployment pipelines
  • Proficient in Go, JavaScript, Python or similar
  • Strong Bash scripting skills
  • Experience with Infrastructure-as-Code (Terraform, Pulumi)
  • CI/CD expertise (GitHub Actions)
  • Production Kubernetes management (Helm)
  • Database infrastructure management
  • Excellent communication and time management
Perks:
  • Flexible PTO
  • 100% employer-covered medical, dental, and vision insurance
  • 401k
  • Up to $1500 desk set-up stipend
  • Company retreats
  • No crunch

Add these skills to join the top 1% applicants for this job

team-management
timeline-management
communication
github
user-experience-ux
smart-contracts
terraform
helm
ci-cd
kubernetes
python
github-actions
bash
javascript
multiplayer

Argus Labs is building the next generation of massively multiplayer online (MMO) games by empowering players with the extensive freedom to build, extend, and influence the game worlds they inhabit. Our approach is centered around World Engine, our state-of-the-art onchain game server framework.

World Engine leverages a novel sharded rollup blockchain architecture, which allows games to use smart contracts for user-generated content (UGC) while scaling to tens of thousands of concurrent users without compromising performance. To date, World Engine is the most performant blockchain designed from the ground up for games and has been used in production for games like Dark Frontier, processing 700K+ player transactions within a week.

Backed by the best

We raised $10 million in seed funding led by Haun Ventures ($1.5B crypto fund led by former a16z GP, Katie Haun) with participation from influential angel investors in tech and gaming such as Elad Gil and Balaji Srinivasan (ex-Coinbase CTO).

Learn about who we are and our technology

Responsibilities

  • Work closely with stakeholders company-wide to provide services that enhance the user experience for the development team, as well as our end-users.

  • Design and build operational infrastructure to support games, automating where possible.

  • Spearhead company-wide security culture and architecture to keep our platform secure.

  • Own delivery, scalability, and reliability of our backend infrastructure.

  • Advise and collaborate with the rest of the engineering team to ensure we are building safe, secure, and reliable products.

Requirements

  • Canada-based candidates only – Must reside in and have the legal right to work in Canada.

  • 4+ years of experience managing software deployment pipelines in a production cloud environment.

  • Proficient in Go, JavaScript, Python, or other object-oriented programming languages.

  • Strong scripting skills with Bash.

  • Hands-on experience with writing and maintaining complex Infrastructure-as-Code (Terraform, Pulumi, etc.).

  • Expertise in CI/CD – Building and maintaining performant pipelines using GitHub Actions.

  • Production Kubernetes management – Deployment best practices with Helm, etc.

  • Database infrastructure management – Setup, maintenance, and migration coordination.

  • Excellent communication and time management skills.

  • Ability to design and implement highly available, reliable systems.

Nice to have

  • Experience in game development and game server hosting, ensuring high-performance and scalable infrastructure.

  • Hands-on expertise with Buildkite for CI/CD automation and pipeline optimization.

  • Knowledge of secrets management systems like HashiCorp Vault, Infisical, and similar tools to safeguard sensitive data.

  • Experience in securing cryptographic keys using KMS or equivalent technologies to enhance security protocols.

  • Proficiency in Layer 3 network optimization, including geo-based routing and traffic management with Cloudflare.

  • Familiarity with deploying and maintaining blockchain infrastructure, such as full nodes, validator nodes, and other blockchain-related services.

Perks & benefits

For full-time employees

  • A note for the game industry veterans: no crunch :-)

  • Flexible PTO (2 weeks required) + holidays

  • 100% employer-covered medical, dental, and vision insurance (US)

  • 401k (US)

  • Up to $1500 desk set-up stipend

  • Company retreats

We’re an equal-opportunity employer. All applicants will be considered for employment without attention to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran, or disability status.

Set alerts for more jobs like Site Reliability Engineer
Set alerts for new jobs by Argus
Set alerts for new Devops jobs in Canada
Set alerts for new jobs in Canada
Set alerts for Devops (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙