Outscal Logooutscal logo

Senior Software Reliability Engineer (Production Health)

1 Day ago • 5 Years + • Frontend Development • DevOps • Backend Development

Job Summary

Job Description

Canva seeks a Senior Software Reliability Engineer (Production Health) to design and implement processes, tools, and automation improving service reliability. Responsibilities involve collaborating with engineering teams to ensure best practices, fostering a reliability-first culture, investigating production incidents, and researching solutions for Canva's distributed cloud infrastructure. This role requires advanced coding proficiency (Python/Java/GoLang), experience with complex web applications, and a strong understanding of observability principles. The successful candidate will guide others in incident management, have disciplined coding practices, and demonstrate excellent communication skills. The role is open to remote work across ANZ.
Must have:
  • Advanced coding (Python/Java/GoLang)
  • 5+ years experience with complex web apps
  • Full-stack troubleshooting expertise
  • Observability principles understanding
  • Incident review & remediation guidance
  • Strong communication & collaboration
Good to have:
  • Java 13 experience
  • Microservice architecture experience (AWS)
  • Experience with Snowflake, Mode Analytics, Looker
Perks:
  • Equity packages
  • Inclusive parental leave
  • Annual Vibe & Thrive allowance
  • Flexible leave options

Job Details

Job Description

Join the team redefining how the world experiences design.

Hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte!

Thanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the point.

Where and how you can work

Our flagship campus is in Sydney. We also have a campus in Melbourne and co-working spaces in Brisbane, Perth and Adelaide. But you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.

What you’d be doing in this role

As Canva scales change continues to be part of our DNA. But we like to think that's all part of the fun. So this will give you the flavour of the type of things you'll be working on when you start, but this will likely evolve.

At the moment, this role is focused on:

  • Designing and implementing processes, tools, automation, and libraries that service teams can use to improve the reliability of the services they own. For instance, adding a new long-awaited feature in our circuit breaker library.
  • Working with product engineering teams to ensure reliability best practices and tools are rolled out in every service across the whole organization. It’s not enough to create a new throttling library; we want to make sure it’s successfully used in every service.
  • Fostering a culture within the Engineering org that puts reliability first and establishes processes and policies that drive reliability within product engineering teams. This includes things like SLAs, error budgets, on-call response, incident resolution, and observability best practices.
  • A deep investigation into production incidents followed up by applying the learning to code. 
  • Researching, developing, and justifying the best choices in the form of design docs for tools and processes that will shape the future of reliability at Canva.
  • Proposing new approaches and solutions to ensure we future-proof Canva’s distributed cloud infrastructure as we scale.
  • Participating in design meetings, hiring interviews, and code reviews.

You're probably a match if

  • You have advanced coding proficiency in Python/ Java/ GoLang and strong Object Oriented Programming fundamentals
  • You have five-plus (5+) years of commercial experience working with developing complex, distributed web applications.
  • You have experience diagnosing and addressing issues across the “full stack”, including front-end code, backend, network / infrastructure and data layer
  • You have solid understanding of observability principles, such as metrics, logs, tracing, synthetic testing, query construction, dashboarding and alerting.
  • You have experience with guiding others in the principles of incident review, investigation and remedial activity.
  • You have disciplined coding practices, experience with code reviews and pull requests, and a creative and conceptual problem-solving approach.
  • You have strong communication and team collaboration skills, both written and verbal. As a reliability engineer, you will need to share the knowledge, communicate and coordinate changes across multiple service teams.

Nice to have; Not required!

  • Our services and libraries are primarily written in Java 13, so experience in Java is a nice to have. Our platform and infrastructure tooling is primarily written in Python, Go and Terraform.
  • Experience working with microservice architectures in large containerised, distributed cloud environments (ideally AWS). We’re hosted on AWS and leverage the tools they provide as much as possible
  • Experience working with data warehouse, analytics and reporting tools such as Snowflake, Mode Analytics and Looker.

About the Group

The Reliability Platform Group is responsible for providing the tools and processes to scale reliability across all Canva services. Our teams work together, and with other groups, to deliver preventive and detective tooling, processes and best practices that uplift Canva’s reliability. We do this by driving operational excellence, reducing the impact of incidents, and providing visibility and accountability across the broader Engineering community.

This role sits within the Production Health team, whose focus is on providing tools and guidance for Canva’s engineering teams to measure and maintain their systems’ reliability. Their key areas of practice include on-call management, service-level management, production readiness and operational review.

What's in it for you?

Achieving our crazy big goals motivates us to work hard - and we do - but you'll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a range of benefits to set you up for every success in and outside of work.

Here's a taste of what's on offer:

  • Equity packages - we want our success to be yours too
  • Inclusive parental leave policy that supports all parents & carers
  • An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup & more
  • Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally

Check out lifeatcanva.com for more info.

Other stuff to know

We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.

We celebrate all types of skills and backgrounds at Canva so even if you don’t feel like your skills quite match what’s listed above - we still want to hear from you!

Please note that interviews are conducted virtually.

Similar Jobs

ByteDance - Network Data Operations Engineer

ByteDance

Singapore (On-Site)
4 Months ago
PwC - IN-Senior Associate_ JAVA_Utility Transformation _Advisory_Kolkata

PwC

Kolkata, West Bengal, India (On-Site)
3 Months ago
ByteDance - Software Researcher/Engineer - Applied Research Center (Infrastructure+AI)

ByteDance

Seattle, Washington, United States (On-Site)
4 Months ago
ION - Smalltalk Developer - 708

ION

India (On-Site)
5 Months ago
The Walt Disney Company - Lead Software Engineer (Identity)

The Walt Disney Company

San Francisco, California, United States (On-Site)
4 Months ago
Scale AI - Senior Software Engineer

Scale AI

Argentina (On-Site)
5 Months ago
Rocket Science - Software Engineer - UI

Rocket Science

Wales, United Kingdom (Hybrid)
6 Days ago
Animoca Brands - Frontend Developer

Animoca Brands

Philippines (Remote)
6 Months ago
Aristocrat Gaming - Game Developer

Aristocrat Gaming

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Weeks ago
Canva - Senior Frontend Engineer - Canva for Education

Canva

Surry Hills, New South Wales, Australia (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Oracle Retail Expert F/H/NB - CDI

The Walt Disney Company

Île-de-France, France (On-Site)
4 Months ago
Google - Software Engineer III, Full Stack, Google Cloud

Google

Hyderabad, Telangana, India (On-Site)
4 Months ago
N-iX - Senior Java Developer

N-iX

Poland (Remote)
2 Days ago
Skillz - Lead Cloud Engineer

Skillz

San Francisco, California, United States (On-Site)
6 Days ago
ByteDance - Senior Software Development Engineer - Cloud Native Databases

ByteDance

San Jose, California, United States (On-Site)
2 Months ago
Electronic Arts - Technical Artist

Electronic Arts

Kuala Lumpur, Wilayah Persekutuan Kuala Lumpur, Malaysia (On-Site)
2 Months ago
Google - Software Engineer, PhD, Early Career, Campus, Systems and Infrastructure, 2025 Start

Google

Mountain View, California, United States (On-Site)
4 Months ago
Moon Active - Data Platform Engineer

Moon Active

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
18 Hours ago
ComeOn Group - Frontend Developer

ComeOn Group

Silesian Voivodeship, Poland (Hybrid)
1 Week ago
ByteDance - Backend Software Engineer - Global E-Commerce Supply Chain

ByteDance

San Jose, California, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Sydney, New South Wales, Australia

Altagram Group - Localization Engineer –  APAC Region (m/f/d) - Video Game Localization

Altagram Group

Australia (Remote)
4 Months ago
Salesforce - Enterprise Account Executive Marketing Cloud

Salesforce

Sydney, New South Wales, Australia (On-Site)
5 Months ago
Canva - Head of AI Research

Canva

Sydney, New South Wales, Australia (Remote)
2 Months ago
Immutable - Senior Software Engineer (Passport)

Immutable

Australia (Hybrid)
3 Months ago
Entain - Compliance Analyst (GRC)

Entain

Queensland, Australia (Hybrid)
1 Week ago
Canva - Senior Frontend Engineer - Frontend Core Libraries

Canva

Sydney, New South Wales, Australia (Hybrid)
4 Months ago
Canva - Backend Engineer (Java), Media Platform - Global Content and Discovery

Canva

Surry Hills, New South Wales, Australia (Remote)
1 Day ago
Canva - Senior Software Engineer - Cloud Security & Compliance, remote across ANZ

Canva

Sydney, New South Wales, Australia (Remote)
3 Months ago
Easygo - Senior DevOps Engineer

Easygo

Melbourne, Victoria, Australia (On-Site)
1 Month ago
Entain - Senior QA Engineer I Quality Champion

Entain

Australia (Remote)
1 Day ago

Get notifed when new similar jobs are uploaded

Frontend Development Jobs

Ubisoft - UI Programmer

Ubisoft

Shanghai, Shanghai, China (On-Site)
1 Day ago
Hedra - Frontend Engineer

Hedra

San Francisco, California, United States (On-Site)
1 Day ago
Progress - Senior Software Engineer

Progress

Sofia, Sofia City Province, Bulgaria (Hybrid)
4 Months ago
The Walt Disney Company - Lead Software Engineer (Roku Engineer)

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
4 Months ago
Epic Games - UI Engineering Director

Epic Games

(On-Site)
2 Days ago
Canva - Senior Machine Learning Engineer - Ecosystem Experiences

Canva

Sydney, New South Wales, Australia (Remote)
1 Week ago
The Walt Disney Company - Sr Software Engineer (webOS/Tizen)

The Walt Disney Company

Charlotte, North Carolina, United States (On-Site)
4 Months ago
Ajmera Infotech - Senior React Developer

Ajmera Infotech

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Booming games - Game Developer - Javascript / HTML5 (f/m/x)

Booming games

Hamburg, Hamburg, Germany (Hybrid)
5 Months ago
The Walt Disney Company - Sr Software Engineer (webOS/Tizen)

The Walt Disney Company

Bristol, Connecticut, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Surry Hills, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Surry Hills, New South Wales, Australia (Remote)

Surry Hills, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Surry Hills, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

View All Jobs

Get notified when new jobs are added by Canva

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug