Software Reliability Engineer (Observability)

3 Weeks ago • All levels • Frontend Development

Job Summary

Job Description

Canva is seeking a Software Reliability Engineer (Observability) to build and improve their observability platform and tooling. Responsibilities include providing technical leadership, optimizing the tracing platform, improving operational effectiveness, advocating for best practices, and improving the user experience. The ideal candidate will have strong coding skills in Python, Java, or Golang, deep knowledge of computer engineering, experience with AWS and Kubernetes, and familiarity with observability tools like Elasticsearch, Grafana, and Jaeger. The role involves working with a team to maintain tracing libraries and infrastructure, error reporting, and handling guidelines, ensuring the scalability and reliability of Canva's platform for developers.
Must have:
  • Proficient in Python, Java, or Golang
  • Deep knowledge of Computer Engineering
  • AWS (EC2, EKS, Lambda etc.) experience
  • Kubernetes experience
  • Observability tooling experience (Elasticsearch, Grafana, Jaeger)
  • Experience with highly available distributed systems
Good to have:
  • OpenTelemetry experience
  • Java or TypeScript application code experience
  • Experience building monitoring infrastructure at scale
  • Experience with ClickHouse
  • Experience with data security and PII detection
Perks:
  • Equity packages
  • Inclusive parental leave policy
  • Annual Vibe & Thrive allowance
  • Flexible leave options

Job Details

Job Description

Join the team redefining how the world experiences design.

Hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte!

Thanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the point.

Where and how you can work

Our flagship campus is in Sydney. We also have a campus in Melbourne and co-working spaces in Brisbane, Perth and Adelaide. But you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.

What you’d be doing in this role

As Canva scales change continues to be part of our DNA. But we like to think that's all part of the fun. So this will give you the flavour of the type of things you'll be working on when you start, but this will likely evolve.

At the moment, this role is focused on:

  • Being responsible for building and improving our observability platform and tooling, which is used by all Canva engineers.
  • Providing technical leadership and expertise to drive pragmatic solutions and dive into impactful design decisions.
  • Brainstorming, researching and prototyping to optimize our tracing platform, improve our operational effectiveness and increase reliability.
  • Being proactive in improving the tracing user experience and advocating for best practices.
  • Participating in team ceremonies, knowledge sharing and brainstorming sessions.
  • Becoming an observability champion, evangelising best practices and guiding other Canvanauts in the observability space.
  • Finding ways to improve the use of traces and provide better insights to our engineers.

You're probably a match if

  • You are proficient and happy to code in Python, Java or Golang.
  • You have deep knowledge and understanding of Computer Engineering fundamentals and first principles.
  • You have a solid knowledge of AWS (EC2, EKS, Lambda, SQS, Kinesis, S3) or equivalent.
  • You have experience deploying and running containerized workloads on a platform like Kubernetes.
  • You have experience with Observability Tooling – having competency with tools like Elasticsearch, Grafana, Sentry, Jaegar Tracing or similar.
  • Experience running highly available and reliable distributed systems, with highly scalable data stores.
  • You are proficient with infrastructure-as-code - we’re a Terraform shop, but strong experience with other IaC tools will do the trick.

Not essential; but helpful experience!

  • You have experience with OpenTelemetry because it underpins a lot of the infrastructure and tooling that the team owns.
  • You have experience writing application code in Java or frontend code in TypeScript, since we also maintain the tracing libraries.
  • You have experience building and running monitoring infrastructure at scale. For example, Petabyte-scale Elasticsearch clusters or similar databases.
  • You have experience with data handling at scale.
  • You have experience with Clickhouse.
  • You have experience with data security, data obfuscation and PII detection.

About the team

You’ll join The Observability Traces & Exceptions Team, responsible for operational insights inside Canva. Our goal is to provide our development team with world-class tools to view how their services are performing in production. We achieve this by combining industry-leading third-party solutions with our own solutions developed in-house.

We work across the entire stack maintaining our TypeScript and Java tracing libraries, our tracing infrastructure, error reporting libraries and error handling guidelines to name just a few. As we scale all of these areas, we require more sophisticated solutions to ensure that Canva developers can continue to grow without compromising on reliability or availability.

What's in it for you?

Achieving our crazy big goals motivates us to work hard - and we do - but you'll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a range of benefits to set you up for every success in and outside of work.

Here's a taste of what's on offer:

  • Equity packages - we want our success to be yours too
  • Inclusive parental leave policy that supports all parents & carers
  • An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup & more
  • Flexible leave options that empower you to be a force for good, take time to recharge and supports you personally

Check out lifeatcanva.com for more info.

Other stuff to know

We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.

We celebrate all types of skills and backgrounds at Canva so even if you don’t feel like your skills quite match what’s listed above - we still want to hear from you!

Please note that interviews are conducted virtually.

Similar Jobs

Phoenix Labs - Senior Services Engineer - Dauntless

Phoenix Labs

Canada (Remote)
2 Weeks ago
Axinous - Sr. Staff ML Engineer

Axinous

San Jose, California, United States (Hybrid)
1 Month ago
PwC - IN-Senior Associate_Oracle Fusion Finance_ Oracle_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Yodo1 - Unity3D Mobile SDK Team Lead

Yodo1

(Remote)
6 Months ago
Everyday Health Group - Senior Software Engineer, Backend - Lose It!

Everyday Health Group

United States (On-Site)
3 Months ago
TransDyne IT Services - Senior Software Engineer – React Native (3 to 5 years)

TransDyne IT Services

Kochi, Kerala, India (Hybrid)
8 Months ago
Social Discovery Group - Senior React Native Developer

Social Discovery Group

Bangkok, Bangkok, Thailand (Remote)
2 Days ago
Trek - UI Engineer

Trek

Haryana, India (Hybrid)
4 Months ago
Apex logic - Sr. Front-End Developer

Apex logic

United States (Remote)
2 Months ago
Nagarro - Senior Front-end Engineer

Nagarro

China (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Zoox - Backend Software Engineering Internship/Co-op

Zoox

Foster City, California, United States (On-Site)
3 Months ago
The Walt Disney Company - Lead Software Engineer

The Walt Disney Company

Santa Monica, California, United States (On-Site)
1 Month ago
Anthology  Inc  - Platform Engineer II

Anthology Inc

Bogotá, Bogota, Colombia (Remote)
2 Months ago
Rush Street Interactive - Server Engineer

Rush Street Interactive

Tartu, Tartu County, Estonia (On-Site)
1 Month ago
Netflix - Data Engineer L5 - Security

Netflix

Warsaw, Masovian Voivodeship, Poland (Hybrid)
2 Weeks ago
PlayStation Global - Server Software Engineer (Data)

PlayStation Global

London, England, United Kingdom (Hybrid)
2 Months ago
PENN Interactive - Data Engineer, Java

PENN Interactive

Philadelphia, Pennsylvania, United States (Hybrid)
4 Weeks ago
NetSPI - Senior Software Engineer

NetSPI

Pune, Maharashtra, India (On-Site)
3 Months ago
Saviynt - Consultant, Professional Services, IAM/IGA

Saviynt

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Altair - Senior Software Engineer (Golang)

Altair

India (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Surry Hills, New South Wales, Australia

Tesla - Registration Advisor

Tesla

Queensland, Australia (On-Site)
9 Hours ago
USE Insider - Solutions Architect - Australia

USE Insider

Sydney, New South Wales, Australia (On-Site)
3 Months ago
Entain - Commercial Manager - Procurement

Entain

Queensland, Australia (Hybrid)
1 Week ago
Employee First - Recruitment & Engagement - 3D Artist

Employee First - Recruitment & Engagement

Sydney, New South Wales, Australia (On-Site)
5 Months ago
Tesla - Sales Advisor

Tesla

Queensland, Australia (On-Site)
8 Hours ago
Immutable - Chief of Staff (Operations)

Immutable

Sydney, New South Wales, Australia (Hybrid)
3 Weeks ago
Tesla - Sales Executive, North Queensland

Tesla

Queensland, Australia (On-Site)
8 Hours ago
DNEG - Environment Generalist TD

DNEG

Sydney, New South Wales, Australia (On-Site)
1 Week ago
Canva - Front of House Specialist

Canva

Sydney, New South Wales, Australia (On-Site)
1 Month ago
Tesla - Service Advisor

Tesla

Osborne Park, Western Australia, Australia (On-Site)
8 Hours ago

Get notifed when new similar jobs are uploaded

Frontend Development Jobs

Maverick Games - UI Engineer

Maverick Games

Warwick, England, United Kingdom (On-Site)
4 Weeks ago
Canva - Staff Machine Learning Engineer - User Voice

Canva

Brisbane, Queensland, Australia (Remote)
3 Months ago
The Walt Disney Company - Lead Software Engineer (Roku Engineer)

The Walt Disney Company

Santa Monica, California, United States (On-Site)
3 Months ago
EG A S - Frontend Developer- EG Construction

EG A S

Mangaluru, Karnataka, India (On-Site)
4 Months ago
Nexters - Senior Unity Developer

Nexters

Cyprus (Remote)
6 Hours ago
Rockstar Games - UI Tools Programmer (Mid/Senior)

Rockstar Games

Dundee, Scotland, United Kingdom (On-Site)
4 Weeks ago
Neostella - React Developer

Neostella

Guadalajara, Jalisco, Mexico (Remote)
3 Months ago
Luxoft - Technical Lead UI - React

Luxoft

Gurugram, Haryana, India (On-Site)
2 Months ago
Playtech - Technical Writer

Playtech

Kyiv, Kyiv City, Ukraine (On-Site)
3 Days ago
bosh group india - HMI SW Engineer - Qt

bosh group india

Bengaluru, Karnataka, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Sydney, New South Wales, Australia (Remote)

Seattle, Washington, United States (Remote)

Los Angeles, California, United States (Remote)

Brisbane, Queensland, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Perth, Western Australia, Australia (Remote)

Melbourne, Victoria, Australia (Remote)

Auckland, Auckland, New Zealand (Remote)

View All Jobs

Get notified when new jobs are added by Canva

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug