Staff Machine Learning Engineer - Dataset & Training Platform

1 Month ago • 5 Years + • Devops

Job Summary

Job Description

Canva is seeking a Staff Machine Learning Engineer to join their Dataset & Training Platform team. This role focuses on architecting foundational AI Platform capabilities, making key technical decisions for model training and deployment. The engineer will lead cross-team initiatives to improve dataset and training capabilities, building and maintaining high-performance distributed data processing and training systems. Responsibilities include driving technical strategy, mentoring other engineers, and solving complex ML stack challenges. The role involves collaborating with stakeholders across engineering and research teams to address diverse technical requirements and influence engineering practices.
Must have:
  • 5+ years of experience building and scaling ML training systems
  • Hands-on experience in distributed training
  • Model lifecycle management experience
  • Large-scale data processing experience
  • Designing and implementing foundational AI/ML infrastructure
  • Setting technical direction and making architectural decisions
  • Strong understanding of distributed computing
  • Experience with Kubernetes
  • Experience with AWS
  • Fluent in Python
  • Deep knowledge of ML frameworks (PyTorch, TensorFlow)
  • Knowledge of modern tools (W&B, Ray, Anyscale)
  • Strong understanding of generative AI systems (LLMs, multimodal models)
  • Ability to balance long-term investments with business needs
  • Experience working with product teams and researchers
  • Track record of growing engineers
  • Experience with infrastructure-as-code
  • Understanding of performance optimization
Good to have:
  • GitOps principles for automation and deployment
Perks:
  • Equity packages
  • Inclusive parental leave policy
  • Annual Vibe & Thrive allowance
  • Flexible leave options

Job Details

Join the team redefining how the world experiences design.

Hey, gday, mabuhay, kia ora, 你好, hallo, vítejte!
Thanks for stopping by. We know job hunting can be a little time consuming and you're probably keen to find out what's on offer, so we'll get straight to the point.

Where and how you can work

Our flagship campus is in Sydney. We also have a campus in Melbourne and co-working spaces in Brisbane, Perth and Adelaide. But you have choice in where and how you work — we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.

What you'd be doing in this role

As Canva scales, change continues to be part of our DNA. But we like to think that's all part of the fun. So this will give you the flavour of the type of things you'll be working on when you start, but this will likely evolve.

At the moment, this role is focused on:

  • Architecting foundational AI Platform capabilities, making key technical decisions that impact how models are trained and deployed across Canva.
  • Leading cross-team initiatives to consolidate and improve dataset and training capabilities, working with stakeholders across multiple engineering and research teams.
  • Building and maintain paved roads for high-performance distributed data processing and training systems, optimising for cost efficiency and developer experience.
  • Driving technical strategy discussions, weighing platform stability needs against product velocity requirements.
  • Mentoring other engineers and contribute to growing Canva's AI platform engineering capabilities.
  • Solving complex technical challenges spanning multiple parts of the ML stack.

You're probably a match if:

  • You have over 5 years of experience building and scaling ML training systems, with hands-on experience in distributed training, model lifecycle management, and large-scale data processing.
  • You have a proven track record of designing and implementing foundational AI/ML infrastructure that supports multiple teams and use cases.
  • You have experience setting technical direction, making architectural decisions, and influencing engineering practices across organisations.
  • You possess a strong understanding of distributed computing, container orchestration (Kubernetes), and cloud infrastructure (preferably AWS).
  • You are fluent in Python with deep knowledge of ML frameworks (PyTorch, TensorFlow) and modern tools (W&B, Ray, Anyscale).
  • You have a strong understanding of generative AI systems, including LLMs, multimodal models, and foundation model fine-tuning.
  • You have the ability to balance long-term platform investments with immediate business needs, making pragmatic technical decisions.
  • You have experience working with product teams, research, and other engineering specialties to understand and address diverse technical requirements.
  • You have a track record of growing other engineers and contributing to technical culture and standards.
  • You have experience with infrastructure-as-code and an understanding of performance optimisation.
  • You consider GitOps principles for automation and deployment a plus.

About the team

Canva's GenAI Platform Group is responsible for the delivery of Capabilities and Solutions which support ML and AI initiatives, from early ideation and prototyping, through to scaling to meet the needs of millions of Canva users in production. We empower thousands of engineers and product managers to deliver amazing product features which harness the power of cutting-edge technologies. 

Dataset & Training Platform team specifically focuses on the foundational Capabilities that power model training, dataset management, and AI/ML workflows. We're building the infrastructure that enables Canva's AI-first future, supporting everything from generative design models to intelligent automation systems that serve millions of users worldwide.

What's in it for you?

Achieving our crazy big goals motivates us to work hard — and we do — but you'll experience lots of moments of magic, connectivity and fun woven throughout life at Canva, too. We also offer a range of benefits to set you up for every success in and outside of work.

Here’s a taste of what’s on offer:
• Equity packages — we want our success to be yours too
• Inclusive parental leave policy that supports all parents & carers
• An annual Vibe & Thrive allowance to support your wellbeing, social connection, office setup & more
• Flexible leave options that empower you to be a force for good, take time to recharge and support you personally

Check out lifeatcanva.com for more info.

Other stuff to know

We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.

We celebrate all types of skills and backgrounds at Canva — so even if you don’t feel like your skills quite match what’s listed above — we still want to hear from you!

We see AI as a powerful amplifier of creativity and technology at Canva.We’re evolving how we assess AI skills in our Technology hiring experience - you’ll tackle interactive, real-time challenges that reflect the kind of work we do. In some interviews, you may also be asked to solve a problem using an AI tool to show how you approach challenges with tech by your side. Your recruitment partner will walk you through what to expect.We make hiring decisions based on your experience, skills and passion, as well as how you can enhance Canva and our culture. When you apply, please tell us the pronouns you use and any reasonable adjustments you may need during the interview process.We celebrate all types of skills and backgrounds at Canva so even if you don’t feel like your skills quite match what’s listed above - we still want to hear from you!

Please note that interviews are conducted virtually.

 

Similar Jobs

London stock Exchange - Senior Java Software Engineer

London stock Exchange

Romania (On-Site)
2 Months ago
Haleon - Senior Medical Specialist, Hospital

Haleon

Xuzhou, Jiangsu, China (On-Site)
1 Month ago
reality.co - Product Owner, Mobile Games (LiveOps & Game Design Focused)

reality.co

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
2 Weeks ago
Futurlab - Technical Audio Designer

Futurlab

Brighton And Hove, England, United Kingdom (Remote)
2 Weeks ago
Kolibri Games - DevOps Engineer

Kolibri Games

Berlin, Berlin, Germany (Hybrid)
3 Months ago
Apple - Cloud Security Architect, Platform Architecture

Apple

Cupertino, California, United States (On-Site)
3 Months ago
Moon Active - Site Reliability Engineer

Moon Active

Warsaw, Masovian Voivodeship, Poland (Hybrid)
2 Weeks ago
Synechron - Platform Engineer

Synechron

Charlotte, North Carolina, United States (On-Site)
1 Year ago
smarsh - Cloud Engineer III-Observability

smarsh

India (Hybrid)
6 Months ago
bytedance - Tech Lead (SRE) - Cloud Infrastructure

bytedance

Singapore (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Rhino Entertainment Group - Finnish Speaking - Customer Experience Agent

Rhino Entertainment Group

Sliema, Malta (Remote)
2 Weeks ago
neostella - React Developer

neostella

Guadalajara, Jalisco, Mexico (Hybrid)
3 Weeks ago
Toast - Bilingual Spanish Los Angeles Emerging Markets Account Executive

Toast

Los Angeles, California, United States (Remote)
1 Month ago
truecaller - Senior Site Reliability Engineer

truecaller

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
5minlab - Animator

5minlab

Seoul, South Korea (On-Site)
2 Weeks ago
bytedance - Tech Lead - Architect / Researcher - DPU

bytedance

San Jose, California, United States (On-Site)
5 Months ago
CD PROJEKT RED - Expert 3Cs Engineer

CD PROJEKT RED

Boston, Massachusetts, United States (Remote)
1 Month ago
quience - Merchandising Manager, Bags/SLGs

quience

San Francisco, California, United States (On-Site)
1 Month ago
Rippling - Sales Development Representative (Outbound)

Rippling

Sydney, New South Wales, Australia (Hybrid)
1 Year ago
Gallagher - Web Content Editor

Gallagher

Bengaluru, Karnataka, India (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Jobs in Auckland, Auckland, New Zealand

Zuru - Product Design Engineer

Zuru

Auckland, Auckland, New Zealand (On-Site)
9 Months ago
Zuru - Brand & Packaging Designer, Confectionery, Pet Care & Toys

Zuru

Auckland, Auckland, New Zealand (On-Site)
1 Month ago
Weta Fx - FXTD

Weta Fx

Wellington, Wellington Region, New Zealand (On-Site)
3 Months ago
Entain group - Broadcast Operations Coordinator

Entain group

Auckland, Auckland, New Zealand (On-Site)
1 Month ago
Zuru - Community Manager, Toys

Zuru

Auckland, Auckland, New Zealand (On-Site)
1 Month ago
Remote - Customer Care Associate

Remote

New Zealand (Remote)
3 Weeks ago
Canva - Engineering Manager (Frontend/Full-Stack) - Ecosystem - Apps API Platform

Canva

Auckland, Auckland, New Zealand (Remote)
3 Weeks ago
Zuru - Senior Marketing Executive

Zuru

Auckland, Auckland, New Zealand (On-Site)
2 Months ago
Banyan Software - Systems Administrator- Infrastructure

Banyan Software

Auckland, Auckland, New Zealand (On-Site)
2 Weeks ago
Canva - Senior Engineering Manager (FE) - Editing Core (Remote ANZ)

Canva

Auckland, Auckland, New Zealand (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Survay Monkey - Staff Site Reliability Engineer

Survay Monkey

Bengaluru, Karnataka, India (Hybrid)
3 Months ago
Single Store - Solutions Engineer

Single Store

Tokyo, Japan (On-Site)
2 Months ago
velotio technologies  - Senior DevOps Engineer

velotio technologies

Pune, Maharashtra, India (Remote)
2 Months ago
miniclip - Senior Cloud Infrastructure Engineer

miniclip

Lisbon, Lisbon, Portugal (On-Site)
3 Weeks ago
Scale AI - Solutions Engineer, Enterprise

Scale AI

San Francisco, California, United States (On-Site)
3 Months ago
Autodesk - Senior Software Engineer (Full Stack - Java, AWS, AI/ML)

Autodesk

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Egnyte - Software Engineer, Java - Core Infrastructure

Egnyte

Mountain View, California, United States (On-Site)
2 Weeks ago
Vertx Inc. - Workday Finance Solution Architect

Vertx Inc.

United States (Remote)
1 Month ago
Plaid  - Experienced Infrastructure Engineer

Plaid

United States (On-Site)
1 Month ago
Rush street interactive  - Senior Full-Stack Automation Engineer

Rush street interactive

Estonia (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded