Machine Learning Engineer - ML Platform

11 Hours ago • All levels • DevOps

About the job

Job Description

Canva's GenAI Platform Group seeks a Machine Learning Engineer to lead ML platform projects end-to-end. Responsibilities include implementing changes to model server build processes, improving developer experience, collaborating with teams to optimize services, modernizing AWS infrastructure (migrating to EKS), and platformizing model observability. The ideal candidate possesses a proven track record of delivering wide-ranging changes, a passion for engineering excellence, and expertise in integrating and scaling ML within engineering systems. Deep Python expertise and familiarity with AWS services (ECS, EKS, ECR, Kinesis) are highly desirable. The role involves working within the Production ML team, which focuses on enabling the deployment and serving of various teams' models to production environments.
Must have:
  • Proven track record of delivering wide changes
  • Passionate about engineering excellence, systems architecture
  • Background in integrating and scaling ML within engineering systems
  • Excellent collaboration, communication, and stakeholder management skills
  • Deep expertise in Python development
Good to have:
  • Experience with modern development tools (Bazel or Nix)
  • Experience with containerization, Terraform, and GitOps
  • Familiarity with AWS ecosystem: ECS, EKS, ECR, Kinesis

Job Description

Join the team redefining how the world experiences design.

Hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte!

Thanks for stopping by. We know job hunting can be a little time-consuming and you're probably keen to find out what's on offer, so we'll get straight to the point.

Where and how you can work

Our flagship campus is in Sydney. We also have a campus in Melbourne and co-working spaces in Brisbane, Perth and Adelaide. But you have choice in where and how you work, we trust our Canvanauts to choose the balance that empowers them and their team to achieve their goals.

What you’d be doing in this role

As Canva scales change continues to be part of our DNA. But we like to think that's all part of the fun. So this will give you the flavour of the type of things you'll be working on when you start, but this will likely evolve.

At the moment, this role is focused on:

  • Leading ML platform projects end-to-end from ideation to delivery
  • Implementing changes to our build process for model servers, improving developer experience while also ensuring consistency and traceability from dev through to production.
  • Working closely with other teams to understand how we can improve our existing services and platforms, or introduce new ones, in order to optimize their efficiency.
  • Modernizing our underlying AWS serving infrastructure by migrating to EKS, as well as uplifting our account structure to expose new regions and services.
  • Working to “platformize” new (or immature existing) capabilities such as model observability.
  • Continue exploring opportunities for our platform teams to better leverage each other in order to optimize for delivery of both product and R&D teams.

You're probably a match if

  • Proven track record of delivering wide changes and acquiring buy-in from multiple stakeholders.
  • Passionate about engineering excellence, systems architecture, and meeting non-functional requirements.
  • Background in integrating and scaling ML within engineering systems.
  • Understanding of Computer Science/Engineering fundamentals and first principles covering system design, data structures, architecture, and design patterns
  • Excellent collaboration, communication, and stakeholder management skills

It would be preferable if you have:

  • Deep expertise in Python development is essential, with experience in modern development tools such as Bazel or Nix being a plus.
  • Experience with containerization, Terraform, and GitOps principles for automation and deployment is a plus.
  • Familiarity with the AWS ecosystem: ECS, EKS, ECR, Kinesis.


About the team

Canva's GenAI Platform Group is responsible for the delivery of capabilities which support ML and AI initiatives, from early ideation and prototyping, through to scaling to meet the needs of millions of Canva users in production. We empower thousands of engineers and product managers to deliver amazing product features which harness the power of cutting-edge technologies. 

The Production ML team develops tools, services and processes to enable the deployment and serving of various teams’ models to our production environments. The team also owns and drives change in several engineering areas related to our Python and model/server environments.

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

Sydney, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Remote)

Sydney, New South Wales, Australia (Hybrid)

New South Wales, Australia (Remote)

San Francisco, California, United States (Remote)

View All Jobs

Get notified when new jobs are added by Canva

Similar Jobs

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Get notifed when new similar jobs are uploaded

Jobs in Sydney, New South Wales, Australia

Get notifed when new similar jobs are uploaded

DevOps Jobs

Get notifed when new similar jobs are uploaded