Staff Platform Engineer, MLOps

1 Month ago • 7-7 Years • DevOps • $130,000 PA - $160,000 PA

Job Summary

Job Description

As a Staff Platform Engineer (MLOps), you'll design, deploy, and maintain cloud infrastructure for Inworld's AI Engine and Studio. Responsibilities include optimizing the ML model lifecycle using the Inworld AI platform and Nvidia CUDA, implementing CI/CD systems for ML workflows, monitoring models, designing MLOps tools, and facilitating a 'you build it, you run it' culture. You will manage CI/CD pipelines, identify opportunities to enhance engineering speed, conduct root cause analysis, and develop best practices for automation. The role requires expertise in Kubernetes, Terraform/Terragrunt, and at least one major cloud provider.
Must have:
  • 7+ years software engineering experience
  • 5+ years Infrastructure-as-code experience
  • Kubernetes & Helm/Kustomize proficiency
  • CI/CD pipeline creation & maintenance
  • Cloud provider expertise (GCP, Azure, Oracle)
  • Golang, Python, or Bash proficiency
Good to have:
  • Open-source LLM & serving solution familiarity
  • SLURM experience
  • Data pipeline & workflow management tools experience
  • Bare metal GPU experience
Perks:
  • Equity
  • Benefits

Job Details

view open roles

Why Join Inworld

Inworld is the leading provider of AI technology for real-time interactive experiences, with a $500 million valuation and backing from top tier investors including Intel Capital, Microsoft’s M12 fund, Lightspeed Venture Partners, Section 32, BITKRAFT Ventures, Kleiner Perkins, Founders Fund, and First Spark Ventures.

Inworld provides the market’s best framework for building production ready interactive experiences, coupled with dedicated services to optimize specific stages of development – from design and development, to ML pipeline optimization and custom compute infrastructure. We help developers bring their AI engines in-house with a framework optimized for real-time data ingestion, low latency, and massive scale. Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as Microsoft Xbox, Epic Games, and Unity. 

Inworld was recognized by CB Insights as one of the 100 most promising AI companies in the world in 2024 and was named among LinkedIn's Top Startups of 2024 in the USA.

About the Role:

As a Staff Platform Engineer (MLOps), you'll work closely with backend and ML Engineering teams to design, deploy, and maintain reliable, high-performance, and secure cloud infrastructure for our AI Engine and Studio. 

 

What you'll do:

  • Develop, manage, and optimize the ML model lifecycle in production using the Inworld AI platform and Nvidia CUDA, implementing CI/CD systems for ML workflows, monitoring models to identify issues and inefficiencies, and designing MLOps tools and frameworks to enhance automation and efficiency.
  • Facilitate a "you build it, you run it" culture by providing the necessary tools and processes for monitoring the reliability, availability, and performance of services.
  • Manage CI/CD pipelines to ensure smooth and efficient code integration and deployment.
  • Identify and implement opportunities to enhance engineering speed and efficiency.
  • Conduct root cause analysis to identify critical issues and develop automated solutions to prevent recurrence.
  • Develop and share best practices to improve automation and efficiency across our engineering teams.

 

Expected experience:

  • 7 years of experience in software engineering.
  • 5 years of experience with infrastructure-as-code.
  • Proficiency in managing Kubernetes clusters and applications, including creating Helm charts/Kustomize manifests for new applications.
  • Experience in creating and maintaining CI/CD pipelines for both applications and infrastructure deployments (using tools like Terraform/Terragrunt, ArgoCD, GitHub Actions, Ansible, etc.).
  • Deep knowledge of at least one major cloud provider (Google Cloud Platform, Microsoft Azure, Oracle Cloud).
  • Proficient in at least one backend programming/scripting languages such as Golang, Python, and Bash.
  • Familiarity with open source LLM and open source serving solution (e.g. vLLM or llama.cpp, kserve, etc) is a plus.
  • Experience with SLURM
  • Experience with data pipeline and workflow management tools
  • Experience with bare metal GPUs (optional).

 

The base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes equity and benefits. Within the range, individual pay is determined by work location, level, and additional factors, including competencies, experience, and business needs. The base pay range is subject to change and may be modified in the future.

Inworld Jobs Privacy

Similar Jobs

Falcon X - Senior Software Engineer - DevOps

Falcon X

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Zitga gaming studio - Backend Developer

Zitga gaming studio

Hanoi, Vietnam (On-Site)
3 Weeks ago
Adyen - Senior Linux Infrastructure Engineer

Adyen

Amsterdam, North Holland, Netherlands (On-Site)
1 Week ago
Better ME - Manual QA Engineer (Backend)

Better ME

Kyiv, Kyiv City, Ukraine (Remote)
2 Weeks ago
sinch  - Senior DevOps Engineer

sinch

Noida, Uttar Pradesh, India (Hybrid)
6 Days ago
Google - Site Reliability Manager, Platforms and Devices, SRE

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Britive - SOFTWARE ENGINEER (CLOUD)

Britive

Bengaluru, Karnataka, India (Remote)
6 Months ago
Match Group - Senior Platform Engineer

Match Group

New York, New York, United States (Hybrid)
7 Months ago
Omnissa - Engineering Manager (C++, Linux/Windows/MacOS internals)

Omnissa

Bengaluru, Karnataka, India (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

SoundCloud - Senior Machine Learning Engineer

SoundCloud

Berlin, Berlin, Germany (On-Site)
2 Weeks ago
RoofStack - Software Architect

RoofStack

Istanbul, İstanbul, Türkiye (On-Site)
2 Months ago
Trend Micro - Sr. Software Developer

Trend Micro

Ottawa, Ontario, Canada (On-Site)
3 Days ago
Crowd Strick - Threat Detection Engineer

Crowd Strick

Tel Aviv-Yafo, Tel Aviv District, Israel (Remote)
4 Days ago
Visa - Staff Systems Engineer - Splunk Administrator - PRE

Visa

Austin, Texas, United States (Hybrid)
7 Months ago
cirrus logic - DevOps Engineer – CI/CD & Software Automation

cirrus logic

Austin, Texas, United States (Hybrid)
1 Month ago
Gaijin Entertainment - Senior Linux Administrator

Gaijin Entertainment

(Remote)
1 Month ago
Jumio - SDE III - (Core)

Jumio

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Netflix - Solutions Support Engineer (L5) - Delivery

Netflix

Poland (Remote)
1 Month ago
playrix  - Senior Release Engineer

playrix

Ukraine (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Vancouver, British Columbia, Canada

Ubisoft - Senior Rigger

Ubisoft

Montreal, Quebec, Canada (Hybrid)
2 Months ago
N-ix - VP of Data Consulting

N-ix

Canada (Remote)
3 Months ago
Bally's Interactive - Senior Data Developer

Bally's Interactive

Toronto, Ontario, Canada (Hybrid)
1 Month ago
WildBrain - Surfacing Tech Artist (Unreal Experienced)

WildBrain

Vancouver, British Columbia, Canada (Hybrid)
1 Month ago
PlayStation Global - Senior UI Programmer

PlayStation Global

Montreal, Quebec, Canada (On-Site)
1 Month ago
Addepar - Sr. Product Designer - Platform Architecture

Addepar

Canada (Remote)
1 Month ago
GoDaddy - Senior Software Engineer - Front End

GoDaddy

British Columbia, Canada (Remote)
1 Week ago
Ansys - Senior R&D Engineer

Ansys

Waterloo, Ontario, Canada (Remote)
2 Weeks ago
nova quark - Senior Product Manager - EA Sports FC

nova quark

Vancouver, British Columbia, Canada (Hybrid)
3 Weeks ago
Rockstar Games - HR Manager

Rockstar Games

Toronto, Ontario, Canada (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Omnissa - Senior Member of Technical Staff (C++ Windows)

Omnissa

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
Anthology  Inc  - DevOps (SRE) Engineer

Anthology Inc

Brno, South Moravian Region, Czechia (On-Site)
7 Months ago
warner bros games - Staff Software Engineer - Cloud Support and Operations

warner bros games

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Rackspace Technology - Senior GCP Cloud Engineer

Rackspace Technology

United States (Remote)
2 Months ago
InMobiInMobi - SDE III - Devops

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
8 Months ago
Electronic Arts - Senior Software Engineer .NET, Game Creation

Electronic Arts

Orlando, Florida, United States (On-Site)
2 Months ago
DraftKings - Lead Software Engineer

DraftKings

Sofia, Sofia City Province, Bulgaria (Hybrid)
6 Months ago
Microsoft - Senior Build Engineer

Microsoft

Ostergotland, Östergötland County, Sweden (Hybrid)
1 Month ago
velotio technologies  - Senior DevOps Engineer (AWS)

velotio technologies

Pune, Maharashtra, India (Remote)
2 Months ago
Ubisoft - Backend Golang Developer

Ubisoft

Montreal, Quebec, Canada (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Mountain View, California, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Inworld AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug