Staff Platform Engineer, MLOps

1 Hour ago • 7-7 Years • DevOps • $130,000 PA - $160,000 PA

Job Summary

Job Description

As a Staff Platform Engineer (MLOps), you'll design, deploy, and maintain cloud infrastructure for Inworld's AI Engine and Studio. Responsibilities include optimizing the ML model lifecycle using the Inworld AI platform and Nvidia CUDA, implementing CI/CD systems for ML workflows, monitoring models, designing MLOps tools, and facilitating a 'you build it, you run it' culture. You will manage CI/CD pipelines, identify opportunities to enhance engineering speed, conduct root cause analysis, and develop best practices for automation. The role requires expertise in Kubernetes, Terraform/Terragrunt, and at least one major cloud provider.
Must have:
  • 7+ years software engineering experience
  • 5+ years Infrastructure-as-code experience
  • Kubernetes & Helm/Kustomize proficiency
  • CI/CD pipeline creation & maintenance
  • Cloud provider expertise (GCP, Azure, Oracle)
  • Golang, Python, or Bash proficiency
Good to have:
  • Open-source LLM & serving solution familiarity
  • SLURM experience
  • Data pipeline & workflow management tools experience
  • Bare metal GPU experience
Perks:
  • Equity
  • Benefits

Job Details

view open roles

Why Join Inworld

Inworld is the leading provider of AI technology for real-time interactive experiences, with a $500 million valuation and backing from top tier investors including Intel Capital, Microsoft’s M12 fund, Lightspeed Venture Partners, Section 32, BITKRAFT Ventures, Kleiner Perkins, Founders Fund, and First Spark Ventures.

Inworld provides the market’s best framework for building production ready interactive experiences, coupled with dedicated services to optimize specific stages of development – from design and development, to ML pipeline optimization and custom compute infrastructure. We help developers bring their AI engines in-house with a framework optimized for real-time data ingestion, low latency, and massive scale. Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as Microsoft Xbox, Epic Games, and Unity. 

Inworld was recognized by CB Insights as one of the 100 most promising AI companies in the world in 2024 and was named among LinkedIn's Top Startups of 2024 in the USA.

About the Role:

As a Staff Platform Engineer (MLOps), you'll work closely with backend and ML Engineering teams to design, deploy, and maintain reliable, high-performance, and secure cloud infrastructure for our AI Engine and Studio. 

 

What you'll do:

  • Develop, manage, and optimize the ML model lifecycle in production using the Inworld AI platform and Nvidia CUDA, implementing CI/CD systems for ML workflows, monitoring models to identify issues and inefficiencies, and designing MLOps tools and frameworks to enhance automation and efficiency.
  • Facilitate a "you build it, you run it" culture by providing the necessary tools and processes for monitoring the reliability, availability, and performance of services.
  • Manage CI/CD pipelines to ensure smooth and efficient code integration and deployment.
  • Identify and implement opportunities to enhance engineering speed and efficiency.
  • Conduct root cause analysis to identify critical issues and develop automated solutions to prevent recurrence.
  • Develop and share best practices to improve automation and efficiency across our engineering teams.

 

Expected experience:

  • 7 years of experience in software engineering.
  • 5 years of experience with infrastructure-as-code.
  • Proficiency in managing Kubernetes clusters and applications, including creating Helm charts/Kustomize manifests for new applications.
  • Experience in creating and maintaining CI/CD pipelines for both applications and infrastructure deployments (using tools like Terraform/Terragrunt, ArgoCD, GitHub Actions, Ansible, etc.).
  • Deep knowledge of at least one major cloud provider (Google Cloud Platform, Microsoft Azure, Oracle Cloud).
  • Proficient in at least one backend programming/scripting languages such as Golang, Python, and Bash.
  • Familiarity with open source LLM and open source serving solution (e.g. vLLM or llama.cpp, kserve, etc) is a plus.
  • Experience with SLURM
  • Experience with data pipeline and workflow management tools
  • Experience with bare metal GPUs (optional).

 

The base salary range for this full-time position is CAD $170,000 - $220,000. In addition to base pay, total compensation includes equity and benefits. Within the range, individual pay is determined by work location, level, and additional factors, including competencies, experience, and business needs. The base pay range is subject to change and may be modified in the future.

Inworld Jobs Privacy

Similar Jobs

ION - Cloud Engineer Kubernetes

ION

Rome, Lazio, Italy (Hybrid)
6 Months ago
Playrix - Lead C++ Software Engineer (Gameplay)

Playrix

Georgia (Remote)
5 Months ago
Playrix - Senior Unity Software Engineer (Gameplay)

Playrix

Ukraine (Remote)
5 Months ago
GoTo Group - Lead Software Engineer - Identity Platform

GoTo Group

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
GoTo Group - Software Engineer - Foundation Security

GoTo Group

Bengaluru, Karnataka, India (On-Site)
5 Months ago
N-iX - Senior Data Engineer

N-iX

Kyiv, Kyiv City, Ukraine (Remote)
3 Weeks ago
Sigma Software - Senior/Principal Site Reliability Engineer (AdTech)

Sigma Software

Brasília, Federal District, Brazil (Remote)
6 Months ago
Aera Technology - Senior Platform Administration Engineer

Aera Technology

Bucharest, Bucharest, Romania (Hybrid)
6 Months ago
Omnissa - Senior Member of Technical Staff (C++ Windows Internals)

Omnissa

Bengaluru, Karnataka, India (On-Site)
5 Months ago
ByteDance - Software Engineer, SRE - Platform Services

ByteDance

Seattle, Washington, United States (On-Site)
4 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Trackman - Senior Android Developer - Mobile Golf

Trackman

Hørsholm, Denmark (On-Site)
3 Weeks ago
Genies - Machine Learning Infrastructure Engineer, 3D Model Inference & Deployment

Genies

Los Angeles, California, United States (On-Site)
1 Month ago
Velotio Technologies - Senior DevOps Engineer (AWS)

Velotio Technologies

Maharashtra, India (Remote)
4 Weeks ago
Britive - SENIOR SOFTWARE ENGINEER

Britive

San Francisco, California, United States (Remote)
4 Months ago
Riot Games - Staff Software Engineer - Infrastructure Reliability

Riot Games

Los Angeles, California, United States (On-Site)
2 Months ago
PwC - Manager_ Cloud Architecture _ Advisory corporate _ Advisory _ Hyderabad

PwC

Hyderabad, Telangana, India (On-Site)
5 Months ago
IT Gurus Software - ETL Test Automation Engineer (ETL Tester)

IT Gurus Software

Pune, Maharashtra, India (On-Site)
6 Months ago
Fluxon - Senior Software Engineer

Fluxon

Bengaluru, Karnataka, India (Remote)
6 Months ago
Zazz - Cloud Engineer (Azure)

Zazz

(Remote)
2 Months ago
Thatgamecompany - Senior DevOps Engineer (LiveOps)

Thatgamecompany

Shanghai, Shanghai, China (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Vancouver, British Columbia, Canada

Epic Games - Audio Designer

Epic Games

Montreal, Quebec, Canada (On-Site)
1 Day ago
VGW - Customer Service Manager

VGW

Toronto, Ontario, Canada (On-Site)
3 Weeks ago
Blazesoft - Legal Counsel (Gaming)

Blazesoft

Vaughan, Ontario, Canada (On-Site)
9 Months ago
Kabam - Backend Programmer

Kabam

Montreal, Quebec, Canada (Hybrid)
3 Weeks ago
Scanline VFX - Senior DevOps Engineer

Scanline VFX

Vancouver, British Columbia, Canada (Hybrid)
2 Months ago
People Can Fly - Live Operations Technician

People Can Fly

Montreal, Quebec, Canada (Remote)
1 Month ago
Google - Senior Software Developer, AI/Machine Learning, Applied AI

Google

Waterloo, Ontario, Canada (On-Site)
2 Days ago
Epic Games - Senior Application Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
3 Weeks ago
Epic Games - Senior QA Programmer

Epic Games

Vancouver, British Columbia, Canada (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Microsoft - Service Operations Manager

Microsoft

London, England, United Kingdom (On-Site)
2 Days ago
Google - Software Developer III, Site Reliability Development

Google

Waterloo, Ontario, Canada (On-Site)
2 Days ago
Ubisoft - Senior C++ Programmer

Ubisoft

Bucharest, Bucharest, Romania (Hybrid)
5 Months ago
Ajmera Infotech - Senior Azure DevOps Engineer (IaaS)

Ajmera Infotech

Hyderabad, Telangana, India (On-Site)
1 Month ago
Google - Technical Solutions Engineer, Security

Google

Maharashtra, India (On-Site)
2 Days ago
Warner Bros Games - Software Engineer II - DevOps

Warner Bros Games

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
White Hat Gaming  - SRE/DevOps Engineer

White Hat Gaming

(Remote)
2 Months ago
Kolibri Games - DevOps Engineer

Kolibri Games

Berlin, Berlin, Germany (Hybrid)
4 Weeks ago
ARHS - AWS Cloud Architect

ARHS

Luxembourg (On-Site)
6 Months ago
Google - Software Engineer, Storage Everywhere

Google

Raleigh, North Carolina, United States (On-Site)
2 Days ago

Get notifed when new similar jobs are uploaded

About The Company

Inworld powers AI-driven gameplay for video games and immersive experiences. 


Whether you’re looking to unlock novel gameplay, create content at scale, improve player immersion, or future proof your AI infrastructure, Inworld helps uplevel your game development with AI.


Inworld has worked with Xbox, Ubisoft, NVIDIA, NetEase Games, Niantic, LG UPlus, Alpine Electronics, and indie game developers to create AI-driven experiences. We’re backed by top-tier investors including Section 32, Intel Capital, Microsoft’s M12 fund, BITKRAFT Ventures, Kleiner Perkins, Founders Fund,and First Spark Ventures. 


The Inworld product suite includes: 


Inworld Engine powers real-time experiences with groundbreaking game mechanics, dynamic NPCs, and worlds that evolve with each action. AI NPCs can learn and adapt, deliver nuanced performances, perceive the world around them, and autonomously initiate actions based on players' decisions. 


Inworld Studio consists of a suite of tools that enhance game design. Using AI to streamline workflows, the Studio enables developers to workshop, draft, and outline storylines, narratives, dialogue, quests, and more.  


Inworld Core is our custom solution for future-proof AI infrastructure, including custom models, training, serving and security.

Mountain View, California, United States (Hybrid)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (Hybrid)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Inworld AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug