Staff Platform Engineer, MLOps

1 Week ago • 7-7 Years • DevOps • $180,000 PA - $280,000 PA

Job Summary

Job Description

As a Staff Platform Engineer (MLOps), you will design, deploy, and maintain cloud infrastructure for Inworld's AI Engine and Studio. Responsibilities include developing and optimizing the ML model lifecycle using the Inworld AI platform and Nvidia CUDA; implementing CI/CD systems for ML workflows; monitoring models for issues; designing MLOps tools; and fostering a 'you build it, you run it' culture. You will manage CI/CD pipelines, enhance engineering speed and efficiency, conduct root cause analysis, and share best practices. The role requires extensive experience with infrastructure-as-code, Kubernetes, CI/CD pipelines (Terraform/Terragrunt, ArgoCD, etc.), and cloud platforms.
Must have:
  • 7+ years software engineering experience
  • 5+ years infrastructure-as-code experience
  • Kubernetes cluster management proficiency
  • CI/CD pipeline creation and maintenance
  • Cloud provider expertise (GCP, Azure, Oracle)
  • Backend programming (Golang, Python, Bash)
Good to have:
  • Open source LLM & serving solution familiarity
  • SLURM experience
  • Data pipeline & workflow management tools experience
  • Bare metal GPU experience

Job Details

view open roles

Why Join Inworld

Inworld is the leading provider of AI technology for real-time interactive experiences, with a $500 million valuation and backing from top tier investors including Intel Capital, Microsoft’s M12 fund, Lightspeed Venture Partners, Section 32, BITKRAFT Ventures, Kleiner Perkins, Founders Fund, and First Spark Ventures.

Inworld provides the market’s best framework for building production ready interactive experiences, coupled with dedicated services to optimize specific stages of development – from design and development, to ML pipeline optimization and custom compute infrastructure. We help developers bring their AI engines in-house with a framework optimized for real-time data ingestion, low latency, and massive scale. Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as Microsoft Xbox, Epic Games, and Unity. 

Inworld was recognized by CB Insights as one of the 100 most promising AI companies in the world in 2024 and was named among LinkedIn's Top Startups of 2024 in the USA.

 

About the Role:

As a Staff Platform Engineer (MLOps), you'll work closely with backend and ML Engineering teams to design, deploy, and maintain reliable, high-performance, and secure cloud infrastructure for our AI Engine and Studio. 

 

What you'll do:

  • Develop, manage, and optimize the ML model lifecycle in production using the Inworld AI platform and Nvidia CUDA, implementing CI/CD systems for ML workflows, monitoring models to identify issues and inefficiencies, and designing MLOps tools and frameworks to enhance automation and efficiency.
  • Facilitate a "you build it, you run it" culture by providing the necessary tools and processes for monitoring the reliability, availability, and performance of services.
  • Manage CI/CD pipelines to ensure smooth and efficient code integration and deployment.
  • Identify and implement opportunities to enhance engineering speed and efficiency.
  • Conduct root cause analysis to identify critical issues and develop automated solutions to prevent recurrence.
  • Develop and share best practices to improve automation and efficiency across our engineering teams.

 

Expected experience:

  • 7 years of experience in software engineering.
  • 5 years of experience with infrastructure-as-code.
  • Proficiency in managing Kubernetes clusters and applications, including creating Helm charts/Kustomize manifests for new applications.
  • Experience in creating and maintaining CI/CD pipelines for both applications and infrastructure deployments (using tools like Terraform/Terragrunt, ArgoCD, GitHub Actions, Ansible, etc.).
  • Deep knowledge of at least one major cloud provider (Google Cloud Platform, Microsoft Azure, Oracle Cloud).
  • Proficient in at least one backend programming/scripting languages such as Golang, Python, and Bash.
  • Familiarity with open source LLM and open source serving solution (e.g. vLLM or llama.cpp, kserve, etc) is a plus.
  • Experience with SLURM
  • Experience with data pipeline and workflow management tools
  • Experience with bare metal GPUs (optional)

 

In-office location: Mountain View, CA, United States. You must be available for hybrid work. 

 

The US base salary range for this full-time position is $180,000 - $280,000. In addition to base pay, total compensation includes equity and benefits. Within the range, individual pay is determined by work location, level, and additional factors, including competencies, experience, and business needs. The base pay range is subject to change and may be modified in the future.

Inworld Jobs Privacy

Similar Jobs

Aristocrat Gaming - Project Manager - AREA Suite of Apps

Aristocrat Gaming

Las Vegas, Nevada, United States (Hybrid)
1 Week ago
NVIDIA - NIM Solution Architect

NVIDIA

Shanghai, Shanghai, China (On-Site)
1 Week ago
Tide - Information Security Risk Lead

Tide

Delhi, India (On-Site)
1 Day ago
PwC - IN_Senior Associate _Java Developer _Data & Analytics _Advisory _PAN India

PwC

Kolkata, West Bengal, India (On-Site)
6 Months ago
BigID - Software Engineer Team Lead

BigID

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
5 Months ago
Netflix - Engineering Manager - Infrastructure Tooling, Data Platform

Netflix

Warsaw, Masovian Voivodeship, Poland (On-Site)
2 Weeks ago
Google - Customer Engineer, Small/Medium Businesses Google Cloud

Google

Mexico City, Mexico City, Mexico (On-Site)
2 Weeks ago
GoTo Group - Senior Software Engineer - Event Platform

GoTo Group

Gurugram, Haryana, India (On-Site)
6 Months ago
NVIDIA - Senior Software Configuration Management Engineer

NVIDIA

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
UXBERT Labs - Senior Solution Architect (IoT/Bluetooth Integration)

UXBERT Labs

Riyadh, Riyadh Province, Saudi Arabia (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Nagarro - Staff Engineer (Cloud Infrastructure)

Nagarro

Gurugram, Haryana, India (On-Site)
6 Months ago
Wargaming - Infrastructure Engineer

Wargaming

Belgrade, Serbia (Hybrid)
1 Week ago
Nielsen Holdings - Scala Developer

Nielsen Holdings

Bengaluru, Karnataka, India (On-Site)
5 Months ago
NCR Atleos - PS Engineer III

NCR Atleos

Hyderabad, Telangana, India (On-Site)
7 Months ago
Trend Micro - DevOps Engineer

Trend Micro

Manila, Metro Manila, Philippines (On-Site)
18 Years ago
GoGuardian - Site Reliability Engineer

GoGuardian

India (Remote)
7 Months ago
The Mill Adventure - Senior DevOps Engineer

The Mill Adventure

St. Julian's, Malta (Remote)
1 Month ago
Ubisoft - Senior Gameplay Programmer

Ubisoft

Barcelona, Catalonia, Spain (Hybrid)
2 Weeks ago
The Walt Disney Company - Staff Software Engineer – Full Stack

The Walt Disney Company

Orlando, Florida, United States (On-Site)
2 Weeks ago
ION - Cloud Engineer Kubernetes

ION

Italy (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Mountain View, California, United States

Google - Lead Group Product Manager, Sports, Search

Google

New York, New York, United States (On-Site)
2 Days ago
Netflix - Sr. Manager, Workplace

Netflix

Los Gatos, California, United States (On-Site)
1 Week ago
ByteDance - Video Experience Software Engineer Intern (Global Streaming Media)

ByteDance

San Jose, California, United States (On-Site)
2 Weeks ago
Google - Technical Program Manager, Google Public Sector

Google

Washington, District Of Columbia, United States (On-Site)
1 Week ago
Scientific Games  - Director, Product Management

Scientific Games

Alpharetta, Georgia, United States (On-Site)
1 Week ago
Meta - Software Engineer, Infrastructure

Meta

Seattle, Washington, United States (Remote)
5 Months ago
Meta - Software Engineer, Android

Meta

Los Angeles, California, United States (On-Site)
2 Weeks ago
Scale AI - Strategic Finance Manager, Engineering/G&A

Scale AI

San Francisco, California, United States (On-Site)
1 Day ago
Nintendo - Contract - Localization Specialist (Portuguese)

Nintendo

Redmond, Washington, United States (Hybrid)
1 Week ago
Google - Senior Software Developer, Site Reliability Engineering, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Google - Technical Account Manager, Google Cloud Consulting

Google

Sunnyvale, California, United States (On-Site)
2 Weeks ago
ByteDance - Software Engineer - Compute Infrastructure (Orchestration & Scheduling)

ByteDance

Seattle, Washington, United States (On-Site)
2 Weeks ago
Playrix - Senior Release Automation Engineer (Gardenscapes)

Playrix

Ireland (Remote)
3 Months ago
Google - Systems Development Engineer, Deployment, Public Sector

Google

Reston, Virginia, United States (On-Site)
2 Days ago
Google - Customer Engineer, Machine Learning, Google Cloud

Google

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Weeks ago
Trend Micro - Staff/Sr. Cloud Service Engineer (VicOne_ Automotive Security)

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago
Rackspace Technology - DEVOP Engineer (AWS Terraform)-PSDE III

Rackspace Technology

India (Remote)
5 Months ago
Google - Engineering Manager, YouTube Developer Infrastructure

Google

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
Nagarro - Senior Cloud Consultant

Nagarro

Germany (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Inworld powers AI-driven gameplay for video games and immersive experiences. 


Whether you’re looking to unlock novel gameplay, create content at scale, improve player immersion, or future proof your AI infrastructure, Inworld helps uplevel your game development with AI.


Inworld has worked with Xbox, Ubisoft, NVIDIA, NetEase Games, Niantic, LG UPlus, Alpine Electronics, and indie game developers to create AI-driven experiences. We’re backed by top-tier investors including Section 32, Intel Capital, Microsoft’s M12 fund, BITKRAFT Ventures, Kleiner Perkins, Founders Fund,and First Spark Ventures. 


The Inworld product suite includes: 


Inworld Engine powers real-time experiences with groundbreaking game mechanics, dynamic NPCs, and worlds that evolve with each action. AI NPCs can learn and adapt, deliver nuanced performances, perceive the world around them, and autonomously initiate actions based on players' decisions. 


Inworld Studio consists of a suite of tools that enhance game design. Using AI to streamline workflows, the Studio enables developers to workshop, draft, and outline storylines, narratives, dialogue, quests, and more.  


Inworld Core is our custom solution for future-proof AI infrastructure, including custom models, training, serving and security.

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (Hybrid)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Inworld AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug