Staff Platform Engineer, MLOps

1 Month ago • 7-7 Years • DevOps • $180,000 PA - $280,000 PA

Job Summary

Job Description

As a Staff Platform Engineer (MLOps), you will design, deploy, and maintain cloud infrastructure for Inworld's AI Engine and Studio. Responsibilities include developing and optimizing the ML model lifecycle using the Inworld AI platform and Nvidia CUDA; implementing CI/CD systems for ML workflows; monitoring models for issues; designing MLOps tools; and fostering a 'you build it, you run it' culture. You will manage CI/CD pipelines, enhance engineering speed and efficiency, conduct root cause analysis, and share best practices. The role requires extensive experience with infrastructure-as-code, Kubernetes, CI/CD pipelines (Terraform/Terragrunt, ArgoCD, etc.), and cloud platforms.
Must have:
  • 7+ years software engineering experience
  • 5+ years infrastructure-as-code experience
  • Kubernetes cluster management proficiency
  • CI/CD pipeline creation and maintenance
  • Cloud provider expertise (GCP, Azure, Oracle)
  • Backend programming (Golang, Python, Bash)
Good to have:
  • Open source LLM & serving solution familiarity
  • SLURM experience
  • Data pipeline & workflow management tools experience
  • Bare metal GPU experience

Job Details

view open roles

Why Join Inworld

Inworld is the leading provider of AI technology for real-time interactive experiences, with a $500 million valuation and backing from top tier investors including Intel Capital, Microsoft’s M12 fund, Lightspeed Venture Partners, Section 32, BITKRAFT Ventures, Kleiner Perkins, Founders Fund, and First Spark Ventures.

Inworld provides the market’s best framework for building production ready interactive experiences, coupled with dedicated services to optimize specific stages of development – from design and development, to ML pipeline optimization and custom compute infrastructure. We help developers bring their AI engines in-house with a framework optimized for real-time data ingestion, low latency, and massive scale. Inworld powers experiences built by Ubisoft, NVIDIA, Niantic, NetEase Games and LG, among others, and has partnerships with key industry players such as Microsoft Xbox, Epic Games, and Unity. 

Inworld was recognized by CB Insights as one of the 100 most promising AI companies in the world in 2024 and was named among LinkedIn's Top Startups of 2024 in the USA.

 

About the Role:

As a Staff Platform Engineer (MLOps), you'll work closely with backend and ML Engineering teams to design, deploy, and maintain reliable, high-performance, and secure cloud infrastructure for our AI Engine and Studio. 

 

What you'll do:

  • Develop, manage, and optimize the ML model lifecycle in production using the Inworld AI platform and Nvidia CUDA, implementing CI/CD systems for ML workflows, monitoring models to identify issues and inefficiencies, and designing MLOps tools and frameworks to enhance automation and efficiency.
  • Facilitate a "you build it, you run it" culture by providing the necessary tools and processes for monitoring the reliability, availability, and performance of services.
  • Manage CI/CD pipelines to ensure smooth and efficient code integration and deployment.
  • Identify and implement opportunities to enhance engineering speed and efficiency.
  • Conduct root cause analysis to identify critical issues and develop automated solutions to prevent recurrence.
  • Develop and share best practices to improve automation and efficiency across our engineering teams.

 

Expected experience:

  • 7 years of experience in software engineering.
  • 5 years of experience with infrastructure-as-code.
  • Proficiency in managing Kubernetes clusters and applications, including creating Helm charts/Kustomize manifests for new applications.
  • Experience in creating and maintaining CI/CD pipelines for both applications and infrastructure deployments (using tools like Terraform/Terragrunt, ArgoCD, GitHub Actions, Ansible, etc.).
  • Deep knowledge of at least one major cloud provider (Google Cloud Platform, Microsoft Azure, Oracle Cloud).
  • Proficient in at least one backend programming/scripting languages such as Golang, Python, and Bash.
  • Familiarity with open source LLM and open source serving solution (e.g. vLLM or llama.cpp, kserve, etc) is a plus.
  • Experience with SLURM
  • Experience with data pipeline and workflow management tools
  • Experience with bare metal GPUs (optional)

 

In-office location: Mountain View, CA, United States. You must be available for hybrid work. 

 

The US base salary range for this full-time position is $180,000 - $280,000. In addition to base pay, total compensation includes equity and benefits. Within the range, individual pay is determined by work location, level, and additional factors, including competencies, experience, and business needs. The base pay range is subject to change and may be modified in the future.

Inworld Jobs Privacy

Similar Jobs

N-ix - Senior Python Engineer (Part-Time)

N-ix

Poland (Remote)
1 Month ago
Canva - Senior Technical Program Manager - Compute and Networking (Core Infra)

Canva

Sydney, New South Wales, Australia (Remote)
1 Month ago
TransUnion - Lead developer, Java full stack

TransUnion

Hyderabad, Telangana, India (Hybrid)
1 Week ago
Canonical - Linux Enablement - Software Engineering Manager

Canonical

Beijing, China (On-Site)
2 Weeks ago
playrix  - Senior C++ Software Engineer (Gameplay)

playrix

Montenegro (Remote)
7 Months ago
metacore - DevOps Advocate

metacore

Helsinki, Uusimaa, Finland (Hybrid)
2 Months ago
Luxoft - Senior Software Support Engineer

Luxoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (Remote)
6 Months ago
Ion - Site Reliability Engineer

Ion

Collecchio, Emilia-Romagna, Italy (Hybrid)
7 Months ago
Britive - SENIOR SOFTWARE ENGINEER (CLOUD)

Britive

Bengaluru, Karnataka, India (Remote)
6 Months ago
Rackspace Technology - Azure Cloud Engineer III

Rackspace Technology

Bengaluru, Karnataka, India (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

truecaller - Senior Staff Data Engineer II

truecaller

Stockholm, Stockholm County, Sweden (On-Site)
2 Weeks ago
GoTo Group - Senior Software Engineer - Event Platform

GoTo Group

Gurugram, Haryana, India (On-Site)
7 Months ago
Ansys - Software R&D Engineer II - Acoustic & Vibration, NVH (f/m)

Ansys

Villeurbanne, Auvergne-Rhône-Alpes, France (On-Site)
2 Weeks ago
easy brain - Build Engineer

easy brain

(Remote)
2 Weeks ago
Social Discovery Ventures - QA Engineer

Social Discovery Ventures

Poland (Remote)
1 Week ago
that game company - Senior Backend Engineer - China

that game company

Shanghai, Shanghai, China (On-Site)
2 Months ago
balbex - Staff DevOps Engineer

balbex

Gurugram, India (On-Site)
2 Months ago
Roof Stacks - React Native Developer

Roof Stacks

Istanbul, İstanbul, Türkiye (On-Site)
1 Month ago
Comscore - Senior QA Automation Engineer

Comscore

Pune, Maharashtra, India (On-Site)
1 Month ago
CD PROJEKT RED - DevOps Engineering Manager

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Mountain View, California, United States

pentair - Warehouse Technician

pentair

Roseville, Minnesota, United States (On-Site)
1 Week ago
Riot Games - Manager, Software Engineering - Payments

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
Scientific Games - Sales Account Manager II

Scientific Games

Tulsa, Oklahoma, United States (On-Site)
3 Days ago
PeopleFun - Senior Unity Engineer II

PeopleFun

United States (Remote)
1 Month ago
Motorola solutions - Senior Business Development Manager

Motorola solutions

Linthicum Heights, Maryland, United States (On-Site)
2 Weeks ago
Biofire DX - Microbiology Specialist

Biofire DX

United States (On-Site)
2 Weeks ago
WebFX - Jr. Paid Search Specialist

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
7 Months ago
SBM Management - CSR Lead

SBM Management

Saint Joseph, Missouri, United States (On-Site)
1 Month ago
Mattel Inc - American Girl Hair Stylist

Mattel Inc

Chicago, Illinois, United States (On-Site)
1 Month ago
Nintendo - Partner Marketing Specialist

Nintendo

Redmond, Washington, United States (Hybrid)
11 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Smilegate - SRE Strategy Project Manager

Smilegate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
2 Months ago
bytedance - Site Reliability Engineer Intern

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
Rackspace Technology - Manager, Professional Services Delivery

Rackspace Technology

Gurugram, Haryana, India (Remote)
2 Months ago
Luxoft - Senior Software Support Engineer

Luxoft

Slovakia (Remote)
6 Months ago
NVIDIA - Senior Site Reliability Engineer - AI Research Clusters

NVIDIA

Pune, Maharashtra, India (On-Site)
1 Month ago
Ion - Cloud Engineer Kubernetes

Ion

Milan, Lombardy, Italy (Hybrid)
7 Months ago
Info Stretch - Lead Data Engineer

Info Stretch

Hyderabad, Telangana, India (On-Site)
7 Months ago
Smilegate - SRE Strategy Manager

Smilegate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
4 Months ago
The Walt Disney Company - Lead Software Engineer - Big Data Infrastructure

The Walt Disney Company

California, United States (On-Site)
2 Months ago
Trend Micro - Cloud Engineer (Golang/Python, Backend Focus) 雲端開發工程師

Trend Micro

Taipei City, Taiwan (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Mountain View, California, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Vancouver, British Columbia, Canada (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (On-Site)

Mountain View, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Inworld AI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug