Staff MLOps Engineer

7 Minutes ago • 7 Years + • $180,000 PA - $280,000 PA
Research Development

Job Description

As a Staff MLOps Engineer at Inworld, you will be instrumental in designing, building, and scaling the infrastructure that powers intelligent AI agents for real-time, immersive applications. This role involves streamlining the entire ML model lifecycle, from training to deployment, and ensuring high performance, reliability, and speed. You will collaborate with ML and backend teams, manage CI/CD pipelines, and provide technical leadership in MLOps best practices to drive engineering efficiency.
Good To Have:
  • Familiarity with open source LLM and open source serving solutions (e.g., vLLM, llama.cpp, kserve).
  • Experience with bare metal GPUs.
  • Desire to work at a fast-growing Series A startup, comfortable with uncertainty, owning and scaling new products, and embracing an experimental and iterative development process.
Must Have:
  • Build and scale MLOps systems to streamline the end-to-end ML model lifecycle.
  • Design and implement robust model training, evaluation, and release pipelines.
  • Collaborate cross-functionally with ML and backend teams on scalable secure infrastructure.
  • Facilitate a "you build it, you run it" culture by providing monitoring tools and processes.
  • Manage CI/CD pipelines for smooth and efficient code integration and deployment.
  • Identify and implement opportunities to enhance engineering speed and efficiency.
  • Provide technical leadership in ML engineering best practices and mentor junior engineers.
  • 7+ years of software engineering experience, with 5+ years of infrastructure-as-code.
  • Proficiency in managing Kubernetes clusters and applications, including Helm charts/Kustomize.
  • Experience in creating and maintaining CI/CD pipelines (Terraform, ArgoCD, GitHub Actions, Ansible).
  • Deep knowledge of at least one major cloud provider (Google Cloud Platform, Microsoft Azure, Oracle Cloud).
  • Proficient in at least one backend programming/scripting language (Golang, Python, Bash).
  • Knowledge of SLURM or similar job schedulers for distributed training.
  • Experience with data pipeline and workflow management tools.
  • Available for hybrid work in Mountain View, CA, United States.
Perks:
  • Equity
  • Benefits

Add these skills to join the top 1% applicants for this job

oracle
github
cpp
game-texts
azure
ansible
terraform
helm
spark
google-cloud-platform
microsoft-azure
ci-cd
kubernetes
python
github-actions
bash

About Inworld

At Inworld, we believe the processes of building, scaling, and evolving applications are monsters that consume value before it can reach users. Our mission is to solve evolution and transform static software into AI systems that autonomously evolve to better serve their users. We are building an intelligent runtime to conquer these monsters and make this vision a reality.

We are backed by investors such as Lightspeed, Section 32, Kleiner Perkins, Microsoft’s M12 venture fund, BITKRAFT, Founders Fund, and First Spark Ventures. Our technology is used by category leaders, including NVIDIA, Microsoft Xbox, Niantic, Wishroll, Little Umbrella and Streamlabs, among many others. Inworld has been recognized by CB Insights as one of the 100 most promising AI companies globally and has been named one of LinkedIn's Top 10 Startups in the USA.

About the role

At Inworld, we’re building the AI framework behind the next generation of real-time, immersive applications. As a Staff MLOps Engineer, you’ll design, build and scale the infrastructure that powers intelligent AI agents across massive consumer experiences while ensuring performance, reliability, and speed at every level.

What you’ll do

  • Build and scale MLOps systems to streamline the end-to-end ML model lifecycle on the Inworld AI platform, from training to deployment.
  • Design and implement robust model training, evaluation, and release pipelines.
  • Collaborate cross-functionally with ML and backend teams to design, deploy, and maintain scalable secure infrastructure for Inworld’s AI Engine and Studio.
  • Facilitate a "you build it, you run it" culture by providing the necessary tools and processes for monitoring the reliability, availability, and performance of services.
  • Manage CI/CD pipelines to ensure smooth and efficient code integration and deployment.
  • Identify and implement opportunities to enhance engineering speed and efficiency.
  • Provide technical leadership in ML engineering best practices, raise the technical bar, and mentor junior engineers in MLOps principles.

Expected experience

  • 7+ years of software engineering experience, with 5+ years of infrastructure-as-code
  • Proficiency in managing Kubernetes clusters and applications, including creating Helm charts/Kustomize manifests for new applications.
  • Experience in creating and maintaining CI/CD pipelines for both applications and infrastructure deployments (using tools like Terraform/Terragrunt, ArgoCD, GitHub Actions, Ansible, etc.).
  • Deep knowledge of at least one major cloud provider (Google Cloud Platform, Microsoft Azure, Oracle Cloud).
  • Proficient in at least one backend programming/scripting languages such as Golang, Python, and Bash.
  • Knowledge of SLURM or similar job schedulers for distributed training.
  • Experience with data pipeline and workflow management tools
  • Familiarity with open source LLM and open source serving solution (e.g. vLLM or llama.cpp, kserve, etc) is a plus.
  • Experience with bare metal GPUs (optional).
  • Desire to work at a fast-growing Series A startup, comfortable with uncertainty, owning and scaling new products, and embracing an experimental and iterative development process.

The US base salary range for this full-time position is $180,000 - $280,000. In addition to base pay, total compensation includes equity and benefits. Within the range, individual pay is determined by work location, level, and additional factors, including competencies, experience, and business needs. The base pay range is subject to change and may be modified in the future.

Set alerts for more jobs like Staff MLOps Engineer
Set alerts for new jobs by Inworld AI
Set alerts for new Research Development jobs in United States
Set alerts for new jobs in United States
Set alerts for Research Development (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙