Senior / Principal Inference Engineer - ML Platform

2 Months ago • 4 Years + • Devops • $273,070 PA - $322,170 PA

Job Summary

Job Description

Roblox is seeking a Senior / Principal Inference Engineer to build the next generation of ML Ecosystem Tooling, focusing on model inference. The role involves setting technical strategy, overseeing development of high-scale, reliable infrastructure for large-scale inference, and optimizing performance across the inference stack. The engineer will stay updated on industry trends, bootstrap and maintain infrastructure components like the Serving Layer, Metadata Store, Model Registry, and Pipeline Orchestrator, and partner with other organizations to enhance the ML@Roblox platform. This position requires a strong background in building complex distributed systems for real-time ML inference serving.
Must have:
  • 4+ years of professional experience
  • System design experience
  • Build scalable, reliable platforms
  • Complex distributed systems
  • Real-time ML inference serving
  • Low latency, high throughput inference
  • Bachelor's degree in CS or related field
  • Support internal partners
  • Fix weaknesses in systems
Good to have:
  • Experience with recommendation systems
  • Familiar with Triton Inference Server
  • Familiar with TensorRT
  • Familiar with KServe

Job Details

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. 

At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. 

A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.

As a Senior / Principal Inference Engineer on ML Platform you will build the next generation of ML Ecosystem Tooling, specifically around model inference. ML Platform today supports billions of requests per day across our homepage, marketplace, economy, and more. We are looking for accomplished engineers to help build out the next generation of ML platform tooling for high-scale inference in a quickly innovating space.

You Will:

  • Set technical strategy and oversee development of high scale, reliable infrastructure systems for large-scale inference, especially as we scale up both inference qps and model size.
  • Dig into performance bottlenecks all along the inference stack, spanning from model optimizations to infrastructure optimizations.
  • Stay abreast of industry trends in machine learning and infrastructure to ensure the adoption of leading-edge technologies and practices.
  • Bootstrap and maintain infrastructure for ML Platform components—Serving Layer, Metadata Store, Model Registry, and Pipeline Orchestrator.
  • Partner across organizations to build tooling, interfaces, and visualizations that make the ML@Roblox a delight to use.

You Have:

  • 4+ years of professional experience and a tool chest of system design experience upon which to draw to build scalable, reliable platforms for all of Roblox.
  • Experience building complex distributed systems that scale to real-time ML inference serving, ideally for real-time recommendation systems serving millions of QPS.
  • Experience debugging complicated infrastructure-level performance issues to enable low latency, high throughput inference..
  • Bachelor's degree or higher in Computer Science, Computer Engineering, Data Science, or a similar technical field.

You Are:

  • Passionate about supporting and working cross functionally with internal partners (Data Scientists and ML Engineers) to meet and understand their needs.
  • A reliability nut: you love digging into tricky postmortems and identifying and fixing weaknesses in complicated systems.
  • Ideally familiar with ML model inference frameworks like Triton Inference Server, TensorRT, KServe.

For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on this page.

Annual Salary Range
$273,070$322,170 USD

Roles that are based in our San Mateo, CA Headquarters are in-office Tuesday, Wednesday, and Thursday, with optional in-office on Monday and Friday (unless otherwise noted).

Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations for all candidates during the interview process.

Similar Jobs

Epic Games - Senior Data Analyst, Unreal Engine & Creator Products

Epic Games

(On-Site)
7 Months ago
Apple - Software Engineer, Audio/Music Engineering

Apple

Culver City, California, United States (On-Site)
1 Month ago
Greenworks Sunrise Global Marketing - Field Service Technician

Greenworks Sunrise Global Marketing

Orlando, Florida, United States (On-Site)
2 Months ago
Illumina - Senior Software Engineer (OS)

Illumina

Singapore (On-Site)
3 Months ago
Capgemini - SAP BRIM Consultant

Capgemini

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Glocomms - Senior Cloud Engineer

Glocomms

Dallas, Texas, United States (On-Site)
1 Month ago
Synechron - Solution Architect

Synechron

Chennai, Tamil Nadu, India (On-Site)
1 Month ago
Brillio - .NET Azure Architect - R01525011

Brillio

Pune, Maharashtra, India (Hybrid)
10 Months ago
Ubisoft - Senior/Expert Online Infrastructure Engineer

Ubisoft

Malmö, Skåne County, Sweden (Hybrid)
1 Month ago
London stock Exchange - Senior DevOps Engineer

London stock Exchange

Colombo, Western Province, Sri Lanka (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Illumina - EHS Specialist, Hazard Communication Specialist

Illumina

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Axel springer - Technical Account Manager

Axel springer

New York, United States (On-Site)
1 Month ago
TransUnion - Senior Manager - Customer Success

TransUnion

Gurugram, Haryana, India (On-Site)
3 Weeks ago
Flexra Software - Senior Backend Engineer

Flexra Software

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
eBay - Staff Engineer - Video Streaming Expert

eBay

San Jose, California, United States (Hybrid)
2 Months ago
Greenworks Sunrise Global Marketing - Service & Rework Technician

Greenworks Sunrise Global Marketing

Spanish Springs, Nevada, United States (On-Site)
3 Months ago
FlockSafety - Installation Technician

FlockSafety

Atlanta, Georgia, United States (Remote)
3 Weeks ago
Dave Ramsey - Senior Publicist

Dave Ramsey

Franklin, Tennessee, United States (On-Site)
1 Month ago
Kaseya - Senior Engineer - Cloud Ops

Kaseya

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Lorikeet - Solutions Engineer

Lorikeet

London, England, United Kingdom (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Mateo, California, United States

CRB workforce  - VMware Cloud Foundation Engineer

CRB workforce

Houston, Texas, United States (Remote)
2 Months ago
Zscaler - Principal Software Engineer (ZDX)- Mac/IOS

Zscaler

San Jose, California, United States (Hybrid)
3 Months ago
singularity 6 - Publishing Application Drop Box

singularity 6

Los Angeles, California, United States (Hybrid)
2 Years ago
Apple - Simulation & Modeling Algorithm Engineer (iPhone)

Apple

Cupertino, California, United States (On-Site)
2 Months ago
Open Systems Technologies - Work Readiness Specialist/GED Remediation Instructor

Open Systems Technologies

Scranton, Pennsylvania, United States (On-Site)
1 Month ago
Pinterest - Senior Technical Program Manager, Data Labeling

Pinterest

San Francisco, California, United States (Remote)
1 Month ago
Apple - Mixed-Signal IC Design Engineer

Apple

San Diego, California, United States (On-Site)
2 Months ago
Illumina - Senior Manager, Government Affairs

Illumina

United States (Remote)
1 Month ago
Nintendo - Receiving Agent

Nintendo

New York, New York, United States (On-Site)
5 Months ago
OKX - Associate General Counsel, Licensing

OKX

Austin, Texas, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Axi - DevOps Engineer

Axi

Philippines (On-Site)
3 Weeks ago
Workato - Senior Automation Engineer

Workato

Chennai, Tamil Nadu, India (On-Site)
1 Month ago
Addepar - Principal Software Engineer - Core Platform

Addepar

New York, New York, United States (On-Site)
3 Months ago
Ubisoft - Build Engineer

Ubisoft

Paris, Île-de-France, France (Hybrid)
1 Month ago
AccelData - Senior Platform Engineer

AccelData

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

Porto Alegre, State Of Rio Grande Do Sul, Brazil (On-Site)
4 Months ago
AiDash - Software Development Engineer - III DevOps

AiDash

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Fearless - Software Engineer II (Cloud Transition Planning Consultant) Navy

Fearless

Charleston, South Carolina, United States (On-Site)
1 Month ago
appier - Technical Solution Engineer

appier

Beijing, China (On-Site)
3 Months ago
Palo Alto Networks - Senior Technical Support Engineer, Prisma Cloud - Focused Services

Palo Alto Networks

London, England, United Kingdom (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (Hybrid)

San Mateo, California, United States (Hybrid)

San Mateo, California, United States (Remote)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (Hybrid)

San Mateo, California, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Roblox

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug