Senior Software Engineer, Compute Fleet Management

2 Months ago • 5 Years + • Software Development & Engineering • $189,110 PA - $233,840 PA

Job Summary

Job Description

As a Senior Software Engineer in Roblox's Compute Group's Fleet Management, you will influence the evolution of their Private Cloud. You will build products to streamline the provisioning of GPUs and Compute resources, significantly improving AI capacity delivery, uptime, and OS security. This role focuses on building customer-centric products by writing Golang controllers for Kubernetes and developing higher-level gRPC APIs to abstract data center complexities from Roblox Builders. Responsibilities include developing and maintaining a fleet-wide machine daemon for hardware/software monitoring, runtime updates, and secure machine access; writing Golang controllers for fleet lifecycle management; handling OS installation, firmware provisioning, and secure recycling; and building a robust framework for HW/OS/Kernel validation and performance tuning.
Must have:
  • 5+ years of industry experience
  • Golang passion and practical experience
  • Excited about infrastructure problem space
  • Prefer autonomous systems over ops
  • Understand documentation importance
  • Strong consideration for production health
  • Customer, team, and quality oriented
  • Detail oriented and organized
Good to have:
  • Experience building reliable, sustainable production systems
  • Solution-oriented innovator
  • Excellent communicator with team-playing abilities
  • Adaptable to fast-paced environments
  • Committed to continual process improvement

Job Details

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. 

At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. 

A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.

As a Senior Software Engineer in Roblox's Compute Group's Fleet Management, you'll directly influence the evolution of our Private Cloud. You will build products to streamline provisioning of GPUs and Compute resources, significantly improving AI capacity delivery, uptime, and OS security.

While this role exposes you to the low-level components of our cloud and on-premise infrastructure, it is heavily focused on building products with a customer-centric approach. You will focus on delivering exceptional user experiences to Compute’s customers by writing Golang controllers for Kubernetes and developing higher-level gRPC APIs. This work abstracts the complexities of our data centers away from our Roblox Builders. People from diverse technical backgrounds have succeeded in this role, and we value diversity in our team.

You will:

  • Develop and maintain a fleet wide machine daemon for efficient hardware/software monitoring, runtime updates, and secure machine access.
  • Write Golang controllers for Roblox's fleet lifecycle, ensuring smooth functioning at all times.
  • Handle OS installation, firmware provisioning, and secure recycling processes.
  • Build and maintain a robust framework for HW/OS/Kernel validation and performance tuning.
  • Provide abstraction across cloud and on-premise systems, supporting stateful services.

You have:

  • 5+ years of industry experience.
  • Passion for Golang, and practical day to day experience coding in Go
  • No experience required in this specific area of infrastructure, but being excited about the problem space is a must.
  • Prefer building autonomous systems over ops and repetitive tasks.
  • Like and understand the importance of documentation for large scale systems.
  • Strong consideration for production health and experience working on reliable, sustainable production systems.
  • Care deeply about production health and reliable, sustainable production systems.
  • Customer, team, and quality oriented.
  • Like getting things done

You are:

  • A solution-oriented innovator.
  • An excellent communicator with strong team-playing abilities.
  • Detail oriented and highly organized.
  • Adaptable to fast-paced, evolving environments.
  • Committed to continual process improvement.

For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on this page.

Annual Salary Range
$189,110$233,840 USD

Roles that are based in our San Mateo, CA Headquarters are in-office Tuesday, Wednesday, and Thursday, with optional in-office on Monday and Friday (unless otherwise noted).

Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations for all candidates during the interview process.

Similar Jobs

Alphawave Semi - Staff Engineer - Physical Design

Alphawave Semi

Bengaluru, Karnataka, India (Hybrid)
4 Weeks ago
Barracuda - Cloud Site Reliability Staff Developer

Barracuda

Ottawa, Ontario, Canada (Hybrid)
4 Months ago
Nagarro - Senior Compliance (GSEDD) Analyst

Nagarro

United States (Remote)
1 Month ago
SoftSwiss - Ruby on Rails Developer

SoftSwiss

Warsaw, Masovian Voivodeship, Poland (Remote)
2 Months ago
Capco - Asset Management Business Analyst

Capco

Geneva, Geneva, Switzerland (On-Site)
3 Months ago
legion - Optimization Engineer

legion

Bucharest, Bucharest, Romania (Hybrid)
2 Months ago
Optiv - Sr. Unix/Linux Engineer

Optiv

Columbia, Maryland, United States (On-Site)
2 Months ago
Palo Alto Networks - Senior Technical Support Engineer, Focused Services

Palo Alto Networks

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Autodesk - Senior Software Engineer

Autodesk

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Apple - Engineering Project Manager

Apple

Sunnyvale, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Zscaler - Staff Software Development Engineer

Zscaler

Bengaluru, Karnataka, India (Hybrid)
4 Weeks ago
Varonis  - Technical Support Engineer

Varonis

Morrisville, North Carolina, United States (On-Site)
7 Months ago
P99 soft - Sr. React.js Developer

P99 soft

Hyderabad, Telangana, India (On-Site)
3 Months ago
bounteous - MxML Developer (Murex Back Office)

bounteous

Mexico City, Mexico (On-Site)
2 Months ago
PwC - Senior Associate II - Independence & Ethics

PwC

Karachi, Sindh, Pakistan (On-Site)
10 Months ago
Hawkeye Innovations - Football Tracking Systems Technician

Hawkeye Innovations

Udine, Friuli-Venezia Giulia, Italy (On-Site)
2 Months ago
Saronic Technologies - IT Engineer

Saronic Technologies

Austin, Texas, United States (On-Site)
1 Month ago
cyara - Support Engineer

cyara

United States (Remote)
5 Months ago
Xplor Technologies - Software Engineering Lead (.Net)

Xplor Technologies

Auckland, Auckland, New Zealand (Remote)
1 Month ago
Qualcomm - Engineer- Linux Int

Qualcomm

Hyderabad, Telangana, India (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Mateo, California, United States

DraftKings - Associate Corporate Counsel, Marketing (Bilingual)

DraftKings

New York, United States (On-Site)
2 Months ago
Captions - Community Manager

Captions

New York, United States (On-Site)
2 Months ago
Rackner - Business Development & Capture Manager

Rackner

Washington, District Of Columbia, United States (Remote)
3 Months ago
onwards Search - SEO Analyst

onwards Search

Orlando, Florida, United States (Hybrid)
1 Month ago
dbt Labs - Senior Software Engineer II

dbt Labs

United States (Remote)
1 Month ago
Ansys - Senior Application Engineer

Ansys

Austin, Texas, United States (On-Site)
1 Month ago
Games For Love - Mobile Game Production Mentor

Games For Love

Washington, United States (Remote)
5 Months ago
sony global (Games) - Research Intern on Generative AI for Content Creation

sony global (Games)

New York, United States (Remote)
3 Months ago
Apple - US-Manager

Apple

San Francisco, California, United States (On-Site)
3 Months ago
Palo Alto Networks - Finance Manager, Revenue Forecasting

Palo Alto Networks

Santa Clara, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Software Development & Engineering Jobs

WebMD - Lead, Data Engineering

WebMD

Newark, New Jersey, United States (On-Site)
9 Months ago
Thousand Eyes - Adoption Engineer

Thousand Eyes

Mexico City, Mexico (On-Site)
2 Months ago
onwards Search - Senior Software Engineer (UiPath)

onwards Search

Rochester, Minnesota, United States (Remote)
1 Month ago
Marvell - Principal Design Verification Engineer

Marvell

Santa Clara, California, United States (On-Site)
2 Months ago
Venoeer - Hardware Development Engineer

Venoeer

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Shield AI - Senior Project Engineer (R3398)

Shield AI

Dallas, Texas, United States (On-Site)
1 Month ago
valve software - Thermal Engineer

valve software

Bellevue, Washington, United States (On-Site)
3 Months ago
Western Digital - Technician 3, Engineering

Western Digital

Phra Nakhon Si Ayutthaya, Thailand (On-Site)
1 Month ago
Nasdaq - Lead Software Engineer

Nasdaq

Atlanta, Georgia, United States (Hybrid)
4 Weeks ago
Tesla - Craftsmanship Engineer

Tesla

Berlin, Berlin, Germany (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

About The Company

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (Hybrid)

San Mateo, California, United States (Hybrid)

San Mateo, California, United States (Remote)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (Hybrid)

San Mateo, California, United States (Hybrid)

Seoul, South Korea (On-Site)

View All Jobs

Get notified when new jobs are added by Roblox

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug