Senior Software Engineer, Compute Fleet Management

4 Hours ago • 5 Years + • Software Development & Engineering • $189,110 PA - $233,840 PA

Job Summary

Job Description

As a Senior Software Engineer in Roblox's Compute Group's Fleet Management, you will influence the evolution of their Private Cloud. You will build products to streamline the provisioning of GPUs and Compute resources, significantly improving AI capacity delivery, uptime, and OS security. This role focuses on building customer-centric products by writing Golang controllers for Kubernetes and developing higher-level gRPC APIs to abstract data center complexities from Roblox Builders. Responsibilities include developing and maintaining a fleet-wide machine daemon for hardware/software monitoring, runtime updates, and secure machine access; writing Golang controllers for fleet lifecycle management; handling OS installation, firmware provisioning, and secure recycling; and building a robust framework for HW/OS/Kernel validation and performance tuning.
Must have:
  • 5+ years of industry experience
  • Golang passion and practical experience
  • Excited about infrastructure problem space
  • Prefer autonomous systems over ops
  • Understand documentation importance
  • Strong consideration for production health
  • Customer, team, and quality oriented
  • Detail oriented and organized
Good to have:
  • Experience building reliable, sustainable production systems
  • Solution-oriented innovator
  • Excellent communicator with team-playing abilities
  • Adaptable to fast-paced environments
  • Committed to continual process improvement

Job Details

Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers and creators. 

At Roblox, we’re building the tools and platform that empower our community to bring any experience that they can imagine to life. Our vision is to reimagine the way people come together, from anywhere in the world, and on any device. We’re on a mission to connect a billion people with optimism and civility, and looking for amazing talent to help us get there. 

A career at Roblox means you’ll be working to shape the future of human interaction, solving unique technical challenges at scale, and helping to create safer, more civil shared experiences for everyone.

As a Senior Software Engineer in Roblox's Compute Group's Fleet Management, you'll directly influence the evolution of our Private Cloud. You will build products to streamline provisioning of GPUs and Compute resources, significantly improving AI capacity delivery, uptime, and OS security.

While this role exposes you to the low-level components of our cloud and on-premise infrastructure, it is heavily focused on building products with a customer-centric approach. You will focus on delivering exceptional user experiences to Compute’s customers by writing Golang controllers for Kubernetes and developing higher-level gRPC APIs. This work abstracts the complexities of our data centers away from our Roblox Builders. People from diverse technical backgrounds have succeeded in this role, and we value diversity in our team.

You will:

  • Develop and maintain a fleet wide machine daemon for efficient hardware/software monitoring, runtime updates, and secure machine access.
  • Write Golang controllers for Roblox's fleet lifecycle, ensuring smooth functioning at all times.
  • Handle OS installation, firmware provisioning, and secure recycling processes.
  • Build and maintain a robust framework for HW/OS/Kernel validation and performance tuning.
  • Provide abstraction across cloud and on-premise systems, supporting stateful services.

You have:

  • 5+ years of industry experience.
  • Passion for Golang, and practical day to day experience coding in Go
  • No experience required in this specific area of infrastructure, but being excited about the problem space is a must.
  • Prefer building autonomous systems over ops and repetitive tasks.
  • Like and understand the importance of documentation for large scale systems.
  • Strong consideration for production health and experience working on reliable, sustainable production systems.
  • Care deeply about production health and reliable, sustainable production systems.
  • Customer, team, and quality oriented.
  • Like getting things done

You are:

  • A solution-oriented innovator.
  • An excellent communicator with strong team-playing abilities.
  • Detail oriented and highly organized.
  • Adaptable to fast-paced, evolving environments.
  • Committed to continual process improvement.

For roles that are based at our headquarters in San Mateo, CA: The starting base pay for this position is as shown below. The actual base pay is dependent upon a variety of job-related factors such as professional background, training, work experience, location, business needs and market demand. Therefore, in some circumstances, the actual salary could fall outside of this expected range. This pay range is subject to change and may be modified in the future. All full-time employees are also eligible for equity compensation and for benefits as described on this page.

Annual Salary Range
$189,110$233,840 USD

Roles that are based in our San Mateo, CA Headquarters are in-office Tuesday, Wednesday, and Thursday, with optional in-office on Monday and Friday (unless otherwise noted).

Roblox provides equal employment opportunities to all employees and applicants for employment and prohibits discrimination and harassment of any type without regard to race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state or local laws. Roblox also provides reasonable accommodations for all candidates during the interview process.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in San Mateo, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Software Development & Engineering Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

San Mateo, California, United States (On-Site)

Gurugram, Haryana, India (On-Site)

View All Jobs

Get notified when new jobs are added by Roblox

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug