Engineering Manager, Compute Platform Orchestration

9 Hours ago • 5 Years + • Software Development & Engineering • $217,000 PA - $303,900 PA

Job Summary

Job Description

Reddit is seeking an Engineering Manager for Compute Platform Orchestration to lead and build common software, frameworks, and constructs for the company's infrastructure. The role involves managing a team responsible for the Linux kernel up to Kubernetes autoscalers and multi-cluster orchestration. Responsibilities include leading high-leverage projects, hiring and mentoring engineers, collaborating with various teams, and evolving technical and non-technical abilities. The ideal candidate will have experience in people management of platform or infrastructure engineering teams, with a strong background in cloud infrastructure and compute platforms, including developing and shipping software. A focus on scalability, performance, user experience, and quality is essential.
Must have:
  • 5+ years experience in cloud infrastructure and compute platforms
  • Experience developing and shipping software
  • Strong technical judgment for cloud infrastructure systems
  • Focus on scalability, performance, user experience, and quality
  • High empathy and excellent communication skills
  • 2-4 years experience in people management
Good to have:
  • Experience with distributed systems at web scale
  • Experience in Go, Kubernetes, Argo, Flux, and other CNCF projects
Perks:
  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days Off
  • Generous paid Parental Leave
  • Paid Volunteer time off

Job Details

The Infrastructure organization has the mission of building and delivering common software, frameworks, and constructs for the rest of Reddit.  Reddit has a complex production serving environment consisting of AWS, GCP and running several compute clusters (Kubernetes) across both.  

Within Infrastructure Foundations, we are looking to expand our engineering management in Compute Platform / Infrastructure: Part software engineering (SWE) this group’s responsibility spans from the Linux kernel up to our Kubernetes autoscalers and multi-cluster orchestration/federation. Developing the platform that runs Reddit requires balancing developer experience, reliability, and safety in supporting one of the world’s largest websites.

Our teams are building and maintaining the complex software needed to rapidly grow and sustain Reddit's cloud compute production environment. Having experience managing high-performance software engineering teams is important for success in this role.  Previous experience with distributed systems at web scale (thousands of nodes, hundreds of systems) is a plus.  

This is a high impact role where your positive contributions will be amplified through the Reddit technology stack and lead to direct business impact. You will work closely with other infrastructure teams, Developer Experience, Observability, Compute, Transport and dozens of product teams, such as the ML infra team, that appreciate being able to run large-scale software securely, reliably, efficiently, and scalable.

You Will

  • Lead: Work with the team to select, scope, and drive high leverage projects that align with Reddit’s goals to scale our infrastructure to a large multiple of what it is today by abstracting away the technical details of our compute infrastructure by building a developer-facing platform.
  • Build: Hire, onboard, and build out your team to execute on a strategy and create a more efficient, scalable, and user friendly compute platform.
  • Amplify: Mentor your ICs and be a leader for the team.
  • Collaborate: Work together with a variety of teams across Reddit Engineering.
  • Evolve: Learn and improve your own technical and non-technical abilities.

What we’re looking for

  • 2-4 years experience in people management of high performing platform or infrastructure engineering teams, including setting clear goals, driving execution, developing product roadmaps, and supporting career growth. 
  • 5+ years experience on cloud infrastructure and compute platforms.
    • This experience should include ample time developing and shipping software.
  • Strong technical judgment and ability to evaluate the quality of engineering decisions related to cloud infrastructure systems (Kubernetes, AWS, GCE). Accountability for the team's technical output and operational decisions.
  • Strong focus on scalability, performance, user experience, and quality. You are an undying advocate for the user, and you have a deep intuition for how critical infra systems work at scale and how to improve user experience through platforms.
  • High empathy, excellent communication skills, and the ability to find compromise working across the entire engineering org.
  • Experience in Go, Kubernetes, Argo, Flux, and other CNCF landscape projects is a huge plus.

Benefits:

  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days Off
  • Generous paid Parental Leave  
  • Paid Volunteer time off

#LI-remote, #LI-JS5

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Software Development & Engineering Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!