Staff Software Engineer, Messaging Infrastructure

12 Hours ago • 7 Years + • $206,700 PA - $289,400 PA

Job Summary

Job Description

The Staff Software Engineer on the Messaging Infrastructure team will be responsible for developing and maintaining the foundational platform for Reddit's infrastructure, focusing on engineering Kafka as a platform and promoting event-driven architectures. This includes automating the management of Kafka clusters on Kubernetes, developing APIs and controllers for Kafka operations at scale, and working on schema registries, capacity management, and developer tooling. The role involves collaborating with a team of engineers, contributing to the technical direction of eventing, and sharing on-call responsibilities. The work directly impacts hundreds of millions of users worldwide. The role is remote.
Must have:
  • 7+ years of experience developing internet-scale software, preferably infrastructure.
  • Experience with asynchronous communication patterns and solutions.
  • Experience developing on top of Kubernetes or similar distributed systems.
  • Strong troubleshooting capabilities surrounding systems and software.
  • Experience engineering large systems and being a self-starter.
  • Excellent communication skills to collaborate with a service-oriented team.
Good to have:
  • Familiarity with Kafka specifically or experience building large-scale infrastructure.
  • Kubernetes controller or operator development experience.
Perks:
  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days off
  • Generous paid Parental Leave
  • Paid Volunteer time off

Job Details

Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 101M+ daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit redditinc.com.

The Messaging Infrastructure team is looking to hire a Staff Software Engineer that thrives at the intersection of infrastructure and software development. This team’s challenges break into 2 domains, which we consider engineering Kafka as a platform and promoting event driven architectures.

Engineering Kafka as a platform: Higher-level orchestration of Kafka to support our multi-cloud, multi-region, deployments on top of Kubernetes. A subset of current focuses include:

  • Software automation that creates, manages, and destroys stateful Kafka clusters in our stateless Kubernetes environments.
  • APIs and controllers that support operation of kafka clusters at scale atop kubernetes
  • Automation of replication and other cluster management concerns.

Promoting event driven architectures: Reddit is increasingly adopting geographically distributed, eventually consistent architectures. As part of this shift, many systems are shifting to event driven architectures, and the Messaging team is working to make this adoption simpler and faster. Current projects are:

  • Schema Registries and enforcement
  • Capacity management and ACLs through client enforced authentication
  • GitOps and other developer experience tooling for Kafka interaction

As a member of the Messaging team, your work will span these 2 domains, which are rich with challenging infrastructure and software engineering problems. You will work with a team of engineers to develop a vision for asynchronous communication tooling. Your work will directly impact hundreds of millions of users around the world. Join us and help build the future of Reddit!

In your day-to-day, you can expect to:

  • Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit’s infrastructure.
  • Deliver software to improve the availability, scalability, latency, and efficiency of Kafka and other messaging components.
  • Contribute feedback to the technical and strategic direction of eventing at Reddit.
  • Automate critical aspects of the event driven development process
  • Share on-call responsibilities. 

You have:

  • 7+ years of experience developing internet-scale software, preferably in the context of infrastructure.
  • Experience with asynchronous communication patterns and solutions, bonus if familiar with Kafka specifically or experience building large-scale infrastructure.
  • Experience developing on top of Kubernetes or similar distributed systems.
    • Kubernetes controller or operator development experience is a huge plus.
  • Strong troubleshooting capabilities surrounding both systems and software.
  • Experience engineering large systems, tracking work, and being a self-starter on projects.
  • Excellent communication skills to collaborate with a service-oriented team and company.
  • Experience navigating cross-functional migrations
  • Experience mentoring other engineers.

Benefits:

  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days off
  • Generous paid Parental Leave  
  • Paid Volunteer time off

#LI-remote, #LI-JS5

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in San Francisco, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!