Staff Software Engineer, Messaging Infrastructure

2 Months ago • 7 Years + • Devops • $206,700 PA - $289,400 PA

Job Summary

Job Description

The Staff Software Engineer, Messaging Infrastructure role involves developing and maintaining the foundational platform for Reddit's infrastructure. The engineer will work on engineering Kafka as a platform, including automation for cluster management and APIs. They will also promote event-driven architectures, focusing on schema registries, capacity management, and developer experience tooling. The role requires collaborating with a team, improving the availability and efficiency of messaging components, and contributing to the strategic direction of eventing. The responsibilities include on-call duties and directly impact millions of users.
Must have:
  • 7+ years of internet-scale software development experience, infrastructure preferred.
  • Experience with asynchronous communication patterns and solutions, Kafka knowledge is a plus.
  • Experience developing on Kubernetes or similar distributed systems.
  • Strong troubleshooting capabilities for systems and software.
  • Experience engineering large systems and being a self-starter.
  • Excellent communication skills for collaboration.
  • Experience navigating cross-functional migrations.
  • Experience mentoring other engineers.
Good to have:
  • Kubernetes controller or operator development experience
Perks:
  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days off
  • Generous paid Parental Leave
  • Paid Volunteer time off

Job Details

Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 101M+ daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit redditinc.com.

The Messaging Infrastructure team is looking to hire a Staff Software Engineer that thrives at the intersection of infrastructure and software development. This team’s challenges break into 2 domains, which we consider engineering Kafka as a platform and promoting event driven architectures.

Engineering Kafka as a platform: Higher-level orchestration of Kafka to support our multi-cloud, multi-region, deployments on top of Kubernetes. A subset of current focuses include:

  • Software automation that creates, manages, and destroys stateful Kafka clusters in our stateless Kubernetes environments.
  • APIs and controllers that support operation of kafka clusters at scale atop kubernetes
  • Automation of replication and other cluster management concerns.

Promoting event driven architectures: Reddit is increasingly adopting geographically distributed, eventually consistent architectures. As part of this shift, many systems are shifting to event driven architectures, and the Messaging team is working to make this adoption simpler and faster. Current projects are:

  • Schema Registries and enforcement
  • Capacity management and ACLs through client enforced authentication
  • GitOps and other developer experience tooling for Kafka interaction

As a member of the Messaging team, your work will span these 2 domains, which are rich with challenging infrastructure and software engineering problems. You will work with a team of engineers to develop a vision for asynchronous communication tooling. Your work will directly impact hundreds of millions of users around the world. Join us and help build the future of Reddit!

In your day-to-day, you can expect to:

  • Work collaboratively with a team of software engineers to create and maintain the foundational platform for running Reddit’s infrastructure.
  • Deliver software to improve the availability, scalability, latency, and efficiency of Kafka and other messaging components.
  • Contribute feedback to the technical and strategic direction of eventing at Reddit.
  • Automate critical aspects of the event driven development process
  • Share on-call responsibilities. 

You have:

  • 7+ years of experience developing internet-scale software, preferably in the context of infrastructure.
  • Experience with asynchronous communication patterns and solutions, bonus if familiar with Kafka specifically or experience building large-scale infrastructure.
  • Experience developing on top of Kubernetes or similar distributed systems.
    • Kubernetes controller or operator development experience is a huge plus.
  • Strong troubleshooting capabilities surrounding both systems and software.
  • Experience engineering large systems, tracking work, and being a self-starter on projects.
  • Excellent communication skills to collaborate with a service-oriented team and company.
  • Experience navigating cross-functional migrations
  • Experience mentoring other engineers.

Benefits:

  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days off
  • Generous paid Parental Leave  
  • Paid Volunteer time off

#LI-remote, #LI-JS5

Similar Jobs

Toptracer - Lead Backend Engineer

Toptracer

Danderyd, Stockholm County, Sweden (Hybrid)
1 Month ago
Eccentric - WebGL Developer

Eccentric

Mumbai, Maharashtra, India (On-Site)
1 Month ago
Reddit - Engineering Manager - Ad Formats Foundation

Reddit

United States (Remote)
2 Months ago
appier - Senior Backend Developer

appier

Istanbul, İstanbul, Türkiye (Remote)
2 Weeks ago
Microsoft - Member of Technical Staff - Backend Engineer, Product

Microsoft

Mountain View, California, United States (Hybrid)
3 Months ago
bytedance - Site Reliability Engineer - CapCut - San Jose/Seattle

bytedance

San Jose, California, United States (On-Site)
8 Months ago
Argus - Site Reliability Engineer (South East Asia)

Argus

(Remote)
3 Months ago
bytedance - Cloud Site Reliability Engineer

bytedance

Seattle, Washington, United States (On-Site)
3 Months ago
Thales - Senior Technical Lead - DevOps

Thales

Bengaluru, Karnataka, India (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

The Walt Disney Company - Principal Software Engineer - iOS

The Walt Disney Company

San Francisco, California, United States (Hybrid)
3 Months ago
Illumina - Director, Medical Affairs

Illumina

United States (Remote)
2 Months ago
Adyen - Alliances Partner Manager

Adyen

Amsterdam, North Holland, Netherlands (On-Site)
2 Months ago
Postman - Staff Engineer, Developer Tooling

Postman

Boston, Massachusetts, United States (Hybrid)
2 Months ago
YouGov - Senior Data Scientist / AI Engineer

YouGov

United States (Remote)
2 Weeks ago
Marsh McLennan - Digital Content Governance Specialist

Marsh McLennan

Warsaw, Masovian Voivodeship, Poland (Hybrid)
2 Months ago
upwork - Lead Product Designer

upwork

United States (Remote)
2 Weeks ago
Epic Games - Senior Director, Player Growth & Lifecycle Marketing

Epic Games

Cary, North Carolina, United States (On-Site)
2 Months ago
Apple - Channel Strategist

Apple

Sunnyvale, California, United States (On-Site)
2 Weeks ago
Riot Games - Senior Game Product Manager, Metasystem

Riot Games

Shanghai, China (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in New York, New York, United States

Roblox - Senior/Principal Software Engineer, Mobile UX (Android)

Roblox

San Mateo, California, United States (On-Site)
3 Weeks ago
Salesforce - Associate Account Partner (Public Sector)

Salesforce

Washington, District Of Columbia, United States (Hybrid)
1 Month ago
UPF Industries  - Assembler

UPF Industries

Hazlehurst, Georgia, United States (On-Site)
1 Month ago
Apple - Machine Learning Engineer

Apple

Cupertino, California, United States (On-Site)
2 Weeks ago
AI Fund - Founder-in-Residence/CEO (Realtime Voice-Developer Tools)

AI Fund

United States (Remote)
1 Month ago
Dynamis Inc - Information Systems Business Analyst

Dynamis Inc

Huntsville, Alabama, United States (On-Site)
2 Weeks ago
Nintendo - CONTRACT - Localization Product Specialist III (Spanish)

Nintendo

Redmond, Washington, United States (Hybrid)
8 Months ago
Actian - Senior Partner Marketing Manager

Actian

United States (Remote)
1 Month ago
oportun - Bilingual English/Spanish Member Loyalty Representative

oportun

Dallas, Texas, United States (On-Site)
2 Weeks ago
Critical mass - Art Director

Critical mass

Cupertino, California, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Saama Technologies,  Inc  - Senior Site Reliability Engineer

Saama Technologies, Inc

Chennai, Tamil Nadu, India (On-Site)
9 Months ago
GHX - Automation Engineer III

GHX

Hyderabad, Telangana, India (On-Site)
2 Months ago
Prophecy - Cloud Engineer

Prophecy

Bengaluru, Karnataka, India (On-Site)
2 Months ago
GoTo Group - Sr. Software Engineer (iOS) - Engineering Platform (2)

GoTo Group

Bengaluru, Karnataka, India (On-Site)
9 Months ago
Ansys - Software Engineer II - Systems Integration

Ansys

Exton, Pennsylvania, United States (On-Site)
1 Month ago
GoTo Group - Lead Software Engineer - Engineering Platform

GoTo Group

Gurugram, Haryana, India (On-Site)
8 Months ago
SparkCognition - DevOps Engineer

SparkCognition

Bengaluru, Karnataka, India (On-Site)
10 Months ago
Ubisoft - Vulnerability DevOps Specialist

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
2 Months ago
bounteous - Junior Cloud/Infrastructure Engineer

bounteous

Chennai, Tamil Nadu, India (Hybrid)
3 Weeks ago
bytedance - Site Reliability Engineer, ML System

bytedance

Seattle, Washington, United States (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded