Manager, Site Reliability Engineering Tooling

1 Month ago • All levels • Devops

Job Summary

Job Description

Toast is seeking a Manager of Site Reliability Engineering Tooling to lead their platform teams. The SRE team is responsible for overseeing Toast production services, focusing on quality, reliability, and low latency. This involves building automation tooling, developing and evangelizing best practices for scalability and observability, consulting with teams to improve systems, and participating in incident response. The manager will provide technical leadership, hands-on code contributions, and mentor a geographically distributed team. Key responsibilities include driving daily operations, developing the SRE roadmap, influencing architecture decisions, and guiding teams to build reliable systems.
Must have:
  • Manage an SRE team
  • Hands-on coding (Kotlin, Go, Python, Java/JVM)
  • Lead complex engineering projects
  • Build and run distributed systems
  • Understand systems, networking, scaling
  • Cloud infrastructure exposure
Good to have:
  • Mentoring engineers
  • Cross-functional collaboration
  • Scrum environment experience
  • Networking knowledge
  • Cloud architectures
  • SaaS solutions exposure
Perks:
  • Competitive compensation
  • Comprehensive benefits
  • Healthy lifestyle support
  • Flexible work arrangements

Job Details

Toast is driven by building the platform that helps restaurants adapt, take control, and focus on what they do best: creating experiences their guests love. Tremendous business growth has spurred a need for significant investment in Toast's platform teams. The Site Reliability Engineering team at Toast is responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency — without needing heroics. The team accomplishes this goal by:

  • Building tooling to automate, monitor, and manage deployed services using reliability best practices
  • Developing and evangelizing patterns and best practices to improve the scalability, observability, and reliability of all Toast systems
  • Consulting with teams to improve product scalability, observability, security, and reliability
  • Participating in outage response and root cause analysis for critical systems and infrastructure incidents

As a Manager of the Site Reliability Engineering team, you will provide technical leadership and hands-on code contributions, incorporating reliability best practices for programming and scripting, observability, production triage, incident resolution, and retrospective/root cause analysis to maintain the world-class reliability and uptime of our platform. 

About this roll* (Responsibilities) 

  • Enable a geographically distributed team of talented engineers to continue performing at a high level and help increase the impact of their work
  • Drive day-to-day operations of the team and contribute to the development and prioritization of the SRE roadmap for major initiatives
  • Create and drive strategic organization-wide scalability, observability, and reliability initiatives in collaboration with technical leadership and Product Management
  • Influence architecture decisions for your team and for individual services to optimize resilience and scalability
  • Guide teams to build and maintain systems that are reliable and available for Toast customers
  • Facilitate professional growth by mentoring engineers on your team

Do you have the right ingredients*? (Requirements)

  • Hands-on experience managing an SRE team, including hiring, mentoring, cross functional collaboration
  • Hands-on coding experience with Kotlin, Go, Python, Java/JVM
  • Background in leading complex engineering projects in a Scrum environment
  • Experience in building and running distributed systems
  • Exposure to networking, cloud architectures, and patterns 
  • Deep understanding of systems, networking, and scaling issues
  • Direct exposure to cloud infrastructure and SaaS solutions

**This is a hybrid role requiring in-office presence two days per week**

Our Spread* of Total Rewards
We strive to provide competitive compensation and benefits programs that help to attract, retain, and motivate the best and brightest people in our industry. Our total rewards package goes beyond great earnings potential and provides the means to a healthy lifestyle with the flexibility to meet Toasters’ changing needs. Learn more about our benefits at https://careers.toasttab.com/toast-benefits.

*Bread puns encouraged but not required



 

Diversity, Equity, and Inclusion is Baked into our Recipe for Success

At Toast, our employees are our secret ingredient—when they thrive, we thrive. The restaurant industry is one of the most diverse, and we embrace that diversity with authenticity, inclusivity, respect, and humility. By embedding these principles into our culture and design, we create equitable opportunities for all and raise the bar in delivering exceptional experiences.

We Thrive Together

We embrace a hybrid work model that fosters in-person collaboration while valuing individual needs. Our goal is to build a strong culture of connection as we work together to empower the restaurant community. To learn more about how we work globally and regionally, check out: https://careers.toasttab.com/locations-toast.

Apply today!

Toast is committed to creating an accessible and inclusive hiring process. As part of this commitment, we strive to provide reasonable accommodations for persons with disabilities to enable them to access the hiring process. If you need an accommodation to access the job application or interview process, please contact candidateaccommodations@toasttab.com.

------

For roles in the United States, It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Similar Jobs

Mixpanel - Account Executive, Small and Medium Business

Mixpanel

San Francisco, California, United States (Hybrid)
2 Weeks ago
DevRev - Finance Manager - FP&A

DevRev

London, England, United Kingdom (On-Site)
1 Month ago
Tide - Senior Threat Detection Engineer

Tide

Lithuania (Remote)
2 Months ago
Mark43 - Vice President - Global Controller

Mark43

New York, United States (Hybrid)
2 Weeks ago
Arkose Labs - Security Analyst (Weekend Shift)

Arkose Labs

Brisbane, Queensland, Australia (On-Site)
1 Month ago
Scopely - DevOps Lead

Scopely

Barcelona, Catalonia, Spain (Hybrid)
1 Month ago
DraftKings - Lead Site Reliability Engineer

DraftKings

Boston, Massachusetts, United States (On-Site)
4 Months ago
Sailpoint - Senior Solutions Engineer

Sailpoint

Dallas, Texas, United States (On-Site)
3 Weeks ago
Apple - NLP Solutions Software Engineer

Apple

Cupertino, California, United States (On-Site)
4 Weeks ago
Addepar - Staff Site Reliability Engineer

Addepar

United States (Remote)
1 Week ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Safe security - Enterprise Account Executive

Safe security

United States (Remote)
1 Year ago
Regrello - Principal Software Engineer, Backend Systems

Regrello

United States (Remote)
1 Year ago
deel. - QA Automation Engineer | EMEA

deel.

Croatia (Remote)
1 Week ago
Fi - Product Solution Engineer

Fi

Bengaluru, Karnataka, India (On-Site)
1 Week ago
AccelData - Senior Backend Engineer

AccelData

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Sprinkler - Field Sales Enablement Business Partner (APJ)

Sprinkler

Singapore (On-Site)
2 Months ago
Scopely - Analytics Engineering Manager

Scopely

Barcelona, Catalonia, Spain (Hybrid)
6 Months ago
deel. - Payroll Implementation Manager

deel.

Germany (Remote)
1 Week ago
Motorola solutions - Senior Procurement Category Manager - Software

Motorola solutions

Chicago, Illinois, United States (Hybrid)
2 Months ago
Varonis  - Account Manager

Varonis

Phoenix, Arizona, United States (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Dublin, County Dublin, Ireland

cyara - Associate Customer Success Manager

cyara

Skibbereen, County Cork, Ireland (Hybrid)
10 Months ago
Riot Games - Staff Software Engineer, Full-Stack - 2XKO

Riot Games

Dublin, County Dublin, Ireland (On-Site)
8 Months ago
Larian Studios - VFX Director

Larian Studios

Dublin, County Dublin, Ireland (On-Site)
9 Months ago
Varonis  - R&D Escalation Engineer

Varonis

Cork, County Cork, Ireland (On-Site)
5 Years ago
Whatnot - Customer Experience Team Lead (Weekend)

Whatnot

Dublin, County Dublin, Ireland (On-Site)
2 Months ago
Romero games - Lighting Artist

Romero games

Galway, County Galway, Ireland (Remote)
5 Months ago
playrix  - Generative AI Engineer

playrix

Ireland (Remote)
4 Months ago
Whatnot - Customer Experience Agent (German or Dutch Speaking)

Whatnot

Dublin, County Dublin, Ireland (Remote)
9 Months ago
Alpha Sense - Account Executive, Financial Services

Alpha Sense

Ireland (Remote)
1 Month ago
eBay - Traffic Software Engineer

eBay

Dublin, County Dublin, Ireland (Hybrid)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Devops Jobs

Supabase - Platform Engineer: Compute & Scaling

Supabase

(Remote)
2 Months ago
T systems - SAP Basis Solution Architect

T systems

Pune, Maharashtra, India (On-Site)
3 Weeks ago
Corsair - Senior Software Embedded Architect

Corsair

Landshut, Bavaria, Germany (On-Site)
4 Months ago
GoDaddy - Full Stack Software Engineer -AWS

GoDaddy

Colombia (Remote)
1 Week ago
Fortra - Solutions Engineer

Fortra

United States (Remote)
1 Week ago
C3 IoT - Solution Engineer

C3 IoT

Chicago, Illinois, United States (On-Site)
3 Weeks ago
Cavnue - Senior Platform Infrastructure Engineer

Cavnue

United States (Remote)
2 Months ago
zeta - Sr. Site Reliability Engineer

zeta

Bengaluru, Karnataka, India (On-Site)
9 Months ago
Capgemini - Cloud Solution Architect

Capgemini

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Tencent - Senior Site Reliability Engineer

Tencent

Shanghai, Shanghai, China (On-Site)
10 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Boston, Massachusetts, United States (Remote)

New York, United States (Hybrid)

Dublin, County Dublin, Ireland (Hybrid)

Dublin, County Dublin, Ireland (Hybrid)

United States (Remote)

Ottawa, Ontario, Canada (Hybrid)

London, England, United Kingdom (On-Site)

Tulsa, Oklahoma, United States (Hybrid)

Boston, Massachusetts, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Toast

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug