SRE, Observability System Administrator

undefined ago • 2 Years + • System Admin

Job Summary

Job Description

The Observability System Administrator role at Toast fits within the Observability Enablement & Administration team, which is part of Site Reliability Engineering, responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency. The Observability Enablement & Administration team is responsible for setting the overall observability strategy, choosing the right tools and technologies, developing best practices, and providing guidance to other teams, while maintaining, governing cost, and administering the observability platform and log pipelines.
Must have:
  • Participate in observability architecture design, support, and platform management
  • Gather and analyze metrics from operating systems and applications that enable development teams with observability insights
  • Manage users and roles, monitor platform performance, and ensure security and high availability.
  • Automate operational toil for observability focused administrative tasks
  • Build and support automation for legal and compliance requirements
  • Support end-users with training and technical guidance on observability tools and capabilities.
  • Maintain accurate documentation of configurations, workflows, and procedures.
  • Manage data ingestion and parsing to ensure data integrity and availability.
  • Design and manage dashboards, reports, alerts, and visualizations.
  • Implement strategies to increase observability system reliability and performance through on-call rotation and process optimization
  • Utilize observability tools to diagnose application and infra issues and incidents
Good to have:
  • Splunk power user/administrator experience preferred
Perks:
  • competitive compensation and benefits programs
  • healthy lifestyle
  • flexibility to meet Toasters’ changing needs
  • hybrid work model
  • accessible and inclusive hiring process
  • reasonable accommodations for persons with disabilities

Job Details

The Observability System Administrator role at Toast fits within the Observability Enablement & Administration team, which is part of Site Reliability Engineering, responsible for overseeing Toast production services, with a commitment to quality, reliability, and low latency. The Observability Enablement & Administration team is responsible for setting the overall observability strategy, choosing the right tools and technologies, developing best practices, and providing guidance to other teams, while maintaining, governing cost, and administering the observability platform and log pipelines.

About this roll\* (Responsibilities)

In this role you will be responsible for the administration, maintenance, and enhancement of our observability platforms, ensuring optimal performance and availability for our critical security and business operations. In this role you will:

  • Participate in observability architecture design, support, and platform management
  • Gather and analyze metrics from operating systems and applications that enable development teams with observability insights
  • Manage users and roles, monitor platform performance, and ensure security and high availability.
  • Automate operational toil for observability focused administrative tasks
  • Build and support automation for legal and compliance requirements
  • Support end-users with training and technical guidance on observability tools and capabilities.
  • Maintain accurate documentation of configurations, workflows, and procedures.
  • Manage data ingestion and parsing to ensure data integrity and availability.
  • Design and manage dashboards, reports, alerts, and visualizations.
  • Implement strategies to increase observability system reliability and performance through on-call rotation and process optimization
  • Utilize observability tools to diagnose application and infra issues and incidents

Do you have the right ingredients\*? (Requirements)

  • Polyglot technologist/generalist with a thirst for learning
  • Understanding of cloud and microservice architecture
  • Experience with tools such as APM, RUM, Synthetics, Splunk, OTEL, Log pipelines, SIEM, Terraform etc.
  • Automation/scripting experience with Go, Python, etc
  • Splunk power user/administrator experience preferred
  • Industry experience with at least 2 years observability experience with a focus on SRE or observability platform management

AI at Toast

At Toast we’re Hungry to Build and Learn. We believe learning new AI tools empowers us to build for our customers faster, more independently, and with higher quality. We provide these tools across all disciplines, from Engineering and Product to Sales and Support, and are inspired by how our Toasters are already driving real value with them. The people who thrive here are those who embrace changes that let us build more for our customers; it’s a core part of our culture.

Our Spread\* of Total Rewards

We strive to provide competitive compensation and benefits programs that help to attract, retain, and motivate the best and brightest people in our industry. Our total rewards package goes beyond great earnings potential and provides the means to a healthy lifestyle with the flexibility to meet Toasters’ changing needs. Learn more about our benefits at https://careers.toasttab.com/toast-benefits.

\*Bread puns encouraged but not required

Diversity, Equity, and Inclusion is Baked into our Recipe for Success

At Toast, our employees are our secret ingredient—when they thrive, we thrive. The restaurant industry is one of the most diverse, and we embrace that diversity with authenticity, inclusivity, respect, and humility. By embedding these principles into our culture and design, we create equitable opportunities for all and raise the bar in delivering exceptional experiences.

We Thrive Together

We embrace a hybrid work model that fosters in-person collaboration while valuing individual needs. Our goal is to build a strong culture of connection as we work together to empower the restaurant community. To learn more about how we work globally and regionally, check out: https://careers.toasttab.com/locations-toast.

Apply today!

Toast is committed to creating an accessible and inclusive hiring process. As part of this commitment, we strive to provide reasonable accommodations for persons with disabilities to enable them to access the hiring process. If you need an accommodation to access the job application or interview process, please contact candidateaccommodations@toasttab.com.

------

For roles in the United States, It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Bengaluru, Karnataka, India

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

System Admin Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Bengaluru, Karnataka, India (On-Site)

Boston, Massachusetts, United States (Hybrid)

Dublin, County Dublin, Ireland (Hybrid)

Boston, Massachusetts, United States (Remote)

New York, United States (Hybrid)

New York, United States (Remote)

California, United States (On-Site)

Sydney, New South Wales, Australia (On-Site)

View All Jobs

Get notified when new jobs are added by Toast

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug