Site Reliability Engineer, Process Automation

undefined ago • 2 Years + • Devops

Job Summary

Job Description

The Site Reliability Engineer, Process Automation role at Toast fits within the Site Reliability Engineering team responsible for overseeing the Incident and Change Management processes at Toast. As a Site Reliability Engineer, Process Automation, you will provide automation for incident and change management processes, which improve release consistency and enable faster incident response. You will help maintain and improve key organizational processes for Incident and Change management, which prevent customer impacts through change control, rapid detection, response, root cause analysis, and continuous learning from issues.
Must have:
  • Optimize existing processes, identify areas for improvement, and implement automated solutions to enhance efficiency and reliability of Toast systems.
  • Utilize, configure, and support tools such as JIRA, FireHydrant, and Backstage for tracking events, incidents, and changes, and maintain the Service Catalog
  • Enable low-risk, compliant releases with rapid rollback capability to maintain platform reliability
  • Implement automation for risk mitigation strategies to minimize the impact of changes and releases on Toast customers
  • Work closely with leadership, 3rd party vendors, relevant stakeholders, and to drive work to completion
  • Industry experience with at least 2 years engineering experience with a focus on SRE
  • Bachelor’s Degree in Computer Science, engineering, or related field
  • Working knowledge of complex cloud environments (AWS, GCP, Azure, etc.)
  • Experience scripting automation (Python, Go, etc)
  • Experience with Infrastructure as code (Terraform, etc)
  • Experience participating in Incident Response
  • Strong written and verbal communication skills
  • Strong problem-solving skills and the ability to think strategically and analytically
  • Experience working with a diverse global team across multiple regions and time zones
Good to have:
  • Working knowledge of various best practice frameworks, including ITIL, ITSM, Agile/scrum, change management, etc
  • Experience with Incident and Change processes and tools (JIRA, OpsGenie, FireHydrant, DX, etc)
Perks:
  • hybrid work model that fosters in-person collaboration while valuing individual needs
  • accessible and inclusive hiring process
  • reasonable accommodations for persons with disabilities

Job Details

Engineering

Site Reliability Engineer, Process Automation

The Site Reliability Engineer, Process Automation role at Toast fits within the Site Reliability Engineering team responsible for overseeing the Incident and Change Management processes at Toast.

About this roll\* (Responsibilities)

As a Site Reliability Engineer, Process Automation, you will provide automation for incident and change management processes, which improve release consistency and enable faster incident response. You will help maintain and improve key organizational processes for Incident and Change management, which prevent customer impacts through change control, rapid detection, response, root cause analysis, and continuous learning from issues.

You will:

  • Optimize existing processes, identify areas for improvement, and implement automated solutions to enhance efficiency and reliability of Toast systems.
  • Utilize, configure, and support tools such as JIRA, FireHydrant, and Backstage for tracking events, incidents, and changes, and maintain the Service Catalog
  • Enable low-risk, compliant releases with rapid rollback capability to maintain platform reliability
  • Implement automation for risk mitigation strategies to minimize the impact of changes and releases on Toast customers
  • Work closely with leadership, 3rd party vendors, relevant stakeholders, and to drive work to completion

Do you have the right ingredients\*? (Requirements)

  • Industry experience with at least 2 years engineering experience with a focus on SRE
  • Bachelor’s Degree in Computer Science, engineering, or related field
  • Working knowledge of complex cloud environments (AWS, GCP, Azure, etc.)
  • Experience scripting automation (Python, Go, etc)
  • Experience with Infrastructure as code (Terraform, etc)
  • Experience participating in Incident Response
  • Strong written and verbal communication skills
  • Strong problem-solving skills and the ability to think strategically and analytically
  • Experience working with a diverse global team across multiple regions and time zones
  • Working knowledge of various best practice frameworks, including ITIL, ITSM, Agile/scrum, change management, etc a plus
  • Experience with Incident and Change processes and tools (JIRA, OpsGenie, FireHydrant, DX, etc) a plus

\*Bread puns encouraged but not required.

**Diversity, Equity, and Inclusion is Baked into our Recipe for Success**

At Toast, our employees are our secret ingredient—when they thrive, we thrive. The restaurant industry is one of the most diverse, and we embrace that diversity with authenticity, inclusivity, respect, and humility. By embedding these principles into our culture and design, we create equitable opportunities for all and raise the bar in delivering exceptional experiences.

We Thrive Together

We embrace a hybrid work model that fosters in-person collaboration while valuing individual needs. Our goal is to build a strong culture of connection as we work together to empower the restaurant community. To learn more about how we work globally and regionally, check out: https://careers.toasttab.com/locations-toast.

Apply today!

Toast is committed to creating an accessible and inclusive hiring process. As part of this commitment, we strive to provide reasonable accommodations for persons with disabilities to enable them to access the hiring process. If you need an accommodation to access the job application or interview process, please contact candidateaccommodations@toasttab.com.

------

For roles in the United States, It is unlawful in Massachusetts to require or administer a lie detector test as a condition of employment or continued employment. An employer who violates this law shall be subject to criminal penalties and civil liability.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Bengaluru, Karnataka, India

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Devops Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (Remote)

New York, United States (Hybrid)

Hartford, Connecticut, United States (On-Site)

Bengaluru, Karnataka, India (Hybrid)

Port St. Lucie, Florida, United States (On-Site)

New York, United States (Remote)

Burlington, North Carolina, United States (On-Site)

La Mesa, California, United States (Remote)

San Francisco, California, United States (Remote)

View All Jobs

Get notified when new jobs are added by Toast

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug