Senior Site Reliability Engineer

1 Month ago • 4-8 Years • DevOps

About the job

Job Description

We are hiring for a Senior Site Reliability Engineer within the Reliability Tooling team, you will be responsible for writing and reviewing code, contributing to technical decisions, and mentoring engineers in your squad. We are looking for someone who will be part of an engaging, dynamic and inclusive engineering organisation, grounded in scrum and agile practices, CI/CD, great collaboration and motivated by a commitment to continuous learning and improvement. You will be part of a team that is customer satisfaction focused and will be working on reliability solutions that enable development teams to achieve their service level objectives, by continuous measurement and improvement of reliability signals. As a Senior engineer, we are looked at by our fellow team members as a ‘go to’ individual; you are someone who has a clear understanding of, and can thoroughly elaborate on SRE principles and best practices to a given audience. To be successful in this role you will continuously uphold and improve all the relevant reliability aspects for our services, with an increased focus on SLIs and SLOs, while raising the reliability of a variety of large scale user facing and internal services. Disney Entertainment & ESPN Technology teams are located in New York, San Francisco, Seattle, Bristol US, Manchester UK, Amsterdam, remotely and more!
Must have:
  • proven experience in SRE, DevOps, technical operations, systems engineering, software engineering
  • Passionate and curious about ways to leverage technology while continually learning
  • Skilled in Cloud/PaaS/SaaS Environments (e.g. AWS, Azure, Google Cloud Compute)
  • Efficiently skilled with the use of containers in enterprise production environments (e.g. Docker, Kubernetes, LXC, AWS ECS and EKS)
  • Proficient in one or more of the following languages (Python, Go, Rust, or similar)
Good to have:
  • Comfortable in one or more of the following languages (Python, Java, Scala, Go, Rust, or similar)
  • User Interfaces development experience
  • Proficient, collaborative, & experienced in building reliable, scalable, enterprise systems
  • Ability to identify root-cause sources of instability in a high-traffic, large-scale distributed systems
  • Experience in designing, building, and operating large-scale production systems
  • Configuration management and orchestration (e.g. Terraform, Cloud Formation, Ansible)
  • Experience with continuous integration tools (e.g. Jenkins, Gitlab CI/CD, AWS CodeBuild/Deploy/Pipeline, Azure DevOps, Spinnaker)
  • Knowledge of best practices and IT operations in an always-up, always-available service
  • Experience in SDLC, distributed systems, networking, logistics and operations or capacity planning
Perks:
  • 25 days annual leave
  • Private medical insurance & dental care
  • Free Park Entry
  • Disney Discounts
  • Excellent parental and guardian leave
  • Employee Resource Groups – WOMEN @ Disney, Disney DIVERSITY, Disney PRIDE, ENABLED, and our Mental Health & Wellbeing Group, TRUST

Job Summary:

We are hiring for a Senior Site Reliability Engineer within the Reliability Tooling team, you will be responsible for writing and reviewing code, contributing to technical decisions, and mentoring engineers in your squad. We are looking for someone who will be part of an engaging, dynamic and inclusive engineering organisation, grounded in scrum and agile practices, CI/CD, great collaboration and motivated by a commitment to continuous learning and improvement.

You will be part of a team that is customer satisfaction focused and will be working on reliability solutions that enable development teams to achieve their service level objectives, by continuous measurement and improvement of reliability signals.

As a Senior engineer, we are looked at by our fellow team members as a ‘go to’ individual; you are someone who has a clear understanding of, and can thoroughly elaborate on SRE principles and best practices to a given audience. To be successful in this role you will continuously uphold and improve all the relevant reliability aspects for our services, with an increased focus on SLIs and SLOs, while raising the reliability of a variety of large scale user facing and internal services.

Disney Entertainment & ESPN Technology teams are located in New York, San Francisco, Seattle, Bristol US, Manchester UK, Amsterdam, remotely and more!

What You Will Do

  • Build tools to help your SRE team quickly pinpoint, isolate and resolve issues related to infrastructure, platform services and applications;
  • Use Chaos Engineering principles and methodologies to test what you build under real-world conditions;
  • Deploy and manage innovative modern cloud technologies using infrastructure-as-code, self-healing, and security automation patterns;
  • Develop useful telemetry, alerts, and response to reduce Mean Time To Repair (MTTR);
  • Collaborate and provide technical excellence within and across teams;
  • Consult on standard methodologies and develop tools to enable smooth adoptions of good service reliability practices and methods, e.g. promote sustainable incident response and blameless postmortems
  • Identify areas of improvement in reliability, efficiency, and operations;
  • Write code that improves scalability, performance, maintainability, and security;
  • Mentor SREs in technical and non-technical SRE responsibilities;

What To Bring

  • proven experience in SRE, DevOps, technical operations, systems engineering, software engineering
  • Passionate and curious about ways to leverage technology while continually learning
  • Skilled in Cloud/PaaS/SaaS Environments (e.g. AWS, Azure, Google Cloud Compute)
  • Efficiently skilled with the use of containers in enterprise production environments (e.g. Docker, Kubernetes, LXC, AWS ECS and EKS)
  • Proficient in one or more of the following languages (Python, Go, Rust, or similar)

Preferred Experience

  • Comfortable in one or more of the following languages (Python, Java, Scala, Go, Rust, or similar)
  • User Interfaces development experience
  • Proficient, collaborative, & experienced in building reliable, scalable, enterprise systems
  • Ability to identify root-cause sources of instability in a high-traffic, large-scale distributed systems
  • Experience in designing, building, and operating large-scale production systems
  • Configuration management and orchestration (e.g. Terraform, Cloud Formation, Ansible)
  • Experience with continuous integration tools (e.g. Jenkins, Gitlab CI/CD, AWS CodeBuild/Deploy/Pipeline, Azure DevOps, Spinnaker)
  • Knowledge of best practices and IT operations in an always-up, always-available service;
  • Experience in SDLC, distributed systems, networking, logistics and operations or capacity planning;

The Perks

  • 25 days annual leave.
  • Private medical insurance & dental care.
  • Free Park Entry: You will have the opportunity to enter any of our parks with your family and friends for free.
  • Disney Discounts: you are entitled to discounts on designated Disney products, resort F&B and ticketing.
  • Excellent parental and guardian leave.
  • Employee Resource Groups – WOMEN @ Disney, Disney DIVERSITY, Disney PRIDE, ENABLED, and our Mental Health & Wellbeing Group, TRUST.

The Walt Disney Company Limited is an equal opportunity employer. Applicants will receive consideration for employment without regard to age, race, colour, religion or belief, sex, nationality, ethnic or national origin, sexual orientation, gender reassignment, marital or civil partner status, disability or pregnancy or maternity. Disney fosters a business culture where ideas and decisions from all people help us grow, innovate, create the best stories and be relevant in a rapidly changing world.

View Full Job Description

Add your resume

80%

Upload your resume, increase your shortlisting chances by 80%

About The Company

From classic animated features and exhilarating theme park attractions to cutting edge sports coverage, and the hottest shows on television, The Walt Disney Company has been making magic since 1923, creating unforgettable stories that connect with audiences around the world. And we’re just getting started!

The key to our success…. The Cast, Crew, Imagineers and Employees who honor Disney’s rich legacy by stretching the bounds of imagination to create the never-before-seen, bringing unparalleled entertainment experiences to people of all ages. Begin a career that delivers unparalleled creative content and experiences to audiences around the world and just imagine the stories you could be part of…

What is #LifeAtDisney like? It’s a series of magical moments with cast members and employees developing and telling our stories in the most innovative ways. Whether it’s a day spent as a Disney VoluntEAR, or celebrating the release of a new interactive experience, retail product or movie, our days are filled with the knowledge that we are creating entertainment experiences the whole family can enjoy. Follow @DisneyCareers on Facebook, Twitter and Instagram for a peek behind-the-curtain, and discover how you could connect to a world of stories with Disney!

Burbank, California, United States (On-Site)

Île-de-France, France (On-Site)

London, England, United Kingdom (Hybrid)

Celebration, Florida, United States (On-Site)

Burbank, California, United States (On-Site)

New York, New York, United States (On-Site)

San Antonio, Texas, United States (Remote)

Glendale, California, United States (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by The Walt Disney Company

Similar Jobs

dmg - Senior Technical Program Manager

dmg, United States (On-Site)

Reuters News Agency - Cloud Engineer

Reuters News Agency, India (Hybrid)

Social Discovery Group - IT asset and supplier manager (Customer IT Infrastructure)

Social Discovery Group, Georgia (Remote)

Notion - Software Engineer, Connections

Notion, India (On-Site)

Rooter.gg - Software Development Engineer - Backend

Rooter.gg, India (On-Site)

Consilio LLC - Infrastructure Site Reliability Engineer

Consilio LLC, India (On-Site)

zones - Azure Backend Developer

zones, India (On-Site)

Rackspace - Senior AWS DevOps Engineer

Rackspace, Poland (Remote)

GoTo Group - Site Reliability Engineer - EP (SE4)

GoTo Group, India (On-Site)

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

USE Insider - Security Engineer - Red Team

USE Insider, Türkiye (Remote)

Lulalend - Senior Analytics Engineer

Lulalend, South Africa (On-Site)

Travelex - Lead Data Scientist

Travelex, India (Hybrid)

edgemony - Cloud Systems Engineer - DOCEBO

edgemony, Italy (Remote)

Demonware - Expert Software Engineer (Privacy Data)

Demonware, Canada (On-Site)

The Walt Disney Company - Sr ML Engineer

The Walt Disney Company, United States (On-Site)

The Walt Disney Company - Senior Software Engineer

The Walt Disney Company, United States (On-Site)

Zuora - Senior ML Engineer

Zuora, India (Hybrid)

Get notifed when new similar jobs are uploaded

Jobs in England, United Kingdom

Rolls Royce - Security Infrastructure Lead

Rolls Royce, United Kingdom (On-Site)

forescout - Inside Sales Account Manager - (French Speaker)

forescout, United Kingdom (On-Site)

Granicus - Sales Engineer

Granicus, United Kingdom (Remote)

Assystems - Senior Planner

Assystems, United Kingdom (On-Site)

Cloud Imperium Games - Senior Producer

Cloud Imperium Games, United Kingdom (On-Site)

Rank group - Food & Beverage Team Leader

Rank group, United Kingdom (On-Site)

Alpha Sense - Client Solutions Specialist

Alpha Sense, United Kingdom (On-Site)

Salesforce - Account Executive - Nonprofit, Public Sector UK

Salesforce, United Kingdom (On-Site)

Mastercard - PI&R-PRSS

Mastercard, United Kingdom (On-Site)

Lighthouse Games - Senior Producer

Lighthouse Games, United Kingdom (Hybrid)

Get notifed when new similar jobs are uploaded

DevOps Jobs

PublicisGroupe - Senior Associate Technology L1_Net-Web

PublicisGroupe, India (On-Site)

Whoop - Senior Software Engineer (DevOps)

Whoop, United States (On-Site)

ARHS - Data Manager

ARHS, Sweden (On-Site)

Clarivate - Lead Infrastructure Engineer

Clarivate, India (Hybrid)

Arrow Electronics - Senior Cloud AD Engineer

Arrow Electronics, India (On-Site)

Nasdaq - DevOps Engineer

Nasdaq, Canada (On-Site)

Playtech - Dev Ops Engineer

Playtech, United Kingdom (On-Site)

Get notifed when new similar jobs are uploaded