Sr. System Reliability Engineer

18 Minutes ago • 5 Years + • DevOps • $138,900 PA - $186,200 PA

Job Summary

Job Description

The Sr. System Reliability Engineer at Disney will play a crucial role in the IAM SRE team, focusing on elevating SRE practices, onboarding new technologies, and integrating next-generation digital platforms. Responsibilities include collaborating with engineering and production teams, designing and building reliable systems, automating infrastructure and operations, creating monitoring telemetry, and ensuring data security. The role requires strong software development skills (Python, Go, Java, etc.), experience with CI/CD pipelines, cloud hosting (AWS, GCP, Azure), containerization, and DevOps culture. The engineer will also be involved in troubleshooting, providing customer support, and contributing to a high-accountability environment.
Must have:
  • Linux/Windows System Admin
  • Software Development (Python, Go)
  • CI/CD (Jenkins), Git
  • Cloud Hosting (AWS, GCP, Azure)
  • Container Computing (Docker)
  • DevOps Culture
  • Problem-solving & Troubleshooting
Good to have:
  • Experience with Load Balancers
  • Experience in Agile/Scrum
  • Public Key Cryptography (X.509)
  • Web Technologies (Java, Node.js, etc.)
Perks:
  • Bonus
  • Long-term Incentives
  • Full Benefits Package

Job Details

Job Summary:

At Disney, we‘re storytellers. We make the impossible possible. We do this through utilizing and developing cutting-edge technology and pushing the envelope to bring stories to life through our movies, products, interactive games, parks and resorts, and media networks. Now is your chance to join our talented team that delivers unparalleled creative content to audiences around the world.

The Systems Reliability Engineering (SRE) team helps elevate SRE practices at TWDC, promoting and onboarding new technologies, solving complex problems and integrating with next generation digital platforms. Systems Reliability Engineers use a software engineering approach to architect, design, automate, monitor, and build applications at scale. This includes operating and engineering software with close business segment alignment to deliver platforms through efficient, effective and resilient architectures. SREs are talented engineers that are focused on improving quality through a data driven approach: instrumentation, automation, and functional/unit testing.

This position is for a systems reliability engineering (SRE) eager to play an integral role on the IAM SRE engineering team for The Walt Disney Company to help elevate SRE practices, onboard new technologies, solve complex problems and integrate next generation digital platforms.  

As a Disney SRE, you will help create, build and deliver amazing experiences for our guests, fans and businesses. Primary responsibilities include helping existing, new and emerging business teams onboard technologies or platforms to accelerate their businesses.  This will include consultation, designing, building, and supporting development pipelines, automating infrastructure and operations, creating telemetry for monitoring, engineering high reliability and reinforcing best practices to secure our company and guest data.

You will be expected to have some systems administration skills in Linux and Windows platforms, and must have experience with software development (e.g. Python, Go, Java, Node), CI Pipeline tools (e.g. Jenkins), Git source management, cloud hosting (AWS, GCP & Azure), container computing (e.g. Docker, OCI), web technologies and the DevOps team culture. You will work with engineering, creative and production teams in an extremely collaborative and high-energy environment to brainstorm, architect, gather requirements, troubleshoot, and provide stellar customer support.  You are passionate about constantly learning, applying technology to solve complex problems, and is a highly motivated, optimistic, proactive, creative thought leader and project manager. 

As an SRE, you will:

Translate ideas into tangible products that shape experiences by focusing on a systematic approach to automation, resiliency, efficiency, stability, security, performance, and capacity management, as well as documentation and serve as a subject matter expert through internal and external tech talks and conferences.

Make an impact on a transformative team and culture by designing, building, and supporting systems for a large-scale enterprise production environment that hosts a variety of digital workloads and experiences for The Walt Disney Company.
 
Collaborate and serve as a thought partner to work with various Engineering and Production teams to gather requirements, troubleshoot issues, apply a scientific approach to continuous improvement, challenge the status quo, promote a high accountability trust culture and provide stellar customer support. 
 
Support initial discovery, architecture, design, automation, implementation and operationalization, including:

  • Business Engagement and Requirements Gathering

  • Architectural Review, Proof of Concept Work, and Onboarding

  • Project: Build and Operationalize New Systems/Sites/Services/Products

  • Systematic Load Testing, Troubleshooting, Optimization and Tuning

  • Create System and Application Monitors, Trending Metrics and Reports

  • Development: Tools and Automation Frameworks

  • Hosting Platforms and Infrastructure Design and Support

  • Documentation: Creation of Application Infrastructure Design documents, Operational Runbooks, and Knowledge Base Articles

  • Bachelors degree in Computer Science or related field with a minimum of 5 years of related work experience.

Technical Requirements

  • Understand how to install and configure operating systems, specifically with expertise in Linux and Windows Server.

  • Software Development Continuous Integration (CI) Pipeline knowledge (Jenkins)

  • Experience with Source Control Management systems (Git)

  • Experience in public and private cloud hosting services (AWS, Google Cloud, Azure, OpenStack, CloudStack) as well as familiarity with container computing (eg. Docker, Mesos, ECS/Kubernetes, Terraform).

  • Recognized as a subject matter expert on at least one OS and proficient in multiple operating systems, including OS performance monitoring, setup, configuration, tuning, and troubleshooting.

  • Proficient in web or webserver technologies:  Java, Node.js, Tomcat, IIS, Apache/nginx, MySQL, PostgreSQL, etc., including being able to perform basic setup, configuration, and troubleshooting.

  • Understand internet technologies and network protocols, including HTTP, basic load balancing configurations, security zones, VIPs, SNMP, REST and DNS.

  • Proficient in SSL/TLS certificate management and public key cryptography technology, specifically X.509 used for HTTPS.

  • Able to implement existing base standards for new systems and/or applications with mentoring for all of the following:

    • Site monitoring and instrumentation

    • Application monitoring and instrumentation

    • System monitoring and instrumentation

    • Resiliency and performance

  • Able to diagnose simple to complex system problems.

  • Has experience on one or more load balancer platforms (setting up pools, VIPs, layer 7 routing, debugging).

  • Able to author tools and scripts to be used by others to automate repeatable production tasks in standard languages like Bash, Ruby, Python, or Go.

  • Advanced skills in at least one programming language such as Python, PHP, Ruby, Java, Go, Swift or C++ and able to build unit test suites for all software being developed.

  • Experience supporting and/or developing backend tools or services

  • Able to perform and provide in depth analysis on load test runs against a moderately complex system.

  • Demonstrates exceptional troubleshooting methodology, including the ability to author and instruct new methodologies to the SE team.

  • Independently resolve moderately to highly complex system and application incidents.

  • Able to identify and propose system and application fixes for performance bottlenecks.

  • Able to evaluate new application requirements for capacity and run-time best practices.

  • Able to evaluate new system and/or infrastructure solutions for technical feasibility against known requirements and standards.

  • Effective at dealing with change: Able to transition in role or handle a significant modification to workflow or technology with minimal ramp-up time and with very little guidance.

Communication and Leadership Requirements

  • Excellent verbal and written communication to all levels in the organization.

  • Serves as primary point of contact with Manager.

  • Demonstrates curiosity and continuous learning and self-improvement.

  • Ability to lead functional teams in systems integration and design including writing operational specs, architectural diagrams, test plans and requirements management.

  • Communication of ideas and solutions in a clear and organized manner.

  • Clear and effective presentations to groups of people.

  • Effective project management and planning on large-scale projects (familiarity with agile/scrum and water-fall project management a plus).

  • Ability to design and deliver training to other staff.

  • Construction of concise and complete technical documentation.

  • Mentoring of Jr. Staff on technical material.

  • Viewed as a reliable technical resource for others.

  • Detailed understanding of the goals and requirements of the business supported.


The hiring range for this position in Glendale, California is $138,900 to $186,200 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.

Similar Jobs

Devrev - Finance Manager

Devrev

Chennai, Tamil Nadu, India (On-Site)
2 Months ago
The Walt Disney Company - Manager, Product Integrity

The Walt Disney Company

Minato City, Tokyo, Japan (On-Site)
3 Weeks ago
NVIDIA - Senior Photonic Layout Design Engineer

NVIDIA

Yokne'am Illit, North District, Israel (On-Site)
1 Month ago
Keywords Studios (Player Support) - Product Planner

Keywords Studios (Player Support)

(Remote)
5 Days ago
CCP Games - Infrastructure Engineer

CCP Games

Reykjavík, Reykjavíkurborg, Iceland (On-Site)
3 Weeks ago
PlayStation Global - Info Sys Engineer 3

PlayStation Global

Bellevue, Washington, United States (On-Site)
5 Months ago
Ajmera Infotech - Senior DevOps Engineer - AWS

Ajmera Infotech

Austin, Texas, United States (On-Site)
3 Months ago
Rackspace Technology - DevOps Engineer (AWS Terraform)

Rackspace Technology

India (Remote)
4 Days ago
Hitachi - CE Developers-Jul-2024

Hitachi

Bengaluru, Karnataka, India (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

GIANTS Software - Game Tester

GIANTS Software

Brno, South Moravian Region, Czechia (On-Site)
2 Weeks ago
Tesla - Field Service Technician, Industrial Storage / Supercharging

Tesla

Oslo, Oslo, Norway (On-Site)
1 Week ago
Sony Pictures Entertainment - Sr. Counsel (Project Hire)

Sony Pictures Entertainment

Culver City, California, United States (On-Site)
4 Months ago
NVIDIA - AI Computing Software Development Engineer, TensorRT

NVIDIA

Taipei City, Taiwan (On-Site)
1 Month ago
The Walt Disney Company - Manager-Software Engineering

The Walt Disney Company

Lake Buena Vista, Florida, United States (Hybrid)
1 Week ago
The Walt Disney Company - Software Engineer II

The Walt Disney Company

Morrisville, North Carolina, United States (On-Site)
2 Days ago
SEGA US - First Party & Events Coordinator

SEGA US

Irvine, California, United States (Hybrid)
2 Months ago
AriensCo - Product Manager-Outdoor Power Equipment

AriensCo

Etmadpur, Uttar Pradesh, India (On-Site)
3 Months ago
NVIDIA - GPU ASIC Design Engineer

NVIDIA

Bengaluru, Karnataka, India (On-Site)
1 Week ago
Lucky VR - Technical Animator

Lucky VR

Canada (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Burbank, California, United States

ByteDance - Machine Learning Engineer - Machine Learning Infrastructure

ByteDance

San Jose, California, United States (On-Site)
3 Months ago
ION - Junior Sales and Account Manager - 7990

ION

Jersey City, New Jersey, United States (On-Site)
4 Months ago
Axon - Customer Service Representative (Onsite)

Axon

Scottsdale, Arizona, United States (On-Site)
3 Days ago
Fluence - Project Controller - Energy Storage

Fluence

Houston, Texas, United States (On-Site)
3 Months ago
Penumbra - Life Sciences Counsel

Penumbra

Alameda, California, United States (On-Site)
2 Months ago
Meta - Production Engineer

Meta

New York, New York, United States (Remote)
3 Months ago
Rivos - Silicon Logic Formal Verification - Full Time

Rivos

Portland, Oregon, United States (Hybrid)
4 Months ago
Sphere Entertainment Co - Senior Director Pipeline Engineering

Sphere Entertainment Co

Burbank, California, United States (On-Site)
2 Weeks ago
Google - Senior Software Engineer, Machine Learning, Google Cloud Compute

Google

Sunnyvale, California, United States (On-Site)
3 Months ago
Company3 Method Studios - Facility Technician (7:00am - 3:30pm PT)

Company3 Method Studios

Hollywood, Florida, United States (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

Saviynt - Associate Principal Engineer/Senior Engineer - IGA

Saviynt

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
PhonePe - Site Reliability Engineer - Azure

PhonePe

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Sinch - Data Platform Engineer

Sinch

Stockholm, Stockholm County, Sweden (Hybrid)
4 Months ago
Microsoft - Digital Technology Specialists App Innovation ( Spanish Speaker)

Microsoft

Dublin, County Dublin, Ireland (Hybrid)
1 Month ago
Nielsen Holdings - Sr. Data Engineer - (Big Data, Spark, Scala, Python, AWS, RDBMS, SQL)

Nielsen Holdings

Gurugram, Haryana, India (Hybrid)
4 Months ago
SparkCognition - Senior DevOps Engineer

SparkCognition

Bengaluru, Karnataka, India (On-Site)
5 Months ago
PwC - Cloud & IT Transformation Senior Associates

PwC

Makati, Metro Manila, Philippines (On-Site)
4 Months ago
PwC - Power Platform Developer Associate

PwC

Rome, Lazio, Italy (On-Site)
1 Month ago
Rockstar Games - Linux Systems Engineer

Rockstar Games

Dundee, Scotland, United Kingdom (On-Site)
2 Months ago
Nagarro - Staff Engineer

Nagarro

Portugal (Remote)
4 Months ago

Get notifed when new similar jobs are uploaded

About The Company

From classic animated features and exhilarating theme park attractions to cutting edge sports coverage, and the hottest shows on television, The Walt Disney Company has been making magic since 1923, creating unforgettable stories that connect with audiences around the world. And we’re just getting started!

The key to our success…. The Cast, Crew, Imagineers and Employees who honor Disney’s rich legacy by stretching the bounds of imagination to create the never-before-seen, bringing unparalleled entertainment experiences to people of all ages. Begin a career that delivers unparalleled creative content and experiences to audiences around the world and just imagine the stories you could be part of…

What is #LifeAtDisney like? It’s a series of magical moments with cast members and employees developing and telling our stories in the most innovative ways. Whether it’s a day spent as a Disney VoluntEAR, or celebrating the release of a new interactive experience, retail product or movie, our days are filled with the knowledge that we are creating entertainment experiences the whole family can enjoy. Follow @DisneyCareers on Facebook, Twitter and Instagram for a peek behind-the-curtain, and discover how you could connect to a world of stories with Disney!

California, United States (On-Site)

Île-de-France, France (On-Site)

Kissimmee, Florida, United States (On-Site)

New Jersey, United States (On-Site)

Orlando, Florida, United States (On-Site)

Anaheim, California, United States (On-Site)

Glendale, California, United States (On-Site)

Celebration, Florida, United States (On-Site)

Washington, District Of Columbia, United States (Hybrid)

Glendale, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by The Walt Disney Company

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug