Sr System Reliability Engineer

1 Month ago • 5 Years + • DevOps • $138,900 PA - $195,000 PA

Job Summary

Job Description

The Senior System Reliability Engineer at Disney uses a software engineering approach to architect, design, automate, monitor, and build applications at scale. Responsibilities include operating and engineering software, aligning with business segments to deliver resilient platforms. This role requires expertise in Linux and Windows systems administration, CI/CD platforms (GitLab CI, Jenkins), and cloud automation tools (Boto, CloudFormation, Terraform). The SRE will contribute to new technology development, build and support pipelines, create monitoring telemetry, and reinforce security best practices. Collaboration with development teams, troubleshooting, and stellar customer support are crucial. The role involves project leadership, architectural design, and implementation of solutions and systems, including Level 2 & 3 maintenance and support.
Must have:
  • Expert Linux/Windows admin
  • CI/CD (GitLab CI, Jenkins)
  • Cloud automation (Boto, Terraform)
  • Container computing (Docker, Kubernetes)
  • System troubleshooting
  • Scripting (Bash, Python, Go)
  • Software development (Python, Java)
  • Load balancing expertise
Good to have:
  • Agile/Scrum experience
  • Experience with various cloud platforms (AWS, Azure, GCP)
  • Web technologies (Java, Node.js, Tomcat, Apache)
  • Database expertise (MySQL, PostgreSQL)

Job Details

Job Summary:

Systems Reliability Engineers use a software engineering approach to architect, design, automate, monitor, and build applications at scale. This includes operating and engineering software with close business segment alignment to deliver platforms through efficient, effective and resilient architectures. SREs are talented engineers that are focused on improving quality through a data driven approach: instrumentation, automation, and functional/unit testing.

Responsibilities:

  • The SRE will help create, build and deliver new technologies or platforms.  This will include consultation, designing, building, and supporting development pipelines, automating infrastructure and operations, creating telemetry for monitoring, engineering high reliability and reinforcing best practices to secure our company and guest data.

  • Have expert level systems administration skills on both the Linux and Windows platforms

  • Work with CI/CD platforms (Gitlab CI or Jenkins), strong systems development (Go, Python, Ruby, Node) and cloud automation tools (Boto, CloudFormation, Terraform), source control, cloud hosting, container computing, web technologies

  • Maintain expertise on systems, operational excellence and application stability, security, performance, and capacity management, as well as documentation.

  • Work closely with development teams across Disney to brainstorm, architect, gather requirements, troubleshoot, and provide stellar customer support

  • Be prepared to work in an extremely collaborative and high-energy environment. 

  • Lead project/planning efforts, architectural design, engineering, attending meetings w/ various teams.

  • Implement, integrate and configure solutions, tools, infrastructure and systems.

  • Provide systems administration and application support  – Level 2 & 3 maintenance and support

Basic Qualifications:

  • Understand how to install and configure operating systems, specifically with expertise in Linux and Windows Server.

  • Software Development Continuous Integration (CI) Pipeline knowledge (GitLab CI, Github Actions)

  • Experience with Source Control Management systems (Git)

  • Experience in public and private cloud hosting services (AWS, Google Cloud, Azure, OpenStack, CloudStack) as well as familiarity with container computing (eg. Docker, ECS, Kubernetes, Terraform).

  • Experience as a subject matter expert on at least one OS and proficient in multiple operating systems, including OS performance monitoring, setup, configuration, tuning, and troubleshooting.

  • Proficiency in web or web server technologies:  Java, Node.js, Tomcat, IIS, Apache/nginx, MySQL, PostgreSQL, etc., including being able to perform basic setup, configuration, and troubleshooting.

  • Understanding of internet technologies and network protocols, including HTTP, basic load balancing configurations, security zones, VIPs, SNMP, REST and DNS.

  • Ability to implement existing base standards for new systems and/or applications with mentoring for all of the following:

    • Site monitoring and instrumentation

    • Application monitoring and instrumentation

    • System monitoring and instrumentation

    • Resiliency and performance

  • Able to diagnose simple to complex system problems.

  • Has experience on one or more load balancer platforms (setting up pools, VIPs, layer 7 routing, debugging).

  • Able to author tools and scripts to be used by others to automate repeatable production tasks in standard languages like Bash, Ruby, Python, or Go.

  • Advanced skills in at least one programming language such as Python, PHP, Ruby, Java, Go, Swift or C++ and able to build unit test suites for all software being developed.

  • Experience supporting and/or developing backend tools or services

  • Able to perform and provide in depth analysis on load test runs against a moderately complex system.

  • Demonstrates exceptional troubleshooting methodology, including the ability to author and instruct new methodologies to the SRE team.

  • Independently resolve moderately to highly complex system and application incidents.

  • Able to identify and propose system and application fixes for performance bottlenecks.

  • Able to evaluate new application requirements for capacity and run-time best practices.

  • Able to evaluate new system and/or infrastructure solutions for technical feasibility against known requirements and standards.

  • Effective at dealing with change: Able to transition in role or handle a significant modification to workflow or technology with minimal ramp-up time and with very little guidance.

  • Excellent verbal and written communication to all levels in the organization.

  • Serves as primary point of contact with Manager.

  • Demonstrates curiosity and continuous learning and self-improvement.

  • Ability to lead functional teams in systems integration and design including writing operational specs, architectural diagrams, test plans and requirements management.

  • Effective project management and planning on large-scale projects (familiarity with agile/scrum and water-fall project management a plus).

  • Construction of concise and complete technical documentation and the ability to design and deliver training to other staff

  • Detailed understanding of the goals and requirements of the business supported.

Required Education:

Bachelor of Science degree in computer science or related field or equivalent experience in technical operations and software engineering with 5 years of related work experience.

#DISNEYTECH


The hiring range for this position in California is $138,900 - $186,200 per year and in Washington is $145,400 - $195,000 per year. The base pay actually offered will take into account internal equity and also may vary depending on the candidate’s geographic region, job-related knowledge, skills, and experience among other factors. A bonus and/or long-term incentive units may be provided as part of the compensation package, in addition to the full range of medical, financial, and/or other benefits, dependent on the level and position offered.

Similar Jobs

Nagarro - Principal Engineer, Java Fullstack

Nagarro

India (Remote)
7 Months ago
Palo Alto Networks - Sr Staff DevOps Engineer

Palo Alto Networks

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
2 Months ago
Shyft Labs - Lead Software Engineer

Shyft Labs

Noida, Uttar Pradesh, India (Hybrid)
4 Months ago
Adyen - Engineering Manager - Test Enablement IPP (In-Person Payments)

Adyen

Amsterdam, North Holland, Netherlands (On-Site)
3 Weeks ago
Nagarro - Staff Engineer, Java Fullstack

Nagarro

Mumbai, Maharashtra, India (On-Site)
7 Months ago
SmileGate - Group Purchasing System and Internal Web System Operation (Development)

SmileGate

Seongnam-si, Gyeonggi-do, South Korea (On-Site)
2 Months ago
ZeniMax Media - DevOps Engineer

ZeniMax Media

Austin, Texas, United States (Remote)
2 Months ago
Crunchyroll - Staff Site Reliability Engineer

Crunchyroll

Mexico City, Mexico City, Mexico (On-Site)
6 Months ago
ByteDance - Senior Software Engineer - Compute Infrastructure (Orchestration & Scheduling)

ByteDance

Seattle, Washington, United States (On-Site)
1 Month ago
Trend Micro - (Sr.) Software Engineer in Linux

Trend Micro

Taipei City, Taiwan (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

GameJobs - Senior DevSecOps Lead

GameJobs

(Remote)
1 Year ago
MURKA - Java Backend Developer

MURKA

(Remote)
2 Months ago
Turbulent - Senior DevOps Engineer

Turbulent

Montreal, Quebec, Canada (On-Site)
2 Months ago
Nielsen Holdings - Scala Developer

Nielsen Holdings

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Xsolla - Technical Project Manager

Xsolla

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
7 Months ago
Nagarro - Associate Staff Engineer, NodeJS

Nagarro

India (Remote)
7 Months ago
Zoic Studios - FX Pipeline Technical Director (TD)

Zoic Studios

(Remote)
3 Weeks ago
DEVOTEAM - Test Automation Lead

DEVOTEAM

Morocco (On-Site)
3 Months ago
NVIDIA - CAD Engineer

NVIDIA

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Nagarro - Associate Principal Engineer, Java

Nagarro

India (Remote)
7 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Orlando, Florida, United States

Palo Alto Networks - Assistant Treasurer

Palo Alto Networks

Santa Clara, California, United States (On-Site)
3 Weeks ago
Inkittt - VP of Operations

Inkittt

San Francisco, California, United States (Hybrid)
5 Months ago
Patel greene - Senior PD&E Planner

Patel greene

Sarasota, Florida, United States (On-Site)
7 Months ago
Click Therapeutics - Business Development Director

Click Therapeutics

New York, New York, United States (Hybrid)
1 Month ago
Google - Group Product Manager, Databases & Analytics, Google Cloud

Google

Kirkland, Washington, United States (On-Site)
1 Month ago
Visa - Director, NA Visa Direct Cross Border Bank Account Manager

Visa

San Francisco, California, United States (Hybrid)
1 Month ago
Studio Wildcard - Senior VFX Artist - Remote or On Site

Studio Wildcard

Redmond, Washington, United States (Hybrid)
8 Months ago
Meet Elise - Strategy & Planning - Manager

Meet Elise

Chicago, Illinois, United States (On-Site)
1 Month ago
Rockstar Games - Director, Security Operations

Rockstar Games

New York, New York, United States (On-Site)
7 Months ago
160over90 - Account Manager, Experiential

160over90

Atlanta, Georgia, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

ByteDance - Site Reliability Engineer - Security Engineering - San Jose

ByteDance

San Jose, California, United States (On-Site)
7 Months ago
Google - Software Developer III, Site Reliability Development

Google

Waterloo, Ontario, Canada (On-Site)
1 Month ago
Google - Customer Engineer, Google Cloud

Google

Taipei City, Taiwan (On-Site)
1 Month ago
The Walt Disney Company - Senior Software Engineer

The Walt Disney Company

England, United Kingdom (On-Site)
1 Month ago
Brillio - Enterprise Architect, AWS - R01535258

Brillio

Bengaluru, Karnataka, India (Hybrid)
7 Months ago
Ubisoft - Monitoring Specialist - Golang Developer

Ubisoft

Saint-Mandé, Île-de-France, France (Hybrid)
3 Months ago
Globalization Partners - Principal Solution Architect

Globalization Partners

United States (Remote)
3 Months ago
Glean - Solutions Architect - ANZ / Singapore region customer hours.

Glean

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Egnyte - Sr DevOps Engineer - Azure

Egnyte

India (Remote)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

From classic animated features and exhilarating theme park attractions to cutting edge sports coverage, and the hottest shows on television, The Walt Disney Company has been making magic since 1923, creating unforgettable stories that connect with audiences around the world. And we’re just getting started!

The key to our success…. The Cast, Crew, Imagineers and Employees who honor Disney’s rich legacy by stretching the bounds of imagination to create the never-before-seen, bringing unparalleled entertainment experiences to people of all ages. Begin a career that delivers unparalleled creative content and experiences to audiences around the world and just imagine the stories you could be part of…

What is #LifeAtDisney like? It’s a series of magical moments with cast members and employees developing and telling our stories in the most innovative ways. Whether it’s a day spent as a Disney VoluntEAR, or celebrating the release of a new interactive experience, retail product or movie, our days are filled with the knowledge that we are creating entertainment experiences the whole family can enjoy. Follow @DisneyCareers on Facebook, Twitter and Instagram for a peek behind-the-curtain, and discover how you could connect to a world of stories with Disney!

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

Burbank, California, United States (On-Site)

Celebration, Florida, United States (On-Site)

Buenos Aires, Buenos Aires, Argentina (On-Site)

Seattle, Washington, United States (On-Site)

Santa Monica, California, United States (On-Site)

Glendale, California, United States (On-Site)

Anaheim, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by The Walt Disney Company

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug