Staff Site Reliability Engineer

7 Months ago • 12 Years + • Devops

Job Summary

Job Description

As a Staff Site Reliability Engineer at Crunchyroll, you'll be responsible for maintaining and enhancing the reliability of the data infrastructure, directly impacting the availability and performance of data services. You'll collaborate with data engineers and software engineers to develop and drive 100% automation, implement best practices for monitoring and alerting, and ensure the smooth operation of large-scale data infrastructures. Responsibilities include standardizing monitoring, implementing IaC, automating key processes, defining operational requirements, developing incident response processes, and optimizing data governance practices. You'll work to eliminate system bottlenecks and ensure the availability and performance of Crunchyroll's data services.
Must have:
  • 12+ years SRE/database experience
  • AWS expertise
  • Proficiency in monitoring tools
  • Programming (Python, Java)
  • Automation frameworks (Terraform)
  • Database internals understanding
  • CI/CD and DataOps experience
  • Data governance & compliance

Job Details

About Crunchyroll

WE HELP EVERYONE BELONG. IT’S OUR PURPOSE.

Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We super-serve over 100 million anime and manga fans across 200+ countries and territories, and help them connect with the stories and characters they crave. Whether that experience is online or in-person, streaming video, theatrical, games, merchandise, events and more, it’s powered by the anime content we all love.

Join our team, and help us shape the future of anime!

Who We Are

We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection of brands.

About the Team

The Site Reliability Engineering (SRE) team is dedicated to ensuring the reliability, scalability, and performance of our data infrastructure. We focus on standardizing and implementing monitoring and alerting across all datastores to track key metrics like errors, latency, and throughput, and to ensure critical systems are covered. Our team also leads efforts to keep databases up-to-date, implements Infrastructure as Code (IaC) for high availability and performance, and automates key processes to enhance operational efficiency. 

We lead and evangelize the principle of 100% automation. Additionally, we define and document operational requirements, develop incident response processes, and automate monitoring and compliance checks to maintain a secure and reliable data environment. By continuously improving load testing and optimizing data governance practices, we support the overall health and efficiency of our data systems.

About the Role

Crunchyroll is growing and changing, presenting unique challenges and opportunities to support millions of anime fans around the world. The Data Engineering team provides seamless help to our internal stakeholders, ensuring an exceptional experience for all Crunchyroll fans.

As a Staff Site Reliability Engineer for the Data Engineering team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure. Your work will directly impact the availability and performance of our data services, enabling the organization to better decisions. You will collaborate closely with data engineers, and software engineers to develop and drive 100% automation, best practices for deep monitoring and alerting. This role will report to our Director of Data Engineering and will be based out of our Mexico City office. 

About You

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms, data stores, data operations.
  • Extensive experience with AWS cloud platform and their data-related services.
  • Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights).
  • Proficiency in one or more programming languages (e.g.  Python, Java)
  • Proficiency in automation frameworks (e.g., Terraform, Cloud Formation).
  • Strong understanding of various performance metrics both at a high level and at a low level like Disk/IO saturation.
  • Experience in identifying and eliminating the bottlenecks in the system.
  • Strong understanding of database internals like types of indexes, schemas, query plans.
  • Strong understanding of database systems (e.g., SQL, NoSQL) and experience in managing large-scale data infrastructures.
  • Strong understanding and hands-on implementation of CI/CD pipelines and DataOps practices.
  • Experience with data governance, compliance, and lifecycle management.
  • Ability to own and execute projects while effectively collaborating with the team to influence and shape the vision of the data engineering organization.

#LifeAtCrunchyroll #LI-Hybrid

About our Values

We want to be everything for someone rather than something for everyone and we do this by living and modeling our values in all that we do. We value

  • Courage. We believe that when we overcome fear, we enable our best selves.

  • Curiosity. We are curious, which is the gateway to empathy, inclusion, and understanding.

  • Service. We serve our community with humility, enabling joy and belonging for others.

  • Kaizen. We have a growth mindset committed to constant forward progress.

Our commitment to diversity and inclusion

Our mission of helping people belong reflects our commitment to diversity & inclusion. It's just the way we do business.

We are an equal opportunity employer and value diversity at Crunchyroll. Pursuant to applicable law, we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Crunchyroll, LLC is an independently operated joint venture between US-based Sony Pictures Entertainment, and Japan's Aniplex, a subsidiary of Sony Music Entertainment (Japan) Inc., both subsidiaries of Tokyo-based Sony Group Corporation.

Questions about Crunchyroll’s hiring process? Please check out our Hiring FAQs: https://help.crunchyroll.com/hc/en-us/articles/360040471712-Crunchyroll-Hiring-FAQs

Please refer to our Candidate Privacy Policy for more information about how we process your personal information, and your data protection rights: https://tbcdn.talentbrew.com/company/22978/v1_0/docs/spe-jobs-privacy-policy-update-for-crpa-dec-21-22.pdf

Please beware of recent scams to online job seekers. Those applying to our job openings will only be contacted directly from @crunchyroll.com email account.

Similar Jobs

zoox - Senior/Staff System Engineer - Fail Operational

zoox

Foster City, California, United States (Hybrid)
1 Week ago
MiQ - Senior Manager, Events

MiQ

New York, United States (On-Site)
1 Week ago
Intel  - Power and Performance Lead-Client Silicon

Intel

Folsom, California, United States (Hybrid)
1 Month ago
NVIDIA - Senior GPU Kernel Performance Lead

NVIDIA

Canada (On-Site)
4 Months ago
Aptive - Machine Learning Algorithm Engineer ADAS & AD

Aptive

Kraków, Lesser Poland Voivodeship, Poland (Hybrid)
1 Month ago
CyberArk - Senior Backend Software Engineer, Golang, Cloud Native

CyberArk

Santa Clara, California, United States (Hybrid)
1 Month ago
Wind River - Senior Engineer - Cloud

Wind River

Bengaluru, Karnataka, India (On-Site)
1 Week ago
NVIDIA - Senior Software Architect, Accelerated Computing SDN

NVIDIA

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
4 Months ago
Google - Senior Software Engineer, Infrastructure, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
2 Months ago
Ion - Cloud Engineer Kubernetes

Ion

Collecchio, Emilia-Romagna, Italy (Hybrid)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Imagine.io - Head - Customer Success

Imagine.io

Austin, Texas, United States (Hybrid)
3 Months ago
Apple - Cellular Systems Performance Analysis Engineer

Apple

San Diego, California, United States (On-Site)
4 Weeks ago
Kyruus Health - Senior Talent Acquisition Partner

Kyruus Health

United States (Remote)
2 Weeks ago
IGG - Regional Operations Manager

IGG

Vancouver, British Columbia, Canada (On-Site)
1 Month ago
Lionbridge Games - Business Development Director, Games

Lionbridge Games

United States (On-Site)
4 Months ago
bytedance - Senior Machine Learning Ops Engineer, ML System - Foundation Model

bytedance

San Jose, California, United States (On-Site)
5 Months ago
Tesla - Associate Operations Manager, Electrode, Battery Cell

Tesla

Brandenburg, Germany (On-Site)
4 Months ago
Whalar - Summer Intern, Client Services (Gaming)

Whalar

New York, United States (On-Site)
3 Weeks ago
Informa Group - Sales Manager - Jewellery (Exhibition)

Informa Group

Bangkok, Thailand (On-Site)
1 Week ago
London stock Exchange - Senior Lead QA Engineer

London stock Exchange

Hyderabad, Telangana, India (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Mexico City, Mexico City, Mexico

Valeo - Maintenance Technician

Valeo

San Luis Potosi, Mexico (On-Site)
4 Weeks ago
nubank - AML Ops Analyst

nubank

Mexico City, Mexico (On-Site)
1 Month ago
Univision - Accounting Analyst Human Resources

Univision

Mexico (On-Site)
1 Day ago
nissan - Quality Inspector

nissan

Aguascalientes, Aguascalientes, Mexico (On-Site)
2 Weeks ago
Crunchyroll - Software Engineer III, Game Consoles

Crunchyroll

Mexico City, Mexico City, Mexico (Hybrid)
4 Months ago
Netflix - Channel Partner Manager LATAM

Netflix

Mexico City, Mexico City, Mexico (On-Site)
2 Months ago
bytedance - Content Operations Manager (MX) - Vertical & Commercial

bytedance

Mexico City, Mexico City, Mexico (On-Site)
2 Months ago
Fictiv - Manufacturing Data Analysis Intern

Fictiv

Monterrey, Nuevo Leon, Mexico (On-Site)
1 Month ago
FICO - Platform Success Senior Associate Partner

FICO

Mexico City, Mexico (On-Site)
1 Week ago
LTI Mindtree - Senior Oracle Finance Fusion Functional Consultant

LTI Mindtree

Mexico City, Mexico (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Devops Jobs

NXP - Senior Principal Software Architect - Platform and RF Software

NXP

Bucharest, Bucharest, Romania (On-Site)
9 Months ago
CD PROJEKT RED - Senior DevOps Engineer

CD PROJEKT RED

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
broadcom - Sr. Software engineer in VMware Cloud Foundation (VCF)

broadcom

Sofia, Sofia City Province, Bulgaria (On-Site)
1 Month ago
Nasdaq - Cloud Developer

Nasdaq

St. John's, Newfoundland And Labrador, Canada (Hybrid)
1 Month ago
upwork - Senior Database Automation Engineer (APAC)

upwork

(Remote)
2 Months ago
PhonePe - Site Reliability Engineer 2 - Database

PhonePe

Bengaluru, Karnataka, India (On-Site)
1 Month ago
2K - Build Systems Engineer

2K

Austin, Texas, United States (On-Site)
6 Days ago
luxsoft - Senior DevOps Consultant

luxsoft

Toronto, Ontario, Canada (On-Site)
3 Months ago
Scale AI - AI Infrastructure Engineer, Model Serving Platform

Scale AI

San Francisco, California, United States (On-Site)
2 Months ago
Rackspace Technology - Senior Azure Engineer

Rackspace Technology

Bengaluru, Karnataka, India (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Los Angeles, California, United States (Hybrid)

Dallas, Texas, United States (Hybrid)

Los Angeles, California, United States (Hybrid)

Los Angeles, California, United States (Hybrid)

Dallas, Texas, United States (Hybrid)

San Francisco, California, United States (Hybrid)

Los Angeles, California, United States (Hybrid)

Dallas, Texas, United States (Hybrid)

Dallas, Texas, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Crunchyroll

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug