Staff Site Reliability Engineer - Data Engineering, Platform

6 Months ago • 12 Years + • DevOps • $191,000 PA - $239,000 PA

Job Summary

Job Description

As a Staff Site Reliability Engineer, you'll maintain and enhance the reliability of Crunchyroll's data infrastructure. This involves standardizing monitoring and alerting across datastores, implementing Infrastructure as Code (IaC), automating key processes, and ensuring 100% automation. You'll collaborate with data and software engineers, develop incident response processes, optimize data governance, and support the overall health and efficiency of data systems. The role requires expertise in AWS, monitoring tools, programming languages (Python, Java), automation frameworks (Terraform, Cloud Formation), database systems (SQL, NoSQL), CI/CD, and DataOps practices. You'll directly impact the availability and performance of data services, enabling better organizational decisions.
Must have:
  • 12+ years SRE/Database experience
  • AWS expertise
  • Monitoring tool proficiency
  • Automation framework skills (Terraform)
  • Database system knowledge (SQL, NoSQL)
  • CI/CD and DataOps experience
  • Python/Java proficiency
Good to have:
  • Data governance experience
  • Compliance and lifecycle management expertise
Perks:
  • Competitive salary & bonus
  • Flexible PTO
  • Comprehensive benefits (medical, dental, vision)
  • 401k matching
  • Commuter benefits
  • Parental support program
  • Pet insurance

Job Details

About Crunchyroll

WE HELP EVERYONE BELONG. IT’S OUR PURPOSE.

Founded by fans, Crunchyroll delivers the art and culture of anime to a passionate community. We super-serve over 100 million anime and manga fans across 200+ countries and territories, and help them connect with the stories and characters they crave. Whether that experience is online or in-person, streaming video, theatrical, games, merchandise, events and more, it’s powered by the anime content we all love.

Join our team, and help us shape the future of anime!

Who We Are

We're a cast of characters working to shine a spotlight on anime. Crunchyroll is an international business focused on creating both online and offline experiences for fans through content (licensed, co-produced, originals, distribution), merchandise, events, gaming, news, and more. Visit our About Us pages for more information about our collection of brands.

About the Team

The Site Reliability Engineering (SRE) team is dedicated to ensuring the reliability, scalability, and performance of our data infrastructure. We focus on standardizing and implementing monitoring and alerting across all datastores to track key metrics like errors, latency, and throughput, and to ensure critical systems are covered. Our team also leads efforts to keep databases up-to-date, implements Infrastructure as Code (IaC) for high availability and performance, and automates key processes to enhance operational efficiency. 

We lead and evangelize the principle of 100% automation. Additionally, we define and document operational requirements, develop incident response processes, and automate monitoring and compliance checks to maintain a secure and reliable data environment. By continuously improving load testing and optimizing data governance practices, we support the overall health and efficiency of our data systems.

About the Role

Crunchyroll is growing and changing, presenting unique challenges and opportunities to support millions of anime fans around the world. The Data Engineering team provides seamless help to our internal stakeholders, ensuring an exceptional experience for all Crunchyroll fans.

As a Staff Site Reliability Engineer for the Data Engineering team, you will be responsible for maintaining and enhancing the reliability of our data infrastructure. Your work will directly impact the availability and performance of our data services, enabling the organization to better decisions. You will collaborate closely with data engineers, and software engineers to develop and drive 100% automation, best practices for deep monitoring and alerting. This role will report to our Director of Data Engineering. While it is preferred for this role to sit in one of our offices, fully remote is also an option in the United States.

About You

  • Bachelor's degree in Computer Science, Information Technology, or a related field.
  • 12+ years of experience in site reliability engineering, database operations, or a related role with a focus on data platforms, data stores, data operations.
  • Extensive experience with AWS cloud platform and their data-related services.
  • Proficiency in monitoring tools (e.g., Datadog, CloudWatch, DevOps Guru, DB Performance Insights).
  • Proficiency in one or more programming languages (e.g.  Python, Java)
  • Proficiency in automation frameworks (e.g., Terraform, Cloud Formation).
  • Strong understanding of various performance metrics both at a high level and at a low level like Disk/IO saturation.
  • Experience in identifying and eliminating the bottlenecks in the system.
  • Strong understanding of database internals like types of indexes, schemas, query plans.
  • Strong understanding of database systems (e.g., SQL, NoSQL) and experience in managing large-scale data infrastructures.
  • Strong understanding and hands-on implementation of CI/CD pipelines and DataOps practices.
  • Experience with data governance, compliance, and lifecycle management.
  • Ability to own and execute projects while effectively collaborating with the team to influence and shape the vision of the data engineering organization.

Why you will love working at Crunchyroll

Not only will you get to work with fun, passionate and inspired colleagues, you will also...

  • Receive a great compensation package including salary plus performance bonus earning potential, paid annually.
  • Enjoy flexible PTO and time off policies allowing you to take the time you need to be your whole self.
  • Appreciate the generous medical, dental, vision, STD, LTD, and life insurance options for you and your family.
  • Take advantage of our health saving account HSA program plus health care and dependent care FSA programs.
  • Love that we offer an employer match on our 401(k) plan.
  • Receive employer paid commuter benefit (for eligible employees)
  • Appreciate the generous support program for new parents
  • Obtain pet insurance and some of our offices are pet friendly! 

#LifeAtCrunchyroll #LI-Remote

The Pay Range for this position is listed. Actual pay will vary based on factors including, but not limited to location, experience, and performance. The range listed is just one component of Crunchyroll’s Total Rewards offerings for employees. Other rewards may include performance bonuses, employer matched retirement savings, time-off programs, and progressive health benefits and perks.
Pay Transparency - San Francisco, CA
$191,000$239,000 USD

About our Values

We want to be everything for someone rather than something for everyone and we do this by living and modeling our values in all that we do. We value

  • Courage. We believe that when we overcome fear, we enable our best selves.

  • Curiosity. We are curious, which is the gateway to empathy, inclusion, and understanding.

  • Service. We serve our community with humility, enabling joy and belonging for others.

  • Kaizen. We have a growth mindset committed to constant forward progress.

Our commitment to diversity and inclusion

Our mission of helping people belong reflects our commitment to diversity & inclusion. It's just the way we do business.

We are an equal opportunity employer and value diversity at Crunchyroll. Pursuant to applicable law, we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.

Crunchyroll, LLC is an independently operated joint venture between US-based Sony Pictures Entertainment, and Japan's Aniplex, a subsidiary of Sony Music Entertainment (Japan) Inc., both subsidiaries of Tokyo-based Sony Group Corporation.

Questions about Crunchyroll’s hiring process? Please check out our Hiring FAQs: https://help.crunchyroll.com/hc/en-us/articles/360040471712-Crunchyroll-Hiring-FAQs

Please refer to our Candidate Privacy Policy for more information about how we process your personal information, and your data protection rights: https://tbcdn.talentbrew.com/company/22978/v1_0/docs/spe-jobs-privacy-policy-update-for-crpa-dec-21-22.pdf

Please beware of recent scams to online job seekers. Those applying to our job openings will only be contacted directly from @crunchyroll.com email account.

Similar Jobs

Philips - Cloud Developer

Philips

Bengaluru, Karnataka, India (On-Site)
3 Weeks ago
Blockville Digital Assets - AI Technology Specialist for Game Development

Blockville Digital Assets

İstanbul, Türkiye (On-Site)
10 Months ago
buildstsaff - Java Developer

buildstsaff

Alexandria, Virginia, United States (On-Site)
6 Years ago
Qualcomm - Software Security Engineer

Qualcomm

Farnborough, England, United Kingdom (On-Site)
2 Weeks ago
Qualcomm - Automotive ADAS System Test and Integration Engineer Sr.

Qualcomm

Bengaluru, Karnataka, India (On-Site)
2 Weeks ago
bytedance - Senior Software Engineer - Compute Infrastructure (Orchestration & Scheduling)

bytedance

Seattle, Washington, United States (On-Site)
1 Month ago
Saama Technologies,  Inc  - Senior Site Reliability Engineer

Saama Technologies, Inc

Chennai, Tamil Nadu, India (On-Site)
7 Months ago
bytedance - Software Engineer, Cloud Infrastructure

bytedance

San Jose, California, United States (On-Site)
7 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

Canada (On-Site)
1 Month ago
Axon - Senior Privacy Engineer

Axon

Scottsdale, Arizona, United States (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Meta - Research Scientist Intern, Smart Glasses in Wearables AI (PhD)

Meta

Menlo Park, California, United States (On-Site)
6 Months ago
Wrike - Staff Backend Engineer

Wrike

Bengaluru, Karnataka, India (Hybrid)
1 Month ago
Temporal Technologies - Senior Product Manager, SDK & Developer Primitives

Temporal Technologies

(Remote)
2 Weeks ago
tectic studios - Lead Gameplay Programmer

tectic studios

Canada (Remote)
1 Month ago
Nasdaq - Senior DevOps Engineer

Nasdaq

Vilnius, Vilnius County, Lithuania (On-Site)
3 Weeks ago
Zscaler - Staff Application Security Engineer

Zscaler

Bengaluru, Karnataka, India (Hybrid)
2 Weeks ago
bytedance - Backend Software Engineer Graduate (Global E-commerce-US) - 2025 Start (BS/MS)

bytedance

Seattle, Washington, United States (On-Site)
7 Months ago
bytedance - Site Reliability Engineer, Traffic Infrastructure

bytedance

Singapore (On-Site)
7 Months ago
PhonePe - Firmware Engineer

PhonePe

Bengaluru, Karnataka, India (On-Site)
5 Days ago
bytedance - Tech Lead Manager, Infrastructure Platform

bytedance

San Jose, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

Gupta - Media Analyst

Gupta

New York, New York, United States (On-Site)
1 Month ago
Crunchyroll - Senior Product Manager - Access and Identity Management

Crunchyroll

Los Angeles, California, United States (On-Site)
2 Months ago
Guardian - RN Clinical Consultant

Guardian

United States (Remote)
1 Month ago
Scale AI - Senior Technical Writer

Scale AI

San Francisco, California, United States (On-Site)
5 Days ago
Go guardian - Commercial Counsel

Go guardian

United States (Remote)
1 Month ago
Google - Senior Staff Software Engineer, Infrastructure, Google Cloud AI

Google

Sunnyvale, California, United States (On-Site)
1 Month ago
Zscaler - Director, Designated Support

Zscaler

Plano, Texas, United States (Hybrid)
1 Week ago
Riot Games - Senior Manager, Game Product Management - Unpublished R&D Product

Riot Games

Los Angeles, California, United States (On-Site)
1 Month ago
lifechruh - Senior Staff Software Engineer

lifechruh

Edmond, Oklahoma, United States (On-Site)
1 Month ago
DataVisor - Sr. Customer Success Manager - Fraud/AML Strategy

DataVisor

New York, United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

DevOps Jobs

The Walt Disney Company - Manager, Database Reliability Engineering

The Walt Disney Company

California, United States (On-Site)
1 Month ago
miniclip - Senior Cloud Engineer

miniclip

Lisbon, Lisbon, Portugal (Hybrid)
1 Month ago
Milestone - Senior Software Engineer

Milestone

Portland, Oregon, United States (Remote)
2 Months ago
Luxoft - Senior Software Support Engineer

Luxoft

Zlínský Kraj, Czechia (Remote)
6 Months ago
Google - Software Engineer III, Site Reliability Engineering

Google

Warsaw, Masovian Voivodeship, Poland (On-Site)
1 Month ago
The Walt Disney Company - Lead Software Engineer - Big Data Infrastructure

The Walt Disney Company

California, United States (On-Site)
2 Months ago
Rackspace Technology - Cloud Practice Engineer III

Rackspace Technology

Jalisco, Mexico (Remote)
1 Month ago
Rackspace Technology - Senior Machine Learning Engineer

Rackspace Technology

Vietnam (Remote)
3 Months ago
PwC - IN-Associate_ Azure DevOps Engineer_OneCloud_Advisory_Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
Epic Games - Senior DevOps Programmer

Epic Games

Montreal, Quebec, Canada (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Los Angeles, California, United States (Hybrid)

Dallas, Texas, United States (Hybrid)

Los Angeles, California, United States (Hybrid)

Los Angeles, California, United States (Hybrid)

Dallas, Texas, United States (Hybrid)

San Francisco, California, United States (Hybrid)

Los Angeles, California, United States (Hybrid)

Dallas, Texas, United States (Hybrid)

Dallas, Texas, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Crunchyroll

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug