DISCO CAST Data Engineer

1 Month ago • 2 Years +

About the job

✦ AI Job Summary

Must have: Data Engineering, ETL Pipelines, SQL Experience, Scripting Languages
Good to have: Linux/Unix, CLI Tools, Typescript/Java, DBT/Stitch

Description

  • 2+ years of data engineering experience
  • Experience with data modeling, warehousing and building ETL pipelines
  • Experience with one or more query language (e.g., SQL, PL/SQL, DDL, MDX, HiveQL, SparkSQL, Scala)
  • Experience with one or more scripting language (e.g., Python, KornShell)
  • Experience in troubleshooting the issues related to data and infrastructure issues.
  • A passion for problem solving
  • Excessive curiosity
  • Exceptional ability to learn quickly and independently
Amazon Music is awash in data! To help make sense of it all, the CAST team within DISCO (Data, Insights, Science & Optimization) team accelerates and facilitates content analytics and provides independence to generate valuable insights in a fast, agile, and accurate way. This domain provides analytical support for the below topics within Amazon Music: Programming / Label Relations / PR / Stations / Livesports / Originals / Case & CAM.

The CAST team enables repeatable, easy, in depth analysis of music customer behaviors. Our goal is to empower all teams at Amazon Music to make data driven decisions and effectively measure their results by providing high quality, high availability data, and democratized data access through self-service tools.

If you love the challenges that come with big data and devops, then this role is for you. We collect billions of events a day, manage petabyte scale data on Redshift and S3, and develop data pipelines using Spark/Scala EMR, SQL based ETL, Airflow and Java services.

We are looking for a talented, curious, enthusiastic, and detail-oriented Data Engineer, who knows how to take on big data challenges in an agile way. Duties include big data design and analysis, data modeling, and development, deployment, and operations of big data pipelines. You'll help build Amazon Music's most important data pipelines and data sets, and expand self-service data knowledge and capabilities through an Amazon Music data university.

The CAST team develops data specifically for a set of key business domains like personalization, playlist programming, merch, artists, and provides and protects a robust self-service core data experience for all internal customers. We deal in AWS technologies like Redshift, S3, EMR, EC2, DynamoDB, Kinesis Firehose, and Lambda.

Key job responsibilities
  • Writing scripts, building microservices in AWS to increase team efficiency
  • Building and managing ETL pipelines in AWS to ingest data from external vendors
  • Assist the BIs on the team in managing our existing environment that consists of Redshift and SQL based pipelines. The activities around these systems will largely be well-defined via standard operation procedures (SOP) and typically involve approving data access requests, subscribing or adding new data to the environment, but there will be cases where creative problem-solving is required
  • SQL data pipeline management (creating or updating existing pipelines) and maintenance tasks on the Redshift cluster
  • Assist the team with the management of our next-generation AWS infrastructure. Tasks includes infrastructure monitoring via CloudWatch alarms, infrastructure maintenance through code changes or enhancements, and troubleshooting/root cause analysis infrastructure issues that arise, and in some cases this resource may also be asked to submit code changes based on infrastructure issues that arise.
About the team
Amazon Music is an immersive audio entertainment service that deepens connections between fans, artists, and creators.From personalized music playlists to exclusive podcasts, concert livestreams to artist merch, we are innovating at some of the most exciting intersections of music and culture.We offer experiences that serve all listeners with our different tiers of service: Prime members get access to all music in shuffle mode,and top ad-free podcasts, included with their membership; customers can upgrade to Music Unlimited for unlimited on-demand access to 100 million songs including millions in HD, Ultra HD, spatial audio and anyone can listen for free by downloading Amazon Music app or via Alexa-enabled devices.Join us for opportunity to influence how Amazon Music engages fans, artists, and creators on a global scale.

  • Experience in Linux/Unix scripting a big plus (e.g., bash scripts)
  • Experience writing CLI tools/utilities in any language
  • Experience with Typescript/Javascript and Java
  • Experience with tools such as DBT, Stitch, Glue, Airflow
  • Knowledge of distributed systems as it pertains to data storage and computing
  • Experience in administering reporting/analytics platforms

About The Company

Amazon is guided by four principles: customer obsession rather than competitor focus, passion for invention, commitment to operational excellence, and long-term thinking. We are driven by the excitement of building technologies, inventing products, and providing services that change lives. We embrace new ways of doing things, make decisions quickly, and are not afraid to fail. We have the scope and capabilities of a large company, and the spirit and heart of a small one.


Together, Amazonians research and develop new technologies from Amazon Web Services to Alexa on behalf of our customers: shoppers, sellers, content creators, and developers around the world.


Our mission is to be Earth's most customer-centric company. Our actions, goals, projects, programs, and inventions begin and end with the customer top of mind.


You'll also hear us say that at Amazon, it's always "Day 1."​ What do we mean? That our approach remains the same as it was on Amazon's very first day - to make smart, fast decisions, stay nimble, invent, and focus on delighting our customers.

Karnataka, India (On-Site)
4 Weeks ago

Maharashtra, India (On-Site)
4 Weeks ago

Tamil Nadu, India (On-Site)
2 Months ago

Similar Jobs

Casumo - Data Engineer

Casumo

Zagreb Hub (On-Site)
1 Day ago
Yggdrasil Gaming Ltd - Data Engineer (preferably B2B Agreement)

Yggdrasil Gaming Ltd

Kraków, Lesser Poland Voivodeship, Poland (On-Site)
1 Day ago
Unity - Senior Data Engineer

Unity

Tel Aviv District, Tel Aviv-Yafo, Israel (On-Site)
2 Days ago
Electronic Arts - Data Engineer - Frostbite Analytics

Electronic Arts

Vancouver, British Columbia, Canada (On-Site)
2 Days ago
Omeda Studios - Data Engineer

Omeda Studios

Europe (Remote)
3 Days ago
Skillz - Lead Data Engineer

Skillz

Las Vegas, Nevada, United States (On-Site)
1 Week ago
Electronic Arts - Data Engineer

Electronic Arts

Hyderabad, Telangana, India (On-Site)
1 Week ago
vi - Data Engineer

vi

Tel Aviv District, Tel Aviv-Yafo, Israel (On-Site)
1 Week ago
Matific - Lead Data Engineer

Matific

Colombo, Western Province, Sri Lanka (On-Site)
1 Week ago
Moon Active - Big Data Engineer

Moon Active

Tel Aviv District, Tel Aviv-Yafo, Israel (On-Site)
1 Week ago

Game Development Courses

Learn the foundations of Game Development and create your very own video game.

Programming MCQs

Check out our comprehensive collection of programming multiple choice questions (MCQs) curated for both aspiring and experienced game developers. Enhance your skills and knowledge with our targeted, expert-level questions.

Try out our Online Compilers

Write, run, compile, and debug your code efficiently with our user-friendly online compilers. Accessible from anywhere, our compliers simplify your coding experience.