Data Engineer - Data Engineering

2 Months ago • 2 Years + • Data Analysis • $163,200 PA - $223,200 PA

Job Summary

Job Description

The Data Engineering team's primary objective for 2024-25 is to establish robust golden data sets to support the company's goal of creating more insight-driven products. Plaid emphasizes data-driven decision-making and needs to scale its data systems while ensuring data accuracy and completeness. The team provides tools and guidance to engineering, product, and business teams, enabling them to explore data efficiently and safely. Data Engineers utilize SQL and Python for building data workflows, employing tools like DBT, Airflow, Redshift, ElasticSearch, Atlanta, and Retool for orchestrating data pipelines. They collaborate with various departments to develop Plaid's data strategy and foster a data-first mindset, operating within an IC-driven engineering culture that encourages bottom-up ideation and team empowerment. The role is designed for engineers motivated by creating consumer and customer impact, team growth, shipping MVPs, and continuous improvement.
Must have:
  • 2+ years of data engineering experience
  • Solve complex data pipeline issues at scale
  • Build data models and pipelines on large datasets
  • Proficient in SQL and modern orchestration tools like DBT, Mode, Airflow
  • Understand Plaid product and strategy
  • Prioritize data quality and performance
  • Advocate for industry tools and practices
  • Own core SQL and Python data pipelines
  • Well-documented data with defined quality, uptime, usefulness
Good to have:
  • Experience with performant warehouses and data lakes (Redshift, Snowflake, Databricks)
  • Experience building/maintaining batch and real-time pipelines (Spark, Kafka)
Perks:
  • Equity and/or commission
  • Comprehensive benefit plan (medical, dental, vision, 401(k))

Job Details

The main goal of the DE team in 2024-25 is to build robust golden data sets to power our business goals of creating more insights based products. Making data-driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively. Data Engineers heavily leverage SQL and Python to build data workflows. We use tools like DBT, Airflow, Redshift, ElasticSearch, Atlanta, and Retool to orchestrate data pipelines and define workflows. We work with engineers, product managers, business intelligence, data analysts, and many other teams to build Plaid's data strategy and a data-first mindset. Our engineering culture is IC-driven -- we favor bottom-up ideation and empowerment of our incredibly talented team. We are looking for engineers who are motivated by creating impact for our consumers and customers, growing together as a team, shipping the MVP, and leaving things better than we found them.

You will be in a high impact role that will directly enable business leaders to make faster and more informed business judgements based on the datasets you build. You will have the opportunity to carve out the ownership and scope of internal datasets and visualizations across Plaid which is a currently unowned area that we intend to take over and build SLAs on. You will have the opportunity to learn best practices and up-level your technical skills from our strong DE team and from the broader Data Platform team. You will collaborate with and have strong and cross functional partnerships with literally all teams at Plaid from Engineering to Product to Marketing/Finance etc.
  • Understanding different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
  • Have data quality and performance top of mind while designing datasets
  • Advocating for adopting industry tools and practices at the right time
  • Owning core SQL and Python data pipelines that power our data lake and data warehouse
  • Well-documented data with defined dataset quality, uptime, and usefulness.
  • 2+ years of dedicated data engineering experience, solving complex data pipeline issues at scale.
  • You have experience building data models and data pipelines on top of large datasets (in the order of 500TB to petabytes)
  • You value SQL as a flexible and extensible tool and are comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.
  • [Nice to have] You have experience working with different performant warehouses and data lakes; Redshift, Snowflake, Databricks
  • [Nice to have] You have experience building and maintaining batch and real-time pipelines using technologies like Spark, Kafka.
The target base salary for this position ranges from $163,200/year to $223,200/year in Zone 1. The target base salary will vary based on the job's location. 

Our geographic zones are as follows:
Zone 1 - New York City and San Francisco Bay Area 
Zone 2 - Los Angeles, Seattle, Washington D.C.
Zone 3 - Austin, Boston, Denver, Houston, Portland, Sacramento, San Diego
Zone 4 - Raleigh-Durham and all other US cities

Additional compensation in the form(s) of equity and/or commission are dependent on the position offered. Plaid provides a comprehensive benefit plan, including medical, dental, vision, and 401(k). Pay is based on factors such as (but not limited to) scope and responsibilities of the position, candidate's work experience and skillset, and location. Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in San Francisco, California, United States

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Data Analysis Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Plaid

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug