Data Engineer - Data Engineering

undefined ago • 2 Years + • Data Analysis • $163,200 PA - $223,200 PA

Job Summary

Job Description

We build simple yet innovative consumer products and developer APIs that shape how everybody interacts with money and the financial system. The main goal of the DE team in 2024-25 is to build robust golden data sets to power our business goals of creating more insights based products. Making data-driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively. Data Engineers heavily leverage SQL and Python to build data workflows. You will be in a high impact role that will directly enable business leaders to make faster and more informed business judgements based on the datasets you build. You will have the opportunity to carve out the ownership and scope of internal datasets and visualizations across Plaid which is a currently unowned area that we intend to take over and build SLAs on.
Must have:
  • Understanding different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
  • Have data quality and performance top of mind while designing datasets
  • Advocating for adopting industry tools and practices at the right time
  • Owning core SQL and Python data pipelines that power our data lake and data warehouse
  • Well-documented data with defined dataset quality, uptime, and usefulness.
  • 2+ years of dedicated data engineering experience, solving complex data pipeline issues at scale.
  • Experience building data models and data pipelines on top of large datasets (500TB to petabytes).
  • Value SQL as a flexible and extensible tool and comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.
Good to have:
  • Experience working with different performant warehouses and data lakes; Redshift, Snowflake, Databricks.
  • Experience building and maintaining batch and real-time pipelines using technologies like Spark, Kafka.
Perks:
  • Additional compensation in the form(s) of equity and/or commission
  • Comprehensive benefit plan, including medical, dental, vision, and 401(k).

Job Details

We build simple yet innovative consumer products and developer APIs that shape how everybody interacts with money and the financial system.

The main goal of the DE team in 2024-25 is to build robust golden data sets to power our business goals of creating more insights based products. Making data-driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively. Data Engineers heavily leverage SQL and Python to build data workflows. We use tools like DBT, Airflow, Redshift, ElasticSearch, Atlanta, and Retool to orchestrate data pipelines and define workflows. We work with engineers, product managers, business intelligence, data analysts, and many other teams to build Plaid's data strategy and a data-first mindset. Our engineering culture is IC-driven -- we favor bottom-up ideation and empowerment of our incredibly talented team. We are looking for engineers who are motivated by creating impact for our consumers and customers, growing together as a team, shipping the MVP, and leaving things better than we found them.

You will be in a high impact role that will directly enable business leaders to make faster and more informed business judgements based on the datasets you build. You will have the opportunity to carve out the ownership and scope of internal datasets and visualizations across Plaid which is a currently unowned area that we intend to take over and build SLAs on. You will have the opportunity to learn best practices and up-level your technical skills from our strong DE team and from the broader Data Platform team. You will collaborate with and have strong and cross functional partnerships with literally all teams at Plaid from Engineering to Product to Marketing/Finance etc.

Responsibilities

  • Understanding different aspects of the Plaid product and strategy to inform golden dataset choices, design and data usage principles.
  • Have data quality and performance top of mind while designing datasets
  • Advocating for adopting industry tools and practices at the right time
  • Owning core SQL and Python data pipelines that power our data lake and data warehouse
  • Well-documented data with defined dataset quality, uptime, and usefulness.

Qualifications

  • 2+ years of dedicated data engineering experience, solving complex data pipeline issues at scale.
  • You have experience building data models and data pipelines on top of large datasets (in the order of 500TB to petabytes)
  • You value SQL as a flexible and extensible tool and are comfortable with modern SQL data orchestration tools like DBT, Mode, and Airflow.
  • [Nice to have] You have experience working with different performant warehouses and data lakes; Redshift, Snowflake, Databricks
  • [Nice to have] You have experience building and maintaining batch and real-time pipelines using technologies like Spark, Kafka.

The target base salary for this position ranges from $163,200/year to $223,200/year in Zone 1. The target base salary will vary based on the job's location.

Our geographic zones are as follows:

Zone 1 - New York City and San Francisco Bay Area

Zone 2 - Los Angeles, Seattle, Washington D.C.

Zone 3 - Austin, Boston, Denver, Houston, Portland, Sacramento, San Diego

Zone 4 - Raleigh-Durham and all other US cities

Additional compensation in the form(s) of equity and/or commission are dependent on the position offered. Plaid provides a comprehensive benefit plan, including medical, dental, vision, and 401(k). Pay is based on factors such as (but not limited to) scope and responsibilities of the position, candidate's work experience and skillset, and location. Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans.

Similar Jobs

Jam City - Senior Data Analyst

Jam City

Toronto, Ontario, Canada (On-Site)
1 Year ago
bombit - Backend / Fullstack Developer

bombit

Gdańsk, Pomeranian Voivodeship, Poland (Remote)
3 Weeks ago
Sabre India - Principal II Business Operations

Sabre India

Kraków, Lesser Poland Voivodeship, Poland (Hybrid)
1 Month ago
CyberArk - Field Operations Manager – EMEA

CyberArk

United Kingdom (On-Site)
3 Months ago
ShyftLabs - Senior Machine Learning Engineer

ShyftLabs

Toronto, Ontario, Canada (Hybrid)
3 Months ago
Nagarro - Senior Staff Consultant, Business Analyst

Nagarro

South Africa (On-Site)
9 Months ago
Quentus - Senior Data Engineer

Quentus

Argentina (Remote)
1 Month ago
HoYoverse - Data Analyst - Honkai: Star Rail - Fresh Grad

HoYoverse

Singapore, Singapore (On-Site)
3 Months ago
Penn Interactive - Data Science Manager, Football

Penn Interactive

Philadelphia, Pennsylvania, United States (Remote)
1 Month ago
PrizePicks - Staff Data Science Engineer

PrizePicks

Atlanta, Georgia, United States (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

YouGov - Data Analytics and Insights Consultant

YouGov

Milan, Lombardy, Italy (Hybrid)
3 Months ago
PwC - Senior Associate, Business Analyst, Data and Analytics, Advisory

PwC

Mumbai, Maharashtra, India (On-Site)
4 Weeks ago
Square - Product Owner

Square

Timișoara, Timiș, Romania (Hybrid)
3 Weeks ago
Trend Micro - (Sr.) Data Engineer/AI Trainer

Trend Micro

Taipei City, Taiwan (On-Site)
10 Months ago
NinjaVan - Analyst, Business Intelligence - HCM

NinjaVan

Ho Chi Minh City, Vietnam (On-Site)
6 Months ago
Ion - Senior Risk Analyst, Italy

Ion

Milan, Lombardy, Italy (On-Site)
10 Months ago
Barracuda - Gainsight Administrator

Barracuda

Ottawa, Ontario, Canada (Hybrid)
1 Month ago
Tellius - Applied AI Lead

Tellius

(Remote)
3 Months ago
EveryMatrix - Junior BI/DB Developer

EveryMatrix

Lviv, Lviv Oblast, Ukraine (Hybrid)
1 Month ago
appier - Sales Operations Associate Manager

appier

Taipei City, Taiwan (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in San Francisco, California, United States

GoMotive - Technical Lead, IoT Device Observability

GoMotive

United States (Remote)
1 Month ago
Shield AI - Staff Engineer, Software Autonomy Applications (R3682)

Shield AI

San Diego, California, United States (On-Site)
3 Weeks ago
Riot Games - Staff Software Engineer - Infrastructure Reliability

Riot Games

Los Angeles, California, United States (On-Site)
3 Months ago
Apple - Product Designer - Generative AI

Apple

Austin, Texas, United States (On-Site)
3 Months ago
Penumbrainc - Manager, Accounting Operations

Penumbrainc

Alameda, California, United States (On-Site)
3 Months ago
Penumbrainc - Manufacturing Engineer I - Development

Penumbrainc

Alameda, California, United States (On-Site)
2 Months ago
Vercel - Sales Development Representative, Enterprise

Vercel

United States (Remote)
3 Months ago
Scale AI - Engineering Manager, Enterprise

Scale AI

San Francisco, California, United States (Hybrid)
1 Month ago
Zeeco, Inc. - Crater

Zeeco, Inc.

Broken Arrow, Oklahoma, United States (On-Site)
3 Weeks ago
Sandbox VR - Retail Associate

Sandbox VR

Roseville, Minnesota, United States (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Apple - Data Engineer

Apple

Cupertino, California, United States (On-Site)
2 Months ago
ISS Stoxx - Accounts Receivable Analyst (Business-to-Business Collections)

ISS Stoxx

Makati City, Metro Manila, Philippines (Hybrid)
3 Months ago
Jane Street - Data Center Mechanical Engineer

Jane Street

New York, United States (On-Site)
3 Months ago
Canva - Senior Software Engineer (Python) - Data Platform

Canva

Brisbane, Queensland, Australia (Remote)
2 Months ago
Apple - Hardware System Integration Engineer - Data Center Hardware Engineering

Apple

Sunnyvale, California, United States (On-Site)
2 Months ago
Cognite - Senior/Principal Data Scientist

Cognite

Oslo, Oslo, Norway (Hybrid)
2 Months ago
Zazz - Data Engineer (6–8 Years) Adhoc

Zazz

India (On-Site)
8 Months ago
beghou consulting - Data Engineer

beghou consulting

Hyderabad, Telangana, India (Hybrid)
3 Months ago
Mapbox - Senior Software Development Engineer (Big Data), HD Maps

Mapbox

Germany (Remote)
4 Months ago
Razer - Senior Data Analyst

Razer

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

San Francisco, California, United States (On-Site)

United States (On-Site)

San Francisco, California, United States (On-Site)

New York, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, United States (On-Site)

United States (Remote)

San Francisco, California, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Plaid

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug