Senior Software Engineer - Data Infrastructure

12 Minutes ago • 5 Years + • Data Analysis • $180,000 PA - $270,000 PA

Job Summary

Job Description

Plaid is seeking a Senior Software Engineer for Data Infrastructure to scale data systems and ensure correct, complete data. This role involves building data and machine learning infrastructure, providing tooling, and guiding teams to derive insights. The engineer will be a domain expert in Data Warehouse, Data Lakehouse, Spark, Workflow Orchestration, and Streaming technologies, focusing on performance, cost efficiency, and simplifying platform development for other engineers.
Must have:
  • Contribute to the long-term technical roadmap for data-driven and machine learning iteration.
  • Lead key data infrastructure projects, including ML development, offline streaming, ETL pipeline, and data warehouse/lakehouse evolution.
  • Collaborate with stakeholders to define technical roadmaps for backend systems and abstractions.
  • Debug, troubleshoot, and reduce operational burden for the Data Platform.
  • Grow the team through mentorship, leadership, and reviewing technical documents and code.
  • 5+ years of software engineering experience.
  • Extensive hands-on software engineering experience in Data Infrastructure or Platform domain.
  • Deep understanding of ML Infrastructure or Data Infrastructure systems.
  • Strong cross-functional collaboration, communication, and project management skills.
  • Proficiency in coding, testing, and system design.
  • Demonstrated leadership abilities, including mentoring junior engineers.
Good to have:
  • Experience with Databricks
  • Experience with Airflow
  • Experience with AWS EMR
Perks:
  • Medical insurance
  • Dental insurance
  • Vision insurance
  • 401(k)

Job Details

We build simple yet innovative consumer products and developer APIs that shape how everybody interacts with money and the financial system.

Making data driven decisions is key to Plaid's culture. To support that, we need to scale our data systems while maintaining correct and complete data. We provide tooling and guidance to teams across engineering, product, and business and help them explore our data quickly and safely to get the data insights they need, which ultimately helps Plaid serve our customers more effectively. We build the data and machine learning infrastructure to enable Plaid engineers to prototype and iterate on products and features built on top of consumer-permissioned financial data.

Engineers on Data Infrastructure are domain experts in Data Warehouse, Data Lakehouse, Spark, Workflow Orchestration, and Streaming technologies. We scale our existing data pipelines in a performant and cost efficient way while creating the necessary abstractions to make developing on top of this platform extremely simple for other engineers at Plaid.

Responsibilities

  • Contribute towards the long-term technical roadmap for data-driven and machine learning iteration at Plaid
  • Leading key data infrastructure projects such as improving ML development golden paths, implementing offline streaming solutions for data freshness, building net new ETL pipeline infrastructure, and evolving data warehouse or data lakehouse capabilities.
  • Working with stakeholders in other teams and functions to define technical roadmaps for key backend systems and abstractions across Plaid.
  • Debugging, troubleshooting, and reducing operational burden for our Data Platform.
  • Growing the team via mentorship and leadership, reviewing technical documents and code changes.

Qualifications

  • 5+ years of software engineering experience
  • Extensive hands-on software engineering experience, with a strong track record of delivering successful projects within the Data Infrastructure or Platform domain at similar or larger companies.
  • Deep understanding of one of: ML Infrastructure systems, including Feature Stores, Training Infrastructure, Serving Infrastructure, and Model Monitoring OR Data Infrastructure systems, including Data Warehouses, Data Lakehouses, Apache Spark, Streaming Infrastructure, Workflow Orchestration.
  • Strong cross-functional collaboration, communication, and project management skills, with proven ability to coordinate effectively.
  • Proficiency in coding, testing, and system design, ensuring reliable and scalable solutions.
  • Demonstrated leadership abilities, including experience mentoring and guiding junior engineers.
  • [Nice to have] Experience with Databricks, Airflow, AWS EMR

The target base salary for this position ranges from $180,000/year to $270,000/year in Zone 1. The target base salary will vary based on the job's location.

Our geographic zones are as follows:

Zone 1 - New York City and San Francisco Bay Area

Zone 2 - Los Angeles, Seattle, Washington D.C.

Zone 3 - Austin, Boston, Denver, Houston, Portland, Sacramento, San Diego

Zone 4 - Raleigh-Durham and all other US cities

Additional compensation in the form(s) of equity and/or commission are dependent on the position offered. Plaid provides a comprehensive benefit plan, including medical, dental, vision, and 401(k). Pay is based on factors such as (but not limited to) scope and responsibilities of the position, candidate's work experience and skillset, and location. Pay and benefits are subject to change at any time, consistent with the terms of any applicable compensation or benefit plans.

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in San Francisco, California, United States of America

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Data Analysis Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

About The Company

Raleigh, North Carolina, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

New York, New York, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

San Francisco, California, United States (On-Site)

New York, New York, United States (On-Site)

View All Jobs

Get notified when new jobs are added by Plaid

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug
Contact Us
hello@outscal.com
Made in INDIA 💛💙