Principal Data Platform Engineer
Blinkhealth
Job Summary
Blink Health is seeking a Principal Data Platform Engineer to define and evolve its real-time and batch data platform built on AWS and Databricks. This role involves owning the technical vision for data ingestion, processing, storage, and serving as trusted datasets, metrics, and APIs for products and decisioning systems. The Principal Engineer will be deeply hands-on, setting architectural direction across streaming systems and lakehouse design, partnering with engineering, analytics, and product teams to simplify the platform and establish scalable, reliable foundations for real-time analytics.
Must Have
- Own the end-to-end data platform architecture, spanning streaming ingestion, lakehouse storage, and data/insight serving layers
- Architect real-time streaming systems using AWS Kinesis and Spark Structured Streaming
- Design stream-to-lakehouse convergence patterns
- Build and evolve data, metrics, and feature APIs
- Establish canonical event schemas and data contracts
- Make deep technical decisions across AWS infrastructure (Kinesis, S3, IAM, networking) and Databricks internals (clusters, jobs, Delta Lake, performance tuning)
- Drive platform modernization, retiring legacy tools and patterns
- Set standards for high-performance SQL and Spark workloads
- Lead complex platform initiatives from architecture through production delivery and ongoing reliability
- Provide technical leadership and mentorship, shaping best practices for platform design, data quality, and operability
- Deep expertise in real-time and distributed data systems, including AWS Kinesis and Spark Structured Streaming
- Strong command of Databricks on AWS (Delta Lake, clusters, jobs) and core AWS services (S3, IAM, networking)
- Proven experience designing data-serving architectures and APIs for analytics, metrics, and feature consumption
- Advanced Python and SQL skills for building scalable, high-performance data pipelines
- Demonstrated ability to design idempotent, replayable, and observable data platforms at scale
Perks & Benefits
- Opportunity to deeply impact customers in healthcare
- Work for a fast-growing healthcare company
- Drive impact across millions of new patients
- Relentlessly learning, constantly curious, and aggressively collaborative cross-functional team environment
Job Description
About the Role
We are seeking a Principal Data Platform Engineer to define and evolve our real-time and batch data platform built on AWS and Databricks. This role owns the technical vision for how data is ingested, processed, stored, and served as trusted datasets, metrics, and APIs that power products, decisioning systems, and operational workflows.
As a Principal, you are a technical authority and force multiplier—deeply hands-on while setting architectural direction across streaming systems, lakehouse design, and data-serving layers. You will partner closely with engineering, analytics, and product teams to simplify the platform, eliminate legacy patterns, and establish scalable, reliable foundations for real-time analytics.
What You Will Do
- Own the end-to-end data platform architecture, spanning streaming ingestion, lakehouse storage, and data/insight serving layers
- Architect real-time streaming systems using AWS Kinesis and Spark Structured Streaming to support low-latency use cases
- Design stream-to-lakehouse convergence patterns that unify real-time and historical data with strong correctness guarantees
- Build and evolve data, metrics, and feature APIs that expose curated datasets for downstream applications and analytics
- Establish canonical event schemas and data contracts to support event-driven and API-based consumption
- Make deep technical decisions across AWS infrastructure (Kinesis, S3, IAM, networking) and Databricks internals (clusters, jobs, Delta Lake, performance tuning)
- Drive platform modernization, retiring legacy tools and patterns in favor of simpler, lakehouse-first designs
- Set standards for high-performance SQL and Spark workloads, optimizing for cost, latency, and scale
- Lead complex platform initiatives from architecture through production delivery and ongoing reliability
- Provide technical leadership and mentorship, shaping best practices for platform design, data quality, and operability
Technical Skills
- Deep expertise in real-time and distributed data systems, including AWS Kinesis and Spark Structured Streaming
- Strong command of Databricks on AWS (Delta Lake, clusters, jobs) and core AWS services (S3, IAM, networking)
- Proven experience designing data-serving architectures and APIs for analytics, metrics, and feature consumption
- Advanced Python and SQL skills for building scalable, high-performance data pipelines
- Demonstrated ability to design idempotent, replayable, and observable data platforms at scale
Why Join Us:
It is rare to have a company that both deeply impacts its customers and is able to provide its services across a massive population. At Blink, we have a huge impact on people when they are most vulnerable: at the intersection of their healthcare and finances. We are also the fastest growing healthcare company in the country and are driving that impact across millions of new patients every year. Our business model not only helps people, but drives economics that allow us to build a generational company. We are a relentlessly learning, constantly curious, and aggressively collaborative cross-functional team dedicated to inventing new ways to improve the lives of our customers.
We are an equal opportunity employer and value diversity of all kinds. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.