Staff Software Engineer - Distributed Data Systems

1 Month ago • All levels • Data Analysis

Job Summary

Job Description

As a Staff Software Engineer, you will be responsible for designing, building, and optimizing data clusters to ensure scalability, fault tolerance, and high availability. The role involves focusing on improving Spark, Hadoop, Kubernetes, Delta Lake, and Druid ecosystems. You will also be involved in evolving these open-source projects internally and contributing code upstream. You will also educate and grow Adyen’s internal knowledge in these topics, collaborating with platform engineers and users.
Must have:
  • Scaling and tuning large deployments of Spark-on-k8s and Spark-on-Hadoop
  • Hadoop and the HDFS protocol
  • Designing and tuning shuffle-heavy systems on yarn or k8s
  • Experience with lakehouse file formats (Delta, Iceberg, Hudi)
  • Experience with OLAP technologies (Clickhouse, Druid, Pinot, Doris)
  • Open-source contributions in relevant technologies
  • Strong communication skills and ability to work in a team
  • Ability to troubleshoot and resolve issues in production environments
Good to have:
  • Experience with next-generation and multi-modal data formats
  • Building self-service stateful platforms
  • Experience with open-source S3 alternatives
  • Experience with native or accelerated runtimes for Spark

Job Details

This is Adyen

Adyen provides payments, data, and financial products in a single solution for customers like Meta, Uber, H&M, and Microsoft - making us the financial technology platform of choice. At Adyen, everything we do is engineered for ambition. 

For our teams, we create an environment with opportunities for our people to succeed, backed by the culture and support to ensure they are enabled to truly own their careers. We are motivated individuals who tackle unique technical challenges at scale and solve them as a team. Together, we deliver innovative and ethical solutions that help businesses achieve their ambitions faster.

Staff Software Engineer - Distributed Data Systems

Adyen hosts a significant footprint in both scale and variety of distributed systems. This covers distributed compute (Spark, Trino, Flink), distributed databases (Cassandra, Druid), and distributed file/object storage (HDFS, Ceph Object Gateway). These technologies are offered as-a-service internally, towards product engineering, to support them in building and scaling world-class products.

We’re looking for an expert with deep knowledge in distributed systems, to both improve operations/scalability of existing offerings, as well as introduce and mature new ones. The initial focus will be scaling and tuning our Hadoop and Spark infrastructure, in addition to iterating on our OLAP platform, where we currently use Apache Druid. Over time, we expect this expertise to be useful in a wider range of distributed systems internally.

What you’ll do

You’ll be asked to both build new as-a-service offerings, and improve the existing ones. This role is perfect for you if you're passionate about one or all of the following:

  • Design, build, and optimize data clusters to ensure scalability, fault tolerance, and high availability. Covering both batch and streaming workloads.
  • Focus on improving the Spark, Hadoop, Kubernetes, Deltalake, and Druid ecosystems internally.
  • Evolve these open-source projects internally, with the intention of contributing this code upstream.
  • Educate and grow Adyen’s internal knowledge in these topics. Covering both peer platform engineers, and the platform users.

Who you are

Must have experience in:

  • Scaling and tuning large deployments of Spark-on-k8s and Spark-on-Hadoop
  • Hadoop and the HDFS protocol
  • Designing and tuning shuffle heavy systems, on yarn, or on k8s via remote shuffle services
  • One of the lakehouse file formats (Delta, Iceberg, Hudi)
  • OLAP technologies covering at least one of Clickhouse, Apache Druid, Apache Pinot, or Apache Doris.
  • Open-source contributions in one of the must-have technologies, or other common ones (e.g. Kafka, Cassandra, Trino, etc)
  • Team player with strong communication skills
  • Ability to work closely with diverse stakeholders you enable (analysts, data scientists, data engineers, etc.) and depend upon (infrastructure, security, etc).
  • Demonstrated ability to troubleshoot and resolve issues in large-scale, production environments with distributed systems.

    Nice-to-have experience in:

  • Next generation and multi-modal data formats (e.g. LanceDB)
  • Building self-service stateful platforms
  • Open-source S3 alternatives (e.g. ceph, minio, etc)
  • Native or accelerated runtimes for Spark (Apache DataFusion Comet, Apache Gluten, Nvidia RAPIDS, etc)

Our Diversity, Equity and Inclusion commitments 

Our unique approach is a product of our diverse perspectives. This diversity of backgrounds and cultures is essential in helping us maintain our momentum. Our business and technical challenges are unique, and we need as many different voices as possible to join us in solving them - voices like yours. No matter who you are or where you’re from, we welcome you to be your true self at Adyen. 

Studies show that women and members of underrepresented communities apply for jobs only if they meet 100% of the qualifications. Does this sound like you? If so, Adyen encourages you to reconsider and apply. We look forward to your application!

What’s next?

Ensuring a smooth and enjoyable candidate experience is critical for us. We aim to get back to you regarding your application within 5 business days. Our interview process tends to take about 4 weeks to complete, but may fluctuate depending on the role. Learn more about our hiring process here. Don’t be afraid to let us know if you need more flexibility.

This role is based out of our Amsterdam office. We are an office-first company and value in-person collaboration; we do not offer remote-only roles.

Similar Jobs

Crowd Strick - Software Development Engineer in Test III, Detections Platform

Crowd Strick

United Kingdom (Remote)
1 Month ago
Haptic  - Backend Engineer (Go)

Haptic

Sweden (On-Site)
6 Months ago
Power Integrations - Field Sales Engineer (Remote)

Power Integrations

Ohio, United States (On-Site)
8 Months ago
Devoteam - ServiceNow IRM / SecOps Senior Consultant

Devoteam

Bratislava, Bratislava Region, Slovakia (Hybrid)
1 Month ago
Carry1st - Tribe Campus Leader

Carry1st

Cairo Governorate, Egypt (Remote)
3 Months ago
Apple - Senior Data Scientist, AIML Measurement

Apple

Seattle, Washington, United States (On-Site)
3 Weeks ago
Nordson Corporation - Principal Insurance Risk Analyst

Nordson Corporation

Amherst, Ohio, United States (Remote)
2 Months ago
plana technologies - Business Analyst

plana technologies

Cartago Province, Costa Rica (Remote)
4 Weeks ago
Univision - Senior Data Engineer

Univision

Bogota, Colombia (On-Site)
1 Month ago
plana technologies - Data Engineer

plana technologies

San José, San José Province, Costa Rica (Remote)
2 Days ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Technicolor Creative Studios - Supervisor-Compositing

Technicolor Creative Studios

Adelaide, South Australia, Australia (On-Site)
8 Months ago
Playdawn Consulting - Full Stack Developer (Unity/Golang)

Playdawn Consulting

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Blazesoft - Social Media Manager

Blazesoft

Vaughan, Ontario, Canada (On-Site)
5 Months ago
Playdawn Consulting - 3D Animator (Mobile Games)

Playdawn Consulting

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Anavation - Director of Proposal Management

Anavation

Chantilly, Virginia, United States (Hybrid)
4 Months ago
The game - Social Creative

The game

London, England, United Kingdom (Hybrid)
5 Months ago
Techland - Junior Localization Specialist

Techland

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)
1 Month ago
Devoteam - Distributed Cloud | AWS DevOps Engineer

Devoteam

Lisbon, Lisbon, Portugal (Remote)
8 Months ago
Rippling - Senior Product Design Manager, Payroll

Rippling

San Francisco, California, United States (Hybrid)
2 Months ago
Rackspace Technology - Senior Artificial Intelligence Engineer

Rackspace Technology

United States (Remote)
1 Month ago

Get notifed when new similar jobs are uploaded

Jobs in Amsterdam, North Holland, Netherlands

Wolters Kluwer - Inside Sales Account Manager - CUP Team

Wolters Kluwer

Deventer, Overijssel, Netherlands (Hybrid)
4 Weeks ago
 Pearl Abyss - [Pearl Abyss Europe] Project Manager

Pearl Abyss

Amsterdam, North Holland, Netherlands (On-Site)
6 Months ago
Wolters Kluwer - Senior Inside Sales Representative

Wolters Kluwer

Arnhem, Gelderland, Netherlands (Hybrid)
3 Weeks ago
Tesla - Service Advisor

Tesla

Utrecht, Utrecht, Netherlands (On-Site)
4 Months ago
Bosch Group - Marketing & Communications Intern

Bosch Group

Tilburg, North Brabant, Netherlands (On-Site)
1 Week ago
Tesla - Associate Technical Support Engineer (French Speaker)

Tesla

North Holland, Netherlands (On-Site)
4 Months ago
Saxo Bank - Senior Controller

Saxo Bank

Amsterdam, North Holland, Netherlands (On-Site)
2 Months ago
Philips - Developer Pulse Programming Clinical MRI

Philips

Best, North Brabant, Netherlands (On-Site)
1 Month ago
Tesla - Sustainability / Energy Management Operations Specialist – Energy

Tesla

North Holland, Netherlands (On-Site)
4 Months ago
Visa - Consulting Senior Manager, VCA

Visa

Amsterdam, North Holland, Netherlands (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

PwC - Functional Data Architect

PwC

Singapore (On-Site)
9 Months ago
Bombay Play - Data Analyst - Growth

Bombay Play

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Visa - Senior Director, Data Science

Visa

Atlanta, Georgia, United States (Hybrid)
4 Weeks ago
Super.com - Manager, Data Analytics

Super.com

United States (Remote)
3 Months ago
Apple - Cellular Data Science & Machine Learning Engineer

Apple

San Diego, California, United States (On-Site)
1 Month ago
Bluevine India - Senior Data Scientist

Bluevine India

Bengaluru, Karnataka, India (Hybrid)
2 Months ago
Haleon - Azure Data Engineer

Haleon

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Bestex Research - Senior Data Engineer

Bestex Research

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Autodesk - Software Architect, AEC Data

Autodesk

Montreal, Quebec, Canada (Remote)
3 Weeks ago
YouGov - Senior Data Scientist

YouGov

United States (Remote)
3 Weeks ago

Get notifed when new similar jobs are uploaded

About The Company

Adyen is a technology company that provides a single platform to accept payments anywhere in the world through any sales channel. Driven by a vision to improve customer experience, streamline processes, and ultimately increase revenue, Adyen enables businesses to process payments across online, mobile, and Point-of-Sale (POS) with over 250 payment methods in 187 transaction currencies. Over 3,500 businesses use the Adyen payment platform, including Facebook, Airbnb, Spotify, Groupon, Evernote, Booking.com, Yelp, Vodafone, Mango, Abercrombie & Fitch, O’Neill, and KLM. Adyen is headquartered in Amsterdam, with offices in San Francisco, São Paulo, Singapore, London, Paris, Berlin, Stockholm, Madrid, and Boston.


Paris, Île-de-France, France (On-Site)

New York, United States (On-Site)

New York, United States (On-Site)

London, England, United Kingdom (On-Site)

Amsterdam, North Holland, Netherlands (On-Site)

San Francisco, California, United States (On-Site)

Chicago, Illinois, United States (On-Site)

Chicago, Illinois, United States (On-Site)

Madrid, Community Of Madrid, Spain (On-Site)

Amsterdam, North Holland, Netherlands (On-Site)

View All Jobs

Get notified when new jobs are added by Adyen

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug