Senior Data Engineer

1 Month ago • 7 Years + • Data Analysis • $100,000 PA - $120,000 PA

Job Summary

Job Description

You will join the team behind an internal AI platform for processing and interacting with unstructured data. The team is currently over 30 people strong and is organized into agile teams, each of which is self-sufficient and handles the creation of features from the idea stage, through analysis, implementation, testing, production deployment, and maintenance. The team is international, and it's located in Krakow, Wroclaw, London and New York. As a Senior Data Engineer, you will design, build, and maintain scalable data pipelines using Python and Azure Data Factory, work with Azure SQL and PostgreSQL, develop and optimize ETL/ELT processes, and use Databricks for large datasets. You will also collaborate with various teams, ensure data quality, manage infrastructure with Terraform, and contribute to CI/CD practices using Azure DevOps.
Must have:
  • Design, build, and maintain scalable data pipelines using Python and Azure Data Factory
  • Work with Azure SQL and PostgreSQL to ingest, transform, and store structured and unstructured data
  • Develop and optimize ETL/ELT processes for high-volume data workflows
  • Use Databricks to process large datasets and build data models for downstream AI/ML components
  • Collaborate with data scientists, backend engineers, and product teams to understand data requirements
  • Ensure data quality, integrity, and security across all stages of the data lifecycle
  • Manage infrastructure as code using Terraform for provisioning and maintaining cloud resources
  • Contribute to CI/CD practices using Azure DevOps for data pipeline deployments and versioning
  • Support analytics and reporting teams by enabling data access via Power BI or similar tools
Good to have:
  • Experience with Power BI or other BI tools for data visualization and reporting
  • Knowledge of Spark and distributed data processing concepts
  • Familiarity with Delta Lake or similar data lakehouse architectures
  • Understanding of data governance, lineage, and cataloging tools (e.g. Azure Purview)
  • Basic knowledge of machine learning workflows or support for data science teams
  • Experience working with APIs for data ingestion or integration
  • Familiarity with containerization tools like Docker or Kubernetes
  • Exposure to monitoring and alerting tools for data pipeline health (e.g. Azure Monitor, Grafana)
  • Knowledge of data security best practices and compliance (e.g. GDPR, data encryption)
  • Prior experience working on AI-related or unstructured data projects

Job Details

Project description

You will join the team behind an internal AI platform for processing and interacting with unstructured data. The team is currently over 30 people strong and is organized into agile teams, each of which is self-sufficient and handles the creation of features from the idea stage, through analysis, implementation, testing, production deployment, and maintenance. The team is international, and it's located in Krakow, Wroclaw, London and New York.

Responsibilities

  • Design, build, and maintain scalable data pipelines using Python and Azure Data Factory
  • Work with Azure SQL and PostgreSQL to ingest, transform, and store structured and unstructured data
  • Develop and optimize ETL/ELT processes for high-volume data workflows
  • Use Databricks to process large datasets and build data models for downstream AI/ML components
  • Collaborate with data scientists, backend engineers, and product teams to understand data requirements
  • Ensure data quality, integrity, and security across all stages of the data lifecycle
  • Manage infrastructure as code using Terraform for provisioning and maintaining cloud resources
  • Contribute to CI/CD practices using Azure DevOps for data pipeline deployments and versioning
  • Support analytics and reporting teams by enabling data access via Power BI or similar tools

Skills

Must have

  • Experience in similar position +7 years
  • Strong programming skills in Python for data processing and scripting
  • Experience with Azure Data Factory (ADF) for building and orchestrating data pipelines
  • Proficiency in working with Azure SQL and PostgreSQL databases
  • Hands-on experience with Databricks for big data processing and transformation
  • Solid understanding of data engineering concepts: ETL/ELT, data modeling, data quality
  • Familiarity with infrastructure as code using Terraform
  • Experience with Azure DevOps for CI/CD pipelines and version control
  • Ability to work with unstructured data and integrate it into structured models
  • Experience in agile development environments and cross-functional teams
  • Good communication skills and ability to work in an international, distributed team

Nice to have

  • Experience with Power BI or other BI tools for data visualization and reporting
  • Knowledge of Spark and distributed data processing concepts
  • Familiarity with Delta Lake or similar data lakehouse architectures
  • Understanding of data governance, lineage, and cataloging tools (e.g. Azure Purview)
  • Basic knowledge of machine learning workflows or support for data science teams
  • Experience working with APIs for data ingestion or integration
  • Familiarity with containerization tools like Docker or Kubernetes
  • Exposure to monitoring and alerting tools for data pipeline health (e.g. Azure Monitor, Grafana)
  • Knowledge of data security best practices and compliance (e.g. GDPR, data encryption)
  • Prior experience working on AI-related or unstructured data projects

Other

Languages

English: C1 Advanced

Seniority

Senior

Similar Jobs

Pinterest - Staff Data Scientist, Engagement Ecosystem

Pinterest

San Francisco, California, United States (Hybrid)
1 Month ago
HHA Exchange - Marketing Operations Manager

HHA Exchange

United States (Hybrid)
2 Months ago
Ubisoft - Gen AI Programmer

Ubisoft

Pune, Maharashtra, India (On-Site)
4 Months ago
Snappr - Senior Product Manager

Snappr

San Francisco, California, United States (Hybrid)
1 Year ago
Progress - Senior Director, Product Marketing - ShareFile + MOVEIt

Progress

United States (Remote)
1 Month ago
Synechron - Lead Big Data Engineer (Java, Spark, Cloud, and Data Pipeline)

Synechron

Pune, Maharashtra, India (On-Site)
1 Year ago
Betson Group - Data & Reporting Analyst

Betson Group

Tbilisi, Tbilisi, Georgia (On-Site)
1 Month ago
paxie games - Data Scientist

paxie games

Istanbul, İstanbul, Türkiye (On-Site)
2 Months ago
Axon - Manager, System & Data Analytics

Axon

Ho Chi Minh City, Vietnam (Hybrid)
1 Month ago
Capgemini - Data Engineer

Capgemini

Mumbai, Maharashtra, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Calix - Senior Sales Engineer – Major Accounts

Calix

United States (Remote)
1 Month ago
lifechruh - Senior Quality Engineer

lifechruh

Edmond, Oklahoma, United States (On-Site)
10 Months ago
Moonbug Entertainment - Digital Marketing Manager

Moonbug Entertainment

London, England, United Kingdom (On-Site)
1 Month ago
Cognite - Senior Data Scientist

Cognite

Bengaluru, Karnataka, India (Hybrid)
10 Months ago
Roblox - Senior Software Engineer on the Economy Revenue team

Roblox

San Mateo, California, United States (On-Site)
1 Month ago
Winzo - Web Developer

Winzo

New Delhi, Delhi, India (On-Site)
3 Months ago
Lambda - Storage Protocols Engineering Manager

Lambda

San Francisco, California, United States (Hybrid)
1 Month ago
Moloco - Growth Manager

Moloco

Gurugram, Haryana, India (Hybrid)
3 Months ago
Discord - Senior Program Manager, Safety Core Initiatives

Discord

San Francisco, California, United States (On-Site)
2 Months ago
Gallagher - Web Content Editor

Gallagher

Bengaluru, Karnataka, India (On-Site)
1 Year ago

Get notifed when new similar jobs are uploaded

Jobs in New York, New York, United States

Intel  - 3D and Ray Tracing Architect

Intel

Folsom, California, United States (On-Site)
2 Months ago
upwork - Lead Product Designer

upwork

United States (Remote)
1 Month ago
upwork - Principal Product Manager, Reputation & Trust

upwork

United States (Remote)
2 Months ago
Apple - Software Asset Manager- Hardware Engineering Operations

Apple

Austin, Texas, United States (On-Site)
2 Months ago
Tencent - Business & Investment Analyst

Tencent

Palo Alto, California, United States (On-Site)
2 Months ago
Interactive Brokers - Software Engineer, Mid level

Interactive Brokers

Greenwich, Connecticut, United States (On-Site)
10 Months ago
Cold Iron Studios - Software Engineer - Builds and Systems

Cold Iron Studios

United States (Remote)
1 Month ago
Regent craft - Senior Perception Software Engineer - Sensor Fusion

Regent craft

North Kingstown, Rhode Island, United States (On-Site)
2 Months ago
Avalanche Studios Group - Executive Producer

Avalanche Studios Group

Salt Lake City, Utah, United States (Hybrid)
2 Months ago
Samsung Semiconductor - Manager, IP/Patent

Samsung Semiconductor

San Jose, California, United States (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Cineplex - Analyst, Data & Insights

Cineplex

Toronto, Ontario, Canada (Hybrid)
1 Month ago
Square - Tax Data Analytics Internship

Square

Amsterdam, North Holland, Netherlands (On-Site)
1 Month ago
Head Digital Works - Data Scientist - Retention

Head Digital Works

Hyderabad, Telangana, India (On-Site)
1 Year ago
Trend Micro - Data Scientist

Trend Micro

Manila, Metro Manila, Philippines (On-Site)
16 Years ago
Vercel - Data Analyst, Finance

Vercel

San Francisco, California, United States (Hybrid)
1 Month ago
Arkose Labs - Data Analyst (Evening Shift)

Arkose Labs

Brisbane, Queensland, Australia (Hybrid)
1 Month ago
Unity - Staff Data AI Engineer

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Month ago
FICO - Salesforce Business Analyst/Administrator (CPQ)

FICO

United States (Remote)
3 Months ago
Casumo - Senior Business Analyst

Casumo

Zagreb, Croatia (Hybrid)
5 Months ago
velotio technologies  - Senior Engineer (Data Engineer- Databricks)

velotio technologies

Pune, Maharashtra, India (Remote)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Empower your future with Luxoft: Innovate, thrive and grow in a software-defined world.

Ukraine (On-Site)

Bengaluru, Karnataka, India (On-Site)

Hyderabad, Telangana, India (On-Site)

Bucharest, Romania (On-Site)

Bengaluru, Karnataka, India (On-Site)

Ukraine (Remote)

Paris, Île-de-France, France (On-Site)

Paris, Île-de-France, France (On-Site)

View All Jobs

Get notified when new jobs are added by luxsoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug