Data Engineer - Python & Databricks

3 Months ago • 5 Years + • Data Analyst

Job Summary

Job Description

As a Data Engineer Developer, you will design, develop, and maintain data pipelines using Python and Databricks to process large-scale datasets. You'll collaborate with data scientists, analysts, and stakeholders to gather requirements and build efficient, scalable solutions for advanced analytics and reporting. Responsibilities include data pipeline development (batch and real-time), ETL process creation and maintenance, data integration from various sources, collaboration with cross-functional teams, performance optimization, data validation, cloud platform integration (AWS, Azure, or Google Cloud), pipeline automation and scheduling, and comprehensive documentation. The role requires expertise in Python, Databricks, and big data technologies, along with strong data modeling and warehousing skills.
Must have:
  • 5+ years Data Engineering experience with Python expertise
  • Databricks or similar big data platform experience
  • Strong understanding of data pipelines, ETL, and data integration
  • Cloud platform experience (AWS, Azure, or GCP)
  • Proficiency in SQL and relational/non-relational databases
  • Experience with big data technologies (Spark, Kafka, Hadoop)
  • Data modeling, warehousing, and database design knowledge
  • Experience with Git and CI/CD pipelines
Good to have:
  • Delta Lake, Lakehouse architecture experience
  • Machine learning and data science workflow familiarity
  • DevOps or DataOps experience
  • Terraform, Docker, or Kubernetes knowledge
  • Data governance and privacy regulation knowledge

Job Details

Project description

As a Data Engineer Developer, you will design, develop, and maintain data pipelines using Python and Databricks to process large-scale data sets. You will collaborate with data scientists, analysts, and business stakeholders to gather data requirements and build efficient, scalable solutions that enable advanced analytics and reporting.

Responsibilities

Data Pipeline Development: Design, develop, and implement scalable data pipelines using Python and Databricks for batch and real-time data processing.

ETL Processes: Build and maintain ETL (Extract, Transform, Load) processes to gather, transform, and store data from multiple sources.

Data Integration: Integrate structured and unstructured data from various internal and external sources into data lakes or warehouses, ensuring data accuracy and quality.

Collaboration: Work closely with data scientists, analysts, and business teams to understand data needs and deliver efficient solutions.

Performance Optimization: Optimize the performance of data pipelines and workflows to ensure efficient processing of large data sets.

Data Validation: Implement data validation and monitoring mechanisms to ensure data quality, consistency, and reliability.

Cloud Integration: Work with cloud platforms like AWS, Azure, or Google Cloud to build and maintain data storage and processing infrastructure.

Automation & Scheduling: Automate data pipelines and implement scheduling mechanisms to ensure timely and reliable data delivery.

Documentation: Maintain comprehensive documentation for data pipelines, processes, and best practices.

Skills

Must have

5+ years of experience as a Data Engineer with strong expertise in Python.

Bachelor's degree in Computer Science, Data Engineering, or a related field (or equivalent experience).

Hands-on experience with Databricks or similar big data platforms.

Strong understanding of data pipelines, ETL processes, and data integration techniques.

Experience with cloud-based platforms such as AWS, Azure, or Google Cloud, particularly with services like Data Lakes, S3, or Azure Blob Storage.

Proficiency in SQL and experience with relational and non-relational databases.

Familiarity with big data technologies like Apache Spark, Kafka, or Hadoop.

Strong understanding of data modeling, data warehousing, and database design principles.

Ability to work with large, complex datasets, ensuring data integrity and performance optimization.

Experience with version control tools like Git and CI/CD pipelines for data engineering.

Excellent problem-solving skills, attention to detail, and the ability to work in a collaborative environment.

Nice to have

Experience with Delta Lake, Lakehouse architecture, or other modern data storage solutions.

Familiarity with machine learning and data science workflows.

Experience with DevOps or DataOps practices.

Knowledge of Terraform, Docker, or Kubernetes for cloud infrastructure automation.

Familiarity with data governance, data privacy regulations (e.g., GDPR, CCPA), and data security best practices.

Other

Languages

English: B2 Upper Intermediate

Seniority

Regular

Similar Jobs

Arkadium - Senior QA Automation Engineer

Arkadium

(Remote)
4 Months ago
Iksha Labs - Senior C++ Engineer

Iksha Labs

Gurugram, Haryana, India (On-Site)
5 Months ago
WebPT - Lead, DevOps Engineer

WebPT

Hyderabad, Telangana, India (Hybrid)
5 Months ago
Adyen - Software Engineer - Test Automation / Robots

Adyen

Amsterdam, North Holland, Netherlands (On-Site)
4 Months ago
Luxoft - Regular Front-End Developer (with DevOps experience)

Luxoft

Mexico City, Mexico City, Mexico (Remote)
3 Months ago
Google - Open Career Opportunities, Verily Life Sciences

Google

South San Francisco, California, United States (On-Site)
3 Months ago
The Mill Adventure - BI Analyst

The Mill Adventure

St. Julian's, Malta (Remote)
4 Months ago
The Walt Disney Company - Analyst, Marketplace & Platform Research

The Walt Disney Company

New York, New York, United States (On-Site)
3 Months ago
PwC - Experienced Associate- Data Analytics

PwC

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
3 Months ago
Visa - Client Services Datalake - Staff Data Engineer

Visa

Warsaw, Masovian Voivodeship, Poland (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Trimble  Inc  - Lead DevOps Engineer

Trimble Inc

Chennai, Tamil Nadu, India (On-Site)
3 Months ago
Make - Senior Software Engineer - Full-Stack - Bridge

Make

Prague, Czechia (Hybrid)
3 Months ago
 Sagecor Solutions - Software Engineer 2 (IDN - 091)

Sagecor Solutions

Annapolis Junction, Maryland, United States (On-Site)
3 Months ago
Evernorth Health Services - Software Engineering Advisor [T500-13630]

Evernorth Health Services

Hyderabad, Telangana, India (On-Site)
5 Months ago
Rackspace Technology - Senior AWS DevOps Engineer

Rackspace Technology

Poland (Remote)
4 Months ago
Saviynt - Engineer/Sr. Engineer, CloudOps

Saviynt

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
WebFX - Entry Level Software Engineer

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
4 Months ago
Warner Bros Games - Advanced Software Engineer

Warner Bros Games

Salt Lake City, Utah, United States (Hybrid)
5 Months ago
Luxoft - Regular Software Developer (Appartenente alle Categorie protette-Legge 68/99)

Luxoft

Turin, Piedmont, Italy (On-Site)
3 Months ago
CleverTap - Devops Team Lead - Cloud Infrastructure

CleverTap

Mumbai, Maharashtra, India (Hybrid)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Gurugram, Haryana, India

DAZN - Endpoint Team Lead

DAZN

Hyderabad, Telangana, India (On-Site)
3 Months ago
Salesforce - Principal Software Engineer / PMTS - Bangalore

Salesforce

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Assystems - BIM 4D/5D & Digital Twin Specialist

Assystems

Gurugram, Haryana, India (On-Site)
3 Months ago
PwC - Associate -Gurgaon- Technology consulting

PwC

Gurugram, Haryana, India (On-Site)
4 Months ago
Sabre India - Sr Software Quality Engineer

Sabre India

Bengaluru, Karnataka, India (On-Site)
6 Months ago
PwC - IN_Associate– Sap SD-Enterprise Apps - SAP– Advisory  -Kolkata

PwC

Kolkata, West Bengal, India (On-Site)
4 Months ago
Emmes - Senior Software Development Engineer - Fullstack (React/Node)

Emmes

Bengaluru, Karnataka, India (Hybrid)
4 Months ago
Barracuda Networks  Inc  - Senior Machine Learning Engineer

Barracuda Networks Inc

Bengaluru, Karnataka, India (On-Site)
3 Months ago
Trek - Network Engineer

Trek

Haryana, India (On-Site)
5 Months ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

The Walt Disney Company - Analyst/Senior Analyst, Analytics

The Walt Disney Company

Singapore, Singapore (On-Site)
3 Months ago
Varonis  - FP&A Analyst

Varonis

New York, New York, United States (On-Site)
3 Months ago
PwC - Data Analyst - 12 Month Fixed Term Contract

PwC

Saint Peter Port, Guernsey (On-Site)
3 Months ago
Google - Data Scientist, gTech Ads

Google

(On-Site)
3 Months ago
HP - Senior Business Data Analyst – Global Indirect Procurement

HP

Tlaquepaque, Jalisco, Mexico (On-Site)
4 Months ago
SoundHound AI - Language Specialist, Telugu [Contractor]

SoundHound AI

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
PublicisGroupe - Data Analyst - PGD-20279

PublicisGroupe

(Remote)
3 Months ago
Animoca Brands - Data Analyst

Animoca Brands

Hong Kong (On-Site)
5 Months ago
Netflix - Analytics Lead

Netflix

Los Angeles, California, United States (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Luxoft, a DXC Technology Company (NYSE: DXC), is a digital strategy and software engineering firm providing bespoke technology solutions that drive business change for customers the world over. Acquired by U.S. company DXC Technology in 2019, Luxoft is a global operation in 44 cities and 21 countries with an international, agile workforce of nearly 18,000 people. It combines a unique blend of engineering excellence and deep industry expertise, helping over 425 global clients innovate in the areas of automotive, financial services, travel and hospitality, healthcare, life sciences, media and telecommunications.

DXC Technology is a leading Fortune 500 IT services company which helps global companies run their mission critical systems. Together, DXC and Luxoft offer a differentiated customer-value proposition for digital transformation by combining Luxoft’s front-end digital capabilities with DXC’s expertise in IT modernization and integration. Follow our profile for regular updates and insights into technology and business needs.

Gothenburg, Västra Götaland County, Sweden (On-Site)

New Delhi, Delhi, India (Remote)

Poland, Ohio, United States (Remote)

Kraków, Lesser Poland Voivodeship, Poland (On-Site)

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)

Ukrainka, Kyiv Oblast, Ukraine (Remote)

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

Bengaluru, Karnataka, India (On-Site)

Mississauga, Ontario, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Luxoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug