Senior PySpark Data Engineer

3 Months ago • 5 Years + • Data Analyst

Job Summary

Job Description

Join a dynamic team in the Middle East working on diverse projects. As a Senior PySpark Data Engineer, you'll clarify requirements, design technical solutions, develop and test code, support QA, optimize processes, collaborate with cross-functional teams, stay updated on industry trends, document solutions, participate in code reviews, and ensure seamless project delivery. The role requires proficiency in Python, PySpark, SQL, ETL processes, data pipelines, and data analysis methodologies. Experience with cloud technologies (Azure preferred) is also beneficial.
Must have:
  • 5+ years experience as Senior Data Engineer
  • Big Data Technologies (Hadoop, Spark)
  • Data Security & Governance
  • Python & PySpark expertise
  • Advanced SQL knowledge
  • ETL experience
  • Data Pipelines & cleansing
  • API Integration
  • Git proficiency
  • Cloud Technology exposure
Good to have:
  • Streaming Data Processing (Kafka)
  • Docker
  • Data Modeling & Evaluation
  • Model Training, Deployment & Maintenance
  • Machine Learning experience
  • Applied Mathematics
  • Tableau/Power BI

Job Details

Project description

Join our dynamic team working on exciting projects in the thriving Middle East region. We offer a multitude of opportunities in various domains. Our diverse team comprises skilled professionals, including front-end and back-end developers, data analysts, data scientists, architects, analysts, and project managers. Currently, we are actively seeking a talented Data Engineer with proficiency in Python programming.

Responsibilities

Actively engage in requirements clarification and contribute to sprint planning sessions.

Design and architect technical solutions that align with project objectives.

Develop comprehensive unit and integration tests to ensure the robustness and reliability of the codebase.

Provide valuable support to QA teammates during the acceptance process, addressing and resolving issues promptly.

Continuously assess and refine best practices to optimize development processes and code quality.

Collaborate with cross-functional teams to ensure seamless integration of components and efficient project delivery.

Stay abreast of industry trends, emerging technologies, and best practices to contribute to ongoing process improvement initiatives.

Contribute to documentation efforts, ensuring clear and comprehensive records of technical solutions and best practices.

Actively participate in code reviews, providing constructive feedback and facilitating knowledge sharing within the team.

Skills

Must have

Technical skills:

5+ years of relevant experience in a Senior Data Engineer role

Big Data Technologies: Familiarity with big data technologies such as Hadoop, Apache Spark, or other distributed computing frameworks.

Data Security and Governance: Comprehensive understanding of data security principles and practices to ensure the confidentiality and integrity of sensitive information, coupled with knowledge of data governance frameworks and practices for ensuring data quality, compliance, and proper data management.

Python and PySpark: Demonstrated strong expertise in both Python and PySpark for efficient data processing and analytics.

Advanced SQL Knowledge: Proficient in SQL with the ability to handle complex queries and database operations.

ETL Experience: Prior experience working with Extract, Transform, Load (ETL) processes.

Data Pipelines: Familiarity with data cleansing, data profiling, data lineage, and adherence to best practices in data engineering.

Familiarity with Data Analysis Approaches: Some experience with various data analysis methodologies.

Python Libraries: Familiarity with building libraries in Python for enhanced functionality.

API Integration: Knowledge of integrating data pipelines with various APIs for seamless data exchange between systems.

Version Control: Proficiency in version control systems, such as Git, for tracking changes in code and collaborative development.

Cloud Technology Experience: Prior exposure to cloud technologies, particularly Azure or any leading cloud platform.

Data Visualization: Some exposure to data visualization tools like Tableau, Power BI, or others to create meaningful insights from data.

Collaboration Tools: Familiarity with collaboration tools such as Azure DevOps, Jira, Confluence, or others to enhance teamwork and project documentation.

Educational Background: A degree in computer science, mathematics, statistics, or a related technical discipline.

Financial Markets Knowledge: Familiarity with financial markets, portfolio theory, and risk management is a plus.

Non-technical skills:

Problem-Solving: Strong problem-solving skills to tackle complex data engineering challenges.

Data Storytelling: Ability to convey insights effectively through compelling data storytelling.

Quality Focus: Keen attention to delivering high-quality solutions within specified timelines.

Team Collaboration: Proven ability to work collaboratively within a team, taking a proactive approach to problem resolution and process improvement.

Communication Skills: Excellent communication skills to articulate technical concepts clearly and concisely.

Nice to have

Streaming Data Processing: Exposure to streaming data processing technologies like Apache Kafka for real-time data ingestion and processing.

Containerization: Knowledge of containerization technologies like Docker for creating, deploying, and running applications consistently across various environments.

Data Modeling and Evaluation: Extensive experience in data modeling and the evaluation of large datasets.

Model Training, Deployment, and Maintenance: Background in training, deploying, and maintaining models for effective data-driven decision-making.

Requirements for Machine Learning: Experience in developing and implementing machine learning algorithms, Natural Language Processing (NLP), and Neural Networks.

Applied Mathematics: Proficiency in applied mathematics, including but not limited to linear algebra, probability, statistics, and distributions.

Other

Languages

English: C1 Advanced

Seniority

Senior

Similar Jobs

Virtusa - Cloud DevOps Lead

Virtusa

Andhra Pradesh, India (On-Site)
3 Months ago
PwC - Credit Risk Modelling Senior Associate

PwC

Montreal, Quebec, Canada (On-Site)
3 Months ago
PwC - Manager SAP Sales | CDI | H/F

PwC

Neuilly-sur-Seine, Île-de-France, France (On-Site)
4 Months ago
Google - Manager, Trust and Safety AdSpam

Google

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Meta - Data Engineer, Product Analytics

Meta

Redmond, Washington, United States (On-Site)
3 Months ago
Google - Senior Data Analyst, Trust and Safety, Search

Google

(On-Site)
2 Months ago
Social Discovery Group - Senior Web Analyst

Social Discovery Group

Belgrade, Serbia (Remote)
2 Months ago
Luxoft - Murex Business Analyst (FO/BO)

Luxoft

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)
2 Months ago
Internkaksha IT Solutions - Data Analyst

Internkaksha IT Solutions

India (Remote)
4 Months ago
Netflix - Associate, Revenue Analytics

Netflix

Singapore, Singapore (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

ByteDance - DevOps Engineer - Applied Machine Learning Engine (Singapore)

ByteDance

Singapore (On-Site)
3 Months ago
Luxoft - Data Architect

Luxoft

Toronto, Ontario, Canada (On-Site)
3 Months ago
Limbic Entertainment - Technical Artist

Limbic Entertainment

Paris, Île-de-France, France (Hybrid)
6 Months ago
Forescout Technologies  Inc  - Manager Devops

Forescout Technologies Inc

Pune, Maharashtra, India (On-Site)
4 Months ago
Krafton  - Deep Learning Engineer - RL

Krafton

Seoul, South Korea (On-Site)
1 Month ago
Luxoft - Senior DevOps (Lambda, Kubernetes)

Luxoft

(Remote)
3 Months ago
Google - Software Engineer, University Graduate, 2025

Google

(On-Site)
2 Months ago
DAC Search  Inc  - Multiple Openings in Bengaluru (onsite) HW DV, RTL Uarch, Performance Modeling

DAC Search Inc

Bengaluru, Karnataka, India (On-Site)
4 Months ago
Rivos - Silicon Microarchitecture & Logic Design - Intern

Rivos

Santa Clara, California, United States (On-Site)
4 Months ago
ION - Senior AI Engineer, Italy

ION

Pisa, Tuscany, Italy (On-Site)
4 Months ago

Get notifed when new similar jobs are uploaded

Jobs in undefined

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Data Analyst Jobs

Playtech - Data Scientist

Playtech

Latsia, Nicosia, Cyprus (On-Site)
1 Month ago
Xsolla - Data and Marketing Analyst

Xsolla

Maryland, United States (Remote)
3 Months ago
Microsoft - Senior Data Scientist

Microsoft

Hyderabad, Telangana, India (On-Site)
1 Month ago
Info Stretch - Data Modeler

Info Stretch

Birmingham, England, United Kingdom (On-Site)
3 Months ago
Scorewarrior - Lead Product Analyst

Scorewarrior

Limassol, Limassol, Cyprus (On-Site)
8 Months ago
Target - Data Scientist

Target

Bengaluru, Karnataka, India (On-Site)
5 Months ago
Neostella - Data Engineer

Neostella

Medellín, Antioquia, Colombia (Hybrid)
4 Months ago
Nagarro - Staff Consultant, Business Analyst

Nagarro

India (Remote)
4 Months ago
Dream Games - Product Specialist

Dream Games

İstanbul, Türkiye (On-Site)
8 Months ago
Trendyol - Trendyol GO - Customer Experience Data Analyst

Trendyol

İstanbul, İstanbul, Türkiye (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Luxoft, a DXC Technology Company (NYSE: DXC), is a digital strategy and software engineering firm providing bespoke technology solutions that drive business change for customers the world over. Acquired by U.S. company DXC Technology in 2019, Luxoft is a global operation in 44 cities and 21 countries with an international, agile workforce of nearly 18,000 people. It combines a unique blend of engineering excellence and deep industry expertise, helping over 425 global clients innovate in the areas of automotive, financial services, travel and hospitality, healthcare, life sciences, media and telecommunications.

DXC Technology is a leading Fortune 500 IT services company which helps global companies run their mission critical systems. Together, DXC and Luxoft offer a differentiated customer-value proposition for digital transformation by combining Luxoft’s front-end digital capabilities with DXC’s expertise in IT modernization and integration. Follow our profile for regular updates and insights into technology and business needs.

Gothenburg, Västra Götaland County, Sweden (On-Site)

New Delhi, Delhi, India (Remote)

Poland, Ohio, United States (Remote)

Kraków, Lesser Poland Voivodeship, Poland (On-Site)

Wrocław, Lower Silesian Voivodeship, Poland (On-Site)

Ukrainka, Kyiv Oblast, Ukraine (Remote)

Kuala Lumpur, Federal Territory Of Kuala Lumpur, Malaysia (On-Site)

Bengaluru, Karnataka, India (On-Site)

Mississauga, Ontario, Canada (On-Site)

View All Jobs

Get notified when new jobs are added by Luxoft

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug