Middle Data Engineer (Python)

1 Month ago • 2 Years + • Data Analyst

Job Summary

Job Description

This role involves designing and implementing data pipelines using Python and PySpark within an Azure environment. Responsibilities include collecting, cleaning, and transforming data from various sources; building and maintaining data storage and processing systems (databases, data warehouses, data lakes); adhering to data governance policies; collaborating with data analysts and scientists; participating in code reviews and performance tuning; working with Big Data Solution Architects to optimize data ingestion; ensuring solutions meet production-ready standards; and participating in daily project meetings. The project focuses on a large-scale data transformation to improve data accuracy, consistency, and accessibility for reporting, analytics, and machine learning.
Must have:
  • 2+ years big data experience
  • Python & PySpark proficiency
  • 2+ years Azure experience
  • Data querying & manipulation skills
  • Unit & integration testing
  • CI/CD pipeline experience
  • Excellent communication skills
Good to have:
  • Data visualization tools (SSRS, Power BI)
  • Machine learning knowledge
  • Data privacy regulation knowledge (GDPR, CCPA)
Perks:
  • Flexible working format
  • Competitive salary
  • Personalized career growth
  • Professional development tools
  • Education reimbursement
  • Corporate events

Job Details

We are looking for a Middle Big Data Engineer (Python+Azure) to join our team!

Client Overview:
Our client is involved in a large-scale Data Transformation project, with a focus on solidifying the foundation of their data operations. They are aiming to ensure that data is accurate, consistent, and available at critical times to support their business needs. 

Project Objectives:
The project aims to build and maintain robust data pipelines, scalable storage systems, and efficient processing mechanisms using Azure technology. 

The goal is to support the client's data-driven decision-making by ensuring clean, transformed, and readily accessible data for reporting, analytics, and machine learning across the organization.

Responsibilities:

  • Design and implement data pipelines to collect, clean, and transform data from various sources.
  • Build and maintain data storage and processing systems, including databases, data warehouses, and data lakes.
  • Follows data governance policies and procedures.
  • Collaborate with data analysts, data scientists, and other stakeholders to understand and meet their data needs.
  • Participate in code reviews, performance tuning, and best practice discussions within the team and brain-storm session
  • Work with Big Data Solution Architects to design, prototype, implement, and optimize data ingestion pipelines.
  • Ensure solutions are production-ready in terms of operational, security, and compliance standards.
  • Participate in daily project and agile meetings, providing technical support for issue resolution.
  • Communicate clearly and concisely with the business about item status and blockers.
  • Maintain comprehensive knowledge of the client's data landscape.

Requirements:

  • 2+ years of design & development experience with big data technologies.
  • Proficiency in Python and PySpark.
  • 2+ years of development experience in cloud technologies like Azure.
  • Strong skills in querying and manipulating data from various databases (relational and big data).
  • Experience in writing effective and maintainable unit and integration tests for ingestion pipelines.
  • Familiarity with static analysis and code quality tools, and experience building CI/CD pipelines. 
  • Excellent communication, problem-solving, and leadership skills.
  • Experience working on high-traffic and large-scale software products.

Nice to Have:

  • Experience with data visualization tools (e.g., SSRS, Power BI).
  • Knowledge of machine learning algorithms and their applications in big data.
  • Familiarity with data privacy regulations (e.g., GDPR, CCPA).

We offer:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

Similar Jobs

Match Group - Staff Product Manager, Machine Learning and Recommendations

Match Group

San Francisco, California, United States (Hybrid)
5 Months ago
ByteDance - Backend Software Engineer, Office Intelligence

ByteDance

Dubai, Dubai, United Arab Emirates (On-Site)
5 Months ago
Meta - Research Scientist Intern, Language and Multimodal Research for MetaAI (PhD)

Meta

Bellevue, Washington, United States (On-Site)
5 Months ago
Google - Software Engineer III, Core Machine Learning, Google Cloud

Google

Sunnyvale, California, United States (On-Site)
5 Months ago
PwC - IGP_CP&F - Data Governance  - Senior Associate - Bangalore

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
PwC - IN-Senior Associate_Azure data Engineer_Data &  Analytics_Advisory_PAN India

PwC

Bengaluru, Karnataka, India (On-Site)
6 Months ago
IGT - Power Platform Administrator

IGT

Providence, Rhode Island, United States (On-Site)
4 Months ago
Socialpoint - Senior User Acquisition Analyst

Socialpoint

Barcelona, Catalonia, Spain (Hybrid)
2 Months ago
Kwalee - Business Analyst

Kwalee

Royal Leamington Spa, England, United Kingdom (On-Site)
1 Month ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Google - Software Engineer III, Infrastructure, Google TV

Google

San Jose, California, United States (On-Site)
5 Months ago
ByteDance - Software Engineer, Model Inference

ByteDance

Seattle, Washington, United States (On-Site)
2 Weeks ago
Tencent - Senior Researcher, Speech Processing

Tencent

London, England, United Kingdom (On-Site)
2 Months ago
InMobiInMobi - Data Scientist III

InMobiInMobi

Bengaluru, Karnataka, India (On-Site)
2 Months ago
Social Discovery Group - Senior NLP Engineer

Social Discovery Group

Serbia (Remote)
5 Months ago
Interface AI - Senior Vice President of Engineering

Interface AI

United States (Remote)
1 Month ago
SLAY - Marketing Data Analyst (SKAN Attribution, LTV forecasting)

SLAY

Berlin, Berlin, Germany (On-Site)
4 Weeks ago
Microsoft - Member of Technical Staff - Machine Learning

Microsoft

Mountain View, California, United States (On-Site)
2 Weeks ago
Fairmatic - Senior Data Scientist

Fairmatic

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
6 Months ago

Get notifed when new similar jobs are uploaded

Jobs in Ukraine

Playrix - Senior HR Generalist

Playrix

Ukraine (Remote)
5 Months ago
Playrix - Customer Support Representative (German and Russian)

Playrix

Ukraine (Remote)
5 Months ago
Gunzilla - Senior Console Programmer

Gunzilla

Kyiv, Kyiv City, Ukraine (On-Site)
2 Weeks ago
Gunzilla - Manual QA Tester

Gunzilla

Kyiv, Kyiv City, Ukraine (On-Site)
2 Weeks ago
Playrix - Principal 2D Artist (Match-3)

Playrix

Ukraine (Remote)
5 Months ago
Playtika - Incident Engineer (NOC/SLS)

Playtika

Ukraine (On-Site)
4 Months ago
Playrix - Lead Technical Designer

Playrix

Ukraine (Remote)
5 Months ago
Playrix - Development Director

Playrix

Ukraine (Remote)
5 Months ago
Virtuos - Material Artist

Virtuos

Ukraine (Remote)
2 Weeks ago
N-iX - Middle Full-Stack (Node.js+React) Engineer

N-iX

Ukraine (Remote)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Data Analyst Jobs

Rockstar Games - Senior Data Scientist, Business Intelligence

Rockstar Games

New York, New York, United States (On-Site)
3 Weeks ago
Mattel  Inc  - Associate Category Advisor

Mattel Inc

Arkansas, United States (On-Site)
4 Months ago
Epic Games - Analytics Lead (Gameplay)

Epic Games

Montreal, Quebec, Canada (On-Site)
3 Months ago
PwC - IN-Senior Associate_React Developer_Data &Analytics_Advisory_PAN India

PwC

Bengaluru, Karnataka, India (On-Site)
5 Months ago
VGW - Data Engineer

VGW

Perth, Western Australia, Australia (On-Site)
1 Month ago
Universal Music - Manager, Financial & Business Analytics, Bilingual (English/Spanish)

Universal Music

Miami Beach, Florida, United States (On-Site)
1 Month ago
CloudHire - Data Labeler

CloudHire

Karnataka, India (Remote)
3 Weeks ago
The Walt Disney Company - Lead Machine Learning Engineer

The Walt Disney Company

San Francisco, California, United States (On-Site)
2 Weeks ago
Playrix - Data QA Engineer

Playrix

Ireland (Remote)
5 Months ago
PwC - AWS Data Architect Senior Manager

PwC

Toronto, Ontario, Canada (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded