Lead Data Engineer

5 Minutes ago • 7 Years +
Data Analysis

Job Description

N-iX is seeking a Lead Data Engineer to join a strategic long-term partnership with a leading North American school transportation provider. This role involves designing complex ETL processes, building and maintaining data pipelines using Python, improving data quality, and optimizing data delivery. The engineer will collaborate with various stakeholders to integrate data from enterprise sources and assist with data-related technical issues, driving digital transformation for millions of users.
Good To Have:
  • Experience in the schema and dimensional data design.
  • Collaboration within a scaled team, using Agile methodology.
  • Decent knowledge of CI/CD (Docker, Cloud formation, Git).
Must Have:
  • Design complex ETL processes of various data sources in the data warehouse.
  • Build new and maintain existing data pipelines using Python to improve efficiency and latency.
  • Improve data quality through anomaly detection by building and working with internal tools to measure data and automatically detect changes.
  • Identify, design, and implement internal process improvements, including re-designing infrastructure for greater scalability, optimizing data delivery, and automating manual processes.
  • Perform data modeling and improve our existing data models for analytics.
  • Collaborate with SMEs, architects, analysts, and others to build solutions that integrate data from many of our enterprise data sources.
  • Partner with stakeholders, including data, design, product, and executive teams, and assist them with data-related technical issues.
  • Proficiency in Python 7+ years.
  • 3-5 years of commercial experience in building and maintaining a Data Lake.
  • Experience leading a Data Lake team of 3-5 Engineers (2 years).
  • Good knowledge of AWS cloud services, including the Glue framework with integration type of projects (2 years).
  • Experience maintaining Apache Kafka.
  • Steady expertise in data processing tools, including Redis, Apache Spark, Apache Iceberg, Athena.
  • Knowledge of job scheduling and orchestration using Airflow.
  • Experience in events streaming.
  • Well-versed in the optimization of ETL processes.
  • Experience of developing high-load backend services on Python.
  • Good understanding of algorithms and data structures.
  • Excellent communication skills, both written and verbal.
Perks:
  • Flexible working format - remote, office-based or flexible.
  • A competitive salary and good compensation package.
  • Personalized career growth.
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more).
  • Active tech communities with regular knowledge sharing.
  • Education reimbursement.
  • Memorable anniversary presents.
  • Corporate events and team buildings.
  • Other location-specific benefits.

Add these skills to join the top 1% applicants for this job

communication
github
data-structures
game-texts
agile-development
aws
apache-kafka
spark
redis
ci-cd
docker
git
python
algorithms

Our customer is the leading school transportation provider in North America, being the owner of more than a half of all yellow school buses in the United States. Every day, the company completes 5 million student journeys, moving more passengers than all U.S. airlines combined and delivers reliable, quality services for 1,100 school districts.

N-iX has built a successful cooperation with the client delivering a range of complex initiatives. As a result, N-iX has been selected as a strategic long-term partner to drive the digital transformation on an enterprise level, fully remodeling the technology landscape for 55,000 employees and millions of people across North America.

Responsibilities:

  • Design complex ETL processes of various data sources in the data warehouse
  • Build new and maintain existing data pipelines using Python to improve efficiency and latency
  • Improve data quality through anomaly detection by building and working with internal tools to measure data and automatically detect changes
  • Identify, design, and implement internal process improvements, including re-designing infrastructure for greater scalability, optimizing data delivery, and automating manual processes
  • Perform data modeling and improve our existing data models for analytics
  • Collaborate with SMEs, architects, analysts, and others to build solutions that integrate data from many of our enterprise data sources
  • Partner with stakeholders, including data, design, product, and executive teams, and assist them with data-related technical issues

Requirements:

  • Proficiency in Python 7+ years
  • 3-5 years of commercial experience in building and maintaining a Data Lake
  • Experience leading a Data Lake team of 3-5 Engineers (2 years)
  • Good knowledge of AWS cloud services, including the Glue framework with integration type of projects (2 years)
  • Experience maintaining Apache Kafka
  • Steady expertise in data processing tools, including Redis, Apache Spark, Apache Iceberg, Athena.
  • Knowledge of job scheduling and orchestration using Airflow
  • Experience in events streaming
  • Well-versed in the optimization of ETL processes
  • Experience of developing high-load backend services on Python.
  • Good understanding of algorithms and data structures
  • Excellent communication skills, both written and verbal

Nice to have:

  • Experience in the schema and dimensional data design
  • Collaboration within a scaled team, using Agile methodology
  • Decent knowledge of CI/CD (Docker, Cloud formation, Git)

We offer*:

  • Flexible working format - remote, office-based or flexible
  • A competitive salary and good compensation package
  • Personalized career growth
  • Professional development tools (mentorship program, tech talks and trainings, centers of excellence, and more)
  • Active tech communities with regular knowledge sharing
  • Education reimbursement
  • Memorable anniversary presents
  • Corporate events and team buildings
  • Other location-specific benefits

*not applicable for freelancers

Set alerts for more jobs like Lead Data Engineer
Set alerts for new jobs by N-ix
Set alerts for new Data Analysis jobs in Ukraine
Set alerts for new jobs in Ukraine
Set alerts for Data Analysis (Remote) jobs

Contact Us
hello@outscal.com
Made in INDIA 💛💙