Data Engineer

codeninja

5+ Years | On Site | Full Time | 1 day ago

Apply Now

Job Summary

CodeNinja is seeking a Data Engineer to design, implement, and maintain scalable data infrastructure. This role involves building data ingestion, processing, and transformation pipelines, curating high-quality datasets for model training and retrieval systems, and supporting data governance. The engineer will collaborate with AI and ML teams to ensure data readiness for production use cases, contributing to impactful AI and enterprise-scale solutions.

Must Have

5+ years of experience in data engineering with cloud-based ETL/ELT systems
Strong proficiency in SQL and Python
Experience with big data tools such as Apache Spark or equivalent frameworks
Hands-on experience handling large unstructured or semi-structured datasets

Perks & Benefits

Provident Fund
Gym Membership
Leaves as per the company policy
Company-paid trips
Easy Loan Facility for Employees
Yearly increment
Maternity Benefits (Leaves & WFH)
Health Insurance (Maternity covered) – includes spouse and parents (till age 80)

Job Description

About Company

is a global AI and engineering services company helping enterprises build, scale, and operate intelligent systems. With 350+ engineers across four continents and 400+ successful deployments, enables organizations to harness artificial intelligence through Global Capability Centers, Work AI, Physical AI, and AI Labs. Recognized among Pakistan’s fastest-growing AI firms and a multi-award recipient on Clutch, empowers over 250 clients worldwide to innovate, automate, and compete in the intelligence economy.

Role Summary

We are seeking a Data Engineer to build and maintain scalable data infrastructure that supports model training, Retrieval-Augmented Generation (RAG) systems, and enterprise AI workflows.

Key Responsibilities

Design and implement data ingestion, processing, and transformation pipelines.
Prepare, curate, and optimize high-quality datasets for model training and retrieval systems.
Support data governance, quality monitoring, and lineage tracking across AI data pipelines.
Collaborate with AI and ML teams to ensure data readiness for production use cases.

Required Qualifications

5+ years of experience in data engineering with cloud-based ETL/ELT systems.
Strong proficiency in SQL and Python.
Experience with big data tools such as Apache Spark or equivalent frameworks.
Hands-on experience handling large unstructured or semi-structured datasets.

Disclaimer: is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All aspects of employment including the decision to hire, promote, discipline, or discharge, will be based on merit, competence, & performance. Female and minorities are strongly encouraged and preferred to apply for the role.

Why

Work on impactful AI and enterprise-scale solutions.
Collaborate with global, cross-functional engineering teams.
Grow your skills in a fast-paced, innovation-driven environment.

Benefits

Provident Fund
Gym Membership
Leaves as per the company policy
Company-paid trips
Easy Loan Facility for Employees
Yearly increment
Maternity Benefits (Leaves & WFH)
Health Insurance (Maternity covered) – includes spouse and parents (till age 80)

6 Skills Required For This Role

Cross Functional Data Analytics Game Texts Spark Python Sql

Similar Jobs