Data Engineer
codeninja
Job Summary
CodeNinja is seeking a Data Engineer to design, implement, and maintain scalable data infrastructure. This role involves building data ingestion, processing, and transformation pipelines, curating high-quality datasets for model training and retrieval systems, and supporting data governance. The engineer will collaborate with AI and ML teams to ensure data readiness for production use cases, contributing to impactful AI and enterprise-scale solutions.
Must Have
- 5+ years of experience in data engineering with cloud-based ETL/ELT systems
- Strong proficiency in SQL and Python
- Experience with big data tools such as Apache Spark or equivalent frameworks
- Hands-on experience handling large unstructured or semi-structured datasets
Perks & Benefits
- Provident Fund
- Gym Membership
- Leaves as per the company policy
- Company-paid trips
- Easy Loan Facility for Employees
- Yearly increment
- Maternity Benefits (Leaves & WFH)
- Health Insurance (Maternity covered) – includes spouse and parents (till age 80)
Job Description
About Company
is a global AI and engineering services company helping enterprises build, scale, and operate intelligent systems. With 350+ engineers across four continents and 400+ successful deployments, enables organizations to harness artificial intelligence through Global Capability Centers, Work AI, Physical AI, and AI Labs. Recognized among Pakistan’s fastest-growing AI firms and a multi-award recipient on Clutch, empowers over 250 clients worldwide to innovate, automate, and compete in the intelligence economy.
Role Summary
We are seeking a Data Engineer to build and maintain scalable data infrastructure that supports model training, Retrieval-Augmented Generation (RAG) systems, and enterprise AI workflows.
Key Responsibilities
- Design and implement data ingestion, processing, and transformation pipelines.
- Prepare, curate, and optimize high-quality datasets for model training and retrieval systems.
- Support data governance, quality monitoring, and lineage tracking across AI data pipelines.
- Collaborate with AI and ML teams to ensure data readiness for production use cases.
Required Qualifications
- 5+ years of experience in data engineering with cloud-based ETL/ELT systems.
- Strong proficiency in SQL and Python.
- Experience with big data tools such as Apache Spark or equivalent frameworks.
- Hands-on experience handling large unstructured or semi-structured datasets.
Disclaimer: is an equal opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees. All aspects of employment including the decision to hire, promote, discipline, or discharge, will be based on merit, competence, & performance. Female and minorities are strongly encouraged and preferred to apply for the role.
Why
- Work on impactful AI and enterprise-scale solutions.
- Collaborate with global, cross-functional engineering teams.
- Grow your skills in a fast-paced, innovation-driven environment.
Benefits
- Provident Fund
- Gym Membership
- Leaves as per the company policy
- Company-paid trips
- Easy Loan Facility for Employees
- Yearly increment
- Maternity Benefits (Leaves & WFH)
- Health Insurance (Maternity covered) – includes spouse and parents (till age 80)