Senior Data Engineer - ML Team
This position for candidates in Gurgaon region only - we do not offer relocation
Role Overview
As a **Machine Learning Data Engineer**, you will be responsible for designing, building, and maintaining scalable data pipelines and ML infrastructure. Collaborating with cross-functional teams, you will ensure the data and ML pipelines support machine learning model development, deployment, and monitoring needs. You will focus on enabling machine learning use cases like data preparation or model serving, while incorporating MLOps best practices to enhance automation, scalability, and model lifecycle management.
Key Responsibilities:
- Design, develop, and maintain scalable data and ML pipelines to support machine learning use cases.
- Collaborate with Data Science, Engineering, and Product teams to define data and ML requirements and technical specifications.
- Implement ETL/ELT processes to prepare data for machine learning models, ensuring data quality and integrity.
- Design and deploy automated model training, validation, and monitoring workflows using MLOps best practices.
- Develop reusable components for data and feature engineering, model training, and model serving.
- Ensure team visibility through metric measurements and insights, focusing on model performance and health.
- Establish best ML model versioning, deployment, and monitoring practices to support continuous integration and deployment (CI/CD).
Qualifications:
- Bachelor's degree in Data Science, Computer Science, Engineering, or a related technical field.
- 5+ years of experience in building data and ML pipelines for machine learning projects.
- Expertise in data modeling, architecture, and SQL.
- Proficiency in programming languages (Python, Scala, Java) and experience with ML libraries/frameworks (e.g., TensorFlow, PyTorch, Scikit-Learn).
- Experience with Big Data frameworks like Hadoop and Spark.
- Proven experience with cloud-based ML and data solutions (e.g., AWS Sagemaker, GCP AI Platform, Azure ML).
- Strong understanding of MLOps concepts, including model lifecycle management, CI/CD, and model monitoring.
- Familiarity with containerization tools (e.g., Docker, Kubernetes) to facilitate scalable ML deployments.
What We Offer:
- Competitive salary plus performance-based bonuses.
- Opportunity to work in an innovative environment at the forefront of data solutions.
- Professional development and career growth opportunities. We are scaling!
- Comprehensive benefits package.
Trustana is an equal-opportunity employer. We celebrate diversity and are committed to creating an inclusive environment for all employees.
About Us:
Trustana is a connected trade solution that enables companies of all sizes to compete. It helps retailers, distributors, and brand owners onboard, manage, and enrich their product data, increasing sales conversion and overall productivity. Over three years, Trustana has supported thousands of customers. Utilizing decentralized technologies, Trustana’s vision is to unlock additional business value for companies globally.
Please see https://www.trustana.com/careers for more information on Trustana!