Data Engineer (ML)

1 Month ago • 3 Years + • Data Analysis

Job Summary

Job Description

We are seeking a talented ML Data Engineer to join our R&D team. In this role, you’ll be responsible for shaping our data infrastructure, empowering data science initiatives, and ensuring the reliability of our data ecosystem. You will build scalable data pipelines, ETLs, and ELTs to unify, normalize, and aggregate data in our data warehouse and feature store. Your role will also involve developing complex preprocessing pipelines for machine learning and advanced quality checks, identifying and mitigating anomalies, drifts, and inconsistencies that could impact downstream analyses and models. Beyond technical proficiency, a strong business intelligence (BI) acumen will be essential for understanding the context and implications of each data point, ensuring we drive data-informed decision-making across the organization.
Must have:
  • Design, build, and optimize complex, scalable, and robust data pipelines.
  • Unify, normalize, and aggregate datasets, ensuring validity, accuracy, consistency.
  • Create sophisticated preprocessing pipelines for machine learning models.
  • Implement advanced data validation techniques, detecting and addressing anomalies and data drift.
  • Understand business context and significance of each data point, translating needs into solutions.
  • Collaborate effectively with data scientists, analysts, and business stakeholders.

Job Details

Description

VI is the market leading Enterprise-AI platform for health, serving the world’s largest health organizations — from Fortune 500 health providers to pharma and consumer brands — helping them maximize acquisition, enrollment, engagement, retention, and health outcomes. Vi offers 3 main product lines: Activate, Engage and Operate.

Backed by $125M+ in R&D, our powerful platform serves over 175 million members daily — and growing. We are based in New York, Austin, Nashville & Tel Aviv.

We are seeking a talented ML Data Engineer to join our R&D team. In this role, you’ll be responsible for shaping our data infrastructure, empowering data science initiatives, and ensuring the reliability of our data ecosystem. You will build scalable data pipelines, ETLs, and ELTs to unify, normalize, and aggregate data in our data warehouse and feature store. Your role will also involve developing complex preprocessing pipelines for machine learning and advanced quality checks, identifying and mitigating anomalies, drifts, and inconsistencies that could impact downstream analyses and models. Beyond technical proficiency, a strong business intelligence (BI) acumen will be essential for understanding the context and implications of each data point, ensuring we drive data-informed decision-making across the organization.

Responsibilities

  • Pipeline Development: Design, build, and optimize complex, scalable, and robust data pipelines to ingest data from multiple sources into our data warehouse and feature store.
  • Data Transformation & Normalization: Work with both structured and unstructured data to unify, normalize, and aggregate datasets, ensuring validity, accuracy, consistency, and readiness for analysis and model development.
  • ML Preprocessing Pipeline: Collaborate with data scientists to create sophisticated preprocessing pipelines, handling the nuances of feature engineering, and preparing data for machine learning models.
  • Data Quality & Integrity: Implement advanced data validation techniques beyond simple checks, focusing on detecting and addressing anomalies, data drift, and ensuring data integrity across various sources.
  • Strong Business Understanding: Bring a business intelligence mindset to understand the context and significance of each data point, working closely with BI stakeholders to translate business needs into technical solutions.
  • Collaboration: Collaborate effectively with data scientists, analysts, and business stakeholders to ensure alignment on data requirements, quality standards, and project goals.

Requirements

  • B.Sc degree in Computer Science or a related technical field or equivalent practical experience
  • 3+ years of experience in a data engineering role
  • Coding experience with Python
  • Experience with SQL databases
  • Experience with data modeling, data warehousing, and building ELT/ETL pipelines
  • Experience with AWS cloud and infrastructure as code
  • Experience building data pipelines with big data frameworks such as Spark, Hadoop, etc.
  • Experience working with BI teams, business context and implications
  • Experience working closely with data scientists / on a data science projects
  • Technologically diverse background and ability/willingness to learn quickly

Similar Jobs

Accenture - Record to Report Ops Specialist

Accenture

Jaipur, Rajasthan, India (On-Site)
2 Months ago
Axon - Supplier Quality Engineer

Axon

Taipei City, Taiwan (On-Site)
1 Month ago
Antarctica Global - Research & Sustainability Analyst

Antarctica Global

Mumbai, Maharashtra, India (Remote)
4 Months ago
Arkose Labs - Staff iOS Engineer

Arkose Labs

Pune, Maharashtra, India (Hybrid)
3 Months ago
Ansys - Senior Technical Support Engineer

Ansys

Canonsburg, Pennsylvania, United States (Remote)
1 Month ago
Trailmix - Senior Data Engineer

Trailmix

London, England, United Kingdom (Hybrid)
1 Month ago
Fanatee - Data Science Intern

Fanatee

(On-Site)
1 Year ago
dun bradstreet - Senior Principal Data Scientist, AaaS

dun bradstreet

Frankfurt Am Main, Hessen, Germany (Hybrid)
2 Months ago
Luxoft - Murex XVA Techno-Functional Business Analyst

Luxoft

Sydney, New South Wales, Australia (On-Site)
9 Months ago
Sandsoft Games - Director of Data Science and Engineering

Sandsoft Games

Riyadh, Riyadh Province, Saudi Arabia (On-Site)
6 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Tesla - Virtual Diagnostic Technician

Tesla

Vienna, Vienna, Austria (On-Site)
6 Months ago
bytedance - Red Team Engineer, Security Assurance

bytedance

Singapore (On-Site)
10 Months ago
Thumbtack - Account Executive

Thumbtack

Philippines (Remote)
7 Months ago
Boomi  - Office Manager / Executive Assistant

Boomi

Tokyo, Japan (On-Site)
1 Month ago
Clearwater Analytics - Client Services Team Lead

Clearwater Analytics

Boise, Idaho, United States (On-Site)
1 Month ago
GoTo Group - Senior Data Analyst

GoTo Group

Jakarta, Indonesia (On-Site)
3 Weeks ago
PwC - Senior Manager, Azure Data Architect, Data Analytics, Advisory

PwC

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Grab - Manager, Corporate Strategy

Grab

Pasig, Metro Manila, Philippines (On-Site)
2 Months ago
Qualcomm - Sr Lead Engineer - DFT

Qualcomm

Bengaluru, Karnataka, India (On-Site)
1 Month ago
Zones - Client Success Manager - 1

Zones

Islamabad, Islamabad Capital Territory, Pakistan (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in Tel Aviv-Yafo, Tel Aviv District, Israel

Varonis  - Cloud Security Research Team Leader

Varonis

Herzliya, Tel Aviv District, Israel (On-Site)
10 Months ago
plarium - Internal Communication Manager

plarium

Herzliya, Tel Aviv District, Israel (Hybrid)
3 Months ago
Pazu Games - Marketing Manager

Pazu Games

Yokne'am Illit, North District, Israel (On-Site)
3 Weeks ago
Unity - Senior Data Product Manager

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Unity - Senior Big Data & ML Engineer

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (Remote)
5 Months ago
Playtika - Graphic Designer

Playtika

Israel (On-Site)
1 Month ago
SciPlay - Server Engineer

SciPlay

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
1 Year ago
Unity - DevOps Tech Lead

Unity

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)
3 Months ago
Moon Active - Backend Developer

Moon Active

Tel Aviv-Yafo, Tel Aviv District, Israel (Hybrid)
1 Month ago
Varonis  - PySpark Backend Engineer – AI Security

Varonis

Herzliya, Tel Aviv District, Israel (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Blackshark - Senior Software Engineer - Data Plane Team

Blackshark

(Remote)
3 Months ago
HCL Tech - Senior Business Analyst

HCL Tech

California, United States (On-Site)
2 Months ago
Brillio - Principal Data Engineer

Brillio

Bengaluru, Karnataka, India (Hybrid)
5 Months ago
Precisly - FinOps Data Engineer

Precisly

Poland (On-Site)
1 Month ago
PwC - Senior Manager, Azure Data Architect, Data Analytics, Advisory

PwC

Bengaluru, Karnataka, India (On-Site)
1 Year ago
Rocket studio - Data Analyst (Intern)

Rocket studio

Hanoi, Vietnam (On-Site)
3 Months ago
Publicis Groupe - Publicis Media - Dual Study Program in Business Informatics Data Science & AI (m/f/d) - 2026

Publicis Groupe

Düsseldorf, North Rhine-Westphalia, Germany (Hybrid)
1 Month ago
yubo - Data Engineer

yubo

Paris, Île-de-France, France (Hybrid)
1 Month ago
GoTo Group - Data Analyst - Driver Risk

GoTo Group

Jakarta, Indonesia (On-Site)
1 Month ago
fluence - Sr. Business Analyst, Production Planning

fluence

Houston, Texas, United States (Hybrid)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Austin, Texas, United States (Remote)

New York, United States (Remote)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

Tel Aviv-Yafo, Tel Aviv District, Israel (On-Site)

View All Jobs

Get notified when new jobs are added by VI

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug