Data Engineer - Consumer Data AI / ML

undefined ago • 4 Years + • Data Analysis • $96,000 PA - $200,000 PA

Job Summary

Job Description

The ideal candidate will have strong AI/ML experience to design, build, and optimize scalable data pipelines and infrastructure that power advanced analytics and machine learning solutions. In this role, you will collaborate closely with data scientists, software engineers, and business stakeholders to prepare and transform large datasets, support end-to-end model development and deployment, and ensure robust, efficient, and secure data flows. You will leverage your expertise in cloud platforms, big data tools, and machine learning frameworks to drive innovation and deliver actionable insights that advance our organization’s AI initiatives and business objectives.
Must have:
  • Design, build, and maintain scalable data pipelines and ETL processes to support machine learning and AI initiatives on Google Cloud Platform (GCP).
  • Implement and optimize data storage solutions using GCP services such as BigQuery, Cloud Storage, and Dataflow.
  • Ensure data quality, integrity, and security throughout the data lifecycle.
  • Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver actionable insights.
  • Monitor, troubleshoot, and maintain the health and performance of cloud-based data infrastructure.
  • Automate manual processes and repetitive tasks to improve efficiency and reduce errors.
  • Apply data governance and compliance best practices to protect sensitive information and meet regulatory standards.
  • Stay current with new GCP features, tools, and best practices to continuously enhance data management capabilities.
  • Document solutions, processes, and architectural decisions to facilitate knowledge sharing and maintainability.
  • BS or MS in Computer Science or a related major, or equivalent experience.
  • 4+ years of software engineering experience, with a strong emphasis on system design and backend development.
  • 2+ years hands-on experience with Google Cloud Platform ecosystem (BigQuery, Dataproc, Composer, Dataflow, Data Catalog, Observability) or AWS equivalent.
  • Proven ability to design, build, and maintain data pipelines that support machine learning and AI model development, training, and deployment.
  • Familiarity with data security, compliance, and governance best practices.
  • Strong problem-solving skills, attention to detail, and ability to work collaboratively with cross-functional teams.
  • Excellent communication skills and ability to tell insightful stories using data and also manage communication within internal teams and stakeholders.
Good to have:
  • Java
  • Python
  • Scala
  • SQL
Perks:
  • flexible hybrid work options
  • healthcare
  • 401k
  • backup childcare
  • education stipends

Job Details

It takes powerful technology to connect our brands and partners with an audience of hundreds of millions of people. Whether you’re looking to write mobile app code, engineer the servers behind our massive ad tech stacks, or develop algorithms to help us process trillions of data points a day, what you do here will have a huge impact on our business—and the world.

Summary:

The ideal candidate will have strong AI/ML experience to design, build, and optimize scalable data pipelines and infrastructure that power advanced analytics and machine learning solutions. In this role, you will collaborate closely with data scientists, software engineers, and business stakeholders to prepare and transform large datasets, support end-to-end model development and deployment, and ensure robust, efficient, and secure data flows. You will leverage your expertise in cloud platforms, big data tools, and machine learning frameworks to drive innovation and deliver actionable insights that advance our organization’s AI initiatives and business objectives.

Responsibilities:

  • Design, build, and maintain scalable data pipelines and ETL processes to support machine learning and AI initiatives on Google Cloud Platform (GCP).
  • Implement and optimize data storage solutions using GCP services such as BigQuery, Cloud Storage, and Dataflow.
  • Ensure data quality, integrity, and security throughout the data lifecycle.
  • Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver actionable insights.
  • Monitor, troubleshoot, and maintain the health and performance of cloud-based data infrastructure.
  • Automate manual processes and repetitive tasks to improve efficiency and reduce errors.
  • Apply data governance and compliance best practices to protect sensitive information and meet regulatory standards.
  • Stay current with new GCP features, tools, and best practices to continuously enhance data management capabilities.
  • Document solutions, processes, and architectural decisions to facilitate knowledge sharing and maintainability.

Qualifications:

  • BS or MS in Computer Science or a related major, or equivalent experience
  • 4+ years of software engineering experience, with a strong emphasis on system design and backend development.
  • 2+ years hands-on experience with Google Cloud Platform ecosystem (BigQuery, Dataproc, Composer, Dataflow, Data Catalog, Observability) or AWS equivalent.
  • Proven ability to design, build, and maintain data pipelines that support machine learning and AI model development, training, and deployment.
  • Familiarity with data security, compliance, and governance best practices.
  • Strong problem-solving skills, attention to detail, and ability to work collaboratively with cross-functional teams.
  • Excellent communication skills and ability to tell insightful stories using data and also manage communication within internal teams and stakeholders.

The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!

Similar Jobs

Notion - Software Engineer, Mail (Frontend)

Notion

San Francisco, California, United States (On-Site)
2 Months ago
Patreon - Senior Data Scientist

Patreon

San Francisco, California, United States (Hybrid)
4 Months ago
FlockSafety - Traveling Installation Technician

FlockSafety

Utica, New York, United States (Remote)
1 Month ago
Rolls-Royce - PSB Portfolio & Project Planner

Rolls-Royce

Singapore (On-Site)
1 Month ago
Aptive - Tooling Engineer

Aptive

Quimistán, Santa Bárbara Department, Honduras (On-Site)
1 Year ago
Lambda - Data Center Operations Engineer

Lambda

Atlanta, Georgia, United States (On-Site)
1 Month ago
Normalyze - Performance Test - Senior Engineer - Solutions - Data Security - India

Normalyze

Bengaluru, Karnataka, India (Remote)
8 Months ago
P99 soft - Data Engineer

P99 soft

Hyderabad, Telangana, India (On-Site)
3 Months ago
Ion - Internship - Data Science

Ion

Pisa, Tuscany, Italy (On-Site)
10 Months ago
dun bradstreet - Senior Principal Data Scientist, AaaS

dun bradstreet

Frankfurt Am Main, Hessen, Germany (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Postman - People Ops Communication Contractor

Postman

San Francisco, California, United States (Hybrid)
3 Months ago
Kin. - Director of Growth Marketing

Kin.

United States (Remote)
4 Weeks ago
Google - Associate Android Auto Partner Engineer, gReach Program

Google

Seoul, South Korea (On-Site)
3 Months ago
Super.com - Senior Software Engineer, Payments

Super.com

Canada (Remote)
9 Months ago
BioFire - IS Business Analyst Intern

BioFire

St. Louis, Missouri, United States (On-Site)
1 Month ago
TransUnion - Product Marketing Manager – Specialized Risk Group (SRG)

TransUnion

Alpharetta, Georgia, United States (Hybrid)
1 Month ago
HappyFox - Product Manager

HappyFox

Bengaluru, Karnataka, India (On-Site)
1 Year ago
CD PROJEKT RED - IT Director

CD PROJEKT RED

Boston, Massachusetts, United States (On-Site)
2 Months ago
Paytm - Product Operation - Assistant Manager - Lending

Paytm

Noida, Uttar Pradesh, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Twitch - Senior Security Engineer

Twitch

San Francisco, California, United States (On-Site)
1 Month ago
Playstation - Staff Software Engineer

Playstation

San Mateo, California, United States (On-Site)
1 Month ago
Warner Bros - NetherRealm Studios - Lead Software Engineer

Warner Bros - NetherRealm Studios

Troy, New York, United States (Remote)
2 Months ago
Alpha Sense - Senior Product Manager, AI Workflows

Alpha Sense

New York, United States (On-Site)
2 Months ago
Rackspace Technology - Communications and Content Development Manager

Rackspace Technology

San Antonio, Texas, United States (Hybrid)
1 Month ago
bytedance - Optical Scientist - Display Optics System

bytedance

San Jose, California, United States (On-Site)
5 Months ago
WebFX - Jr. Web Developer

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
9 Months ago
NVIDIA - Senior VLSI Physical Design Integration Engineer

NVIDIA

Massachusetts, United States (On-Site)
4 Months ago
Axon - Pricing Analyst - New Product Introduction

Axon

Atlanta, Georgia, United States (Hybrid)
1 Month ago
Fearless - Program Manager

Fearless

Washington, District Of Columbia, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Electronic Arts - Senior AI Data Scientist

Electronic Arts

Kirkland, Washington, United States (On-Site)
3 Months ago
Scopely - Senior Data Analyst, Ads

Scopely

Spain (Hybrid)
9 Months ago
Apple - Senior / Staff Data Infrastructure Engineer for Lakehouse, Apple Data Platform

Apple

Cupertino, California, United States (On-Site)
1 Month ago
TransUnion - Senior Consultant, Data Science and Analytics

TransUnion

Hong Kong (On-Site)
2 Months ago
Blackshark - Senior Software Engineer - Data Plane Team

Blackshark

(Remote)
3 Months ago
binance - Binance Accelerator Program - Data Analyst

binance

Dubai, Dubai, United Arab Emirates (Remote)
3 Years ago
Tesla - Data Engineer Internship

Tesla

North Holland, Netherlands (On-Site)
6 Months ago
GoTo Group - Data Engineer

GoTo Group

Jakarta, Indonesia (On-Site)
1 Month ago
ten square games - Data Scientist

ten square games

Wrocław, Lower Silesian Voivodeship, Poland (Hybrid)
3 Weeks ago
CommerceIQ - Software Development Engineer II - Data/Platform Team

CommerceIQ

Bengaluru, Karnataka, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Yahoo serves as a trusted guide for hundreds of millions of people globally, helping them achieve their goals online through our portfolio of iconic products. For advertisers, Yahoo Advertising offers omnichannel solutions and powerful data to engage with our brands and deliver results.

United States (Hybrid)

United Kingdom (Hybrid)

Hong Kong (Hybrid)

Canada, Kentucky, United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Yahoo

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug