Data Engineer - Consumer Data AI / ML

undefined ago • 4 Years + • Data Analysis • $96,000 PA - $200,000 PA

Job Summary

Job Description

The ideal candidate will have strong AI/ML experience to design, build, and optimize scalable data pipelines and infrastructure that power advanced analytics and machine learning solutions. In this role, you will collaborate closely with data scientists, software engineers, and business stakeholders to prepare and transform large datasets, support end-to-end model development and deployment, and ensure robust, efficient, and secure data flows. You will leverage your expertise in cloud platforms, big data tools, and machine learning frameworks to drive innovation and deliver actionable insights that advance our organization’s AI initiatives and business objectives.
Must have:
  • Design, build, and maintain scalable data pipelines and ETL processes for ML/AI on GCP.
  • Implement and optimize data storage solutions using GCP services like BigQuery, Cloud Storage, Dataflow.
  • Ensure data quality, integrity, and security throughout the data lifecycle.
  • Collaborate with data scientists and stakeholders to understand requirements and deliver insights.
  • Monitor, troubleshoot, and maintain cloud-based data infrastructure performance.
  • Automate manual processes and repetitive tasks to improve efficiency.
  • Apply data governance and compliance best practices for sensitive information.
  • Stay current with new GCP features and best practices for data management.
  • Document solutions, processes, and architectural decisions for knowledge sharing.
  • 4+ years of software engineering experience, strong in system design and backend.
  • 2+ years hands-on experience with Google Cloud Platform ecosystem or AWS equivalent.
  • Proven ability to design, build, and maintain data pipelines for ML/AI model development.
  • Familiarity with data security, compliance, and governance best practices.
Good to have:
  • Fluency with at least one object-oriented programming language from Java, Python, or Scala is highly desirable.
  • SQL proficiency is also valued for database operations.
Perks:
  • Flexible hybrid work options
  • Healthcare
  • 401k
  • Backup childcare
  • Education stipends

Job Details

It takes powerful technology to connect our brands and partners with an audience of hundreds of millions of people. Whether you’re looking to write mobile app code, engineer the servers behind our massive ad tech stacks, or develop algorithms to help us process trillions of data points a day, what you do here will have a huge impact on our business—and the world.

Summary:

The ideal candidate will have strong AI/ML experience to design, build, and optimize scalable data pipelines and infrastructure that power advanced analytics and machine learning solutions. In this role, you will collaborate closely with data scientists, software engineers, and business stakeholders to prepare and transform large datasets, support end-to-end model development and deployment, and ensure robust, efficient, and secure data flows. You will leverage your expertise in cloud platforms, big data tools, and machine learning frameworks to drive innovation and deliver actionable insights that advance our organization’s AI initiatives and business objectives.

Responsibilities:

  • Design, build, and maintain scalable data pipelines and ETL processes to support machine learning and AI initiatives on Google Cloud Platform (GCP).
  • Implement and optimize data storage solutions using GCP services such as BigQuery, Cloud Storage, and Dataflow.
  • Ensure data quality, integrity, and security throughout the data lifecycle.
  • Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver actionable insights.
  • Monitor, troubleshoot, and maintain the health and performance of cloud-based data infrastructure.
  • Automate manual processes and repetitive tasks to improve efficiency and reduce errors.
  • Apply data governance and compliance best practices to protect sensitive information and meet regulatory standards.
  • Stay current with new GCP features, tools, and best practices to continuously enhance data management capabilities.
  • Document solutions, processes, and architectural decisions to facilitate knowledge sharing and maintainability.

Qualifications:

  • BS or MS in Computer Science or a related major, or equivalent experience
  • 4+ years of software engineering experience, with a strong emphasis on system design and backend development.
  • 2+ years hands-on experience with Google Cloud Platform ecosystem (BigQuery, Dataproc, Composer, Dataflow, Data Catalog, Observability) or AWS equivalent.
  • Proven ability to design, build, and maintain data pipelines that support machine learning and AI model development, training, and deployment.
  • Fluency with at least one object-oriented programming language from Java, Python, or Scala is highly desirable, as these skills are critical for developing robust applications and managing data workflows effectively. SQL proficiency is also valued for database operations.
  • Familiarity with data security, compliance, and governance best practices.
  • Strong problem-solving skills, attention to detail, and ability to work collaboratively with cross-functional teams.
  • Excellent communication skills and ability to tell insightful stories using data and also manage communication within internal teams and stakeholders.

The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!

Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.

We believe that a diverse and inclusive workplace strengthens Yahoo and deepens our relationships. When you support everyone to be their best selves, they spark discovery, innovation and creativity. Among other efforts, our 11 employee resource groups (ERGs) enhance a culture of belonging with programs, events and fellowship that help educate, support and create a workplace where all feel welcome.

The compensation for this position ranges from $96,000.00 - $200,000.00/yr and will vary depending on factors such as your location, skills and experience.The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.

Currently work for Yahoo? Please apply on our internal career site.

About Us

Yahoo serves as a trusted guide for hundreds of millions of people globally, helping them achieve their goals online through our portfolio of iconic products. For advertisers, Yahoo Advertising offers omnichannel solutions and powerful data to engage with our brands and deliver results.

Similar Jobs

Notion - Software Engineer, Mail (Frontend)

Notion

San Francisco, California, United States (On-Site)
2 Months ago
Patreon - Senior Data Scientist

Patreon

San Francisco, California, United States (Hybrid)
4 Months ago
FlockSafety - Traveling Installation Technician

FlockSafety

Utica, New York, United States (Remote)
3 Weeks ago
Rolls-Royce - PSB Portfolio & Project Planner

Rolls-Royce

Singapore (On-Site)
3 Weeks ago
Aptive - Tooling Engineer

Aptive

Quimistán, Santa Bárbara Department, Honduras (On-Site)
1 Year ago
Lambda - Data Center Operations Engineer

Lambda

Atlanta, Georgia, United States (On-Site)
1 Month ago
Normalyze - Performance Test - Senior Engineer - Solutions - Data Security - India

Normalyze

Bengaluru, Karnataka, India (Remote)
8 Months ago
P99 soft - Data Engineer

P99 soft

Hyderabad, Telangana, India (On-Site)
3 Months ago
Ion - Internship - Data Science

Ion

Pisa, Tuscany, Italy (On-Site)
10 Months ago
dun bradstreet - Senior Principal Data Scientist, AaaS

dun bradstreet

Frankfurt Am Main, Hessen, Germany (Hybrid)
2 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Postman - People Ops Communication Contractor

Postman

San Francisco, California, United States (Hybrid)
3 Months ago
Kin. - Director of Growth Marketing

Kin.

United States (Remote)
3 Weeks ago
Google - Associate Android Auto Partner Engineer, gReach Program

Google

Seoul, South Korea (On-Site)
3 Months ago
Super.com - Senior Software Engineer, Payments

Super.com

Canada (Remote)
8 Months ago
BioFire - IS Business Analyst Intern

BioFire

St. Louis, Missouri, United States (On-Site)
1 Month ago
TransUnion - Product Marketing Manager – Specialized Risk Group (SRG)

TransUnion

Alpharetta, Georgia, United States (Hybrid)
3 Weeks ago
HappyFox - Product Manager

HappyFox

Bengaluru, Karnataka, India (On-Site)
1 Year ago
CD PROJEKT RED - IT Director

CD PROJEKT RED

Boston, Massachusetts, United States (On-Site)
2 Months ago
Paytm - Product Operation - Assistant Manager - Lending

Paytm

Noida, Uttar Pradesh, India (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Twitch - Senior Security Engineer

Twitch

San Francisco, California, United States (On-Site)
1 Month ago
Playstation - Staff Software Engineer

Playstation

San Mateo, California, United States (On-Site)
3 Weeks ago
Warner Bros - NetherRealm Studios - Lead Software Engineer

Warner Bros - NetherRealm Studios

Troy, New York, United States (Remote)
2 Months ago
Alpha Sense - Senior Product Manager, AI Workflows

Alpha Sense

New York, United States (On-Site)
2 Months ago
Rackspace Technology - Communications and Content Development Manager

Rackspace Technology

San Antonio, Texas, United States (Hybrid)
1 Month ago
bytedance - Optical Scientist - Display Optics System

bytedance

San Jose, California, United States (On-Site)
5 Months ago
WebFX - Jr. Web Developer

WebFX

Harrisburg, Pennsylvania, United States (On-Site)
9 Months ago
NVIDIA - Senior VLSI Physical Design Integration Engineer

NVIDIA

Massachusetts, United States (On-Site)
3 Months ago
Axon - Pricing Analyst - New Product Introduction

Axon

Atlanta, Georgia, United States (Hybrid)
1 Month ago
Fearless - Program Manager

Fearless

Washington, District Of Columbia, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Electronic Arts - Senior AI Data Scientist

Electronic Arts

Kirkland, Washington, United States (On-Site)
3 Months ago
Scopely - Senior Data Analyst, Ads

Scopely

Spain (Hybrid)
8 Months ago
Apple - Senior / Staff Data Infrastructure Engineer for Lakehouse, Apple Data Platform

Apple

Cupertino, California, United States (On-Site)
1 Month ago
TransUnion - Senior Consultant, Data Science and Analytics

TransUnion

Hong Kong (On-Site)
2 Months ago
Blackshark - Senior Software Engineer - Data Plane Team

Blackshark

(Remote)
2 Months ago
binance - Binance Accelerator Program - Data Analyst

binance

Dubai, Dubai, United Arab Emirates (Remote)
3 Years ago
Tesla - Data Engineer Internship

Tesla

North Holland, Netherlands (On-Site)
5 Months ago
GoTo Group - Data Engineer

GoTo Group

Jakarta, Indonesia (On-Site)
1 Month ago
ten square games - Data Scientist

ten square games

Wrocław, Lower Silesian Voivodeship, Poland (Hybrid)
3 Weeks ago
CommerceIQ - Software Development Engineer II - Data/Platform Team

CommerceIQ

Bengaluru, Karnataka, India (On-Site)
3 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Yahoo serves as a trusted guide for hundreds of millions of people globally, helping them achieve their goals online through our portfolio of iconic products. For advertisers, Yahoo Advertising offers omnichannel solutions and powerful data to engage with our brands and deliver results.

United States (Hybrid)

United States (Hybrid)

United States (Remote)

United States (Hybrid)

United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Yahoo

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug