Principal Data Engineer - Consumer Data AI / ML

1 Month ago • 10 Years + • Data Analysis • $136,125 PA - $283,750 PA

Job Summary

Job Description

Design, build, and optimize scalable data pipelines and infrastructure for advanced analytics and machine learning solutions. Collaborate with data scientists, engineers, and stakeholders to prepare and transform large datasets, support end-to-end model development and deployment, and ensure robust, efficient, and secure data flows. Leverage expertise in cloud platforms, big data tools, and machine learning frameworks to drive innovation and deliver actionable insights. Responsibilities include designing and maintaining scalable data pipelines and ETL processes on Google Cloud Platform (GCP), implementing and optimizing data storage solutions using GCP services, ensuring data quality, integrity, and security, and collaborating with teams to deliver insights.
Must have:
  • 10+ years of software engineering experience
  • 4+ years hands-on experience with Google Cloud Platform
  • Design, build, and maintain data pipelines for ML/AI
  • Fluency with Java, Python, or Scala
  • SQL proficiency
  • Familiarity with data security and compliance
Good to have:
  • Experience with AWS equivalent
  • Strong problem-solving skills
  • Excellent communication skills
Perks:
  • Flexible hybrid work options
  • Incentive compensation opportunities (discretionary annual bonus or commissions)
  • Comprehensive benefits package including healthcare
  • 401k
  • Backup childcare
  • Education stipends

Job Details

It takes powerful technology to connect our brands and partners with an audience of hundreds of millions of people. Whether you’re looking to write mobile app code, engineer the servers behind our massive ad tech stacks, or develop algorithms to help us process trillions of data points a day, what you do here will have a huge impact on our business—and the world.

Summary:

The ideal candidate will have strong AI/ML experience to design, build, and optimize scalable data pipelines and infrastructure that power advanced analytics and machine learning solutions. In this role, you will collaborate closely with data scientists, software engineers, and business stakeholders to prepare and transform large datasets, support end-to-end model development and deployment, and ensure robust, efficient, and secure data flows. You will leverage your expertise in cloud platforms, big data tools, and machine learning frameworks to drive innovation and deliver actionable insights that advance our organization’s AI initiatives and business objectives.

Responsibilities:

  • Design, build, and maintain scalable data pipelines and ETL processes to support machine learning and AI initiatives on Google Cloud Platform (GCP).

  • Implement and optimize data storage solutions using GCP services such as BigQuery, Cloud Storage, and Dataflow.

  • Ensure data quality, integrity, and security throughout the data lifecycle.

  • Collaborate with data scientists, analysts, and business stakeholders to understand data requirements and deliver actionable insights.

  • Monitor, troubleshoot, and maintain the health and performance of cloud-based data infrastructure.

  • Automate manual processes and repetitive tasks to improve efficiency and reduce errors.

  • Apply data governance and compliance best practices to protect sensitive information and meet regulatory standards.

  • Stay current with new GCP features, tools, and best practices to continuously enhance data management capabilities.

  • Document solutions, processes, and architectural decisions to facilitate knowledge sharing and maintainability.

Qualifications:

  • BS or MS in Computer Science or a related major, or equivalent experience

  • 10+ years of software engineering experience, with a strong emphasis on system design and backend development.

  • 4+ years hands-on experience with Google Cloud Platform ecosystem (BigQuery, Dataproc, Composer, Dataflow, Data Catalog, Observability) or AWS equivalent.

  • Proven ability to design, build, and maintain data pipelines that support machine learning and AI model development, training, and deployment.

  • Fluency with at least one object-oriented programming language from Java, Python, or Scala is highly desirable, as these skills are critical for developing robust applications and managing data workflows effectively. SQL proficiency is also valued for database operations.

  • Familiarity with data security, compliance, and governance best practices.

  • Strong problem-solving skills, attention to detail, and ability to work collaboratively with cross-functional teams.

  • Excellent communication skills and ability to tell insightful stories using data and also manage communication within internal teams and stakeholders.

The material job duties and responsibilities of this role include those listed above as well as adhering to Yahoo policies; exercising sound judgment; working effectively, safely and inclusively with others; exhibiting trustworthiness and meeting expectations; and safeguarding business operations and brand integrity.

At Yahoo, we offer flexible hybrid work options that our employees love! While most roles don’t require regular office attendance, you may occasionally be asked to attend in-person events or team sessions. You’ll always get notice to make arrangements. Your recruiter will let you know if a specific job requires regular attendance at a Yahoo office or facility. If you have any questions about how this applies to the role, just ask the recruiter!

Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.

We believe that a diverse and inclusive workplace strengthens Yahoo and deepens our relationships. When you support everyone to be their best selves, they spark discovery, innovation and creativity. Among other efforts, our 11 employee resource groups (ERGs) enhance a culture of belonging with programs, events and fellowship that help educate, support and create a workplace where all feel welcome.

The compensation package may also include incentive compensation opportunities in the form of discretionary annual bonus or commissions. Our comprehensive benefits include healthcare, a great 401k, backup childcare, education stipends and much (much) more.

Currently work for Yahoo? Please apply on our internal career site.

Similar Jobs

upwork - Director, M&A Accounting

upwork

United States (Remote)
1 Month ago
Alation - Senior Product Manager

Alation

Redwood City, California, United States (Hybrid)
1 Month ago
Trellix - Renewals Account Manager

Trellix

United States (Remote)
1 Year ago
Mistral AI - Technical Program Manager, Engineering

Mistral AI

Paris, Île-de-France, France (On-Site)
6 Months ago
The Globel Talent Co - Senior Product/Marketing Analyst

The Globel Talent Co

Johannesburg, Gauteng, South Africa (Remote)
6 Months ago
PwC - Senior data analyst | Deals (M&A) | Lyon | CDI | H/F

PwC

Lyon, Auvergne-Rhône-Alpes, France (On-Site)
10 Months ago
GoMotive - Principal Data Platform Engineer

GoMotive

United States (Remote)
1 Month ago
Remote - Senior Product Data Analyst

Remote

Netherlands (Remote)
3 Weeks ago
endava - Senior Data Engineer (Azure)

endava

Buenos Aires, Buenos Aires, Argentina (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

Enphase Energy - Sr Embedded Engineer

Enphase Energy

Austin, Texas, United States (Hybrid)
2 Months ago
Vertx Inc. - Workday Finance Solution Architect

Vertx Inc.

United States (Remote)
1 Month ago
Grammarly - Lifecycle Marketing Manager, Renewals

Grammarly

Berlin, Berlin, Germany (Hybrid)
4 Weeks ago
OKX - Strategic Communications Director (Social Media Focused)

OKX

Hong Kong (On-Site)
2 Months ago
Axon - Senior Product Manager

Axon

London, England, United Kingdom (On-Site)
1 Month ago
Riot Games - Senior Game Product Manager - League of Legends

Riot Games

Los Angeles, California, United States (On-Site)
3 Months ago
Discord - Staff Software Engineer – UI/UX & Animations

Discord

United States (Remote)
3 Months ago
Inspiren - Embedded Systems Architect

Inspiren

New York, United States (Remote)
4 Weeks ago
NXP - Global OSAT Package Principal Engineer

NXP

Hsinchu, Hsinchu City, Taiwan (On-Site)
2 Months ago
Nintendo - Licensing Coordinator

Nintendo

Redmond, Washington, United States (Hybrid)
11 Months ago

Get notifed when new similar jobs are uploaded

Jobs in United States

Shield AI - Quality Engineer (R3101)

Shield AI

Dallas, Texas, United States (On-Site)
2 Weeks ago
Bitreactor - Senior Environment Artist

Bitreactor

Cockeysville, Maryland, United States (Remote)
1 Month ago
Nintendo - Senior Engineer - SDSG (NTD)

Nintendo

Redmond, Washington, United States (On-Site)
1 Year ago
Anavation - Capture Manager

Anavation

Chantilly, Virginia, United States (Hybrid)
5 Months ago
Penrose studios - Engine Engineer

Penrose studios

San Francisco, California, United States (On-Site)
3 Months ago
Sabre India - Senior Auditor

Sabre India

Texas, United States (Hybrid)
1 Month ago
Bally's Interactive - Sportsbook Retention Coordinator

Bally's Interactive

Jersey City, New Jersey, United States (Hybrid)
1 Month ago
bytedance - Software Engineer, Inference

bytedance

San Jose, California, United States (On-Site)
9 Months ago
Power Integrations - Field Sales Engineer

Power Integrations

San Jose, California, United States (On-Site)
9 Months ago
Minecast - Sr. Manager, Marketing Analytics

Minecast

Lexington, Massachusetts, United States (On-Site)
2 Weeks ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Blackshark - Senior Software Engineer - Data Plane Team

Blackshark

(Remote)
3 Months ago
Eqvilent - C++ Software Engineer (Market Data)

Eqvilent

(Remote)
2 Months ago
Discord - Senior Financial Analyst, Business Partnership

Discord

San Francisco, California, United States (On-Site)
2 Months ago
Nagarro - Staff Consultant ,SAP Analytics Data Manageme

Nagarro

Gurugram, Haryana, India (On-Site)
9 Months ago
P99 soft - Data Architect (Snowflake)

P99 soft

Hyderabad, Telangana, India (On-Site)
3 Months ago
TransUnion - Senior Consultant, Data Science and Analytics

TransUnion

Hong Kong (On-Site)
2 Months ago
Yodo1 - Data Engineer

Yodo1

(Remote)
11 Months ago
Hitachi - Azure Data Engineer (MS)

Hitachi

Pune, Maharashtra, India (Remote)
9 Months ago
binance - Data Scientist, LLM (AI Agent)

binance

Taipei City, Taiwan (Remote)
1 Month ago
Nintendo - Senior Data Scientist

Nintendo

Redmond, Washington, United States (On-Site)
7 Months ago

Get notifed when new similar jobs are uploaded

About The Company

Yahoo serves as a trusted guide for hundreds of millions of people globally, helping them achieve their goals online through our portfolio of iconic products. For advertisers, Yahoo Advertising offers omnichannel solutions and powerful data to engage with our brands and deliver results.

Hong Kong (Hybrid)

Canada, Kentucky, United States (Hybrid)

United States (Hybrid)

United States (Remote)

View All Jobs

Get notified when new jobs are added by Yahoo

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug