Sr. Data Engineer

undefined ago • 6 Years + • Data Analysis • $128,250 PA - $266,875 PA

Job Summary

Job Description

Yahoo Mail is a leading consumer inbox with hundreds of millions of users, offering an organized and fast email experience. The Mail Analytics Engineering team builds mission-critical data systems, pipelines, warehouses, analytics, and ML/AI programs for the Communications business, including Yahoo Mail. This role involves working on data engineering infrastructures, pipelines, and next-generation Machine Learning- and AI-based data infrastructure. You will support new functionalities, mine data for insights, and address technical challenges in efficient query processing, large-scale stream processing, machine learning, and complex business rules within a petabyte-scale data environment.
Must have:
  • Develop new or improve existing data infrastructures for machine learning and deep learning
  • Implement algorithms and systems efficiently with other engineers
  • Take end-to-end ownership of Machine Learning-based distributed data systems
  • Develop complex queries, large volume data pipelines, and analytics applications
  • Develop software programs to solve analytics and data mining problems
  • Interact with stakeholders to understand requirements and deliver data solutions
  • Prototype new metrics or data systems
  • Lead data investigations to troubleshoot data issues
  • Maintain and improve released systems
  • Provide engineering consulting on large and complex warehouse data
  • BS/MS/PhD in Computer Science/Electrical Engineering or related disciplines
  • 6+ years of hands-on experience in data engineering
  • Strong fundamentals in algorithms, distributed computing, data structure, database
  • Fluency with Python, Java, and SQL
  • Self-driven, detail-oriented, teamwork spirit, excellent communication skills
  • Ability to multitask and manage expectations
Good to have:
  • Experience in Hadoop technologies (Map/Reduce, Pig, Hive, HBase, Storm, Spark, Kafka, Oozie)
  • Experience with Google Cloud Platform (BigQuery, Dataproc, Dataflow)
  • Experience with machine learning algorithms, NLP, and/or statistical methods
  • Experience in machine learning, analytics, data mining, or data mart and warehouse
  • Experience with Deep Learning platforms (Tensorflow/Keras/Spark MLlib) and SQL/Unix/Shell
Perks:
  • Flexible hybrid work options
  • Healthcare
  • 401k
  • Backup childcare
  • Education stipends

Job Details

Yahoo Mail is the ultimate consumer inbox with hundreds of millions of users. It’s the best way to access your email and stay organized from a computer, phone or tablet. With its beautiful design and lightning fast speed, Yahoo Mail makes reading, organizing, and sending emails easier than ever.

A Little About Us

Yahoo makes the world’s daily habits inspiring and entertaining. By creating highly personalized experiences for our users, we keep people connected to what matters most to them, across devices and around the world. Yahoo’s vast businesses span across Search, Communications, Media, and many other verticals.

Yahoo generates terabytes of data every day and it is critical to collect, manage and process data at petabyte scale to provide timely and accurate insights to executives, sales, product managers and product developers on all aspects of user interaction.

The Mail Analytics Engineering team at Yahoo is responsible for building mission critical data systems, pipelines, warehouses, analytics systems, and Machine Learning/AI/data mining programs for the Communications business, which includes Yahoo Mail, with 200M monthly active users. We are constantly pushing the envelope of data platforms due to the insane amount of data we need to harness.

A Lot About You

As part of the Mail Analytics Engineering team, you will be working on data engineering infrastructures, pipelines and next generation Machine Learning- and AI-based data infrastructure, supporting new functionalities on existing platforms, and mining data for analytics insights and product features.

Our Big Data footprints are among the largest few in the world, at double-digit petabyte scale. Developing this infrastructure presents many technical challenges in the areas of efficient query processing, large-scale stream processing, machine learning and modeling, as well as satisfying complex business rules.

If you are someone who is passionate about harnessing data at insane scale, enjoys working with new technologies, setting up petabyte data infrastructures and implementing new machine learning solutions and metrics systems, we want to hear from you!

Your Day

  • Develop new or improve existing data infrastructures for data processing machine learning, and deep learning using your core expertise
  • Work with other engineers to implement algorithms and systems in an efficient way
  • Take end to end ownership of Machine Learning-based distributed data systems - from data and training pipelines, to real time data serving engines.
  • Develop complex queries, very large volume data pipelines, and analytics applications
  • Develop complex queries and software programs to solve analytics and data mining problems
  • Interact with data analysts, data scientists, product managers, and software engineers to understand business problems, technical requirements to deliver data solutions
  • Prototype new metrics or data systems
  • Lead data investigations to troubleshoot data issues that arise along the data pipelines
  • Maintenance and improvement of released systems
  • Engineering consulting on large and complex warehouse data

You Must Have

  • BS/MS/PhD in Computer Science/Electrical Engineering, or related engineering disciplines, ideally with specialization in Data Engineering or Machine Learning
  • 6+ years of hands-on experience in relevant fields, including data engineering
  • Strong fundamentals: algorithms, distributed computing, data structure, database
  • Fluency with: Python/Java/SQL
  • Self-driven, challenge-loving, detail oriented, teamwork spirit, excellent communication skills, ability to multitask and manage expectations

Preferred

  • Experience in Hadoop technologies (Map/Reduce, Pig, Hive, HBase, Storm, Spark, Kafka, Oozie).
  • Experience with Google Cloud Platform (BiqQuery, Dataproc, Dataflow, etc.) a big plus
  • Experience with machine learning algorithms, NLP, and/or statistical methods a big plus
  • Experience in any of: machine learning, analytics, data mining, or data mart and warehouse
  • Experience with Deep Learning platforms (Tensorflow/Keras/Spark MLlib) and SQL/Unix/Shell

Similar Jobs

Tesla - Vehicle Preparer/Receptionist

Tesla

Hanover, Lower Saxony, Germany (On-Site)
5 Months ago
WebMD - Executive Director, HCP Omnichannel Content Innovation

WebMD

Newark, New Jersey, United States (On-Site)
9 Months ago
Avalanche Studios Group - Senior Gameplay Programmer

Avalanche Studios Group

Stockholm, Stockholm County, Sweden (Hybrid)
1 Month ago
Evolution  - In- Studio LIVE Game Presenter - Full Benefits, $20 - $25/hr- NO EXPERIENCE NECESSARY

Evolution

Atlantic City, New Jersey, United States (On-Site)
1 Year ago
Dentsu - Senior Salesforce Business Analyst

Dentsu

Bengaluru, Karnataka, India (On-Site)
1 Month ago
bytedance - Risk Data Analytics Business Partner - E-Commerce - Seattle

bytedance

Seattle, Washington, United States (On-Site)
9 Months ago
Ubisoft - Senior ML Data Scientist

Ubisoft

Montreal, Quebec, Canada (On-Site)
6 Months ago
Morning Star - Data Analyst Intern - Fluent in Italian

Morning Star

Madrid, Community Of Madrid, Spain (Hybrid)
3 Weeks ago
Sleeper - Data Scientist

Sleeper

San Francisco, California, United States (Remote)
3 Months ago
Ion - M&A Junior Data Analyst

Ion

Mumbai, Maharashtra, India (On-Site)
8 Months ago

Get notifed when new similar jobs are uploaded

Similar Skill Jobs

eBay - Lead Technical Program Manager

eBay

Portland, Oregon, United States (Hybrid)
2 Months ago
EveryMatrix - HR Administrator

EveryMatrix

Batumi, Adjara, Georgia (On-Site)
3 Months ago
Infosys - Senior .NET Full Stack Developer with React or Angular

Infosys

Alpharetta, Georgia, United States (On-Site)
3 Months ago
Toast - Inside Account Executive

Toast

London, England, United Kingdom (Hybrid)
1 Month ago
PayPal - Senior Manager, Product Growth

PayPal

San Jose, California, United States (On-Site)
4 Weeks ago
Interactive Brokers - Software Developer

Interactive Brokers

Greenwich, Connecticut, United States (Hybrid)
1 Month ago
Sawhorse Productions - Senior Roblox Developer

Sawhorse Productions

Los Angeles, California, United States (Remote)
4 Months ago
HCL Tech - Kotlin Technical Specialist

HCL Tech

Texas, United States (On-Site)
3 Months ago
Electronic Arts - IT Strategy & Partner Director

Electronic Arts

Orlando, Florida, United States (Hybrid)
3 Months ago
Comscore - Office Coordinator - Part Time

Comscore

Amsterdam, North Holland, Netherlands (On-Site)
3 Weeks ago

Get notifed when new similar jobs are uploaded

Jobs in United States

UPF Industries  - Assembler I (1st & 2nd shift)

UPF Industries

Clinton, North Carolina, United States (On-Site)
2 Months ago
sphere entertainment - Benefits Specialist

sphere entertainment

Las Vegas, Nevada, United States (On-Site)
1 Month ago
Granicus - SLED Account Executive - Utilities

Granicus

United States (Remote)
3 Months ago
Nintendo - Senior Ambassador - Nintendo San Francisco

Nintendo

San Francisco, California, United States (On-Site)
9 Months ago
Highspot - Associate Engineer Internship

Highspot

Seattle, Washington, United States (Hybrid)
3 Months ago
Games For Love - Esports Streamer

Games For Love

Washington, United States (Remote)
4 Months ago
Everi - Lead Customer Service Specialist I

Everi

Las Vegas, Nevada, United States (Hybrid)
1 Month ago
Egnyte - Senior Director, Field Marketing

Egnyte

Raleigh, North Carolina, United States (On-Site)
1 Month ago
Figma - Email Marketing Manager

Figma

San Francisco, California, United States (Remote)
1 Month ago
Moloco - Senior Product Manager

Moloco

Redwood City, California, United States (On-Site)
2 Months ago

Get notifed when new similar jobs are uploaded

Data Analysis Jobs

Head Digital Works - Data Scientist

Head Digital Works

Hyderabad, Telangana, India (On-Site)
1 Year ago
Gameopedia - Data Scientist

Gameopedia

Norway (Hybrid)
4 Months ago
Embark Studios - Data Scientist - Games

Embark Studios

Stockholm, Stockholm County, Sweden (On-Site)
3 Months ago
Toast - Software Engineer II - Data Platform Team

Toast

Dublin, County Dublin, Ireland (Hybrid)
1 Month ago
London stock Exchange - Data Scientist

London stock Exchange

Bangkok, Thailand (On-Site)
1 Month ago
CyberArk - Software Engineer (Data Platform)

CyberArk

Santa Clara, California, United States (Hybrid)
1 Month ago
The Workshop - Data Software Engineer

The Workshop

Madrid, Community Of Madrid, Spain (Hybrid)
4 Months ago
endava - Senior Data Engineer

endava

Brașov, Brașov, Romania (On-Site)
2 Months ago
Novoroma - Data Scientist

Novoroma

(Remote)
2 Years ago
HHA Exchange - Data Analyst

HHA Exchange

New York, New York, United States (Hybrid)
1 Month ago

Get notifed when new similar jobs are uploaded

About The Company

Yahoo serves as a trusted guide for hundreds of millions of people globally, helping them achieve their goals online through our portfolio of iconic products. For advertisers, Yahoo Advertising offers omnichannel solutions and powerful data to engage with our brands and deliver results.

United States (Hybrid)

United States (Remote)

United States (Hybrid)

United States (Hybrid)

United States (Hybrid)

United States (Hybrid)

View All Jobs

Get notified when new jobs are added by Yahoo

Level Up Your Career in Game Development!

Transform Your Passion into Profession with Our Comprehensive Courses for Aspiring Game Developers.

Job Common Plug