Principal Machine Learning Engineer - Large Scale Embedding

3 Hours ago • 15 Years + • $276,700 PA - $387,400 PA

Job Summary

Job Description

The Principal Machine Learning Engineer will lead the design and architecture of GNN and transformers based multi-entity embedding generation. They will actively participate in the end-to-end implementation process, including enabling efficient distributed training and serving for such architectures, shaping the future of recommendation systems at Reddit. The role involves defining the technical roadmap, collaborating with cross-functional partners, developing and optimizing large-scale graph-based machine learning pipelines, architecting scalable and efficient recommendation models, collaborating with business units, collaborating with ML infrastructure teams, leading feature engineering efforts, mentoring the team, and contributing to team and product strategy.
Must have:
  • 15+ years of Technical Leadership Experience
  • Expertise in Graph Neural Networks
  • Understanding of graph theory and network science
  • Strong coding skills in Python
  • Experience with ML frameworks like PyTorch Geometric, DGL, TensorFlow, and scikit-learn
  • Understanding of ML infrastructure components
Perks:
  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days off
  • Generous paid Parental Leave
  • Paid Volunteer time off

Job Details

Reddit is a community of communities. It’s built on shared interests, passion, and trust and is home to the most open and authentic conversations on the internet. Every day, Reddit users submit, vote, and comment on the topics they care most about. With 100,000+ active communities and approximately 101M+ daily active unique visitors, Reddit is one of the internet’s largest sources of information. For more information, visit redditinc.com.

The LS Embedding team will focus on building highly expressive multi-entity large scale embeddings exploring architectures beyond standard two-tower approaches to enhance our recommendation systems at Reddit. This entails modelling various compound interactions and relationships between users and entities they interact with using Graphs and exploring Graph neural networks and transformers to encode them.

We are looking for a Principal Machine Learning Engineer to lead the design and architecture of GNN and transformers based multi-entity embedding generation actively participating in end-to-end implementation process including enabling efficient distributed training and serving for such architectures shaping the future of recommendation systems at Reddit.

If applying ML / AI in production to improve Reddit Relevance excites you, then you’ve found the right place.

RESPONSIBILITIES:

  • Lead the team that architects and designs GNN and transformers based multi-entity embedding generation.
  • Define the technical roadmap and plan of execution in collaboration with Xfn partners.
  • Develop and optimize large-scale graph-based machine learning pipelines for recommendation systems.
  • Architect scalable and efficient GNN and transformers-based recommendation models that can process complex, interconnected data structures.
  • Collaborate with cross functional business units such as Ads teams leveraging the models for upstream functions and improve relevance metrics.
  • Collaborate with ML Infrastructure teams to enable distributed GPU based training and online serving architecture
  • Lead feature engineering efforts to identify and curate expressive raw data to be used for creating embeddings
  • Be a mentor and cross-functional advocate for the team
  • Contribute towards team and product strategy, operations and execution at Reddit.

QUALIFICATIONS:

  • 15+ years of Technical Leadership Experience
  • Proven ability to lead ML initiatives, mentor engineers, and communicate complex concepts to cross-functional teams.
  • Expertise in Graph Neural Networks, collaborative filtering, knowledge graphs, and deep learning for recommendations.
  • Understanding of graph theory, network science, and representation learning technique
  • Strong coding skills in Python and experience with ML frameworks like PyTorch Geometric (PyG), Deep Graph Library (DGL), TensorFlow, and scikit-learn.
  • Solid understanding of ML infrastructure components and libraries (data parallel, model parallel, pipeline parallel, torch.inductor, model pruning, etc.) enabling efficient distributed training and inference.

Benefits:

  • Comprehensive Healthcare Benefits and Income Replacement Programs
  • 401k Match
  • Family Planning Support
  • Gender-Affirming Care
  • Mental Health & Coaching Benefits
  • Flexible Vacation & Reddit Global Days off
  • Generous paid Parental Leave  
  • Paid Volunteer time off

 

 

Similar Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Skill Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Jobs in Worldwide

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!

Similar Category Jobs

Looks like we're out of matches

Set up an alert and we'll send you similar jobs the moment they appear!