登录查看更多内容

#25 - How does Reddit’s Personalization Model work

Ritvvij Parrikh

Building Stuff

发布日期: 2024年12月27日

Originally published in The Times of India.

In the previous post, we discussed how X or Twitter’s Personalization Model works. This post will attempt to juxtapose Reddit’s personalization model against X’s so that we develop a coherent understanding of personalization models.?

How X & Reddit are similar: X and Reddit both thrive on user-generated, community-driven content and engagement algorithms, enabling real-time discussions and content virality. In fact, Reddit is the only other social network that Apple’s App Store also classifies as a news product.

How X & Reddit are different: Reddit is a network of 100k subreddits centered on shared interests, while X emphasizes individual user accounts providing brief, real-time updates.

Content Depth: Reddit fosters in-depth discussions and analyses; X focuses on short-form content.
User Identity: Reddit users are mostly anonymous, while X often features public figures and influencers.
Feed Design: Reddit’s feed aggregates posts from joined communities; X’s feed highlights individual trends and viral posts.
Interaction Style: Reddit prioritizes shared community dialogue, while X thrives on concise, personal expression.
Reddit’s DAU is 52 million, while X’s is 245 million.

Historically, personalization on Reddit:?

Reddit used to recommend subreddits (communities) to help users discover new niches of interest. However, users complained that these recommendations were distracting on the feed, which is meant to display content directly, not the communities that host the content.?
In contrast, the feed itself was made using posts within communities using simple Analytics and If-Then Business Rules —?Hot, New, Top, Controversial, Rising.

In July 2021, Reddit introduced a personalized feed: Instead of recommending subreddits, they started recommending posts directly in the user’s feed.

This is a critical detail because the primitive now shifted from subreddit to post.

Source: https://www.reddit.com/r/RedditEng/comments/158f8o3/evolving_reddits_feed_architecture/

How it works

With this context out of the way, let’s get into how they’ve built it by placing Reddit’s model in the six stages that we introduced in the previous blog — Twitter’s Personalization Model:

Selecting from the Corpus
Candidate Generation
Filtering
Scoring or Ranking
Re-Ranking or Mixing
Serving

1. Selection from the Corpus

The system starts with all Reddit submissions from the past 24 hours.

2. Candidate Generation

It then uses machine learning to identify posts from subreddits you’ve joined, subreddits similar to those you’ve joined, or subreddits you’ve visited recently. For diversity, it also recommends posts from subreddits that are popular or geographically popular.

领英推荐

A fast-evolving social media landscape, fixing Sonos…

Hindustan Times 7 个月前

Twitter Rebrands to "X", Threads Struggles with User…

Digital Women ? 1 年前

Top Highlights from December 10-16!

BAL 2 个月前

3. Filtering

They remove posts that are:

Spam, deleted, removed, hidden, or promoted
Posts the user has already seen
Posts from subreddits or topics that the user asked we show less of
Posts the user has hidden
Posts from authors the user has blocked

4. Scoring

A ML model assigns a weighted-score to each of the remaining posts by probability of click (CTR), propensity of joining (or leaving) the subreddit, propensity of commenting or upvoting/downvoting the post and watch probability if the post has a video.

Below are some interesting quotes from Reddit blogs:

Multi-task models have become particularly important at Reddit. Users engage with content in many ways, with many content types, and their engagement tells us what content and communities they value.

This type of training also implicitly captures negative feedback - content the user chose not to engage with, downvotes, or communities they unsubscribe from.?

These probabilities can be used to estimate long term measures such as retention.

5. Re-ranking

At this point, Reddit doesn’t blindly always put the posts with the highest score at the top. Instead, they use sampling to inject:

Diverse posts so you can explore new topics
Unpredictable posts so the feed doesn’t become monotonous?
Low-ranking posts so less popular or niche subreddits also have a chance

The feed is curated to avoid showing too many similar posts in a row. Even if several posts have high scores, they might be spaced apart to enhance variety. Posts from different subreddits, topics, and formats (e.g., text, video, link) are interspersed to keep the feed engaging.

Conclusion

I will continue reviewing additional product literature on personalization models employed across various media products, but it is likely that the six stages mentioned above will remain applicable.

- - -

Curious how I’m managing to write? I created a CustomGPT for myself, which serves as my go-to editor and audits my first draft. Here’s the link—give it a spin! It’s free to use. https://chatgpt.com/g/g-hgI62sWPm-mediaflywheels-review-opinion-pieces

Want to republish it? This post was released under CC BY-ND — you can republish it as is with the following credit and backlinks: ‘Originally published by Ritvvij Parrikh on The Times of India. The author retains the copyright and any other ancillary rights to the post.

Media Flywheels

530 位关注者

要查看或添加评论，请登录

Ritvvij Parrikh的更多文章

#30 - The Agentic CMS

2025年2月24日

#30 - The Agentic CMS

With the rise of Generative AI, WordPress—and content management systems (CMSes) in general—are poised for a…
#29 - Thinkin: ‘Local Community Media’ as a ‘Trusted Club’

2025年1月18日

#29 - Thinkin: ‘Local Community Media’ as a ‘Trusted Club’

Originally published in The Times of India. In this ThinkIn, I spent a couple of hours with old-school traders.
#28 - Meditations: Turning Longform into Thought-Provoking Audio Shorts

2025年1月12日

#28 - Meditations: Turning Longform into Thought-Provoking Audio Shorts

Originally published in The Times of India. There’s something fascinating about the way deep thinkers—beat reporters…

1 条评论
#27 - Corporate Strategy to Incentivize Collaboration Across Business Units

2024年12月29日

#27 - Corporate Strategy to Incentivize Collaboration Across Business Units

Originally published in The Times of India. In the previous issue, we discussed how AI-driven media companies can…
#26 - Corporate Strategy to Incentivize Collaboration Across Functions

2024年12月28日

#26 - Corporate Strategy to Incentivize Collaboration Across Functions

Originally published in The Times of India. In earlier Media Flywheels issues, I discussed critical organizational…

3 条评论
#24 - How Bias in Data can Derail Self-Learning AI

2024年12月26日

#24 - How Bias in Data can Derail Self-Learning AI

Originally published in The Times of India. All well-built AI is self-learning in nature, i.

1 条评论
#23 - How does X or Twitter's Personalization Model work

2024年12月24日

#23 - How does X or Twitter's Personalization Model work

Originally published in The Times of India. Every major Big Tech product you use is powered by a recommender model.

1 条评论
#22 - Media’s Wicked Problem

2024年12月22日

#22 - Media’s Wicked Problem

Originally published in The Times of India. This post was originally a talk that I had given as part of WAN IFRA’s “AI…
#21 - Media Was Forced to Diversify Revenue Prematurely

2024年12月21日

#21 - Media Was Forced to Diversify Revenue Prematurely

Originally published in The Times of India. This article is part 7 of a series called ‘Reality Check on Media Strategy’.

5 条评论
#20 - Strategic Control Compromised

2024年12月20日

#20 - Strategic Control Compromised

Originally published in The Times of India. Walmart or DMart operate on razor-thin margins, yet they thrive because…

1 条评论

See all articles

#25 - How does Reddit’s Personalization Model work

Ritvvij Parrikh

Building Stuff

How it works

1. Selection from the Corpus

2. Candidate Generation

领英推荐

3. Filtering

4. Scoring

5. Re-ranking

Conclusion

Media Flywheels

530 位关注者

Ritvvij Parrikh的更多文章

社区洞察

其他会员也浏览了

From BlueSky to Instagram AI

Issue #26 - LinkedIn reaching 930 million users!

Social Media Industry Round-Up #50: New TikTok Messaging Options, Instagram Lives in Feed, Threads Keyword Search & Job Postings on Twitter (Sorry, X)

Is Microsoft Buying TikTok?!

This week: X allows blocked users to see posts, U.S tackles social media’s toll on mental health, and TikTok boosts search engine capabilities for ads

Instagram shares more algorithmic insights!

The Digital Appetizer

TikTok on the Clock:

TikTok strikes back against the US ban…

May 2024 Social Media News

How it works

1. Selection from the Corpus

2. Candidate Generation

领英推荐

3. Filtering

4. Scoring

5. Re-ranking

Conclusion

Media Flywheels

530 位关注者

Ritvvij Parrikh的更多文章

#30 - The Agentic CMS

#29 - Thinkin: ‘Local Community Media’ as a ‘Trusted Club’

#28 - Meditations: Turning Longform into Thought-Provoking Audio Shorts

#27 - Corporate Strategy to Incentivize Collaboration Across Business Units

#26 - Corporate Strategy to Incentivize Collaboration Across Functions

#24 - How Bias in Data can Derail Self-Learning AI

#23 - How does X or Twitter's Personalization Model work

#22 - Media’s Wicked Problem

#21 - Media Was Forced to Diversify Revenue Prematurely

#20 - Strategic Control Compromised

社区洞察

其他会员也浏览了

From BlueSky to Instagram AI

Issue #26 - LinkedIn reaching 930 million users!

Social Media Industry Round-Up #50: New TikTok Messaging Options, Instagram Lives in Feed, Threads Keyword Search & Job Postings on Twitter (Sorry, X)

Is Microsoft Buying TikTok?!

This week: X allows blocked users to see posts, U.S tackles social media’s toll on mental health, and TikTok boosts search engine capabilities for ads

Instagram shares more algorithmic insights!

The Digital Appetizer

TikTok on the Clock:

TikTok strikes back against the US ban…

May 2024 Social Media News