登录查看更多内容

The Anatomy of Large-Scale Recommender Systems

Jo Kristian Bergum

Retrieval Evangelist

发布日期: 2025年1月20日

Modern real-time recommender systems power many of today's most engaging platforms. While TikTok's implementation recently gained attention due to its US operation shutdown, similar architectures drive recommendations across social media, streaming, and e-commerce platforms. Here's what these systems typically look like under the hood.

The defining characteristic of modern recommender systems is their real-time nature. Every user interaction—a scroll, pause, or skip—immediately influences subsequent recommendations. This continuous feedback loop creates systems that adapt to user preferences within single sessions rather than relying on pre-computed, static recommendations. TikTok was probably one of the first online services that nailed the real-time aspect, adjusting the feed quickly based on the real-time context feedback.

Typical Serving Architecture

These systems typically employ a multi-stage serving architecture to handle billions of items and users while maintaining millisecond-level response times at low cost.

System overview of a modern online recommender system

Candidate Retrieval

Two-tower architectures dominate the retrieval phase, with separate neural networks for users and items enabling fast similarity search. This embedding-based approach generates initial candidates by combining real-time signals like trending content and recent uploads. The item embeddings are usually frozen, but the user/context embedding can be adjusted based on real-time feedback. As the illustration above shows, there might be multiple parallel retrieval calls.

领英推荐

OpenAI's Exit, Google’s Gemini Shift, AI-Driven…

The AI Journal 4 个月前

Dash Club 11: Plotly Turns 10, Dash-ChatGPT App…

Plotly 1 年前

OpenAI and Microsoft: Symbiotic or future frenemies?

Constellation Research, Inc. 8 个月前

Cascade Ranking

Retrieved candidates flow through a cascade of increasingly sophisticated models. Light models filter thousands of candidates first, followed by more complex models for final ranking. This staged approach balances computational efficiency with recommendation quality.

System Evolution

As these systems scale, they often follow a consistent pattern: They start with compute-heavy models during the service bootstrap phase and then transition to lighter, more efficient models trained on rich interaction data. This evolution reflects the fundamental trade-off between compute & storage costs and recommendation quality at scale. It comes down to $ per user versus the ad revenue per user.

Traditional caching provides minimal benefit in these systems because user preferences are unique per user. These systems typically produce recommendations per user view, which can drive significant traffic compared to search systems where users must type a query.

IMHO: TikTok's key innovation wasn't just sophisticated models (THE ALGO)—it was building a recommender system that learns from every interaction in real-time. Unlike traditional batch-oriented systems, TikTok's algorithm instantly adapts to each pause, swipe, and skip, creating an addictively responsive experience. This real-time learning approach transformed what users expect from social platforms and set a new standard for recommendation systems.

If you want to dive more into the details of TikTok, ByteDance, the company behind the service, published this paper in 2022:?Monolith: Real-Time Recommendation System With Collisionless Embedding Table.

Andreas Eriksen

Principal Software Engineer @ Vespa.ai

1 个月

This illustration seems to be missing, I found it in your twitter thread though :)

4 次回应

要查看或添加评论，请登录

Jo Kristian Bergum的更多文章

From ML Teams to API Calls: The Illusion of Simplicity

2025年2月6日

From ML Teams to API Calls: The Illusion of Simplicity

What once required dedicated machine learning teams, months or years of data collection, and complex training pipelines…

4 条评论
Why AI Agents Are Forcing Enterprises to Rethink Retrieval Investments

2025年1月27日

Why AI Agents Are Forcing Enterprises to Rethink Retrieval Investments

Enterprise search tools lingered in the background for decades—a minor employee efficiency booster with a low…

2 条评论
Why AI Giants Are Suddenly Obsessed With Enterprise Search

2025年1月13日

Why AI Giants Are Suddenly Obsessed With Enterprise Search

The AI giants have a critical weakness. Their frontier models, trained on vast internet data, fail in enterprise…

6 条评论
2024: The rise and fall of the vector database infrastructure category

2025年1月3日

2024: The rise and fall of the vector database infrastructure category

I've spent the last few years watching embedding technologies transform from Big Tech's "secret sauce" into everyday…

14 条评论
Stop Using Vector Indexes (When You Don't Need Them)

2024年11月11日

Stop Using Vector Indexes (When You Don't Need Them)

Here's an article that might save you thousands of $ per day: Your vector search use case probably doesn't need that…

9 条评论
A Practical Guide to Benchmarking Search Systems

2024年11月8日

A Practical Guide to Benchmarking Search Systems

In my early career days, I overheard a senior engineer saying that "we should deploy these systems well below the knee…

2 条评论
Shrink Your Embeddings: Slashing Costs with MRL and BQL

2024年10月18日

Shrink Your Embeddings: Slashing Costs with MRL and BQL

Let's face it: vector embeddings are fantastic for many tasks, but if you've ever worked with large-scale vector…

2 条评论
Why separating compute from storage is a bad idea for late interaction models like ColPali

2024年10月18日

Why separating compute from storage is a bad idea for late interaction models like ColPali

While late-interaction models offer compelling benefits, naive implementations can lead to severe performance…

2 条评论

See all articles

The Anatomy of Large-Scale Recommender Systems

Jo Kristian Bergum

Retrieval Evangelist

Typical Serving Architecture

Candidate Retrieval

领英推荐

Cascade Ranking

System Evolution

Jo Kristian Bergum的更多文章

社区洞察

其他会员也浏览了

How Azure Innovate & Bizmetric Expertise Supercharge AI-Powered Intelligent App Development

Azure OpenAI Deployment Options and Availability

Microsoft Reaches All-Time High Amid Growing OpenAI-Related Optimism

Revolutionizing Recommender Systems: Beyond the Myopic View (InDepth Review)

Transparency, Privacy, and Fairness in Recommender Systems: Insights from Dr. Dominik Kowald at SFI MediaFutures

Challenges Faced while Building Personalisation Engines

Train and Deploy Google?Cloud's Two Towers Recommender

Why is the TikTok Recommender System so Good?

Unpacking The OpenAI Meltdown

5 challenges you face when building cutting edge class recommender systems.

Typical Serving Architecture

Candidate Retrieval

领英推荐

Cascade Ranking

System Evolution

Jo Kristian Bergum的更多文章

From ML Teams to API Calls: The Illusion of Simplicity

Why AI Agents Are Forcing Enterprises to Rethink Retrieval Investments

Why AI Giants Are Suddenly Obsessed With Enterprise Search

2024: The rise and fall of the vector database infrastructure category

Stop Using Vector Indexes (When You Don't Need Them)

A Practical Guide to Benchmarking Search Systems

Shrink Your Embeddings: Slashing Costs with MRL and BQL

Why separating compute from storage is a bad idea for late interaction models like ColPali

社区洞察

其他会员也浏览了

How Azure Innovate & Bizmetric Expertise Supercharge AI-Powered Intelligent App Development

Azure OpenAI Deployment Options and Availability

Microsoft Reaches All-Time High Amid Growing OpenAI-Related Optimism

Revolutionizing Recommender Systems: Beyond the Myopic View (InDepth Review)

Transparency, Privacy, and Fairness in Recommender Systems: Insights from Dr. Dominik Kowald at SFI MediaFutures

Challenges Faced while Building Personalisation Engines

Train and Deploy Google?Cloud's Two Towers Recommender

Why is the TikTok Recommender System so Good?

Unpacking The OpenAI Meltdown

5 challenges you face when building cutting edge class recommender systems.