How we built a media recommender with ChatGPT and without training data
Yuri Borisov, PhD
Yuri Borisov, PhD | Co-Founder & CEO at NextCreature / Randowise Oü | ChatGPT can do much more with the right software around it
All-in-one recommenders that give good suggestions for movies, tv shows, anime series, books (fiction and nonfiction) are not that common. In fact, we couldn’t find one, so why not create it ourselves?
Spoiler
Here is how the final version looks like: https://TopN.ai
Task definition: why it is hard to build media recommender
In the video below task definition and main technical challenges are briefly presented.
Traditionally, it was impossible to create a recommender without any training data / statistics / logs available. The task becomes even more challenging and training data requirements even more severe if we want to recommend different content types: movies, tv shows, books and so on. Fortunately, there is an alternative approach that does not rely on the training data - to leverage GPT models to extract the data needed.
Solution: how the recommender was actually built
Below is the video that walks you through the entire development process. First 3 minutes are devoted to introducing the concepts and outline the solution, in the remaining 10 minutes the actual development process is shown and explained.
Here are the key points
Analysis of the media recommender
Here is a video with our brief analysis of the recommender built.
It’s important to mention that once the recommender is built - meaning, all data generated and tag similarities calculated - no ChatGPT is needed for the recommender to work.
Our experience as well as the experience of our early users with the recommender is positive. However, not all collections recommended are perfect. In the next section we discuss limitations and future line of work.
Limitations
Below are some limitations of the current version and thoughts on improvements.
Precision
We use the “precision” term here to reflect how well recommended movies, tv shows, books match the specific collection title. For instance, if in the collection “Top 5 Alien Invasion Thrillers with Unique Tactics” we find “Forrest Gump” movie - that is a precision problem.
There are at least two sources of errors that impact precision
领英推荐
Recall
We use the “recall” term here to represent the variety of collections available. For example, if I want to find movies / anime / books about software engineers but no relevant collections show up - it’s a recall problem.
Here are at least two sources that impact recall (coverage) problem
Future work
There are a huge number of cool things that we are looking forward to implementing.?
Here, we restrict our imagination a bit and provide promising directions of work specifically related to recommenders.
More recommenders
If ChatGPT is familiar enough with the specific domain, one could employ the approach shown in the videos to build a recommender for this domain. Important point here is that one will not need any training data to do it!
Here are a list of recommenders that seems valuable and unique:
If you have in mind a recommender worth building, please share the idea in the comments. We plan to implement a number of recommenders in the near future, and we would prefer to implement the recommenders people really want.
Improving media recommender
It seems that relying on Tags for navigation is one the key components in our media recommender. But, what if we remove Tags altogether - what if we want the user to pick any content he/she likes (say, the “Matrix” movie and “Breaking Bad” tv show) and the system will recommend other relevant movies, books, tv shows, anime series. In this case, we remove the concept of Tags and work with “Items to Recommend” directly. It seems that this recommender may be more engaging and fun to interact with.
The good news are:
PS.?
We are in a relatively early stage, any thoughts, ideas, suggestions, feedback will be really helpful!?
And of course check the recommender: https://TopN.ai