登录查看更多内容

Experiments in summarization using LLMs

Prayank Swaroop

Partner at Accel

发布日期: 2025年2月21日

For many months I have been trying to be on the top of the AI news, it is damn hard ! So much is happening at such fast pace, it feels like before I can make sense of what happened last month, 10x more things happen in the current week. As a VC, I need to be on the top of the game - I need to be informed of what is happening with respect to startup funding, product launches, new reasearch papers, SOTA models, new paradigm shifts e.g. SFT, RFT, Reasoning etc - whole alphabet soup. And more interestingly more interesting uses of AI that people are talking about. Or the more intellectual AGI / job losses / LLMs for India / AI sovereignty debates etc.

At last count I have subscribed to over 60+ newsletters, I read some 100+ blogs, twitter handles, youtube channels etc. It has become information overload - just reading takes away so much time, that getting time to process knowledge is hard. (Imagine reading even one paper of arXiv takes up at least an hour if not more)

So I needed a way out, and I felt the most efficient way is to code my way out of it. (I have been programming since 5th grade - BASIC games on a PC-AT in the 90s). So I came up with the idea of scraping all these 100+ websites, 60+ newsletters, etc.. Before OpenAI Computer Use and Anthropic CUA came out in the last quarter, I was coding Playwright scripts with GPT3.5 - and I can tell you it was hard. So I had to simplify, so I hit upon the idea of just subscribing to lot of RSS feeds (wow a two decade old technology coming in handy) - but there was another problem some of the interesting new stuff is not available on RSS or most publications force you to come to the main website by just sending you to a small snippet on their RSS feeds.

So finally three weeks ago, inspiration stuck !

I realized that a lot of interesting newsletters / substack / medium etc are anyways coming into my inbox. Why not just parse them and collect insights from them. I can tell you now there is a human acceptable that is working now. But I have stepped into the minefield of 100K is not enough context window, LLM evals, prompt engineering, APIs vs Ollama, GPT vs Reasoning models - who knew markdown handling is also not easy.

I plan to opensource the project. But the most interesting architecture debate I ended up with is this choice - which is the cost vs quality vs coding speed trade off between these three choices.

领英推荐

The Weekend @ ...

Generative AI 1 年前

OpenAI's Latest AI Model Can Perform Some Human-Like…

Bloomberg News 6 个月前

This AI newsletter is all you need #89

Towards AI 1 年前

Which is the best summarization architecture ?

I can tell you that the first approach is the current MVP - it works ok for all newletters I receive in 24 hours - currently they amount to around 70K tokens (~55,000 words across 12 newsletters on average). But the approach goes for a toss if you want to do a weekly review - context length issues. Or even within a daily review, some of the interesting things get missed - I have seen using o1 vs 4o is dramatic improvement. (But of course how much are you willing to spend for your daily newsletter, which is just ok quality 10 cents or 50 cents?)

Have tried approach two as well, but it is not working well at all. The challenge is that newsletter contents are varied and the golden prompts are unable to capture that heterogenity - so the output is a bland summary. The newsletters contain funding rounds, research papers, product launches, news etc. I also tried an OpenAI suggested recipe in their cookbook - but it wasn't good at all, actually quite bad. (I was told the recipe was meant for the turbo models era !)

So finally I'm currently working on the tedious approach of creating custom prompts for each different newsletter - and man that is hard - it involves, multiple iterations with GPT (again o1 is better than 4o) and a bunch of wrangling with Cursor (I want to reduce tokens by doing a bit of parsing of html newsletters).

If you have any suggestions feel free to comment.

MD Fazal Mustafa

Solving for Neurological Disorders | 3x Founder | 2x Exit | Love to Discuss about AI, Products, MedTech & Startups

2 周

I faced the same problem. Tried doing this what you did, but then you will miss out on information that fall under the sponsored section or tool section or recruitment section of the newsletter. The solution is to subscribe to the top 5 newsletter only, I know it's daunting but no other choice. It's an 80-20 gameplay where you need to extract 80% of the info by subscribing just to 20% of the things. For the papers there is a section in Huggingface called paper of the day where you get the the top 10 to 20 papers. Subscribe to 4 to 5 online creators. That's it. At the end of the day, you are a human.

Ayush Panara

Enterprise Solutions Architect and Product Manager | MBA @ ESADE | Ex-Healthtech Startup CTO | Ex-President, ESADE Data Analytics & AI Club | Cloud & Digital Transformation Expertise

3 周

Project looks impressive! I can see you've built a robust system with custom prompts and database integration to track individual summaries. Have you experimented with different chunking strategies or multi-stage summarization approaches to further improve quality?

Farzand Khan

Technologist | Ex - Agriculturalist, Hunter Gatherer, Herbivores Primate

3 周

Interesting & a little overkill but I can see how much time this would save by sending daily highlights and weekly review I have a small setup but only for YT & I use 3.5 turbo, it's the cheapest one available & quite good for summarisation tasks especially if you nail your prompt, for context length I simply process in batches of 15K tokens & keep appending the output until the transcription is fully processed. It’s at least better than watching a 3 hour long video? Would love hear your take on how automated processing of text & videos online is hurting the creator economy (assuming one does open a webpage or watches a video, for which ads are the main source if revenue)

Palkush Chawla

Building GoMarble | AI-Assisted Human-Led Paid Marketing

3 周

For learning I use notebookLM a lot and have recently started exploring this project https://www.open-notebook.ai/ I would suggest fork this project and build integrations to add custom sources (RSS, etc). You could also build email integrations to send you summary. If your objective is to organise your learning sources, it comes in built with the architecture(s) to do so. Some sources would need approach 2, some would need approach 3 (from your diagram).

2 次回应

Aditya Ramakrishnan

3 周

Working on something similar. Couple of thoughts - Summarization isn't a hard task for an LLM. Gemini has a 2M token window, may not have as hard a time on context window amnesia. Another way to do this in claude - 1) prompt it to read a newsletter, strip unnecessary details and create a distilled JSON / text artifact. 2) that artifact is an input into a separate prompt to read and "summarise". You're basically splitting the "read" and "summarise" tasks - less context window hassles and prompt (1) can be tailored to the newsletter format. Chain them in make /n8n to automate. Go hardcore - do (1) from above, but build a vector DB DB and actually train a RAG model on the information?

查看更多评论

要查看或添加评论，请登录

Prayank Swaroop的更多文章

Setting up Ollama + OpenWebUI on Docker

2025年3月3日

Setting up Ollama + OpenWebUI on Docker

I just got a 128GB M4 Macbook pro (woo hoo!) - well out of need since my older 16GB M1 pro couldn’t run any of the 70B…

27 条评论
India needs to find its Moore’s Law of growth by investing in AI innovation

2025年2月13日

India needs to find its Moore’s Law of growth by investing in AI innovation

On the first day of his presidential tenure, Donald J Trump Jr, the 47th President of the United States, unveiled a…

23 条评论
Muhammad Ali wrestled with alligators! - who has your startup fought with?

2016年10月6日

Muhammad Ali wrestled with alligators! - who has your startup fought with?

Would you call "Muhammad Ali is the greatest!" or "Nadal is the best!" - if they had no competition? To be the best you…

1 条评论
Building a startup?—?what to keep in mind?

2016年5月4日

Building a startup?—?what to keep in mind?

I’ve been a VC at Accel Partners, India?—?now for 4.5 years, and have seen some 70+ companies funded by my team.

67 条评论

Experiments in summarization using LLMs

Prayank Swaroop

Partner at Accel

领英推荐

Prayank Swaroop的更多文章

社区洞察

其他会员也浏览了

What if AGI happens and nobody notices?

Future Beat: Voicing concern

Artificial Intelligence #145

GenAI Weekly — Edition 16

You’ve Probably Heard About O3... but what comes next

2024 Roundup - The AI Year Which Reshaped Major Industries

GPT-5: Everything You Need to Know

Big Windows, Better Agents (Part 6 of 10)

Finetuning vs. Retrieval-augmented generation (RAG): Benefits, Limitations, and Use cases

OpenAI O3: AGI is Finally Here

领英推荐

Prayank Swaroop的更多文章

Setting up Ollama + OpenWebUI on Docker

India needs to find its Moore’s Law of growth by investing in AI innovation

Muhammad Ali wrestled with alligators! - who has your startup fought with?

Building a startup?—?what to keep in mind?

社区洞察

其他会员也浏览了

What if AGI happens and nobody notices?

Future Beat: Voicing concern

Artificial Intelligence #145

GenAI Weekly — Edition 16

You’ve Probably Heard About O3... but what comes next

2024 Roundup - The AI Year Which Reshaped Major Industries

GPT-5: Everything You Need to Know

Big Windows, Better Agents (Part 6 of 10)

Finetuning vs. Retrieval-augmented generation (RAG): Benefits, Limitations, and Use cases

OpenAI O3: AGI is Finally Here