Thoughts behind arxiv-daily


Sharing a few design concepts behind this project and hope they inspire your next idea:


1. One limitation of #generativeai is its ignorance of knowledge after the model training time, e.g., OpenAI's ChatGPT is 2021. While this project is built for a continually growing knowledge base arXiv, I frame the application as digesting knowledge within a day, thus hiding the system’s inability to continually updating the knowledge within language model.


2. #chatgpt can give you inaccurate information (quote Yann LeCun: “spew nonsense” lol). This project thus chooses arXiv, a moderately curated database with presumably higher than web quality data. Plus, the target users are researchers, who are absolutely familiar with their own domains and hardly get misled by ChatGPT’s imprecise/inaccurate statement.?


3. From the application side, the main value I think is offloading daily human routines to computer algorithms, which not necessarily need to be #llms – I actually think Karpathy’s tfidf solution was elegant, while a LLM-powered system could make the experience more humanized. Particularly, this selected HCI task aims at increasing researcher’s productivity but not eliminating any existing human values. The task itself is just not a reasonable human job. As far as I know, not a computer vision scientist or professor would ask his junior fellow/students to scan hundreds of paper every day and make a customized report. This echoes Prof. Erik Brynjolfsson 's advocate: a technology benefit to economy should augment human rather than replacing human.


Let me know if this article prompts you any ideas to build something!

要查看或添加评论,请登录

社区洞察

其他会员也浏览了