????????? ???????? ???????? ????????-?????????? ?????????????????? ?????????? This article explains what #GRPO is, how to fine-tune the model, and how to use it in RAG to build an efficient system that provides accurate, logical, and up-to-date responses. This approach is ideal for complex scenarios where fast data retrieval and smart reasoning are essential for delivering high-quality answers. ?? Detailed Article - https://lnkd.in/gRYuf5j7 Star ?? LanceDB recipes to keep yourself updated - https://lnkd.in/dvmfDFed #grpo #rag #finetuning #vectordb
LanceDB
信息服务
San Francisco,California 6,509 位关注者
Developer-friendly, open source database for multi-modal AI
关于我们
LanceDB is a developer-friendly, open source database for multimodal AI. From hyper scalable vector search to advanced retrieval for RAG, from streaming training data to interactive exploration of large scale AI datasets, LanceDB is the best foundation for your AI application.
- 网站
-
https://lancedb.com
LanceDB的外部链接
- 所属行业
- 信息服务
- 规模
- 11-50 人
- 总部
- San Francisco,California
- 类型
- 私人持股
- 创立
- 2022
地点
-
主要
US,California,San Francisco
LanceDB员工
动态
-
LanceDB转发了
Understand GRPO Equation -> Map it to Code -> Fine Tune a small model on single Colab using LoRA -> Use the model in a RAG setting There was a post that I shared some days ago on the same: https://lnkd.in/gyzZZ_tS Here I said I'd write a blog on the same. So I wrote one and threw some fine tuning code on the same. You can learn more about it here: https://lnkd.in/gf6pMuu4 It uses Google Colab + Hugging Face TRL + LanceDB as the Vector DB for end to end stuff. Not sure if the fine tuned model can be put to production but it'll bring you closer to understanding end to end reasoning models for sure. #GRPO #reasoning #llm #finetuning #RAG #huggingface #colab #vectorDB #lancedb
-
-
?? ???????????????????????? ??????????-?????????? ?????????????? ?????? ?????????????????? This Law Assistant system helps legal professionals and students by retrieving and summarizing legal information. It uses hierarchical multi-agent approach to handle queries about the Indian Penal Code (IPC) and the NDPS Act. This system uses supervising agent, primary agent and sub-agents under each primary agent to find relevant case laws and provide accurate legal summaries quickly. ?? Node app - https://lnkd.in/gp44cqWV Star ?? LanceDB recipes to keep yourself updated - https://lnkd.in/dvmfDFed #agent #hierarchical #multiagent #law #assistant
-
-
?? ???????????? ????????????’?? ???????????? ????????????: ??????????????-?????????????? ?????? ?????? ????????????, ?????????????? ???????? ?????????????????????? What if you could go from months of development to hours? That’s exactly what Second Dinner achieved with LanceDB #Cloud: ? Prototyping new features in ?????????? (instead of months) ? ???????? ???????????? QA with a simple API call ? AI-driven workflows ??-???? ???????? ????????-?????????????????? than alternatives “?????????????? ??????’?? ???????? ?? ????????—????’?? ?????? ?????????? ????????.” – Xiaoyang Y., VP of AI, Data, Security at Second Dinner ?? Read the full case study here: https://lnkd.in/gdsK8FVe Special thanks for Eric Del Priore Tal Puhov Qian Zhu for the case study! #AI #GameDevelopment #MachineLearning #Innovation #LanceDB #SecondDinner
-
-
?? HoneyHive x LanceDB HoneyHive is an AI monitoring platform that helps developers track, manage, and improve AI applications. It offers tools for performance monitoring, dataset management, debugging, and collaboration to ensure AI systems run smoothly. By integrating LanceDB, developers can now easily trace and analyse your retrieval calls in a few simple steps. ?? Colab Notebook - https://lnkd.in/gdU-jNCu Star ?? LanceDB recipes to keep yourself updated - https://lnkd.in/dvmfDFed #monitoring #retrieval #vectordb #honeyhive
-
-
?? ?????? ?????? ?????? ?????? ?????????????????? ???? ?????????? ?? This application combines #PDF chat functionality using LanceDB with advanced #RAG (Retrieval-Augmented Generation) methods and uses the ?????????? ????????-????-???????????? (??????) model for audio output. It enables high-quality text and speech interaction with PDF documents. ?? Code - https://lnkd.in/dS-zQBFv Star ?? LanceDB recipes to keep yourself updated - https://lnkd.in/dvmfDFed #chatbot #pdf #parlertts #llama3 #vectordb
-
-
Understanding and analysing retrieval performance is a complex challenge in building RAG solutions. With HoneyHive's LanceDB integration, you can now easily trace and analyse your retrieval calls in a few simple steps. This insightful blog post by HoneyHive covers how you can do so in great depth. Check it out ??
Modern AI applications face challenges managing vector data and understanding retrieval performance. LanceDB's serverless vector database, combined with HoneyHive’s specialized tracing, helps teams efficiently store multimodal data, track RAG quality, and optimize retrieval performance. Here's practical guide on integrating LanceDB and HoneyHive for building better RAG applications: https://lnkd.in/eCAfbXgH
-
-
????????????? ??????????: ?????????????????????? ?????????????? ???????? ???????????????? Super Agent uses Ollama, LiteLLM (YC W23), AutoGen, LanceDB, and LangChain to build a smart AI tool. This tool combines data storage and conversation skills to manage information effectively and interact intelligently. Here are the key steps: 1. ???????????? ??????????: Use LanceDB to index the PDF and make it searchable. 2. ?????? ???? ??????????????????: Build a Question-Answer system with LangChain for smart, context-aware queries. 3. ?????????????? ????????????: Set up user and assistant agents with Autogen for interactive querying. ?? Colab Notebook - https://lnkd.in/gm9kU3is Star ?? LanceDB recipes to keep yourself updated - https://lnkd.in/dvmfDFed #aiagent #vectordb #superagent #autogen
-
-
?? ??????????????-???????????????????? ???????????? ?????? ?????????????????? ????????? Vanilla RAG is great, but some situations need smaller chunks because larger ones can add unnecessary noise, like conversation history. Using couple-level chunks can work, but important context might be lost from previous or future replies. Bigger chunks could help, but they come with their own issues, like noise and limited chunk numbers. What's the Solution:??????????????? ???????????????????? ???????????? This method adds up window of other chunks with search results making searched resulting in enriching context for better RAG answers. ?? Colab Notebook - https://lnkd.in/gvfR9cR6 ?? Blog - https://lnkd.in/gA8JUmpu Star ?? LanceDB recipes to keep yourself updated - https://lnkd.in/dvmfDFed #context #rag #textgeneration #improvement #blog #vectordb
-
-
??? ???????????????????? ?????????? ???????????? ?????????? ???????? ?? This example uses LanceDB to store frames and titles of 13,000+ videos from the YouTube #8M dataset, selecting 5 random videos from each top category. The #CLIP model was used to embed both frames and titles, enabling embedding, keyword, and SQL searches on the videos. ? Build video search Engine - https://lnkd.in/dEBGmWDf Star ?? LanceDB recipes to keep yourself updated - https://lnkd.in/dvmfDFed #multimodal #clip #openai #vectordb
-