ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

LangChain - Question Answering using Vector Databases and Similarity Search, Evaluation, Agent

Sarthak Pattnaik

Senior Software Engineer at HCLTech | MS Applied Data Analytics at Boston University

å‘å¸ƒæ—¥æœŸ: 2024å¹´3æœˆ19æ—¥

The most interesting utility of LangChain is that once it is integrated with a large language model, one could use it to extract insights from data on which it isnâ€™t trained. Such use cases bolster the argument in favour of LangChain and provides evidence of its overwhelming utility. LangChain was found in October 2022 by Harrison Chase and in only a year it has garnered immense popularity and is ubiquitously used LLM application framework as of today.

Questions and Answers using LangChain

Combining the capabilities of LLMs with documents that contain either personal or proprietary information can be of immense help and one would be able to collate answers to questions relevant to the information enmeshed in the document. However, the issue we face is that LLMs do not have gargantuan processing power in and of themselves. Therefore, to make sure that LLMs have the capacity to process large documents we use vector embeddings and storage. Vector embeddings are used to convert the contents of a document into a format which the language model would be able to comprehend. Similarity between these vectors determines how alike two distinct content pieces are. Vector databases are repositories that consist of vector embeddings extracted from documents. Since the size of these datasets are enormous, they are split in chunks and converted to embeddings, post which they are stored in vector databases. There are myriad ways to split the document and based on our use case we must use the appropriate splitting method. A few commonly used splitting techniques include RecursiveSplitting (splitting based on characters), Token splitting (splitting based on token count), context-aware splitting (splitting method that keeps similar words or sentences together). Now, with this mechanism in place, when one wants an answer to a question, they can curate the embeddings of the question and calculate â€˜nâ€™ similar vectors from the vector store. Once we have our â€˜nâ€™ similar vectors and the embedding of the question vector, we can pass this information to an LLM to orchestrate an output. When the question is converted to embeddings and passed to the vector store, based on the similarity â€˜kâ€™ documents from the store are picked and passed to a system prompt. Concomitantly, the system prompt along with the question is passed to the Large Language Model. In default scenarios, all the segments are passed in the same context window. When the length of the document is large however, we can leverage techniques like MapReduce, MapRank, and Refine.

Map Reduce is used to process multiple chunks in parallel and the final answer is curated by combining the result from the LLMs. It takes lot of calls and it treats each document independently which may not be the scenario all the time.

Map Reduce to process embedding chunks in parallel

Refine builds upon the answers from the previous document and follows an iterative approach of information retrieval rather than a parallel approach as seen in Map Reduce.

Refine to process the embeddings in an iterative process.

Map_Rerank returns the rank value from parallel document processing in the LLMs and the highest score is returned as a result. We still use a considerable number of calls in this scenario.

Map Rank operates like Map Reduce with ranks from each parallel processing

é¢†è‹±æŽ¨è

Roadmap to Leveraging Generative AI in Data Science

Data Science Dojo 1 å¹´å‰

Issue #297 - The ML Engineer ??

Alejandro Saucedo 7 ä¸ªæœˆå‰

Real-time Sentiment Analysis System: Social Media Post
HLD & LLD

Real-time Sentiment Analysis System: Social Media Postâ€¦

Rakesh Jha (Product Head/Chief Architect) 10 ä¸ªæœˆå‰

LangChain Evaluation

One of the most trivial ways to evaluate the performance of LangChain is to probe the result it generates and whether it is consistent with the details present in the original dataset. However, if we have a plethora of documents then writing query-result pairs for each of those documents is an assiduous task. Therefore, we have QAGenerateChain. QAGenerateChain curates a question-answer pair for each of the documents so that we do not have to create them ourselves. Once the QAGenerateChain has created multiple query-result pairs for each document, we can run these on our LLM and observe whether the output for each pair is consistent with the document. This again becomes a tedious task and therefore to ameliorate the painstaking endeavour of manually enumerating over each LLM response we use the model at our disposition to perform the evaluation for us. To evaluate the performance of LangChain model, we incorporate the QAEvalChain functionality is conjunction with the large language model.

LangChain Agents

An underdiscussed functionality of LLMs involves their utility in reasoning. An agent in LangChain provides the necessary features to integrate search engines like Wikipedia and DuckDuckGo into its framework so that the model can peruse the content in these websites to find relevant information pertaining to the question that is posited by the user. LangChain also allows users to create their own user-defined agent.

References

https://medium.com/@onkarmishra/using-langchain-for-question-answering-on-own-data-3af0a82789ed

https://python.langchain.com/

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Sarthak Pattnaikçš„æ›´å¤šæ–‡ç«

LLM Performance Using Langsmith

2024å¹´11æœˆ18æ—¥

LLM Performance Using Langsmith

LangSmith is a powerful platform designed to help developers monitor, evaluate, and improve the performance of Largeâ€¦

1 æ¡è¯„è®º
Agentic Framework AI

2024å¹´11æœˆ16æ—¥

Agentic Framework AI

In recent years, the field of artificial intelligence has witnessed a significant shift towards the development of AIâ€¦

2 æ¡è¯„è®º
The Case Against AGI: Hilbert, Godel, Turing, Larson

2024å¹´6æœˆ2æ—¥

The Case Against AGI: Hilbert, Godel, Turing, Larson

The argument for Artificial General Intelligence (AGI) has found pervasive acknowledgement from the Big Tech communityâ€¦
Unleashing the Power of AI: A Deep Dive into RAG vs Fine-Tuning

2024å¹´5æœˆ26æ—¥

Unleashing the Power of AI: A Deep Dive into RAG vs Fine-Tuning

Large Language Models are fundamentally used to predict the subsequent words in a phrase to logically complete aâ€¦

2 æ¡è¯„è®º
PrivateGPT: Safeguarding Sensitive Information in the Age of AI Chatbots

2024å¹´4æœˆ16æ—¥

PrivateGPT: Safeguarding Sensitive Information in the Age of AI Chatbots

What is PrivateGPT? A large swath of private companies including luminary tech giants like Apple and Samsung haveâ€¦

8 æ¡è¯„è®º
REINFORMCEMENT LEARNING FROM HUMAN FEEDBACK

2024å¹´3æœˆ25æ—¥

REINFORMCEMENT LEARNING FROM HUMAN FEEDBACK

RLHF is an interesting concept that focuses on aligning the content curated by the Large Language Model (LLM) withâ€¦

1 æ¡è¯„è®º
LangChain Prompt Templates, Memory, and Chains

2024å¹´3æœˆ18æ—¥

LangChain Prompt Templates, Memory, and Chains

LangChain is an open source LLM application development framework. LangChain has multiple modular components that canâ€¦

3 æ¡è¯„è®º
Understanding Transformers in Natural Language Processing

2024å¹´3æœˆ11æ—¥

Understanding Transformers in Natural Language Processing

The T in ChatGPT, GPT-3, GPT-4 stands for Transformers. A Transformer is an attention-based Sequence-to-Sequenceâ€¦

2 æ¡è¯„è®º
Prompt Engineering: Shaping Responses from Large Language Models

2024å¹´2æœˆ29æ—¥

Prompt Engineering: Shaping Responses from Large Language Models

As Google is mired in controversy over the inaccuracies of its AI Large Language Model Geminiâ€™s highly biased outputâ€¦

5 æ¡è¯„è®º
Exploring ChatGPT: Advancements, Applications, and Ethical Considerations in AI Integration

2024å¹´2æœˆ25æ—¥

Exploring ChatGPT: Advancements, Applications, and Ethical Considerations in AI Integration

The first demo of ChatGPT was released on November 30, 2022, and since then, it has spurred the conversation aroundâ€¦

11 æ¡è¯„è®º

See all articles

LangChain - Question Answering using Vector Databases and Similarity Search, Evaluation, Agent

Sarthak Pattnaik

Senior Software Engineer at HCLTech | MS Applied Data Analytics at Boston University

é¢†è‹±æŽ¨è

Sarthak Pattnaikçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Top LLM Papers of the Week (August Week 3, 2024)

Data Science Technologies

Mastering Azure AI Foundry: Bridging the Gap Between Natural Language and SQL

Notes on Data Compression: Part 4 (JPEG)

SQL vs. NoSQL for AI Agents and Real-Time Generative AI Applications

Timescale Newsletter ?? Shaping the Future of Development

Text-to-SQL Generation: A Deep Dive

Between Test & Train

Data Analysis with an LLM Twist

Oracle Database 23ai - Oracle AI Vector Search & Retrieval Augmented Generation (RAG) with Oracle APEX

é¢†è‹±æŽ¨è

Sarthak Pattnaikçš„æ›´å¤šæ–‡ç«

LLM Performance Using Langsmith

Agentic Framework AI

The Case Against AGI: Hilbert, Godel, Turing, Larson

Unleashing the Power of AI: A Deep Dive into RAG vs Fine-Tuning

PrivateGPT: Safeguarding Sensitive Information in the Age of AI Chatbots

REINFORMCEMENT LEARNING FROM HUMAN FEEDBACK

LangChain Prompt Templates, Memory, and Chains

Understanding Transformers in Natural Language Processing

Prompt Engineering: Shaping Responses from Large Language Models

Exploring ChatGPT: Advancements, Applications, and Ethical Considerations in AI Integration

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Top LLM Papers of the Week (August Week 3, 2024)

Data Science Technologies

Mastering Azure AI Foundry: Bridging the Gap Between Natural Language and SQL

Notes on Data Compression: Part 4 (JPEG)

SQL vs. NoSQL for AI Agents and Real-Time Generative AI Applications

Timescale Newsletter ?? Shaping the Future of Development

Text-to-SQL Generation: A Deep Dive

Between Test & Train

Data Analysis with an LLM Twist

Oracle Database 23ai - Oracle AI Vector Search & Retrieval Augmented Generation (RAG) with Oracle APEX

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†