ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Semantic Search Your Documents

Elias Hamad

Techno-Functional Lead | IT Project Manager | Scrum Master | Python & JS Dabbler. .?? | Delivering Software Implementations with Excellence ?? | Passionate about AI and Technology's Role in Society ????

å‘å¸ƒæ—¥æœŸ: 2023å¹´4æœˆ27æ—¥

Introduction

Semantic search is not a new thing, it has been there for a while that instead of searching specific keywords, we can query with meaning. that alone should be intriguing enough for most of us.

but even that can be done using any call to an LLM and providing a large text within the prompt itself. however, prompt size is still a finite number of characters and if we wanted to discuss a full PDF worth of content, this approach will not work.

whatâ€™s this Langchain thing.

so, a different approach is available and that includes using the most famous development framework for now at least, Langchain.

Langchain is a software development framework designed to simplify the creation of applications using large language models (LLMs) such as from OpenAI, Anthropic, or Hugging Face. The framework offers a suite of tools, components, and interfaces to manage interactions with language models, chain together multiple components, and integrate resources such as APIs, databases, and a wide variety of document types.

there 7 main components provided currently with Langchain and these are:

Schema: such as â€œtext, documents, list of examplesâ€
Models: such as OpenAI
Prompts: a component used to formulate the input into the structured template decided and use it as in input to the selected LLM
Indexes: basically a way to structure documents in order for LLM to interact with them.
Memory: to give our interaction with the LLM some context of the conversation.
Chains: one of my favorites, think of this as chain of components (LLM, Prompt, Index) in order to accomplish a use case.
Agents: a component that can analyze the users input, decide which tools to use and execute tasks by calling these tools.

and each one could have an article by itself.

Use-Case

I have built my own version of â€œchat with your documentâ€ project.

many people have already built something similar, mostly using some using Langchain Python, some used Langchain JS â€œas did iâ€.

Loading then Chatting

First, we need to load the document into a â€œvectorstoreâ€ DB in order for us to be able to query and chat with it later:

in case youâ€™re wondering what is a vectorstore, the following is the description provided in the Langchain documentation:

A vector store is a particular type of database optimized for storing documents and their embeddings, and then fetching of the most relevant documents for a particular query, ie. those whose embeddings are most similar to the embedding of the query.

back to loading document step details:

we need a document - I am using the â€œAI-Index-Report_2023â€
loading the PDF file into a DB.
split the document into chunks.
create embeddings.
each embedding will use one of the text chunks to create a vector
create a Vectorstore to store the vectors created by the embeddings.

once the loading (ingestion) is done, we can proceed with the Semantic search functionality (chatting with the document) which is using Langchain Chains:

To put the above image in words:

we need a standalone question.
embedded it to the vectorstore, making sure we are using the same Index we used for loading the document.
vectorstore finds the most relevant information and pass it to the LLM
LLM "GPT-3.5 in this case" formats the standalone question along with the result coming from the vectorstore and provides the final answer.

Code Overview

I wonâ€™t get into details with the code itself here but you can message me if you want more details, Iâ€™ll be happy to share them.

I wrote one method called queryPineconeStore(Question) and i will spit it into sections below for you to go through the code:

initialize the pinecone client and pass the pinecone API key to get the client ready to be used.

// Create Pinecone client and index

? ? const client = new PineconeClient();

? ? await client.init({

? ? ? apiKey: process.env.PINECONE_API_KEY,

? ? ? environment: process.env.PINECONE_ENVIRONMENT,

? ? });.

é¢†è‹±æŽ¨è

Mastering the Ingestion Phase of Retriever Augmented Generation (RAG)

Mastering the Ingestion Phase of Retriever Augmentedâ€¦

Snigdha Kakkar 11 ä¸ªæœˆå‰

LangChain State of AI 2024: A Comprehensive Analysis

Anablock 3 ä¸ªæœˆå‰

State of AI & Web Scraping in 2024: Thoughts and Predictions

State of AI & Web Scraping in 2024: Thoughts andâ€¦

Oxylabs.cn 1 å¹´å‰

embeddingâ€™s are essentials to either ingest the document text into vector store or even to retrieve it as we are doing here.

the index value should match the index you created when you loaded the document.

const embeddings = new OpenAIEmbeddings()

? ? const pineconeStore = await PineconeStore.fromExistingIndex(embeddings, {

? ? ? pineconeIndex: use the same index name when you loaded the document,

? ? });

? ? console.log("Pinecone store created successfully");;

initialize the LLM, here using OpenAI

temperature : 0 to stop it from using its creativity while answering.

const model = new OpenAI({ temperature: 0 });

create a â€œRetrievalQAChainâ€ Chain and pass the model and the vector store we initialized above.

const chain = RetrievalQAChain.fromLLM(model, pineconeStore.asRetriever());

call the chain by passing the question.

question here is basically an input taken from the user through a text field on the web page.

const response = await chain.call(

? ? ? query: question,

? ? });

? console.log({ response });

? ? return response;{

a simple form for the user to ask a question.

submit and go through the process of retrieval explained above to get the result reutrned back on the same form

below is the section of the PDF representing the result:

obviously, this is not the best example to show, but it should at least give you an idea and you can definilty play with langchain to find the best way to implement this to your needs.

Potential Use Cases

Medical search
Legal search
Corporate Content Search
Calls and meetings transcripts search.

Conclusion

I am not sure if this article is doing #langchain the favor its due but if you are a developer or a tech company then you would do yourself a great service by looking into this framework.

check all its capabilities and the use cases they already implemented as they just included #babyagi and #autogpt into their framework as well.

if you plan on getting involved in any AI development in the future for personal or business purposes, then i believe this should be you immediate next step.

References

A. Langchain Documentation (JS version): Welcome to LangChain | ????? Langchain

B. YouTube Channel: https://www.youtube.com/@samwitteveenai who inspired this article, I used some of his images as well.

C. Sam's Python Collab URL for a better python version of the same functionality: https://colab.research.google.com/drive/13FpBqmhYa5Ex4smVhivfEhk2k4S5skwG?usp=sharing

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Elias Hamadçš„æ›´å¤šæ–‡ç«

Agents are here.

2023å¹´6æœˆ7æ—¥

Agents are here.

I. Introduction "Mr.
How to Manage Software Projects with Jira and GitHub Integration

2023å¹´4æœˆ4æ—¥

How to Manage Software Projects with Jira and GitHub Integration

Software development projects may become chaotic when many lines of work are active at once, from meeting the currentâ€¦
Motor Racing Trinity: F1, Python, and APIs

2023å¹´3æœˆ24æ—¥

Motor Racing Trinity: F1, Python, and APIs

Formula One (F1) racing is not just about fast cars and skilled drivers. It's also about the data that fuels the sport.

1 æ¡è¯„è®º
Kosmos-1: An Insight into GPT-4

2023å¹´3æœˆ12æ—¥

Kosmos-1: An Insight into GPT-4

Microsoft is set to release its latest language model, GPT-4, on March 16th, and the recent work on Kosmos-1 provides aâ€¦

1 æ¡è¯„è®º
Just Bing It

2023å¹´3æœˆ7æ—¥

Just Bing It

Introduction: I have been interested in all things AI and I had some fun previously creating chatBots using Dialogflow.â€¦

2 æ¡è¯„è®º

See all articles

Semantic Search Your Documents

Elias Hamad

Techno-Functional Lead | IT Project Manager | Scrum Master | Python & JS Dabbler. .?? | Delivering Software Implementations with Excellence ?? | Passionate about AI and Technology's Role in Society ????

Introduction

whatâ€™s this Langchain thing.

Use-Case

Loading then Chatting

Code Overview

é¢†è‹±æŽ¨è

Potential Use Cases

Conclusion

References

Elias Hamadçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

State of AI & Web Scraping in 2024: Thoughts and Predictions

DeepSeek vs LLaMA: Detangling Open Source and Special Purpose

Take your RAG system to the next level

Integrating RAG API with Vertex AI Vector Search for Enhanced LLM Grounding

Handling Long Context RAG for LLMs with Contextual Summarization

Transform Any Website into an AI Knowledge Base Instantly

Leveraging OpenAI API Endpoints Over SDKs for Enhanced Control and Stability

Open Source Principles in Foundation Models

Function Calling with OpenAI

Introduction

whatâ€™s this Langchain thing.

Use-Case

Loading then Chatting

Code Overview

é¢†è‹±æŽ¨è

Potential Use Cases

Conclusion

References

Elias Hamadçš„æ›´å¤šæ–‡ç«

Agents are here.

How to Manage Software Projects with Jira and GitHub Integration

Motor Racing Trinity: F1, Python, and APIs

Kosmos-1: An Insight into GPT-4

Just Bing It

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

State of AI & Web Scraping in 2024: Thoughts and Predictions

DeepSeek vs LLaMA: Detangling Open Source and Special Purpose

Take your RAG system to the next level

Integrating RAG API with Vertex AI Vector Search for Enhanced LLM Grounding

Handling Long Context RAG for LLMs with Contextual Summarization

Transform Any Website into an AI Knowledge Base Instantly

Leveraging OpenAI API Endpoints Over SDKs for Enhanced Control and Stability

Open Source Principles in Foundation Models

Function Calling with OpenAI

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†