登录查看更多内容

Building a Conversational Web Application for PDF Documents using Mistral-7B-v0.1

Kshitij Sharma

IEEE Member | CSI Member | AI & ML Engineer | Generative AI, LLMs, NLP, RAG, Computer Vision | Researcher & Developer | Conference Presenter | Open-Source Contributor | Building Intelligent Systems for Healthcare

发布日期: 2024年9月19日

In this we'll walk through the development of a web application that allows users to interact with PDF documents via a conversational AI interface. This application leverages modern AI tools and frameworks to process and query text extracted from multiple PDF files. We'll cover the design choices, key components, and code implementation.

Introduction

Our program seeks to give users a simple means of interacting with data found in PDF documents. Users can ask questions about the material by uploading PDFs, and the program will employ a conversational AI model to deliver pertinent answers. Users that need to swiftly extract information from complex documents will profit from this approach. We are going to use mistral.ai for our development case. A ground-breaking team called Mistral AI developed big language models that are sophisticated yet tiny enough to be used on your desktop computer. In contrast to the handicap practices of large organizations for NLP developers working in large quantities, they offer fair support and model weights for download.

Technology Stack

Streamlit: A powerful library for building interactive web applications with Python.
LangChain: A framework simplifying the integration of language models and vector databases.
HuggingFace Transformers: Provides pre-trained models for natural language processing tasks.
FAISS: A library for efficient similarity search and clustering of dense vectors.
PyPDF2: A library for reading text from PDF files.

Application Components

PDF Processing: Extracts text from PDF files.
Text Chunking: Splits text into manageable chunks.
Vector Store Creation: Embed text chunks and store them in a FAISS index for efficient retrieval.
Conversational AI: Handles user queries and provides relevant responses based on the indexed text.
Web Interface: Built with Streamlit, users can upload PDFs and interact with the AI model.

Code Overview

1. PDF Processing

The get_pdf_text function handles the extraction of text from PDF documents.

def get_pdf_text(pdf_docs):

text = ""

for pdf in pdf_docs:

pdf_reader = PdfReader(pdf)

for page in pdf_reader.pages:

text += page.extract_text()

return text

2. Text Chunking

The get_text_chunks function splits the extracted text into smaller chunks for easier processing.

def get_text_chunks(text):

text_splitter = RecursiveCharacterTextSplitter(

chunk_size=1000,

chunk_overlap=200,

length_function=len

)

chunks = text_splitter.split_text(text)

return chunks

Using the RecursiveCharacterTextSplitter, we break the text into chunks of 1000 characters with an overlap of 200 characters. This approach helps in managing significant texts and improves retrieval accuracy.

3. Vector Store Creation

The get_vectorstore function creates a vector store for efficient similarity search.

def get_vectorstore(text_chunks):

model_id = "sentence-transformers/all-MiniLM-L6-v2"

model_kwargs = {'device': 'cpu'}

encode_kwargs = {'normalize_embeddings': False}

embeddings = HuggingFaceEmbeddings(

model_name=model_id,

model_kwargs=model_kwargs,

encode_kwargs=encode_kwargs

)

vectorstore = FAISS.from_texts(texts=text_chunks, embedding=embeddings)

return vectorstore

4. Conversational AI

The get_conversation_chain the function sets up the conversational AI using HuggingFace models.

def get_conversation_chain(vectorstore):

memory = ConversationBufferMemory(memory_key='chat_history', return_messages=True)

llm = HuggingFaceHub(repo_id="mistralai/Mistral-7B-v0.1", model_kwargs={"temperature":0.5, "max_length":512})

conversation_chain = ConversationalRetrievalChain.from_llm(

llm=llm,

chain_type="stuff",

retriever=vectorstore.as_retriever(search_kwargs={"k": 3}),

memory=memory

)

return conversation_chain

We use HuggingFaceHub to load the conversational AI model and set up the ConversationalRetrievalChain to handle user queries based on the indexed text.

5. Web Interface

The main function defines the web interface using Streamlit.

def main():

st.set_page_config(page_title="Chat with multiple PDFs", page_icon=":books:")

st.write(css, unsafe_allow_html=True)

if "conversation" not in st.session_state:

st.session_state.conversation = None

if "chat_history" not in st.session_state:

st.session_state.chat_history = None

领英推荐

GEN AI Series - Enterprise Unified Semantic Search:…

Jothi Periasamy 1 个月前

?? A New AI Software Engineer

Pascal Biese 11 个月前

How to Unlock the Full Potential of Prompt…

ThinkPalm Technologies Pvt. Ltd. 1 年前

st.header("Chat with multiple PDFs :books:")

user_question = st.text_input("Ask a question about your documents:")

if user_question:

handle_userinput(user_question)

with st.sidebar:

st.subheader("Your documents")

pdf_docs = st.file_uploader("Upload your PDFs here and click on 'Process'", accept_multiple_files=True)

if st.button("Process"):

with st.spinner("Processing"):

st.write("Processing your documents...")

raw_text = get_pdf_text(pdf_docs)

text_chunks = get_text_chunks(raw_text)

vectorstore = get_vectorstore(text_chunks)

st.session_state.conversation = get_conversation_chain(vectorstore)

st.success("Documents processed successfully!")

The Streamlit interface allows users to upload PDF files and ask questions. The uploaded PDFs are processed, and the conversation chain is set up to handle user interactions.

Finally, you need to create a htmlTemplates.py file to implement your web application.

css = '''

<style>

.chat-message {

padding: 1.5rem; border-radius: 0.5rem; margin-bottom: 1rem; display: flex

}

.chat-message.user {

background-color: #2b313e

}

.chat-message.bot {

background-color: #475063

}

.chat-message .avatar {

width: 20%;

}

.chat-message .avatar img {

max-width: 78px;

max-height: 78px;

border-radius: 50%;

object-fit: cover;

}

.chat-message .message {

width: 80%;

padding: 0 1.5rem;

color: #fff;

}

'''

bot_template = '''

</div>

</div>

'''

user_template = '''

</div>

</div>

'''

Conclusions

This application demonstrates how to build a web-based conversational interface for querying information from PDF documents. By integrating various tools and technologies, we can provide a seamless experience for users to interact with document content meaningfully.

The AI Almanac

1,117 位关注者

Ritik Shukla

Java Programming | Data Structure and Algorithms | Full Stack Developer | 300+ LeetCode Problems Solved | CS Fundamentals

6 个月

Love this

1 次回应

Ayush Kumar Dubey

Proficient in C, Python, web Development || Experienced SQL || Passionate about Fullstack || Currently learning Machine Learning

6 个月

Love this

1 次回应

查看更多评论

要查看或添加评论，请登录

Kshitij Sharma的更多文章

Nvidia’s AI agent play is here with new models and orchestration blueprints

2025年1月27日

Nvidia’s AI agent play is here with new models and orchestration blueprints

Nvidia has announced a number of new services and models to help with the development and implementation of AI agents…
5 Essential Free Tools for Getting Started with LLMs

2024年11月2日

5 Essential Free Tools for Getting Started with LLMs

Introduction Although large language models (LLMs) are now widely used and helpful for a variety of activities, the…
Build an Advanced RAG App: Query Routing

2024年10月16日

Build an Advanced RAG App: Query Routing

The problem with Advanced RAG Applications We must choose how to respond to a query that comes into our Generative AI…

6 条评论
MLOps All You Need To Know

2024年10月6日

MLOps All You Need To Know

What is MLOps ? In order to deliver value across industries and solve complicated challenges, data science and machine…
NeMo: Advancing Open-Source AI with Mistral AI and NVIDIA

2024年10月3日

NeMo: Advancing Open-Source AI with Mistral AI and NVIDIA

Introduction Developments in Artificial Intelligence (AI) Artificial intelligence has further joined the scene…
Data Science with GenAI is Revolutionizing Investment Management

2024年9月30日

Data Science with GenAI is Revolutionizing Investment Management

What is the recipe for success in the field of investment management? Every successful company has a special blend of…
Goodbye Manual Prompting, Hello Programming With DSPy

2024年9月10日

Goodbye Manual Prompting, Hello Programming With DSPy

The DSPy framework aims to resolve consistency and reliability issues by prioritizing declarative, systematic…

4 条评论
5 Open LLM Inference Platforms for Your Next AI Application

2024年9月8日

5 Open LLM Inference Platforms for Your Next AI Application

Open large language models, like GPT-4 and Gemini, are a good substitute for commercial LLMs because of their growing…

2 条评论
10 Machine Learning Algorithms Explained Using Real-World Analogies

2024年9月7日

10 Machine Learning Algorithms Explained Using Real-World Analogies

Whenever I tackled difficult arithmetic problems in high school, I would constantly consider the purpose of the subject…

4 条评论
7 Ways to Test LLMs

2024年9月6日

7 Ways to Test LLMs

In a very short time, large language models (LLMs) have spread comparatively quickly. Numerous businesses have reaped…

See all articles

Building a Conversational Web Application for PDF Documents using Mistral-7B-v0.1

Kshitij Sharma

IEEE Member | CSI Member | AI & ML Engineer | Generative AI, LLMs, NLP, RAG, Computer Vision | Researcher & Developer | Conference Presenter | Open-Source Contributor | Building Intelligent Systems for Healthcare

Introduction

Technology Stack

Application Components

Code Overview

1. PDF Processing

2. Text Chunking

3. Vector Store Creation

4. Conversational AI

5. Web Interface

领英推荐

Conclusions

The AI Almanac

1,117 位关注者

Kshitij Sharma的更多文章

社区洞察

其他会员也浏览了

The LLMOps Lifecycle: Managing Large Language Models Effectively

natlagram: How We Translated Words to Diagrams With the Help of GPT and Kroki

Training Program Topics - List 4

Tips on Showcasing Your Skills and Projects That Leverage GenAI for Testing

Building Smarter Web Applications: A Guide to AI Integration with Laravel

Unveiling Text Representation and Embeddings: A Comprehensive Guide for NLP Practitioners

BERT Embeddings for data sets Explained: Key Benefits, Examples, and ML Model Steps

Elevating Efficiency: 10 Amazing Free AI Tools

Leveraging AI for Efficient Conversation Retrieval and Management: A Dive into ChromaDB and DSPyGen

Understanding GraphRAG and Its Challenges

Introduction

Technology Stack

Application Components

Code Overview

1. PDF Processing

2. Text Chunking

3. Vector Store Creation

4. Conversational AI

5. Web Interface

领英推荐

Conclusions

The AI Almanac

1,117 位关注者

Kshitij Sharma的更多文章

Nvidia’s AI agent play is here with new models and orchestration blueprints

5 Essential Free Tools for Getting Started with LLMs

Build an Advanced RAG App: Query Routing

MLOps All You Need To Know

NeMo: Advancing Open-Source AI with Mistral AI and NVIDIA

Data Science with GenAI is Revolutionizing Investment Management

Goodbye Manual Prompting, Hello Programming With DSPy

5 Open LLM Inference Platforms for Your Next AI Application

10 Machine Learning Algorithms Explained Using Real-World Analogies

7 Ways to Test LLMs

社区洞察

其他会员也浏览了

The LLMOps Lifecycle: Managing Large Language Models Effectively

natlagram: How We Translated Words to Diagrams With the Help of GPT and Kroki

Training Program Topics - List 4

Tips on Showcasing Your Skills and Projects That Leverage GenAI for Testing

Building Smarter Web Applications: A Guide to AI Integration with Laravel

Unveiling Text Representation and Embeddings: A Comprehensive Guide for NLP Practitioners

BERT Embeddings for data sets Explained: Key Benefits, Examples, and ML Model Steps

Elevating Efficiency: 10 Amazing Free AI Tools

Leveraging AI for Efficient Conversation Retrieval and Management: A Dive into ChromaDB and DSPyGen

Understanding GraphRAG and Its Challenges