登录查看更多内容

LangChain - Memory Module

Bharani Srinivas

Senior Manager - AIML Delivery Lead | Machine Learning | R&D | Mentor at Great Lakes

发布日期: 2023年10月8日

What is a Memory?

Chains and Agents operate as stateless, treating each query independently. However, in applications like chatbots, its crucial to remember past interactions. The concept of "Memory" serves that purpose.

Different Types of Memories

Let's delve into a few of these in more detail.

ConversationBufferMemory

Imagine you're having a conversation with someone, and you want to remember what you've discussed so far. The ConversationBufferMemory does exactly that in a chatbot or similar system. It keeps a record, or "buffer," of the past parts of the conversation.

from langchain import OpenAI
from langchain.chains import ConversationChain

llm = OpenAI(temperature=0)
conversation = ConversationChain(
    llm=llm,
    verbose=True,
    memory=ConversationBufferMemory()
)

The buffer plays a crucial role in the context, aiding the chatbot in generating improved responses.

What sets this memory apart is its ability to store previous conversations exactly as they occurred, without any alterations.

It preserves the raw form of the conversation, allowing the chatbot to refer back to specific parts accurately. In summary, the ConversationBufferMemory helps the chatbot remember the conversation history, enhancing the overall conversational experience.

Pros of ConversationBufferMemory:

Complete conversation history: It retains the entire conversation history, ensuring comprehensive context for the chatbot.
Accurate references: By storing conversation excerpts in their original form, it enables precise referencing to past interactions, enhancing accuracy.
Contextual understanding: The preserved raw form of the conversation helps the chatbot maintain a deep understanding of the ongoing dialogue.
Enhanced responses: With access to the complete conversation history, the chatbot can generate more relevant and coherent responses.

Cons of ConversationBufferMemory:

Increased memory usage: Storing the entire conversation history consumes memory resources, potentially leading to memory constraints.
Potential performance impact: Large conversation buffers may slow down processing and response times, affecting the overall system performance.
Limited scalability: As the conversation grows, the memory requirements and processing load may become impractical for extremely long conversations.
Privacy concerns: Storing the entire conversation history raises privacy considerations, as sensitive or personal information may be retained in the buffer.

ConversationBufferWindowMemory

Imagine you have a limited space in your memory to remember recent conversations.

The ConversationBufferWindowMemory is like having a short-term memory that only keeps track of the most recent interactions. It intentionally drops the oldest ones to make room for new ones.

This helps manage the memory load and reduces the number of tokens used. The important thing is that it still keeps the latest parts of the conversation in their original form, without any modifications. So, it retains the most recent information for the chatbot to refer to, ensuring a more efficient and up-to-date conversation experience.

from langchain import OpenAI
from langchain.chains import ConversationChain

llm = OpenAI(temperature=0)

conversation = ConversationChain(
    llm=llm,
    verbose=True,
    memory=ConversationBufferWindowMemory(k=3)
)

Pros of ConversationBufferWindowMemory:

Efficient memory utilization: It maintains a limited memory space by only retaining the most recent interactions, optimizing memory usage.
Reduced token count: Dropping the oldest interactions helps to keep the token count low, preventing potential token limitations.
Unmodified context retention: The latest parts of the conversation are preserved in their original form, ensuring accurate references and contextual understanding.
Up-to-date conversations: By focusing on recent interactions, it allows the chatbot to stay current and provide more relevant responses.

领英推荐

Latest Trends in Voice-Activated AI Assistants -…

Analytics Insight? 4 个月前

A look at ChatGPT’s architecture. A lot for enterprise…

Brian Madden 1 年前

The Next Breakthrough according to ChatGPT: AI-Driven…

Rodolfo Nützmann 1 个月前

Cons of ConversationBufferWindowMemory:

Limited historical context: Since older interactions are intentionally dropped, the chatbot loses access to the complete conversation history, potentially impacting long-term context and accuracy.
Loss of older information: Valuable insights or details from earlier interactions are not retained, limiting the chatbot's ability to refer back to past conversations.
Reduced depth of understanding: Without the full conversation history, the chatbot may have a shallower understanding of the user's context and needs.
Potential loss of context relevance: Important information or context from older interactions may be disregarded, affecting the chatbot's ability to provide comprehensive responses in certain scenarios.

ConversationSummaryMemory

With the ConversationBufferMemory, the length of the conversation keeps increasing, which can become a problem if it becomes too large for our LLM to handle.

To overcome this, we introduce ConversationSummaryMemory. It keeps a summary of our past conversation snippets as our history. But how does it summarize? Here comes the LLM to the rescue! The LLM (Language Model) helps in condensing or summarizing the conversation, capturing the key information.

So, instead of storing the entire conversation, we store a summarized version. This helps manage the token count and allows the LLM to process the conversation effectively. In summary, ConversationSummaryMemory keeps a condensed version of previous conversations using the power of LLM summarization.

from langchain import OpenAI
from langchain.chains import ConversationChain

llm = OpenAI(temperature=0)

conversation = ConversationChain(
    llm=llm,
    verbose=True,
    memory=ConversationSummaryMemory(llm=llm)
)

Pros of ConversationSummaryMemory:

Efficient memory management- It keeps the conversation history in a summarized form, reducing the memory load.
Improved processing- By condensing the conversation snippets, it makes it easier for the language model to process and generate responses.
Avoiding maxing out limitations- It helps prevent exceeding the token count limit, ensuring the prompt remains within the processing capacity of the model.
Retains important information- The summary captures the essential aspects of previous interactions, allowing for relevant context to be maintained.

Cons of ConversationSummaryMemory:

Potential loss of detail: Since the conversation is summarized, some specific details or nuances from earlier interactions might be omitted.
Reliance on summarization quality: The accuracy and effectiveness of the summarization process depend on the language model's capability, which might introduce potential errors or misinterpretations.
Limited historical context: Due to summarization, the model's access to the complete conversation history may be limited, potentially impacting the depth of understanding.
Reduced granularity: The summarized form may lack the fine-grained information present in the original conversation, potentially affecting the accuracy of responses in certain scenarios.

ConversationTokenBufferMemory

ConversationTokenBufferMemory is a memory mechanism that stores recent interactions in a buffer within the system's memory.

Unlike other methods that rely on the number of interactions, this memory system determines when to clear or flush interactions based on the length of tokens used. Tokens are units of text, like words or characters, and the buffer is cleared when the token count exceeds a certain threshold. By using token length as a criterion, the memory system ensures that the buffer remains manageable in terms of memory usage.

This approach helps maintain efficient memory management and enables the system to handle conversations of varying lengths effectively.

from langchain import OpenAI
from langchain.chains import ConversationChain

llm = OpenAI(temperature=0)

conversation = ConversationChain(
    llm=llm,
    verbose=True,
    memory=ConversationTokenBufferMemory(llm=llm, max_token_limit=60),
)

Pros of ConversationTokenBufferMemory:

Efficient memory management: By using token length instead of the number of interactions, the memory system optimizes memory usage and prevents excessive memory consumption.
Flexible buffer size: The system adapts to conversations of varying lengths, ensuring that the buffer remains manageable and scalable.
Accurate threshold determination: Flushing interactions based on token count provides a more precise measure of memory usage, resulting in a better balance between memory efficiency and retaining relevant context.
Improved system performance: With efficient memory utilization, the overall performance of the system, including response times and processing speed, can be enhanced.

Cons of ConversationTokenBufferMemory:

Potential loss of context: Flushing interactions based on token length may result in the removal of earlier interactions that could contain important context or information, potentially affecting the accuracy of responses.
Complexity in threshold setting: Determining the appropriate token count threshold for flushing interactions may require careful consideration and experimentation to find the optimal balance between memory usage and context retention.
Difficulty in long-term context retention: Due to the dynamic nature of token-based flushing, retaining long-term context in the conversation may pose challenges as older interactions are more likely to be removed from the buffer.
Impact on response quality: In situations where high-context conversations are required, the token-based flushing approach may lead to a reduction in the depth of understanding and the quality of responses.

There are few other memory options available for which we can refer the official documentation Memory types | ????? Langchain

Next article I will try to bring other components of Langchain

#llm #langchain-memory

带有此图标的链接由领英创建，不带此图标的链接由作者添加。

Ali Hashemi

ML Engineer | MLOps | Data Scientist

4 个月

?How is memory managed in production when a new user enters the system? For example, if two users log in simultaneously, will the first user's chat history be considered the second user's chat history?

Abhishek Bhardwaj

Software Engineering Specialist at Amdocs

1 年

Discovering the power of Langchain and Chainlit for a seamless chatbot experience! Check out my latest project at PrivateDocBot. https://github.com/Abhi5h3k/PrivateDocBot Great, Will incorporate memory soon. ??#langchain #chatbot

Minhee Jung

Data Engineer

1 年

your post is good! however, question one thing,, Where is LangChain memory class stored? Is it DB? Or is it a class called memory inside?

1 次回应

查看更多评论

要查看或添加评论，请登录

Bharani Srinivas的更多文章

Navigating Complexity: How Multi-Agent Systems Drive Efficiency Across Industries

2025年2月21日

Navigating Complexity: How Multi-Agent Systems Drive Efficiency Across Industries

Introduction This document outlines common patterns used in multi-agent systems (MAS). Multi-agent systems involve…

3 条评论
DeepSeek-R1: A Revolution in Open-Source Reasoning AI

2025年1月29日

DeepSeek-R1: A Revolution in Open-Source Reasoning AI

DeepSeek-R1 represents a significant leap forward in the realm of open-source large language models (LLMs), rivalling…

5 条评论
Unlocking AI's Full Potential: The Rise of Multimodality

2024年11月1日

Unlocking AI's Full Potential: The Rise of Multimodality

Multimodality represents a transformative approach in artificial intelligence, integrating diverse modalities such as…
The Next Generation of Document Classification: Exploring Vision Language Models

2024年6月24日

The Next Generation of Document Classification: Exploring Vision Language Models

Document classification is a fundamental task in the field of information retrieval and natural language processing. It…

1 条评论
AI on Trial: Decoding the E.U.'s Groundbreaking AI Act

2024年3月22日

AI on Trial: Decoding the E.U.'s Groundbreaking AI Act

The European Union’s Artificial Intelligence Act received final approval from E.U.

2 条评论
How to use LLMs with Hugging Face Hub in 2024

2024年1月14日

How to use LLMs with Hugging Face Hub in 2024

Large Language Models (LLMs) are powerful tools for text generation, capable of producing diverse and coherent texts…
How to Find Your Perfect Match: A Decision Framework for LLMs

2023年12月18日

How to Find Your Perfect Match: A Decision Framework for LLMs

These days, a huge number of LLMs are being created. But how do I decide which one to use for our applications? This is…
Llama 2: The Ultimate Guide to Creating an App in No Time

2023年10月14日

Llama 2: The Ultimate Guide to Creating an App in No Time

Let me start with an introduction that explains what Llama 2 is, why it is important, and what kind of applications it…

3 条评论

See all articles

LangChain - Memory Module

Bharani Srinivas

Senior Manager - AIML Delivery Lead | Machine Learning | R&D | Mentor at Great Lakes

Different Types of Memories

ConversationBufferWindowMemory

领英推荐

ConversationSummaryMemory

ConversationTokenBufferMemory

Bharani Srinivas的更多文章

社区洞察

其他会员也浏览了

The impact of AI on Modern Help Desks?

How is Artificial Intelligence shaping modern help desks?

How Artificial Intelligence is shaping modern help desks?

ChatGPT-4 .. What do you think about GeoChatGPT?

Building and Deploying AWS Bedrock AI Agents Chat Service Using Lambda with Docker

Introducing Confusion – A Method to Free a Chatbot

GenAI for Developers and Technical Architects

Agentforce and 12 Days of OpenAI: A Holiday of Innovation and Integration - Days 5 & 6 (Apple Intelligence & Advanced Voice Mode)

?? Heroes of DDD: ChatGPT generates tests from Given-When-Then scenarios

How Artificial Intelligence is shaping modern help desks?

Different Types of Memories

ConversationBufferWindowMemory

领英推荐

ConversationSummaryMemory

ConversationTokenBufferMemory

Bharani Srinivas的更多文章

Navigating Complexity: How Multi-Agent Systems Drive Efficiency Across Industries

DeepSeek-R1: A Revolution in Open-Source Reasoning AI

Unlocking AI's Full Potential: The Rise of Multimodality

The Next Generation of Document Classification: Exploring Vision Language Models

AI on Trial: Decoding the E.U.'s Groundbreaking AI Act

How to use LLMs with Hugging Face Hub in 2024

How to Find Your Perfect Match: A Decision Framework for LLMs

Llama 2: The Ultimate Guide to Creating an App in No Time

社区洞察

其他会员也浏览了

The impact of AI on Modern Help Desks?

How is Artificial Intelligence shaping modern help desks?

How Artificial Intelligence is shaping modern help desks?

ChatGPT-4 .. What do you think about GeoChatGPT?

Building and Deploying AWS Bedrock AI Agents Chat Service Using Lambda with Docker

Introducing Confusion – A Method to Free a Chatbot

GenAI for Developers and Technical Architects

Agentforce and 12 Days of OpenAI: A Holiday of Innovation and Integration - Days 5 & 6 (Apple Intelligence & Advanced Voice Mode)

?? Heroes of DDD: ChatGPT generates tests from Given-When-Then scenarios

How Artificial Intelligence is shaping modern help desks?