Build a Powerful RAG Chatbot with Cohere's Command-R

Sri Laxmi

AI Product Manager | Generative AI | AI Products Builders Host| M.Sc at TUM

发布日期: 2024年3月17日

Full tutorial - https://www.youtube.com/watch?v=HTihFrMzur4

In this tutorial, we're going to build a powerful retrieval augmented generation or RAG chatbot using Cohere's new Command-R large language model.

Command-R is a scalable language model optimized for Retrieval Augmented Generation (RAG) and tool use.

The best part of command-r is that we don't have to use any embedded models or vector databases to build RAG-based applications.

Cohere Incorporates RAG into LLMs, especially within question-answering frameworks. It seamlessly integrates with Cohere’s Embed and Rerank models to deliver best-in-class RAG capabilities. Notably, Command-R's outputs include clear citations, mitigating the risk of hallucinations and enabling users to easily access additional context from source materials.

Retrieval Augmented Generation (RAG) is a method for generating text using additional relevant information fetched from an external data source. The core idea is that providing relevant documents or context to the language model can greatly increase the factual accuracy and groundedness of the generated text.

With command-r, the RAG workflow typically involves three main steps:

Generating search queries that can retrieve relevant documents for the given input prompt or question.
Fetching those relevant documents from the specified data source using the generated search queries.
Generating the final response augmented with the retrieved documents, often including inline citations to ground the output in the source material.

Setup and Dependencies

The app relies on a few key Python libraries:

Pluralsight 6 个月前

Mastering the Ingestion Phase of Retriever Augmented…

Snigdha Kakkar 4 个月前

Introducing CodeLlama 70B: A 70 billion-parameter…

Clarifai 7 个月前

Streamlit: A framework for building data-driven web apps rapidly with Python.
Cohere: The official Python client library for interacting with Cohere's AI models and APIs.

To access Cohere's APIs, you need to have a valid API key stored as an environment variable named COHERE_API_KEY. The code checks if this key is present and notifies the user if it's missing.

Core Functionality

The heart of the app is the generate_rag_response_with_citations function, which takes a user query as input and returns a response generated by Cohere's AI along with a list of relevant web citations.

Here's how it works:

The function calls Cohere's chat endpoint, specifying the command-r model and the user query.
It includes a an argument document, which enables the RAG capability to search the documents for relevant information to enhance the response.
Cohere's AI generates a response (response.text) along with a list of citations (response.citations) from the web sources used.
The function returns the response text and the list of citations.

Streamlit UI

The app presents a simple user interface built with Streamlit:

A text area for the user to enter their query.
A button to trigger the request to Cohere's AI.
Once the button is clicked, the app calls generate_rag_response_with_citations with the user's query.
The response from the AI is displayed.
If there are any web citations, they are also displayed, with a notice indicating they were sourced from web searches.

import streamlit as st
import cohere
import os

# Replace 'your-cohere-api-key' with your actual Cohere API key
api_key = os.getenv('COHERE_API_KEY')
print(f"API Key: {api_key}")

# Ensure the API key is actually retrieved; otherwise, notify the user.
if api_key is None:
    st.error("COHERE_API_KEY environment variable not found. Please set it.")
else:
    # Initialize the Cohere client with the API key
    co = cohere.Client(api_key)

def generate_rag_response_with_citations(query, documents):
    """
    Generates a response to the user query using Command-R model with RAG capability
    by referencing a set of user-uploaded documents and includes citations in the response.
    
    Parameters:
    - query (str): The user's query.
    - documents (list): A list of documents provided by the user.
    
    Returns:
    - Tuple[str, list]: The generated response and a list of citations.
    """
    # Format documents for the API
    formatted_documents = [{"title": f"doc_{i}", "snippet": doc} for i, doc in enumerate(documents)]
    
    # Call the Cohere chat endpoint with the documents for RAG
    response = co.chat(
        model="command-r",
        message=query,
        connectors=[{"id": "web-search"}]
    )   

    # Extracting text and citations from the response
    response_text = response.text
    citations = response.citations

    return response_text, citations

# Streamlit UI
st.title('RAG with Citations - Command-r')

uploaded_files = st.file_uploader("Upload documents related to your query (text files):", accept_multiple_files=True, type=['txt'])
user_query = st.text_area("Enter your query:")

if st.button('Get Answer'):
    if not user_query:
        st.write("Please enter a query to proceed.")
    elif not uploaded_files:
        st.write("Please upload at least one document to proceed.")
    else:
        # Read the content of the uploaded files
        documents = [file.getvalue().decode("utf-8") for file in uploaded_files]
        
        response, citations = generate_rag_response_with_citations(user_query, documents)
        st.write("Answer:")
        st.write(response)
        
        if citations:
            st.write("Citations:")
            for citation in citations:
                cited_text = citation['text']
                document_ids = citation['document_ids']
                # Assuming document IDs are in the format "doc_x", extract and display the cited document snippets
                for doc_id in document_ids:
                    index = int(doc_id.split('_')[-1])
                    st.write(f"- {cited_text} (from document: {documents[index]})")

AI & Product Newsletter

2,621 位关注者

Sri Laxmi

AI Product Manager | Generative AI | AI Products Builders Host| M.Sc at TUM

6 个月

full video - https://www.youtube.com/watch?v=HTihFrMzur4&t=2s

要查看或添加评论，请登录

查看全部

Build a Powerful RAG Chatbot with Cohere's Command-R

Sri Laxmi

AI Product Manager | Generative AI | AI Products Builders Host| M.Sc at TUM

Setup and Dependencies

领英推荐

Core Functionality

Streamlit UI

AI & Product Newsletter

2,621 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Langchain

Interesting Content in AI, Software, Business, and Tech- 5/31/2023

Top 5 Open-Source LangChain Alternatives to Use in 2024

Telegram Bot for Monitoring Summarizing and Sending Periodic Qverviews of Channel Posts

Voxel51 Filtered Views Newsletter - August 30, 2024

Build a chat bot with Python and TensorFlow in 15 minutes.

Llama 2, ChatGPT for Web Scraping, & Latest Python News

Exploring Data Analytical Capabilities of Python: A Study on Python’s Big Data Opportunities

LangChain Models

MindsDB and Ollama App for Interacting with Streamlit - Tutorial

Setup and Dependencies

领英推荐

Core Functionality

Streamlit UI

AI & Product Newsletter

2,621 位关注者

Advanced Retrieval Augmented Generation (RAG) with Reranking

2024年4月12日

From Chatbots to AI Co-Pilots: Salesforce AI Product Leader talks about future of Generative AI co-pilots for Enterprise

2024年4月4日

Building a Text Summarization App with Open AI, Streamlit and LangChain

2024年3月24日

Build a Generative AI app with Claude 3 - The powerful LLM

2024年3月16日

Boost growth by picking the best product copy using AI - Just words

2024年3月12日

Build AI agents that work for you using Autogen - Full tutorial

2024年3月9日

How to build a RAG chatbot using Ollama - Serve LLMs locally

2024年3月8日

How to Get Into Y Combinator: Insider Tips for Nailing the YC Accelerator Application

2024年3月7日

Step-by-step guide on how to build AI agents using CrewAI

2024年3月2日

Step-by-Step Guide to Building AI Agents with AutoGen Studio 2.0 - Real-world use-case

2024年2月29日

社区洞察

其他会员也浏览了

Langchain

Interesting Content in AI, Software, Business, and Tech- 5/31/2023

Top 5 Open-Source LangChain Alternatives to Use in 2024

Telegram Bot for Monitoring Summarizing and Sending Periodic Qverviews of Channel Posts

Voxel51 Filtered Views Newsletter - August 30, 2024

Build a chat bot with Python and TensorFlow in 15 minutes.

Llama 2, ChatGPT for Web Scraping, & Latest Python News

Exploring Data Analytical Capabilities of Python: A Study on Python’s Big Data Opportunities

LangChain Models

MindsDB and Ollama App for Interacting with Streamlit - Tutorial