登录查看更多内容

A Guide To Integrating Pythia With AWS Bedrock

Wisecube

Accelerating biomedical research by synthesizing billions of data points

发布日期: 2024年9月26日

Amazon Web Services (AWS) offers a fully managed AWS Bedrock that streamlines generative AI application development. AWS Bedrock offers developers pre-trained generative AI foundation models and customization tools. The foundation models are the building blocks in generative AI applications, and customization tools allow customizing these models according to specific use cases.

Wisecube’s Pythia further boosts the performance of generative AI applications with continuous hallucination monitoring and analysis. Real-time AI hallucination detection directs developers toward continuously improving LLMs, resulting in reliable outputs.?

In this guide, we’ll integrate Wisecube Pythia with AWS Bedrock using the Wisecube Python SDK.?

Getting an API key

To authenticate Wisecube Pythia, you need a unique API key. To get your unique API key, fill out the API key request form with your email address and the purpose of the API request.?

Installing Wisecube

You must install the Wisecube Python SDK in your Python environment to detect AI hallucinations with Pythia. Copy the following command in your Python console and run the code to install Wisecube:

pip install wisecube

Installing AWS, LangChain, and Vector Database

Developing an LLM with AWS Bedrock requires installing AWS, LangChain, and a vector database. The code snippet below installs the following libraries:

awscli: Facilitates interaction with AWS from the terminal.
boto3: Provides an interface to interact with Amazon Web Services.
langchain: Allows to create Natural Language Processing (NLP) pipelines.
faiss-cpu: Implements the CPU-only version of the Faiss database.

pip install boto3
pip install awscli
pip install wisecube
pip install langchain
pip install faiss-cpu

Authenticating API key

You need to authenticate the Wisecube API key to interact with Pythia. Copy and run the following command to authenticate your API key:

from wisecube_sdk.client import WisecubeClientAPI_KEY = "YOUR_API_KEY"
client = WisecubeClient(API_KEY).client

Developing LLM with AWS Bedrock

Developing an LLM with AWS Bedrock goes through the following steps:

Import Required Libraries

The following libraries are required to build an NLP pipeline and interact with AWS. Copy and run the code snippet below to import these libraries:?

import boto3import json
from langchain.embeddings import BedrockEmbeddings
from langchain.indexes import VectorstoreIndexCreator
from langchain.vectorstores import FAISS
from langchain.text_splitter import CharacterTextSplitter
from langchain.document_loaders.csv_loader import CSVLoader

Create a Bedrock Client

Next, you need to create a Bedrock client to use the service finally. service_name specifies that we’re using bedrock_runtime service, and region_name sets the region to us-east-1, which can differfor your configuration. Lastly, we define modelId, accept, and contentType variables to specify your pre-trained model and set the data to JSON format. We’re using the Amazon Titan Express model here.

bedrock = boto3.client(service_name='bedrock-runtime', 
region_name='us-east-1')

modelId = 'amazon.titan-text-express-v1'
accept = 'application/json'
  
contentType = 'application/json'

Build an LLM and Generate Response

Now, you can build your LLM using the model specified above. This begins with building the request body, which includes the inputText that specifies the user query and textGenerationConfig. This defines the configuration for the text generation process.

The following code converts the body into a JSON object and uses bedrock.invoke_model method to send the user query to the AI model. Lastly, it extracts the raw content from the response using response.get(“body”).read() and converts the raw content to JSON format using json.loads.

After we get an LLM response, response_body['results'][0]['outputText'] extracts only the string part from the response because Pythia accepts arguments in string format.

body = {? ? ?
        "inputText": “What are the symptoms of type 2 diabetes?”,? ? 
        "textGenerationConfig": {? ? ?
             "maxTokenCount": 4096,? ? ? ? ? 
             "stopSequences": ["User:"],? ? ? ? ? ?
             "temperature": 0,? ? ? ? ? ?
             "topP": 1? ? ? ? 
             }? ? 
         }

body=json.dumps(body)

response = bedrock.invoke_model(body=body, modelId=modelId, 
accept=accept, contentType=contentType)
response_body=json.loads(response.get("body").read())
repsonse_text=response_body['results'][0]['outputText']

领英推荐

AI for the rest of us

GitHub 1 年前

OpenAI Hype Cycle

AIM 1 年前

The Future of AI Tech Stacks

Udit Goenka 2 个月前

Using Pythia to Detect Hallucinations

Now that you’ve got an LLM that generates responses based on user queries, you can integrate it with Pythia to detect real-time hallucinations. To do this, you need to store data in a vector database, which will act as a reference in Pythia for fact verification. This can be achieved in two simple steps:?

Use Retrieved Data as Reference

The following code snippet loads diabetes.csv data, generates vector embeddings for data, and creates two functions, get_index() and get_similarity_search_results(). The get_index() function returns an in-memory vector database to be used in the application.

The get_similarity_search_results() function retrieves similar data points from the vector database based on the input vector. Lastly, it flattens the retrieved similar data points and returns them.

embeddings = BedrockEmbeddings() #create a Titan Embeddings client

loader = CSVLoader(file_path="diabetes.csv")

documents = loader.load()
  
index_creator = VectorstoreIndexCreator( ? 
   vectorstore_cls=FAISS, ? 
   embedding=embeddings, ? 
   text_splitter=CharacterTextSplitter(chunk_size=300, chunk_overlap=0),? ? 
   )

def get_index(): #returns an in-memory vector store to be used in the application
? ? index_from_loader = index_creator.from_loaders([loader])
? ? return index_from_loader

def get_similarity_search_results(index, question):????
    results = index.vectorstore.similarity_search_with_score(question)????
    flattened_results = [{"content":res[0].page_content, "score":res[1]} 
for res in results] #flatten results for easier display and handling?

return flattened_results

Use Pythia To Detect Hallucinations

Now, we can use Pythia to detect real-time hallucinations in LLM responses using the references retrieved in the previous step. To do this, we define our question, which is the same as the question we passed in the body object above. Then, we create a vector index using the get_index() function and retrieve the reference with get_similarity_search_results function. Don’t forget to extract the string portion from the reference like we did for the response above.

Lastly, the client.ask_pythia detects hallucinations based on reference, response, and question provided to it. Note that our response is passed as response_text in the following code because our LLM responses are stored in the response_text variable.

question = “What are the symptoms of type 2 diabetes?”
index = get_index()
reference = get_similarity_search_results(index, question)[0]["content"]
client.ask_pythia(reference,response_text,question)

The final output for our query is in the screenshot below, where SDK Response categorizes LLM claims into relevant classes, including entailment, contradiction, neutral, and missing facts. Finally, it highlights the LLM's overall performance with the percentage contribution of each class in the metrics dictionary.

Full Code

The steps we discussed are laid out in a procedural approach to make it easier to understand. However, compiling logic into functions is recommended in Python applications to make the code reusable, clean, and maintainable. Therefore, we compile the logic to develop an LLM with AWS Bedrock and use Pythia to detect AI hallucinations into reusable functions:

pip install wisecube
pip install boto3
pip install awscli
pip install langchain
pip install faiss-cpu
  
import boto3
import json
  
from wisecube_sdk.client import WisecubeClient
from wisecube_sdk.model_formats import OutputFormat, WisecubeModel
from langchain.embeddings import BedrockEmbeddings
from langchain.indexes import VectorstoreIndexCreator
from langchain.vectorstores import FAISS
from langchain.text_splitter import CharacterTextSplitter
from langchain.document_loaders.csv_loader import CSVLoader
  
API_KEY = "YOUR_API_KEY"
client = WisecubeClient(API_KEY).client
  
embeddings = BedrockEmbeddings() #create a Titan Embeddings client
  
loader = CSVLoader(file_path="diabetes.csv")
  
documents = loader.load()
  
index_creator = VectorstoreIndexCreator( ? 
  vectorstore_cls=FAISS, ? 
  embedding=embeddings, ? 
  text_splitter=CharacterTextSplitter(chunk_size=300, chunk_overlap=0),? ? 
  )

def get_index(): #returns an in-memory vector store to be used in the application
? ? index_from_loader = index_creator.from_loaders([loader])? ? 
    return index_from_loader
      
def get_similarity_search_results(index, question):? ? 
    results = index.vectorstore.similarity_search_with_score(question)? ? 
      
    flattened_results = [{"content":res[0].page_content, "score":res[1]} 
for res in results] #flatten results for easier display and handling? ? 
      
return flattened_results
  
bedrock = boto3.client(service_name='bedrock-runtime', 
region_name='us-east-1')
  
modelId = 'amazon.titan-text-express-v1'
accept = 'application/json'
contentType = 'application/json'
  
def bedrock_and_sdk_response(question) :? ? 
    body = {? ? ? ? 
        "inputText":? question,? ? ? ? 
        "textGenerationConfig": {? ? ? ? ? ? 
            "maxTokenCount": 4096,? ? ? ? ? ? 
            "stopSequences": ["User:"],? ? ? ? ? ?
            "temperature": 0,? ? ? ? ? ? 
            "topP": 1? ? ? ? 
        }? ? 
    }? ? 
      
    body=json.dumps(body)? ? 
      
    response = bedrock.invoke_model(body=body, modelId=modelId, 
accept=accept, contentType=contentType)? ? 
    response_body=json.loads(response.get("body").read())????
    repsonse_text=response_body['results'][0]['outputText']? ? 
      
    index = get_index()? ? 
    reference = get_similarity_search_results(index, question)[0]["content"]? ? 
    
    response_from_pythia = client.ask_pythia(reference,response_text,question)? ? 
      
    return response_body, response_from_pythia
      
question ="What are the symptoms of type 2 diabetes?"
  
bedrock_and_sdk_response(question)

Benefits of Using Pythia with AWS Bedrock

Pythia offers a range of benefits when integrated into your workflows. These benefits allow LLM developers to continually improve their systems while tracking performance with the help of Pythia. The benefits of integrating Pythia with AWS Bedrock include:

Advanced Hallucination Detection

Pythia uses a billion-scale knowledge graph with 10 billion biomedical facts and 30 million biomedical articles to verify LLM claims and detect hallucinations. Pythia extracts claims from LLM responses in the form of knowledge triplets and verifies them against the billion-scale knowledge graph. Together, these increase the contextual understanding and reliability of LLMs.

Real-time Monitoring

Pythia continuously monitors LLM responses against relevant references and generates an audit report. This allows developers to address risks and fix hallucinations as soon as they occur.?

Robust LLMs

Real-time hallucination detection against a vast range of information promises the development of robust LLMs. These LLMs generate reliable outputs, resulting in disruptive biomedical research.?

Enhanced Trust

Reliable LLMs improve company reputation and enhance user trust in AI. Users are more likely to adopt AI systems when they trust AI.?

Privacy Protection

Pythia protects customer data so developers can focus on the LLM performance without worrying about losing their data. This makes Pythia a trusted hallucination detection tool.

Contact us today to get started with Pythia and build reliable LLMs to speed up your research process and enhance user trust.

The article was originally published on Pythia's website.

Joy Curtis

1 个月

Our team has been exploring hallucinations too

1 次回应

查看更多评论

要查看或添加评论，请登录

Wisecube的更多文章

See all articles

A Guide To Integrating Pythia With AWS Bedrock

Wisecube

Accelerating biomedical research by synthesizing billions of data points

Getting an API key

Installing Wisecube

Installing AWS, LangChain, and Vector Database

Authenticating API key

Developing LLM with AWS Bedrock

Import Required Libraries

Create a Bedrock Client

Build an LLM and Generate Response

领英推荐

Using Pythia to Detect Hallucinations

Use Retrieved Data as Reference

Use Pythia To Detect Hallucinations

Full Code

Benefits of Using Pythia with AWS Bedrock

Advanced Hallucination Detection

Real-time Monitoring

Robust LLMs

Enhanced Trust

Privacy Protection

Wisecube的更多文章

社区洞察

其他会员也浏览了

Importance of Frameworks in AI

Importance of Frameworks in AI

Top Data Analytics Skills and Platforms for 2023, PyTorch 2.0 Released, and 5 Huge Data Science Career?Mistakes

Why LLMs Hallucinate; GraphGPT; Inside Microsoft’s small LLM; Deploy Tiny Llama on AWS EC2; Fine-Tune LLM using PyTorch; and More

OpenAI Hype Cycle

Introducing Gemma: New Open Source Model from Google outperformed Llama 2 and Mistral Models!

Unlocking the Power of LLMs: A Deep Dive into Streamlit, Azure OpenAI, and LangChain

Implementing AdaGrad Optimizer in Spark

End to end LLMOps Pipeline - Part 2 - FastAPI

The Rise of AI-Powered Code Generation Tools: How Developers are Accelerating Workflow

Getting an API key

Installing Wisecube

Installing AWS, LangChain, and Vector Database

Authenticating API key

Developing LLM with AWS Bedrock

Import Required Libraries

Create a Bedrock Client

Build an LLM and Generate Response

领英推荐

Using Pythia to Detect Hallucinations

Use Retrieved Data as Reference

Use Pythia To Detect Hallucinations

Full Code

Benefits of Using Pythia with AWS Bedrock

Advanced Hallucination Detection

Real-time Monitoring

Robust LLMs

Enhanced Trust

Privacy Protection

Wisecube的更多文章

A Guide to Integrating Pythia API with RAG-based Systems Using Wisecube Python SDK

AI Compliance and Governance: Meeting Regulatory Standards with Pythia

How AI Hallucinations Impact Business Operations and Reputation

A Guide to Integrating the Pythia API Using Wisecube Python SDK

Why AI Models Fail in Production: Common Issues and How Observability Helps

How AI Observability Enhances Model Reliability and Diagnoses Issues Faster

The Role of Knowledge Graphs in Enhancing AI Accuracy

Evaluating LLM Hallucination Detectors

Orpheus Vs. Competition: Why Orpheus is the Better AI Drug Discovery Tool?

A Guide To Integrating Pythia With Text Summarizers

社区洞察

其他会员也浏览了

Importance of Frameworks in AI

Importance of Frameworks in AI

Top Data Analytics Skills and Platforms for 2023, PyTorch 2.0 Released, and 5 Huge Data Science Career?Mistakes

Why LLMs Hallucinate; GraphGPT; Inside Microsoft’s small LLM; Deploy Tiny Llama on AWS EC2; Fine-Tune LLM using PyTorch; and More

OpenAI Hype Cycle

Introducing Gemma: New Open Source Model from Google outperformed Llama 2 and Mistral Models!

Unlocking the Power of LLMs: A Deep Dive into Streamlit, Azure OpenAI, and LangChain

Implementing AdaGrad Optimizer in Spark

End to end LLMOps Pipeline - Part 2 - FastAPI

The Rise of AI-Powered Code Generation Tools: How Developers are Accelerating Workflow