登录查看更多内容

How to work with LangChain and Oracle Generative AI

Madhusudhan Rao

Oracle Database, Principal Product Manager

发布日期: 2024年4月11日

In this article, we will learn how to work with?#LangChain's open-source building blocks, components &?#LLMs?integrations,?#Streamlit, an open-source?#Python?framework for?#DataScientists?and AI/ML engineers and?#OracleGenerativeAI?to build the next generation of Intelligent Applications.

We will do the groundwork of setting up the OCI Command line interface, Creating a simple LangChain app, writing an interactive?#Healthcare?application, and creating a search application that can scan through a training?#PDF?document.

Does it sound exciting? Let us get started.

If you are new to Oracle Generative AI, I recommend reading my previous article How to build intelligent applications using Oracle Generative AI and Oracle APEX.

What is LangChain?

LangChain is a framework for developing applications powered by large language models (LLMs).

LangChain simplifies every stage of the LLM application lifecycle:

Development: Build your applications using LangChain's open-source building blocks and components. Hit the ground running using third-party integrations and Templates.
Productionization: Use LangSmith to inspect, monitor and evaluate your chains, so that you can continuously optimize and deploy with confidence.

With LangChain we can build context-aware reasoning applications with LangChain’s flexible framework that leverages your company’s data and APIs. Future-proof your application by making vendor optionality part of your LLM infrastructure design.

What is Streamlit?

A faster way to build and share data apps

Streamlit turns data scripts into shareable web apps in minutes. All in pure Python. No front?end experience is required.

Update Jan 2025:

Cohere.command, cohere.command-light, meta.llama-2-70b-chat (and few more) AI models got retired (and in the last few months), so any application using these AI models will not work and give a response error message “Entity with key <model-name> not found” or “Error 400 <model-name> does not support TextGeneration”, Please see the list of models already retired or sooner to retire.

for updated Langchain with Oracle GenAI source codes, please visit the following GitHub

A short demo video on what we plan to build in this Article

How does LangChain work with Oracle Generative AI?

This is a 3-step process.

Set up the OCI Command Line Interface on your laptop or use a Cloud Compute Instance.
Install required Python libraries.
Write and Run the Python code.

Assumption:?

You already have an Oracle cloud account and access to Oracle Generative AI in the Chicago region.??
You have a default compartment created.
You have administrative rights in the tenancy or your Administrator has set up required policies to access Generative AI Services.
Download Python Code and Budget PDF. used in this article.

Step 1: Installing OCI CLI and Configuring

Get User's OCID

After logging into the cloud console, click on User Settings under the top right navigation; this will take you to?the User Details?page. Copy the OCID into a text file. We will need this later..

2. Under the same page click on Add API Key button

Generate and Download RSA Key Pair

Choose an Option to Generate a new key pair; if you already have keys generated, you can upload them here, The most important is when you generate the key pair, download both of them, Click?the Add?button

You should now be able to see the Configuration file. Copy and paste this into a file we will need later.

Click on Close button

2. We can now see our newly created fingerprint

Get Compartment OCID

Under Identity > Compartments, note your compartment OCID; we will need this OCID later.

Install OCI Command Line Interface (CLI)

Install OCI CLI on MAC OS
If you have not installed Brew on your MacOS, please refer to their official guide,?Install Brew on Mac

/bin/bash -c "$(curl -fsSL https://raw.githubusercontent.com/Homebrew/install/HEAD/install.sh)"

brew update && brew install oci-cli

For Other Operating systems and more details on OCI CLI please check this link.

You can also refer to this video if you have issues setting up the OCI Command line interface.

Update OCI Configuration file

Update DEFAULT values as per your OCI parameters

vi ~/.oci/config

The parameters will look something like this

[DEFAULT]
user=ocid1.user.oc1..aaaaaaXXXX63smy6knndy5q
fingerprint=d5:84:a7:0e:XXXX:ed:11:1a:50
tenancy=ocid1.tenancy.oc1..aaaaaaaaj4XXXymyo4xwxyv3gfa
region=us-phoenix-1 #(replace-with-your-oci-region)
key_file=/your-folder/your-oci-key.pem

Reality check if we can list all buckets in a compartment to check if all configurations are correct. provide your compartment ocid where the OCI buckets have been created

-- set chmod on .pem file  --
chmod 600 /your-folder/your-oci-key.pem

-- get tenancy namespace --
oci os ns get

{
  "data": "yournamespace"
}

-- run oci cli to list buckets in a OCI Object storage 
oci os bucket list --compartment-id ocid1.compartment.oc1..aXXXn4hgg

-- expect a similar JSON output without errors --

  {
      "compartment-id": "ocid1.compartment.oc1..aXXX32q", 
      "defined-tags": null,
      "etag": "25973f28-5125-4eae-a37c-73327f5c2644",
      "freeform-tags": null,
      "name": "your-bucket-name",
      "namespace": "your-tenancy-namespace",
      "time-created": "2023-03-26T16:18:17.991000+00:00"
    }

If this lists objects in an OCI bucket or the name of your tenancy namespace, we are good to move forward; you can create a bucket in OCI Object storage and test it. If there are issues, check the troubleshooting section in this article.

Step 2: Install the required Python libraries

Please add any additional libraries as required to run the Python code.

python3 --version
Python 3.10.4

pip install -U langchain oci
langchain-core
langchain-cli
langchain_cohere 
cohere
langgraph 
langsmith
streamlit

Step 3: Write and Run the Python code.

The basic command to get the Oracle LLM handle is shown below

from langchain_community.llms import OCIGenAI 

...

llm = OCIGenAI(
    model_id="cohere.command",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="<Your-Compartment-Id>",
    model_kwargs={"temperature": 0.7, "top_p": 0.75, "max_tokens": 1000}
)

To get OCI Generative AI Embeddings

领英推荐

Four Machine Learning Questions that Every Data…

Benjamin Bennett Alexander 1 个月前

DABL

360DigiTMG 1 年前

The 30 Most Useful Python Libraries for Data…

Open Data Science Conference (ODSC) 2 年前

from langchain_community.llms import OCIGenAI 
from langchain_community.embeddings import OCIGenAIEmbeddings

...

embeddings = OCIGenAIEmbeddings(
    model_id="cohere.embed-english-light-v3.0",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com", 
    compartment_id="<Your-Compartment-Id>",
)

Example 1: Simple Generative AI Console App

Let's start with a basic Python code in the console to check the connectivity, authentication and AI response.

Download?Basic.py?run the Python code, and view output

python3 basic.py

The Egyptians built the pyramids. The Egyptian pyramids are ancient pyramid-shaped masonry structures located in Egypt.

Example 2: Simple Generative AI Web Application with Input

In this simple Generative AI Application, we will just take input prompt and display AI result.

Download quickstart.py and run the code

Run the code from your laptop or desktop command prompt

% streamlit run quickstart.py

You can now view your Streamlit app in your browser.

Local URL: https://localhost:8501
Network URL: https://192.168.X.Y:8501

Example 3: Simple Healthcare AI Application

Build your first LangChain and Oracle Generative AI-based Healthcare Application. In this Application, we will take disease as input, and based on this, we will create a LangChain that will list the disease, symptoms and first aid.

Code:?Get the required imports and initialise Oracle Generative AI LLM

Build the Streamlit framework and input prompts.

Initialise Memory and Chains

Set SequentialChain and Print output on the browser

Download symptoms.py and run the code.

Run the Application

% streamlit run symptoms.py

You can now view your Streamlit app in your browser.

Local URL: https://localhost:8501
Network URL: https://192.168.X.Y:8501

Example 4: PDF Search AI Application

Smart Application that can read and search PDFs and provide AI output for a Given Prompt. Download and put the?Budget PDF?in the same directory as the Python code.

Prompt: What is Amrit Kaal?  lets check in the PDF document

from langchain_community.document_loaders import PyPDFLoader
from langchain_community.llms import OCIGenAI 
from langchain_core.prompts import PromptTemplate
from langchain.schema.output_parser import StrOutputParser
from langchain.schema.runnable import RunnablePassthrough
from langchain_community.embeddings import OCIGenAIEmbeddings
from langchain_community.vectorstores import Chroma
  
loader = PyPDFLoader("budget_speech.pdf")
pages = loader.load_and_split()

llm = OCIGenAI(
    model_id="cohere.command",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="<Your-Compartment-Id>",
    model_kwargs={"temperature": 0.7, "top_p": 0.75, "max_tokens": 1000}
)

embeddings = OCIGenAIEmbeddings(
    model_id="cohere.embed-english-light-v3.0",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="<Your-Compartment-Id>",
)

vectorstore = Chroma.from_documents(
    pages,
    embedding=embeddings    
)
retriever = vectorstore.as_retriever()

template = """ 
{context}
Indian Budget Speech : {input} 
"""
prompt = PromptTemplate.from_template(template)
 

chain = (
    {"context": retriever, 
     "input": RunnablePassthrough()}
    | prompt
    | llm
    | StrOutputParser()
)

print(chain.invoke("What is Amrit Kaal"))

Run the code in the terminal.

python3 searchpdf.py

-- Output from Generative AI after searching the PDF --

The Indian Budget Speech outlines the government's plans and priorities for the fiscal year 2024-25. One of the key focus areas highlighted in the speech is the concept of "Amrit Kaal," which translates to "Era of Immortality" or "Golden Era." 

The government aims to foster inclusive and sustainable development to improve productivity, create opportunities for all, and contribute to the generation of resources to power investments and fulfill aspirations. 

To achieve this, the government intends to adopt economic policies that facilitate growth and transform the country. This includes ensuring timely and adequate finances, relevant technologies, and appropriate training for Micro, Small and Medium Enterprises (MSME) to compete globally.  

....

Change the prompt and re-run the same code

-- update the code --
print(chain.invoke("What is India's fiscal deficit"))

-- run the code --
python3 searchpdf.py

AI Output after searching through the Budget PDF is.

India's fiscal deficit for the year 2024-25 is estimated to be 5.1% of GDP, according to a budget speech by the country's Finance Minister Nirmala Sitharaman. This is in line with the government's goal to reduce the fiscal deficit to below 4.5% by 2025-26. 

Would you like me to tell you more about India's budgetary process or fiscal policies?

Example 5: PDF Search AI Application Streamlit version

import streamlit as st
from langchain_community.llms import OCIGenAI 
from langchain_community.document_loaders import PyPDFLoader
from langchain_core.prompts import PromptTemplate
from langchain_community.embeddings import OCIGenAIEmbeddings
from langchain_community.vectorstores import Chroma

st.title('???? PDF AI Search Application')
 
def generate_response(input_text): 
  loader = PyPDFLoader("budget_speech.pdf")
  pages = loader.load_and_split()
  llm = OCIGenAI(
    model_id="cohere.command",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="<Your-Compartment-Id>",
    model_kwargs={"temperature": 0.7, "top_p": 0.75, "max_tokens": 300}
  )  
  embeddings = OCIGenAIEmbeddings(
    model_id="cohere.embed-english-light-v3.0",
    service_endpoint="https://inference.generativeai.us-chicago-1.oci.oraclecloud.com",
    compartment_id="<Your-Compartment-Id>",
  ) 
  vectorstore = Chroma.from_documents(
        pages,
        embedding=embeddings    
    )
  retriever = vectorstore.as_retriever()
  template = """ 
  {context}
  Indian Budget Speech : {input} 
  """
  prompt = PromptTemplate.from_template(template) 
  st.info(llm(input_text))

with st.form('my_form'):
  text = st.text_area('Enter Search:','What is Amrit Kaal?')
  submitted = st.form_submit_button('Submit') 
generate_response(text)

#streamlit run pdfsearch.py

Download Python Code and Budget PDF used

Troubleshooting:

Error Message

Traceback (most recent call last):
  File "/some-folder/basic.py", line 3, in <module>
    llm = OCIGenAI(
  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/langchain_core/load/serializable.py", line 120, in __init__
    super().__init__(**kwargs)
  File "/Library/Frameworks/Python.framework/Versions/3.10/lib/python3.10/site-packages/pydantic/v1/main.py", line 341, in __init__
    raise validation_error
pydantic.v1.error_wrappers.ValidationError: 1 validation error for OCIGenAI
__root__
  Could not authenticate with OCI client. Please check if ~/.oci/config exists. If INSTANCE_PRINCIPLE or RESOURCE_PRINCIPLE is used, Please check the specified auth_profile and auth_type are valid. (type=value_error)

Solution: This is an Authentication issue, Verify the config file settings.

 vi ~/.oci/config

[DEFAULT]
user=ocid1.user.oc1..aaaaaaaaompuufgfXXXXndy5q
fingerprint=d5:84:a7:0e:bf:43:XXXXX:11:1a:50
tenancy=ocid1.tenancy.oc1..aaaaaaaaXXXXXgfa
region=us-phoenix-1
key_file=/Users/somefolder/oci_api_key_xxx.pem

Check us out here!

Oracle LiveLabs

Oracle Database 23ai Free

Oracle Cloud Free Tier

Oracle Developer Center

References:

Oracle Cloud Infrastructure Generative AI

OCI CLI reference

Thanks for reading, liking and sharing

Regards, Madhusudhan Rao

John Edwards

AI Experts - Join our Network of AI Speakers, Consultants and AI Solution Providers. Message me for info.

11 个月

Exciting times ahead with all these powerful tools at your disposal!

Pete Grett

GEN AI Evangelist | #TechSherpa | #LiftOthersUp

11 个月

Can't wait to dive into this exciting journey! Madhusudhan Rao

1 次回应

查看更多评论

要查看或添加评论，请登录

Madhusudhan Rao的更多文章

Building a Robust Suspected Transaction Review Business Process with Oracle Integration, Process Automation, Machine Learning and Analytics Cloud

2025年2月11日

Building a Robust Suspected Transaction Review Business Process with Oracle Integration, Process Automation, Machine Learning and Analytics Cloud

Introduction Designing a Suspected Transaction Review Process: A Step-by-Step Guide This article provides a…

5 条评论
How to Automate Business Process with Oracle APEX Workflows, Task Definitions, Push Notifications, ORDS and Data Redaction

2025年1月21日

How to Automate Business Process with Oracle APEX Workflows, Task Definitions, Push Notifications, ORDS and Data Redaction

Automating Business Processes: A Case Study on Opening a Premium Bank Account Introduction In today's fast-paced…

11 条评论
Unlocking the Power of Generative AI Meta Llama Instruct and Vision Models with Oracle Database 23ai

2024年12月18日

Unlocking the Power of Generative AI Meta Llama Instruct and Vision Models with Oracle Database 23ai

Introduction This article explores the capabilities of Meta Llama models on Oracle Cloud and Oracle Database 23ai…

9 条评论
How to migrate the MERN stack MongoDB application to Oracle Autonomous Database 23ai

2024年12月2日

How to migrate the MERN stack MongoDB application to Oracle Autonomous Database 23ai

This article will show how to migrate a #MERN stack MongoDB + React.js application running on AWS, Google Cloud or…

3 条评论
AI for Healthcare Innovations: How to Build an AI Medical Helper?

2024年11月25日

AI for Healthcare Innovations: How to Build an AI Medical Helper?

Introduction In this article, we will see how to create an AI Medical helper that can take patients' input parameters…

6 条评论
AI for Aviation - Create Realtime Passenger Dashboard with Transactional Event Queues, Advanced Queuing, Scheduler Jobs, Queue Management & GenAI

2024年11月18日

AI for Aviation - Create Realtime Passenger Dashboard with Transactional Event Queues, Advanced Queuing, Scheduler Jobs, Queue Management & GenAI

Introduction AI for Aviation. In this article, we will see how Oracle Advanced Queuing and Scheduling Jobs with Oracle…

3 条评论
Multicloud Oracle Database@Microsoft Azure - How to create & access Oracle Autonomous Database 23ai on Azure Cloud

2024年10月30日

Multicloud Oracle Database@Microsoft Azure - How to create & access Oracle Autonomous Database 23ai on Azure Cloud

Wishing you a Happy, Peaceful and Prosperous Deepavali (Diwali) Thanks - Madhusudhan Rao , Family, Friends and my…

4 条评论
Multicloud Oracle Database@Google Cloud - How to create & access Oracle Autonomous Database 23ai on Google Cloud

2024年10月16日

Multicloud Oracle Database@Google Cloud - How to create & access Oracle Autonomous Database 23ai on Google Cloud

In this article, we will see how to create Oracle Autonomous Database 23ai instance, Network, Subnets, Firewalls…

6 条评论
AI for Healthcare with Oracle Database 23ai & Its Building Blocks

2024年10月8日

AI for Healthcare with Oracle Database 23ai & Its Building Blocks

In this article, part 9 of the Oracle Database 23ai series, Learn about various building blocks that can help build a…

12 条评论
LangChain4j LLM framework with Oracle Database 23ai Vector Embedding Store - Fruit Search Java App

2024年9月26日

LangChain4j LLM framework with Oracle Database 23ai Vector Embedding Store - Fruit Search Java App

In part 8 of the Oracle Database 23ai series, we will see how to use the LangChain4j LLM framework with Oracle Database…

4 条评论

See all articles

How to work with LangChain and Oracle Generative AI

Madhusudhan Rao

Oracle Database, Principal Product Manager

What is LangChain?

What is Streamlit?

A faster way to build and share data apps

Update Jan 2025:

A short demo video on what we plan to build in this Article

How does LangChain work with Oracle Generative AI?

Step 1: Installing OCI CLI and Configuring

Get User's OCID

Generate and Download RSA Key Pair

Get Compartment OCID

Install OCI Command Line Interface (CLI)

Update OCI Configuration file

Step 2: Install the required Python libraries

Step 3: Write and Run the Python code.

领英推荐

Example 1: Simple Generative AI Console App

Example 2: Simple Generative AI Web Application with Input

Example 3: Simple Healthcare AI Application

Example 4: PDF Search AI Application

Example 5: PDF Search AI Application Streamlit version

Troubleshooting:

Check us out here!

References:

Related Article:

Madhusudhan Rao的更多文章

社区洞察

其他会员也浏览了

Vectors for Oracle AI Vector Search

Distributed Recursive Kalman Filter on Large Datasets

Issue #192 - THE ML ENGINEER ??

Move Faster your ML Pipeline

Top Languages to Master Machine Learning!

Optimizing Spark Configuration with Genetic Algorithm - Encoding

Top 12 Python Skills Every Data Scientist Should Learn

Personal Knowledge Graphs. Semantic Entity Persistence in DataLog. Deductive databases

Text Parsing in Python with US-Patent Data

Python vs R – Who Is Really Ahead in Data Science, Machine Learning?

What is LangChain?

What is Streamlit?

A faster way to build and share data apps

Update Jan 2025:

A short demo video on what we plan to build in this Article

How does LangChain work with Oracle Generative AI?

Step 1: Installing OCI CLI and Configuring

Get User's OCID

Generate and Download RSA Key Pair

Get Compartment OCID

Install OCI Command Line Interface (CLI)

Update OCI Configuration file

Step 2: Install the required Python libraries

Step 3: Write and Run the Python code.

领英推荐

Example 1: Simple Generative AI Console App

Example 2: Simple Generative AI Web Application with Input

Example 3: Simple Healthcare AI Application

Example 4: PDF Search AI Application

Example 5: PDF Search AI Application Streamlit version

Troubleshooting:

Check us out here!

References:

Related Article:

Madhusudhan Rao的更多文章

Building a Robust Suspected Transaction Review Business Process with Oracle Integration, Process Automation, Machine Learning and Analytics Cloud

How to Automate Business Process with Oracle APEX Workflows, Task Definitions, Push Notifications, ORDS and Data Redaction

Unlocking the Power of Generative AI Meta Llama Instruct and Vision Models with Oracle Database 23ai

How to migrate the MERN stack MongoDB application to Oracle Autonomous Database 23ai

AI for Healthcare Innovations: How to Build an AI Medical Helper?

AI for Aviation - Create Realtime Passenger Dashboard with Transactional Event Queues, Advanced Queuing, Scheduler Jobs, Queue Management & GenAI

Multicloud Oracle Database@Microsoft Azure - How to create & access Oracle Autonomous Database 23ai on Azure Cloud

Multicloud Oracle Database@Google Cloud - How to create & access Oracle Autonomous Database 23ai on Google Cloud

AI for Healthcare with Oracle Database 23ai & Its Building Blocks

LangChain4j LLM framework with Oracle Database 23ai Vector Embedding Store - Fruit Search Java App

社区洞察

其他会员也浏览了

Vectors for Oracle AI Vector Search

Distributed Recursive Kalman Filter on Large Datasets

Issue #192 - THE ML ENGINEER ??

Move Faster your ML Pipeline

Top Languages to Master Machine Learning!

Optimizing Spark Configuration with Genetic Algorithm - Encoding

Top 12 Python Skills Every Data Scientist Should Learn

Personal Knowledge Graphs. Semantic Entity Persistence in DataLog. Deductive databases

Text Parsing in Python with US-Patent Data

Python vs R – Who Is Really Ahead in Data Science, Machine Learning?