登录查看更多内容

Building LangChain ReAct Agents with create_json_chat_agent

Rany ElHousieny, PhD???

Generative AI ENGINEERING MANAGER | ex-Microsoft | AI Solutions Architect | Generative AI & NLP Expert | Proven Leader in AI-Driven Innovation | Former Microsoft Research & Azure AI | Software Engineering Manager

发布日期: 2024年9月29日

LangChain offers a powerful way to create agents that use tools. This article focuses on using the create_json_chat_agent function, which allows agents to interact using JSON-based responses, a useful format for integrating tool-based actions. The goal is to show how to implement create_json_chat_agent using OpenAI, while tools are the main focus. In the previous article, we used ZeroShotAgent.from_llm_and_tools. In this article we will use create_json_chat_agent.

This article is a continuation of the previous article:

Key Components

Tools: Functions that agents can invoke for tasks like calculations or external queries.
LLMs: Although any model can work, we’ll use OpenAI for simplicity.
Prompt: The instructions the model uses to format responses as JSON.

Step-by-Step Implementation

1. Install Dependencies

First, make sure you have LangChain installed:

pip install langchain openai

2. Define Tools

Create simple tools the agent can invoke. We will use the tool that we used in the previous article:

from langchain.tools import BaseTool

class last_name_for_Rany(BaseTool):
    name = "Last name for Rany"
    description = "Use this tool to get the last name of Rany"

    def _run(self, expression: str):
        return "ElHousieny"
      


last_name_for_Rany_tool = last_name_for_Rany()

The description of a tool is essential because it tells the language model (LLM) how and when to use the tool correctly. Clear descriptions guide the LLM's decisions during task execution. In addition, prompt engineering is crucial for refining how the model interacts with the tools. By adjusting the prompt, developers can enhance the model's understanding and performance, especially in cases where the LLM might make mistakes or attempt to use tools inappropriately, like handling multiple tasks at once. This process helps improve overall accuracy and efficiency. So, we might go back and adjust the description as we start using the tool.

Let's test the tool:

last_name_for_Rany_tool.run("Rany")

Now, add it to the tools:

tools = [last_name_for_Rany_tool]

3. Create the Prompt Template

Prompt Engineering is very important. That is why I started my course with it. Please go back to my prompt engineering article, if needed.

We need a prompt that instructs the model on how to format responses as JSON:

system="""
You are tasked with solving problems step-by-step, responding with a JSON structure in markdown format.
The JSON should contain:
- thought: your reasoning
- action: the name of the tool
- action_input: the parameters for the tool

You can use the following tools: {tool_names}

Descriptions of these tools:
{tools}

If you have enough information, use the "Final Answer" tool with the solution as its input. Otherwise, continue using the available tools to gather more information.
"""

System messages are used to set the behavior, rules, or context for the conversation. They typically define the AI's role, instructions on how it should respond, and the boundaries within which it should operate. System messages guide the AI on how to interact throughout the conversation. They are not visible to the end-user in most interfaces and are primarily used to configure the AI's behavior. This prompt has several distinct sections, each serving a specific function in guiding the agent's behavior:

Purpose Overview: The agent is informed that its job is to solve tasks step-by-step. Responses must follow a structured format, using JSON to represent the decision-making process.
JSON Structure: The agent's responses must include:
Tool Definitions: The prompt provides placeholders for:
Decision Logic:

Purpose of the Sections:

System Initialization: This section defines the structure and expectations, ensuring the agent knows how to format its responses correctly in JSON.
Tool Descriptions: By defining the tools in advance, the agent can choose the right tool for the task based on the descriptions provided.
Task Flow Control: The prompt includes decision-making logic, guiding the agent to keep iterating through tools until it has enough information to provide a final solution.

This format is designed to ensure that agents using the create_json_chat_agent function can interact with tools in a systematic and consistent way while maintaining clear reasoning behind their actions.

human="""
After each markdown snippet, include the word "STOP". Example:

```json
{{"thought": "<your reasoning>",
 "action": "<tool name or Final Answer to give a final answer>",
 "action_input": "<tool parameters or the final output"}}
```
STOP

This is my query="{input}". Write only the next step needed to solve it.

Remember to add STOP after each snippet.

"""

In the context of LangChain and agents, the human message acts as the input provided to the agent. It defines the task or query that the agent needs to solve, outlining the structure for how the agent should respond. It’s essentially the instruction layer that helps guide the agent on how to interact with tools, process data, and return outputs.

Breakdown of the Human Message:

Explanation of the Parts:

1. Instruction for JSON Structure:

- "After each markdown snippet, include the word 'STOP'": This ensures that each JSON response ends with the word "STOP," which acts as a separator, allowing the agent to send and execute actions step-by-step.

2. JSON Example:

- This section shows how the agent should format its responses to compl with ReAct Agents. The structure includes:

Pluralsight 9 个月前

Dash Club 10: Dash Enterprise 5.1, Dash-ChatGPT App…

Plotly 1 年前

Dash Club 12: AI and Dash, Dash Online Course…

Plotly 1 年前

- thought: Describes the agent's reasoning process.

- action: Specifies the name of the tool the agent will use or signals that the final answer is ready.

- action_input: The parameters or inputs needed for the tool or the final result.

3. Query Input:

- "{input}": This placeholder represents the actual query or problem posed by the user. The agent must process this query based on the available tools and previously gathered information.

4. Step-Based Instruction:

- "Write only the next step needed to solve it": Guides the agent to approach problem-solving incrementally, responding one step at a time, rather than jumping to a final answer immediately.

Why Is It Necessary?

- Structure & Clarity: The human message helps the agent follow a structured format, ensuring the use of JSON responses, which makes the agent's actions clear and traceable.

- Incremental Problem-Solving: By instructing the agent to solve tasks step-by-step, it prevents the agent from making assumptions and encourages reasoning for each action.

- Tool Guidance: By explicitly outlining how and when to use tools, the human message helps optimize the agent’s tool selection process.

In summary, the human message plays a critical role in ensuring that the agent interacts logically and incrementally while providing structured, predictable outputs.

from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder

prompt = ChatPromptTemplate.from_messages(
    [
        ("system", system),
        MessagesPlaceholder("chat_history", optional=True),
        ("human", human),
        MessagesPlaceholder("agent_scratchpad"),
    ]
)

4. Initialize the Agent

We’ll use OpenAI's gpt-3.5-turbo model and create the agent:

from langchain_community.chat_models import ChatOpenAI
LLM = ChatOpenAI(model="gpt-3.5-turbo")  # Use OpenAI for simplicity

from langchain.agents import create_json_chat_agent, AgentExecutor

agent = create_json_chat_agent(
    tools = tools,
    llm = llm,
    prompt = prompt,
    stop_sequence = ["STOP"],
    template_tool_response = "{observation}"
)

agent_executor = AgentExecutor(agent=agent, tools=tools, verbose=True, handle_parsing_errors=True)

agent_executor.invoke({"input": "What is the last name for Rany?"})

Let's check Galileo traces:

As you can see, the chain is longer. The previous image shows the high level part which includes the first input and the last output:

You can see the use of the tool in trace:

Here is a better look into the flow:

Conclusion

In this article, we demonstrated how to create a JSON-based chat agent using create_json_chat_agent in LangChain. We showed how to set up tools, define prompts, and initialize the agent using OpenAI. This framework enables agents to perform various tasks by leveraging external tools, making them highly adaptable for real-world applications.

AI Solutions Architect

1,586 位关注者

LifeWork

2 个月

This is very relevant to those of us using LangChain, and trying to inform a generation of lifelong learners. We use your articles frequently as guideposts and appreciate how insightful the content is!

1 次回应

Jay Runner

2 个月

Your articles are amazing, and on-point. Thank you for taking the time to go into these next level articles to bring real-world scenarios to light. You are doing amazing work to progress this community. Cheers!

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Building LangChain ReAct Agents with create_json_chat_agent

Rany ElHousieny, PhD???

Generative AI ENGINEERING MANAGER | ex-Microsoft | AI Solutions Architect | Generative AI & NLP Expert | Proven Leader in AI-Driven Innovation | Former Microsoft Research & Azure AI | Software Engineering Manager

Key Components

Step-by-Step Implementation

1. Install Dependencies

2. Define Tools

3. Create the Prompt Template

Purpose of the Sections:

Breakdown of the Human Message:

领英推荐

4. Initialize the Agent

Conclusion

AI Solutions Architect

1,586 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

Custom Enterprise LLM/RAG with Real-Time Fine-Tuning

Mastering the Fusion: ReactJS and AI/ML Integration Strategies

Creating Advanced Data-Driven GPTs Without APIs: Using Decomposed URLs & Algorithmic Analysis

The Anti-Framework Guide for Building LLM Apps

State of AI & Web Scraping in 2024: Thoughts and Predictions

Regex – the ultimate language we love to hate!

January 25, 2024

Speeding Up Your AI-powered Search with JAI Async

Streamlit, The Magic of Data Storytelling

Practical Guide: Using Gemini Context Caching with Large Codebases

Key Components

Step-by-Step Implementation

1. Install Dependencies

2. Define Tools

3. Create the Prompt Template

Purpose of the Sections:

Breakdown of the Human Message:

领英推荐

4. Initialize the Agent

Conclusion

AI Solutions Architect

1,586 位关注者

Building an AI-Powered Resume Coach Website on AWS S3 with React: Hands-On Step-by-Step Guide

2024年11月28日

Enabling Titan in AWS Bedrock and Calling it from a Python Notebook

2024年11月23日

Unlocking AI Potential with OpenAI APIs

2024年11月19日

Clearwater Analytics: Leading the AI Revolution in Finance with Multi-Agent Systems

2024年10月4日

Understanding the Python requests Library

2024年10月4日

Exploring LangChain's AgentExecutor

2024年9月29日

Llama 3.2: A New Era in AI Model Efficiency

2024年9月27日

Galileo Protect with LangChain– Real-Time AI Hallucination Firewall

2024年9月26日

Creating LangChain Agents with LCEL using the Pipe Operator and Solar LLM: A Simple Guide

2024年9月26日

Handling "Agent stopped due to iteration limit or time limit." in LangChain: Avoiding Endless Loops in CoALA Agents

2024年9月25日

社区洞察

其他会员也浏览了

Custom Enterprise LLM/RAG with Real-Time Fine-Tuning

Mastering the Fusion: ReactJS and AI/ML Integration Strategies

Creating Advanced Data-Driven GPTs Without APIs: Using Decomposed URLs & Algorithmic Analysis

The Anti-Framework Guide for Building LLM Apps

State of AI & Web Scraping in 2024: Thoughts and Predictions

Regex – the ultimate language we love to hate!

January 25, 2024

Speeding Up Your AI-powered Search with JAI Async

Streamlit, The Magic of Data Storytelling

Practical Guide: Using Gemini Context Caching with Large Codebases