登录查看更多内容

OpenAI API Guide: Using JSON Mode

Cohen Reuven

发明家“IaaS”，天使投资人，成长黑客，导师

发布日期: 2023年12月13日

This is an advanced how-to focusing and how I built the GuardRail system using the new JSON mode for OpenAI API

OpenAI’s API now features a JSON mode, streamlining response structuring and enhancing integration capabilities. As a practical example, I’ve developed GuardRail, an open-source project utilizing this mode, showcasing how JSON-formatted outputs can significantly improve system interactions and data processing in OpenAI applications.

A Few Practical Uses for JSON Mode:

Automated Data Analysis: JSON mode is ideal for applications that require automated analysis of large datasets, such as customer feedback analysis, market research, or social media monitoring.
Enhanced Chatbots and Virtual Assistants: Integrating JSON mode allows for more structured and nuanced responses, improving the quality of interactions in chatbots and virtual assistants across customer service, healthcare, and e-commerce platforms.
Personalized Content Recommendations: JSON mode can be used in content recommendation systems to parse user preferences and feedback efficiently, leading to more accurate and personalized content suggestions.
Natural Language Processing (NLP) Tasks: For tasks like sentiment analysis, language translation, or summarization, JSON mode provides a structured way to receive and process large volumes of text data.

Enabling JSON Mode

To enable JSON mode, set the response_format parameter to { "type": "json_object" }. This configuration is crucial for receiving outputs in JSON format.

Important Notes:

Explicit JSON Instructions: When using JSON mode, explicitly instruct the model to output JSON in your prompts. Without this, the output may consist of endless whitespace or appear ‘stuck’.
Truncated Outputs: Be aware that outputs might be partially cut off if finish_reason is “length”, indicating that the generation exceeded the token limit.

Seed Parameter (Beta Feature)

Deterministic Sampling: The seed parameter allows for deterministic results. Repeated requests with the same seed and parameters should yield identical outcomes.
Not Guaranteed: Determinism is not guaranteed. Monitor changes using the system_fingerprint response parameter.

Sample Code for Data Analysis & Guidance (based on GuardRail)

Below is an example script (prompts.py) for various analysis types using the JSON mode. It includes definitions for different analysis types and corresponding JSON schemas.

# OpenAI Data Analysis & Guiderails Script
# prompts.py - by @rUv

# Analysis Types with Descriptions
# These descriptions define what each analysis type does.
ANALYSIS_TYPES = {
    "sentiment_analysis": "Analyze the sentiment of the provided text. Determine whether the sentiment is positive, negative, or neutral and provide a confidence score.",
    "text_summarization": "Summarize the provided text into a concise version, capturing the key points and main ideas."
    # Add more analysis types as needed
}

# JSON Schemas for Each Analysis Type
# These schemas define the JSON structure for each analysis type's output.
JSON_SCHEMAS = {
    "sentiment_analysis": {
        "sentiment": "string (positive, negative, neutral)",
        "confidence_score": "number (0-1)"
        # Include additional fields as required
    },
    "text_summarization": {
        "summary": "string",
        "key_points": "array of strings",
        "length": "number (number of words in summary)"
        # Include additional fields as required
    }
    # Add more JSON schemas for other analysis types
}

# Template for Generating System Prompts
STANDARD_PROMPT_TEMPLATE = "You are a data analysis assistant capable of {analysis_type} analysis. {specific_instruction} Respond with your analysis in JSON format. The JSON schema should include '{json_schema}'."

# Function to Generate System Prompts
def get_system_prompt(analysis_type: str) -> str:
    # Fetch the specific instruction and JSON schema for the given analysis type
    specific_instruction = ANALYSIS_TYPES.get(analysis_type, "Perform the analysis as per the specified type.")
    json_schema = JSON_SCHEMAS.get(analysis_type, {})

    # Format the JSON schema into a string representation
    json_schema_str = ', '.join([f"'{key}': {value}" for key, value in json_schema.items()])

    # Construct the system prompt with updated instruction
    return (f"You are a data analyst API capable of {analysis_type} analysis. "
            f"{specific_instruction} Please respond with your analysis directly in JSON format "
            f"(without using Markdown code blocks or any other formatting). "
            f"The JSON schema should include: {{{json_schema_str}}}.")

In this script, ANALYSIS_TYPES holds descriptions for various analyses, JSON_SCHEMAS contains the structure for JSON responses, and get_system_prompt generates prompts for the AI model.

Analysis Types and JSON Schemas Samples

To illustrate how the JSON mode in the OpenAI API works, let’s delve into the ANALYSIS_TYPES and JSON_SCHEMAS, and examine the get_system_prompt function in detail.

1. ANALYSIS_TYPES Samples

ANALYSIS_TYPES is a dictionary mapping types of analysis to their descriptions. Here are a couple of examples:

Sentiment Analysis:Description: “Analyze the sentiment of the provided text. Determine whether the sentiment is positive, negative, or neutral and provide a confidence score.”
Text Summarization:Description: “Summarize the provided text into a concise version, capturing the key points and main ideas.”

Kev C. 8 个月前

Revolutionizing Data Queries: From Text to SQL with…

Santhoshkumar Mariappan 2 个月前

A Dive into HTMX, HyperScript, and AI Fusion

Carl G. 1 年前

2. JSON_SCHEMAS Samples

JSON_SCHEMAS outlines the expected JSON structure for each analysis type. Here are two examples corresponding to the above types:

Sentiment Analysis Schema:{ "sentiment": "string (positive, negative, neutral)", "confidence_score": "number (0-1)", "text_snippets": "array of strings (specific text portions contributing to sentiment)" }
Text Summarization Schema:{ "summary": "string", "key_points": "array of strings (main points summarized)", "length": "number (number of words in summary)" }

3. Function: get_system_prompt

The get_system_prompt function dynamically generates prompts based on the specified analysis type. It works as follows:

Fetching Instructions and Schema:

Retrieves specific instructions and the JSON schema for the given analysis type from ANALYSIS_TYPES and JSON_SCHEMAS.

Formatting JSON Schema:

Formats the retrieved JSON schema into a string representation.

Constructing the Prompt:

Constructs a system prompt that includes the analysis type, specific instruction, and a request for a JSON-formatted response. It also specifies the structure the JSON should follow based on the json_schema_str.

JSON Mode: Assembling Code and Ensuring Consistency

In JSON mode, responses from the OpenAI model are structured as valid JSON objects. This mode ensures consistency in the following ways:

Structured Responses: Responses are in a consistent, parseable format, which is crucial for applications that process the model’s output programmatically.
Schema Adherence: By specifying the JSON schema in the prompt, the model’s responses adhere to a predefined structure, making it easier to integrate and use the data.
Clear Instructions: The prompts explicitly instruct the model to produce JSON, reducing the likelihood of receiving unstructured or irrelevant data.

This methodical approach ensures that the model’s output is not only consistent but also tailored to specific analytical needs, making it highly effective for diverse applications ranging from sentiment analysis to text summarization.

The JSON mode in OpenAI’s API offers structured and consistent output formats, beneficial for various applications, especially those requiring precise data handling and analysis. With the seed feature in beta, users can experiment with deterministic outputs, aiding in consistent application behavior.

See it in action:

Fungibility

10,091 位关注者

Elliott A.

Senior System Reliability Engineer / Platform Engineer

10 个月

Excellent

Paul S.

AI/ML Engineer | Advancing Generative AI

10 个月

Cool, I'm sure it can be useful. I'm curious to see how this performs in comparison to other models. Is there a bias (for you specific example, sentiment analysis? Accuracy, etc.

查看更多评论

要查看或添加评论，请登录

Cohen Reuven的更多文章

Transforming Ideas into Reality: How AI Fuels My Productivity & Creativity

2024年10月12日

Transforming Ideas into Reality: How AI Fuels My Productivity & Creativity

Over the past year, I've embarked on a remarkable journey of creativity and productivity that even I find astounding…

7 条评论
Introduction to Programming with Codebots: A Detailed Tutorial

2024年10月11日

Introduction to Programming with Codebots: A Detailed Tutorial

Imagine being able to build complex applications fast, just by providing a detailed specification and letting a coding…

12 条评论
Introducing AgenticsJS - A full featured agentic style UI framework

2024年9月16日

Introducing AgenticsJS - A full featured agentic style UI framework

AgenticsJS is a powerful and flexible JavaScript library designed to provide an intelligent and interactive search…

12 条评论
Agentic Programming with OpenAi o1 Model: A 10-Step Recursive and Reflective Problem-Solving Process

2024年9月13日

Agentic Programming with OpenAi o1 Model: A 10-Step Recursive and Reflective Problem-Solving Process

This tutorial will guide you through the steps to customize the recursive and reflective prompt template for different…

4 条评论
Tutorial: Build Any App in Minutes with GPTEngineer, no coding required

2024年9月2日

Tutorial: Build Any App in Minutes with GPTEngineer, no coding required

The future of engineering is being redefined by AI-powered tools like GPT Engineer, which have brought the ability to…

25 条评论
Tutorial: The Hidden Power of System Prompts: Unlocking Purpose in Prompt Engineering

2024年8月21日

Tutorial: The Hidden Power of System Prompts: Unlocking Purpose in Prompt Engineering

When we talk about prompt engineering, we typically focus around structure, reasoning, and logic. But what’s often…

4 条评论
Tutorial: Run Aider Code Bot Free using Google Colab with Embedded UI

2024年8月20日

Tutorial: Run Aider Code Bot Free using Google Colab with Embedded UI

Ever wonder if you could program without needing to learn how to code? With Aider, you can make that dream a reality…

6 条评论
Choosing the Ideal Language Model: Frontier vs. Smaller, Older, Faster

2024年8月20日

Choosing the Ideal Language Model: Frontier vs. Smaller, Older, Faster

As AI models continue to evolve, there's an ongoing debate between using frontier models and smaller, older, faster…

1 条评论
Introduction to Programming with Prompts

2024年8月16日

Introduction to Programming with Prompts

by @rUv, just because. Prompt programming represents a significant update in the way developers interact with…

6 条评论
Unlock Agentic Ai with Free "Introduction to Agentic Engineering" Course and Certificate

2024年8月14日

Unlock Agentic Ai with Free "Introduction to Agentic Engineering" Course and Certificate

Reimagining Intelligence, Autonomy, and Interaction As we stand at the edge of an AI-driven future, the need for…

4 条评论

See all articles

OpenAI API Guide: Using JSON Mode

Cohen Reuven

发明家“IaaS”，天使投资人，成长黑客，导师

This is an advanced how-to focusing and how I built the GuardRail system using the new JSON mode for OpenAI API

A Few Practical Uses for JSON Mode:

Enabling JSON Mode

Important Notes:

Seed Parameter (Beta Feature)

Sample Code for Data Analysis & Guidance (based on GuardRail)

Analysis Types and JSON Schemas Samples

1. ANALYSIS_TYPES Samples

领英推荐

2. JSON_SCHEMAS Samples

3. Function: get_system_prompt

JSON Mode: Assembling Code and Ensuring Consistency

See it in action:

Fungibility

10,091 位关注者

Cohen Reuven的更多文章

社区洞察

其他会员也浏览了

A Dive into HTMX, HyperScript, and AI Fusion

Using text analytics in Internal Audit

Employing Large Language Models for Text-to-SQL Tasks: A Comprehensive Survey

Aspect/sentiment-aware review summarization (Recent)

How a Semantic Search Platform Can Help Enterprises Today

Synthetic data generation reinvented: LLMs at the forefront of innovation

The Next Evolution in BI: RAG — Natural Language, SQL-LLM, and the End of Data as We Know It

Data Mesh and LLM – Is it the Future of the New Organization's Data Lake?

GPT-2 in Excel: Understanding Language Models Through Spreadsheets

How does AI web scraping adapt to changes in website structures without manual updates?

This is an advanced how-to focusing and how I built the GuardRail system using the new JSON mode for OpenAI API

A Few Practical Uses for JSON Mode:

Enabling JSON Mode

Important Notes:

Seed Parameter (Beta Feature)

Sample Code for Data Analysis & Guidance (based on GuardRail)

Analysis Types and JSON Schemas Samples

1. ANALYSIS_TYPES Samples

领英推荐

2. JSON_SCHEMAS Samples

3. Function: get_system_prompt

JSON Mode: Assembling Code and Ensuring Consistency

See it in action:

Fungibility

10,091 位关注者

Cohen Reuven的更多文章

Transforming Ideas into Reality: How AI Fuels My Productivity & Creativity

Introduction to Programming with Codebots: A Detailed Tutorial

Introducing AgenticsJS - A full featured agentic style UI framework

Agentic Programming with OpenAi o1 Model: A 10-Step Recursive and Reflective Problem-Solving Process

Tutorial: Build Any App in Minutes with GPTEngineer, no coding required

Tutorial: The Hidden Power of System Prompts: Unlocking Purpose in Prompt Engineering

Tutorial: Run Aider Code Bot Free using Google Colab with Embedded UI

Choosing the Ideal Language Model: Frontier vs. Smaller, Older, Faster

Introduction to Programming with Prompts

Unlock Agentic Ai with Free "Introduction to Agentic Engineering" Course and Certificate

社区洞察

其他会员也浏览了

A Dive into HTMX, HyperScript, and AI Fusion

Using text analytics in Internal Audit

Employing Large Language Models for Text-to-SQL Tasks: A Comprehensive Survey

Aspect/sentiment-aware review summarization (Recent)

How a Semantic Search Platform Can Help Enterprises Today

Synthetic data generation reinvented: LLMs at the forefront of innovation

The Next Evolution in BI: RAG — Natural Language, SQL-LLM, and the End of Data as We Know It

Data Mesh and LLM – Is it the Future of the New Organization's Data Lake?

GPT-2 in Excel: Understanding Language Models Through Spreadsheets

How does AI web scraping adapt to changes in website structures without manual updates?