Pingping's Productive Week: Improving ChatGPT Trust and Semantic Governance
Produced by Scriblediffusion.com under a prompt "beautiful plants surrounded by a fence"

Pingping's Productive Week: Improving ChatGPT Trust and Semantic Governance

Question: Is ChatGPT answer trustable?

Pingping's Answer:

Due to the lack of transparency in ChatGPT's internal inference mechanism, it is difficult to assess and clarify its trustworthiness. Nevertheless, we can still evaluate the reliability of its responses based on the context of ChatGPT answers’ potential usage. Since ChatGPT's answers are expressed in human-readable language, we can describe their intended usage scenario and categorize their performance in that scenario using a formal language that provides precision in defining the situation.

In essence, if we are able to articulate the precise context of the answers' usage in a formal language, we can assess their trustworthiness based on the criteria that are relevant to the defined context.

Question: Why monitoring ChatGPT performance is hard?

Pingping's Answer:

When we refer to "monitoring," we are implicitly referring to certain events or occurrences of interest that we wish to track. In conventional system monitoring, these events are usually limited in number, enabling us to apply detection logic to each one individually. Similarly, in the context of search, we can consider each keyword as an "event." Although keywords are unlimited in number, big data analytics can be employed to monitor and evaluate the performance of frequent keywords.

In the age of Large Language Models, particularly in the ChatGPT context, events can be thought of as the unique meanings conveyed by the natural language prompts. However, since these prompts are lengthy and there are countless possible meanings that can be expressed, there are no efficient representations available for grouping these events. Consequently, the conventional approach to system monitoring is no longer viable in this scenario. ?

Question: Why improving ChatGPT trust is possible?

Pingping's Answer:

Unlike its predecessors in deep learning technologies, ChatGPT exhibits an emerging capability to provide meaningful question-answering across diverse domains, which is a crucial factor for building trust: the ability to gather incremental evidence.

As an illustration, a single question-answering session with ChatGPT may not be sufficient to establish its trustworthiness. Nevertheless, by asking follow-up questions based on prior knowledge, such as the fact that "man" and "woman" refer to non-overlapping individuals, we can partially validate the answer to the question "What are California male mayors?" by also asking "What are California female mayors?" and verifying that the results complement each other.

No alt text provided for this image


Thank you for being patient thus far. Above are the highlights of what I've been thinking in the previous week.

It was a productive week. I have three individual things achieved, shared in three separate posts.

How a ChatGPT answer verification works, on a toy example

Instead of examining the internal scores of deep neural networks to gain insights into the confidence of answers, our work assumes that ChatGPT performs more reliably on translation tasks than on tasks such as writing technical solutions. Therefore, our attention is directed towards how a reasoning framework can place quality checks on various aspects of translation, including type and proof checking, to ensure a high-quality output. Ultimately, the business logic should determine the final evaluation of the answer's quality.

How to represent practical knowledge domain using semantics

The fundamental key to effective governance lies in the ability to precisely define its logic. In the question-and-answer domain, practical knowledge is typically organized hierarchically, and small variations such as homonyms can complicate the logic if not handled properly. To delve deeper into these practical considerations, please refer to the following post.

How to build semantic governance rules incrementally

Builders always work incrementally, meaning that the constant working environment is the interface between old and new. It is crucial to be able to integrate newly formalized language semantics into existing structures in a feasible manner.

No alt text provided for this image


Dear readers, please subscribe to my substack

Alexandru Armasu

Founder & CEO, Group 8 Security Solutions Inc. DBA Machine Learning Intelligence

8 个月

Gratitude for your contribution!

要查看或添加评论,请登录

社区洞察

其他会员也浏览了