Design Patterns for ChatGPT Governance System
"LEGO Duplo Minecraft" by tdm911 is licensed under CC BY-SA 2.0.

Design Patterns for ChatGPT Governance System

Abstract

In this article, we introduce four foundational building blocks for semantically modeling a natural language domain for ChatGPT governance: Hierarchy, Homonym, Antonym, and Disambiguation. These building blocks together represent one step progress towards a framework for generating acceptable language patterns at scale for ChatGPT to operate safely with minimal human specification effort.

Background

Since ChatGPT can make assertions that violates domain knowledge, it's crucial that we create a safety mechanism to regulate its usage on a topic-by-topic basis. However, the scope of possible topics is vast, which makes it impractical to provide training examples for each conceivable subject. Therefore, our only viable option is to convert the unrefined text into a language that is formally defined, allowing us to perform a semantic analysis to determine if it conforms to the necessary semantic rules for a valid topic. If it satisfies the requirements, we can then pinpoint the relevant business rules that are relevant.

In a previous study, we established that utilizing ChatGPT's summarization capabilities makes it effortless to transform natural language text into a topic format. We subsequently demonstrated that the transformed topic can be validated (or invalidated) utilizing a Proof Assistant. This validation is based on predefined semantic rules that consider the "adjective" and "noun" properties, as well as some deep concept relationships. The methodology is detailed in the following link: (https://www.dhirubhai.net/pulse/how-organizations-establish-chatgpt-safe-zone-pingping-xiu/). Moreover, we introduced a technique for gradually integrating new rules into the existing rule system that governs ChatGPT. This method can be found in the following article: (https://www.dhirubhai.net/pulse/incremental-formalization-strategy-chatgpt-governance-pingping-xiu/).

This article aims to showcase the practicality of using design patterns to populate and structure semantic rule sets for a specific domain. The four design patterns we will be examining are Hierarchy, Homonym, Antonym, and Disambiguation. By leveraging these patterns, we will gain the ability to model the domain's semantics concerning ChatGPT governance.

Design Patterns

Why we need Design Patterns

Throughout history, humans have recognized the existence of knowledge structures, with Aristotle developing "Category Theory" to group concepts into hierarchies. In our daily lives, we organize our knowledge, such as in libraries, using various useful patterns. In the business world, Salesforce.com's CRM defines "Data Categories" for organizations to create their own category taxonomy.

Regulating ChatGPT poses a challenge of integrating traditional knowledge systems with frontiers of Generative AI. A framework is necessary to translate established structures into a formal format that can govern ChatGPT effectively.

The design pattern-based framework must be tailored to each domain and can be built up using foundational building blocks. This approach is an incremental method towards achieving an ideal framework.

And in this post, we will present four such "foundational building blocks." Namely, Hierarchy, Homonym, Antonym, and Disambiguation.

Hierarchy

Hierarchy is a fundamental characteristic for depicting any significant domain of interest. For instance, if we intend to create a list of brief phrases for ChatGPT to monitor, with some of them requiring special attention, the list will likely take the form of a hierarchy tree as we keep adding entries.

To illustrate the application of the chosen framework, Modern Type Theory, in representing a tree hierarchy and identifying valid topics, we will use a toy example with two levels of ranks and a few entries. This example will demonstrate how the framework can be leveraged for tree hierarchy representation and applied to non-trivial tasks.

We will conduct the exercise interactively to gain practical experience or to build confidence in the methodology. The example we chose is resembling the ones we used in that post. Again, you can go to?Coq Online IDE?for verifying the code.?

Definition CN:=Set.

Inductive entity: CN:=
| toy
| tool
with entityCat:=
| entity1(c: entity)
.
Definition entityCat_entity_manifest: entityCat->entity. intro. destruct H. assumption. Defined. Coercion entityCat_entity_manifest: entityCat>->entity.

Inductive Brand: CN:=
| disney
| lego
.
Definition Brand_entityCat_subtyping: Brand->entityCat. intro. apply entity1. apply toy. Defined. Coercion Brand_entityCat_subtyping: Brand>->entityCat.

Parameter playful: entity->Prop.
Check playful toy.
Check playful lego.
Parameter popular: Brand->Prop.
Check popular lego.
(** "Check popular toy." would fail *)        

In the above code, we depict a toy hierarchy tree as below:

No alt text provided for this image

Some notable comments on the above snippets:

  • A parent category concept such as Entity or "Brand" is a Inductive type.
  • In an inductive type, the membership relationship between a specific instance and its category is determined through the use of constructors, such as "tool."
  • The subtleties lie in the transition between levels, as the inductive definitions do not explicitly connect entities like Lego and Disney (or their parent category "Brand") to the higher-level category "Toy".
  • To establish the relationship between "Brand" and "Toy", the constant function Brand_entityCat_subtyping is used, which is an arrow type (entailment) from "Brand" to "entityCat" (category of entity) and requires the insertion of an entity during construction to connect "Brand" to a specific member of the Entity category.
  • By using the constant function entityCat_entity_manifest as a coercive conversion from "entityCat" to "entity", the complete hierarchy chain between a "Brand" and an "entity" is established, allowing any properties applied to "entity" to be automatically applied to the corresponding "Brand".
  • The adjectives "playful" and "popular" that are defined later serve as good examples of this concept. In general, the example highlights how the framework enables reasoning through hierarchies.

Based on the example provided, it is easy to see how multiple levels of hierarchical concepts can be interconnected, with each level having more members, making the structure a flexible and adaptable means of handling complex practical domains.

Homonym

Category Theory proposes an eligible method for constructing circular dependencies between concepts, wherein concepts A, B, and C are interconnected by A->B, B->C, and C->A arrow types, which can be useful for representing phenomena like Homonym in a language.

Circular sub-typing relationships are not typically expected to form under the traditional intuition of hierarchies, and as most software type systems follow this intuition, they fail to address important language phenomena; however, Modern Type Theory with Coercive Subtyping features can support these patterns. ( Coercive Subtyping was first proposed by Zhao et al in the paper Coercive Subtyping.)

Inductive Tiger:=
| tiger.

Inductive Panthera_tigris:=
| pathera_tigris.

Axiom tiger_Panthera_tigris: Tiger->Panthera_tigris. Coercion tiger_Panthera_tigris: Tiger>->Panthera_tigris.?
Axiom Panthera_tigris_tiger: Panthera_tigris->Tiger. Coercion Panthera_tigris_tiger: Panthera_tigris>->Tiger.?

Parameter fearful: Tiger->t.

Check (fearful tiger).
Check (fearful pathera_tigris).        

The loop formed by the two arrow types tiger_Panthera_tigris and Panthera_tigris_tiger enables a single definition for the adjective "fearful" to be applied to all related concepts within the loop.

The homonym pattern establishes a meaning equivalence among a group of concepts, wherein a property defined for one member applies to all members within the loop.

It should be noted that the coordination between the Hierarchy pattern and the Homonym pattern is not yet explored in this post, leaving potential gaps that may need to be addressed in future work.

Antonym

If Homonym is the pattern that helps us achieve rule efficiency, then Antonym is the to-go choice for ensuring quality control.

Antonyms are essential in defining data properties that ensure the integrity of ChatGPT applications. This can be seen by looking at the example of male and female mayors in California. As an illustration, consider the screenshot I extracted today, where we can see that London Breed (among others) appears in both the list of male and female mayors in California.

No alt text provided for this image

It is important to note that by utilizing prompt strategies in a chain-of-thought style, ChatGPT's behavior can demonstrate improvement. The pattern we have presented can be useful for building the trigger for such improvement.

We need to document pairs of concepts that are mutually exclusive and need extra attention in prompt strategies, especially on the gender bias or social ethical domain. This is where having an Antonym pattern in our language rule system can be helpful in reducing errors and improving ChatGPT's behavior.

The essential idea under the Antonym is the "Copredication" pattern introduced by Prof. Zhaohui Luo in Formal Semantics of Modern Type Theories, which introduced a useful pattern for the dot-type concept in Dependent Type theory, which allows for the representation of overloaded semantic meanings with the same language context.

Let's using one example to illustrate under our context:

Definition CN:=Set.
Parameter Men: CN.
Parameter Women: CN.
Record AntonymMenWomen: CN:=
? mkAntonymMenWomen
? ? {m:> Men; w:> Women}.

Parameter is_male_mayor: Men->Prop.
Parameter is_female_mayor: Women->Prop.

Parameter Pingping: CN.
Axiom pingping_gender: Pingping->AntonymMenWomen. Coercion pingping_gender: Pingping>->AntonymMenWomen.
Parameter me: Pingping.

Goal (is_male_mayor me) /\ (is_female_mayor me)->False.        

So the main interest point is the "Record AntonymMenWomen", which assumes two opposite semantic concept men and women (not mentioning other categories in this example). And to prevent insulting individuals, I decided to put my name instead of real mayor names.

The AntonymMenWomen type possesses the ability to combine both male and female related properties, making it a useful tool for representing contradicting semantic meanings within a single language context. Without such a pattern, we wouldn't be able to express the following goal: "(is_male_mayor me) /\ (is_female_mayor me)->False," because the type system would not allow such an expression to be compiled in the first place.

To gain a better understanding of the significance of this pattern, let's consider a hypothetical scenario. If we assume that each opposite concept in an antonym pattern has 10 adjectives that can modify it, we can use this pattern to create at least 100 prompts that requires ChatGPT to detect conflicting situations and respond appropriately, which would push ChatGPT to its limits.

Furthermore, in any pattern that includes only one member of an antonym pair, we can search for its semantic opposite to verify the accuracy of existing answers and pinpoint areas where additional reasoning or discussion may be required.

Disambiguation

While we have proposed a feasible solution for gradually formalizing a specific language domain, it's important to keep in mind that most applications involve multiple domains. Therefore, the most effective approach for achieving scalability and rapidly formalizing domain languages is the divide-and-conquer strategy. This entails each sub-domain focusing solely on its own domain without concerning themselves with other domains. Once each domain is completed, the Disambiguation pattern can be utilized to address any conflicting meanings associated with the same term.

Through the use of semantic annotations, as demonstrated in the example below, we can illustrate how a single term can possess different semantic meanings in varying contexts.

Definition CN:=Set.
Parameter Way: CN.
Parameter way: Way.
Inductive Oneman: Set:=man.
Definition man_N:= CN.
Definition man_ADJ:= Way->Prop.?
Parameter man_N': man_N. Parameter man_ADJ': man_ADJ.
Definition man_N_r (r: Oneman): man_N:= man_N'. Coercion man_N_r: Oneman >-> man_N.
Definition man_ADJ_r (r: Oneman): man_ADJ:= man_ADJ'. Coercion man_ADJ_r: Oneman >-> man_ADJ.

Check { man:man_ADJ & man way}.?
Check { man:man_N & man }.        

As demonstrated in the aforementioned examples, the term "man" can function as different parts of speech and appear as either "man way" or simply "man" as a valid component. To enable this functionality, the premises must be enclosed in a specialized specification format, which we won't delve into here. However, our main point is that any additional semantic annotations for a term should be lightweight and easily discernible. For instance, the following check should succeed since the type checker can accurately deduce the semantic sense of "man" based on the context.

Check { man:_ & man way}.?        

Summary

In this article, we have presented four crucial design patterns that can be utilized to create a practical and scalable semantic rule set for ChatGPT's governance. It's worth noting that these four patterns are closely intertwined. The hierarchy pattern addresses the essential nature of a vast domain taxonomy, while the homonym pattern deals with the different variations that naturally arise from language use. Quality is maintained through the use of the antonym pattern, and the Disambiguation pattern is employed to resolve the term multi-meaning issue arising from domain overlapping.

We have also provided Coq code examples that utilize these patterns, which have been verified. In practical applications, we recommend referring to the article below for a complete workflow of the following processes: prompting, translating, type checking, and proof checking.

There are several open problems that remain:

  1. How can we obtain a comprehensive preview of an entire rule set, given the diverse range of patterns? Additionally, how can we organize the rule set in a scalable fashion?
  2. We must expand our language structures beyond simple "Adjective Noun" to include more diverse patterns, such as quantifiers (some, all, none), event semantics (to describe tenses), and modal logic (e.g., "I believe...").
  3. We need to train ChatGPT to translate natural language into formal semantics. As we've seen in a previous post, this isn't a difficult task. However, as we expand to more diverse languages, we'll require more prompt engineering to accomplish this.

Alexandru Armasu

Founder & CEO, Group 8 Security Solutions Inc. DBA Machine Learning Intelligence

8 个月

Appreciate your contribution!

要查看或添加评论,请登录

社区洞察

其他会员也浏览了