Why we cannot allow invalid formations of patterns in semantics representation

Pingping Xiu

Data Engineer Leader @ Caltrans | Data Engineering / AI

发布日期: 2023年2月26日

In the Large Language Model era, few-shot learning is a new norm (See https://arxiv.org/abs/2302.07842 AUGMENTED LANGUAGE MODELS: A SURVEY).

In a sense, we see a paradigm shift from traditional classification-based learning to few-shot learning. In classification-based learning, one has to prepare a handful of examples under each class, and when there is only one interested class, one has to "fake" a negative class to form a multi-class classification problem. But we see in LLM era, few-shot learning has the convenience to be the new norm of learning method.

In few-shot learning, the "target" is a semantic meaningful text. Unlike classification-based learning, where each class label does not carry an internal structure, few-shot learning does allow the target to be a natural language text.

Now, if we want to "normalize" that, by modeling each target text with a semantic structure, and fine-tuning to predict that semantic structure, we may enable the org-specific logic. We also save costs.

However, we must make sure that semantic structure is sound, in terms of not allowing an invalid pattern to occur.

领英推荐

LLM Paper Reading Notes - November 2023

Jean David Ruvini 10 个月前

LLM — Large Language Models

Shoukath Ali 4 个月前

Chain-of-Knowledge: Integrating Knowledge Reasoning…

Vlad Bogolin 2 个月前

An invalid pattern, if it is allowed within a semantic framework, would be disastrous. To see why remember that a semantic framework by nature is a logic framework. And, the most important property of a logical framework is, no contradiction.

If an invalid pattern is allowed due to a design flaw of the semantic framework, then the first thing that happens is, there are universal rules within that semantic framework that can interact with that invalid pattern to generate some knowledge about some valid pattern, and that "knowledge" is fake, for the invalid pattern does not carry a real-world meaning. Therefore, we may produce some fake knowledge that contradicts some valid knowledge. And that's the end of the logic framework.

Takeaway

In LLM, expressing the target's semantic structure is an important step toward solving bias and extending the logic capability.
Such semantic structure should have selection restrictions, which is a fundamental capability of a linguistic framework to rule out invalid patterns.

要查看或添加评论，请登录

查看全部

Why we cannot allow invalid formations of patterns in semantics representation

Pingping Xiu

Data Engineer Leader @ Caltrans | Data Engineering / AI

领英推荐

Takeaway

更多精彩文章

社区洞察

其他会员也浏览了

Make Your LLM Fully Utilize the Context

Pre-Training to Deployment of LLMs

Fine-Tuning LLMs with Your Data

Exploring the Capabilities & Limitations of GPT-4: OpenAI's Large Language Model (Popular LLM Series)

How to train your large language model

Unveiling LLMops: Your Gateway to Efficient Large Language Model Operations

Top LLM Papers of the week (February 2024 Week 4)

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Automatic Summarization using Language Models

Meta Language Creation Pattern in Prompt Engineering

领英推荐

Takeaway

Domain Driven Reasoning on ChatGPT Trust

2023年3月20日

Domain-Driven Semantic Applications on ChatGPT

2023年3月14日

Pingping's Productive Week: Improving ChatGPT Trust and Semantic Governance

2023年3月12日

Design Patterns for ChatGPT Governance System

2023年3月10日

Incremental Formalization Strategy for ChatGPT Governance

2023年3月9日

How Organizations Establish ChatGPT Safe Zone

2023年3月7日

Formal Semantics on Prompt Engineering

2023年3月6日

"Tree-of-Thoughts" as an alternative to "Chain-of-Thoughts"

2023年3月5日

Designing a Random Number Generator for Coq: A Practical Solution for A Language Engineering Toolbox

2023年3月3日

Another step towards ChatGPT Semantic Testing: Manifest /w Subtyping

2023年3月3日

社区洞察

其他会员也浏览了

Make Your LLM Fully Utilize the Context

Pre-Training to Deployment of LLMs

Fine-Tuning LLMs with Your Data

Exploring the Capabilities & Limitations of GPT-4: OpenAI's Large Language Model (Popular LLM Series)

How to train your large language model

Unveiling LLMops: Your Gateway to Efficient Large Language Model Operations

Top LLM Papers of the week (February 2024 Week 4)

Adaptive-RAG: Learning to Adapt Retrieval-Augmented Large Language Models through Question Complexity

Automatic Summarization using Language Models

Meta Language Creation Pattern in Prompt Engineering