登录查看更多内容

How does ChatGPT manage contextual understanding?

Bernd Schossmann

发布日期: 2023年8月6日

In the realm of natural language processing (NLP), surpassing the confines of individual words to grasp broader context presents a significant challenge. This challenge emerges in various contexts, such as translation, where determining the grammatical connections between words is crucial, as well as in narrative writing, where maintaining a character's development throughout a story adds complexity. The connotation of words can shift based on context, exemplified by the word "run" in the phrase "Can I run something past you real quick?"

The interplay between words necessitates comprehension, a task that ChatGPT undertakes. The core technology underpinning ChatGPT is the transformer architecture. Initially, this architecture encodes words (represented as byte-pairs, which are fragments of words) into numerical values through a process called "embedding." By converting words into numbers, computers can process them, removing the ambiguity inherent in raw characters. This process can be likened to assigning a unique number to each word in a language, akin to a numerical dictionary.

However, the challenge lies in the myriad ways words relate to one another within a given text. The potential relationships between all words in a sequence create an exponentially complex problem. For instance, with a vocabulary of 20,000 words and a sentence length of 10 words, there could be an overwhelming number of possible combinations (20,000 ^ 10).

To tackle this complexity, transformers employ positional encoding. Each word is assigned a distinctive "address" within a specific context. This is achieved by adding a unique numerical value to the embedding of each word in the sequence. This value encodes the word's position in the sequence, capturing its spatial arrangement relative to other words.

领英推荐

Is ChatGPT Worth the Hype? Does it worth it?

Tromenz Learning 2 年前

How to use ChatGPT 4

Appentus 1 年前

ChatGPT-3.5 vs. 4 vs. 4o: Which One Should You Use?

Folio3 AI 3 个月前

Transformers, including ChatGPT, utilize a blend of sine and cosine functions to determine positional encoding. This technique draws inspiration from the Fourier Transform, a mathematical operation that frequently occurs in everyday devices like smartphones. Analogous to how the Fourier Transform decomposes signals into constituent frequencies, positional encoding determines the contextual value by considering the contributions of neighboring embedded words.

So, why does ChatGPT encounter challenges when composing a novel or a screenplay? GPT-2, an earlier iteration of ChatGPT, employed a context size of 1,024 tokens (where tokens are byte-pairs, roughly equivalent to 300 to 500 words). Although the specific parameters of ChatGPT remain undisclosed, we can estimate the workable context lengths. ChatGPT likely incorporates multiple attention heads (responsible for capturing relationships), although the exact count may differ from GPT-2's 24. As such, a reasonable approximation for maximum context length might hover around 10,000 words – notably less than the 50,000 to 100,000 words typical of novels. While OpenAI initially indicated a maximum input length of 3,000 words, ChatGPT appears to outperform this limitation, as evidenced by its expanded contextual capabilities. (Source: https://ai.stackexchange.com/questions/38150/how-does-chatgpt-retain-the-context-of-previous-questions). This suggests that a context length of around 10,000 words is within the realm of ChatGPT's capabilities.

要查看或添加评论，请登录

Bernd Schossmann的更多文章

The unreasonable barriers to automated reference checking in pharma: an AI case study

2023年11月6日

The unreasonable barriers to automated reference checking in pharma: an AI case study

"Can AI check references automatically?" was the most common question I received from pharma clients last year…
Artificial Intelligence, Multi-Dimensional Spaces, and Social Media

2023年10月9日

Artificial Intelligence, Multi-Dimensional Spaces, and Social Media

Note: I have converted the last post to an article as the post was too long. Recalling our school days, some of us…
NLP Basics final 2021

2021年12月27日

NLP Basics final 2021

For the final (for now) post in the NLP series I was struggling a bit with the next topic to present: there are so many…
NLP Basics Part 3: Sequences

2021年12月4日

NLP Basics Part 3: Sequences

Words carry meaning and a few words alone might contain all the essence of an article. Language, on the other hand, is…

1 条评论
NLP Basics Part 2: Relative Importance of Documents

2021年6月2日

NLP Basics Part 2: Relative Importance of Documents

You have to research a new topic and a deadline looms. Stacks of documents pile up on your desk, when do you find the…
Natural Language Processing (NLP) Basics: Spam Filters and Adverse Events with Pharmaceutical Products

2021年5月19日

Natural Language Processing (NLP) Basics: Spam Filters and Adverse Events with Pharmaceutical Products

Have you received any requests from Nigerian princes in your inbox recently? Most likely it has been years since you…
Find the heart of story in business

2020年7月9日

Find the heart of story in business

"Whats the story?" you might ask yourself before a presentation. We all know power of story: stories are fun, they…
Artificial intelligence finds hidden studies

2019年3月19日

Artificial intelligence finds hidden studies

EXCITING CONTENT "What's new?" - a question dreaded by many reps. New content arouses interest, is exciting, offers a…
Exploring Patients Needs – an Opportunity for Pharma Communication?

2018年3月13日

Exploring Patients Needs – an Opportunity for Pharma Communication?

“You know, the moment a new patient comes through the door the first diagnostic patient is daunting.” an experienced…
Neural Networks Explained Graphically

2018年1月20日

Neural Networks Explained Graphically

How do Neural Networks work - without maths? The picture above shows a neural network that "recognizes" a cloud of blue…

1 条评论

See all articles

How does ChatGPT manage contextual understanding?

Bernd Schossmann

领英推荐

Bernd Schossmann的更多文章

社区洞察

其他会员也浏览了

Mastering ChatGPT API: Your Guide to AI Integration

ChatGPT: How to Use the World's Most Popular AI Chatbot?

How Will Chat GPT Impact Software Development?

Interview OF Chat GPT with 10 Basic Question – Must Read

ChatGPT vs. DeepSeek: A Comprehensive Comparison

The Complete Guide to ChatGPT: What Comes Next?

Perplexity vs. ChatGPT vs. Claude: Which AI tool Will be Better in 2025?

#028: ContinuousGPT - Chat with your GxP Data

ChatGPT: Bullshit rumors or the end of traditional education?

Navigating the world of AI: How to safely use ChatGPT

领英推荐

Bernd Schossmann的更多文章

The unreasonable barriers to automated reference checking in pharma: an AI case study

Artificial Intelligence, Multi-Dimensional Spaces, and Social Media

NLP Basics final 2021

NLP Basics Part 3: Sequences

NLP Basics Part 2: Relative Importance of Documents

Natural Language Processing (NLP) Basics: Spam Filters and Adverse Events with Pharmaceutical Products

Find the heart of story in business

Artificial intelligence finds hidden studies

Exploring Patients Needs – an Opportunity for Pharma Communication?

Neural Networks Explained Graphically

社区洞察

其他会员也浏览了

Mastering ChatGPT API: Your Guide to AI Integration

ChatGPT: How to Use the World's Most Popular AI Chatbot?

How Will Chat GPT Impact Software Development?

Interview OF Chat GPT with 10 Basic Question – Must Read

ChatGPT vs. DeepSeek: A Comprehensive Comparison

The Complete Guide to ChatGPT: What Comes Next?

Perplexity vs. ChatGPT vs. Claude: Which AI tool Will be Better in 2025?

#028: ContinuousGPT - Chat with your GxP Data

ChatGPT: Bullshit rumors or the end of traditional education?

Navigating the world of AI: How to safely use ChatGPT