登录查看更多内容

GPT3: technological breakthrough

Alexander Polonsky

VP R&D, co-founder at Bloom

发布日期: 2023年1月30日

Every once in a while, a major technological advance brings to mind the Arthur Clark’s observation: “Any sufficiently advanced technology is indistinguishable from magic.” Just recently it was demonstrated by OpenAI’s GPT3. The WOW effect generated a lot of discussions, but they seem to mostly focus on the HOW: how it works, how to use it, how it will affect the future. I think that it's equally as important to understand the WHAT - what technological breakthrough underlies its impressive performance?

In fact, it’s a combination of achievements that make it so remarkable.

Fundamentally, GPT3 is a text matching engine: it matches the input text with an output text. A search engine, such as Google, is also a matching engine. However, unlike the older generation of matching engines, GPT3 does not match pre-existing pieces of content - it creates the best match instead.?

At the core of every matching engine is a mathematical matching function that defines how the matches are computed. In the case of GPT3, it’s a mastodon of a function containing 175 billion variables. Based on billions of sentences written by humans (training data), GPT3 was able to compute the optimal values for these variables, which are then used to generate new text. Effectively, GPT3 extrapolates patterns learned from human-generated text to produce new content. NB: GPT3 will generate new text every time, even when a better one already exists among its training data.

In other words, GPT3 has captured the collective wisdom of billions of human-produced pieces of content and encoded this wisdom in a highly complex mathematical function. This collective wisdom can be broken down into three main components:

1. Linguistic expertise

GPT3 writes perfectly, from syntactic point of view, in 46 natural languages. The key achievement here is that it does so without any constraints - outside of narrow contexts such as spell check or translation (which it can also do).

领英推荐

How GPT-4 Fails to Measure Up in 2023

Michael Spencer 11 个月前

Meta Llama 3.1 vs. GPT-4o Mini: The Latest Open-Source…

Muhammad Ehsan 4 个月前

??Top ML Papers of the Week

DAIR.AI 8 个月前

2. Knowledge

GPT3 has also captured the knowledge contained in the training data. The capture is not perfect this time – it makes many mistakes. Nevertheless, the accuracy level overall is very high and in some domains the knowledge capture is near perfect. This is a real breakthrough for information retrieval and shows a way to improve search, both on the Web and within organizations. This improvement can come in two ways: through direct question answering or by helping to extract structured data from unstructured content.

3. Human thinking & behavior

Human-produced content necessarily also reflects human thinking & behavior, and such patterns did not escape GPT3. Here the accuracy is not so high, but still, GPT3 is quite polite, can express emotion on demand, and is able to carry out many logical and mathematical operations. This means that machines can now learn to mimic human behavior as well as extract and reuse logic from situational descriptions in natural language.

It's important to understand that GPT3 is a phenomenological model – it directly models the output of a complex system (human writing) without modeling any intermediate steps. This means it has no notion of language structure, knowledge, logic or behavior – all it does is detect and reproduce patterns. It’s truly remarkable that this approach, relatively simple from conceptual point of view, is able to perform so well. The biggest achievement of GPT3 is precisely in demonstrating this to a wide audience, including the general public.

Jean Rohmer

President at Institut Fredrik R. Bull

1 年

Alexander Polonsky Very clearly stated, Alexander. But we -general public and even engineers, scientists- still need to understand what happens in the 175 billions parameters network, to have an idea of HOW and -more important- WHY it works. Sort of "brain imagery" of this neural system. How to trace the pattern matching activity you mention ? Or imagine a new kind of systems where "pattern matching" just does not mean anything.

2 次回应

查看更多评论

要查看或添加评论，请登录

Alexander Polonsky的更多文章

Search Wizard: a “magic” prompt that optimizes an LLM for search

2024年3月12日

Search Wizard: a “magic” prompt that optimizes an LLM for search

Problem Search is a major, but not the only, use case for LLMs. Therefore, an LLM is not specifically optimized for…
Social Media Analysis: talking with humanity

2020年12月15日

Social Media Analysis: talking with humanity

The publicly available Social Media data is a cluster of data sources within the vast “Data Universe”. It is huddled…

1 条评论
Data specialists versus Data generalists

2020年12月7日

Data specialists versus Data generalists

Data specialists focus on a specific data source or category (e.g.
An illusion of success: the problem-solution gap in data science

2020年11月13日

An illusion of success: the problem-solution gap in data science

Data science solutions, in the form of a commercial product or custom-built, are unlike other types of software. The…

GPT3: technological breakthrough

Alexander Polonsky

VP R&D, co-founder at Bloom

领英推荐

Alexander Polonsky的更多文章

社区洞察

其他会员也浏览了

Gen-AI may be massively hyped, but the potential is huge: Here are ten big technological shifts creating the disruptive opportunity of GPT-4

How to Choose Your GenAI Prompting Strategy: Zero-shot vs. One-shot vs. Few-shot Prompting in Generative AI

The Battle of the LLMs: Llama 3 vs. GPT-4 vs. Gemini

WE, THE STORIES.

GPT-3 writes like a writer, programs like a programmer, and can be ... dangerous

The Showdown: Google Gemini vs. OpenAI’s GPT-4 – Who Rules the AI Arena?

Impossible Distillation: How to Make High-quality Lemonade out of Small, Low-quality Model.

What is GPT-4 and why should recruiters be excited by it?

Exploring OpenAI’s Latest Models: GPT-4, Turbo, o1-Series, and More

Designing a GPT: A Comprehensive Guide to Do's and Don'ts

领英推荐

Alexander Polonsky的更多文章

Search Wizard: a “magic” prompt that optimizes an LLM for search

Social Media Analysis: talking with humanity

Data specialists versus Data generalists

An illusion of success: the problem-solution gap in data science

社区洞察

其他会员也浏览了

Gen-AI may be massively hyped, but the potential is huge: Here are ten big technological shifts creating the disruptive opportunity of GPT-4

How to Choose Your GenAI Prompting Strategy: Zero-shot vs. One-shot vs. Few-shot Prompting in Generative AI

The Battle of the LLMs: Llama 3 vs. GPT-4 vs. Gemini

WE, THE STORIES.

GPT-3 writes like a writer, programs like a programmer, and can be ... dangerous

The Showdown: Google Gemini vs. OpenAI’s GPT-4 – Who Rules the AI Arena?

Impossible Distillation: How to Make High-quality Lemonade out of Small, Low-quality Model.

What is GPT-4 and why should recruiters be excited by it?

Exploring OpenAI’s Latest Models: GPT-4, Turbo, o1-Series, and More

Designing a GPT: A Comprehensive Guide to Do's and Don'ts