登录查看更多内容

Generative AI: time to learn a whole new vocabulary

David Knott

CTO for UK Government

发布日期: 2022年12月29日

I have no idea how to talk about sport. This was a disadvantage when growing up as a teenager at an all boy’s school. I felt as if I’d missed an important lesson, or failed to read the manual that the other boys had been issued at an early age. How else did everybody else have a vocabulary, a set of concepts, a whole language, that was opaque to me?

I initially felt the same way when attempting to learn in public about generative AI, the set of solutions such as ChatGPT and DALL-E which are receiving a lot of attention right now.

This was the week when I was supposed to read a few more detailed papers, to find a couple of books, and to go deep enough to get to grips with the main concepts. However, I found that, unlike my similar experiment with quantum computing, it was hard to find accessible entry points. Perhaps this is because, despite rapid developments in recent years, the ideas behind quantum computing have been around for a long while - long enough for experts to write introductions for curious laypeople like me. By contrast, most of the material describing generative AI technologies was quite new, and either so high level that it told me little I didn’t already know (and much that I had reason to be sceptical of), or dived so deep that I was as baffled as if listening to the dissection of a football match. No-one has had time to write the accessible introduction yet.

However, with some perseverance, and a few helpful links (shared at the end of this article), I was able to figure some things out. The main lessons I learnt were:

There are a few important terms . . .

I saw the same few terms over and over again, and derived the following meaning:

Large Language Model (LLM): a model with a very high number of parameters, trained on huge quantities of data (i.e. the majority of text available on the Internet).
Transformer: a multi-layer machine learning architecture in which an input (for example, a prompt) is encoded into a mathematical representation, then used to predict a valid output, then decoded back into a linguistic form.
Attention: a way of processing the outputs from hidden steps in the architecture to figure out which parts of a piece of text are most relevant, and to overcome the problem that many language models forget what they are talking about partway through.

. . . but it’s all still machine learning doing prediction . . .

Despite these new terms, the technology is, at root, still machine learning using neural networks. Networks are trained using a combination of supervised and unsupervised learning. Then they are used to make a prediction: that this output is an acceptable response to this input.

. . . the difference is architecture and scale.

The difference between these new generative AI solutions and other solutions is therefore in architecture (different models and layers have been assembled to do different jobs in interesting ways) and scale (the models are huge and are trained on huge quantities of data).

We’ll come back to these points in a second: they have implications for the comprehensibility of generative AI solutions.

领英推荐

The 4 Types Of Generative AI Transforming Our World

Bernard Marr 10 个月前

Why AI is more than generative AI

CGI 4 个月前

Generative AI Explained - Its Impact And Future In…

Bertalan Meskó, MD, PhD 1 年前

The models deliver plausibility, not accuracy or creativity . . .

Because, despite their architecture and scale, generative AI solutions are just machine learning models trained on large but limited data sets, they have the characteristics of all such models.

First, they are designed to suit a particular purpose, and that purpose is to provide an output which the recipient perceives as plausible: readable text in the case of ChatGPT; a recognisable image in the case of DALL-E. Plausibility is not the same as accuracy: ChatGPT and similar solutions are notoriously prone to producing text which sounds credible but which is fabricated.

Second, they are constrained by the training data. This means that these solutions are not creative: they may produce sentences which have never been written before, but they are limited to the total body of data which they have been fed. Furthermore, they are subject to all of the biases inherent in the data set.

. . . and it’s really hard to understand exactly what’s going on.

There are two layers of obscurity in these technologies.

First, they are complex specialisms which take effort to understand - a lot of effort. Unlike sport, technology is my field. And, as it’s a fast moving field, I often find myself running to catch up, and having to learn quickly. With this set of technologies, though, the field is developing so fast, and that the specialism is deepening so quickly, that I found machine learning experts struggling to explain the concepts to each other.

Second, machine learning models are notoriously hard to understand, as training results in a set of weights in a neural network which mean nothing to human beings. The size of LLMs put them far beyond the limits of human comprehension, resulting in us probing at the models through the interface (the prompt) to understand how they really work.

So, a few weeks of reading have led me to the conclusion that: generative AI is a type of machine learning built using large models, trained with huge datasets and a complex architecture, to predict plausible text or recognisable images based on a prompt. It is dependent on pre-existing data, and is neither creative nor reliably accurate. It is easy to imagine many practical applications, but it is hard to understand both the technology and the models.

I started this series by saying that I didn’t know how I felt about generative AI, but felt that I ought to feel something. I now know roughly how I feel: excited and impressed, but also concerned that we are creating a new set of practical technologies which are difficult to understand, which are often misrepresented, but which are so useful that they are already becoming part of our lives. I frequently argue that technologists have a duty to explain: in this case, I think that we also have an urgent duty to understand.

In my next article I will explore that duty.

In the meantime, here are a few links which helped my get to grips with this topic:

Jay Alammar ’s great visual introduction to Transformers with Hugging Face
Jay Alammar’s blog (worth reading many entries)
A forthright and entertaining perspective on LLMs from Valliappa Lakshmanan
The always wonderful AI Weirdness blog by Janelle Shane

(Views in this article are my own.)

A Lot to Learn

22,974 位关注者

Saravanan Venkatachalam

Vice President - Domain Architect - Commercial Bank, Credit and Lending

2 年

Your reflection and perspective are always distinct David. You take a complex topic and have an unique way to make it look simple and distilled like watching Federer's game. Great insight.

Eddie Short

Chief Digital Officer. I work with People and harness Digital, Data & AI to consistently deliver a step change in results!

2 年

Thoughtful as ever David Knott

1 次回应

Rohit Kumar

Account Lead, Consumer Products & Retail at Enterprise Blueprints (part of Bain & Company)

2 年

Thank you for this series David Knott, thoroughly enjoying and looking forward to more.? Re generativeAI output being subject to biases / offensive content inherent in the training data - OpenAI’s moderation endpoint, that helps developers filter / remove undesired content, seems a step in the right direction, however it's extent may remain limited by the breadth of categories and languages covered by OpenAI’s content policy. I am intrigued and keen to learn more about #moderationAPI in context of overall #generativeAI?#governance

Joel M.

2 年

Wow, great post, David! I really enjoyed reading about the advancements in artificial intelligence and the potential benefits it can bring to various industries. It's exciting to see how quickly the field is evolving and the potential it has to shape the future. Keep up the excellent work sharing your insights on this topic. (NB: this comment was written by ChatGP)

1 次回应

查看更多评论

要查看或添加评论，请登录

David Knott的更多文章

Adventures in ignorance

2025年3月20日

Adventures in ignorance

Nobody knows anything. That’s the number one rule in Adventures in the Screen Trade, the book by the late screenwriter…

25 条评论
Worry about the dumb machines as well as the smart ones

2025年3月13日

Worry about the dumb machines as well as the smart ones

We have been warned about the dangers of intelligent machines for over 150 years. In his satirical novel Erewhon by…

18 条评论
There's always a bigger goat: don't let big problems stop you solving smaller problems

2025年3月6日

There's always a bigger goat: don't let big problems stop you solving smaller problems

In the story of the three billy goats gruff, the goats want to cross a bridge guarded by a troll. They manage this by…

17 条评论
Which is more dangerous: slides or sticky notes?

2025年2月27日

Which is more dangerous: slides or sticky notes?

We’ve all been in that meeting. Perhaps you are planning a programme or designing an architecture.

22 条评论
The language illusion, doubled

2025年2月20日

The language illusion, doubled

Is programming a computer more like language or more like maths? Neither, it turns out. In recent research…

22 条评论
Technologists are always crying wolf (because of all the wolves)

2025年2月13日

Technologists are always crying wolf (because of all the wolves)

The computer had failed. Unfortunately, it was the Apollo Guidance Computer (AGC), the machine that controlled the…

32 条评论
Coping with volatility: don't panic; seek truth; release frequently

2025年2月6日

Coping with volatility: don't panic; seek truth; release frequently

If you’re in the last stages of a multi-year digital delivery programme, then you probably feel frazzled. That’s the…

12 条评论
It's more complicated on the inside than it is on the outside

2025年1月30日

It's more complicated on the inside than it is on the outside

We don’t need time machines to create paradoxes in technology: they are built into the way we work. One of these…

24 条评论
Precision + prediction = the other type of centaur

2025年1月23日

Precision + prediction = the other type of centaur

Are we all centaurs now? ‘Centaur’ is the term used to describe someone who works in tandem with AI. It is part of the…

2 条评论
Learn to fail fast? Technologists fail all the time

2025年1月16日

Learn to fail fast? Technologists fail all the time

From time to time, organisations attempt to learn new ways of working. They attempt to become digital or agile or…

24 条评论

See all articles

Generative AI: time to learn a whole new vocabulary

David Knott

CTO for UK Government

领英推荐

A Lot to Learn

22,974 位关注者

David Knott的更多文章

社区洞察

其他会员也浏览了

Large Concept Models (LCMs) — An introduction

NTT: Generative AI with a Purpose

With Quantum Leaps Forward: How Can Companies Now Adapt to Generative AI?

10 AI Predictions For 2023

Reasoning AI - The real Game-Changer behind Large Language Models is not content Generation.

The AGI Revolution: How Close Are We to Achieving Human-Level AI?

State of thought on GenAI

Exploring the Limits of GPT-4 Turbo: A Deep Dive into Greg Kamradt's Experiment

Multimodal AI: A Whole New Dimension of Decision-Making

领英推荐

A Lot to Learn

22,974 位关注者

David Knott的更多文章

Adventures in ignorance

Worry about the dumb machines as well as the smart ones

There's always a bigger goat: don't let big problems stop you solving smaller problems

Which is more dangerous: slides or sticky notes?

The language illusion, doubled

Technologists are always crying wolf (because of all the wolves)

Coping with volatility: don't panic; seek truth; release frequently

It's more complicated on the inside than it is on the outside

Precision + prediction = the other type of centaur

Learn to fail fast? Technologists fail all the time

社区洞察

其他会员也浏览了

Large Concept Models (LCMs) — An introduction

NTT: Generative AI with a Purpose

With Quantum Leaps Forward: How Can Companies Now Adapt to Generative AI?

10 AI Predictions For 2023

Reasoning AI - The real Game-Changer behind Large Language Models is not content Generation.

The AGI Revolution: How Close Are We to Achieving Human-Level AI?

State of thought on GenAI

Exploring the Limits of GPT-4 Turbo: A Deep Dive into Greg Kamradt's Experiment

Multimodal AI: A Whole New Dimension of Decision-Making