登录查看更多内容

Understanding Parameters and Tokens in an LLM: A Simple Breakdown

Assem Hijazi

发布日期: 2024年10月17日

Article 3: LLM Series articles, previous article How an LLM Works: A Simple Way to Explain It | LinkedIn

We’ve already established that in an LLM (like GPT), parameters are the relationships between words, and tokens “they call it tokens” are the words themselves. Now, let’s dig a little deeper into how these tokens and parameters work together to help the model understand and respond to prompts.

Tokens and Relationships

First, think of tokens as individual words or parts of words. Each word has certain relationships with other words, and these relationships are built during the model’s training. The more training data the model has, the more relationships (or parameters) it can establish between different words.

When you give the LLM a prompt (we can think of this as a "project"), the model analyzes the relationships between the words in that prompt. It evaluates how strongly the words are connected and decides which connections are most relevant. These relationships are measured by vectors, which are like numerical scores that show how related two words are. These vectors aren’t static—they change dynamically based on the context of the prompt.

Visualizing the Relationships

Imagine each word is inside a circle, and from that circle, you have multiple lines connecting it to other words. Each line represents a potential relationship between the word in the circle and another word. Some of these lines might be thick and strong because those words are closely related, while others might be thin and weak if the relationship is less relevant.

Note: visit the site for visualize words relationship Semantically related words for "dubai_NOUN" (nlpl.eu)

For example, if you take the word "bank" and place it in a circle, you might see lines connecting it to words like "money," "loan," "river," and "finance." The strength of these connections depends on the context. If you’re talking about banking, the connection between "bank" and "money" will be strong, but if the context is about rivers, the connection between "bank" and "river" will become stronger instead.

Changing Contexts and Relationships

The beauty of an LLM is that these relationships adapt depending on the context of your prompt. Let’s say you move from talking about healthcare to finance and then to geography:

In healthcare, the word "apple" might be linked closely to words like "nutrition" and "diet."
In finance, the word "bank" might be linked closely to words like "loan" and "interest."
In geography, "bank" might shift its connection to "river" and "water."

This means that for each new context or domain, the LLM dynamically adjusts the vectors (relationships between words) to match the meaning that’s most relevant.

领英推荐

Smart Contracts - Global Trend

Bhumesh Verma 4 年前

Real-World Applications of RAG Across Industries

Noorain Fathima 2 个月前

ARTIFICIAL INTELLIGENCE AND THE FUTURE OF LAW PRACTICE…

Tom Utum 2 年前

The Role of Training Data in Building These Relationships

So, how does the LLM know to adjust these connections? This is where training data comes in. During training, the model is exposed to a huge amount of text across different topics and domains. It learns the patterns in language and builds initial vectors (or relationships) between words.

At the start, these vectors are generic and might even be random. But as the model processes more data, it refines these relationships. For instance, the more the model reads about banks and finance, the stronger the connection between "bank" and "loan" becomes.

Once trained, the model doesn’t need to go back to the original data every time it gets a new prompt. Instead, it uses what it already learned—just like how you don’t need to look up a fact in a book once you’ve memorized it. The LLM adjusts the relationships between words on the fly, based on the context of the question or prompt.

In Summary

Tokens are words, and the relationships between those words are called parameters.
These relationships are represented by vectors, which measure how closely two words are related.
Depending on the context of the prompt, the model will adjust the strength of these relationships.
The training data helps the model learn these relationships, so it can respond to different prompts in various domains (like healthcare, finance, or geography) without needing to revisit the original data.

In short, every word is connected to many other words, and the strength of those connections shifts depending on the project or prompt you're working on. It’s like having a web of words, and the model knows how to highlight the right connections depending on what you ask.

Going a Bit Deeper: Vector Representation

If you're curious about how these relationships are actually represented in the LLM, here's a bit more detail. In models like GPT, every word (or token) is represented by a vector—a long list of numbers. These vectors are usually 1,024 dimensions long, meaning each word is associated with 1,024 different numbers that capture its meaning.

For example:

"apple" might have a vector like [0.45, -0.32, 0.78, ..., -0.11].
"fruit" could have a vector like [0.42, -0.30, 0.80, ..., -0.10]. Since "apple" and "fruit" are related, their vectors will be similar. In this case, the difference between their vectors is small, which shows that the LLM recognizes them as closely related.

These numbers typically range between -1 and 1 and reflect the word's relationship with other words in the model. Words with similar meanings will have vectors that are closer together in this high-dimensional space, while unrelated words will have vectors that are farther apart. The model uses these vectors to understand and adjust the relationships between words based on the context of your prompt.

So, while we visualize these connections as lines between words in circles, under the hood, the model is using complex mathematical relationships between these 1,024-dimensional vectors to decide how words are related. This allows the model to adjust dynamically and provide relevant answers, no matter the domain or topic

要查看或添加评论，请登录

Assem Hijazi的更多文章

Redefining Beauty with AI: A Framework of Six Dimensions for Human Simulation

2025年3月25日

Redefining Beauty with AI: A Framework of Six Dimensions for Human Simulation

When it comes to understanding what Artificial Intelligence (AI) can truly accomplish within any field—including…
Build Your Vision: Inspired by Sheikh Mohammed bin Rashid

2025年3月4日

Build Your Vision: Inspired by Sheikh Mohammed bin Rashid

In today’s fast-changing world, individuals and organizations — especially in the government sector — often struggle…
Are Governments Building Fake AI Data Centers? The Ugly Truth Behind the Hype

2025年3月3日

Are Governments Building Fake AI Data Centers? The Ugly Truth Behind the Hype

Introduction and executive summary— A Real-Life Reflection from Experience Lately, I’ve been watching the global race…
What Enterprise AI Agents Are Missing — Taking CoPilot as an Example

2025年2月28日

What Enterprise AI Agents Are Missing — Taking CoPilot as an Example

In a previous article, I talked about my vision for CoPilot ERP and how Microsoft could improve the way AI works with…
How Open Source can fuel the LLMs to produce new era of App Development ?

2025年2月27日

How Open Source can fuel the LLMs to produce new era of App Development ?

Introduction as executive summary Open-source software has always been seen as a game-changer. Free, flexible, and…
Who Will Win the Generative AI Race? The Answer Lies in the Ecosystem

2025年2月24日

Who Will Win the Generative AI Race? The Answer Lies in the Ecosystem

The AI Race Is Not Just About Models The race between generative AI models like ChatGPT, DeepSeek, CoPilot, Gemini, and…

2 条评论
ChatGPT vs. DeepSeek and Others: Who is the Winner? Are We Missing Something?

2025年1月29日

ChatGPT vs. DeepSeek and Others: Who is the Winner? Are We Missing Something?

In recent days, the buzz around generative AI engines has been louder than ever. Everyone is talking about ChatGPT…

1 条评论
ChatGPT Operator: A Digital Proxy for Human Interaction?

2025年1月27日

ChatGPT Operator: A Digital Proxy for Human Interaction?

Recently, OpenAI announced the release of Operator, a new feature integrated into ChatGPT. When I first heard about it,…

4 条评论
Why the Future of Data Science Lies in Generative AI Skills

2025年1月25日

Why the Future of Data Science Lies in Generative AI Skills

I found myself wondering one day: Is all my study and knowledge in data science still valid in this new era of…
Dedicated Features for Every User: The Future of LLM-Driven ALM Apps

2025年1月24日

Dedicated Features for Every User: The Future of LLM-Driven ALM Apps

Article no 8 the AI future LLM series, previous article Robotics-Based LLM driven apps (ALM Apps) The future generation…

See all articles

Understanding Parameters and Tokens in an LLM: A Simple Breakdown

Assem Hijazi

Tokens and Relationships

Visualizing the Relationships

Changing Contexts and Relationships

领英推荐

In Summary

Going a Bit Deeper: Vector Representation

Assem Hijazi的更多文章

社区洞察

其他会员也浏览了

Why Every Speaker Needs an Online Course

The Double-Edged Sword of Knowledge: Application, Distrust, and the Modern Mind

I launched my LLM Course! and more - Weekly Update, Apr. 1, 2024

Mastering Logical Reasoning for CAT: Essential Topics and Effective Strategies

10 Ideas worth learning

Title: Navigating the Legal Landscape: New Indian Laws in 2024 & Provision of Article 370

AIFI Newsletter: LLMs in Finance Certificate 2025 - Starts next Monday 10th March.

A rising demand for Outcomes & Flow

Overfitting makes everything worse: the strong version of Goodhart’s law

Practice and currency

Tokens and Relationships

Visualizing the Relationships

Changing Contexts and Relationships

领英推荐

In Summary

Going a Bit Deeper: Vector Representation

Assem Hijazi的更多文章

Redefining Beauty with AI: A Framework of Six Dimensions for Human Simulation

Build Your Vision: Inspired by Sheikh Mohammed bin Rashid

Are Governments Building Fake AI Data Centers? The Ugly Truth Behind the Hype

What Enterprise AI Agents Are Missing — Taking CoPilot as an Example

How Open Source can fuel the LLMs to produce new era of App Development ?

Who Will Win the Generative AI Race? The Answer Lies in the Ecosystem

ChatGPT vs. DeepSeek and Others: Who is the Winner? Are We Missing Something?

ChatGPT Operator: A Digital Proxy for Human Interaction?

Why the Future of Data Science Lies in Generative AI Skills

Dedicated Features for Every User: The Future of LLM-Driven ALM Apps

社区洞察

其他会员也浏览了

Why Every Speaker Needs an Online Course

The Double-Edged Sword of Knowledge: Application, Distrust, and the Modern Mind

I launched my LLM Course! and more - Weekly Update, Apr. 1, 2024

Mastering Logical Reasoning for CAT: Essential Topics and Effective Strategies

10 Ideas worth learning

Title: Navigating the Legal Landscape: New Indian Laws in 2024 & Provision of Article 370

AIFI Newsletter: LLMs in Finance Certificate 2025 - Starts next Monday 10th March.

A rising demand for Outcomes & Flow

Overfitting makes everything worse: the strong version of Goodhart’s law

Practice and currency