登录查看更多内容

First step in AI and LLM.

Quoc Viet Ha

Software Engineer | ReactJS | NextJS | React Native | NodeJS | Firebase | AWS

发布日期: 2024年11月13日

What is AI, and What Are Its Types?

AI stands for Artificial Intelligence. It’s a big part of computer science, first created in the 1950s. AI can do many amazing things, like recognize faces, help cars drive by themselves, predict the weather, and even make music.

There are four main types of AI:

Artificial Intelligence (started in 1956): This is the field where people build smart computers that can think like humans.
Machine Learning (started in 1997): A part of AI that lets computers learn from data so they can make better choices or guesses.
Deep Learning (started in 2017): A type of machine learning that uses “layers” of computer networks to understand complex information.
Generative AI (started in 2021): This type of AI creates new pictures, sounds, or writing based on what it has learned.

What is a Large Language Model (LLM), and What Does It Do?

A Large Language Model (LLM) is a special type of AI that can understand and create text. It can recognize letters, words, and sentences and figure out how they work together to make sense.

To do this, LLMs use a method called deep learning to study tons of text. Over time, they learn to recognize patterns in words and sentences, so they can answer questions or write text without human help.

LLMs are used for lots of things, like chatbots (for example, ChatGPT), and can help people write stories, answer questions, and much more.

How Are LLMs Trained?

To train an LLM, we need a huge amount of text (billions of words) so it can learn a lot about language and how words fit together.

There are two main ways to train LLMs:

The first way is call supervised Learning: The AI learns from labeled data, which means humans help it by telling it what’s right or wrong. For example, to help the AI learn numbers, we might show it lots of pictures of numbers with labels so it knows what each one is.

One of the most famous videos about Supervised Learning applications is about Convolutional Neural Networks (CNN) that identify numbers in picture. Link video

领英推荐

Why AI is more than generative AI

CGI 4 个月前

AI vs Machine Learning; and What's in between?

Alex Wang 1 年前

Understanding AI, Machine Learning, and Deep Learning:…

Mazen Zbib 6 个月前

Another way to train LLM is unsupervised learning when machine learn by themself without human instruction.

Unsupervised learning is suitable for complex processing tasks such as organizing large datasets into clusters. They are useful for identifying previously undetected patterns in data that can help identify features useful for categorizing data.

To train an LLM, there are 3 stages required to do:

Gather Data: The engineer gives LLM a large dataset including words, sentences, and text like posts, articles, and websites. This dataset will include billions of words. For example, Llama3 model of Meta now is training based on 15T token (over 12 billions words). Model will be training based on these natural language and try to understand the context between words.
Fine-tuning: Engineer will fine-tune model with data, label all the correct answer so LLM can relearn it again and again
Reinforcement Learning: After LLM provider correct answer, apply Reinforcement Learning (a machine learning technique that trains software to make decisions to achieve the most optimal result) to adjust correct answer and choose the best answer for LLM improve.

Props and cons of LLM

Props:

LLMs understand human language very well, meaning they can read and make sense of words, sentences, and even complex ideas, much like a person would
LLM application like ChatGPT can save people time by answering questions quickly and can sometimes even give better answers than people.

Cons:

Knowledge cutoff: LLMs can’t know about events after they were last trained. For example, they might not know about very recent news.
Hallucinations: Sometimes, the model can produce outputs that are coherent and grammatically correct but factually incorrect or nonsensical

For example, when we compare 9.11 and 9.9 which number is greater in ChatGPT we will get this answer

Software Engineer Growth

561 位关注者

Duy Nguyen

Full Digitalized Chief Operation Officer (FDO COO) | First cohort within "Coca-Cola Founders" - the 1st Corporate Venture funds in the world operated at global scale.

4 个月

1 次回应

查看更多评论

要查看或添加评论，请登录

Quoc Viet Ha的更多文章

10 stages of AI: From simple rules to mind-blowing possibilities

2024年12月30日

10 stages of AI: From simple rules to mind-blowing possibilities

10 stages of AI include: Rule or Knowledge-Based Systems. Context-Based and Retention Systems.
AGI and o3: A new future is coming.

2024年12月26日

AGI and o3: A new future is coming.

I wrote this article when OpenAI just announced their new o3 model, which has significant performance results in coding…
Generative AI in a nutshell

2024年11月25日

Generative AI in a nutshell

Generative AI is changing how we work in software engineering - from writing code to managing teams and building…

1 条评论
Design Document in a Nutshell.

2024年11月8日

Design Document in a Nutshell.

As a software engineer, my job isn’t just about writing code; it's about solving problems. When adding a new feature to…

2 条评论
WebRTC in nutshell - What is it? How it works

2024年10月28日

WebRTC in nutshell - What is it? How it works

What is WebRTC? (Web Real-Time Communication) WebRTC is an open-source project designed to make real-time communication…

1 条评论
What is Reverse-engineering? How Does It Work?

2024年10月21日

What is Reverse-engineering? How Does It Work?

What is reverse engineering? Reverse engineering is the act of reading and understanding software code to see how it…

1 条评论
DDoS Attacks: What They Are and How to Defend Against Them

2024年10月14日

DDoS Attacks: What They Are and How to Defend Against Them

What is a DDoS Attack? A DDoS attack stands for "Distributed Denial-of-Service Attack." It's a type of cybercrime where…

2 条评论
Should You Prioritize Speed or Stability When Building a Product?

2024年10月8日

Should You Prioritize Speed or Stability When Building a Product?

When developing a large product, there's often a trade-off between speed and stability. Should you aim for rapid…
REST API: fundamental concept and best practice

2024年9月1日

REST API: fundamental concept and best practice

As a software engineer, you will be very familiar with the concept of REST API especially if you work in backend…
Implement The First Principle Thinking in software engineer.

2024年8月23日

Implement The First Principle Thinking in software engineer.

Introduction As software engineers, we often face complex problems that require significant time for thinking and…

1 条评论

See all articles

First step in AI and LLM.

Quoc Viet Ha

Software Engineer | ReactJS | NextJS | React Native | NodeJS | Firebase | AWS

What is AI, and What Are Its Types?

What is a Large Language Model (LLM), and What Does It Do?

How Are LLMs Trained?

领英推荐

Props and cons of LLM

Software Engineer Growth

561 位关注者

Quoc Viet Ha的更多文章

社区洞察

其他会员也浏览了

The difference between ML & AI and what it means for business leaders

This weekend I used Google's new NotebookLM.

Unraveling Artificial Intelligence: How It Works and Its Promising Future"

Is Machine Learning a Part of Artificial Intelligence?

Machine Reasoning taking artificial intelligence from narrow AI to general AI to support Telco

Learning to Forget: How Nature-Inspired AI is Transforming Machine Learning

A Beginner's Guide to Key AI Concepts

AI Alphabet Soup: From Buzzwords to Brilliance!

?? The AI FAMILY TREE: Tracing the Lineage from Intelligence to ChatGPT

The World of AI: Untangling the Buzzwords

What is AI, and What Are Its Types?

What is a Large Language Model (LLM), and What Does It Do?

How Are LLMs Trained?

领英推荐

Props and cons of LLM

Software Engineer Growth

561 位关注者

Quoc Viet Ha的更多文章

10 stages of AI: From simple rules to mind-blowing possibilities

AGI and o3: A new future is coming.

Generative AI in a nutshell

Design Document in a Nutshell.

WebRTC in nutshell - What is it? How it works

What is Reverse-engineering? How Does It Work?

DDoS Attacks: What They Are and How to Defend Against Them

Should You Prioritize Speed or Stability When Building a Product?

REST API: fundamental concept and best practice

Implement The First Principle Thinking in software engineer.

社区洞察

其他会员也浏览了

The difference between ML & AI and what it means for business leaders

This weekend I used Google's new NotebookLM.

Unraveling Artificial Intelligence: How It Works and Its Promising Future"

Is Machine Learning a Part of Artificial Intelligence?

Machine Reasoning taking artificial intelligence from narrow AI to general AI to support Telco

Learning to Forget: How Nature-Inspired AI is Transforming Machine Learning

A Beginner's Guide to Key AI Concepts

AI Alphabet Soup: From Buzzwords to Brilliance!

?? The AI FAMILY TREE: Tracing the Lineage from Intelligence to ChatGPT

The World of AI: Untangling the Buzzwords