课程: Generative AI: Working with Large Language Models
今天就学习课程吧!
今天就开通帐号,24,700 门业界名师课程任您挑!
Inference time vs. pre-training
课程: Generative AI: Working with Large Language Models
Inference time vs. pre-training
- [Instructor] Over this course, we've looked at scaling laws and lessons from the chinchilla models. Let's bring this all together as we look at the current trends for large language models. When training of large language models first kicked off, most of the focus was on improving the models, and then the scaling laws were the guiding principle, and the scaling laws suggested that you would get the biggest improvement by increasing the size of the models. So you do this by increasing the model's architecture, so that's the number of layers in a model, the number of attention heads, and so on. But that was only one dimension because as model providers created large models, the associated training cost became an important consideration. So the second important dimension became training cost, because being able to train a model effectively given a budget was important. And the learnings from the chinchilla paper earlier in the course suggested that large language models were being…
随堂练习,边学边练
下载课堂讲义。学练结合,紧跟进度,轻松巩固知识。
内容
-
-
-
-
-
GPT-34 分钟 32 秒
-
(已锁定)
GPT-3 use cases5 分钟 27 秒
-
(已锁定)
Challenges and shortcomings of GPT-34 分钟 17 秒
-
(已锁定)
GLaM3 分钟 6 秒
-
(已锁定)
Megatron-Turing NLG Model1 分钟 59 秒
-
(已锁定)
Gopher5 分钟 23 秒
-
(已锁定)
Scaling laws3 分钟 14 秒
-
(已锁定)
Chinchilla7 分钟 53 秒
-
(已锁定)
BIG-bench4 分钟 24 秒
-
(已锁定)
PaLM5 分钟 49 秒
-
(已锁定)
OPT and BLOOM2 分钟 51 秒
-
(已锁定)
GitHub models2 分钟 43 秒
-
(已锁定)
Accessing Large Language Models using an API6 分钟 25 秒
-
(已锁定)
Inference time vs. pre-training4 分钟 5 秒
-
-