课程: Introduction to Large Language Models
今天就学习课程吧!
今天就开通帐号,24,700 门业界名师课程任您挑!
Making large language models follow instructions
课程: Introduction to Large Language Models
Making large language models follow instructions
- [Instructor] We've seen the problems with just a base large language model. It just doesn't follow our instructions to create a shopping list out of the box. So how do we go about creating a large language model that will follow instructions we give it? In 2022, the Open AI team released a paper called "Training Language Models to Follow Instructions with Human Feedback," which is still the industry standard. There are two components to this training, supervised fine-tuning, and RLHF, or reinforcement learning from human feedback. Let's head over to the paper and take a look at the supervised training in the diagram on the left. The Open AI research team would create a prompt, for example, "Explain the moon landing to a six-year-old," and then a labeler, so that's a person who's skilled with working with text data, would then write out what the model should produce as output. So for example, they may include…