ChatGPT
What is ChatGPT?
ChatGPT is OpenAI’s latest language model, specifically trained for interactions by conversation.?Since the release of GPT-3, many problems have surfaced, such as the AI not being able to respond to some questions very well or making up information. While you can try to change the?prompt?(i.e. the text given to the AI), it is still?you?that is trying to cue the AI. But logically, it would be much better if the?AI?instead cued you. That is, instead of trying to iteratively rephrase a question to make?the AI understand what you are looking for and answer correctly, the AI should simply ask you if it doesn’t understand in a conversational manner.
Now, this is the direction ChatGPT moves us towards. It is still more inclined to?answer?than?ask?for now, but the conversational format allows for more dynamic interactions. Before you could simulate a conversation with GPT-3, but this isn’t what it was trained for. ChatGPT, on the other hand, has been trained explicitly for this purpose. It uses a technique called?reinforcement learning from human feedback. Reinforcement learning is an area within machine learning where agents are trained to complete objectives in an environment driven by rewards. Iteratively, the agent interacts with the environment by performing actions, obtaining rewards and then adjusting its behavior to both explore the environment and obtain better rewards.
Games, in particular, have been a playing ground for these techniques with huge success and impressive results in Chess, the Atari games, Go and Poker. While games have predefined rules and rewards, a conversation does not, thus, human feedback becomes essential. This was done by prompting a model, sampling several responses and then letting a human manually rank the responses. These rankings will then become training data for a?reward model.?Finally, a fine-tuned language model will be further trained using reinforcement learning to respond to questions so as to optimize the output of the reward model. For more information, check out OpenAI’s blog post:
领英推荐
What Does ChatGPT Mean?
ChatGPT (Chat Generative Pre-trained Transformer) is a complex?machine learning model?that is able to carry out natural language generation (NLG) tasks with such a high level of accuracy that the model can pass a?Turing Test.
ChatGPT was trained on massive amounts of?unlabeled data?scraped from the internet before 2022. The model is continually being monitored and fine-tuned for specific language-oriented tasks with additional datasets?labeled?by humans.
Tasks ChatGPT excels at include:
ChatGPT is built on a?large language model?(LLM) called GPT-3 (Generative Pre-trained Transformer 3). A large language model is a type of?deep neural network?that uses billions of parameters and is trained with?petabytes?of data. The model was developed by OpenAI, a research firm founded by Elon Musk.