What is Generative AI, How it is different from existing Machine Learning Techniques
PART 4: what is Really Prompt, Prompt Design
Before learning Part 4 , please read part 3:https://www.dhirubhai.net/pulse/what-generative-ai-how-different-from-existing-machine-sunkara-2c
Large language models are one type of generative AI?since they generate novel combinations of text?in the form of natural sounding language.
A generative image model takes an image?as input and can output text, another image, or video.
For example, under the output text,?you can get visual question answering?while under output image, an image completion is generated.
Generative language models learn?about patterns and language through training data,?then, given some text, they predict what comes next.
?Thus generative language models are pattern matching systems. They learn about patterns based on the data you provide.
The power of generative AI comes from the use of transformers.?Transformers produced a 2018 revolution?in natural language processing.?
At a high level, a transformer model?consists of an encoder and decoder.?The encoder encodes the input sequence?and passes it to the decoder, which?learns how to decode the representation?for a relevant task.
In transformers, hallucinations are words or phrases?that are generated by the model that?are often nonsensical or grammatically incorrect.
In transformers, hallucinations are words or phrases?that are generated by the model that?are often nonsensical or grammatically incorrect.?
Hallucinations can be caused by a number of factors,?including the
model is not trained on enough data,?
or the model is trained on noisy or dirty data,?
or the model is not given enough context,?
or the model is not given enough constraints.
Hallucinations can be a problem for transformers?because they can make the output text difficult to understand.
领英推荐
They can also make the model more?likely to generate incorrect or misleading information.
?A prompt is a short piece of text?that is given to the large language model as input.?And it can be used to control the output of the model?in a variety of ways.?
Prompt design is the process of creating?a prompt that will generate the desired output?from a large language model.?As previously mentioned, gen AI depends a lot?on the training data that you have fed into it.?And it analyzes the patterns and structures of the input data?and thus learns.
model types in Gen AI
Text-to-text.?
Text-to-text models take a natural language input?and produces a text output.?These models are trained to learn the mapping?between a pair of text, e.g. for example, translation from one language to another.
Text-to-image Text-to-image models are trained on a large set of images,?each captioned with a short text description.?Diffusion is one method used to achieve this.?
Text-to-video and text-to-3D
Text-to-video models aim to generate a video representation?from text input.?The input text can be anything from a single sentence?to a full script. And the output is a video that corresponds to the input text.?
Similarly, text-to-3D models generate?three dimensional objects that correspond to a user's text?description.?For example, this can be used in games or other 3D worlds.
Text-to-task
Text-to-task models are trained to perform a defined task?or action based on text input.?This task can be a wide range of actions?such as answering a question, performing a search,?making a prediction, or taking some sort of action.?For example, a text-to-task model?could be trained to navigate a web UI or make changes to a doc?through the GUI.
A foundation model is a large AI model pre-trained?on a vast quantity of data designed to be adapted or fine?tuned to a wide range of downstream tasks,?such as sentiment analysis, image captioning, and object?recognition.?