Introduction to Gen AI
Maduri Madduru
Product Manager@TCS, Certified Product Professional, Lean Six Sigma(Green Belt), Ex-Infosys
The first session as part of the Gen AI rush cohort, an initiative by The Product Folks kick-started with Sudalai Rajkumar - SRK walking us through the basics of Generative AI
Evolution of AI
What is Gen AI?
Gen AI refer to describing algorithms, that can be used to create new content including text, images, audio and video
Gen AI in Natural Langaguge Processing(NLP)
Gen AI Large Language Models(LLMs)
Large Language Models are the language models that are trained on large amounts of text data to generate human-like text. They are called Large language models as they accept billions of parameters. Ex. ChatGPT
As we can obviously make out, the output text that gets generated depends upon the input text. This input text that we provide to the AI models is called Prompt.
Two types of prompting:
Gen AI - Diffusion Models
领英推荐
The stable Diffusion model is an iterative model which removes noise from images.
Evolution of NLP Models
Bag of Words: In BoW, we count the number of times a particular word occurs in a particular text and then we put the count over there so it's just counting the words and then we build some machine learning models on top of them. Limitation: For ex. if we are doing a sentiment prediction and the input text had the words happy and glad and if the model has not seen the word glad in the training set, it will not be able to predict that glad is the same as happy.
In order to overcome such limitations, the models have to be trained across all the words that are present in the vocabulary.
Word Vectors & Deep Learning: Each word is represented in a vector representation and words that are close to each other are also closer in high dimensional space. For example, in the above vector representation, the cat and kitten appear closer to each other whereas the word dog or house appears farther. But, the problem here is what if we encounter a word like apple? An apple could mean a fruit or an organization and the model could not differentiate between the two unless we have a clear context.
Context vectors & Transformers: In this model, the entire sentence or the entire paragraph is converted into a vector format
The 4 stages in GPT training process are as follows:
Some of the tools that can be leveraged for various areas of work:
This was followed by a Demo of the application called Document QA which is an AI/ML-powered automation platform built using the concepts discussed during the session.