ç™»å½•æŸ¥çœ‹æ›´å¤šå†…å®¹

Generative AI for Image Generation - GAN

Navin Manaswi

Author of Best Seller AI book| Authoring â€œAI Agent" book | Represented India on Metaverse at ITU-T, Geneva | 12 Years AI | Corporate Trainer| AI Consulting| Entrepreneur | Guest Faculty at IIT | Google Developers Expert

å‘å¸ƒæ—¥æœŸ: 2024å¹´5æœˆ8æ—¥

Generative adversarial networks (GANs) are one of the hottest topics in deep learning. They can generate an infinite number of similar image samples based on a given dataset. The underlying idea behind GAN is that it contains two neural networks that compete against each other in a zero-sum game framework, that is a generator and a discriminator.

Welcome !!! It is always a great idea to get a story from a picture. Here is a story where a forger is quite smart. He sells the fake milk and milk shop-owner can tell that it is fake. The forger is smart as he starts learning from feedback given by shop-owner. Every next time, he produces a bit less fake milk and learns from feedback. Eventually, the forger would be able to over smart the shop owner. The forger, finally, generates the milk as close as real milk.

Congratulations!!! You have understood GAN. Here the forger is the generator, and the show owner is the discriminator. GAN consists of two deep learning models, one generator model and one discriminator model. In short, we can summarize GAN as follows

Generator(forger) generates/creates/manufactures milk and
Discriminator(milk shop-owner/ expert person who knows real milk and fake milk)
In the first go, generator (forger) generates milk and discriminator (milk shop-owner) can tell that this is fake. Learning from feedback/loss function, the forger (generator) improves next time, again discriminator tells that this is fake, but this fake would be less fake than the first one. In this way, feedback helps the forger improve the milk quality until the time discriminator says that the milk is real.

Here is the formal look of GAN Architecture

GAN is inspired by the zero-sum non-cooperative game. It means that if one wins, the other loses. A zero-sum game is also known as minimax. Player A wants to maximize its actions, but player B want to minimize them. In-game theory, the GAN model converges when the discriminator (player A) and the generator (player B) reach Nash equilibrium. This is the optimal point for the minimax equation.

Training GAN is equivalent to minimizing JS divergence (or KL divergence) between probability distribution q (estimated distribution, from a generator) and probability distribution p (the real-life distribution). In laymanâ€™s words, JS divergence (or KL divergence) represents the distance between two probability distribution functions.

é¢†è‹±æŽ¨è

The Paradigm Shift in AI: Why Reinforcement Learning is Key to Unlocking LLM Potential

The Paradigm Shift in AI: Why Reinforcement Learningâ€¦

Catherine Wang 4 ä¸ªæœˆå‰

VERSESâ€™ Latest Research Advances Beyond GenAI With RGM Conceptual Modelingâ€¦for Better, Faster, and Cheaper AI

VERSESâ€™ Latest Research Advances Beyond GenAI With RGMâ€¦

Denise Holt 7 ä¸ªæœˆå‰

Fundamentals of AI, ML, DL and Generative Models : Key Insights

Fundamentals of AI, ML, DL and Generative Models : Keyâ€¦

Ramachandran Murugan 7 ä¸ªæœˆå‰

Generator

The generator takes random noise as an input and generates samples as an output. Its goal is to generate such samples that will fool the discriminator to think that it sees real images while actually seeing fakes. We can think of the generator as a counterfeit.

Discriminator

Discriminator takes both real images from the input dataset and fake images from the generator and outputs a verdict whether a given image is legit or not. We can think of the discriminator as a policeman trying to catch the bad guys while letting the good guys free.

The discriminator has the task of determining whether a given image looks natural (that is, is an image from the dataset) or looks like it has been artificially created. The task of the generator is to create natural- looking images that are similar to the original data distribution, images that look natural enough to fool the discriminator network. Firstly a random noise is given to the generator using this it creates the fake images and then these fake images are along with original images sent to the discriminator.

The discriminative model has the task of determining whether a given image looks natural (an image from the dataset) or looks like it has been artificially created. This is basically a binary classifier that will take the form of a normal CNN. The task of the generator is to create natural-looking images that are similar to the original data distribution.

The generator is trying to fool the discriminator while the discriminator is trying not to get fooled by the generator. As the models train through alternating optimization, both methods are improved until a point where the fake images are indistinguishable from the dataset images.

The content is inspired by the book https://www.amazon.in/Generative-Adversarial-Networks-Industrial-Cases/dp/9389423856

è¦æŸ¥çœ‹æˆ–æ·»åŠ è¯„è®ºï¼Œè¯·ç™»å½•

Navin Manaswiçš„æ›´å¤šæ–‡ç«

New Approach of LLM Safety: Bias Mitigation and Toxicity Removal Critical to GenAI's success

2024å¹´9æœˆ3æ—¥

New Approach of LLM Safety: Bias Mitigation and Toxicity Removal Critical to GenAI's success

Bias Mitigation: Using Hamiltonian Mechanics and Poisson Bracket to create safe LLM System Bias Mitigation is one ofâ€¦
Building Safe LLM Systems by using Fourier Neural Operators -- A promising framework of Scalable Safe LLM Systems

2024å¹´8æœˆ28æ—¥

Building Safe LLM Systems by using Fourier Neural Operators -- A promising framework of Scalable Safe LLM Systems

How to build Safe LLM Systems (LLM Guardrails) Using Fourier Neural Operators(FNO) To create a robust framework thatâ€¦

4 æ¡è¯„è®º
PINN: A birthplace of Safe LLMs

2024å¹´8æœˆ26æ—¥

PINN: A birthplace of Safe LLMs

Physics-Informed Neural Networks (PINNs) are poised to play a critical role in the advancement of both AI andâ€¦

2 æ¡è¯„è®º
Detecting Gender and Racial Bias in GenAI Systems: Quantum Entanglement in GenAI Systems

2024å¹´8æœˆ20æ—¥

Detecting Gender and Racial Bias in GenAI Systems: Quantum Entanglement in GenAI Systems

1. Introduction to Bias and Quantum Entanglement Gender and racial bias in Generative AI (GenAI) systems can profoundlyâ€¦
Building Safe LLM Systems: Perturbation Theory as a Framework for Predicting and Mitigating Risks in Large Language Models

2024å¹´8æœˆ14æ—¥

Building Safe LLM Systems: Perturbation Theory as a Framework for Predicting and Mitigating Risks in Large Language Models

Large Language Models (LLMs) like GPT, Llama3, BERT, and their successors have demonstrated remarkable abilities inâ€¦

1 æ¡è¯„è®º
A Race to beat ChatGPT

2023å¹´6æœˆ29æ—¥

A Race to beat ChatGPT

Large Language Models (LLM) such as ChatGPT, GPT-4, and Bard are powerful language models that have been fine-tunedâ€¦
Introduction to GAN (Generative Adversarial Networks)

2020å¹´3æœˆ18æ—¥

Introduction to GAN (Generative Adversarial Networks)

GAN is an algorithm(Deep Learning Approach) behind 1. DeepFake 2.

2 æ¡è¯„è®º
Google MLkit - Simplified for Dummies

2020å¹´1æœˆ27æ—¥

Google MLkit - Simplified for Dummies

â€²ML Kit beta brings Google's machine learning expertise to mobile developers in a powerful and easy-to-use package â€²â€¦
Future of e-commerce: 3D Avatar and Virtual Try-on as game changer

2019å¹´9æœˆ25æ—¥

Future of e-commerce: 3D Avatar and Virtual Try-on as game changer

3D Avatar is all in the rage now. With multiple startups and companies announcing their next big move into the AR andâ€¦

1 æ¡è¯„è®º
Reinforcement Learning Approaches for beginners

2018å¹´11æœˆ15æ—¥

Reinforcement Learning Approaches for beginners

In a series of continuous improvement in RL, we have moved from Q-learning to SARSA to Deep Q Network (DQN) to DDPGâ€¦

1 æ¡è¯„è®º

See all articles

Generative AI for Image Generation - GAN

Navin Manaswi

Author of Best Seller AI book| Authoring â€œAI Agent" book | Represented India on Metaverse at ITU-T, Geneva | 12 Years AI | Corporate Trainer| AI Consulting| Entrepreneur | Guest Faculty at IIT | Google Developers Expert

é¢†è‹±æŽ¨è

Generator

Discriminator

Navin Manaswiçš„æ›´å¤šæ–‡ç«

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Deep Learning Demystified: Key Concepts for Easy Understanding

Attention

Deep Dive: Building GPT from scratch - part 3

Generative Adversarial Networks (GANs)

ML Day 22: Advanced ML Techniques and Tools

The Power of Artificial Intelligence: Understanding the Basics

Top 20 AI Buzzwords in 2024

Kolmogorov-Arnold Networks (KANs) Are Being Used To Boost Graph Deep Learning Like Never Before

The power trio: Exploring ML, DL, and NLP in AI transformations

5 Emerging Trends in Deep Learning and AI to Watch in 2023

é¢†è‹±æŽ¨è

Generator

Discriminator

Navin Manaswiçš„æ›´å¤šæ–‡ç«

New Approach of LLM Safety: Bias Mitigation and Toxicity Removal Critical to GenAI's success

Building Safe LLM Systems by using Fourier Neural Operators -- A promising framework of Scalable Safe LLM Systems

PINN: A birthplace of Safe LLMs

Detecting Gender and Racial Bias in GenAI Systems: Quantum Entanglement in GenAI Systems

Building Safe LLM Systems: Perturbation Theory as a Framework for Predicting and Mitigating Risks in Large Language Models

A Race to beat ChatGPT

Introduction to GAN (Generative Adversarial Networks)

Google MLkit - Simplified for Dummies

Future of e-commerce: 3D Avatar and Virtual Try-on as game changer

Reinforcement Learning Approaches for beginners

ç¤¾åŒºæ´žå¯Ÿ

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†

Deep Learning Demystified: Key Concepts for Easy Understanding

Attention

Deep Dive: Building GPT from scratch - part 3

Generative Adversarial Networks (GANs)

ML Day 22: Advanced ML Techniques and Tools

The Power of Artificial Intelligence: Understanding the Basics

Top 20 AI Buzzwords in 2024

Kolmogorov-Arnold Networks (KANs) Are Being Used To Boost Graph Deep Learning Like Never Before

The power trio: Exploring ML, DL, and NLP in AI transformations

5 Emerging Trends in Deep Learning and AI to Watch in 2023

é¢†è‹±æŽ¨è

å…¶ä»–ä¼šå‘˜ä¹Ÿæµè§ˆäº†