登录查看更多内容

Artificial Intelligence - Part 7.3 - GENERATIVE AI - VAEs

Alessandro Ciappei

Senior Manager | Cloud Infrastructure, Edge Devices Technical Lead | Datacentre Model Transformation | Artificial Intelligence

发布日期: 2025年1月15日

Variational Autoencoders (VAEs): A Complete Guide

Variational Autoencoders (VAEs) are a powerful class of generative models in machine learning that combine principles from neural networks and probability theory. Unlike traditional autoencoders, VAEs are designed to learn latent representations of data that enable the generation of new, similar samples. This article explores how VAEs work, their underlying principles, use cases, and examples.

What Are Variational Autoencoders (VAEs)?

A Variational Autoencoder is a type of neural network designed for unsupervised learning tasks. It is used to encode data into a latent space (a compressed representation) and then decode it back to reconstruct the original input. The defining feature of VAEs is their probabilistic nature, which enables the generation of new data samples by sampling from the learned latent space.

How Do VAEs Work?

VAEs consist of two primary components:

Encoder: Maps the input data into a latent space represented by a probability distribution.
Decoder: Reconstructs the input data from samples drawn from the latent space.

Key Steps in VAE Functionality

Input Encoding: The encoder maps an input xxx to a latent representation zzz. However, instead of a deterministic mapping, VAEs assume zzz follows a probability distribution (usually Gaussian).

Here:

μ: Mean of the latent space distribution.

σ2: Variance of the latent space distribution.

?: Parameters of the encoder

Latent Space Sampling: To generate a new data point, a sample zzz is drawn from the latent distribution. However, backpropagation requires differentiable operations. To achieve this, VAEs use the reparameterization trick, which allows gradients to flow through the sampling process:

Decoding and Reconstruction: The decoder maps zzz back to the original data space, generating a reconstructed sample x′:

Here, θ\thetaθ represents the parameters of the decoder.

Loss Function: VAEs optimize a composite loss function comprising:

Reconstruction Loss: Ensures the output resembles the input. For continuous data, this is often the mean squared error (MSE).

KL Divergence: Regularizes the latent space by minimizing the divergence between the approximate posterior q?(z∣x)q_\phi(z|x)q?(z∣x) and a prior distribution p(z) (typically N(0,1)):

The total loss is:

Implementing VAEs

Below is a simple implementation of a VAE using Python and TensorFlow/Keras:

Step 1: Import Libraries

import tensorflow as tf
from tensorflow.keras import layers, models
import numpy as np
import matplotlib.pyplot as plt

Step 2: Define the Encoder

latent_dim = 2  # Dimensionality of the latent space

def build_encoder(input_shape):
    inputs = layers.Input(shape=input_shape)
    x = layers.Flatten()(inputs)
    x = layers.Dense(128, activation='relu')(x)
    x = layers.Dense(64, activation='relu')(x)
    z_mean = layers.Dense(latent_dim, name='z_mean')(x)
    z_log_var = layers.Dense(latent_dim, name='z_log_var')(x)
    return models.Model(inputs, [z_mean, z_log_var], name='encoder')

Step 3: Define the Sampling Layer

class Sampling(layers.Layer):
    def call(self, inputs):
        z_mean, z_log_var = inputs
        epsilon = tf.random.normal(shape=tf.shape(z_mean))
        return z_mean + tf.exp(0.5 * z_log_var) * epsilon

Step 4: Define the Decoder

领英推荐

Object Detection 101: Applications, Challenges, and…

Neil Sahota 2 年前

Uncovering Hidden Patterns: How AI Reveals Insights…

Anton Dubov 1 个月前

Demystifying Computer Vision: A Deep Dive into the…

Sandhya Karki 2 个月前

def build_decoder(output_shape):
    latent_inputs = layers.Input(shape=(latent_dim,))
    x = layers.Dense(64, activation='relu')(latent_inputs)
    x = layers.Dense(128, activation='relu')(x)
    x = layers.Dense(np.prod(output_shape), activation='sigmoid')(x)
    outputs = layers.Reshape(output_shape)(x)
    return models.Model(latent_inputs, outputs, name='decoder')

Step 5: Combine into a VAE Model

def build_vae(input_shape, output_shape):
    encoder = build_encoder(input_shape)
    decoder = build_decoder(output_shape)
    z_mean, z_log_var = encoder.output
    z = Sampling()([z_mean, z_log_var])
    outputs = decoder(z)
    vae = models.Model(encoder.input, outputs, name='vae')

    # Define the loss
    reconstruction_loss = tf.keras.losses.binary_crossentropy(
        tf.keras.backend.flatten(encoder.input), 
        tf.keras.backend.flatten(outputs)
    )
    reconstruction_loss *= np.prod(input_shape)
    kl_loss = -0.5 * tf.reduce_sum(1 + z_log_var - tf.square(z_mean) - tf.exp(z_log_var), axis=-1)
    vae.add_loss(tf.reduce_mean(reconstruction_loss + kl_loss))
    vae.compile(optimizer='adam')
    return vae

Step 6: Train the VAE

(input_train, _), (_, _) = tf.keras.datasets.mnist.load_data()
input_train = input_train.astype('float32') / 255.0
input_train = np.expand_dims(input_train, axis=-1)

vae = build_vae(input_shape=(28, 28, 1), output_shape=(28, 28, 1))
vae.fit(input_train, input_train, epochs=10, batch_size=128)

Applications of VAEs

Image Generation

Generating new faces, handwritten digits, or artistic designs.

Example: Using a VAE trained on the MNIST dataset to create new handwritten digits.

Data Denoising

Reconstructing clean data from noisy inputs, such as removing noise from images or audio signals.

Anomaly Detection

Identifying outliers by comparing reconstruction errors.

Example: Detecting fraudulent transactions or defective manufacturing components.

Latent Space Interpolation

Exploring the latent space to create smooth transitions between data points.

Example: Generating morphing sequences of images, such as transitioning between two faces.

Text Generation

Using VAEs with recurrent networks for generating coherent text sentences.

Healthcare

Simulating medical images for training models or augmenting datasets in fields like radiology.

Advantages of VAEs

Generative Power:

VAEs can generate new data samples while maintaining diversity and realism.

Latent Space Representation:

The structured latent space makes it easier to explore and manipulate data representations.

Flexibility:

Can be adapted for various data types, including images, text, and audio.

Challenges of VAEs

Blurred Outputs:

Generated images can sometimes lack sharpness compared to other generative models like GANs.

Computational Complexity:

Training VAEs, especially with large latent spaces, can be resource-intensive.

KL Divergence Trade-off:

Balancing reconstruction accuracy and latent space regularization can be challenging.

Conclusion

Variational Autoencoders represent a significant step forward in generative modeling, offering a blend of probabilistic inference and deep learning. Their ability to model complex data distributions while enabling generative capabilities makes them invaluable across industries. Whether you're generating images, detecting anomalies, or exploring latent spaces, VAEs provide a versatile and powerful tool in the AI toolbox.

查看更多评论

要查看或添加评论，请登录

Alessandro Ciappei的更多文章

TELECOMMUNICATION - PART 4.2 - SOFTWARE DEFINED SATELLITE

2025年2月11日

TELECOMMUNICATION - PART 4.2 - SOFTWARE DEFINED SATELLITE

Software-Defined Satellites: Revolutionising Space Technology Software-defined satellites (SDSs) represent a paradigm…
ARTIFICIAL INTELLIGENCE - PART 6.8 - LUSTRE

2025年2月11日

ARTIFICIAL INTELLIGENCE - PART 6.8 - LUSTRE

Lustre File System: Unleashing the Power of Parallel Storage for HPC and AI The relentless growth of High-Performance…
TELECOMMUNICATION - PART 4.1 - SATELLITES COMMUNICATIONS (Section 1)

2025年2月2日

TELECOMMUNICATION - PART 4.1 - SATELLITES COMMUNICATIONS (Section 1)

Satellite Communications Satellite communication systems rely on a combination of advanced technologies and key…
TELECOMMUNICATION - PART 4 - SATELLITES

2025年2月2日

TELECOMMUNICATION - PART 4 - SATELLITES

Satellite Technologies: A Comprehensive Overview Introduction Satellites have become indispensable in today's world…

1 条评论
ARTIFICIAL INTELLIGENCE - PART 6.7 - VECTOR DATABASE

2025年1月30日

ARTIFICIAL INTELLIGENCE - PART 6.7 - VECTOR DATABASE

Vector Databases: A Comprehensive Guide Introduction to Vector Databases Vector databases are specialized databases…
ARTIFICIAL INTELLIGENCE - PART 11 - THE AI WAR

2025年1月27日

ARTIFICIAL INTELLIGENCE - PART 11 - THE AI WAR

The Rise of DeepSeek: China’s Answer to OpenAI In recent years, the artificial intelligence (AI) landscape has become a…
Artificial Intelligence - Part 9.1 - XAI Real World Use Cases

2025年1月26日

Artificial Intelligence - Part 9.1 - XAI Real World Use Cases

Real-World Applications of XAI Explainable AI (XAI) is revolutionising various industries by enhancing trust…

2 条评论
Artificial Intelligence - Part 10.2 - Quantum Computing

2025年1月26日

Artificial Intelligence - Part 10.2 - Quantum Computing

Building a Quantum Computing Datacenter: Requirements, Workflows, and Implementation Examples Introduction Quantum…

6 条评论
Artificial Intelligence - Part 9 - Explainable AI

2025年1月25日

Artificial Intelligence - Part 9 - Explainable AI

Chapter 1: Explainable AI (XAI): A Comprehensive Guide Introduction Artificial Intelligence (AI) has rapidly evolved…
Artificial Intelligence - Part 10.1 - HPC for AI

2025年1月24日

Artificial Intelligence - Part 10.1 - HPC for AI

Building an AI Datacenter with High-Performance Computing (HPC): Requirements, Workflows, and Implementation Examples…

2 条评论

See all articles

Artificial Intelligence - Part 7.3 - GENERATIVE AI - VAEs

Alessandro Ciappei

Senior Manager | Cloud Infrastructure, Edge Devices Technical Lead | Datacentre Model Transformation | Artificial Intelligence

Variational Autoencoders (VAEs): A Complete Guide

What Are Variational Autoencoders (VAEs)?

How Do VAEs Work?

Key Steps in VAE Functionality

Implementing VAEs

Step 1: Import Libraries

Step 2: Define the Encoder

Step 3: Define the Sampling Layer

Step 4: Define the Decoder

领英推荐

Step 5: Combine into a VAE Model

Step 6: Train the VAE

Applications of VAEs

Advantages of VAEs

Challenges of VAEs

Conclusion

Alessandro Ciappei的更多文章

社区洞察

其他会员也浏览了

AI Research News Update: Issue 1 (Nov 15-21, 2021)

PINN: A birthplace of Safe LLMs

How KANs Rethink AI Problem-Solving

The State of AI in Early 2025: A Technical Deep Dive

AI Atlas #6: Neural Radiance Fields (NeRFs)

NewMind AI Journal #12

The Rise of Transformers: Why The Sudden Jump in AI Capabilities?

Noisy by Nature: How AI Learns to Shush the Static

Detection and interpretation of outliers thanks to autoencoder and SHAP values

Variational Autoencoders (VAEs): A Complete Guide

What Are Variational Autoencoders (VAEs)?

How Do VAEs Work?

Key Steps in VAE Functionality

Implementing VAEs

Step 1: Import Libraries

Step 2: Define the Encoder

Step 3: Define the Sampling Layer

Step 4: Define the Decoder

领英推荐

Step 5: Combine into a VAE Model

Step 6: Train the VAE

Applications of VAEs

Advantages of VAEs

Challenges of VAEs

Conclusion

Alessandro Ciappei的更多文章

TELECOMMUNICATION - PART 4.2 - SOFTWARE DEFINED SATELLITE

ARTIFICIAL INTELLIGENCE - PART 6.8 - LUSTRE

TELECOMMUNICATION - PART 4.1 - SATELLITES COMMUNICATIONS (Section 1)

TELECOMMUNICATION - PART 4 - SATELLITES

ARTIFICIAL INTELLIGENCE - PART 6.7 - VECTOR DATABASE

ARTIFICIAL INTELLIGENCE - PART 11 - THE AI WAR

Artificial Intelligence - Part 9.1 - XAI Real World Use Cases

Artificial Intelligence - Part 10.2 - Quantum Computing

Artificial Intelligence - Part 9 - Explainable AI

Artificial Intelligence - Part 10.1 - HPC for AI

社区洞察

其他会员也浏览了

AI Research News Update: Issue 1 (Nov 15-21, 2021)

PINN: A birthplace of Safe LLMs

How KANs Rethink AI Problem-Solving

The State of AI in Early 2025: A Technical Deep Dive

AI Atlas #6: Neural Radiance Fields (NeRFs)

NewMind AI Journal #12

The Rise of Transformers: Why The Sudden Jump in AI Capabilities?

Noisy by Nature: How AI Learns to Shush the Static

Detection and interpretation of outliers thanks to autoencoder and SHAP values