登录查看更多内容

Anomaly Detection with VAE

Eeswar C.

发布日期: 2023年5月4日

Anomaly detection is a machine learning technique used to identify patterns that are considered unusual or out of the ordinary. Think of it as the machine learning equivalent of that one friend in your group who always notices when something is off.

In machine learning, we train our algorithms to recognize what is normal behavior, and when it detects something that doesn't fit the norm, it raises a flag. It's like having a bouncer at a party who kicks out anyone who's behaving in a suspicious or unusual way.

And just like that bouncer, sometimes the machine learning algorithm can be a bit overzealous and kick out someone who was just having a bit too much fun. So, it's important to tweak the settings to make sure it's not flagging too many false positives.

But hey, it's better to have an overzealous bouncer than no bouncer at all, right?

There are many machine learning models available which can be that bouncer we are looking for, but autoencoders stand out in specific variational autoencoders (VAE's) as they automatically learn the general structure of the training data to isolate only its discriminative features i.e., latent vector. The latent vector acts as an information bottleneck that forces the model to be very selective about what to encode.

During the training process an encoder produces latent vector, and a decoder reconstructs the original data from the latent vector as faithfully as possible. By detecting inaccuracies in the reconstruction, we can tell which sample is an outlier.

No alt text provided for this image — VAE

AE's learn to generate latent vector that decoder can reproduce, however VAE learn to generate two vectors that represent the parameters of distribution from which the latent vector is sampled. Which means VAE learning task is to learn a function that will generate parameters of distribution from which latent vector that a decoder can reproduce can be sampled.

Below is an example code for setting up VAE model:

Encoder -

-- encoder model
inputs = Input(shape=input_shape, name='encoder_input')
hidden_encode = Dense(dim_i, activation='relu')(inputs)
latent_mean = Dense(latent_dim, name='z_mean')(hidden_encode)
latent_log_var = Dense(latent_dim, name='z_log_var')(hidden_encode)

-- sampling
latent = Lambda(sampling, output_shape=(latent_dim,), name='z')([z_mean, z_log_var]) 

-- instantiate encoder model
encoder = Model(inputs, [latent_mean, latent_log_var, latent], name='encoder')
encoder.summary()

Decoder -

# decoder model
latent_inputs = Input(shape=(latent_dim,), name='z_sampling')
hidden_decode = Dense(intermediate_dim, activation='relu')(latent_inputs)
outputs = Dense(original_dim, activation='sigmoid')(hidden_decode)


# instantiate decoder model
decoder = Model(latent_inputs, outputs, name='decoder')
decoder.summary()

VAE Loss -

To achieve such latent vector, VAE's have two components as part of loss function:

Reconstruction loss component - forces the encoder to generate latent features that minimize the reconstruction loss.

KL loss component - forces the distribution generated by the encoder to be like the prior probability of the input vector.

This results in a heavily regularized encoder which results in more continuous and smoother latent space.

# VAE loss

reconstruction_loss = binary_crossentropy(inputs, outputs)
reconstruction_loss *= original_dim

kl_loss = 1 + latent_log_var - Keras.backend.square(latent_mean) - Keras.backend.exp(latent_log_var)
kl_loss = Keras.backend.sum(kl_loss, axis=-1)

vae_loss = Keras.backend.mean(reconstruction_loss + kl_loss)

VAE Model -

outputs = decoder(encoder(inputs)[2])
vae = Model(inputs, outputs, name='vae')
vae.add_loss(vae_loss)
vae.compile(optimizer='adam')
vae.summary()

Happy reading!

Gentle Gaint

159 位关注者

要查看或添加评论，请登录

Eeswar C.的更多文章

In-Context Learning

2023年9月22日

In-Context Learning

Have you ever encountered instances where ChatGPT repeatedly provides similar responses to your queries, or where its…

1 条评论
Retrieval Augumented Generation

2023年9月1日

Retrieval Augumented Generation

Anyone within the industry who has utilized ChatGPT for business purposes would likely have had the thought, "This is…
Diffusion Model - Gen AI

2023年8月18日

Diffusion Model - Gen AI

Diffusion models have gained attention for their ability to handle various tasks, particularly in the domains of image…
Neural Network

2023年4月23日

Neural Network

In this article I am going back to the basics, Neural Networks! Most of the readers must have seen the picture above…
BERT - Who?

2023年4月15日

BERT - Who?

BERT - Bidirectional Encoder Representations from Transformers, isn’t that a tongue twister! 5 years ago, google…
How Does my Iphone know its me?

2023年4月8日

How Does my Iphone know its me?

Ever wondered how does iPhone know its you and never mistakes someone else for you when using Face Detection? Drum Roll…

1 条评论
Natural Language Data Search

2023年4月1日

Natural Language Data Search

Remember how search was tedious a decade ago! Today you can search and ask questions in any search engine as you would…
Machine Learning & Data Privacy

2023年3月21日

Machine Learning & Data Privacy

Every person i know fears about how their personal data is at risk by all the AI/ML that is surrounding them, whether…
Business at center of Data Science

2023年3月17日

Business at center of Data Science

Any one who has participated in brainstroming & whiteboarding sessions would agree that, what data scientists think of…
Capsule Networks (#capsnets)

2023年3月11日

Capsule Networks (#capsnets)

In my previous article on Handwriting Decoder (#ocr), we touched on how can we read Hand Writing using Computer vision.…

See all articles

Anomaly Detection with VAE

Eeswar C.

Gentle Gaint

159 位关注者

Eeswar C.的更多文章

社区洞察

其他会员也浏览了

Machine Learning | Accuracy Paradox

AI_Part_3_Regression vs Classification Models

Mastering Linear Discriminant Analysis in Machine Learning

Unlocking the Potential of Machine Learning: A Look into the Various Applications

Undercomplete Autoencoders, Regularized Autoencoders, Stochastic Encoders And Decoders, Denoising Autoencoders, & More.

Most Commonly Used Machine Learning Theorems

How to Handle Imbalanced Datasets in Machine Learning: A Step-by-Step Guide

Types of Machine Learning Models From Basics to Advanced

Machine Learning Myths

Gentle Gaint

159 位关注者

Eeswar C.的更多文章

In-Context Learning

Retrieval Augumented Generation

Diffusion Model - Gen AI

Neural Network

BERT - Who?

How Does my Iphone know its me?

Natural Language Data Search

Machine Learning & Data Privacy

Business at center of Data Science

Capsule Networks (#capsnets)

社区洞察

其他会员也浏览了

Machine Learning | Accuracy Paradox

AI_Part_3_Regression vs Classification Models

Mastering Linear Discriminant Analysis in Machine Learning

Unlocking the Potential of Machine Learning: A Look into the Various Applications

Undercomplete Autoencoders, Regularized Autoencoders, Stochastic Encoders And Decoders, Denoising Autoencoders, & More.

Most Commonly Used Machine Learning Theorems

How to Handle Imbalanced Datasets in Machine Learning: A Step-by-Step Guide

Types of Machine Learning Models From Basics to Advanced

Machine Learning Myths