登录查看更多内容

Deep Learning Project

Aashi Parashar

Human Resources Manager at ONLEI Technologies

发布日期: 2023年2月15日

+ 关注

ONLEI Technologies

A Gentle Introduction to Siamese Neural Networks Architecture?

What are Siamese Neural Networks?

Siamese Neural Networks, or SNNs, is one of the most popular neural network architectures that use this strategy and can predict multiple classes from very little data. This ability has made Siamese neural networks very popular in real-world applications in security, face recognition, signature verification, and more.

So how does the neural network architecture of Siamese networks make this possible?

Siamese Neural Networks: An Overview

A Siamese network consists of two or more identical subnetworks: neural networks with the same architecture, configuration, and weights. Even during training, parameter updates happen simultaneously for both neural networks with the same weights.

The purpose of having identical subnetworks is to train the model based on a similarity function that measures how different the feature vectors of one image are from the other. Because of this architecture, the model can be trained without much data.

Why use Siamese Neural Networks ?

With siamese neural networks, the common class imbalance problem can be addressed since the network does not need too many samples for a given class in the training data.

Moreover, a new class can be added without training the entire network from scratch after the siamese neural network has been trained and deployed. The model trains by learning how similar or dissimilar image pairs are, samples from a new class can be added to the trained siamese network, and training can be resumed since the network architecture will compare the new images with the rest of the classes and update the weights and the fully connected layer.

This behaviour is unique to a network architecture that uses one-shot learning since other categories of neural networks would have to be trained from scratch on a large, class-balanced dataset for significant performance.

But how does a siamese network learn from such a small set of samples? Let's look at the architecture and how the training process in siamese neural networks works.

Siamese Neural Network Architecture Explained

As described above, the architecture below shows two identical subnetworks that make up a siamese neural network. Feature vectors from both networks are compared using a loss function L. There are two strategies for training the siamese network using different loss functions.

First, the feature vectors of similar and dissimilar pairs should be descriptive, informative, and distinct enough from each other so that segregation can be learned effectively.

And secondly, the feature vectors of similar image pairs should be similar enough, and those for dissimilar pairs should be dissimilar enough so that the model can quickly learn semantic similarity.

To make sure the model can learn these feature vectors quickly, the loss function should incentivize both learning the similarity and dissimilarity of things heavily enough. Here is where the siamese neural network strategy helps - by comparing one image with all the other images, the model learns what "similar" is and how to define and recognize dissimilar pairs.

To gain this kind of information, the cross-entropy loss cannot help as it works on a class prediction basis. Mean squared errors also do not give enough information needed for our goal. The most commonly used loss functions are a Contrastive loss function and a Triplet loss function. Let's look at each of them in detail.

Contrastive Loss Function

The contrastive loss function is a distance-based loss function that updates weights such that two similar feature vectors have a minimal Euclidean distance. In comparison, the distance is maximized between two different vectors.

In the equation shown below, y represents whether or not the vectors are dissimilar, and Dw is the Euclidean distance between the vectors. When the vectors are dissimilar (y=1), the loss function minimizes the second term -- for which Dw must be maximized (encourage more distance between dissimilar vectors). We want these vectors to have a distance of more than at least m, and we avoid computation if the vectors are already m units apart by defaulting to 0.

Similarly, if the vectors are similar (y=0), the loss function must minimize Dw.

领英推荐

Recurrent Neural Networks

360DigiTMG 1 年前

Optimizing hidden layers of neural networks: AI web…

Rakuten Symphony 5 个月前

Hands-on Neural Networks: Building and Using Models…

James Cupps 2 年前

Contrastive Loss Function in Siamese Neural Networks

However, because of the binary nature of this function to bring the vectors either close or far from each other, we cannot learn how similar two vectors are to each other. Thus, another loss function helps us learn both similarity score and dissimilarity in a better way.

Triplet Loss Function in Siamese Network

By using triplet loss, we can tell how similar an image looks to the others (within or outside its class) when compared. The siamese network learns the similarity ranking using the score computed in this fashion.

For this, the loss is computed by comparing a given image (called anchor image) with a positive image (which is similar to the anchor image) and a negative image (which is dissimilar to it). Computing the intra-distance for each of these pairs, the model knows what similarity looks like and how different the given image must be from the other classes.

So, in the equation below, f(A) is the anchor image, and f(P) and f(N) is the positive image and negative image, respectively. Again, for the loss function to minimize the RHS, the term with f(N) would have to be maximized and that with f(P) minimized. This aligns with the strategy that we want similar pairs closer and dissimilar pairs further apart. α is just a regularizing parameter.

Triplet Loss Function in Siamese Networks

Read here for further explanation on the Triplet Loss Function in Siamese Networks

Pros and Cons of Siamese Neural Networks

As we saw when getting introduced to siamese neural networks they offer many benefits over conventional CNNs in certain specific tasks.

Advantages of Siamese Network

Semantic Similarity: Firstly, siamese networks do not learn from training errors or mispredictions but from semantic similarity. This encourages the model to learn better and better embeddings that represent images from the support set and bring related concepts close in the feature space. By learning such a feature space, similar to how textual models learn word embeddings, the model learns concepts and attempts to understand why certain images are more similar than others instead of just extracting static features using convolutions.

Class Imbalance: The biggest benefit directly applicable to the real world is the capability of giving benchmark performance on very little data. With the data requirement reduced, the problem of class imbalance also vanishes.

Siamese Neural Network for Face Recognition?

Face recognition is nothing but another image recognition or classification task. One-shot learning is particularly applicable to this task because it is impossible to have sufficient samples of one person's face (one label) in practical cases. Face recognition is often used as an attendance system or security measure to restrict access to buildings and offices to employees only.

In this case, not only is it impractical to get many images of one person to get a decent success rate but adding access to an incoming new employee would mean training the entire CNN from scratch and risking the existing performance.

Siamese Neural Network for Image Classification?

Signature verification is a commonly found use of image classification in the context of one-shot learning. A signature verification system checks the authenticity of a given signature against the one existing in a dataset. Based on the sign's similarity, the sample can be classified as real or fake. With this task widely prevalent in banks and financial institutions worldwide, Siamese networks quickly became the go-to solution for this otherwise manually laborious task.

Is a Siamese Network Supervised?

Yes, siamese networks are trained in a supervised fashion. It needs labeled information to know whether the images it compares are similar. However, one can also tune siamese networks to learn in a self-supervised (SSL: self-supervised learning) manner.

Important Links

Home Page?

Courses Link??

class="font-[700]">Python Course??

Machine Learning Course?

Data Science Course

Digital Marketing Course

Python Training in Noida

ML Training in Noida

DS Training in Noida

Digital Marketing Training in Noida?

Winter Training

DS Training in Bangalore?

DS Training in Hyderabad

DS Training in Pune

DS Training in Chandigarh/Mohali

Python Training in Chandigarh/Mohali?

DS Certification Course

DS Training in Lucknow

Machine Learning Certification Course

Data Science Training Institute in Noida

Business Analyst Certification Course?

DS Training in USA

Python Certification Course

Digital Marketing Training in Bangalore

Internship Training in Noida

ONLEI Technologies India

Python Certification

Best Data Science Course Training in Indore

Best Data Science Course Training in Vijayawada

Best Data Science Course Training in Chennai

ONLEI Group

ONLEI Technologies

2,714 位关注者

要查看或添加评论，请登录

Aashi Parashar的更多文章

The Importance of Python Programming in 2025

2025年3月20日

The Importance of Python Programming in 2025

ONLEI Technologies Python has solidified its position as one of the most versatile and widely used programming…

1 条评论
The Power of Data Analytics in Today's Business World

2025年3月7日

The Power of Data Analytics in Today's Business World

ONLEI Technologies In today’s fast-paced and ever-evolving business landscape, organizations across industries are…
How Data Science is Transforming Industries in 2025

2025年2月27日

How Data Science is Transforming Industries in 2025

ONLEI Technologies Introduction Data science has rapidly evolved into a cornerstone of modern industries, driving…
The Power of Data Science: Transforming the Digital World

2025年2月23日

The Power of Data Science: Transforming the Digital World

ONLEI Technologies The Power of Data Science: Transforming the Digital World In the modern era, data is often referred…
The Impact of Data Science on Business Decision-Making

2025年2月15日

The Impact of Data Science on Business Decision-Making

ONLEI Technologies In today’s rapidly evolving business environment, data science has emerged as a game-changer…
The Power and Versatility of Python Programming

2025年2月13日

The Power and Versatility of Python Programming

ONLEI Technologies Python is one of the most popular and versatile programming languages in the world today. Whether…
Accenture Data Analyst Interview

2025年1月31日

Accenture Data Analyst Interview

ONLEI Technologies Accenture Data Analyst Interview Questions and Answers Accenture, a global leader in consulting and…
Data Science Certification Course in 2025

2025年1月28日

Data Science Certification Course in 2025

ONLEI Technologies Data Science Certification Course in 2025 In 2025, the field of Data Science continues to be one of…
Unlock Your Tech Career Potential at Microsoft India

2024年12月16日

Unlock Your Tech Career Potential at Microsoft India

ONLEI Technologies Unlock Your Tech Career Potential at Microsoft India Unveil the limitless possibilities awaiting…
Data Analyst Interview at Accenture

2024年12月7日

Data Analyst Interview at Accenture

ONLEI Technologies Accenture, a renowned global consulting and technology services company, attracts top talent for its…

See all articles

Deep Learning Project

Aashi Parashar

Human Resources Manager at ONLEI Technologies

领英推荐

ONLEI Technologies

2,714 位关注者

Aashi Parashar的更多文章

社区洞察

其他会员也浏览了

Neural Networks & Large Language Models

A Comprehensive Overview of Graph Neural Networks (GNNs)

From RNNs to Transformers: A Paradigm Shift in Deep Learning

When Two Heads are Better Than One: Twin Neural Networks

The Ultimate Guide to Convolutional Neural Networks for Beginners

A Practical Guide to Graph Neural Networks for Enterprise

A Practical Guide to Convolutional Neural Networks for Enterprise

Unlocking the Future of Finance: Deep Learning Models for Time Series Forecasting

Neural Network architectures that no one is talking about !

Recurrent Neural Networks Unveiled: Mastering Sequential Data Beyond Simple ANNs

领英推荐

ONLEI Technologies

2,714 位关注者

Aashi Parashar的更多文章

The Importance of Python Programming in 2025

The Power of Data Analytics in Today's Business World

How Data Science is Transforming Industries in 2025

The Power of Data Science: Transforming the Digital World

The Impact of Data Science on Business Decision-Making

The Power and Versatility of Python Programming

Accenture Data Analyst Interview

Data Science Certification Course in 2025

Unlock Your Tech Career Potential at Microsoft India

Data Analyst Interview at Accenture

社区洞察

其他会员也浏览了

Neural Networks & Large Language Models

A Comprehensive Overview of Graph Neural Networks (GNNs)

From RNNs to Transformers: A Paradigm Shift in Deep Learning

When Two Heads are Better Than One: Twin Neural Networks

The Ultimate Guide to Convolutional Neural Networks for Beginners

A Practical Guide to Graph Neural Networks for Enterprise

A Practical Guide to Convolutional Neural Networks for Enterprise

Unlocking the Future of Finance: Deep Learning Models for Time Series Forecasting

Neural Network architectures that no one is talking about !

Recurrent Neural Networks Unveiled: Mastering Sequential Data Beyond Simple ANNs