登录查看更多内容

Transfer Learning, how to use pre-trained neural networks to apply to your own

Juan David Tuta Botero

Data Science | Machine Learning | Artificial Intelligence

发布日期: 2022年2月21日

Abstract

This review is provided a detailed overview of how to develop a Neural network able to recognize and categorize all the different classes presented in the database CIFAR-10, which consists of 60000 32x32 color images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images. Using a pre-trained model called InceptionV3 found in Keras.applications instance which function specialized in image detection. Adding it two dense layers and one adam optimizer that were trained using a batch size of 100 and 2 epochs with an ending accuracy of 89.7%.

Introduction

Object detection is a computer vision technique that allows us to identify and locate objects in an image or video. With this kind of identification and localization, object detection can be used to count objects in a scene and determine and track their precise locations, all while accurately labeling them.

Imagine, for example, an image that contains two cats and a person. Object detection allows us to at once classify the types of things found while also locating instances of them within the image.

No hay texto alternativo para esta imagen

This technology can have countless applications and as time pass we see more in our day by day. But there is a technological wall the common people and business find and it is as more complex the computer power and time to train the model increase. So to surpass this problem we must rely on human collaboration to generate knowledge to this process we call Transfer Learning. The first thing to remember here is that transfer learning is not a new concept that is very specific to deep learning. There is a stark difference between the traditional approach of building and training machine learning models and using a methodology following transfer learning principles.

Applying the transfer learning we can use pre-trained models from universities or technological enterprises that were developed by highly skillfully people and trained during weeks and even months to accomplish specific tasks such as self-driving technologies, medical illness prediction, finances model and so more. We can import those and apply them to our model to save time and have really good predictions.

For a better understanding of the problem, I'm going to use a pre-trained model from Toronto university and modified it to be able to recognize 10 different objects found in the CIFAR 10 dataset that consists of 60000 32x32 color images in 10 classes, with 6000 images per class. There are 50000 training images and 10000 test images.

How to solve the problem

Now that we know we have a problem with image identification the next thing is to find a neural network developed to solve this problem. Fortunately the keras.aplication instance has a list of deep learning models that are made available alongside pre-trained weights. These models can be used for prediction, feature extraction, and fine-tuning. Between those models, we are going to use the InceptionV3 model that is suited perfectly for the task we are looking for. Thes model was developed to recognize 1000 different objects and we only want to be able to recognize 10 so it is perfectly fine. So let's start with the code and the first thing we are going to do is import our machine learning framework and the model we are going to use.

import tensorflow.keras as K

inception = K.applications.InceptionV3(include_top=False,
                                       weights="imagenet"    
    ? ? ? ? ? ? ? ? ? ? ? ? ? ? ? ?    input_shape=(128, 128, 3))

Now that we have successfully imported the model we must "freeze" it. This concept is used when in the process of backpropagation the optimizable parameters stay the same and don't change because we are exporting an entire model using weights="imagenet" we exported with the parameter already optimize. If we don't freeze it we face the problem to find them again and can take a lot of time.

领英推荐

Table Parsing Made Simple with Homegrown Neural…

Xiao-Fei Zhang 2 个月前

Batch Normalization In Deep Learning: What Does It Do?…

Ze Learning Labb 1 个月前

Build Deep Learning model for the Image Classification…

Saigon Technology - Accelerate Software Development 1 年前

inception.trainable?=?False  #freeze the layer

Now we have an exported model and freeze, we must add some extra layers to be able to adapt our new modified deep neural network to solve the problem of the CIFAR data. As we see in previous articles this NN is convolutional and to be connected to a normal network first we must be flattened then we create two extra layers one with 128 nodes and activation function "relu" and the other one as the dataset only have 10 classes we want to have only 10 outputs and as activation function use "softmax", finally in the compile process we use the optimization process of Adam to speed up the process of backpropagation, the code should look like this.

model?=?K.Sequential()

model.add(inception)

model.add(K.layers.Flatten())

model.add(K.layers.Dense(units=128,
?????????????????????????activation='relu',
?????????????????????????kernel_initializer='he_normal')

model.add(K.layers.Dense(units=10,
?????????????????????????activation='softmax',
?????????????????????????kernel_initializer='he_normal')

model.compile(optimizer='adam',
??????????????loss='categorical_crossentropy',
??????????????metrics=['accuracy'])

Results

Now that the code was running using 2 epoch and a batch size of 100 we found a pretty good result of 89,7%.

Discussion

As we can see the accuracy is pretty good with a value of 89,7%, offcourse if we want we can keep trained for more time or adding more layers to the model, but for the poorpuse of this article is really good enough. So in this way we can create a very powerful network able to recognise images in a little time respecting of coding and training aspects, so this could be a really good approximation werever you face a complicate problem and you want a fast result to compare or follow a totally new framework. I f you want to check the code by your sel you can find it here.

Biblyography

https://keras.io/api/applications/inceptionv3/

https://github.com/Juand0145/holbertonschool-machine_learning/blob/main/supervised_learning/0x09-transfer_learning/Manual_review.ipynb

https://www.cs.toronto.edu/~kriz/cifar.html

https://towardsdatascience.com/a-comprehensive-hands-on-guide-to-transfer-learning-with-real-world-applications-in-deep-learning-212bf3b2f27a

要查看或添加评论，请登录

Juan David Tuta Botero的更多文章

How to perform automated data augmentation

2022年8月10日

How to perform automated data augmentation

Every day of our lives we see regularly how machine learning takes more and more prominence in the different economic…
How to create an RNN (Recurrent Neural Network) capable of predicting the behavior of the stock markets and cryptocurrencies

2022年5月11日

How to create an RNN (Recurrent Neural Network) capable of predicting the behavior of the stock markets and cryptocurrencies

In this article, we will see how to build an artificial intelligence using an RNN (Recurrent neural network) to predict…
Hyperparameters selection using Bayesian Optimization with GPyOpt over a Keras Neural Network

2022年4月18日

Hyperparameters selection using Bayesian Optimization with GPyOpt over a Keras Neural Network

One of the main concerns in the development of neural networks is the correct selection of hyperparameters. These are…
Convolutional Neural Networks how Artificial intelligence see

2022年2月6日

Convolutional Neural Networks how Artificial intelligence see

Recently the idea to use Artificial intelligence to analyze images and videos is becoming a trending topic even for…
Optimization operations in supervised learning and hyperparameter choices

2022年1月17日

Optimization operations in supervised learning and hyperparameter choices

In the current world, it’s hard to imagine one industry or company that is not interested to implement machine learning…
Activation functions in machine learning and Neural Networks

2022年1月5日

Activation functions in machine learning and Neural Networks

It’s been a while since the last article I wrote, today we are going to talk about activation functions, this concept…
What happened when you search in your browser?

2021年9月6日

What happened when you search in your browser?

The Internet has become one of the most important tools in the current society. It is hard to believe any activity that…
IoT is one step closer to the future

2021年8月19日

IoT is one step closer to the future

What is the first thing that comes to your mind when you think of the word Iot? In my case, I didn't know anything…
What is Recursion? Computer science

2021年6月17日

What is Recursion? Computer science

Every day of our lives we usually find notions to interpret the reality from where we are living, and how we perceived…

1 条评论
Differences between static and dynamic libraries

2021年5月4日

Differences between static and dynamic libraries

Why using libraries in general A library in C is a collection of objects files exposed for use and build other…

1 条评论

See all articles

Transfer Learning, how to use pre-trained neural networks to apply to your own

Juan David Tuta Botero

Data Science | Machine Learning | Artificial Intelligence

领英推荐

Juan David Tuta Botero的更多文章

社区洞察

其他会员也浏览了

DEEP LEARNING BASED OJECT RECOGNITION SYTEM: Analyzing The Effect Of The Learning Rate In A Convolutional Neural Network

Leveraging Transfer Learning for Computer Vision

Tunisian ID CARD OCR USING NEUROPARSER

Introduction to Deep Learning: Unlocking the Power of Neural Networks

Automating Neural Network Configuration with Keras-Tuner

Top 10 Activation Functions in Deep Learning

Neural Networks Made Fun With TensorFlow Playground!

Over-Parameterization does not lead to Poor Generalization

An AI Evening Lecture

Regularization, Parameter Norm Penalties, Dataset Augmentation, Noise Robustness, Early Stopping, Sparse Representation, and Dropout.

领英推荐

Juan David Tuta Botero的更多文章

How to perform automated data augmentation

How to create an RNN (Recurrent Neural Network) capable of predicting the behavior of the stock markets and cryptocurrencies

Hyperparameters selection using Bayesian Optimization with GPyOpt over a Keras Neural Network

Convolutional Neural Networks how Artificial intelligence see

Optimization operations in supervised learning and hyperparameter choices

Activation functions in machine learning and Neural Networks

What happened when you search in your browser?

IoT is one step closer to the future

What is Recursion? Computer science

Differences between static and dynamic libraries

社区洞察

其他会员也浏览了

DEEP LEARNING BASED OJECT RECOGNITION SYTEM: Analyzing The Effect Of The Learning Rate In A Convolutional Neural Network

Leveraging Transfer Learning for Computer Vision

Tunisian ID CARD OCR USING NEUROPARSER

Introduction to Deep Learning: Unlocking the Power of Neural Networks

Automating Neural Network Configuration with Keras-Tuner

Top 10 Activation Functions in Deep Learning

Neural Networks Made Fun With TensorFlow Playground!

Over-Parameterization does not lead to Poor Generalization

An AI Evening Lecture

Regularization, Parameter Norm Penalties, Dataset Augmentation, Noise Robustness, Early Stopping, Sparse Representation, and Dropout.