登录查看更多内容

Object Detection Using EfficientNet in Tensorflow 2

Rubens Zimbres, Ph.D.

ML Engineer, Gen AI, Sec+, Google Developer Expert in AI/ML ^ Google Cloud

发布日期: 2022年7月23日

In this tutorial, I'll show the necessary steps to create an object detection algorithm using Google Research's EfficientNet, in Tensorflow 2. The paper, EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks, is available here.

First, let's create the Anaconda environment:

conda create -n tf2 python=3.8 anaconda
source activate tf2

If source activate does not work, add the full path, like: /home/user/anaconda3/bin/activate tf2

Install Tensorflow:

pip install tensorflow-gpu

Now, go to your tf2 environment inside anaconda3 folder envs and find inside site-packages, where is tensorflow. It will be something like this:

cd /home/user/.conda/envs/tf2/lib/python3.8/site-packages/tensorflow

Now, inside tensorflow folder, git clone the tensorflow/models repository:

git clone https://github.com/tensorflow/models

Good, now, cd into the object detection folder:

cd models/research/object_detection

Now you must configure the object detection API and proto:

cd models/research
protoc object_detection/protos/*.proto --python_out=.
cp object_detection/packages/tf2/setup.py .
python -m pip install .

After that, create a folder inside the object_detection folder called "demo"

mkdir demo
cd demo

Now, inside demo folder you must create 5 other folders:

Annotations for the labelmap.pbtxt and test.record and train.record that will be used by the Tensorflow model while training.
Images: with two subfolders train and test. In each subfolder you will add the training and test images and the .xml files correspondent to these files with annotations of the rectangles regarding the objects.
Inference_graph folder: this is where you will save the exported model .pb after training, for inference.
Output: folder that will store the checkpoints as the model begins training. Important: every time you start a training, this folder must be empty.
Pre_Trained_Models: this place is where you are going save the trained EfficientNet model and checkpoints, to initialize your model training

Now, let's do the annotation part. The images are located at /demo/images/train and test. You must first install labelImg from GitHub:

git clone https://github.com/tzutalin/labelImg

In Ubuntu:

sudo apt-get install pyqt5-dev-tool
sudo pip3 install -r requirements/requirements-linux-python3.txt
make qt5py3
python3 labelImg.py

Then you will type w and annotate (drag to a rectangle) the images with the format PascalVOC, which is a .xml file.

Now, inside the demo folder, add these 6 files from https://github.com/RubensZimbres/Repo-2022/tree/main/Object-Detection-Tensorflow2

Great. These are the steps:

You annotate the images, generate a .xml for each one.
With the script xml_to_csv.py you transform the images names and .xml files to a .csv.
With the .csv in hand you will generate the Tensorflow .record file that will be used for training the neural net, with the file generate_tfrecord.py
With the file model_main_tf2.py you will train the model.
With the file exporter_main_v2.py you will export the trained model to a .pb file.
With the file webcam.py you are able to make inference in real time.

Now it's the tricky part: you have to customize some files in order to succeed.

After applying the xml_to_csv.py file, you will get something like this for train and for test:

Now you must edit generate_tfrecord.py. For instance, if you have one class, change the following part of the code:

Towards Data Science 9 个月前

AIML 10- Building Custom Image Datasets in PyTorch

Dr. Alok Tiwari 2 年前

From Content to Art: An Introduction to Neural Style…

Rohan Chikorde 1 年前

def class_text_to_int(row_label):
??? if row_label == 'desod':
??????? return 1
??? else:
??????? return None

For two classes:

def class_text_to_int(row_label):
??? if row_label == 'desod':
??????? return 1
??? if row_label == 'car':
??????? return 2
??? else:
??????? return None

Be sure you run these files inside the object_detection folder, as Python will need the utils package (a folder inside object_detection), otherwise you will get the "no module named utils" error. Another issue you may find is "no module named nets". In this case, run:

export PYTHONPATH=$PYTHONPATH=/home/theone/.conda/envs/tf2/lib/python3.8/site-packages/tensorflow/models/research:/home/theone/.conda/envs/tf2/lib/python3.8/site-packages/tensorflow/models/research/slim

Now let's generate the .record files:

python generate_tfrecord.py --csv_input=images/train_labels.csv --image_dir=images/train --output_path=train.record
python generate_tfrecord.py --csv_input=images/test_labels.csv --image_dir=images/test --output_path=test.record

Then, let's download the EfficientNet pre-trained weights:

wget https://download.tensorflow.org/models/object_detection/tf2/20200711/efficientdet_d0_coco17_tpu-32.tar.gz

tar -xvf efficientdet_d0_coco17_tpu-32.tar.gz

Extract this file to the folder Pre_Trained_Models. In side this folder, you will see there is a pipeline.config file. We are going to edit it, this is crucial. At the beggining of the file, chnage to the number of classes you want:

model 
? ssd {
??? num_classes: 1
??? image_resizer {
????? keep_aspect_ratio_resizer {
??????? min_dimension: 512
??????? max_dimension: 512
??????? pad_to_max_dimension: true
????? }

Then, at train config part, adjust the batch size to some value that fits in the memory of your computer:

train_config 
? batch_size: 4
? data_augmentation_options {
??? random_horizontal_flip {
??? }
? }
? data_augmentation_options {
??? random_scale_crop_and_pad_to_square {
????? output_size: 512
????? scale_min: 0.10000000149011612
????? scale_max: 2.0
??? }
? }

Then, fill 5 more paths: fine_tune checkpoint to where is the EfficientNet checkpoints, 2 label_map_path to where is your labelmap.pbtxt file (inside Annotations folder) and the path to your train and test.record files (inside Annotations folder). Two more setups:

Fix the number of steps to 5000 and fine_tune_checkpoint_type to "detection". If you keep the fine_tune_checkpoint_type as "classification" the output will be a class (number), but we need a dictionary with the annotated boxes of object detection. This part will look like this:

? fine_tune_checkpoint: "/home/theone/.conda/envs/tf2/lib/python3.8/site-packages/tensorflow/models/research/object_detection/training-demo-tf2/pre-trained-models/efficientdet_d0_coco17_tpu-32/checkpoint/ckpt-0
? num_steps: 5000
? startup_delay_steps: 0.0
? replicas_to_aggreg.ate: 4
? max_number_o_boxes: 100
? unpad_groundtruth_tensors: false
? fine_tune_checkpoint_type: "detection"
? use_bfloat16: true
? fine_tune_checkpoint_version: V2"

Now, one last setup: in the folder Annotations, edit label_map.pbtxt to contain this (adjust to the number of classes you have):

item {
? id: 1
? name: 'desod'
}

Now we can train the object detection model (inside /object_detection folder) using the model_main_tf2.py file. Remember, the output folder inside demo folder must be empty in order to avoid errors.

python model_main_tf2.py --pipeline_config_path=./training-demo-tf2/pre-trained-models/efficientdet_d0_coco17_tpu-32/pipeline.config --model_dir=./training-demo-tf2/output --alsologtostderr

Good, now we can export the inference_graph. Still inside the /object_detection folder:

python exporter_main_v2.py --trained_checkpoint_dir=./training-demo-tf2/output --pipeline_config_path=./training-demo-tf2/pre-trained-models/efficientdet_d0_coco17_tpu-32/pipeline.config --output_directory ./training-demo-tf2/inference_graph

This will generate a .pb file whose path must be added to the command line of python3 webcam.py file for inference, along with the labelmap.pbtxt path. Be sure to install OpenCV:

pip3 install opencv-python

Now, if you want, you can deploy the model in the cloud. Add the Python inference file containing the flask app, requirements.txt (libraries required), .yaml files (configurations) and Dockerfile to a folder. The idea is to generate an image in Google Cloud Container Registry and deploy to Cloud Run for inference.

Supposing you are in the folder where the files (requirements.txt, .yaml files) and Dockerfile are, you can create a docker image:

gcloud builds submit --tag gcr.io/your_project_id/image_name .

Then deploy in Cloud Run:

gcloud run deploy --image gcr.io/your_project_id/image_name

For GKE (Google Kubernetes Engine):

$ gcloud container clusters create example-gke --num-nodes 1 --enable-basic-auth --issue-client-certificate --enable-autoscaling --min-nodes 1 --max-nodes 3 --region us-central1 --machine-type n1-standard-2 --local-ssd-count 1 --scopes=cloud-platform,storage-full,compute-rw,service-control,cloud-source-repos --service-account [email protected] 

$ gcloud container clusters get-credentials example-gke --region=us-central1

$ kubectl create secret generic my-app-sa-key --from-file service-account.json

$ kubectl get secret

$ kubectl apply -f deployment.yaml

$ kubectl get pods

$ kubectl get service

$ kubectl describe pod example-gke-12345

$ kubectl apply -f allow-all-network-policy.yaml

$ kubectl apply -f service.yaml

Google Cloud and ML News

4,139 位关注者

Javeria Ehsan

1 年

Hey! I want to implement EfficientNet on image dataset to classify normal signal or abnormal. The problem is I want 32x32(smaller the better) and gray scale only because later on I will be moving for hardware so yeah. I'm trying to use transfer learning. Do you think there is anyway to do that?

Vijay Reddy G

Solutions Architect at Thomson Reuters

2 年

Thanks Rubens. Helpful.

1 次回应

查看更多评论

要查看或添加评论，请登录

查看全部

Object Detection Using EfficientNet in Tensorflow 2

Rubens Zimbres, Ph.D.

ML Engineer, Gen AI, Sec+, Google Developer Expert in AI/ML ^ Google Cloud

领英推荐

Google Cloud and ML News

4,139 位关注者

更多精彩文章

社区洞察

其他会员也浏览了

3D Fractal Dimension

Applied Machine Learning: CNNs for Image Recognition

How to Classify the paintings of an artist using Convolutional Neural?Network

SVM

TensorFlow-Keras using Mnist Dataset

Machine Learning Models: Leveraging the Power of Decision Trees, XGBoost, Random Forest, and SVM.

How to build a GAN from scratch with library, PyTorch

Deep Learning in Python with TensorFlow and Keras API for creating AI algorithms/models. Sequential models.

Unlock Computer Vision with AlexNet: Step-by-Step Tutorial

Demystify Neural Network. Implement A Simple NN using PYTHON Functions. Tensorflow/keras VS Python-Code...??

领英推荐

Google Cloud and ML News

4,139 位关注者

How to Check if your iPhone is Infected with Spyware/Stalkerware

2024年1月16日

Two Towers Model: A Custom Pipeline in Vertex AI Using?Kubeflow

2023年3月2日

Google Cloud Contact Center A.I. : A Managerial View

2023年2月3日

Search of Brazilian Laws using Dialogflow CX and Matching Engine

2023年1月24日

Burn a Physical Security Key to Access Your Google Accounts?Securely

2022年8月9日

Train and Deploy Google?Cloud's Two Towers Recommender

2022年8月5日

Agent-Based Modeling with Python and?NetLogo

2022年8月1日

How to Setup a sFTP Server in Google Cloud and Restrict Access

2022年7月19日

Adding Human Rationality to MNIST via Neural Cellular Automata

2022年5月29日

Connecting Jupyter Notebook to a Google Cloud VM using only an Internal IP

2022年4月11日

社区洞察

其他会员也浏览了

3D Fractal Dimension

Applied Machine Learning: CNNs for Image Recognition

How to Classify the paintings of an artist using Convolutional Neural?Network

SVM

TensorFlow-Keras using Mnist Dataset

Machine Learning Models: Leveraging the Power of Decision Trees, XGBoost, Random Forest, and SVM.

How to build a GAN from scratch with library, PyTorch

Deep Learning in Python with TensorFlow and Keras API for creating AI algorithms/models. Sequential models.

Unlock Computer Vision with AlexNet: Step-by-Step Tutorial

Demystify Neural Network. Implement A Simple NN using PYTHON Functions. Tensorflow/keras VS Python-Code...??