登录查看更多内容

Covid-19 detection analyzing X-Ray images using a simple Convolutional Neural Net architecture

Saikat Chakraborty

Managing Director @ Accenture | Enterprise AI Value Strategy Executive

发布日期: 2020年12月7日

Just like all my articles, the opinions expressed in this article are of mine only as per my understanding and in no way represents anything that has to do with the organization I work for.

As several references mention,Chest X-Ray scans may be helpful to diagnose COVID-19 in individuals with a high clinical suspicion of infection. This small article is a demonstration of this using a simple deep learning architecture with a pretty reasonable accuracy, which is not perfect but makes a definite point in proving the effectiveness of AI as it may evolve in future. It is amazing to observe that with only a few lines of code, great results can be achieved!

The data that has been used for this experimentation, is publicly available in here:

https://github.com/shervinmin/DeepCovid/tree/master/data

Let's start by importing the required libraries as usual.

Created on Sat Nov 28 10:37:45 2020


@author: chakr
"""
import tensorflow as tf
from keras.preprocessing.image import ImageDataGenerator
import numpy as np
import glob

Next we need to preprocess the data.The ImageDataGenerator class is particularly useful and I would request the readers to try with different options to make the prediction even more accurate.

#data preprocessing
traingen = ImageDataGenerator(rescale = 1./255,shear_range=0.2,zoom_range=[0.2,1.8],horizontal_flip=True,vertical_flip=True)
training_set = traingen.flow_from_directory('covid19\\train',target_size = (256,256),batch_size=32,class_mode='binary')
testgen = ImageDataGenerator(rescale = 1./255)
test_set = traingen.flow_from_directory('covid19\\test',target_size = (256,256),batch_size=32,class_mode='binary')

Now, simply build the CNN model and train it with the training set.

cnn = tf.keras.models.Sequential()
cnn.add(tf.keras.layers.Conv2D(filters =32,kernel_size=3,activation='relu',input_shape = [256,256,3]))
cnn.add(tf.keras.layers.MaxPool2D(pool_size=(2,2),strides=2))
cnn.add(tf.keras.layers.Conv2D(filters =32,kernel_size=3,activation='relu'))
cnn.add(tf.keras.layers.MaxPool2D(pool_size=(2,2),strides=2))
cnn.add(tf.keras.layers.Flatten())
cnn.add(tf.keras.layers.Dense(units = 256,activation = 'relu'))
cnn.add(tf.keras.layers.Dense(units = 1,activation = 'sigmoid'))
cnn.compile(optimizer = 'adam',loss = 'binary_crossentropy',metrics=['accuracy'])
cnn.fit(x = training_set,validation_data=test_set,epochs=100)

Note the number of epochs in the snippet above. It usually takes 60 minutes to 90 minutes in my computer to finish the entire training for a 100 epoch run. So babysit the training in the beginning and based on the learning, you may adjust the same.

In my case, this is how the metrics look at the end of the run...

....
....
Epoch 99/100
21/21 [==============================] - 66s 3s/step - loss: 0.0645 - accuracy: 0.9759 - val_loss: 0.2242 - val_accuracy: 0.9381

Epoch 100/100
21/21 [==============================] - 67s 3s/step - loss: 0.0770 - accuracy: 0.9699 - val_loss: 0.0830 - val_accuracy: 0.9810

Now all that remains is the prediction for your data. You may feed single or multiple images in the model and collec predictions and check accuracy.

k = training_set.class_indices['covid']

def imageconverter(picture):
    import numpy as np
    from keras.preprocessing import image
    chk_img = image.load_img(picture,target_size = (256,256))
    chk_img = np.expand_dims(image.img_to_array(chk_img),axis = 0)
    return chk_img


imagelist = glob.glob('covid19\\val\\*.jpg')


chkimages = [imageconverter(i) for i in imagelist]


collector = []
for j in [imageconverter(i) for i in imagelist]: 
    predictions = cnn.predict(x = j)
    if predictions[0][0] == k:
        predictions = "Covid-19"
    else:
        predictions = "Normal"
    

    collector.append(predictions)

In my case, just without any optimization, in the first run, 88% accuracy was obtained in the validation images. I would encourage the readers to further optimize and share the results in the comments below.

Manish Kapoor

Senior Principal Engineer

4 年

On the onset accuracy is good...still can work on hyperparameters of CNN...Image size is 256*256...we can add more conv layers to have a better receptive field...stride of 2 may make the image blur can try with stride 1..also we can increase the filters on every layer added to extract more features from the image....will work on dataset and let you the results.

1 次回应

查看更多评论

要查看或添加评论，请登录

Saikat Chakraborty的更多文章

Optimizing Business Transformation Initiatives

2025年2月5日

Optimizing Business Transformation Initiatives

In their 2023 article published in Harvard Business Review,” What’s Derailing Your Company’s Transformation?”, Scott D.…
How DeepSeek Works? The Mixture of Experts Architecture

2025年1月29日

How DeepSeek Works? The Mixture of Experts Architecture

Opinions expressed in this short article are mine and has no connection to the organization I work for. DeepSeek works…

6 条评论
Multi Agent Orchestration using Autogen to create sequential data processing & demand forecasting

2024年12月10日

Multi Agent Orchestration using Autogen to create sequential data processing & demand forecasting

All opinions and contents expressed in this article are mine and not of the organization I work for AutoGen is an…
Creating an AI Agent, that drives Data Analysis through ML Model Creation

2024年11月23日

Creating an AI Agent, that drives Data Analysis through ML Model Creation

All opinions and contents expressed in this article are mine and not of the organization I work for AI agents can…

6 条评论
Kolmogorov-Arnold Networks or KAN, the latest advance in Neural Networks

2024年5月20日

Kolmogorov-Arnold Networks or KAN, the latest advance in Neural Networks

Opinions expressed in this article are mine and not connected in anyway with the organization I work for. On April…
Playing with the neural responses of human brain to deliver optimal presentations

2024年5月12日

Playing with the neural responses of human brain to deliver optimal presentations

In his groundbreaking work “Thinking Fast and Slow’, Nobel laureate Daniel Kahneman points to two distinct thinking…

4 条评论
Generative models that we carry with us!

2024年4月28日

Generative models that we carry with us!

Have you ever wondered about the fact that the generative models define who you are and how you perceive the world…

2 条评论
OWASP: Security Challenges of Large Language Models

2024年2月17日

OWASP: Security Challenges of Large Language Models

The information shared here is taken from OWASP published documents and the opinions expressed are entirely mine and…

2 条评论
Generative AI : A Primer

2023年10月1日

Generative AI : A Primer

All the content and opinion expressed in this article are mine and not of the organization I work for. What is GenAI…

3 条评论
Quantum Computing based Machine Learning using IBM Qiskit

2023年8月20日

Quantum Computing based Machine Learning using IBM Qiskit

Today I plan to discuss very briefly the application of quantum computing in machine learning using Qiskit which is an…

8 条评论

See all articles

Covid-19 detection analyzing X-Ray images using a simple Convolutional Neural Net architecture

Saikat Chakraborty

Managing Director @ Accenture | Enterprise AI Value Strategy Executive

Saikat Chakraborty的更多文章

社区洞察

其他会员也浏览了

Chapter 2.2 : Self-Driving Car [Intro to TensorFlow & Deep Neural Network]

Deep Learning

Continuous value prediction with decision forest algorithm

Long Short-Term Memory (LSTM)

k-Nearest Neighbors (KNN) Algorithm: Simple Yet Powerful (Part5)

Building a Neural Network from Scratch

#11 Finding Nemo: Exploring pre-trained Keras models

Copy of FACE-Recognization| VggFace and FaceEmbaddings.

Using LSTM Networks and Markov Chains to Predict Market Movements: A Neural Network Approach for Trading Analysis

Which Scaler is suitable for LSTM

Saikat Chakraborty的更多文章

Optimizing Business Transformation Initiatives

How DeepSeek Works? The Mixture of Experts Architecture

Multi Agent Orchestration using Autogen to create sequential data processing & demand forecasting

Creating an AI Agent, that drives Data Analysis through ML Model Creation

Kolmogorov-Arnold Networks or KAN, the latest advance in Neural Networks

Playing with the neural responses of human brain to deliver optimal presentations

Generative models that we carry with us!

OWASP: Security Challenges of Large Language Models

Generative AI : A Primer

Quantum Computing based Machine Learning using IBM Qiskit

社区洞察

其他会员也浏览了

Chapter 2.2 : Self-Driving Car [Intro to TensorFlow & Deep Neural Network]

Deep Learning

Continuous value prediction with decision forest algorithm

Long Short-Term Memory (LSTM)

k-Nearest Neighbors (KNN) Algorithm: Simple Yet Powerful (Part5)

Building a Neural Network from Scratch

#11 Finding Nemo: Exploring pre-trained Keras models

Copy of FACE-Recognization| VggFace and FaceEmbaddings.

Using LSTM Networks and Markov Chains to Predict Market Movements: A Neural Network Approach for Trading Analysis

Which Scaler is suitable for LSTM