登录查看更多内容

Building an Image Classification Model: Thanos vs. Joker

Heerthi Raja H

Computer Vision | CV/Robotics Enthusiast | Sharing my lessons | Learning and building in public!

发布日期: 2024年6月2日

Introduction

As a passionate computer vision enthusiast, I embarked on an exciting journey to build an image classification model capable of distinguishing between two iconic characters: Thanos and Joker. In this article, I’ll walk you through the entire process, from data collection to model evaluation.

1. Data Collection

To create a robust dataset, I used the simple_image_download library. This Python package allowed me to download images related to both Thanos and Joker. The dataset included various poses, lighting conditions, and backgrounds, ensuring diversity for effective training.

from simple_image_download import simple_image_download as sim

response = sim.simple_image_download
response().download('thanos', 60)
response().download('joker', 60)

2. Model Architecture

For image classification, I designed a Convolutional Neural Network (CNN) model. Let’s break down the architecture:

model = Sequential()
model.add(Conv2D(32, (3, 3), input_shape=(64, 64, 3), activation='relu'))
model.add(MaxPooling2D(pool_size=(2, 2)))
model.add(Flatten())
model.add(Dense(units=128, activation='relu'))
model.add(Dense(units=1, activation='sigmoid'))
model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

The first layer is a 2D convolutional layer with 32 filters and a 3x3 kernel size. It uses the ReLU activation function.
Max-pooling reduces spatial dimensions.
The flattened layer prepares the data for the fully connected layers.
Two dense layers follow: one with 128 units (ReLU activation) and the final output layer with a single unit (sigmoid activation).

3. Data Split

I split the dataset into three subsets:

Training set: Used for model training
Validation set: Used for hyperparameter tuning
Test set: Remained unseen until model evaluation

Data augmentation techniques (shear, zoom, and horizontal flip) were applied to enhance model generalization.

领英推荐

Top RAG Papers of the Week (November Week 1, 2024)

Kalyan KS 4 个月前

Neo4j Graph Tech Weekly (E:14)

Neo4j 2 年前

Brief Guide to Understanding Bayes’ Theorem

Vartul Mittal 4 年前

4. Model Training

The model was trained using the training set:

training_set = train_datagen.flow_from_directory('Dataset/train',
                                                 target_size=(64, 64),
                                                 batch_size=8,
                                                 class_mode='binary')
model.fit_generator(training_set,
                    steps_per_epoch=10,
                    epochs=50,
                    validation_data=val_set,
                    validation_steps=2)

5. Model Evaluation

After training, I evaluated the model’s performance on the test set:

json_file = open('model.json', 'r')
loaded_model_json = json_file.read()
json_file.close()
model = model_from_json(loaded_model_json)
model.load_weights("model.h5")

def classify(img_file):
    # Load and preprocess the test image
    # Make predictions and assign labels (Thanos or Joker)
    print(prediction, img_name)

# Iterate through test images
for f in files:
    classify(f)

The Result:

Github: https://github.com/heerthiraja/Deep-Learning-Projects/tree/main/Image-Classificaton-Project

Conclusion

The model successfully identified Thanos and Joker in unseen images. Feel free to explore further, fine-tune hyperparameters, and expand the dataset for even better results!

#computervision #imageclassification #deeplearning #cnn #dnn

Heerthi Raja's Journal

979 位关注者

要查看或添加评论，请登录

Heerthi Raja H的更多文章

From Ideation to Transformation: My 25-Day Entrepreneurial Bootcamp Journey

2025年1月31日

From Ideation to Transformation: My 25-Day Entrepreneurial Bootcamp Journey

My Journey Through the Entrepreneurship Transformation Bootcamp: A Deep Dive into Learning and Growth! The path to…

16 条评论
Building a Blog Generator Using OpenAI API

2024年12月12日

Building a Blog Generator Using OpenAI API

Building a Blog Generator Using OpenAI API: A Step-by-Step Guide As a developer, exploring AI tools and creating…

2 条评论
Building a Medical RAG Chatbot with BioMistral LLM!

2024年12月11日

Building a Medical RAG Chatbot with BioMistral LLM!

Building a Medical RAG Chatbot with BioMistral LLM: A Step-by-Step Guide Generative AI and Retrieval-Augmented…
My First Generative AI Project: SQL Query Generator

2024年12月5日

My First Generative AI Project: SQL Query Generator

This is my first project using Generative AI, and I’m really excited to share it! The project is about creating a tool…

2 条评论
Road Sign Recognition Using Deep Learning and PyQt: A Detailed Guide

2024年8月20日

Road Sign Recognition Using Deep Learning and PyQt: A Detailed Guide

In this article, we will explore a project that integrates computer vision, deep learning, and a graphical user…

4 条评论
Real-Time Drowsiness Detection Using Computer Vision: A Step Towards Safer Roads

2024年8月19日

Real-Time Drowsiness Detection Using Computer Vision: A Step Towards Safer Roads

Introduction In today’s fast-paced world, driving long distances has become a routine for many. However, one of the…
Automating Attendance with a Smart Attendance System: A Deep Dive into Facial Recognition Technology

2024年8月19日

Automating Attendance with a Smart Attendance System: A Deep Dive into Facial Recognition Technology

Introduction In today's fast-paced world, efficiency and accuracy are paramount, especially in administrative tasks…
Building a Gujarati Character Recognition System Using Convolutional Neural Networks and PyQt5

2024年8月18日

Building a Gujarati Character Recognition System Using Convolutional Neural Networks and PyQt5

Introduction Optical Character Recognition (OCR) systems have revolutionized the way we interact with written text by…

2 条评论
Leaf Disease Detection Using Computer Vision

2024年8月15日

Leaf Disease Detection Using Computer Vision

Introduction In the realm of agriculture, early detection of leaf diseases is crucial for maintaining crop health and…

4 条评论
Building an Object Detection System with MobileNet SSD and OpenCV

2024年6月2日

Building an Object Detection System with MobileNet SSD and OpenCV

In this article, we’ll walk through the process of creating an object detection system using the MobileNet SSD…

2 条评论

See all articles

Building an Image Classification Model: Thanos vs. Joker

Heerthi Raja H

Computer Vision | CV/Robotics Enthusiast | Sharing my lessons | Learning and building in public!

Introduction

1. Data Collection

2. Model Architecture

3. Data Split

领英推荐

4. Model Training

5. Model Evaluation

Conclusion

Heerthi Raja's Journal

979 位关注者

Heerthi Raja H的更多文章

社区洞察

其他会员也浏览了

Demystifying XGBoost with a Real-World Example

Thoughts on Grassmannian packing with o1

?? Intersection Theory & Enumerative Geometry: Structured Annotation for Research ??

I Ran Billions of Simulations to Simplify Multi-Armed Bandit Algorithms for You

Fundamentals of Quantization - Quantization of LLMs, Part-3

A Deep Dive into Quantum-Enhanced Variational Autoencoder for Synthetic Data Creation

Representing a Problem's State-Space

Top 50 AI and Dev Tools to Add to Your Arsenal

The beauty of the closed form Ordinary Least Squares solution

Accuracy is not Evil

Introduction

1. Data Collection

2. Model Architecture

3. Data Split

领英推荐

4. Model Training

5. Model Evaluation

Conclusion

Heerthi Raja's Journal

979 位关注者

Heerthi Raja H的更多文章

From Ideation to Transformation: My 25-Day Entrepreneurial Bootcamp Journey

Building a Blog Generator Using OpenAI API

Building a Medical RAG Chatbot with BioMistral LLM!

My First Generative AI Project: SQL Query Generator

Road Sign Recognition Using Deep Learning and PyQt: A Detailed Guide

Real-Time Drowsiness Detection Using Computer Vision: A Step Towards Safer Roads

Automating Attendance with a Smart Attendance System: A Deep Dive into Facial Recognition Technology

Building a Gujarati Character Recognition System Using Convolutional Neural Networks and PyQt5

Leaf Disease Detection Using Computer Vision

Building an Object Detection System with MobileNet SSD and OpenCV

社区洞察

其他会员也浏览了

Demystifying XGBoost with a Real-World Example

Thoughts on Grassmannian packing with o1

?? Intersection Theory & Enumerative Geometry: Structured Annotation for Research ??

I Ran Billions of Simulations to Simplify Multi-Armed Bandit Algorithms for You

Fundamentals of Quantization - Quantization of LLMs, Part-3

A Deep Dive into Quantum-Enhanced Variational Autoencoder for Synthetic Data Creation

Representing a Problem's State-Space

Top 50 AI and Dev Tools to Add to Your Arsenal

The beauty of the closed form Ordinary Least Squares solution

Accuracy is not Evil