登录查看更多内容

Deep Learning Project on MNIST Handwritten Digits Dataset: Step-by-Step Guide for Beginners

Shah Muhammad Fazle Rabbi

Data Analytics | BSc Engineer

发布日期: 2025年1月7日

This guide will walk you through building a simple yet effective deep-learning model to classify handwritten digits using the MNIST dataset. We’ll use Python, TensorFlow/Keras, and Jupyter Notebook for this project.

Project Overview

Objective: Build a neural network model to classify handwritten digits (0–9) from the MNIST dataset.
Tools: Python, TensorFlow/Keras, NumPy, Matplotlib.
Dataset: MNIST Handwritten Digits (available in TensorFlow and Keras datasets).
Outcome: A trained neural network model capable of accurately classifying handwritten digits.

Step 1: Setting Up the Environment

1.1 Install Required Libraries

Open your terminal or Jupyter Notebook and install the necessary libraries:

pip install tensorflow matplotlib numpy

1.2 Import Libraries

Create a Python script or Jupyter Notebook and start with importing libraries:

import tensorflow as tf
from tensorflow.keras import Sequential
from tensorflow.keras.layers import Dense, Flatten
from tensorflow.keras.datasets import mnist
import matplotlib.pyplot as plt
import numpy as np
import pandas as pd

Step 2: Load and Explore the MNIST Dataset

2.1 Load the Dataset

# Load MNIST dataset
(x_train, y_train), (x_test, y_test) = mnist.load_data()

print(f"Training Data Shape: {x_train.shape}")
print(f"Testing Data Shape: {x_test.shape}")

x_train, x_test: Grayscale images (28x28 pixels) of handwritten digits.
y_train, y_test: Labels (0–9) corresponding to each image.

2.2 Visualize the Data

# Display sample images
plt.figure(figsize=(10, 5))
for i in range(10):
    plt.subplot(2, 5, i + 1)
    plt.imshow(x_train[i], cmap='gray')
    plt.title(f"Label: {y_train[i]}")
    plt.axis('off')
plt.show()

Step 3: Preprocess the Data

3.1 Normalize the Data

Normalize the image data to scale pixel values between 0 and 1.

x_train = x_train / 255.0
x_test = x_test / 255.0

3.2 Reshape the Data (if using Dense Layers)

Flatten the 28x28 images into 1D arrays of 784 pixels.

x_train = x_train.reshape(-1, 28 * 28)
x_test = x_test.reshape(-1, 28 * 28)

Step 4: Build the Neural Network Model

4.1 Define the Model

# Build the model
model = Sequential([
    Dense(128, activation='relu', input_shape=(784,)),  # Hidden layer with 128 neurons
    Dense(64, activation='relu'),                      # Hidden layer with 64 neurons
    Dense(10, activation='softmax')                # Output layer with 10 classes (0-9)
])

4.2 Compile the Model

Specify the optimizer, loss function, and evaluation metric:

model.compile(
    optimizer='adam',
    loss='sparse_categorical_crossentropy',
    metrics=['accuracy']
)

Optimizer: Adam (adaptive learning rate optimizer)
Loss Function: Sparse Categorical Crossentropy (used for multi-class classification)
Metrics: Accuracy

4.3 Model Summary

model.summary()

Step 5: Train the Model

Train the neural network using the training data

history = model.fit(x_train, y_train, epochs=5, validation_split=0.2)

Epochs: Number of passes over the dataset.
Validation Split: 20% of the training data will be used for validation.

领英推荐

Roll Up Your Sleeves: 9 Data and Machine Learning…

Towards Data Science 10 个月前

Visualization, Math, Time Series, and More: Our Best…

Towards Data Science 1 年前

Deep Learning Frameworks: Tools for Developing…

Analytics Insight? 7 个月前

Step 6: Evaluate the Model

Test the model's performance on unseen test data.

test_loss, test_accuracy = model.evaluate(x_test, y_test)
print(f"Test Accuracy: {test_accuracy * 100:.2f}%")

Step 7: Make Predictions

Use the trained model to make predictions on new images

predictions = model.predict(x_test)

# Display sample predictions
plt.figure(figsize=(10, 5))
for i in range(5):
    plt.subplot(1, 5, i + 1)
    plt.imshow(x_test[i].reshape(28, 28), cmap='gray')
    plt.title(f"Pred: {np.argmax(predictions[i])}")
    plt.axis('off')
plt.show()

Step 8: Save and Load the Model

8.1 Save the Model

model.save('mnist_digit_classifier.h5')

8.2 Load the Model

loaded_model = tf.keras.models.load_model('mnist_digit_classifier.h5')

Step 9: Fine-Tuning and Optimization

Experiment with different architectures (e.g., more layers, dropout layers).
Adjust learning rates.
Use Convolutional Neural Networks (CNNs) for improved accuracy.

Step 10: Conclusion

You’ve successfully built a neural network to classify handwritten digits using the MNIST dataset.
Further improvements can be made by exploring CNN architectures like LeNet-5 or ResNet.

Frequently Asked Questions (FAQs)

1. What is the MNIST Dataset?

The MNIST dataset is a collection of 70,000 handwritten digit images (0–9), commonly used for machine learning and deep learning.

2. Why Normalize the Data?

Normalization scales pixel values to a range (0–1), improving model convergence and performance.

3. What is the Purpose of Flattening the Data?

Flattening converts 2D images (28x28) into 1D arrays for input into fully connected layers.

4. What is Sparse Categorical Crossentropy?

It's a loss function used for multi-class classification problems where target labels are integers.

5. How Can I Improve the Accuracy?

Use Convolutional Neural Networks (CNNs), increase training epochs, and fine-tune hyperparameters.

6. How Can I Use This Model in Real-World Applications?

Deploy it using tools like TensorFlow Lite or integrate it into web and mobile applications.

Learn about more kaggle

要查看或添加评论，请登录

Shah Muhammad Fazle Rabbi的更多文章

?? Mastering Object-Oriented Programming (OOP) in Python: The Complete Guide

2025年1月2日

?? Mastering Object-Oriented Programming (OOP) in Python: The Complete Guide

Object-Oriented Programming (OOP) is a cornerstone of modern software development, empowering developers to build…
The Importance of Deep Learning in Today's World

2024年12月31日

The Importance of Deep Learning in Today's World

The Importance of Deep Learning in Today's World In today’s fast-paced, data-driven world, deep learning has emerged as…
House Prices: Advanced Regression Techniques

2024年8月14日

House Prices: Advanced Regression Techniques

Detail Found GitHub and kaggle Machine Learning Workflow for Predicting House Prices In this article, we'll walk…
Titanic - Machine Learning from Disaster | Kaggle

2024年8月13日

Titanic - Machine Learning from Disaster | Kaggle

Titanic - Machine Learning from Disaster: The Titanic's sinking in 1912 led to 1502 deaths out of 2224 people due to…
100 Essential Scikit-Learn Classes for Machine Learning: Algorithms, Preprocessing, Evaluation Metrics, and More

2024年6月4日

100 Essential Scikit-Learn Classes for Machine Learning: Algorithms, Preprocessing, Evaluation Metrics, and More

Data Science | ML Discover the top 100 essential classes in scikit-learn for machine learning tasks. From powerful…
The Future of Machine Learning Data Science in the AI Era: Rising Demand or Job Displacement?

2024年5月15日

The Future of Machine Learning Data Science in the AI Era: Rising Demand or Job Displacement?

DataCamp Machine Learning DataScienceAcademy In the ever-evolving landscape of technology, the emergence of artificial…
100 Essential Scikit-Learn Classes for Machine Learning: Algorithms, Preprocessing, Evaluation Metrics, and More

2024年5月14日

100 Essential Scikit-Learn Classes for Machine Learning: Algorithms, Preprocessing, Evaluation Metrics, and More

Discover the top 100 essential classes in scikit-learn for machine learning tasks. From powerful algorithms to…
Become Machine Learnig Engineer zero to hero

2024年5月14日

Become Machine Learnig Engineer zero to hero

Becoming a machine learning engineer involves a combination of theoretical knowledge, practical skills, and continuous…

1 条评论
Journey of Civil Engineering From ancient times to the present

2023年2月12日

Journey of Civil Engineering From ancient times to the present

Civil engineering is an arena of work that has evolved over time and has been instrumental in the development of human…

1 条评论
The Great Indian Earthquake of 1897: A Historic Disaster

2023年2月11日

The Great Indian Earthquake of 1897: A Historic Disaster

The Great Indian Earthquake of 1897: A Historic Disaster Introduction: The Great Indian Earthquake of 1897, also known…

See all articles

Project Overview

Step 1: Setting Up the Environment

1.1 Install Required Libraries

1.2 Import Libraries

Step 2: Load and Explore the MNIST Dataset

2.1 Load the Dataset

2.2 Visualize the Data

Step 3: Preprocess the Data

3.1 Normalize the Data

3.2 Reshape the Data (if using Dense Layers)

Step 4: Build the Neural Network Model

4.1 Define the Model

4.2 Compile the Model

4.3 Model Summary

Step 5: Train the Model

领英推荐

Step 6: Evaluate the Model

Step 7: Make Predictions

Step 8: Save and Load the Model

8.1 Save the Model

8.2 Load the Model

Step 9: Fine-Tuning and Optimization

Step 10: Conclusion

Frequently Asked Questions (FAQs)

1. What is the MNIST Dataset?

2. Why Normalize the Data?

3. What is the Purpose of Flattening the Data?

4. What is Sparse Categorical Crossentropy?

5. How Can I Improve the Accuracy?

6. How Can I Use This Model in Real-World Applications?

Shah Muhammad Fazle Rabbi的更多文章

?? Mastering Object-Oriented Programming (OOP) in Python: The Complete Guide

The Importance of Deep Learning in Today's World

House Prices: Advanced Regression Techniques

Titanic - Machine Learning from Disaster | Kaggle

100 Essential Scikit-Learn Classes for Machine Learning: Algorithms, Preprocessing, Evaluation Metrics, and More

The Future of Machine Learning Data Science in the AI Era: Rising Demand or Job Displacement?

100 Essential Scikit-Learn Classes for Machine Learning: Algorithms, Preprocessing, Evaluation Metrics, and More

Become Machine Learnig Engineer zero to hero

Journey of Civil Engineering From ancient times to the present

The Great Indian Earthquake of 1897: A Historic Disaster

社区洞察

其他会员也浏览了

New Open Source Projects, NGINX Tutorial, Running Ollama on Kubernetes, Deep Learning Book

AutoKeras - A new revolution into Deep Learning

AIML 10- Building Custom Image Datasets in PyTorch

Object Detection Using EfficientNet in Tensorflow 2

Applied Machine Learning: CNNs for Image Recognition

Keras: Training on Large Datasets That Don’t Fit In Memory

How to Classify the paintings of an artist using Convolutional Neural?Network

Deep Learning: GANs and Variational Autoencoders training

TensorFlow-Keras using Mnist Dataset