登录查看更多内容

Demystifying Deep Learning: A Beginner's Guide to Artificial Neural Networks with TensorFlow

Hayat Ullah Abid

Geo-Informatics Engineer | Full Stack Web Developer | Spatial Data Science

发布日期: 2023年12月3日

Introduction

Artificial Neural Networks (ANN) form the backbone of modern deep learning, enabling machines to mimic the human brain's ability to learn and make decisions. In this article, we'll embark on a journey into the realm of deep learning using TensorFlow and Keras, unraveling the intricacies of building a neural network for image classification.

Motivation

The choice of the MNIST dataset serves as an excellent starting point for our deep learning exploration. MNIST, a collection of handwritten digits, is a classic dataset widely used for training and testing machine learning models. Its simplicity allows beginners to grasp the fundamentals while still posing an interesting challenge.

Code Walkthrough

Loading and Preprocessing the Data

We begin by loading the MNIST dataset using TensorFlow's convenient mnist.load_data() function. The dataset is then preprocessed by normalizing pixel values to a range between 0 and 1, preparing it for neural network training.

# Loading and preprocessing the data
import tensorflow as tf
from tensorflow.keras.datasets import mnist
from sklearn.model_selection import train_test_split

(X_train, y_train), (X_test, y_test) = mnist.load_data()

# Normalize pixel values
X_train = X_train / 255.0
X_test = X_test / 255.0

# Reshape the data
X_train = X_train.reshape(-1, 28*28)
X_test = X_test.reshape(-1, 28*28)

# Split the data
X_train, X_val, y_train, y_val = train_test_split(X_train, y_train, test_size=0.2, random_state=42)

Building the Neural Network Model

Our neural network is quite simple. It has a layer with 128 "neurons" that learn from the data using a method called ReLU (Rectified Linear Unit). We also include a dropout layer to prevent overfitting, followed by a layer with 10 neurons using softmax activation for classifying the digits.

# Building the neural network model

model = tf.keras.models.Sequential()
model.add(tf.keras.layers.Dense(units=128, activation='relu', input_shape=(784,)))
model.add(tf.keras.layers.Dropout(0.2))
model.add(tf.keras.layers.Dense(units=10, activation='softmax'))

Compiling and Training the Model

Next, we prepare the model for training by specifying how it should learn. We choose the "Adam" method for learning and "sparse categorical crossentropy" to measure how well it's doing. The model is then trained using our training data.

# Compiling and training the model
model.compile(optimizer='adam', loss='sparse_categorical_crossentropy', metrics=['sparse_categorical_accuracy'])
model.fit(X_train, y_train, epochs=5, validation_data=(X_val, y_val))

Evaluation on the Test Set

Finally, we want to see how well our model performs on new, unseen data—our test set.

领英推荐

Build Deep Learning model for the Image Classification…

Saigon Technology - Accelerate Software Development 1 年前

Deep Learning Demystified: Understanding Neural…

Crest Infotech ? 1 个月前

Deep Learning training

Bluechip Technologies Asia 1 年前

# Evaluating the model on the test set
test_loss, test_accuracy = model.evaluate(X_test, y_test)
print(f'Test Accuracy: {test_accuracy}')

Model Architecture

Let's take a closer look at how our neural network is structured. The decisions we made, like having 128 neurons and using dropout, influence how well the model learns.

Training Process and Results

During training, we keep an eye on important numbers like accuracy and loss. Seeing how these change over time helps us understand if our model is improving.

Performance Metrics

We use specific metrics to measure how good our model is at recognizing digits. These metrics, like "sparse categorical crossentropy," give us insights into its performance.

Visualization

Visualizing our training progress can be helpful. Creating graphs using a tool like Matplotlib allows us to see how our model gets better over time.

Challenges Faced and Tips for Beginners

Learning about deep learning can be challenging. I encountered some hurdles, and here are a few tips to make your journey smoother:

Take it step by step: Break down complex concepts into smaller, manageable parts.
Experiment: Don't be afraid to try different things and see what happens.
Learn from mistakes: It's okay to make errors; they're a natural part of the learning process.

Conclusion

In wrapping up, we've covered the basics of creating a neural network using TensorFlow. Armed with this knowledge, you're ready to explore more complex aspects of deep learning.

Future Directions

As you continue your deep learning adventure, consider exploring more advanced topics like different model architectures and diverse datasets. The field is vast and ever-evolving, offering endless possibilities for exploration.

Acknowledgments

A big thank you to the machine learning community and the numerous online resources that have made my learning journey exciting and rewarding.

Demystifying Deep Learning: A Beginner's Guide to Artificial Neural Networks with TensorFlow

Hayat Ullah Abid

Geo-Informatics Engineer | Full Stack Web Developer | Spatial Data Science

Introduction

Motivation

Code Walkthrough

Loading and Preprocessing the Data

Building the Neural Network Model

Compiling and Training the Model

Evaluation on the Test Set

领英推荐

Model Architecture

Training Process and Results

Performance Metrics

Visualization

Challenges Faced and Tips for Beginners

Conclusion

Future Directions

Acknowledgments

社区洞察

其他会员也浏览了

3 real world deep learning projects

What Is Deep Learning ?

What is deep learning?

Deep Learning with Graphs: Part 9 of my Graph Series of blogs

Top 12 Deep learning Features

Essential Concepts From Little Book of Deep Learning

How to fine-tune your algorithms

What is deep learning?

DEEP LEARNING INTERVIEW QUESTIONS

Artificial Intelligence - Part 4 - Deep Learning