登录查看更多内容

Using Autoencoders for Dimensionality Reduction: A Practical Guide with MNIST

Rany ElHousieny, PhD???

Generative AI ENGINEERING MANAGER | ex-Microsoft | AI Solutions Architect | Generative AI & NLP Expert | Proven Leader in AI-Driven Innovation | Former Microsoft Research & Azure AI | Software Engineering Manager

发布日期: 2024年5月16日

Dimensionality reduction is a crucial technique in data preprocessing, particularly for high-dimensional datasets. It helps in simplifying the dataset, reducing storage requirements, and often improving the performance of machine learning models. One powerful method for dimensionality reduction is the use of autoencoders. In this article, we’ll explore how to use autoencoders for this purpose using the MNIST dataset and then compare its accuracy with PCA.

What is an Autoencoder?

An autoencoder is a type of neural network designed to learn efficient codings of input data. It consists of two main parts:

Encoder: Compresses the input data into a lower-dimensional representation.
Decoder: Reconstructs the input data from the lower-dimensional representation.

Steps to Use Autoencoders for Dimensionality Reduction

Load and Preprocess the Dataset
Define and Train the Autoencoder
Extract Encoded Features
Use Encoded Features for Further Analysis

Let’s walk through these steps using the MNIST dataset.

1. Load and Preprocess the Dataset

First, we need to load the MNIST dataset, which contains images of handwritten digits.

from tensorflow.keras.datasets import mnist
import numpy as np

# Load the MNIST dataset
(X_train, y_train), (X_test, y_test) = mnist.load_data()

# Normalize the data to the range [0, 1]
X_train = X_train.astype('float32') / 255.
X_test = X_test.astype('float32') / 255.

# Flatten the images to vectors of size 784 (28*28)
X_train = X_train.reshape((X_train.shape[0], -1))
X_test = X_test.reshape((X_test.shape[0], -1))

import matplotlib.pyplot as plt

# Plot some of the images
num_images = 10
plt.figure(figsize=(10, 1))
for i in range(num_images):
    # Reshape the flattened image back to 28x28
    image = X_train[i].reshape(28, 28)
    
    # Plot the image
    plt.subplot(1, num_images, i + 1)
    plt.imshow(image, cmap='gray')
    plt.axis('off')
plt.show()

2. Define and Train the Autoencoder

We define an autoencoder with a simple architecture where the encoder compresses the data to a 32-dimensional representation.

from tensorflow.keras.layers import Input, Dense
from tensorflow.keras.models import Model

# Define the input layer
input_shape = (784,)  # 28*28 pixels
input_layer = Input(shape=input_shape)

# Define the encoder
hidden_layer_1 = Dense(128, activation='relu')(input_layer)
hidden_layer_2 = Dense(64, activation='relu')(hidden_layer_1)
encoded_representation = Dense(32, activation='relu')(hidden_layer_2)

# Define the decoder
decoded = Dense(64, activation='relu')(encoded_representation)
decoded = Dense(128, activation='relu')(decoded)
output_layer = Dense(784, activation='sigmoid')(decoded)

# Create the autoencoder model
autoencoder = Model(inputs=input_layer, outputs=output_layer)

# Compile the model
autoencoder.compile(optimizer='adam', loss='binary_crossentropy')

# Train the autoencoder
autoencoder.fit(X_train, X_train, epochs=50, batch_size=256, shuffle=True, validation_data=(X_test, X_test))

3. Extract Encoded Features

After training, we extract the encoder part of the model to obtain the encoded features.

# Extract the encoder model
encoder = Model(inputs=input_layer, outputs=encoded_representation)

# Obtain the encoded features
encoded_train_features = encoder.predict(X_train)
encoded_test_features = encoder.predict(X_test)

领英推荐

BxD Primer Series: DBSCAN Clustering Models

Mayank K. 1 年前

Build your first CNN model: Convolutional Neural…

Sibasish Chowdhury PgDip,MIT,PMP?ITIL?PCSM,PC AgilePM,Certified CSM 6 个月前

Day 23 — Autoencoders

Ime Eti-mfon 1 个月前

4. Use Encoded Features for Further Analysis

We can now use these encoded features for further analysis. For instance, we can train a classifier on the encoded features and evaluate its performance.

from sklearn.ensemble import RandomForestClassifier
from sklearn.metrics import accuracy_score

# Train a classifier on the encoded features
clf = RandomForestClassifier()
clf.fit(encoded_train_features, y_train)

# Evaluate the classifier
predictions = clf.predict(encoded_test_features)
accuracy = accuracy_score(y_test, predictions)
print(f'Accuracy: {accuracy}')

Accuracy: 0.9453

Autoencoder vs PCA

Let's compare autoencoders with Principal Component Analysis (PCA) using the same MNIST dataset.

1. Dimensionality Reduction with PCA

Let's perform dimensionality reduction using PCA:

from sklearn.decomposition import PCA

# Initialize PCA with the same number of components as the autoencoder
pca = PCA(n_components=32)

# Fit PCA on the training data and transform both training and test data
pca_train_features = pca.fit_transform(X_train)
pca_test_features = pca.transform(X_test)

2. Train and Evaluate a Classifier using PCA Features

We'll train and evaluate a Random Forest classifier using the features obtained from the PCA:

# Train a classifier on the PCA encoded features
clf_pca = RandomForestClassifier()
clf_pca.fit(pca_train_features, y_train)
pca_predictions = clf_pca.predict(pca_test_features)
pca_accuracy = accuracy_score(y_test, pca_predictions)
print(f'Accuracy using PCA: {pca_accuracy}')

Comparison and Results

Accuracy using PCA: 0.9535

We had the Accuracy from the Encoder before:
Accuracy: 0.9453

This example demonstrates how to perform dimensionality reduction using both autoencoders and PCA, and then evaluate the performance of a classifier trained on the reduced data. By comparing the accuracies, we can assess the effectiveness of each dimensionality reduction technique.

to learn more about PCA and dimensionality reduction, you can read the following articles:

Conclusion

Both autoencoders and PCA are powerful tools for dimensionality reduction. Autoencoders can learn more complex and non-linear transformations, potentially capturing more intricate structures in the data. PCA, on the other hand, is a linear method that is often simpler and faster to apply. The choice between them depends on the specific characteristics of your data and the requirements of your task. In practice, it's valuable to experiment with both methods to determine which one works best for your particular application.

AI Synergy Insights

560 位关注者

要查看或添加评论，请登录

Rany ElHousieny, PhD???的更多文章

Getting Started with LangChain.js: A Hello World Example

2025年2月18日

Getting Started with LangChain.js: A Hello World Example

LangChain.js is a powerful library that enables seamless interaction with Large Language Models (LLMs) in JavaScript…
LangChain Chains: Powering AI with Structured Execution ????

2025年2月16日

LangChain Chains: Powering AI with Structured Execution ????

When building AI-powered applications, we often need to process user inputs, format prompts, retrieve relevant data…
LangChain Memory in a React AI Joke Generator: A Beginner’s Guide ????

2025年2月16日

LangChain Memory in a React AI Joke Generator: A Beginner’s Guide ????

Wouldn’t it be cool if your AI remembered what it told you before? Imagine asking an AI for a joke, and instead of…
Mastering LangChain.js Prompt Templates: A Beginner's Guide for Frontend Developers

2025年2月16日

Mastering LangChain.js Prompt Templates: A Beginner's Guide for Frontend Developers

?? What if you could customize AI responses dynamically in your React app? Instead of sending hardcoded prompts to…
Getting Started with LangChain.js: Calling OpenAI to Tell a Joke

2025年2月15日

Getting Started with LangChain.js: Calling OpenAI to Tell a Joke

Artificial Intelligence is becoming more accessible for frontend developers, thanks to LangChain.js.
AI Development for Frontend Developers with React and LangChain: Hands-On project

2025年2月15日

AI Development for Frontend Developers with React and LangChain: Hands-On project

In my previous article, I explained how to build a Resume Coach application that helps job seekers optimize their…

3 条评论
Getting Started with OpenHands Code Assistance on Mac

2025年2月14日

Getting Started with OpenHands Code Assistance on Mac

OpenHands is an AI-powered code assistance tool designed to streamline development workflows. This guide will walk you…

1 条评论
CodiumAI Windsurf Code Assistant: Getting Started

2025年2月6日

CodiumAI Windsurf Code Assistant: Getting Started

In the ever-evolving landscape of software development, integrating advanced tools can significantly enhance…
Deploying DeepSeek-R1 on Azure

2025年2月6日

Deploying DeepSeek-R1 on Azure

DeepSeek-R1 is a powerful reasoning model designed for complex tasks like language processing, scientific reasoning…
Getting Started with LocalStack: A Beginner's Guide

2025年1月10日

Getting Started with LocalStack: A Beginner's Guide

LocalStack is an open-source tool that emulates AWS services locally, enabling you to develop and test your…

See all articles

Using Autoencoders for Dimensionality Reduction: A Practical Guide with MNIST

Rany ElHousieny, PhD???

Generative AI ENGINEERING MANAGER | ex-Microsoft | AI Solutions Architect | Generative AI & NLP Expert | Proven Leader in AI-Driven Innovation | Former Microsoft Research & Azure AI | Software Engineering Manager

What is an Autoencoder?

Steps to Use Autoencoders for Dimensionality Reduction

1. Load and Preprocess the Dataset

2. Define and Train the Autoencoder

3. Extract Encoded Features

领英推荐

4. Use Encoded Features for Further Analysis

Autoencoder vs PCA

1. Dimensionality Reduction with PCA

2. Train and Evaluate a Classifier using PCA Features

Comparison and Results

Conclusion

AI Synergy Insights

560 位关注者

Rany ElHousieny, PhD???的更多文章

社区洞察

其他会员也浏览了

Day 30 — Hyperparameter Optimization

BxD Primer Series: Linear Discriminant Analysis (LDA) for Dimensionality Reduction

Hyperparameters in Machine Learning: A Comprehensive Guide

LeNet-5: A Simple Yet Powerful CNN for Image Classification

Radial Basis Function Networks

Do you really know the importance of split the dataset correctly?

?? Harnessing the Power of LSTM for Text Generation! ??

What Makes DeepSeek More Reaching?

Handling nonlinear data in time series using Recurrent Neural Networks (RNNs)

A Comprehensive Guide to Regularization Techniques in Machine Learning

What is an Autoencoder?

Steps to Use Autoencoders for Dimensionality Reduction

1. Load and Preprocess the Dataset

2. Define and Train the Autoencoder

3. Extract Encoded Features

领英推荐

4. Use Encoded Features for Further Analysis

Autoencoder vs PCA

1. Dimensionality Reduction with PCA

2. Train and Evaluate a Classifier using PCA Features

Comparison and Results

Conclusion

AI Synergy Insights

560 位关注者

Rany ElHousieny, PhD???的更多文章

Getting Started with LangChain.js: A Hello World Example

LangChain Chains: Powering AI with Structured Execution ????

LangChain Memory in a React AI Joke Generator: A Beginner’s Guide ????

Mastering LangChain.js Prompt Templates: A Beginner's Guide for Frontend Developers

Getting Started with LangChain.js: Calling OpenAI to Tell a Joke

AI Development for Frontend Developers with React and LangChain: Hands-On project

Getting Started with OpenHands Code Assistance on Mac

CodiumAI Windsurf Code Assistant: Getting Started

Deploying DeepSeek-R1 on Azure

Getting Started with LocalStack: A Beginner's Guide

社区洞察

其他会员也浏览了

Day 30 — Hyperparameter Optimization

BxD Primer Series: Linear Discriminant Analysis (LDA) for Dimensionality Reduction

Hyperparameters in Machine Learning: A Comprehensive Guide

LeNet-5: A Simple Yet Powerful CNN for Image Classification

Radial Basis Function Networks

Do you really know the importance of split the dataset correctly?

?? Harnessing the Power of LSTM for Text Generation! ??

What Makes DeepSeek More Reaching?

Handling nonlinear data in time series using Recurrent Neural Networks (RNNs)

A Comprehensive Guide to Regularization Techniques in Machine Learning