登录查看更多内容

Build Your Own Text Classification Model From Scratch

Vishal Verma

AI & Data Science Professional | Data Scientist | Machine Learning | Generative AI | AI Agents | NLP | Career Coach | AI Enthusiast | Helping Others Succeed

发布日期: 2024年2月23日

Hey there, tech explorers! Ever wonder how computers can understand our feelings through words? Well, today, we're diving into the cool world of Sentiment Analysis with TensorFlow - a fancy name for teaching computers to know if we're happy or not, excited or a bit bummed out, just by reading what we write!

Setting the Stage: Let's Gather Our Tools

Okay, first things first. We need some tools to make the magic happen. We bring in our computer language called Python and a special helper library called TensorFlow. It's like giving our computer a superhero suit! Then, we grab a bunch of sentences - some happy, some not-so-happy - to teach our computer the difference.

import tensorflow as tf
import numpy as np
from tensorflow.keras.preprocessing.text import Tokenizer
from tensorflow.keras.preprocessing.sequence import pad_sequences

# Our happy and not-so-happy sentences
train_sentences = [
    "The new restaurant in town exceeded my expectations.",
    "I was disappointed with the service at the hotel.",
    "The concert last night was amazing!",
    "The traffic on the way to work this morning was unbearable.",
    "I love the atmosphere of this place.",
    "The customer service was excellent.",
    "Yesterday's weather was fantastic.",
    "The book I read last night was boring.",
    "The food at the cafe was delicious.",
    "The flight got delayed, and it was frustrating.",
    "The park is a beautiful place to relax.",
    "The smartphone's battery life is impressive.",
    "The company's customer support needs improvement.",
    "The play at the theater was captivating.",
    "I had a wonderful experience with the tech support team.",
    "The hiking trail offers breathtaking views.",
    "The traffic signal system in the city is inefficient.",
    "The museum exhibits were informative and interesting.",
    "The new software update caused my computer to crash.",
    "The beach vacation was incredibly relaxing."
]

# Labels: 1 for positive, 0 for negative

train_labels = [1, 0, 1, 0, 1, 1, 1, 0, 1, 0, 1, 1, 0, 1, 1, 1, 0, 1, 0, 1]

Building Blocks: Turning Words into Numbers

Computers don't understand words like we do, so we have to turn our sentences into numbers. We do this with something called tokenization and padding. It's like translating our sentences into a language computers understand.

# Tokenizing and padding sequences for training
tokenizer = Tokenizer(oov_token="<OOV>")
tokenizer.fit_on_texts(train_sentences)
word_index = tokenizer.word_index
sequences = tokenizer.texts_to_sequences(train_sentences)
padded_sequences = pad_sequences(sequences)

Neural Network: Our Computer Brain

Now, we create a simple computer brain, kind of like a tiny robot, to learn from all those numbers. Our robot brain has layers - one to understand words, one to think, and one to decide if it's a happy or sad sentence.

# Simple Neural Network for Text Classification
model = tf.keras.Sequential([
    tf.keras.layers.Embedding(input_dim=len(word_index) + 1, output_dim=16, input_length=padded_sequences.shape[1]),
    tf.keras.layers.Flatten(),
    tf.keras.layers.Dense(8, activation='relu'),
    tf.keras.layers.Dense(1, activation='sigmoid')
])

Let's Train Our Robot: Learning Time!

Just like teaching a pet a new trick, we show our computer brain lots of sentences and tell it if they're happy or not. We do this several times (epochs) until our computer gets really good at guessing feelings.

领英推荐

Deep Learning Roadmap 2022 - The Ultimate Guide

Abhinavan Sarikonda ? 2 年前

OpenAI's o1 Model: Einstein in a Box - A Breakthrough…

Stanislav Sorokin 6 个月前

AI Developer tech skillsets.

Darko Medin 1 个月前

model.compile(optimizer='adam', loss='binary_crossentropy', metrics=['accuracy'])

# Convert labels to NumPy array
train_labels = np.array(train_labels)

# Train the model
model.fit(padded_sequences, train_labels, epochs=10)

Saving Our Robot's Knowledge: For Later!

We don't want to lose all the hard work, so we save our computer brain's knowledge in a file. It's like putting our robot on pause and telling it, "Remember everything you learned!"

# Save the model
model.save('text_classification_model.h5')

Testing Time: Let's See How Our Robot Does!

Now, the fun part! We give our robot some new sentences it has never seen before and ask it, "Hey, are these happy or not?" It gives us its best guess.

# Sample comments for prediction
predict_sentences = [
    "I love this product!",
    "The movie was fantastic.",
    "I had a terrible experience with customer service.",
    "The book I read last night was boring."
]

# Tokenize and pad the input sentences for prediction
predict_sequences = tokenizer.texts_to_sequences(predict_sentences)
predict_padded_sequences = pad_sequences(predict_sequences, maxlen=padded_sequences.shape[1])

# Make predictions using the trained model
predictions = model.predict(predict_padded_sequences)

# Convert probability predictions to binary labels (1 for positive, 0 for negative)
binary_predictions = np.round(predictions).astype(int)

# Display the results
for sentence, prediction in zip(predict_sentences, binary_predictions):
    sentiment = "Positive" if prediction == 1 else "Negative"
    print(f"Sentence: '{sentence}' - Predicted Sentiment: {sentiment}")

Output

Conclusion: Cheers to Understanding Feelings!

And there you have it! We just took our computer on a journey to understand feelings through words. Imagine all the cool things we can do with this - like making sure customers are happy or helping machines chat with us better. The world of tech is full of wonders, and we've just scratched the surface. Keep exploring, and who knows what amazing things you might create!

The Intelligent Future

463 位关注者

要查看或添加评论，请登录

Vishal Verma的更多文章

Unleashing the Power of Clustering: Empowering Decision Making for Success

2023年6月9日

Unleashing the Power of Clustering: Empowering Decision Making for Success

In today's data-driven world, making informed decisions is crucial for individuals and organizations alike…
Large Language Models: The Backbone of Generative AI

2023年6月8日

Large Language Models: The Backbone of Generative AI

The Power of Language in AI Language has always been a defining characteristic of human intelligence. It allows us to…
Mastering SQL for Data Analysts: Top 10 Interview Questions Explained with Comprehensive Answers and Theoretical Concepts

2023年6月7日

Mastering SQL for Data Analysts: Top 10 Interview Questions Explained with Comprehensive Answers and Theoretical Concepts

Top 10 Theoretical SQL interview questions with answers for Data Analysts: 1. What is SQL? SQL stands for Structured…
Evolving Data Landscape: Why Upskilling is Essential for Success?

2023年6月6日

Evolving Data Landscape: Why Upskilling is Essential for Success?

In today's rapidly evolving world, data has become the lifeblood of organizations across industries. However, the data…
The Power of Prompt Engineering in Generative AI: Unlocking Precision and Control

2023年6月5日

The Power of Prompt Engineering in Generative AI: Unlocking Precision and Control

In the realm of generative AI, prompt engineering emerges as a critical tool, offering developers the ability to shape…
Pros and Cons of Generative AI: Exploring Benefits and Challenges

2023年6月4日

Pros and Cons of Generative AI: Exploring Benefits and Challenges

Generative AI, like any technology, has its own set of pros and cons. Here are some of the key advantages and…
Roadmap to Become a Successful Data Analyst in 6 Months: A Step-by-Step Guide

2023年6月4日

Roadmap to Become a Successful Data Analyst in 6 Months: A Step-by-Step Guide

Becoming a successful data analyst in just six months requires focused effort and a strategic roadmap. While it's…
Why Microsoft Excel Remains Popular: Unraveling Its Enduring Market Dominance

2023年6月3日

Why Microsoft Excel Remains Popular: Unraveling Its Enduring Market Dominance

Microsoft Excel has maintained its popularity and dominance in the market for several reasons: Versatility Excel is a…
Mastering Data Analysis: Essential Skills for Becoming a Successful Data Analyst

2023年6月3日

Mastering Data Analysis: Essential Skills for Becoming a Successful Data Analyst

To become a skilled data analyst, you need a combination of technical and soft skills. Here are some key skills…
Harnessing Data Science and Analytics for Advancements in Civil Engineering: Practical Applications and Impacts

2023年6月2日

Harnessing Data Science and Analytics for Advancements in Civil Engineering: Practical Applications and Impacts

Data science and analytics have several practical applications in the field of Civil Engineering. A few examples are…

See all articles

Build Your Own Text Classification Model From Scratch

Vishal Verma

AI & Data Science Professional | Data Scientist | Machine Learning | Generative AI | AI Agents | NLP | Career Coach | AI Enthusiast | Helping Others Succeed

Setting the Stage: Let's Gather Our Tools

Building Blocks: Turning Words into Numbers

Neural Network: Our Computer Brain

Let's Train Our Robot: Learning Time!

领英推荐

Saving Our Robot's Knowledge: For Later!

Testing Time: Let's See How Our Robot Does!

Output

Conclusion: Cheers to Understanding Feelings!

The Intelligent Future

463 位关注者

Vishal Verma的更多文章

社区洞察

其他会员也浏览了

Frameworks and Libraries for AI Development: A Comprehensive Guide ????

How to Learn AI: A Comprehensive Self-Paced Learning Path

Demystifying Machine Learning: Build Your First Model in Python

DeepSeek R1: A Game-Changing AI Model Challenging Industry Leaders

GPT-4o vs Gemini 1.5 Pro: Battle of The Best AI of 2024

Command line tools for Machine learning

AI and Machine Learning Essentials: A Beginner's Guide with Hands-On Practice in Python

Amazing TensorFlow Application to Try Right Now

Course Directive: Foundational Principles and Methodologies for Mastery in Artificial Intelligence

Setting the Stage: Let's Gather Our Tools

Building Blocks: Turning Words into Numbers

Neural Network: Our Computer Brain

Let's Train Our Robot: Learning Time!

领英推荐

Saving Our Robot's Knowledge: For Later!

Testing Time: Let's See How Our Robot Does!

Output

Conclusion: Cheers to Understanding Feelings!

The Intelligent Future

463 位关注者

Vishal Verma的更多文章

Unleashing the Power of Clustering: Empowering Decision Making for Success

Large Language Models: The Backbone of Generative AI

Mastering SQL for Data Analysts: Top 10 Interview Questions Explained with Comprehensive Answers and Theoretical Concepts

Evolving Data Landscape: Why Upskilling is Essential for Success?

The Power of Prompt Engineering in Generative AI: Unlocking Precision and Control

Pros and Cons of Generative AI: Exploring Benefits and Challenges

Roadmap to Become a Successful Data Analyst in 6 Months: A Step-by-Step Guide

Why Microsoft Excel Remains Popular: Unraveling Its Enduring Market Dominance

Mastering Data Analysis: Essential Skills for Becoming a Successful Data Analyst

Harnessing Data Science and Analytics for Advancements in Civil Engineering: Practical Applications and Impacts

社区洞察

其他会员也浏览了

Frameworks and Libraries for AI Development: A Comprehensive Guide ????

How to Learn AI: A Comprehensive Self-Paced Learning Path

Demystifying Machine Learning: Build Your First Model in Python

DeepSeek R1: A Game-Changing AI Model Challenging Industry Leaders

GPT-4o vs Gemini 1.5 Pro: Battle of The Best AI of 2024

Command line tools for Machine learning

AI and Machine Learning Essentials: A Beginner's Guide with Hands-On Practice in Python

Amazing TensorFlow Application to Try Right Now

Course Directive: Foundational Principles and Methodologies for Mastery in Artificial Intelligence