登录查看更多内容

Sentiment Analysis using Bi-Directional LSTM

Ankit Agarwal

Founder | CEO | Private Equity Value Creation | Generative AI Agents | Gen AI Board Advisor | Investor | Speaker | Mentor | Startups | Thought Leadership | Artificial Intelligence | Ex-Amazon

发布日期: 2020年5月2日

As I mentioned in my previous article Sentiment Analysis using Deep Learning (1-D CNN), here is the post towards performing Sentiment Analysis on the same data using bidirectional LSTM which is a form of Recurrent Neural Network (RNN).

Why RNN: RNNs are designed to make use of sequential data, when the current step has some kind of relation with the previous steps. This makes them ideal for applications with a time component (audio, time-series data) and natural language processing. RNN’s perform very well for applications where sequential information is clearly important, because the meaning could be misinterpreted or the grammar could be incorrect if sequential information is not used. Applications include image captioning, language modeling and machine translation.

Why LSTM: Long Short-Term Memory (LSTM) stores historical information by constructing a memory unit, each temporal state saves the previous input information, which can effectively alleviate the long-distance dependence problem of Recurrent Neural Networks (RNN).

Why Bi-LSTM: LSTM ignores future information. The BiLSTM contributes to the solution of obtaining both historical information and future information by using the bidirectional propagation mechanism, which helps to achieve better performance in such tasks.

Note: I executed Bi-LSTM on Windows 10 machine on i7 CPU and it took around 8 hours for the training to complete (1 hour per epoch on an average). I highly recommend using GPU if you want to save time. This snipet shows the time and resources it took to train the model -

For details around Input data, Callback Functions, refer my previous article on Sentiment Analysis: Sentiment Analysis using Deep Learning (1-D CNN)

Once again, I have used Tensorboard to monitor training and validation parameters.

Tensorboard

Tensorboard provides an excellent way to visualize various metrices generated during model training and validation. Some of the metrices that were immensely useful –

· Tracking and visualizing metrics such as loss and accuracy

· Visualizing the model graph (ops and layers)

· Viewing histograms of weights, biases, or other tensors as they change over time

It can easily be invoked from Windows 10 machine through Jupyter notebook with minimal efforts, so I thought of leveraging it to monitor some metrics. Here are couple of samples –

Here are some sample output predictions generated by Bi-LSTM model on IMDB reviews –

Further Enhancements

There are number of improvements that can be made to this model including (and not limited to)-

· Improving Word Embedding by increasing dimensions of Word Embedding from 50 to 100 or more

· Leveraging existing pre-trained word embedding (Google News dataset (about 100 billion words))

· Training the model on a larger data-set. Probably use of Adversarial network for generating large amount of training data for Model training.

· Currently Model is trained on only first 500 words of each review/comment. This can be increased to 1000 words

· Leveraging Transformer-Based Models.

Here is the link to the Github repository for full code and test data: https://github.com/Ankit-DA/Sentiment_Analysis_Deep_Learning

要查看或添加评论，请登录

Ankit Agarwal的更多文章

Everything about LLM Hallucinations

2023年5月10日

Everything about LLM Hallucinations

Large Language Models (LLMs) suffer with a major challenge, i.e.

6 条评论
Generative AI for Healthcare - Use-Cases and Challenges

2023年4月26日

Generative AI for Healthcare - Use-Cases and Challenges

Generative AI is making huge strides in all industries. Healthcare is one of those industries that will see phenomenal…

2 条评论
India burning - Need to act on this NOW !

2021年4月29日

India burning - Need to act on this NOW !

All - I am writing this not to get likes but to apprise everyone of the alarming situation in India. Not everything is…
Snowflake in 30 Seconds

2021年2月14日

Snowflake in 30 Seconds

While I am not an expert in Snowflake and have begun exploring it, here is a quick 30 seconds read about Snowflake…
Preparation Primer for AWS Cloud Practitioner Certification

2021年2月4日

Preparation Primer for AWS Cloud Practitioner Certification

65 Questions = 90 Minutes Passing Percentage = 70% AWS Cloud Practitioner Certification is more about checking your…

6 条评论
AI-900 Primer (Azure AI Fundamentals) - Go For It !!!

2021年1月11日

AI-900 Primer (Azure AI Fundamentals) - Go For It !!!

Cleared AI-900 so thought of sharing my perspective before it slips away my unretentive mind. 60 Mins - 53 Questions…

4 条评论
Sentiment Analysis using Deep Learning (1-D CNN)

2020年4月27日

Sentiment Analysis using Deep Learning (1-D CNN)

I wanted to experiment with the recent advances in Tensorflow including Tensorboard and Keras Callback functions and…
Overuse of AI - Risks and Blind-spots

2020年4月4日

Overuse of AI - Risks and Blind-spots

Analysis paralysis. AI excels at pulling together and interpreting large bodies of data.
What role does Ethics & Trust play in Artificial Intelligence ?

2020年4月3日

What role does Ethics & Trust play in Artificial Intelligence ?

Today, AI-powered systems are routinely being used to support human decision-making in a multitude of applications. Yet…
Pause, Ponder and Plan to Progress during this global Crisis

2020年3月28日

Pause, Ponder and Plan to Progress during this global Crisis

Most of the organization (other than e-Commerce and Healthcare companies ofcourse) are currently struggling with the…

See all articles

Sentiment Analysis using Bi-Directional LSTM

Ankit Agarwal

Founder | CEO | Private Equity Value Creation | Generative AI Agents | Gen AI Board Advisor | Investor | Speaker | Mentor | Startups | Thought Leadership | Artificial Intelligence | Ex-Amazon

Ankit Agarwal的更多文章

社区洞察

其他会员也浏览了

Understanding Neural Networks by Building a Language Model from Scratch

Move Over Transformers: The Next Evolution in AI Architecture Is Here!

A Primer on Natural Language Processing: Sequence models vs. Attention models

Transformers Simplified: A Guide to Attention Is All You Need

N-BEATS: The Unique Interpretable Deep Learning Model for Time Series Forecasting

BxD Primer Series: Attention Mechanism

ML Algorithms

Retentive Network (RetNet): Revolutionizing Neural Architecture for Language Models

What Technologies are used to Build an AI Image generator? What is GAN Architecture?

Building a Gujarati Character Recognition System Using Convolutional Neural Networks and PyQt5

Ankit Agarwal的更多文章

Everything about LLM Hallucinations

Generative AI for Healthcare - Use-Cases and Challenges

India burning - Need to act on this NOW !

Snowflake in 30 Seconds

Preparation Primer for AWS Cloud Practitioner Certification

AI-900 Primer (Azure AI Fundamentals) - Go For It !!!

Sentiment Analysis using Deep Learning (1-D CNN)

Overuse of AI - Risks and Blind-spots

What role does Ethics & Trust play in Artificial Intelligence ?

Pause, Ponder and Plan to Progress during this global Crisis

社区洞察

其他会员也浏览了

Understanding Neural Networks by Building a Language Model from Scratch

Move Over Transformers: The Next Evolution in AI Architecture Is Here!

A Primer on Natural Language Processing: Sequence models vs. Attention models

Transformers Simplified: A Guide to Attention Is All You Need

N-BEATS: The Unique Interpretable Deep Learning Model for Time Series Forecasting

BxD Primer Series: Attention Mechanism

ML Algorithms

Retentive Network (RetNet): Revolutionizing Neural Architecture for Language Models

What Technologies are used to Build an AI Image generator? What is GAN Architecture?

Building a Gujarati Character Recognition System Using Convolutional Neural Networks and PyQt5