登录查看更多内容

Deep Learning - Long Short Term Memory(LSTM)

Varun Machupally

Employee at Deloitte

发布日期: 2022年4月29日

INTRODUCTION

We know that Traditional RNNs are not good at capturing Long-range dependencies. When we are working with a huge dataset and multiple RNN layers we are at a vanishing gradient problem. To overcome these issues Long Short Term Memory is introduced. LSTM is nothing but an Artificial Recurrent Neural Network used in the field of DeepLearning. Unlike standard feedforward neural networks. LSTM has a feedback connection. It cannot only process single data points but also an entire sequence of data.

LSTM

LSTM is capable of capturing long-range dependencies and also capable of remembering RNNs weights and their inputs over a very long period of time. It can store the previous inputs for a very extended time duration. LSTM does this by using three gates:

Forget gate: It removes the information that is no longer useful in the cell state.
Input gate: Additional information in the cell state is added by the input gate.
Output gate: It also adds some additional information to the cell state.

This gating mechanism of LSTM has allowed the network to learn the condition for when to remove or ignore or keep the information in the memory cell.

LSTM Usecases

Apple is the first major tech company to integrate smart assistance in its operating system. Siri was actually a by-product of some other company, Siri was the company's adaption of a standalone app it had purchased along with creators.

领英推荐

How to optimize an AI algorithm

Algolia 1 年前

Unlocking the Potential of Pre-Trained Models

Bhuwan Mittal 1 年前

The Game-Changer in Deep Learning: Transformers

Abhishek Srivastav 5 个月前

Google implemented google voice search, compared to deep neural network LSTM RNN have additional recurrent connections and memory cell that allows them to remember the previous data.

Real-Time applications of LSTM

Name Entity Recognition: It is an NLP task that seems to locate and classify named entities mentioned in unstructured text into predefined categories.

Sentiment Analysis: It is contextual mining of text which identifies and extracts subjective information in the source material.

Machine Translation: NLP is a subfield of linguistics, in particular how to program computers to process and analyze large amounts of Natural Language data.

References:

https://goo.gl/cck4hE

https://www.analyticsvidhya.com/blog/2017/12/fundamentals-of-deep-learning-introduction-to-lstm/

Deep Learning - Long Short Term Memory(LSTM)

Varun Machupally

Employee at Deloitte

领英推荐

社区洞察

其他会员也浏览了

Foundation Models

How Transformers work in deep learning and NLP: an intuitive introduction?

BxD Primer Series: Transfer Learning Techniques

How Transformer Models Compare to Traditional RNNs in Sequence-to-Sequence Tasks

What content my trained model have?

Why "Attention is All you need" in Machine Learning

Deep Learning with a tale of two cities (Part VI/IX): the time

Classifying Short-Form Text With Neural Networks – Our Journey So Far

Role of Sparse matrix in machine learning