登录查看更多内容

Recurrent Neural Networks

Mounic Madiraju

Web Solutions Architect @ Sonova Group | Ex- AirAsia, Monster, CGI | Cloud Solutions | Digital Transformation | Data & Analytics | Gen AI | LLM | Omnichannel CX | Delivering Customized, High-Impact Solutions

发布日期: 2020年9月26日

An RNN is a category of artificial neural network (ANN) and finds its major applications in Natural Language Processing (NLP) and speech recognition. These networks are designed to identify the sequential characteristics in the data and make use of the detected patterns to predict the next expected scenario.

The RNNs are deployed in Deep Learning and for the designing of models that try to simulate the human neural activity inside the brain. These are especially useful in the cases where the context is crucial to predict the outcome and vary from other artificial neural networks as they incorporate feedback loops for the processing of data sequences which inform the final output. The final output can be a data sequence also. The loops help in information persistence and this effect is usually described as holding memory.

Working:

The RNNs keeps a memory of the past and the current decisions are dependent on the information from the past. The RNNs take vector(s) as input and generates one or more output vectors. These output vectors are affected by the applied weights on the input vectors like simple neural networks. They are also influenced by a state vector which represents the context-based or previous inputs or outputs. So the output from and input varies depending upon the previous inputs of the sequence.

Parameter Sharing in RNN:

Parameter sharing occurs when one filter applied to the previous layer, results in complexity. It is so because of the number of units that are involved in that specific layer. All of these units will share the same weight. This is the reason it is called “parameter sharing”.

In case of RNNs same weights are applied to different items in the input again and again. The parameters are shared across the inputs in RNNs. If these parameters are not shared then the RNN becomes more like vanilla networks where each input requires their own weights. This puts a constraint on the length of input to be fixed. This makes it impossible to incorporate a series input where the length is unknown or differing.

Elman networks and Jordan networks

They are also called as the “Simple RNN”. Elman is a three-layer network which also includes a set of context units. The middle layer is hidden and is connected to these units with a weight 1. The Jordan networks on the other hand are fed using an outer layer rather than the hidden layer. The context units in these networks are also known as the state layer. They connect to themselves recurrently.

In the case of Elman Filters,

In case of the Jordan filters,

Hopfield Network

In this type of RNNs, the connections are symmetric. Stationary inputs are required for this type of RNN because it does not process the data sequence. Convergence is guaranteed in this case.

IndRNN (Independently RNN)

These types of networks address the exploding issues and the gradient vanishing in fully connected RNNs. The neurons are independent of each other and receive their own past information as the context information. The backpropagation of gradient can be regularized for avoiding the gradient vanishing and exploding for keeping the short or long-term memories. The IndRNN can be trained with robustness using the non-saturated nonlinear functions like ReLU. The deep networks can be trained using the skip connections.

Continuous Time RNNs

A system of ordinary differential equations is used in continuous time recurrent neural network (CTRNN) for modelling the effects on the neuron in case of an incoming pike train.

The rate of change of activation for a given neuron in the network with action potential is denoted as

Libraries

There are a number of libraries available for RNNs most common of which are

Apache Singa
Caffe: It supports GPU and CPU. It is developed in C++ and has wrappers for MATLAB and Python
Deeplearning4j: this library is built for deep learning in Java and Scala. Supports GPU only and allows the creation of custom layers. Integration with Hadoop and Kafka is supported.
Microsoft Cognitive Tool
Pytorch: Developed in Python, it has strong GPU acceleration and supports tensor and dynamic neural networks.

Application of RNN

A few of vast applications of RNNs include Robot Control, Machine translator, Speech recognition and synthesis, anomaly detection in time series, music composition, action recognition and prediction in medical care pathways.

Thamaraikani C

Senior Analyst (SEO Strategy) - Helping businesses to generate leads through Organic SEO and AI SEO (ChatGPT)

4 年

Nice blog

要查看或添加评论，请登录

Mounic Madiraju的更多文章

Why You Need Analytics Audit?

2021年1月15日

Why You Need Analytics Audit?

Nowadays, decision making in any business depends upon the data; basically, how well your company can manage the data…

1 条评论
Data Reliability Monitoring

2020年12月25日

Data Reliability Monitoring

Organizations rely on intuitive database monitoring for optimum business processes and applications. An organization's…

1 条评论
Unsupervised Learning

2020年10月24日

Unsupervised Learning

What is Unsupervised Learning? Unsupervised learning is a machine learning technique that does not require users to…
What is Supervised Learning?

2020年9月28日

What is Supervised Learning?

In supervised learning, you train a machine using "labelled" data. It means some information is already tagged with the…

1 条评论
Deep Learning - Convolutional Neural network

2020年9月24日

Deep Learning - Convolutional Neural network

Convolutional Neural Network or CNN is a deep learning artificial Neural Network that is commonly used for Image…
Basics for SEO on JavaScript Crawl

2020年1月7日

Basics for SEO on JavaScript Crawl

In the last few years, One of the more significant changes in technical SEO is the topic "JavaScript crawling", Many…
Unlimited Private Repositories for Free

2018年5月15日

Unlimited Private Repositories for Free

Have you ever thought of using unlimited private repositories for free? I'm going to brief simple steps on "How to…

1 条评论
Common Technical SEO Issues and their Recommendations

2018年3月9日

Common Technical SEO Issues and their Recommendations

1. Canonicalising : ?Issue : The Pages with multiple niche URL structures but refers to same page.

1 条评论
Improving Websites User Experience by Scrapping On-Page Technical and SEO Content Issues - SEO Beginners

2017年2月17日

Improving Websites User Experience by Scrapping On-Page Technical and SEO Content Issues - SEO Beginners

You should check your website on a regular basis for Technical SEO issues and to improve it to a Technically sound and…

1 条评论

See all articles

Recurrent Neural Networks

Mounic Madiraju

Web Solutions Architect @ Sonova Group | Ex- AirAsia, Monster, CGI | Cloud Solutions | Digital Transformation | Data & Analytics | Gen AI | LLM | Omnichannel CX | Delivering Customized, High-Impact Solutions

Working:

Parameter Sharing in RNN:

Elman networks and Jordan networks

Hopfield Network

IndRNN (Independently RNN)

Continuous Time RNNs

Libraries

Application of RNN

Mounic Madiraju的更多文章

社区洞察

其他会员也浏览了

Learn With Me: Key Applications and Terms within Machine Learning

How to work with Autoencoders ?

RELU & GELU Activation Functions in Neural Networks

Neural Networks in AI

Constructing Neural Networks From Scratch

ML 1.5 Fundamental Concepts in Deep Learning

Basic Activation Functions for Neural Networks

Understanding Bidirectional and Deep RNNs: A Simplified Guide to Improving NLP Models

Small and Fast Deep Neural Networks

A Comprehensive Guide to Training Deep Multi-Layered Perceptron Neural Networks

Working:

Parameter Sharing in RNN:

Elman networks and Jordan networks

Hopfield Network

IndRNN (Independently RNN)

Continuous Time RNNs

Libraries

Application of RNN

Mounic Madiraju的更多文章

Why You Need Analytics Audit?

Data Reliability Monitoring

Unsupervised Learning

What is Supervised Learning?

Deep Learning - Convolutional Neural network

Basics for SEO on JavaScript Crawl

Unlimited Private Repositories for Free

Common Technical SEO Issues and their Recommendations

Improving Websites User Experience by Scrapping On-Page Technical and SEO Content Issues - SEO Beginners

社区洞察

其他会员也浏览了

Learn With Me: Key Applications and Terms within Machine Learning

How to work with Autoencoders ?

RELU & GELU Activation Functions in Neural Networks

Neural Networks in AI

Constructing Neural Networks From Scratch

ML 1.5 Fundamental Concepts in Deep Learning

Basic Activation Functions for Neural Networks

Understanding Bidirectional and Deep RNNs: A Simplified Guide to Improving NLP Models

Small and Fast Deep Neural Networks

A Comprehensive Guide to Training Deep Multi-Layered Perceptron Neural Networks