What is an RNN

Uploaded by

mrinmoyee.bhattacharya

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views

What is an RNN

Uploaded by

mrinmoyee.bhattacharya

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

What is an RNN?

A recurrent neural network, or RNN, is a deep neural network trained on

sequential or time series data to create a machine learning model can make
sequential predictions or conclusions based on sequential inputs.
An RNN might be used to predict daily flood levels based on past daily flood,
tide and meteorilogical data. But RNNs can also be used to solve ordinal or
temporal problems such as language translation, natural language processing
(NLP), speech recognition, and image captioning. RNNs are incorporated into
popular applications such as Siri, voice search, and Google Translate.

How does a recurrent neural network work?

The following image shows a diagram of an RNN.

RNNs are made of neurons: data-processing nodes that work together to

perform complex tasks. The neurons are organized as input, output, and
hidden layers. The input layer receives the information to process, and the
output layer provides the result. Data processing, analysis, and prediction take
place in the hidden layer.
Hidden layer
RNNs work by passing the sequential data that they receive to the hidden
layers one step at a time. However, they also have a self-looping
or recurrent workflow: the hidden layer can remember and use previous inputs
for future predictions in a short-term memory component. It uses the current
input and the stored memory to predict the next sequence.
For example, consider the sequence: Apple is red. You want the RNN to
predict red when it receives the input sequence Apple is. When the hidden
layer processes the word Apple, it stores a copy in its memory. Next, when it
sees the word is, it recalls Apple from its memory and understands the full
sequence: Apple is for context. It can then predict red for improved accuracy.
This makes RNNs useful in speech recognition, machine translation, and other
language modeling tasks.

Training
Machine learning (ML) engineers train deep neural networks like RNNs by
feeding the model with training data and refining its performance. In ML, the
neuron's weights are signals to determine how influential the information
learned during training is when predicting the output. Each layer in an RNN
shares the same weight.
ML engineers adjust weights to improve prediction accuracy. They use a
technique called backpropagation through time (BPTT) to calculate model error
and adjust its weight accordingly. BPTT rolls back the output to the previous
time step and recalculates the error rate. This way, it can identify which hidden
state in the sequence is causing a significant error and readjust the weight to
reduce the error margin.

What are the types of recurrent neural networks?

RNNs are often characterized by one-to-one architecture: one input sequence
is associated with one output. However, you can flexibly adjust them into
various configurations for specific purposes. The following are several common
RNN types.
One-to-many
This RNN type channels one input to several outputs. It enables linguistic
applications like image captioning by generating a sentence from a single
keyword.

Many-to-many
The model uses multiple inputs to predict multiple outputs. For example, you
can create a language translator with an RNN, which analyzes a sentence and
correctly structures the words in a different language.

Many-to-one
Several inputs are mapped to an output. This is helpful in applications like
sentiment analysis, where the model predicts customers’ sentiments
like positive, negative, and neutral from input testimonials.
How do recurrent neural networks compare to other
deep learning networks?
RNNs are one of several different neural network architectures.
Recurrent neural network vs. feed-forward neural network
Like RNNs, feed-forward neural networks are artificial neural networks that
pass information from one end to the other end of the architecture. A feed-
forward neural network can perform simple classification, regression, or
recognition tasks, but it can’t remember the previous input that it has
processed. For example, it forgets Apple by the time its neuron processes the
word is. The RNN overcomes this memory limitation by including a hidden
memory state in the neuron.
Recurrent neural network vs. convolutional neural networks
Convolutional neural networks are artificial neural networks that are designed
to process spatial data. You can use convolutional neural networks to extract
spatial information from videos and images by passing them through a series of
convolutional and pooling layers in the neural network. RNNs are designed to
capture long-term dependencies in sequential data

Common activation functions

As discussed in the Learn article on Neural Networks, an activation function
determines whether a neuron should be activated. The nonlinear functions
typically convert the output of a given neuron to a value between 0 and 1 or -1
and1.
Variant RNN architectures
Popular RNN architecture variants include
• Bidirectional recurrent neural networks (BRRNs)
• Long short-term memory (LSTM)
• Gated recurrent units (GNUs)
1.Bidirectional recurrent neural networks (BRNNs)

While unidirectional RNNs can only drawn from previous inputs to make
predictions about the current state, bidirectional RNNs, or BRNNs, pull in
future data to improve the accuracy of it. Returning to the example of “feeling
under the weather”, a model based on a BRNN can better predict that the
second word in that phrase is “under” if it knows that the last word in the
sequence is “weather.”
2.Long short-term memory (LSTM)

LSTM a popular RNN architecture, which was introduced by Sepp Hochreiter

and Juergen Schmidhuber as a solution to vanishing gradient problem. In
their paper (link resides outside ibm.com), they work to address the problem of
long-term dependencies. That is, if the previous state that is influencing the
current prediction is not in the recent past, the RNN model may not be able to
accurately predict the current state.
As an example, let’s say we wanted to predict the italicized words in following,
“Alice is allergic to nuts. She can’t eat peanut butter.” The context of a nut
allergy can help us anticipate that the food that cannot be eaten contains nuts.
However, if that context was a few sentences prior, then it would make it
difficult, or even impossible, for the RNN to connect the information.
To remedy this, LSTMs have “cells” in the hidden layers of the neural network,
which have three gates–an input gate, an output gate, and a forget gate. These
gates control the flow of information which is needed to predict the output in
the network. For example, if gender pronouns, such as “she”, was repeated
multiple times in prior sentences, you may exclude that from the cell state.
3.Gated recurrent units (GRUs)

A GRU is similar to an LSTM as it also works to address the short-term memory

problem of RNN models. Instead of using a “cell state” regulate information, it
uses hidden states, and instead of three gates, it has two—a reset gate and an
update gate. Similar to the gates within LSTMs, the reset and update gates
control how much and which information to retain.

RNN
No ratings yet
RNN
15 pages
Unit 4
No ratings yet
Unit 4
27 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
8 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
36 pages
Recurrent Neural Networks Tutorial, Part 1 - Introduction To RNNs - WildML
No ratings yet
Recurrent Neural Networks Tutorial, Part 1 - Introduction To RNNs - WildML
8 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
11 pages
DL Notes
No ratings yet
DL Notes
35 pages
What are Recurrent Neural Networks.docx
No ratings yet
What are Recurrent Neural Networks.docx
7 pages
Deep Learning(Part4). Recurrent Neural Network (RNN) _ by Sumbatilinda _ Medium
No ratings yet
Deep Learning(Part4). Recurrent Neural Network (RNN) _ by Sumbatilinda _ Medium
36 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
6 pages
RNN.docx
No ratings yet
RNN.docx
8 pages
Recurrent Neural Networks: Index
No ratings yet
Recurrent Neural Networks: Index
13 pages
DL Unit - III Notes1
No ratings yet
DL Unit - III Notes1
14 pages
UNIT-3
No ratings yet
UNIT-3
30 pages
module-4-RNN-LSTM-GRU
No ratings yet
module-4-RNN-LSTM-GRU
59 pages
Top 25 Interview Questions On RNN - Reader View
No ratings yet
Top 25 Interview Questions On RNN - Reader View
9 pages
OCI DL Fundations
No ratings yet
OCI DL Fundations
4 pages
Nria20-Dl - Unit-4 Notes-Final
No ratings yet
Nria20-Dl - Unit-4 Notes-Final
21 pages
What is a Recurrent Neural Network
No ratings yet
What is a Recurrent Neural Network
36 pages
AD3501_UNIT3
No ratings yet
AD3501_UNIT3
29 pages
Survey of Prediction Using Recurrent Neural Network
No ratings yet
Survey of Prediction Using Recurrent Neural Network
3 pages
Top 10 Deep Learning Algorithms You Should Know in 2023
No ratings yet
Top 10 Deep Learning Algorithms You Should Know in 2023
14 pages
Ministry of Higher Education and Scientific Research University of Technology Computer Engineering Department
No ratings yet
Ministry of Higher Education and Scientific Research University of Technology Computer Engineering Department
6 pages
A Recurrent Neural Network
No ratings yet
A Recurrent Neural Network
22 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
Day 4
No ratings yet
Day 4
22 pages
UNIT-IV DL
No ratings yet
UNIT-IV DL
23 pages
Understanding Recurrent Neural Networks (RNN) — NLP _ by Praveen Raj _ Medium
No ratings yet
Understanding Recurrent Neural Networks (RNN) — NLP _ by Praveen Raj _ Medium
25 pages
Neural Network (RNN & CNN)
No ratings yet
Neural Network (RNN & CNN)
31 pages
RNN Simplified.
No ratings yet
RNN Simplified.
2 pages
DL-unit-4-part-2
No ratings yet
DL-unit-4-part-2
8 pages
Recurrent Neural Network
No ratings yet
Recurrent Neural Network
10 pages
LSTM
No ratings yet
LSTM
19 pages
Recurrent Neural Networks
No ratings yet
Recurrent Neural Networks
18 pages
Unit III- Recurrent Neural Networks
No ratings yet
Unit III- Recurrent Neural Networks
44 pages
30 Encoder, Decoder, Sequence To Sequence 25-09-2024
No ratings yet
30 Encoder, Decoder, Sequence To Sequence 25-09-2024
5 pages
Recurrent Neural Networks (RNN)
No ratings yet
Recurrent Neural Networks (RNN)
3 pages
Recurrent Neural Nets - The Third and Least Appreciated Leg of The AI Stool - Data Science Central
No ratings yet
Recurrent Neural Nets - The Third and Least Appreciated Leg of The AI Stool - Data Science Central
6 pages
Top 10 Neural Network Architectures You Need To Know: 1 - Perceptrons
No ratings yet
Top 10 Neural Network Architectures You Need To Know: 1 - Perceptrons
12 pages
all_merged_chap_3 (1)
No ratings yet
all_merged_chap_3 (1)
36 pages
Unit V Recurrent Neural Networks
No ratings yet
Unit V Recurrent Neural Networks
35 pages
Lecture Notes_RRN
No ratings yet
Lecture Notes_RRN
8 pages
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
From Everand
Multilayer Perceptron: Fundamentals and Applications for Decoding Neural Networks
Fouad Sabry
No ratings yet
28-Recurrent Neural Networks - Bidirectional RNNs-19!09!2024
No ratings yet
28-Recurrent Neural Networks - Bidirectional RNNs-19!09!2024
12 pages
DeepLearning Unit-III
No ratings yet
DeepLearning Unit-III
99 pages
RNN Part1
No ratings yet
RNN Part1
12 pages
Perceptron vs Neural network
No ratings yet
Perceptron vs Neural network
8 pages
Steps For Training A Recurrent Neural Network: Advantages
No ratings yet
Steps For Training A Recurrent Neural Network: Advantages
13 pages
What Is A Neural Network
No ratings yet
What Is A Neural Network
7 pages
Article On Recurrent Neural Networks
No ratings yet
Article On Recurrent Neural Networks
3 pages
Analyzing Types of Neural Networks in Deep Learning
No ratings yet
Analyzing Types of Neural Networks in Deep Learning
15 pages
unit 4_merged
No ratings yet
unit 4_merged
13 pages
Different Artificial Neural Networks Architectures
No ratings yet
Different Artificial Neural Networks Architectures
27 pages
Module 4 Recurrent Neural Network
No ratings yet
Module 4 Recurrent Neural Network
78 pages
DeepLearning Unit-III
No ratings yet
DeepLearning Unit-III
42 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
UNIT5
No ratings yet
UNIT5
13 pages
Soft Computing
No ratings yet
Soft Computing
25 pages
Unit 4 - DL
No ratings yet
Unit 4 - DL
23 pages
Power of Recurrent Neural Networks (RNN) - Revolutionizing AI
No ratings yet
Power of Recurrent Neural Networks (RNN) - Revolutionizing AI
33 pages
LSTM_AryanGomes
No ratings yet
LSTM_AryanGomes
13 pages
Primer on Augmented Data Preparation
No ratings yet
Primer on Augmented Data Preparation
12 pages
PRESENTATION
No ratings yet
PRESENTATION
9 pages
Autoencoder
No ratings yet
Autoencoder
4 pages
Principal Component Analysis
No ratings yet
Principal Component Analysis
6 pages
How To Build Your Own Neural Network From Scratch in Python
No ratings yet
How To Build Your Own Neural Network From Scratch in Python
11 pages
(2020) Gaussian Error Linear Units (Gelus)
No ratings yet
(2020) Gaussian Error Linear Units (Gelus)
9 pages
Understanding LSTM
No ratings yet
Understanding LSTM
34 pages
Adaline Madaline
No ratings yet
Adaline Madaline
32 pages
Comparison of Algorithms in Foreign Exchange Rate Prediction
No ratings yet
Comparison of Algorithms in Foreign Exchange Rate Prediction
5 pages
Deep Learning - Lecture 4
No ratings yet
Deep Learning - Lecture 4
13 pages
NN Lec - 04 - 05
No ratings yet
NN Lec - 04 - 05
84 pages
Reinforcement Learning Syllabus
No ratings yet
Reinforcement Learning Syllabus
1 page
CSE3008 Module3
No ratings yet
CSE3008 Module3
38 pages
07 AIS302 CNN
No ratings yet
07 AIS302 CNN
56 pages
Chapter 18
No ratings yet
Chapter 18
31 pages
Lecture 8 - Supervised Learning in Neural Networks - (Part 1)
No ratings yet
Lecture 8 - Supervised Learning in Neural Networks - (Part 1)
7 pages
MODULE 5
No ratings yet
MODULE 5
27 pages
2023 June ITT478-A
No ratings yet
2023 June ITT478-A
2 pages
Download Full (Ebook) Deep Learning for Medical Image Analysis by S. Kevin Zhou, Hayit Greenspan, Dinggang Shen (eds.) ISBN 9780128104088, 0128104082 PDF All Chapters
100% (9)
Download Full (Ebook) Deep Learning for Medical Image Analysis by S. Kevin Zhou, Hayit Greenspan, Dinggang Shen (eds.) ISBN 9780128104088, 0128104082 PDF All Chapters
55 pages
Unit 1
No ratings yet
Unit 1
109 pages
ANN Architectures
No ratings yet
ANN Architectures
26 pages
comprehensive-popular-deep-learning-interview-questions-answers
No ratings yet
comprehensive-popular-deep-learning-interview-questions-answers
15 pages
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
No ratings yet
Bee4333 Intelligent Control: Artificial Neural Network (ANN)
76 pages
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
No ratings yet
CNN Architectures: Lenet, Alexnet, VGG, Googlenet, Resnet and More
9 pages
CCS355 SET2 Anna University Lab Question Set Neural Network
No ratings yet
CCS355 SET2 Anna University Lab Question Set Neural Network
2 pages
QB3RDIA
No ratings yet
QB3RDIA
2 pages
Backpropagation Math
No ratings yet
Backpropagation Math
11 pages
CNN Notes - Rohan
No ratings yet
CNN Notes - Rohan
2 pages
Module 3 - Convolutional Neural Networks: History
No ratings yet
Module 3 - Convolutional Neural Networks: History
3 pages
Ad3501 - Deep Learning
No ratings yet
Ad3501 - Deep Learning
2 pages
Practice Final
No ratings yet
Practice Final
45 pages
771 A18 Lec21
No ratings yet
771 A18 Lec21
109 pages
DL Unit 1
No ratings yet
DL Unit 1
16 pages
Radial Basis Function (RBF) Neural Networks For The Senior Design Project
No ratings yet
Radial Basis Function (RBF) Neural Networks For The Senior Design Project
17 pages