LSTM Presentation
LSTM Presentation
4
Long Short Term
Memory (LSTM) Model
Group Members
Burhan Ahmed(bai222001)
Saif Ur Rehman(bcs222001)
↙
Halima Sadia(bse222002)
CONTENT
O O
Introduction to LSTM Features of LSTM
1 2
O
3
O
4
↖ Introduction
1
↗ To LSTM
↘ Overview of Long Short-Term Memory Networks
↙
What is LSTM?
LSTM is a type of recurrent neural network (RNN) that can
process and analyze sequential data, such as text, speech,
and time series.
They use a memory cell and gates to control the flow of
information.
Memory cell stores information from previous time steps and
uses it to influence the output of the cell at the current time
step.
The output of each LSTM cell is passed to the next cell in the
network, allowing the LSTM to process and analyze
sequential data over multiple time steps.
LSTM
HISTORY
1990: LSTM concept proposed by Sepp Hochreiter
and Jürgen Schmidhuber.
1997: Paper published explaining the design with
input, forget, and output gates.
2015: Rise of Attention Mechanisms and
Transformers challenging LSTMs.
2020: New architectures and training algorithms.
2021: Introduction of Corrector LSTM for accurate
predictions.
2022: NXAI invented xLSTM (Extended LSTM) with
billions of parameters.
Recurrent Neural
Network RNN
What is RNN? Basic Structure
Definition: Recurrent Neural Networks
(RNNs) are neural networks designed to
process sequential data with temporal
dependencies.
Key Characteristics:
Analyze data with a temporal dimension
(e.g., time-series, speech, text).
Use a hidden state passed from one
timestep to the next.
Hidden state updates based on current
input and previous hidden state.
Strengths: Excellent at capturing short-term
dependencies.
Challenges: Struggle to handle long-term
dependencies due to vanishing or exploding
gradients.
Vanilla RNN
↙
PROBLEMS IN RNN
Long Term Dependency Issue in RNN
↙
Structure of LSTM
Structure of LSTM
Memory Cell
Memory cells maintain information across time
steps, enabling LSTMs to learn and utilize long-
term dependencies in data.
The content of the memory cell is updated or
modified by the interaction of three gates:
Input Gate: Determines what new information to
add to the memory.
Forget Gate: Decides which information to erase.
Output Gate: Controls what part of the memory is
used for the current output.
Memory Cell
LSTM Memory Cell
Input Gate