0% found this document useful (0 votes)

27 views

Unit 6

This document provides information about applying neural networks to images. It discusses how images can be represented as single column vectors by stretching the pixels and the problems this causes due to high dimensionality and loss of local relationships. Convolutional neural networks are proposed as a solution, using local connectivity and weight sharing. The document then discusses layers in CNNs such as convolution, pooling and fully connected layers. It also covers applications of CNNs and 3D convolutional networks. Recurrent neural networks are also summarized, including their ability to model temporal data and applications such as image captioning. Long short-term memory networks are introduced as an improvement over traditional RNNs.

Uploaded by

Poorna

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

27 views

Unit 6

Uploaded by

Poorna

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 41

Neural Networks & Deep Learning

Unit-6
Dr. D. SUDHEER
Assistant Professor
Department of CSE
VNR VJIET (NAAC: A++, NIRF: 113)
Hyderabad, Telangana.

©Dr. SUDHEER DEVULAPALLI 1

How to apply NN over Image?
Multi-layer Neural Network & Image

©Dr. SUDHEER DEVULAPALLI 2

How to apply NN over Image?
Multi-layer Neural Network & Image
Stretch pixels in single column vector

©Dr. SUDHEER DEVULAPALLI 3

How to apply NN over Image?
Multi-layer Neural Network & Image
Stretch pixels in single column vector

Problems ?
©Dr. SUDHEER DEVULAPALLI 4
How to apply NN over Image?
Multi-layer Neural Network & Image
Stretch pixels in single column vector

High dimensionality
Problems ? Local relationship

©Dr. SUDHEER DEVULAPALLI 5

How to apply NN over Image?
Multi-layer Neural Network & Image
Stretch pixels in single column vector

High dimensionality
Solutions ? Local relationship

©Dr. SUDHEER DEVULAPALLI 6

Convolutional Neural Networks
• Also known as
CNN, ConvNet, DCN
• CNN = a multi-layer neural network with
1. Local connectivity 2. Weight sharing

©Dr. SUDHEER DEVULAPALLI 7

Convolution Neural
Network(CNN)

©Dr. SUDHEER DEVULAPALLI 8

©Dr. SUDHEER DEVULAPALLI 9
For convolution and pooling operations open CNN layers unit5.pdf file.

©Dr. SUDHEER DEVULAPALLI 10

CNN Local and Global connectivity Input neurons:7, Hidden units:3

Number of parameters:
Global connectivity:?
Local connectivity: ?

©Dr. SUDHEER DEVULAPALLI 11

CNN Local and Global connectivity Input neurons:7, Hidden units:3

Number of parameters:
Global connectivity:3*7=21
Local connectivity: 3*3=9

©Dr. SUDHEER DEVULAPALLI 12

CNN Local and Global connectivity Input neurons:7, Hidden units:3

Number of parameters:
Without weight sharing:3*3=9
With weight sharing: 3*1=3

©Dr. SUDHEER DEVULAPALLI 13

Layers in CNN

• Input Layer (Ex. Input image)

• Convolution layer
• Non linearity layer
• Pooling layer
• fully connected layer
• classification layer

©Dr. SUDHEER DEVULAPALLI 14

©Dr. SUDHEER DEVULAPALLI 15
©Dr. SUDHEER DEVULAPALLI 16
©Dr. SUDHEER DEVULAPALLI 17
©Dr. SUDHEER DEVULAPALLI 18
©Dr. SUDHEER DEVULAPALLI 19
©Dr. SUDHEER DEVULAPALLI 20
©Dr. SUDHEER DEVULAPALLI 21
©Dr. SUDHEER DEVULAPALLI 22
©Dr. SUDHEER DEVULAPALLI 23
3D ConvNet

©Dr. SUDHEER DEVULAPALLI 24

Figure 3: Difference: 2D convolution and 3D convolution
[2]

©Dr. SUDHEER DEVULAPALLI 25

Difference: 2D convolution and 3D convolution
 2D convolution applied on an image will output an image.
 2D convolution applied on multiple images (treating them as different channels)
also results in an image.
 Hence, 2D ConvNets lose temporal information of the input signal right after
every convolution operation.

 Only 3D convolution preserves the temporal information of the input signals

resulting in an output volume.

©Dr. SUDHEER DEVULAPALLI 26

Batch normalization and layers:
• To accelerate training in CNNs we can normalize the activations of the previous
layer at each batch.
• This technique applies a transformation that keeps the mean activation close to
0.0 while also keeping the activation standard deviation close to 1.0.
• By applying normalization for each training mini-batch of input records, we can
use much higher learning rates.
• Batch normalization also reduces the sensitivity of training toward weight
initialization and acts as a regularizer.
Fully Connected Layers:
• We use this layer to compute class scores that we’ll use as output of the
network.
• Fully connected layers perform transformations on the input data volume that
are a function of the activations in the input volume and the parameters.
Applications of CNN:
• MRI data
•3D shape data
• Graph data
• NLP applications
Recurrent Neural Networks

•Historically, these networks have been difficult to train, but more recently,
advances in research (optimization, network architectures, parallelism, and
graphics processing units [GPUs]) have made them more approachable for the
practitioner.
• Recurrent Neural Networks take each vector from a sequence of input
vectors and model them one at a time.
• Modeling the time dimension is a hallmark of Recurrent Neural Networks.
Modeling the Time Dimension:
• Recurrent Neural Networks are considered Turing complete and can simulate
arbitrary programs (with weights).
• Recurrent neural networks are well suited for modeling functions for which
the input and/or output is composed of vectors that involve a time dependency
between the values.
• Recurrent neural networks model the time aspect of data by creating cycles
in the network (hence, the “recurrent” part of the name).©Dr. SUDHEER DEVULAPALLI 29
Lost in Time:
• Many classification tools (support vector machines, logistic regression, and
regular feed-forward networks) have been applied successfully without
modeling the time dimension, assuming independence.
• Other variations of these tools capture the time dynamic by modeling a
sliding window of the input (e.g., the previous, current, and next input together
as a single input vector).
• A drawback of these tools is that assuming independence in the time
connection between model inputs does not allow our model to capture long-
range time dependencies.
• Sliding window techniques have a limited window width and will fail to
capture any effects larger than the fixed window size.
• Good example is automatic replies by machines for conversations over time.

Temporal feedback and loops in connections:
• Recurrent Neural Networks can have loops in the connections.
•This allows them to model temporal behavior gain accuracy in domains such
as time-series, language, audio, and text.
• Data in these domains are inherently ordered and context sensitive where later
values depend on previous ones.
• A Recurrent Neural Network includes a feedback loop that it uses to learn
from sequences, including sequences of varying lengths.
• Recurrent Neural Networks contain an extra parameter matrix for the
connections between time-steps, which are used/trained to capture the temporal
relationships in the data.
• Recurrent Neural Networks are trained to generate sequences, in which the
output at each time-step is based on both the current input and the input at all
previous time steps.
• Recurrent Neural Networks compute a gradient with an algorithm called back
propagation through time (BPTT). ©Dr. SUDHEER DEVULAPALLI 31
Applications for Sequences and time-series data:

• Image captioning
•Speech synthesis
•Music generation
•Playing video games
•Language modeling
•Character-level text generation models

Understanding model input and output:

• Recurrent Neural Networks change the fixed input to dynamic to include
multiple input vectors, one for each time-step, and each vector can have many
columns.

• One-to-many: sequence output. For example, image captioning takes an
image and outputs a sequence of words.
•Many-to-one: sequence input. For example, sentiment analysis where a given
sentence is input.
•Many-to-many: For example, video classification: label each frame.

LSTM Neuron

A
LSTM

• It will be useful to remember the past data along with the present data to take
decision.
• Example, In a sentence beginning words more important than the last words
to understand the meaning.
• LSTM stores all the words along with recent words to take decision.

LSTM

Long-term-memory Short-term-memory

•by logic gatesLong-term memory represents all the words starting from the
first word.
• Short-term-memory represents recent words from past state of the model.
• when LSTM keep on storing data, it may reach where they cannot store
further.
• It will remove the unwanted information from time to time.
• The removing or keeping the data implemented .
input Gate
output Gate
Forget 1 2 3
Gate
LSTM
Pass
Forget updated
irrelevant information
information

New updated
information ©Dr. SUDHEER DEVULAPALLI 38
Layers of RNN
There two important layers: 1. Embedding 2. LSTM
1. Embedding
• It is useful to convert positive integers to vector of values.
• Fixed range of input values should be provide this layer.
• It will be more useful in language translation to understand the meaning.

Embedding(input_dim,output_dim,input_length)

LSTM:
• The LSTM network is different to a classical MLP.
• Input data is propagated through the network in order to make a prediction.
• Like RNNs, the LSTMs have recurrent connections so that the state from
previous activations of the neuron from the previous time step is used as
context for formulating an output.
• But unlike other RNNs, the LSTM has a unique formulation that allows it to
avoid the problems that prevent the training and scaling of other RNNs.
• LSTM overcomes the problems like vanishing gradient and exploding
gradients.
LSTM Gates
• Forget Gate: Decides what information to discard from the cell.
• Input Gate: Decides which values from the input to update the memory
state.
• Output Gate: Decides what to output based on input and the memory of the
cell. ©Dr. SUDHEER DEVULAPALLI 40
• The forget gate and input gate are used in the updating of the internal state.
• The output gate is a final limiter on what the cell actually outputs.
• It is these gates and the consistent data flow called the constant error
carrousel or CEC that keep each cell stable (neither exploding or vanishing).

Applications of LSTM:
• Image caption generation.
• Text translation.
• Hand writing recognition.
Limitations of LSSTM:
• In time series forecasting, often the information relevant for making a
forecast is within a small window of past observations. Often an MLP with a
window or a linear model may be a less complex and more suitable model.
• An important limitation of LSTMs is the memory

Download Complete Researching Information Systems and Computing 1st Edition Briony J Oates PDF for All Chapters
71% (7)
Download Complete Researching Information Systems and Computing 1st Edition Briony J Oates PDF for All Chapters
37 pages
Build Business Credit Fast Step by Step Guide To A Solid Business Credit Foundation
100% (11)
Build Business Credit Fast Step by Step Guide To A Solid Business Credit Foundation
4 pages
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
From Everand
Convolutional Neural Networks in Python: Beginner's Guide to Convolutional Neural Networks in Python
Frank Millstein
No ratings yet
Aliens Script by James Cameron
100% (1)
Aliens Script by James Cameron
145 pages
DEEP LEARNING
No ratings yet
DEEP LEARNING
38 pages
UNIT-2 DL
No ratings yet
UNIT-2 DL
51 pages
SUDIPTA
No ratings yet
SUDIPTA
19 pages
Plant Leaf Recognition
No ratings yet
Plant Leaf Recognition
14 pages
DL CO3- PPT 1
No ratings yet
DL CO3- PPT 1
22 pages
Deep Learning RNN
100% (1)
Deep Learning RNN
53 pages
AAM QB With Answer
No ratings yet
AAM QB With Answer
4 pages
Handwriting Recognition With Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
No ratings yet
Handwriting Recognition With Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
6 pages
Types of Neural Networks and Definition of Neural Network
No ratings yet
Types of Neural Networks and Definition of Neural Network
15 pages
Example File
No ratings yet
Example File
3 pages
Attention Is All You Need
No ratings yet
Attention Is All You Need
15 pages
Solodskikh_Integral_Neural_Networks_CVPR_2023_paper
No ratings yet
Solodskikh_Integral_Neural_Networks_CVPR_2023_paper
10 pages
Handwriting Recognition With Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
No ratings yet
Handwriting Recognition With Large Multidimensional Long Short-Term Memory Recurrent Neural Networks
6 pages
Bidirectional RNN and RVNN
No ratings yet
Bidirectional RNN and RVNN
15 pages
RNN.docx
No ratings yet
RNN.docx
8 pages
Deep Neural Network
No ratings yet
Deep Neural Network
12 pages
Deep Learning in Automated Ecg Noise Detection
No ratings yet
Deep Learning in Automated Ecg Noise Detection
22 pages
RNN_2
No ratings yet
RNN_2
144 pages
RNN LSTM
No ratings yet
RNN LSTM
72 pages
Under Water Final PPT
No ratings yet
Under Water Final PPT
21 pages
6-Neural NT
No ratings yet
6-Neural NT
44 pages
AI_UNIT_5
No ratings yet
AI_UNIT_5
33 pages
DL3 QB
No ratings yet
DL3 QB
19 pages
A Practical Survey On Faster and Lighter Transformers - 2023 - Fournier Et Al
No ratings yet
A Practical Survey On Faster and Lighter Transformers - 2023 - Fournier Et Al
40 pages
Artificial Neural Network (2)
No ratings yet
Artificial Neural Network (2)
75 pages
Christopher Manning Lecture 5: Language Models and Recurrent Neural Networks (Oh, and Finish Neural Dependency Parsing J)
No ratings yet
Christopher Manning Lecture 5: Language Models and Recurrent Neural Networks (Oh, and Finish Neural Dependency Parsing J)
66 pages
Applsci 12 11184
No ratings yet
Applsci 12 11184
18 pages
DL UNIT 3
No ratings yet
DL UNIT 3
14 pages
ISP560 Notes
No ratings yet
ISP560 Notes
139 pages
EEG SIGNAL Analysing Using ML Techniques FOR Epilepsy Disease
No ratings yet
EEG SIGNAL Analysing Using ML Techniques FOR Epilepsy Disease
24 pages
s41598-023-36940-5
No ratings yet
s41598-023-36940-5
12 pages
Dsa Theory Da
No ratings yet
Dsa Theory Da
41 pages
RNN Neural Network
No ratings yet
RNN Neural Network
23 pages
1 - Deep Learning 10-10-2023
No ratings yet
1 - Deep Learning 10-10-2023
30 pages
Denosing
No ratings yet
Denosing
2 pages
Artificial intelligence basics
No ratings yet
Artificial intelligence basics
13 pages
Deep Neural Network Architectures For ModulationClassification
No ratings yet
Deep Neural Network Architectures For ModulationClassification
5 pages
Introduction To Deep Learning: Poo Kuan Hoong 19 July 2016
No ratings yet
Introduction To Deep Learning: Poo Kuan Hoong 19 July 2016
53 pages
NN
No ratings yet
NN
57 pages
1-s2.0-S0031320322002394-main
No ratings yet
1-s2.0-S0031320322002394-main
12 pages
Dual Residual Attention Network For Image Denoising
No ratings yet
Dual Residual Attention Network For Image Denoising
18 pages
13031122003_SAINI_GUHA_ROY_CA2
No ratings yet
13031122003_SAINI_GUHA_ROY_CA2
8 pages
Lecture Notes on Lecture Notes on Deep Learning.docx
No ratings yet
Lecture Notes on Lecture Notes on Deep Learning.docx
8 pages
Pneumonia_Detection_DeepLearning 3 am
No ratings yet
Pneumonia_Detection_DeepLearning 3 am
38 pages
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
No ratings yet
Components-Algorithms/: The Basic Architecture of Neural Networks: Single Computational Layer
65 pages
Pneumonia_Detection_DeepLearning 11 Am 28 04 2025 -
No ratings yet
Pneumonia_Detection_DeepLearning 11 Am 28 04 2025 -
38 pages
Chapter 6 (6.2)
No ratings yet
Chapter 6 (6.2)
65 pages
20250224_Diffusion-StableDiffusion
No ratings yet
20250224_Diffusion-StableDiffusion
27 pages
2020 Hong
No ratings yet
2020 Hong
18 pages
Operations Slides
No ratings yet
Operations Slides
11 pages
Unit 1
No ratings yet
Unit 1
70 pages
Electronics 08 00736 v2
No ratings yet
Electronics 08 00736 v2
13 pages
DL Unit-3
No ratings yet
DL Unit-3
9 pages
Foundation Design Using ANN
No ratings yet
Foundation Design Using ANN
57 pages
Technical Report On DenseNet Architecture (Deep Learning Network Model)
No ratings yet
Technical Report On DenseNet Architecture (Deep Learning Network Model)
9 pages
Deep Arch Msc 2024
No ratings yet
Deep Arch Msc 2024
83 pages
Fine Tuning Hper Parameters
No ratings yet
Fine Tuning Hper Parameters
13 pages
DeepLearning L1 Intro
No ratings yet
DeepLearning L1 Intro
92 pages
DL Mod2
No ratings yet
DL Mod2
45 pages
P 01 Intro
No ratings yet
P 01 Intro
70 pages
Introduction To Econometrics, 5 Edition: Chapter 5: Dummy Variables
No ratings yet
Introduction To Econometrics, 5 Edition: Chapter 5: Dummy Variables
40 pages
1 - Proforma Invoice-PJG20200109001
No ratings yet
1 - Proforma Invoice-PJG20200109001
1 page
WT-SAFE Quotation (SF-102+SF-140+SF-421) - Torretas y Alarmas Industriales SA de CV (MX) - 20240315
No ratings yet
WT-SAFE Quotation (SF-102+SF-140+SF-421) - Torretas y Alarmas Industriales SA de CV (MX) - 20240315
2 pages
Modulos 660W Canadian
No ratings yet
Modulos 660W Canadian
2 pages
2022 SDO Zero Backlog Report Google Forms
No ratings yet
2022 SDO Zero Backlog Report Google Forms
38 pages
Brillant Smart Inverter
No ratings yet
Brillant Smart Inverter
14 pages
Untitled
No ratings yet
Untitled
35 pages
4D_runge_kutta and adaptive step size
No ratings yet
4D_runge_kutta and adaptive step size
14 pages
Design and Analysis of Flywheel in A Multi Cylinder Petrol Engine
No ratings yet
Design and Analysis of Flywheel in A Multi Cylinder Petrol Engine
10 pages
Calculation of Draft and Twist in Ring Spinning
100% (2)
Calculation of Draft and Twist in Ring Spinning
3 pages
Nuig PHD Thesis Format
100% (2)
Nuig PHD Thesis Format
6 pages
Embedded Systems
No ratings yet
Embedded Systems
34 pages
Oroflex Well Brochure
No ratings yet
Oroflex Well Brochure
8 pages
Line Filters: Datasheet - Live
No ratings yet
Line Filters: Datasheet - Live
1 page
Well Test - Self Learning Module - Pet Eng
100% (1)
Well Test - Self Learning Module - Pet Eng
27 pages
Cyclone Dust Collector Report
No ratings yet
Cyclone Dust Collector Report
30 pages
Star Trek Adventures Decision Point: SCENE 1: The Search
No ratings yet
Star Trek Adventures Decision Point: SCENE 1: The Search
7 pages
Parts Fa
No ratings yet
Parts Fa
14 pages
BTP Report Parteek
No ratings yet
BTP Report Parteek
41 pages
Nook GlowLight 4 User Guide
No ratings yet
Nook GlowLight 4 User Guide
154 pages
BM2510 BM2502 Syllabus AY24S1
No ratings yet
BM2510 BM2502 Syllabus AY24S1
14 pages
Idap 2019 8875953
No ratings yet
Idap 2019 8875953
6 pages
Unit 3 Da
No ratings yet
Unit 3 Da
43 pages
Notice Inviting Tender 2
No ratings yet
Notice Inviting Tender 2
1 page
INTELLECTUAL PROPERTY RIGHTS IN CYBER SPACE (1)
No ratings yet
INTELLECTUAL PROPERTY RIGHTS IN CYBER SPACE (1)
10 pages
About Network Visibility Module
No ratings yet
About Network Visibility Module
8 pages

Unit 6

Uploaded by

Unit 6

Uploaded by

Neural Networks & Deep Learning

©Dr. SUDHEER DEVULAPALLI 1

©Dr. SUDHEER DEVULAPALLI 2

©Dr. SUDHEER DEVULAPALLI 3

©Dr. SUDHEER DEVULAPALLI 5

©Dr. SUDHEER DEVULAPALLI 6

©Dr. SUDHEER DEVULAPALLI 7

©Dr. SUDHEER DEVULAPALLI 8

©Dr. SUDHEER DEVULAPALLI 10

©Dr. SUDHEER DEVULAPALLI 11

©Dr. SUDHEER DEVULAPALLI 12

©Dr. SUDHEER DEVULAPALLI 13

• Input Layer (Ex. Input image)

©Dr. SUDHEER DEVULAPALLI 14

©Dr. SUDHEER DEVULAPALLI 24

©Dr. SUDHEER DEVULAPALLI 25

 Only 3D convolution preserves the temporal information of the input signals

©Dr. SUDHEER DEVULAPALLI 26

©Dr. SUDHEER DEVULAPALLI 30

Understanding model input and output:

©Dr. SUDHEER DEVULAPALLI 32

©Dr. SUDHEER DEVULAPALLI 33

©Dr. SUDHEER DEVULAPALLI 35

©Dr. SUDHEER DEVULAPALLI 36

©Dr. SUDHEER DEVULAPALLI 37

©Dr. SUDHEER DEVULAPALLI 39

©Dr. SUDHEER DEVULAPALLI 41

You might also like