0% found this document useful (0 votes)

6 views22 pages

S2_5_NN

The document provides an overview of artificial neural networks and deep learning, detailing various types of networks such as multi-layer perceptrons, radial basis function networks, and recurrent neural networks. It covers their structures, operations, training methods, and applications in function approximation and image analysis. Additionally, it discusses activation functions, including sigmoid and hyperbolic tangent functions, and the importance of non-linear activation for effective learning.

Uploaded by

mahsa.kh.1980

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

6 views22 pages

S2_5_NN

Uploaded by

mahsa.kh.1980

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 22

Artificial N eu r al Networks

and Deep Lear n in g

1
Contents

• Introduction
Motivation, Biological Background

• Th res h o l d L o g i c U n i t s
Definition, Geometric Interpretation, Limitations, Networks of T L U s , Training

• General N e u r a l Networks
Structure, Operation, Training

• M u l t i - layer Perceptrons
Definition, Function Approximation, Gradient Descent, Backpropagation, Variants, Sensitivity Analysis

• Deep Learn i n g
Many-layered Perceptrons, Rectified Linear Units, Auto-Encoders, Feature Construction, Image Analysis

• R a d i a l B a s i s Fu n ct ion Networks
Definition, Function Approximation, Initialization, Training, Generalized Version

• Self-Org an i zi n g Map s
Definition, Learning Vector Quantization, Neighborhood of Output Neurons

• Hopfield Networks and B o l t z m a n n Machines

Definition, Convergence, Associative Memory, Solving Optimization Problems, Probabilistic Models

• Recu rren t N e u r a l Networks

Differential Equations, Vector Networks, Backpropagation through Time

2
M u l t i - layer Perceptrons ( M L P s )

64
M u l t i - layer Perceptrons

An r-layer perceptron is a neural network with a graph G = (U, C )

that satisﬁes the following conditions:

(i) Uin ∩ Uout = ∅,

(1) ∪ ···∪ U (r−2)

(ii) Uhidden = Uhidden hidden ,
(i) (j)
∀1 ≤ i < j ≤ r − 2 : Uhidden ∩ Uhidden = ∅,

(r−2)
(iii) C ⊆ (Uin × Uhidden
(1)
) ∪ (U r−3
(i)
i=1 hidden
× U (i+1) )
hidden
∪ ( U
hidden
× Uout )

65
M u l t i - layer Perceptrons

General structure of a multi -layer perceptron

x1 y1

x2 y2

xn ym
(1) (2) (r−2)
Uin Uhidden Uhidden Uhidden Uout

66
M u l t i - layer Perceptrons

• The network input function of each hidden neuron and of each output neuron is
the weighted sum of its inputs, that is,
(u) ⊤→ Σ
∀u ∈ Uhidden ∪ Uout : →u, i→
f net (w nu) = →u inu
w wuv outv .
= v∈pred (u)

• The activation function of each hidden neuron is a so-called

sigmoid function, that is, a monotonically increasing function

f : IR → [0, 1] with lim f (x) = 0 and lim f (x) = 1.

x→−∞ x→∞

• The activation function of each output neuron is either also a sigmoid function or
a linear function, that is,

fact(net, θ) = α net −θ.

Only the step function is a neurobiologically plausible activation function.

67
S i g mo i d A c t i v a t io n Functions

step function: semi-linear function:

1 1

1
1
2
2

net net
0 0
θ θ− 1
2
θ θ+ 1
2

sine until saturation: logistic function:

1
f act (net, θ) =
1 + e−(net −θ)

1 1

1 1
2 2

net net
0 π 0
θ− θ θ+ π
2 2 θ− 4 θ− 2 θ θ+ 2 θ+ 4

68
S i g mo i d A c t i v a t io n Functions

• All sigmoid functions on the previous slide are unipolar,

that is, they range from 0 to 1.
• Sometimes bipolar sigmoid functions are used (ranging from −1 to +1),
like the hyperbolic tangent (tangens hyperbolicus).

hyperbolic tangent:
fact(net, θ) = tanh(net −θ)
(net −θ) − e−(net −θ) 1
= e
e(net −θ) + e−(net −θ) net
0
θ− 2 θ− 1 θ θ+ 1 θ+ 2
1 − e−2(net −θ)
=
1 + e−2(net −θ) −1

= 2 −1
1 + e−2(net −θ)

69
M u l t i - layer Perceptrons: Weight Matrices

Let U1 = {v 1 , . . . , v m } and U2 = {u 1 , . . . , u n } be the neurons of two consecutive

layers of a multi-layer perceptron.
Their connection weights are represented by an n × m matrix

where w u i v j = 0 if there is no connection from neuron v j to neuron u i .

Advantage: The computation of the network input can be written as

etU2 = W ·i→
n→ nU2 = W ·o→
utU1

etU2 = (netu1, . . . , netun)⊤ and i→

where n→ utU1 = (outv1, . . . , outvm)⊤.
nU2 = o→

70
M u l t i - layer Perceptrons: Biimplication

So l v i n g the biimplication problem with a multi -layer perceptron.

−2
x1 −1
2
2
3 y
2
x2 −1 2
−2
Uin U hidden U out

Note the additional input neurons compared to the T L U solution.

71
M u l t i - layer Perceptrons: Fr e d k i n G a t e

s s
x1 y1 s 0 0 0 0 1 1 1 1
x2 y2 x1 0 0 1 1 0 0 1 1
x2 0 1 0 1 0 1 0 1
0 0 1 1 y1 0 0 1 1 0 1 0 1
a a a b y2 0 1 0 1 0 0 1 1
b b b a

y1 y2
x3 x2 x3 x2
x1 x1

72
M u l t i - layer Perceptrons: Fr e d k i n G a t e

1
2
2
x1
2
y1
−2 3 1
2
2
s
2
−2 2
3 1 y2
2
x2
2
2
1
Uin U hidden U out

84
W h y N o n - linear A c t i v a t io n Functions?

With weight matrices we have for two consecutive layers U1 and U2

etU2 = W ·i→
n→ nU2 = W ·o→
utU1 .
If the activation functions are linear, that is,
fact(net, θ) = α net −θ.
the activations of the neurons in the layer U2 can be computed as
a→ etU2 − →
ctU2 = D act ·n→ θ,
where
ctU2 = (actu1, . . . , actun)⊤ is the activation vector,
• a→
• D act is an n × n diagonal matrix of the factors α u i , i = 1, . . . , n, and

θ = (θu1, . . . , θun)⊤ is a bias vector.

•→

85
W h y N o n - linear A c t i v a t io n Functions?

If the output function is also linear, it is analogously

o→ ctU2 − →
utU2 = D out ·a→ ξ,
where
utU2 = (outu1, . . . , outun)⊤ is the output vector,
• o→
• D out is again an n × n diagonal matrix of factors, and
• →ξ = (ξ u 1 , . . . , ξ u n ) ⊤ a bias
vector. Combining these computations
we get o→ utU1 − →
utU2 = D out · D act · W ·o→ θ
−→ξ
and thus

utU2 = A 12 ·o→
o→ utU1 + →
b12
with an n × m matrix A 12 and an n-dimensional vector →
b12.

86
W h y N o n - linear A c t i v a t io n Functions?

Therefore we have
o→ utU1 + →
utU2 = A 12 ·o→ b12
and
o→ utU2 + →
utU3 = A 23 ·o→ b23 for the
computations of two consecutive layers U2 and U3.

These two computations can be combined into

o→ utU1 + →
utU3 = A 13 ·o→ b13, where

A 13 = A 23 ·A 12 and →
b13 = A 23 ·→
b12 + →
b23.

Res u l t: With linear activation and output functions any multi-layer perceptron can be
reduced to a two-layer perceptron.

87
M u l t i - layer Perceptrons: Function A p p r o x i ma t i o n

• Up to now: representing and learning Boolean functions f : {0, 1} n → {0, 1}.

• Now: representing and learning real-valued functions f : IR n → IR.

General idea of function approximation:

• Approximate a given function by a step function.
• Construct a neural network that computes the step function.

y
y4
y3
y2

y1
y0
x
x1 x2 x3 x4

88
M u l t i - layer Perceptrons: Function A p p r o x i ma t i o n

y
y4
y3
x1
2 y2

1 y1
1 −2
y1 y
x2 0
x
1 2
x1 x2 x3 x4
y2 y
x −2 1 id
1
x3
2 y3
1
−2 1

A neural network that computes the step function shown on the preceding slide.
According to the input value only one step is active at any time.
The output neuron has the identity as its activation and output functions.

89
M u l t i - layer Perceptrons: Function A p p r o x i ma t i o n

Theorem: Any Riemann-integrable function can be approximated

with arbitrary accuracy by a four-layer perceptron.

• But: Error is measured as the area between the functions.

• More sophisticated mathematical examination allows a stronger assertion:

With a three-layer perceptron any continuous function can be approximated
with arbitrary accuracy (error: maximum function value difference).

90
M u l t i - layer Perceptrons: Function A p p r o x i ma t i o n

y y
y4
y3 ∆y 4
y2 ∆y 3

y1 ∆y 2
y0 ∆y 1
x x
x1 x2 x3 x4 x1 x2 x3 x4

1
0 ·∆y 4
1
0 ·∆y 3
By using relative step heights 1
one layer can be saved. 0 ·∆y 2
1
0 ·∆y 1

92
M u l t i - layer Perceptrons: Function A p p r o x i ma t i o n

1 ∆y 1
x 2 ∆y 2
1
x id y
1
x 3 ∆y 3
1 ∆y 4

A neural network that computes the step function shown on the preceding slide.
The output neuron has the identity as its activation and output functions.

93
M u l t i - layer Perceptrons: Function A p p r o x i ma t i o n

y y
y4
y3 ∆y 4
y2 ∆y 3

y1 ∆y 2
y0 ∆y 1
x x
x1 x2 x3 x4 x1 x2 x3 x4

1
0 ·∆y 4
1
0 ·∆y 3
By using semi-linear functions 1
the approximation can be 0 ·∆y 2
1
improved. 0 ·∆y 1

94
M u l t i - layer Perceptrons: Function A p p r o x i ma t i o n

θ1
1
∆x ∆y 1
1
∆x
θ2 ∆y 2

x id y
1
∆x θ3 ∆ y 3
1
∆x ∆y 4
xi
θi =
θ4 ∆x
∆ x = x i + 1 −x i

A neural network that computes the step function shown on the preceding slide.
The output neuron has the identity as its activation and output functions.

Unit - II ML
No ratings yet
Unit - II ML
9 pages
UNIT V (1)
No ratings yet
UNIT V (1)
25 pages
NNDL
No ratings yet
NNDL
96 pages
1) deep_learning
No ratings yet
1) deep_learning
60 pages
UNIT V
No ratings yet
UNIT V
26 pages
FML Unit5
No ratings yet
FML Unit5
21 pages
AI17-Neural Networks
No ratings yet
AI17-Neural Networks
34 pages
2 3 4 6 7 8 9 Coursenotes
No ratings yet
2 3 4 6 7 8 9 Coursenotes
98 pages
Neural Networks
No ratings yet
Neural Networks
28 pages
Multilayer Perceptron
No ratings yet
Multilayer Perceptron
16 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
82 pages
Session XX - Neural Network
No ratings yet
Session XX - Neural Network
43 pages
26 Neural Nets
No ratings yet
26 Neural Nets
77 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
ST M Hdstat RNN Deep Learning
No ratings yet
ST M Hdstat RNN Deep Learning
17 pages
UNIT-I.pptx
No ratings yet
UNIT-I.pptx
90 pages
10 Multilayer Perceptrons
No ratings yet
10 Multilayer Perceptrons
54 pages
Neural networks unit-3
No ratings yet
Neural networks unit-3
14 pages
Neural Network and Fuzzy Logic
50% (2)
Neural Network and Fuzzy Logic
54 pages
2 Layers Activationfunctions
No ratings yet
2 Layers Activationfunctions
16 pages
AN2DL_02_2324_Perceptron_2_FeedForward
No ratings yet
AN2DL_02_2324_Perceptron_2_FeedForward
55 pages
Slides 11
No ratings yet
Slides 11
48 pages
Unit 2 Deep Learning and Neural Networks
No ratings yet
Unit 2 Deep Learning and Neural Networks
38 pages
14 Deep
No ratings yet
14 Deep
6 pages
MLSlides3 1 Selected Shared (3)
No ratings yet
MLSlides3 1 Selected Shared (3)
20 pages
Lec 23
No ratings yet
Lec 23
13 pages
Artificial Intelligence: Outline
No ratings yet
Artificial Intelligence: Outline
35 pages
06-NeuralNetworks-2024
No ratings yet
06-NeuralNetworks-2024
82 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
cst414- Deep learning
No ratings yet
cst414- Deep learning
34 pages
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
No ratings yet
WINSEM2023-24 BITE410L TH VL2023240503970 2024-03-11 Reference-Material-I
40 pages
Neural Network Notes
No ratings yet
Neural Network Notes
8 pages
Deep Learning
No ratings yet
Deep Learning
37 pages
Ann
No ratings yet
Ann
86 pages
Kagan Lecture2
No ratings yet
Kagan Lecture2
118 pages
Ad3451 Ml Unit 4 Notes
No ratings yet
Ad3451 Ml Unit 4 Notes
34 pages
Deep-learning (1)
No ratings yet
Deep-learning (1)
180 pages
UNIT V
No ratings yet
UNIT V
33 pages
5_From Linear Models to Multi-layer Perceptrons
No ratings yet
5_From Linear Models to Multi-layer Perceptrons
45 pages
Neural Networks
No ratings yet
Neural Networks
54 pages
ADVANCED_SUPERVISED_LEARNING[1]
No ratings yet
ADVANCED_SUPERVISED_LEARNING[1]
17 pages
13_Ann
No ratings yet
13_Ann
39 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
Ch 12_Artificial Neural Networks
No ratings yet
Ch 12_Artificial Neural Networks
39 pages
Unit -4 Artificial Neural Networks
No ratings yet
Unit -4 Artificial Neural Networks
33 pages
ANN Unit IV Notes
No ratings yet
ANN Unit IV Notes
4 pages
UNIT V NEURAL NETWORKS
No ratings yet
UNIT V NEURAL NETWORKS
35 pages
Artificial Neural Network: Lecture Module 22
No ratings yet
Artificial Neural Network: Lecture Module 22
54 pages
Ann
No ratings yet
Ann
40 pages
AI & ML Unit 5 Notes
No ratings yet
AI & ML Unit 5 Notes
23 pages
Ml Neural Networks
No ratings yet
Ml Neural Networks
71 pages
Unit 2 (Q&A)
No ratings yet
Unit 2 (Q&A)
23 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
DEEP LEARNING-MATERIAL FOR THE UNITS 1,2,3
No ratings yet
DEEP LEARNING-MATERIAL FOR THE UNITS 1,2,3
36 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
100 pages
cs188 sp23 Note25
No ratings yet
cs188 sp23 Note25
8 pages
02A-DL2023-NN-basics
No ratings yet
02A-DL2023-NN-basics
52 pages
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
From Everand
Student Solutions Manual to Accompany Economic Dynamics in Discrete Time, secondedition
Yue Jiang
4.5/5 (2)
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
From Everand
Hyperbolic Functions (Trigonometry) Mathematics E-Book For Public Exams
Mohmmad Khaja Shareef
No ratings yet
2020 8268 DeltaPDF PDF
No ratings yet
2020 8268 DeltaPDF PDF
32 pages
Project 1
No ratings yet
Project 1
7 pages
Data Analytics Unit-2 PPT Notes
No ratings yet
Data Analytics Unit-2 PPT Notes
190 pages
Neural Networks and Principal Component Analysis: Learning From Examples Without Local Minima
No ratings yet
Neural Networks and Principal Component Analysis: Learning From Examples Without Local Minima
6 pages
Unit 3
No ratings yet
Unit 3
41 pages
Introduction To Soft Computing
No ratings yet
Introduction To Soft Computing
9 pages
Nascimento 2000
No ratings yet
Nascimento 2000
12 pages
LAB MANUAL CST (Soft Computing) 12!02!2019
No ratings yet
LAB MANUAL CST (Soft Computing) 12!02!2019
66 pages
ANN Matlab
No ratings yet
ANN Matlab
13 pages
ANN Course File 2011
No ratings yet
ANN Course File 2011
8 pages
XAI Seminar
No ratings yet
XAI Seminar
8 pages
Prediction of Heart Disease Using Neural Network With Back Propagation
No ratings yet
Prediction of Heart Disease Using Neural Network With Back Propagation
4 pages
11 PDF
No ratings yet
11 PDF
13 pages
Homework 3: Neural Networks and Face Images
No ratings yet
Homework 3: Neural Networks and Face Images
13 pages
AI ML Solved Question Paper
No ratings yet
AI ML Solved Question Paper
25 pages
Rapport projet VHDL
No ratings yet
Rapport projet VHDL
53 pages
kuzey2014
No ratings yet
kuzey2014
16 pages
1 s2.0 S1674775520301426 Main
No ratings yet
1 s2.0 S1674775520301426 Main
19 pages
Drill Resrach Paper
No ratings yet
Drill Resrach Paper
7 pages
Lecture 12 - Neural Networks (DONE!!) PDF
No ratings yet
Lecture 12 - Neural Networks (DONE!!) PDF
27 pages
Ijciet 08 11 106
No ratings yet
Ijciet 08 11 106
8 pages
Deep Algorithm Unrolling For Blind Image Deblurring
No ratings yet
Deep Algorithm Unrolling For Blind Image Deblurring
14 pages
Deep Learning (MODULE-4)_RNN - NLP
No ratings yet
Deep Learning (MODULE-4)_RNN - NLP
52 pages
Master Thesis in Petroleum Engineering PDF
100% (3)
Master Thesis in Petroleum Engineering PDF
7 pages
Soft Computing: Introduction
No ratings yet
Soft Computing: Introduction
24 pages
Prediction of Transmission Line Overloading Using Intelligent Technique
No ratings yet
Prediction of Transmission Line Overloading Using Intelligent Technique
18 pages
Chapter 5 PDF
No ratings yet
Chapter 5 PDF
39 pages
Bachelor's Final Project - Investigating Artificial Intelligence Applied To Robotics
No ratings yet
Bachelor's Final Project - Investigating Artificial Intelligence Applied To Robotics
72 pages
Electrical Power and Energy Systems: Kusum Verma, K.R. Niazi
No ratings yet
Electrical Power and Energy Systems: Kusum Verma, K.R. Niazi
8 pages
Paul Haider, Benjamin Ellenberger, Jakob Jordan, Kevin Max, Ismael Jaras, Laura Kriener, Federico Benitez, Mihai A. Petrovici - Backpropagation through space, time and the brain (NEUROMONSTER).pdf
No ratings yet
Paul Haider, Benjamin Ellenberger, Jakob Jordan, Kevin Max, Ismael Jaras, Laura Kriener, Federico Benitez, Mihai A. Petrovici - Backpropagation through space, time and the brain (NEUROMONSTER).pdf
1 page

S2_5_NN

Uploaded by

S2_5_NN

Uploaded by

Artificial N eu r al Networks

and Deep Lear n in g

• Hopfield Networks and B o l t z m a n n Machines

• Recu rren t N e u r a l Networks

An r-layer perceptron is a neural network with a graph G = (U, C )

(i) Uin ∩ Uout = ∅,

(1) ∪ ···∪ U (r−2)

General structure of a multi -layer perceptron

• The activation function of each hidden neuron is a so-called

f : IR → [0, 1] with lim f (x) = 0 and lim f (x) = 1.

fact(net, θ) = α net −θ.

Only the step function is a neurobiologically plausible activation function.

step function: semi-linear function:

sine until saturation: logistic function:

• All sigmoid functions on the previous slide are unipolar,

Let U1 = {v 1 , . . . , v m } and U2 = {u 1 , . . . , u n } be the neurons of two consecutive

where w u i v j = 0 if there is no connection from neuron v j to neuron u i .

Advantage: The computation of the network input can be written as

etU2 = (netu1, . . . , netun)⊤ and i→

So l v i n g the biimplication problem with a multi -layer perceptron.

Note the additional input neurons compared to the T L U solution.

With weight matrices we have for two consecutive layers U1 and U2

θ = (θu1, . . . , θun)⊤ is a bias vector.

If the output function is also linear, it is analogously

These two computations can be combined into

• Up to now: representing and learning Boolean functions f : {0, 1} n → {0, 1}.

General idea of function approximation:

Theorem: Any Riemann-integrable function can be approximated

• But: Error is measured as the area between the functions.

• More sophisticated mathematical examination allows a stronger assertion:

You might also like