Foundations of Machine Learning: Module 6: Neural Network

- Multi-layer neural networks can represent complex functions by allowing interactions between inputs through multiple hidden layers of nodes. - The backpropagation algorithm trains multi-layer neural networks by propagating errors from the output layer back through the network to update weights between all layers. - It works by calculating the contribution of each node in one layer to errors in the next layer, and using those contributions to update the weights to minimize overall error via gradient descent. - This recursive process of propagating errors back through the network allows multi-layer neural networks to learn complex patterns from large amounts of data.

Uploaded by

Nishant Tiwari

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

33 views

Foundations of Machine Learning: Module 6: Neural Network

Uploaded by

Nishant Tiwari

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 14

Foundations of Machine Learning

Module 6: Neural Network

Part B: Multi-layer Neural
Network
Sudeshna Sarkar
IIT Kharagpur
Limitations of Perceptrons
• Perceptrons have a monotinicity property:
If a link has positive weight, activation can only increase as the
corresponding input value increases (irrespective of other
input values)
• Can’t represent functions where input interactions can cancel
one another’s effect (e.g. XOR)
• Can represent only linearly separable functions
A solution: multiple layers
output layer
y y

hidden layer
z1 z2 z1

input layer
x1 x2 x1
Power/Expressiveness of Multilayer
Networks
• Can represent interactions among inputs
• Two layer networks can represent any Boolean
function, and continuous functions (within a
tolerance) as long as the number of hidden units is
sufficient and appropriate activation functions used
• Learning algorithms exist, but weaker guarantees
than perceptron learning algorithms
Multilayer Network

Outputls
Inputs

First Second
Input hidden hidden Output
layer layer layer
Two-layer back-propagation neural network
Input signals
1
x1 1 y1
1
2
x2 2 y2
2

i wij j wjk
xi k yk

n1
n n2 yn2
xn
Input Hidden Output
layer layer

Error signals

6
The back-propagation training algorithm
• Step 1: Initialisation
Set all the weights and threshold levels of the network to
random numbers uniformly distributed inside a small range
1

v01
v11 1
x1 1 1 w11
v21 w01

1 y1
v22
x2 2 2 w21
v22
Input v02 Output

1
x z y
Backprop
• Initialization
– Set all the weights and threshold levels of the network to
random numbers uniformly distributed inside a small
range
• Forward computing:
– Apply an input vector x to input units
– Compute activation/output vector z on hidden layer
𝑧𝑗 = 𝜑(σ𝑖 𝑣𝑖𝑗 𝑥𝑖 )
– Compute the output vector y on output layer
𝑦𝑘 = 𝜑(σ𝑗 𝑤𝑗𝑘 𝑧𝑗 )
y is the result of the computation.
Learning for BP Nets
• Update of weights in W (between output and hidden layers):
– delta rule
• Not applicable to updating V (between input and hidden)
– don’t know the target values for hidden units z1, Z2, … ,ZP
• Solution: Propagate errors at output units to hidden units to
drive the update of weights in V (again by delta rule)
(error BACKPROPAGATION learning)
• Error backpropagation can be continued downward if the net
has more than one hidden layer.
• How to compute errors on hidden units?
Derivation
• For one output neuron, the error function is
1
𝐸 = (𝑦 − 𝑦) ො 2
2
• For each unit 𝑗, the output 𝑜𝑗 is defined as
𝑛

𝑜𝑗 = 𝜑 𝑛𝑒𝑡𝑗 = 𝜑 ෍ 𝑤𝑘𝑗 𝑜𝑘
𝑘=1
The input 𝑛𝑒𝑡𝑗 to a neuron is the weighted sum of outputs 𝑜𝑘
of previous 𝑛 neurons.
• Finding the derivative of the error:
𝜕𝐸 𝜕𝐸 𝜕𝑜𝑗 𝜕𝑛𝑒𝑡𝑗
=
𝜕𝑤𝑖𝑗 𝜕𝑜𝑗 𝜕𝑛𝑒𝑡𝑗 𝜕𝑤𝑖𝑗
Derivation
• Finding the derivative of the error:
𝜕𝐸 𝜕𝐸 𝜕𝑜𝑗 𝜕𝑛𝑒𝑡𝑗
=
𝜕𝑤𝑖𝑗 𝜕𝑜𝑗 𝜕𝑛𝑒𝑡𝑗 𝜕𝑤𝑖𝑗
𝑛
𝜕𝑛𝑒𝑡𝑗 𝜕
= ෍ 𝑤𝑘𝑗 𝑜𝑘 = 𝑜𝑖
𝜕𝑤𝑖𝑗 𝜕𝑤𝑖𝑗
𝑘=1
𝜕𝑜𝑗 𝜕
= 𝜑 𝑛𝑒𝑡𝑗 = 𝜑 𝑛𝑒𝑡𝑗 1 − 𝜑 𝑛𝑒𝑡𝑗
𝜕𝑛𝑒𝑡𝑗 𝜕𝑛𝑒𝑡𝑗
Consider 𝐸 as as a function of the inputs of all neurons 𝑍 = 𝑧1 , 𝑧2 , …
receiving input from neuron 𝑗,
𝜕𝐸 𝑜𝑗 𝜕𝐸 𝑛𝑒𝑡𝑧1 , 𝑛𝑒𝑡𝑧2 , …
=
𝜕𝑜𝑗 𝜕𝑜𝑗
taking the total derivative with respect to 𝑜𝑗 , a recursive expression for
the derivative is obtained:
𝜕𝐸 𝜕𝐸 𝜕𝑛𝑒𝑡𝑧𝑙 𝜕𝐸 𝜕𝑜𝑙
=෍ =෍ 𝑤𝑗𝑧𝑙
𝜕𝑜𝑗 𝜕𝑛𝑒𝑡𝑧𝑙 𝜕𝑜𝑗 𝜕𝑜𝑙 𝜕𝑛𝑒𝑡𝑧𝑙
𝑙 𝑙
𝜕𝐸 𝜕𝐸 𝜕𝑛𝑒𝑡𝑧𝑙 𝜕𝐸 𝜕𝑜𝑙
=෍ =෍ 𝑤
𝜕𝑜𝑗 𝜕𝑛𝑒𝑡𝑧𝑙 𝜕𝑜𝑗 𝜕𝑜𝑙 𝜕𝑛𝑒𝑡𝑧𝑙 𝑗𝑧𝑙
𝑙 𝑙
• Therefore, the derivative with respect to 𝑜𝑗 can be calculated if all the derivatives
with respect to the outputs 𝑜𝑧𝑙 of the next layer – the one closer to the output
neuron – are known.
• Putting it all together:
𝜕𝐸
= 𝛿𝑗 𝑜𝑖
𝜕𝑤𝑖𝑗
With
𝑜𝑗 − 𝑡𝑗 𝑜𝑗 1 − 𝑜𝑗 if 𝑗 is an output neuron
𝜕𝐸 𝜕𝑜𝑗
𝛿𝑗 = =
𝜕𝑜𝑗 𝜕𝑛𝑒𝑡𝑗 ෍ 𝛿𝑧𝑙 𝑤𝑗𝑙 𝑜𝑗 1 − 𝑜𝑗 if 𝑗 is an inner neuron
𝑍
To update the weight 𝑤𝑖𝑗 using gradient descent, one must choose a learning rate 𝜂.
𝜕𝐸
∆𝑤𝑖𝑗 = −𝜂
𝜕𝑤𝑖𝑗
Backpropagation Algorithm
Thank You

XQG55-HE1014 Service Manual20121116
0% (1)
XQG55-HE1014 Service Manual20121116
25 pages
Chapter3 - BP
No ratings yet
Chapter3 - BP
12 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
68 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
19 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
Artificial Neural Networks
No ratings yet
Artificial Neural Networks
26 pages
Neural Network
100% (1)
Neural Network
54 pages
36-Multi-Layer Perceptron and Its Properties-30-10-2024
No ratings yet
36-Multi-Layer Perceptron and Its Properties-30-10-2024
39 pages
Back-Propagation Is Very Simple. Who Made It Complicated
No ratings yet
Back-Propagation Is Very Simple. Who Made It Complicated
26 pages
Neural
No ratings yet
Neural
53 pages
Module 3 Final
No ratings yet
Module 3 Final
88 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
Neural Net 3rdclass
No ratings yet
Neural Net 3rdclass
35 pages
Understanding and Coding Neural Networks From Scratch in Python and R
100% (1)
Understanding and Coding Neural Networks From Scratch in Python and R
15 pages
Artificial Neural Network
No ratings yet
Artificial Neural Network
15 pages
Multi Layer Perceptron 1
No ratings yet
Multi Layer Perceptron 1
54 pages
Neural Networks Handout
No ratings yet
Neural Networks Handout
7 pages
Main
No ratings yet
Main
25 pages
Artificial Neural Networks - MLP
No ratings yet
Artificial Neural Networks - MLP
52 pages
Back Propagation
No ratings yet
Back Propagation
29 pages
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
No ratings yet
Advanced Information Retreival: Chapter 02: Modeling - Neural Network Model
31 pages
Multi Layer Perceptron Haykin
No ratings yet
Multi Layer Perceptron Haykin
50 pages
nn2
No ratings yet
nn2
12 pages
Pr3_ANN_WriteUp.docx
No ratings yet
Pr3_ANN_WriteUp.docx
8 pages
Session XX - Neural Network
No ratings yet
Session XX - Neural Network
43 pages
CC511 Week 5 - 6 - NN - BP
No ratings yet
CC511 Week 5 - 6 - NN - BP
62 pages
ML Unit-2
No ratings yet
ML Unit-2
141 pages
AI17-Neural Networks
No ratings yet
AI17-Neural Networks
34 pages
lect8_dnn (1)
No ratings yet
lect8_dnn (1)
33 pages
Artificial Neural Networks - Lect - 3
No ratings yet
Artificial Neural Networks - Lect - 3
16 pages
John Bullinaria's Step by Step Guide To Implement Neuronal Network in C
No ratings yet
John Bullinaria's Step by Step Guide To Implement Neuronal Network in C
6 pages
ANN-Implemetation of Back-Prop
No ratings yet
ANN-Implemetation of Back-Prop
89 pages
Deep Learning PDF
100% (1)
Deep Learning PDF
87 pages
Understanding and Coding Neural Networks From Scratch in Python and R
No ratings yet
Understanding and Coding Neural Networks From Scratch in Python and R
12 pages
UNIT 3 - Backpropagation Algorithm
No ratings yet
UNIT 3 - Backpropagation Algorithm
38 pages
SOFT COMPUTING UNIT 2
No ratings yet
SOFT COMPUTING UNIT 2
22 pages
Module 5 Lecture 2
No ratings yet
Module 5 Lecture 2
45 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
ML807_Distributed_and_Federated_Learning_Slides_2
No ratings yet
ML807_Distributed_and_Federated_Learning_Slides_2
211 pages
CS460 - Deep Learning - W02 & W03
No ratings yet
CS460 - Deep Learning - W02 & W03
44 pages
DL
No ratings yet
DL
73 pages
Module 3.Docxaiml
No ratings yet
Module 3.Docxaiml
20 pages
Lec 15 MLP Cont
No ratings yet
Lec 15 MLP Cont
34 pages
Ann MJJ-1
No ratings yet
Ann MJJ-1
64 pages
Chap11 Neural Nets
No ratings yet
Chap11 Neural Nets
38 pages
NeuralNetworks
No ratings yet
NeuralNetworks
29 pages
Understanding Backpropagation Algorithm - Towards Data Science
No ratings yet
Understanding Backpropagation Algorithm - Towards Data Science
11 pages
CSD311: Artificial Intelligence
No ratings yet
CSD311: Artificial Intelligence
12 pages
Lecture 13.3 Classification ANN
No ratings yet
Lecture 13.3 Classification ANN
64 pages
Artificial Neural Network (2)
No ratings yet
Artificial Neural Network (2)
75 pages
Back Propagation
100% (1)
Back Propagation
27 pages
An Introduction To Mathematics Behind Neural Networks
No ratings yet
An Introduction To Mathematics Behind Neural Networks
5 pages
Artificial Neural Networks: HCMC University of Technology Sep. 2008
No ratings yet
Artificial Neural Networks: HCMC University of Technology Sep. 2008
71 pages
Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks
No ratings yet
Neural Networks and Fuzzy Systems: Multi-Layer Feed Forward Networks
27 pages
cst414- Deep learning
No ratings yet
cst414- Deep learning
34 pages
Unit 1 (1)
No ratings yet
Unit 1 (1)
72 pages
M3_Transcript
No ratings yet
M3_Transcript
10 pages
Exercises of Logarithms and Exponentials
From Everand
Exercises of Logarithms and Exponentials
Simone Malacrida
No ratings yet
Neural Networks
From Everand
Neural Networks
Sasha Kurzweil
No ratings yet
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
From Everand
Bio Inspired Computing: Fundamentals and Applications for Biological Inspiration in the Digital World
Fouad Sabry
No ratings yet
Exercises of Derivatives
From Everand
Exercises of Derivatives
Simone Malacrida
No ratings yet
Mobile Professionals, Inc: LTE Air Interface
No ratings yet
Mobile Professionals, Inc: LTE Air Interface
21 pages
SocialDistancingAdvisorybyMOHFW PDF
No ratings yet
SocialDistancingAdvisorybyMOHFW PDF
2 pages
Best Practices - 5G Timing and Synchronization: Nigel Brownlow
No ratings yet
Best Practices - 5G Timing and Synchronization: Nigel Brownlow
30 pages
Original1.Week 3 Module 1 DGNSS
100% (1)
Original1.Week 3 Module 1 DGNSS
16 pages
3c Feature Extraction
No ratings yet
3c Feature Extraction
19 pages
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
No ratings yet
Foundations of Machine Learning: Module 3: Instance Based Learning and Feature Reduction
40 pages
Foundations of Machine Learning: Module 6: Neural Network
No ratings yet
Foundations of Machine Learning: Module 6: Neural Network
22 pages
Ref.5 NGN Protocols
100% (1)
Ref.5 NGN Protocols
15 pages
Umts Overview
No ratings yet
Umts Overview
539 pages
VCMA-20ULS Owners Manual
No ratings yet
VCMA-20ULS Owners Manual
4 pages
Analyzing The Statistical Error of Physical Chemistry Experimental Data
No ratings yet
Analyzing The Statistical Error of Physical Chemistry Experimental Data
7 pages
How To Scope and Plan A Project
No ratings yet
How To Scope and Plan A Project
11 pages
Đe 2 HSG-TA-12
No ratings yet
Đe 2 HSG-TA-12
9 pages
BONDSTRAND - 2400eng
No ratings yet
BONDSTRAND - 2400eng
6 pages
Ver - 2 0 - TAE WSQ Credit Exemptions N Equivalency Table - Final
No ratings yet
Ver - 2 0 - TAE WSQ Credit Exemptions N Equivalency Table - Final
14 pages
Lithium Extraction From Brines Using Ion Concentration Polarization by Alex Barksdale
No ratings yet
Lithium Extraction From Brines Using Ion Concentration Polarization by Alex Barksdale
103 pages
A Framework For The Study of Security Communities
No ratings yet
A Framework For The Study of Security Communities
10 pages
Introduction To Biomedical Engineering Technology Second Edition Street
No ratings yet
Introduction To Biomedical Engineering Technology Second Edition Street
84 pages
SET 17 PhysicalScience II (A) K
No ratings yet
SET 17 PhysicalScience II (A) K
12 pages
GE 9ha01 layout
No ratings yet
GE 9ha01 layout
4 pages
p&Wc s.b. No. 1568r8 - Fcu
No ratings yet
p&Wc s.b. No. 1568r8 - Fcu
39 pages
Liebert NX UPS: User Manual-10-30kVA, 208V, 60Hz
No ratings yet
Liebert NX UPS: User Manual-10-30kVA, 208V, 60Hz
112 pages
Invoice Letv 1s Phone
No ratings yet
Invoice Letv 1s Phone
1 page
Astino - Purlin
No ratings yet
Astino - Purlin
19 pages
BPR 100 80d
No ratings yet
BPR 100 80d
112 pages
SQ3R PDF
No ratings yet
SQ3R PDF
1 page
VisuHole - User Manual - English - V1.0 - 19 Sep 14
No ratings yet
VisuHole - User Manual - English - V1.0 - 19 Sep 14
17 pages
ch 8
No ratings yet
ch 8
4 pages
Identify Project Risks
No ratings yet
Identify Project Risks
6 pages
Sparepart Byson Karburator
No ratings yet
Sparepart Byson Karburator
2 pages
Strata™ 400 STEM Product Datasheet - FEI Company
No ratings yet
Strata™ 400 STEM Product Datasheet - FEI Company
2 pages
Mnemonic S
No ratings yet
Mnemonic S
4 pages
Assignment 2 Management Perspective Son Leadership Motivation
No ratings yet
Assignment 2 Management Perspective Son Leadership Motivation
14 pages
Evaluation Horticulture
No ratings yet
Evaluation Horticulture
1 page
SEM Notes
No ratings yet
SEM Notes
3 pages
Daily Lesson Plan English Language Year 1
No ratings yet
Daily Lesson Plan English Language Year 1
6 pages
Math10 Q3 Week2
No ratings yet
Math10 Q3 Week2
21 pages
BA5018 - Unit 1 - Organization and Its Environment PDF
100% (1)
BA5018 - Unit 1 - Organization and Its Environment PDF
32 pages

Foundations of Machine Learning: Module 6: Neural Network

Uploaded by

Foundations of Machine Learning: Module 6: Neural Network

Uploaded by

Foundations of Machine Learning

Module 6: Neural Network

You might also like