0% found this document useful (0 votes)

16 views71 pages

Chapter 2 Adaline

Uploaded by

Deepak singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

16 views71 pages

Chapter 2 Adaline

Uploaded by

Deepak singh

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 71

Chapter 2:

Dr. Gaur Sanjay B.C.

Visible and Hidden Neurons

• Visible Neurons: there is interface between network and the

environment in which it operates. They are also clamped to
onto specific states determined by the environment.
• Hidden Neurons: they operates independent of the
environment. And there is free running condition, in which all
the neurons are allowed to operate freely.
Hebb Net
• The first learning law for artificial
Neural Network was designed by
Donald Hebb in 1949.
• For Hebb net, the input and output
data should be in bipolar form. It
is limitation of Hebb-net that it
cannot learn for binary data.
• The Hebb-net consists of bias, Figure : Architecture of single layer net
which acts as a weight on a
connection from a unit whose
activation is always 1.
Algorithm
Step 1: Initially all weight and bias are set to zero.
wi=0 and b=0;
Step 2: for all input and target pair perform step 3-step6.
Step 3: Set activation for input units with input vector.
xi=Si
Step 4: Set activation for output units with the output Neuron.
y=t
Step 5: adjust the weight by applying Hebb rule,
wi(new)=wi(old)+xiy; (Δw= xiy)
Step 6: adjust the bias by applying Hebb rule,
b(new)=b(old)+y ((Δb= y)
Step 1: initially

𝑤1 = 𝑤2 = 𝑏 = 0
Input Target Weight Change Final Weight
x1 x2 b y Δw1 Δw2 Δb w1 w2 b
Initially 0 0 0

Step 2: x1=1, x2=1, b=1, y=1 as per the table;

Now update weight
𝑤2 (𝑛𝑒𝑤) = 𝑤2 𝑜𝑙𝑑 + 𝑥2 𝑦
𝑤1 (𝑛𝑒𝑤) = 𝑤1 𝑜𝑙𝑑 + 𝑥1 𝑦 = 0+1=1
= 0+1=1 𝑏(𝑛𝑒𝑤) = b 𝑜𝑙𝑑 + 𝑦
= 0+1=1
Input Target Weight Change Final Weight
x1 x2 b y Δw1 Δw2 Δb w1 w2 b
1 1 1 1 1 1 1
𝛻𝑤1 = 𝑥1 𝑦 𝛻𝑤2 = 𝑥2 𝑦 𝛻𝑏 = 𝑦
= 1.1=1 = 1.1=1 =1
Input Target Weight Change Final Weight
x1 x2 b y Δw1 Δw2 Δb w1 w2 b
1 1 1 1 1 1 1 1 1 1

Step 3: x1=1, x2=-1, b=1, y=-1 as per the table;

Now update weight
𝑤2 (𝑛𝑒𝑤) = 𝑤2 𝑜𝑙𝑑 + 𝑥2 𝑦
𝑤1 (𝑛𝑒𝑤) = 𝑤1 𝑜𝑙𝑑 + 𝑥1 𝑦 = 1+1=2
= 1-1=0 𝑏(𝑛𝑒𝑤) = b 𝑜𝑙𝑑 + 𝑦
= 1-1=0
Input Target Weight Change Final Weight
x1 x2 b y Δw1 Δw2 Δb w1 w2 b
1 1 1 1 0 2 0
𝛻𝑤1 = 𝑥1 𝑦 𝛻𝑤2 = 𝑥2 𝑦 𝛻𝑏 = 𝑦
= 1.(-1)=-1 =(- 1).(-1)=1 = -1
Input Target Weight Change Final Weight
x1 x2 b y Δw1 Δw2 Δb w1 w2 b
1 -1 1 -1 -1 1 -1 0 2 0

Step 4:Finally,
Input Target Weight Change Final Weight
x1 x2 b y Δw1 Δw2 Δb w1 w2 b
Initially 0 0 0

1 1 1 1 1 1 1 1 1 1

1 -1 1 -1 -1 1 -1 0 2 0

-1 1 1 -1 1 -1 -1 1 1 -1

-1 -1 1 -1 1 1 -1 2 2 -2
Finally, the Hebb net for AND Gate is given by

𝑣 = 𝑏 + 𝑤1 𝑥1 + 𝑤2 𝑥2
1 𝑖𝑓𝑣 ≥ 0
𝑦=ቊ
−1 𝑖𝑓𝑣 < 0

Input Output
x1 x2 b v=b+wixi y
1 1 1 2 1

1 -1 1 -2 -1

-1 1 1 -2 -1

-1 -1 1 -6 -1
The straight line separating the region can be obtained after
presenting each input pair. Thus, line after first iteration:
𝑥1 𝑤1 + 𝑥2 𝑤2 + 𝑏=0;
𝑤1 𝑏
𝑥2 = − 𝑤 𝑥1 − 𝑤 ;
2 2
After first input: 𝑤1 =1, 𝑤2 =1, b=1
𝑥2 = −𝑥1 −1 ;
Thus, line after second iteration:

b + w1 x1 + w2 x2 = 0
w1 b
x2 = − x1 −
w2 w2
at b = 0, w1 = 0, w2 = 2,
0 0
x2 = − x1 −
2 2
x2 = 0
x2 vertical axis
Thus, line after third iteration:

b + w1 x1 + w2 x2 = 0
w1 b
x2 = − x1 −
w2 w2
at b = −1, w1 = 1, w2 = 1,
1 −1
x2 = − x1 −
1 1
x2 = − x1 + 1
y = mx + c
Thus, line after forth iteration:

b + w1 x1 + w2 x2 = 0
w1 b
x2 = − x1 −
w2 w2
at b = −2, w1 = 2, w2 = 2,
2 −2
x2 = − x1 −
2 2
x2 = − x1 + 1
y = mx + c
Example: Develop a Perceptron for the AND function
with binary inputs and bipolar targets without bias up to
second epoch. (take first with (0,0) and second without
(0,0)) (SN Deepa)

Solution: Case I: with (0,0)

x1 x2
1 1
1 0
0 1
0 0
Step 1: Initially
Input Target Weight Change Final Weight
x1 x2 b y Δw1 Δw2 Δb w1 w2 b
Initially 0 0 0
The net input, Yin = w1 x1 + w2 x2
1 if Yin  0

f (Yin ) = 0 if − 0  Yin  0
 − 1 if Yin  −0

The weight change,

w = txi

The weight change,

w(new) = w(old ) + w
Step 2: EPOCH 1:
Input Target Weight Change Final Weight

x1 x2 y t Δw1 Δw2 w1 w2
0 0
1 1 0 1 1 1 1 1
1 0 1 -1 -1 0 0 1
0 1 1 -1 0 -1 0 0
0 0 0 -1 0 0 0 0 FINAL WEIGHT

Step 3: EPOCH 2:
Input Target Weight Change Final Weight

x1 x2 y t Δw1 Δw2 w1 w2
0 0
1 1 0 1 1 1 1 1
1 0 1 -1 -1 0 0 1
0 1 1 -1 0 -1 0 0
0 0 0 -1 0 0 0 0 FINAL WEIGHT
Solution: Case II: without (0,0)
x1 x2
1 1
1 0
0 1
0 0
Step 1: Initially
Input Target Weight Change Final Weight
x1 x2 y Δw1 Δw2 w1 w2
Initially 0 0
The net input,Yin = w1 x1 + w2 x2 The weight change,
1 if Yin  0 w = txi

f (Yin ) = 0 if − 0  Yin  0
 − 1 if Yin  −0
The weight change,
w(new) = w(old ) + w

Step 2: EPOCH 1:
Input Target Weight Change Final Weight

x1 x2 y t Δw1 Δw2 w1 w2
0 0
1 1 0 1 1 1 1 1
1 0 1 -1 -1 0 0 1
0 1 1 -1 0 -1 0 0 FINAL WEIGHT
Step 3: EPOCH 2:

Input Target Weight Change Final Weight

x1 x2 y t Δw1 Δw2 w1 w2
0 0
1 1 0 1 1 1 1 1
1 0 1 -1 -1 0 0 1
0 1 1 -1 0 -1 0 0 FINAL WEIGHT

Thus, from the above solution it is clear that without bias the
convergence does not occur. Even after neglecting (0,0) the
convergence does not occur.
Application Image Processing: The Iris Dataset classification

The Iris flower data set is a multivariate data set conceived by Ronald
Fisher in 1936. Fisher was a British statistician and biologist.

He recorded the length and width of sepals and petals in centimeters for
three different species of flowers: Iris Setosa, Iris Virginica, and Iris
Versicolor.

The total number of records is 150, with 50 for every species. The
columns of the data set are organized as follows:
SepalLengt SepalWidt PetalLengt PetalWidth
Species
hCm hCm hCm Cm
5.2 2.7 3.9 1.4 Iris-versicolor
5.5 4.2 1.4 0.2 Iris-setosa
5.6 2.5 3.9 1.1 Iris-versicolor
6.3 2.5 5.0 1.9 Iris-virginica
Activation Function
• Identity Function Linear Function:

Output y = f(v) = v

Equation : Linear function has the equation similar to as of a straight line i.e. y = ax
No matter how many layers we have, if all are linear in nature, the final activation function
of last layer is nothing but just a linear function of the input of first layer.
Range : -inf to +inf
Uses : Linear activation function is used at just one place i.e. output layer.
Issues : If we will differentiate linear function to bring non-linearity, result will no more
depend on input “x” and function will become constant, it won’t introduce any ground-
breaking behavior to our algorithm.
For example : Calculation of price of a house is a regression problem. House price may
have any big/small value, so we can apply linear activation at output layer. Even in this
case neural net must have any non-linear function at hidden layers.
Activation Function…
• Sigmoidal: it is S shaped curve: Hyperbolic or log functions are
commonly used. Two main types of sigmoidal function are: Binary
and Bipolar sigmoidal function
• a). Binary sigmoidal function/Logistic Function:
1

(range is 0 to 1) 0.95
lembda=0.5
lembda=1
lembda=15
0.9
1 lembda=30

Output = 0.85

1 + e − x 0.8

where  is steepness parameter

0.75

0.7

0.65

0.6

0.55

0.5
0 1 2 3 4 5 6 7 8 9 10

Nature : Non-linear. Notice that X values lies between -2 to 2, Y values are very steep.
This means, small changes in x would also bring about large changes in the value of Y.
Value Range : 0 to 1
Uses : Usually used in output layer of a binary classification, where result is either 0 or
1, as value for sigmoid function lies between 0 and 1 only so, result can be predicted
easily to be 1 if value is greater than 0.5 and 0 otherwise.
Activation Function…
b). Bipolar Sigmoidal/Hyperbolic tangent
• The desired range is between +1 and -1
1
lembda=0.5
0.8 lembda=15
2
Output = −1
lembda=25
0.6
lembda=30
− x
1+ e 0.4

where  is steepness parameter 0.2

-0.2

-0.4

-0.6

-0.8
Value Range :- -1 to +1
-1
Nature :- non-linear -10 -8 -6 -4 -2 0 2 4 6 8 10

Uses :- Usually used in hidden layers of a neural network as it’s

values lies between -1 to 1 hence the mean for the hidden layer
comes out be 0 or very close to it, hence helps in centering the
data by bringing mean close to 0. This makes learning for the next
layer much easier.
Activation Function…
Signum Function/Hard-Limit:
• The function is defined as

+ 1 for net  0
Output = 
− 1 for net  0
Activation Function…
Binary Activation Function
f(net)
a). Unipolar +1

0
+ 1 for net  0 net
Output = 
0 for net  0

b). Bipolar
f(net)

+ 1 for net  0 +1
Output = 
− 1 for net  0
net
-1
Linear Networks
• The Adaline
Adaline (Adaptive Linear element)
• In 1960 Widrow and Hoff developed the learning rule (Delta
Rule), which is very closely related to Perceptron learning
rule.
• Adaline and Madaline both uses the Least Mean Square
(LMS) error for learning.
• Adaline uses bipolar activation function for input and target
output.
• The weight and bias are adjusted by Delta Rule/LMS rule or
Widrow-Hoff Rule.
Delta Rule
• “The adjustment made to a synaptic weight of
a neuron is proportional to the product of the
error signal and the input signal of the
synapse”.
wi =  (t − yin )xi
where  = learning rate
t = target value
y = net input to output unit =  wi xi
x = input
Derivation (Delta Rule) for single
output unit
• The mean square error for a particular training pattern is

E =  (t j − yin )
2

• Partial derivative of E w.r.t. each weight (change in error with

weight w1)

 (t )
E 
= j − y j −in
2

w1 j w1 j j

E
=

w1 j w1 j
(
t j − y j −in )2
Since w1j influences the error
only at output unit yj
y j −in
E
w1 j
( )
= 2 t j − y j −in (− 1)
w1 j
y j −in
(
= −2 t j − y j −in ) w
1j

= −2(t j − y j −in )x
1

• Thus, the error will be reduced rapidly depending upon the

given learning by adjusting the weights according to the delta
rule is given by
w1 =  (t − yin )x1
Architecture (SN Deepa)
• Similar to single layer Neuron, the
Adaline, also has only one output unit.
• The output unit receives input from
several units and also from bias; whose
activation is always +1.
• Each input neurons is connected with
output neurons with weighted
interconnections. (w1,w2,… wn)
• These weights get changed as the
training progresses.
Algorithm
Step1: To start the training process, initially the weights and the bias are set to
be random values (non zero but small random value)
Step2: While stopping condition is false. Do step 3-7.
Step3: For each bipolar training pair s:t, perform step 4-6.
Step4: Set activations of input units xi=si, for i=1 to n.
Step5: Compute net input y-in. For any Adaline, whose three inputs are
x1,x2,x3, and one bias b. The net sum y-in is given by:
net input y−in = b +  wi xi
i

The activation function is used to compute output y.

 1 if yin  

y = f ( y−in ) =  0 if -   yin  
− 1 if yin  −

Algo…
Step6: If t ≠ y, update bias and weights, i=1 to n.
wi (new) = wi (old ) +  (t − y−in )xi
b(new) = b(old ) +  (t − y−in )
else
wi (new) = wi (old )
b(new) = b(old )
Step7: Test for stopping condition.
Stopping condition:
➢ When weight changes reaches to small level.
➢ Pre-decided Number of iterations/epoch.
Example: Develop an Adaline network for ANDNOT function with
bipolar inputs and targets. Find the final weights after second epoch.

• Truth table
x1 x2 Output • Output= (x1 ANDNOT x2)
1 1 -1
1 -1 1 y = x1. x2
-1 1 -1
-1 -1 -1

1
b

x1 w1 y

w2
x2
Architecture
• Step 1: Initially weight and bias are assumed a random value
say 0.2. The learning rate is =0.2.
• Step 2: The weights are calculated until the least mean square
error is obtained
Consider w1=w2=b=0.2, Learning rate η=0.2

• The operations are carried out for two epochs.

Epoch -1: First iteration
• If x1=1 and x2=1, t=-1, 𝛼= 0.2
• Y_in=w1*x1+w2*x2+b

Y_in=0.2*1+0.2*1+0.2 t-Y_in=0.2*1+0.2*1+0.2
=0.6 = -1.6

∆wi=𝛼 * (tj-yj) * xi ∆wi=𝛼 * (tj-yj) * xi

j=1 and i=1 j=1 and i=2
∆w1=𝛼 * (t1-y1) * x1 ∆w2=𝛼 * (t1-y1) * x2
= 0.2*(-1.6))*(1) = 0.2*(-1.6))*(1)
=-0.32 =-0.32
∆b=𝛼 * (tj-yj) * 1 for j=1
∆w2=𝛼 * (t1-y1) * 1
= 0.2*(-1.6))*(1) = -0.32
w1 0.2
w2 0.2
b 0.2
x1 1
x2 1
alpha 0.2
t -1

yin=w1x1+w2x2+b 0.6
t-yin -1.6
dw1 -0.32
dw2 -0.32
db -0.32
w1 -0.12
w2 -0.12
b -0.12
Epoch -1: First iteration
• If x1=1 and x2=1, t=-1, 𝛼= 0.2
• Y_in=w1*x1+w2*x2+b

Y_in=0.2*1+0.2*1+0.2 t-Y_in=0.2*1+0.2*1+0.2
=0.6 = -1.6

∆wi=𝛼 * (tj-yj) * xi ∆wi=𝛼 * (tj-yj) * xi

yin=w1x1+w2x2+b 0.6 -0.12

t-yin -1.6 1.12
dw1 -0.32 0.224
dw2 -0.32 -0.224
db -0.32 0.224
w1 -0.12 0.104
w2 -0.12 -0.344
b -0.12 0.104
Input Net Input Yin t E=(t-y)2
x1 x2 b w1=0.55, w2=-0.33, b=0.43
1 1 1 0.65 -1
2.72
1 -1 1 1.31 1
0.09
-1 1 1 -0.45 -1
0.30
-1 -1 1 0.21 -1
1.46

Net error=4.57

Thus, the error is minimized from 5.7 to 4.57 after second

iteration.

It can further reduce, by taking more number of iterations.

Madaline Rule 1
Architecture (from :SN Deepa)
Algorithm
Madline Rule II
Algorithm
Algorithm…
Example
Q. From a Madaline network for XOR function
with bipolar input and targets using MR-1
algorithm.

• Solution :
x1 x2 target
1 1 -1
1 -1 1
-1 1 1
-1 -1 -1
A scatterplot with two features of the Iris dataset

The scatterplot is shown in figure generated by combining two features

of the dataset, more specifically the petal and sepal width by each species.
From the figure, we can have an overall understanding of the three
varieties as three classes.
The perceptron needs to be trained by the available data, which is linearly
separable. (Linear separability is a property of two sets of points. If there
exists at least a line in the plane that separates the two sets of points, then
they are linearly separable.) The records of Iris-versicolor and Iris-
virginica for example are not linearly separable, whereas Iris-setosa and
Iris-versicolor fit the rule.
The perceptron used for the classification of Iris data is shown in figure.
Total four input features (x1, x2, x3 and x4) e.g. SepalLengthCm,
SepalWidthCm, PetalLengthCm, PetalWidthCm are used as the input for
training of the network to classify the data into three classes as Iris-
versicolor, Iris-setosa and Iris-virginica.
The network consists of a bias, which acts as a weight on a connection
from a unit whose activation is always 1.
During the training weights are updated as:
If t≠y
w(new)=w(old)+∆w
else
w(new)=w(old);
Stop the training
The diagram shows the perceptron’s process of receiving inputs and
combining them with weights. After training in fact, the perceptron
determines a set of weights. New records can go through the net input
function, which is defined as follows:
4

𝑛𝑒𝑡 = ෍ 𝑤𝑖 𝑥𝑖 + 𝑏
𝑖=1

The Sigmoid activation function is now applied for the proper

classification of the data. The sigmoid function takes net as the input
and gives outputs within the range of 0 and 1:
1
𝑦=
1 + 𝑒 −λ.𝑛𝑒𝑡

Where λ decides steepness of the curve.

The output y decides, which class it belong to.

Back Propagation
Features of Back-propagation
• In 1961, the basics concept of continuous backpropagation were derived in
the context of control theory by J. Kelly, Henry Arthur, and E. Bryson.
• Backpropagation is a short form for "backward propagation of errors." It is
a standard method of training artificial neural networks
• Backpropagation is fast, simple and easy to program
• A feedforward neural network is an artificial neural network.
• Backpropagation simplifies the network structure by removing weighted
links that have a minimal effect on the trained network.
• It is especially useful for deep neural networks working on error-prone
projects, such as image or speech recognition.
• The biggest drawback of the Backpropagation is that it can be sensitive for
noisy data.
Advantages of Back propagation
• Back-propagation is fast, simple and easy to program
• It has no parameters to tune apart from the numbers of input
• It is a flexible method as it does not require prior knowledge about the
network
• It is a standard method that generally works well
• It does not need any special mention of the features of the function to be
learned.
Types of Backpropagation Networks
Two Types of Backpropagation Networks are:
• Static back-propagation: It is one kind of backpropagation
network which produces a mapping of a static input for
static output. It is useful to solve static classification issues
like optical character recognition.
• Recurrent Backpropagation: Recurrent backpropagation is
fed forward until a fixed value is achieved. After that, the
error is computed and propagated backward.
The main difference between both of these methods is: that the
mapping is rapid in static back-propagation while it is
nonstatic in recurrent backpropagation.
Example:
input=[0.5,0.10] and output=[0.1,0.99]
• initial weights, the biases, and training
inputs/outputs:

Given training set: inputs 0.05 and 0.10, and outputs 0.01 and 0.99.
The Forward Pass for Node H1
Here’s how we calculate the total net
input for :

After applying activation function to get the output of :

The Forward Pass for Node H2
Carrying out the same process for h2 we get:
The Forward Pass for Node O1 and
O2
We repeat this process for the output layer neurons, using the output from the
hidden layer neurons as inputs.

Similarly,
Calculating the Total Error
Calculate the error for each output neuron using the squared error function and sum
them to get the total error:

For example, the target output for O1 is 0.01 but the neural network output
0.75136507, therefore its error is:
The Backwards Pass
“Our goal with backpropagation is to update each of the weights in the
network so that they cause the actual output to be closer the target output”
By applying the chain rule we know that:

First, how much does the total error change with respect to the output?
Next, how much does the output of O1 change with respect to its total net input?
The partial derivative of the logistic function is the output multiplied by 1 minus the
output:

Finally, how much does the total net input of o1 change with respect to w5 ?
Putting it all together:

To decrease the error, we then subtract this value from the current weight
Hidden Layer
Next, we’ll continue the backwards pass by calculating new values
for w1, w2, w3 and w4..
We know that out(h1) affects both out(O1) and out(O2) therefore the Etotal needs
to take into consideration its effect on the both output neurons: out h1

Similarly, second part we can calculate through the output equation

Substituting the value

Following the same process for , we get:

Therefore:

Now that we have , we need to figure out and then for each weight:
We can now update w1 :

Tabulation of Error Function Values PDF
No ratings yet
Tabulation of Error Function Values PDF
1 page
Assignment 3
No ratings yet
Assignment 3
3 pages
IGNOU Solved Assignment of MCS32
No ratings yet
IGNOU Solved Assignment of MCS32
13 pages
Neural Networks
No ratings yet
Neural Networks
37 pages
Hebb Network
No ratings yet
Hebb Network
10 pages
Architecture: Simple Neural Nets For Pattern Classification
No ratings yet
Architecture: Simple Neural Nets For Pattern Classification
15 pages
Unit 1 Application of Soft Computing_presentation (1)
No ratings yet
Unit 1 Application of Soft Computing_presentation (1)
47 pages
NN-2nd
No ratings yet
NN-2nd
23 pages
Model of Neuron in An ANN
No ratings yet
Model of Neuron in An ANN
12 pages
chp3 Hebb network
No ratings yet
chp3 Hebb network
4 pages
Soft Computing Manual.-1
No ratings yet
Soft Computing Manual.-1
45 pages
0905 Cs 161183 Vishal
No ratings yet
0905 Cs 161183 Vishal
38 pages
Jntuk R20 ML Unit-V
No ratings yet
Jntuk R20 ML Unit-V
19 pages
Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)
No ratings yet
Neural Networks: Single Neurons (Continued) : G. Extension of The Delta Rule: Smooth F (Z)
5 pages
Linearly_separable_1
No ratings yet
Linearly_separable_1
36 pages
Networks With Threshold Activation Functions: Navigation
No ratings yet
Networks With Threshold Activation Functions: Navigation
6 pages
NN unit_1
No ratings yet
NN unit_1
27 pages
Ad3451 ML Unit 4 Notes Eduengg
No ratings yet
Ad3451 ML Unit 4 Notes Eduengg
36 pages
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
No ratings yet
Learning With Linear Neurons: Adapted From Lectures by Geoffrey Hinton and Others Updated by N. Intrator, May 2007
59 pages
ML tushar assignment
No ratings yet
ML tushar assignment
8 pages
Activation Functions in Neural Networks - 241102 - 224129
No ratings yet
Activation Functions in Neural Networks - 241102 - 224129
7 pages
NN-Ch2
No ratings yet
NN-Ch2
36 pages
Neural Network (1)
No ratings yet
Neural Network (1)
98 pages
1587253226
No ratings yet
1587253226
35 pages
Module 2
No ratings yet
Module 2
44 pages
Activation Function
No ratings yet
Activation Function
4 pages
Neural Network Notes
No ratings yet
Neural Network Notes
8 pages
120135
No ratings yet
120135
4 pages
ML Unit4 Notes
No ratings yet
ML Unit4 Notes
96 pages
Soft Computing
No ratings yet
Soft Computing
92 pages
Name:-Time Allowed: - 3 Hours: Artificial Neural Networks Exam
No ratings yet
Name:-Time Allowed: - 3 Hours: Artificial Neural Networks Exam
11 pages
Ml Neural Networks
No ratings yet
Ml Neural Networks
71 pages
Soft_Computing_2 with numericals
No ratings yet
Soft_Computing_2 with numericals
64 pages
ML_Lec-22
No ratings yet
ML_Lec-22
25 pages
JETIR2006041
No ratings yet
JETIR2006041
8 pages
Unit 2
No ratings yet
Unit 2
18 pages
UNIT V (1)
No ratings yet
UNIT V (1)
25 pages
Pattern Classifiers1
No ratings yet
Pattern Classifiers1
46 pages
DL CHPT 1
No ratings yet
DL CHPT 1
59 pages
NN Learning
No ratings yet
NN Learning
69 pages
Types of Machine Learning: Supervised Learning: The Computer Is Presented With Example Inputs and Their
No ratings yet
Types of Machine Learning: Supervised Learning: The Computer Is Presented With Example Inputs and Their
50 pages
6.10-Tutorial For Week6
No ratings yet
6.10-Tutorial For Week6
17 pages
Week-3 Module-2 Neural Network
No ratings yet
Week-3 Module-2 Neural Network
58 pages
Machine Learning (CSO851) - Lecture 08
No ratings yet
Machine Learning (CSO851) - Lecture 08
27 pages
Activation Function in NN
No ratings yet
Activation Function in NN
29 pages
Unit 3 Self Made
No ratings yet
Unit 3 Self Made
23 pages
Lecture - 05 (Introduction to ANN)
No ratings yet
Lecture - 05 (Introduction to ANN)
27 pages
Week 14 (NN)
No ratings yet
Week 14 (NN)
49 pages
Ch1-fundamental of neural network
No ratings yet
Ch1-fundamental of neural network
59 pages
Function of Single Biological Neuron and Modelling of Artificial Neuron From It
No ratings yet
Function of Single Biological Neuron and Modelling of Artificial Neuron From It
33 pages
Ad3451 Ml Unit 4 Notes
No ratings yet
Ad3451 Ml Unit 4 Notes
34 pages
Soft Computing Lab File
No ratings yet
Soft Computing Lab File
26 pages
Week 2 Artificial Neural Networks
No ratings yet
Week 2 Artificial Neural Networks
62 pages
Simple Neural Nets For Pattern Classification
No ratings yet
Simple Neural Nets For Pattern Classification
68 pages
Simple Neural Nets For Pattern Classification
No ratings yet
Simple Neural Nets For Pattern Classification
68 pages
Module 2 Hebb Net
No ratings yet
Module 2 Hebb Net
12 pages
Module 2 Hebb Net
No ratings yet
Module 2 Hebb Net
12 pages
Question Bank For NN
No ratings yet
Question Bank For NN
6 pages
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
100% (1)
Artificial Neural Networks: System That Can Acquire, Store, and Utilize Experiential Knowledge
40 pages
ML Week 4 To 10 PDF
No ratings yet
ML Week 4 To 10 PDF
146 pages
Lec 19
No ratings yet
Lec 19
16 pages
MachineLeanrning With Python
No ratings yet
MachineLeanrning With Python
6 pages
Algebraic Equations
From Everand
Algebraic Equations
Demetrios P. Kanoussis
No ratings yet
Oe-Ec604c
No ratings yet
Oe-Ec604c
11 pages
IE643 Lecture3 2020aug21
No ratings yet
IE643 Lecture3 2020aug21
60 pages
Cook's Theorem On NP-completeness SATISFIABILITY Is NP-complete
No ratings yet
Cook's Theorem On NP-completeness SATISFIABILITY Is NP-complete
1 page
Advanced Time Series Analysis
100% (1)
Advanced Time Series Analysis
3 pages
Manual-2
No ratings yet
Manual-2
5 pages
Computer Vision Exam
No ratings yet
Computer Vision Exam
7 pages
309 Assignment2 Fall23
No ratings yet
309 Assignment2 Fall23
2 pages
Recurrent Neural Network Applications
No ratings yet
Recurrent Neural Network Applications
16 pages
13 Independent Random Variables
No ratings yet
13 Independent Random Variables
34 pages
Assignment - 2 - Compiler Design
100% (2)
Assignment - 2 - Compiler Design
4 pages
Deep Learning (Syllabus)
No ratings yet
Deep Learning (Syllabus)
1 page
Adarsh - 2024en01 - Soft Computing Assignment 1
No ratings yet
Adarsh - 2024en01 - Soft Computing Assignment 1
12 pages
UML Notes
No ratings yet
UML Notes
14 pages
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
No ratings yet
Convolutional Neural Networks: Computer Vision CS 543 / ECE 549 University of Illinois Jia-Bin Huang
76 pages
Deep Learning Mar 19
No ratings yet
Deep Learning Mar 19
2 pages
Time Series QBank
No ratings yet
Time Series QBank
6 pages
Uml Notes Rajender Nath Sir PDF
100% (1)
Uml Notes Rajender Nath Sir PDF
208 pages
Automata
100% (1)
Automata
17 pages
Prob
No ratings yet
Prob
3 pages
On Deep Machine Learning & Time Series Models: A Case Study With The Use of Keras
100% (1)
On Deep Machine Learning & Time Series Models: A Case Study With The Use of Keras
34 pages
CS 121 - Object Oriented Programming (Lec 8)
No ratings yet
CS 121 - Object Oriented Programming (Lec 8)
22 pages
Ma8391 Notes
No ratings yet
Ma8391 Notes
60 pages
COM555 Introduction To The Course
No ratings yet
COM555 Introduction To The Course
8 pages
Recurrent Neural Network Wiki
100% (1)
Recurrent Neural Network Wiki
7 pages
Lampiran 2. Pendugaan Model Awal
No ratings yet
Lampiran 2. Pendugaan Model Awal
20 pages
CH 4.2 Probability Distribu P. 70-72
No ratings yet
CH 4.2 Probability Distribu P. 70-72
4 pages
Jurnal Baru
No ratings yet
Jurnal Baru
8 pages

Chapter 2 Adaline

Uploaded by

Chapter 2 Adaline

Uploaded by

Chapter 2:

Dr. Gaur Sanjay B.C.

• Visible Neurons: there is interface between network and the

Step 2: x1=1, x2=1, b=1, y=1 as per the table;

Step 3: x1=1, x2=-1, b=1, y=-1 as per the table;

Solution: Case I: with (0,0)

The weight change,

The weight change,

Input Target Weight Change Final Weight

where  is steepness parameter

where  is steepness parameter 0.2

Uses :- Usually used in hidden layers of a neural network as it’s

• Partial derivative of E w.r.t. each weight (change in error with

• Thus, the error will be reduced rapidly depending upon the

The activation function is used to compute output y.

• The operations are carried out for two epochs.

∆wi=𝛼 * (tj-yj) * xi ∆wi=𝛼 * (tj-yj) * xi

∆wi=𝛼 * (tj-yj) * xi ∆wi=𝛼 * (tj-yj) * xi

yin=w1x1+w2x2+b 0.6 -0.12

Thus, the error is minimized from 5.7 to 4.57 after second

It can further reduce, by taking more number of iterations.

The scatterplot is shown in figure generated by combining two features

The Sigmoid activation function is now applied for the proper

Where λ decides steepness of the curve.

The output y decides, which class it belong to.

After applying activation function to get the output of :

Similarly, second part we can calculate through the output equation

Substituting the value

You might also like