0% found this document useful (0 votes)

21 views

AI31

Uploaded by

ANANTHI K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

21 views

AI31

Uploaded by

ANANTHI K

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 13

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

5.2 PERCEPTRON IN MACHINE LEARNING

Perceptron is Machine Learning algorithm for supervised learning of various binary

classification tasks. Further, Perceptron is also understood as an Artificial Neuron or neural
network unit thathelps to detect certain input data computations in business intelligence.
Perceptron model is also treated as one of the best and simplest types of Artificial Neural
networks. However, it is a supervised learning algorithm of binary classifiers. Hence, we can
consider it as a single-layer neural network with four main parameters, i.e., input values, weights
and Bias, net sum, and an activation function.

Basic Components of Perceptron

Mr. Frank Rosenblatt invented the perceptron model as a binary classifier which contains
three main components. These are as follows:

Fig: 5.1

o Input Nodes or Input Layer:

This is the primary component of Perceptron which accepts the initial data into the system for
further processing. Each input node contains a real numerical value.
o Wight and Bias:

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

Weight parameter represents the strength of the connection between units. This is another most
important parameter of Perceptron components. Weight is directly proportional to the strength
of the associated input neuron in deciding the output. Further, Bias can be considered as the
line of intercept in a linear equation.

o Activation Function:
These are the final and important components that help to determine whether the neuron
will fire or not. Activation Function can be considered primarily as a step function.
Types of Activation functions:
o Sign function

o Step function, and

o Sigmoid function

Fig:5.2

The data scientist uses the activation function to take a subjective decision based on various
problem statements and forms the desired outputs. Activation function may differ (e.g., Sign,
Step,and Sigmoid) in perceptron models by checking whether the learning process is slow or
has vanishing or exploding gradients.

How does Perceptron work?

In Machine Learning, Perceptron is considered as a single-layer neural network that

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

consists of four main parameters named input values (Input nodes), weights and Bias, net sum,
and an activation function. The perceptron model begins with the multiplication of all input
values and their weights, then adds these values together to create the weighted sum. Then this
weighted sum is applied to the activation function 'f' to obtain the desired output. This
activation function is also known as the step function and is represented by 'f'.

Fig:5.3
This step function or Activation function plays a vital role in ensuring that output is mapped
between required values (0,1) or (-1,1). It is important to note that the weight of input is
indicative of the strength of a node. Similarly, an input's bias value gives the ability to shift the
activation function curve up or down.
Perceptron model works in two important steps as follows:

Step-1

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

In the first step first, multiply all input values with corresponding weight values and then add
themto determine the weighted sum. Mathematically, we can calculate the weighted sum as
follows:

∑wixi = x1w1 + x2w2 +…wnxn

Add a special term called bias 'b' to this weighted sum to improve the model's performance.

∑wi*xi + b

Step-2

In the second step, an activation function is applied with the above-mentioned weighted sum,
which gives us output either in binary form or a continuous value as follows:

Y = f(∑wi*xi + b)

Types of Perceptron Models

Based on the layers, Perceptron models are divided into two types. These are as follows:
1. Single-layer Perceptron Model

2. Multi-layer Perceptron model

Single Layer Perceptron Model:

This is one of the easiest Artificial neural networks (ANN) types. A single-layered
perceptron model consists feed-forward network and also includes a threshold transfer function
inside the model. The main objective of the single-layer perceptron model is to analyze the
linearly separableobjects with binary outcomes.

In a single layer perceptron model, its algorithms do not contain recorded data, so it begins
with inconstantly allocated input for weight parameters. Further, it sums up all inputs (weight).

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

After adding all inputs, if the total sum of all inputs is more than a pre-determined value, the
model gets activated and shows the output value as +1.

If the outcome is same as pre-determined or threshold value, then the performance of

this modelis stated as satisfied, and weight demand does not change. However, this model
consists of a few discrepancies triggered when multiple weight inputs values are fed into the
model. Hence, to find desired output and minimize errors, some changes should be necessary
for the weights input.

Multi-Layered Perceptron Model:

Like a single-layer perceptron model, a multi-layer perceptron model also has the same
model structure but has a greater number of hidden layers.
The multi-layer perceptron model is also known as the Backpropagation algorithm, which
executesin two stages as follows:

o Forward Stage: Activation functions start from the input layer in the forward stage and
terminate on the output layer.
o Backward Stage: In the backward stage, weight and bias values are modified as per
the model's requirement. In this stage, the error between actual output and demanded
originated backward on the output layer and ended on the input layer.

5.2.1 Multilayer Perceptron:

 Multilayer perceptron is one of the most commonly used machine learning method.
 The Multi-layer Perceptron network, consisting of multiple layers of connected
neurons.
 Multilayer perceptron is an artificial neural network structure and is a non-parametric
estimator that can be used for classification and regression.

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

Fig: 5.4 The Multi-layer Perceptron network

 In the multi-layer perceptron diagram above, we can see that there are three inputs and
thusthree input nodes and the hidden layer has three nodes.
 The output layer gives two outputs, therefore there are two output nodes.
 The nodes in the input layer take input and forward it for further process, in the diagram
above the nodes in the input layer forwards their output to each of the three nodes in
the hidden layer, and in the same way, the hidden layer processes the information and
passesit to the output layer.
 Every node in the multi-layer perception uses a sigmoid activation function. The
sigmoid activation function takes real values as input and converts them to numbers
between 0 and 1 using the sigmoid formula.
 The most commonly used form of this function (where β is some positive parameter)
is:

 The multi-layer perceptron is also known as back propagation algorithm, which

executes in two stages as follows:
i. Forward stage:
In Figure 5.1, we start at the left by filling in the values for the inputs. We then use these
inputs and the first level of weights to calculate the activations of the hidden layer, and

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

thenwe use those activations and the next set of weights to calculate the activations of
the output layer. Now that we’ve got the outputs of the network, we can compare them
to the targets and compute the error.
ii. Backward stage: BACK-PROPAGATION OF ERROR
Backpropagation, or backward propagation of errors, is an algorithm that is designed
to test for errors working back from output nodes to input nodes. The error function that
we used for the Perceptron was

Where N is the number of output nodes.

The Multi-layer Perceptron Algorithm:

The MLP training algorithm using back-propagation of error is described below:

1. an input vector is put into the input nodes
2. the inputs are fed forward through the network
• the inputs and the first-layer weights (here labelled as v) are used to decide whether
the hidden nodes fire or not. The activation function g(·) is the sigmoidfunction given
in

• the outputs of these neurons and the second-layer weights (labelled as w) areused to
decide if the output neurons fire or not
3. the error is computed as the sum-of-squares difference between the network outputsand
the targets

4. this error is fed backwards through the network in order to

• first update the second-layer weights

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

• and then afterwards, the first-layer weights

Advantages of Multi-layer perceptron:

 It can be used to solve complex nonlinear problems.

 It handles large amounts of input data well.
 Makes quick predictions after training.

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

 The same accuracy ratio can be achieved even with smaller samples.

Disadvantages of Multi-layer perceptron:

 In Multi-layer perceptron, computations are difficult and time-consuming.In

multi-layer Perceptron, it is difficult to predict how much the dependent
variableaffects each independent variable.
 The model functioning depends on the quality of the training.

5.2.2 Activation Functions:

 Artificial neurons are elementary units in an artificial neural network. The artificial
neuron receives one or more inputs and sums them to produce an output. Each input
isseparately weighted, and the sum is passed through a function known as an
activation function or transfer function.
 In an artificial neural network, the function which takes the incoming signals as input
andproduces the output signal is known as the activation function.

Fig: 5.5 Artificial neuron

x1, x2,……,xn : input signals

w1,w2,……,wn : weights associated with input signals
x0 : input signal taking the constant value 1
w0 : weight associated with x0 (called bias)
Ʃ : indicates summation of input signals

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

f : function which produces the output

y : output signal

 The function f can be expressed in the following form:

Some simple activation functions

The following are some of the simple activation functions.
Threshold activation function
 The threshold activation function is defined by

 The graph of this function is shown as follows:

Fig: 5.6 Threshold activation function

2. Unit step functions:

 Sometimes, the threshold activation function is also defined as a unit step function in
which case it is called a unit-step activation function.
 This is defined as follows:

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

 The graph of this function is shown as follows:

Fig: 5.7 Unit step functions

3. Sigmoid activation function (logistic function):
 One of the most commonly used activation functions is the sigmoid activation
function.

 It is a function which is plotted as ‘S’ shaped graph

 This is defined as follows:

 Value Range :- 0to +1

 Nature :- non-linear
 Uses : Usually used in output layer of a binary classification, where result is either 0 or
1, as value for sigmoid function lies between 0 and 1 only so, result can be predicted
easilyto be 1 if value is greater than 0.5 and 0 otherwise.
 The graph of this function is shown as follows:

Fig: 5.8 Sigmoid activation function

4. Linear activation function

 The linear activation function is defined by

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

F(x) = mx + c
 This defines a straight line in the xy-plane.

Fig: 5.9 Linear activation function

5. Tanh or Hyperbolic tangential activation function

 The activation that works almost always better than sigmoid function is Tanh function
alsoknown as Tangent Hyperbolic function. It’s actually mathematically shifted version
of the sigmoid function. Both are similar and can be derived from each other.
 Value Range :- -1 to +1
 Nature :- non-linear

 Uses :- Usually used in hidden layers of a neural network as it’s values lies between -
1 to1 hence the mean for the hidden layer comes out be 0 or very close to it, hence
helps in centering the data by bringing mean close to 0. This makes learning for the next
layer mucheasier.
 This is defined by

Fig: 5.10 Hyperbolic tangent activation function

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

6. RELU Activation Function

It Stands for Rectified linear unit. It is the most widely used activation function. Chiefly
implemented in hidden layers of Neural network.

Equation :- A(x) = max(0,x). It gives an output x if x is positive and 0 otherwise.Value

Range :- [0, inf)
Nature :- non-linear, which means we can easily backpropagate the errors and have multiple
layersof neurons being activated by the ReLU function.

Uses :- ReLu is less computationally expensive than tanh and sigmoid because it involves
simpler mathematical operations. At a time only a few neurons are activated making the
network sparse making it efficient and easy for computation.

Fig: 5.11 RELU activation Function

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

CS3351 AIML UNIT 5 NOTES EduEngg
No ratings yet
CS3351 AIML UNIT 5 NOTES EduEngg
35 pages
unit 5
No ratings yet
unit 5
46 pages
ML UNIT 3-2-18
No ratings yet
ML UNIT 3-2-18
17 pages
Unit 4
No ratings yet
Unit 4
9 pages
Unit 3
No ratings yet
Unit 3
29 pages
chp1 NN, MLFFN, weight, bias, threshold, activation fn, loss fn
No ratings yet
chp1 NN, MLFFN, weight, bias, threshold, activation fn, loss fn
19 pages
3 - Perceptron in Machine Learning
No ratings yet
3 - Perceptron in Machine Learning
7 pages
UNIT-II MLT1
No ratings yet
UNIT-II MLT1
45 pages
Unit 2
No ratings yet
Unit 2
15 pages
Perceptrons
No ratings yet
Perceptrons
8 pages
1 - Perceptron in Machine Learning
No ratings yet
1 - Perceptron in Machine Learning
6 pages
Unit VML
No ratings yet
Unit VML
14 pages
unit2ml-230101150634-5590aaef
No ratings yet
unit2ml-230101150634-5590aaef
202 pages
ADVANCED_SUPERVISED_LEARNING[1]
No ratings yet
ADVANCED_SUPERVISED_LEARNING[1]
17 pages
3rd Lecture
No ratings yet
3rd Lecture
21 pages
DL2_Perceptron.pptx
No ratings yet
DL2_Perceptron.pptx
14 pages
NN Unit 2
No ratings yet
NN Unit 2
20 pages
Percept Ron
No ratings yet
Percept Ron
49 pages
The Perceptrons
No ratings yet
The Perceptrons
41 pages
perceptron
No ratings yet
perceptron
32 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
Unit 1 NNDL
No ratings yet
Unit 1 NNDL
8 pages
Aiml Unit 5
No ratings yet
Aiml Unit 5
16 pages
1 Neural Networks
No ratings yet
1 Neural Networks
16 pages
DL Unit 2
No ratings yet
DL Unit 2
107 pages
Unit - 2
No ratings yet
Unit - 2
24 pages
Unit II - Perceptron
No ratings yet
Unit II - Perceptron
20 pages
4.3 Perceptron and MFFN
No ratings yet
4.3 Perceptron and MFFN
12 pages
Percptron
No ratings yet
Percptron
25 pages
AIML-UNIT-5
No ratings yet
AIML-UNIT-5
34 pages
Group 6 Perceptron
No ratings yet
Group 6 Perceptron
23 pages
ML Unit4
No ratings yet
ML Unit4
38 pages
unit-3_ml[1]
No ratings yet
unit-3_ml[1]
21 pages
Unit_2
No ratings yet
Unit_2
20 pages
Unit 1
No ratings yet
Unit 1
19 pages
Perceptron Notes
No ratings yet
Perceptron Notes
4 pages
20200428135045cfbc718e2c (1)
No ratings yet
20200428135045cfbc718e2c (1)
30 pages
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
No ratings yet
Ann Mid1: Artificial Neural Networks With Biological Neural Network - Similarity
13 pages
Machine Learning
No ratings yet
Machine Learning
13 pages
ML Unit 5
No ratings yet
ML Unit 5
33 pages
MODULE 1 DL
No ratings yet
MODULE 1 DL
6 pages
Deep Learning Unit1
No ratings yet
Deep Learning Unit1
25 pages
AI - II - Cihan - Lect 6 PDF
No ratings yet
AI - II - Cihan - Lect 6 PDF
31 pages
Unit 1 Until MLP
No ratings yet
Unit 1 Until MLP
56 pages
Percept Ron
No ratings yet
Percept Ron
15 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
ANN MODULE 1 Part2
No ratings yet
ANN MODULE 1 Part2
58 pages
ML UNIT 2
No ratings yet
ML UNIT 2
23 pages
Soft Computing Unit 2 Notes..
No ratings yet
Soft Computing Unit 2 Notes..
24 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Session XX - Neural Network
No ratings yet
Session XX - Neural Network
43 pages
AI: Neural Network For Beginners (Part 1 of 3) : Sacha Barber
No ratings yet
AI: Neural Network For Beginners (Part 1 of 3) : Sacha Barber
9 pages
ML Exp-7
No ratings yet
ML Exp-7
5 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
Lecture 19 NN
No ratings yet
Lecture 19 NN
32 pages
Deep Leaning
No ratings yet
Deep Leaning
117 pages
Supervised ANN
No ratings yet
Supervised ANN
19 pages
unitV (1)
No ratings yet
unitV (1)
29 pages
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet
Digital Circuit Simulation Using Excel
From Everand
Digital Circuit Simulation Using Excel
Anthony Mazzurco
No ratings yet
AI10
No ratings yet
AI10
5 pages
AI18
No ratings yet
AI18
11 pages
AI22
No ratings yet
AI22
3 pages
AI24
No ratings yet
AI24
4 pages
Smart Data Monitoring System For Power Loom Using IOT
No ratings yet
Smart Data Monitoring System For Power Loom Using IOT
6 pages
HPC Practical 2025 (3)
No ratings yet
HPC Practical 2025 (3)
19 pages
Applied Numerical Methods: Dr. Khaled Ahmida Ashouri
No ratings yet
Applied Numerical Methods: Dr. Khaled Ahmida Ashouri
12 pages
Generative AI (21CS733) AAT-1 Final Marks
No ratings yet
Generative AI (21CS733) AAT-1 Final Marks
8 pages
Newton Raphson Method:: Find Roots of The Equation
No ratings yet
Newton Raphson Method:: Find Roots of The Equation
15 pages
CM mid sem 2
No ratings yet
CM mid sem 2
1 page
Association
No ratings yet
Association
40 pages
Product of Monomial
No ratings yet
Product of Monomial
3 pages
Lecture 6
No ratings yet
Lecture 6
90 pages
What Is Backpropagation
No ratings yet
What Is Backpropagation
8 pages
ADA Techmax Searchable
No ratings yet
ADA Techmax Searchable
100 pages
Daa Imp QP
No ratings yet
Daa Imp QP
4 pages
3. LPP CLASS XI 2024-25
No ratings yet
3. LPP CLASS XI 2024-25
2 pages
FEM Assignment 3
100% (1)
FEM Assignment 3
11 pages
Design and Analysis of Algorithm Practicals
No ratings yet
Design and Analysis of Algorithm Practicals
93 pages
Worksheet Sig Fig 9 11 08 PDF
No ratings yet
Worksheet Sig Fig 9 11 08 PDF
2 pages
Assign 1 MTH308 Sol
No ratings yet
Assign 1 MTH308 Sol
3 pages
Thesis
No ratings yet
Thesis
115 pages
CH 05 Transportation Model and Its Variance
No ratings yet
CH 05 Transportation Model and Its Variance
70 pages
Chapter 1 Introduction
No ratings yet
Chapter 1 Introduction
18 pages
ML DSBA Lab7
No ratings yet
ML DSBA Lab7
6 pages
AI Python Lab Report CSIT 4th Semester Part II
No ratings yet
AI Python Lab Report CSIT 4th Semester Part II
28 pages
Maths Formula Part 1
No ratings yet
Maths Formula Part 1
25 pages
13 - Polynomial and Rational Functions - Finding X and y Intercepts Given A Polynomial Function
No ratings yet
13 - Polynomial and Rational Functions - Finding X and y Intercepts Given A Polynomial Function
2 pages
10th Mathematics MCQ
No ratings yet
10th Mathematics MCQ
7 pages
Chapter3 ProblemSolvingBySearching
No ratings yet
Chapter3 ProblemSolvingBySearching
61 pages
Week 12 - Integer Programming - Part 1
No ratings yet
Week 12 - Integer Programming - Part 1
28 pages
NEURAL NETWORKS Basics Using Matlab
100% (2)
NEURAL NETWORKS Basics Using Matlab
51 pages
CH 3 - Linear System of Equations
No ratings yet
CH 3 - Linear System of Equations
15 pages
Operation Research
No ratings yet
Operation Research
20 pages
Coding Using MATLAB
No ratings yet
Coding Using MATLAB
27 pages

AI31

Uploaded by

AI31

Uploaded by

ROHINI COLLEGE OF ENGINEERING AND TECHNOLOGY

5.2 PERCEPTRON IN MACHINE LEARNING

Perceptron is Machine Learning algorithm for supervised learning of various binary

Basic Components of Perceptron

o Input Nodes or Input Layer:

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

o Step function, and

How does Perceptron work?

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

∑wi*xi = x1*w1 + x2*w2 +…wn*xn

Types of Perceptron Models

2. Multi-layer Perceptron model

Single Layer Perceptron Model:

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

If the outcome is same as pre-determined or threshold value, then the performance of

Multi-Layered Perceptron Model:

5.2.1 Multilayer Perceptron:

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Fig: 5.4 The Multi-layer Perceptron network

 The multi-layer perceptron is also known as back propagation algorithm, which

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Where N is the number of output nodes.

The Multi-layer Perceptron Algorithm:

The MLP training algorithm using back-propagation of error is described below:

4. this error is fed backwards through the network in order to

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

• and then afterwards, the first-layer weights

Advantages of Multi-layer perceptron:

 It can be used to solve complex nonlinear problems.

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Disadvantages of Multi-layer perceptron:

 In Multi-layer perceptron, computations are difficult and time-consuming.In

5.2.2 Activation Functions:

Fig: 5.5 Artificial neuron

x1, x2,……,xn : input signals

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

f : function which produces the output

 The function f can be expressed in the following form:

Some simple activation functions

 The graph of this function is shown as follows:

Fig: 5.6 Threshold activation function

2. Unit step functions:

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

 The graph of this function is shown as follows:

Fig: 5.7 Unit step functions

 It is a function which is plotted as ‘S’ shaped graph

 Value Range :- 0to +1

Fig: 5.8 Sigmoid activation function

4. Linear activation function

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

Fig: 5.9 Linear activation function

5. Tanh or Hyperbolic tangential activation function

Fig: 5.10 Hyperbolic tangent activation function

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

6. RELU Activation Function

Equation :- A(x) = max(0,x). It gives an output x if x is positive and 0 otherwise.Value

Fig: 5.11 RELU activation Function

CS3491-ARTIFICIAL INTELLIGENCE AND MACHINE LEARNING

You might also like

∑wixi = x1w1 + x2w2 +…wnxn