9-MLP-EXAMPLE-08-08-2024

The document discusses deep learning concepts, focusing on deep neural networks and the multilayer perceptron (MLP) architecture. It explains the perceptron as a linear classifier, the limitations of single-layer perceptrons in solving non-linearly separable problems like XOR, and introduces backpropagation for weight updates in MLPs. Additionally, it covers practical applications of MLPs in sentiment analysis and image classification using the CIFAR-10 dataset, highlighting the importance of model architecture and preprocessing steps.

Uploaded by

vemuripraveena2622

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

3 views50 pages

9-MLP-EXAMPLE-08-08-2024

Uploaded by

vemuripraveena2622

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PPTX, PDF, TXT or read online on Scribd

You are on page 1/ 50

CSE4006- Deep

learning Dr.
D.Sumathi
Deep Neural Networks
Multilayer Perceptron-Gradient based Learning-
Backpropagation Algorithm- for
Regularization Learning- Optimization for Deep
training deep models
RECAP
• The perceptron is a classification algorithm. Specifically, it works as
a linear binary classifier. It was invented in the late 1950s by Frank
Rosenblatt.
• The perceptron basically works as a threshold function — non-
negative outputs are put into one class while negative ones are put into
the other class.
Perceptron –
Components
• Input nodes
• Output node
• An activation function
• Weights and biases
• Error function
Classificati
on
Implement linear classification in
terms of AND,OR gates..
Model to mimic XoR
problem
Attempt #1: The Single Layer Perceptron
• A perceptron can only converge on linearly separable data. Therefore, it
isn’t capable of imitating the XOR function.
• a perceptron must correctly classify the entire training data in one go.
• Non-linearity allows for more complex decision boundaries. One
potential decision boundary for our XOR data could look like this.
The 2d XOR problem — Attempt
#2
• Imitating the XOR function would require a non-linear decision
boundary.
• The XOR problem with neural networks can be solved by using Multi-
Layer Perceptrons or a neural network architecture with an input
layer, hidden layer, and output layer.
• So during the forward propagation through the neural networks, the
weights get updated to the corresponding layers and the XOR logic
gets executed.
Perceptron to solve XOR
problem
• Out of all the
2 input logic
gates, the
XOR and
XNOR gates
are the only
ones that are
not linearly-
separable
potential decision boundary could
be something to classify????
The Multi-layered
• Components are
Perceptron
• input and output nodes,
• activation function
• weights and
• Biases
• An MLP can have hidden layers. An MLP is generally restricted
to having a minimum of single hidden layer
• Activation functions should be differentiable, so that a network’s
parameters can be updated using backpropagation.
• Though the output generation process is a direct extension of that of the
perceptron, updating weights isn’t so straightforward.
• Backpropagation is an algorithm for update the weights and biases of a
model based on their gradients with respect to the error function,
starting from the output layer all the way to the first layer.
Architecture-
•MLP
The architecture of a network refers to its general structure — the
number of hidden layers, the number of nodes in each layer and how
these nodes are inter-connected.
Example-1
For w1 (with respect to
E1),
Proces
s
• update all the old weights with these new weights.
• Once the weights are updated, one backpropagation cycle is finished.
• Now the forward pass is done and the total new error is computed.
• And based on this newly computed total error the weights are again
updated.
• This goes on until the loss value converges to minima.
• This way a neural network starts with random values for its weights
and finally converges to optimum values.
Key
takeaway
• #1 Adding more layers or nodes : It gives increasingly complex
decision boundaries that could also lead to overfitting — where a model
achieves very high accuracies on the training data, but fails to
generalize.
• #2: Choosing a loss function: It makes some assumptions on the data (like
it being gaussian) and isn’t always convex when it comes to a classification
problem
Using Perceptron for Sentiment Analysis

With the final labels assigned to the entire

corpus, you decided to fit the data to a
Perceptron, the simplest neural network of all.
• Text from the guestbooks as a vector using the Term Frequence-
Inverse Document Frequency(TF-IDF). This method encodes any kind
of text as a statistic of how frequent each word, or term, is in each
sentence and the entire document.
• In Python we need to use TfidfVectorizer method from ScikitLearn.
• Remove English stop-words and even applyL1 normalization.

TfidfVectorizer(stop_words='english',
lowercase=True, norm='l1')
Step 1: Corpus is initialized along with targets
How would MultiLayer Perceptron perform
in this case?
• Activation function: ReLU, specified with the parameter activation=’relu’
• Optimization function: Stochastic Gradient Descent, specified with the
parameter solver=’sgd’
• Learning rate: Inverse Scaling, specified with the
parameter learning_rate=’invscaling’
• Number of iterations: 20, specified with the
parameter max_iter=20
• By default, Multilayer Perceptron has three hidden layers, but you want
to see how the number of neurons in each layer impacts performance
• Here the code started with 2 neurons per hidden layer, setting
the parameter num_neurons=2.
• Finally, to see the value of the loss function at each iteration, you also
added the parameter verbose=True.
• What about if you added more capacity to the neural network? What
happens when each hidden layer has more neurons to learn the patterns of
the dataset?
• Simply change the num_neurons parameter an set it, for instance, to 5.
• buildMLPerceptron(train_features, test_features, train_targets,
test_targets, num_neurons=5)
• Adding more neurons to the hidden layers definitely improved Model
accuracy!
Inferenc
es neural network structure, 3 hidden layers, but with the increased
• Same
computational power of the 5 neurons, the model got better at
understanding the patterns in the data.
• It converged much faster and mean accuracy doubled!
• In the end, for this specific case and dataset, the Multilayer Perceptron
performs as well as a simple Perceptron. But it was definitely a great
exercise to see how changing the number of neurons in each hidden-
layer impacts model performance.
Example -2 cifar dataset – deploy
MLP1: Preparing the Data for Training MLP Network
• Step
• Pixel scaling is an important preprocessing step that is often applied to
the input data
• In image classification tasks, the pixel values in the images can range
from 0 to 255 (for 8-bit images).
• Scaling the pixel values to be between 0 and 1 can make the model
training process more stable and efficient. This can be done by dividing
each pixel value by 255.
• from tensorflow.keras.datasets import cifar10
• from tensorflow.keras.utils import to_categorical

• # Load the CIFAR-10 dataset

• (x_train, y_train), (x_test, y_test) = cifar10.load_data()

• # Scale the pixel values to between 0 and 1

• x_train = x_train / 255.0
• x_test = x_test / 255.0
• # Convert the labels to one-hot encoding makes it easy to compare
the predicted probabilities to the true labels.
• y_train = to_categorical(y_train)
• y_test = to_categorical(y_test)
Defining the MLP Model
•Architecture
Using sequential model
• Using the Functional API.
• In the next slide, let us see the Keras Sequential Model Structure for
MLP
• from tensorflow.keras.models import Sequential
• from tensorflow.keras.layers import Dense, Flatten

• # Create a Sequential model

• model = Sequential()

• # Add a Flatten layer to flatten the input image

• model.add(Flatten(input_shape=(32, 32, 3)))

• # Add two dense layers with 200 units and 'relu' activation
function
• model.add(Dense(200, activation='relu'))
• model.add(Dense(150, activation='relu'))

• # Add a softmax output layer with 10 units

• model.add(Dense(10, activation='softmax'))

• # Print the model summary

• model.summary()
Model
summary
Keras Functional API Model
Structure for MLP
• from tensorflow.keras.layers import Input, Dense, Flatten
• from tensorflow.keras.models import Model
•
• # Define the input layer
• inputs = Input(shape=(32, 32, 3))
•
• # Flatten the input image
• x = Flatten()(inputs)
•
• # Add two dense layers with 200 units and 'relu' activation
function
• x = Dense(200, activation='relu')(x)
• x = Dense(150, activation='relu')(x)
• # Add a softmax output layer with 10 units
• outputs = Dense(10, activation='softmax')(x)
•
• # Create the model
• model = Model(inputs=inputs,
outputs=outputs)
•
• # Print the model summary
• model.summary()
Compile and Train the MLP
•Model
from tensorflow.keras import optimizers
• opt = optimizers.Adam(learning_rate=0.0005)
• model.compile(loss='categorical_crossentropy', optimizer=opt,
metrics=['accuracy'])
• model.fit(x_train, y_train, batch_size = 32, epochs = 10,
shuffle = True)
Evaluate the MLP
Model
Functional API vs
Sequential
• The Functional API method in Keras is recommended for creating
more complex models that have multiple inputs, multiple outputs, or
require layers to share connections.
• The Sequential model, on the other hand, is suitable for creating
simple models where the layers are stacked linearly.

P5 Neural Nets
No ratings yet
P5 Neural Nets
114 pages
Practicals 1 to
No ratings yet
Practicals 1 to
5 pages
Deep Learning Curriculum
No ratings yet
Deep Learning Curriculum
23 pages
module 1
No ratings yet
module 1
98 pages
ML Module 2
No ratings yet
ML Module 2
59 pages
Neural Networks
No ratings yet
Neural Networks
19 pages
ML-Lec10-Artificial Neural Networks (1)
No ratings yet
ML-Lec10-Artificial Neural Networks (1)
76 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
73 pages
Unit 1 Until MLP
No ratings yet
Unit 1 Until MLP
56 pages
Module I
No ratings yet
Module I
109 pages
04Introduction to Neural Networks
No ratings yet
04Introduction to Neural Networks
62 pages
Notes_ML_02_Slides_RNN_ANN
No ratings yet
Notes_ML_02_Slides_RNN_ANN
105 pages
CH5_Function Approximation (1)
No ratings yet
CH5_Function Approximation (1)
33 pages
DLP Lab
No ratings yet
DLP Lab
81 pages
module 4
No ratings yet
module 4
32 pages
CH3_2 Montecarlo Control
No ratings yet
CH3_2 Montecarlo Control
33 pages
Deep Neural Networks Intro As If
No ratings yet
Deep Neural Networks Intro As If
55 pages
Module I NLP
No ratings yet
Module I NLP
65 pages
Mathworks Installation Help
No ratings yet
Mathworks Installation Help
60 pages
UNIT V
No ratings yet
UNIT V
49 pages
ML-unit-2
No ratings yet
ML-unit-2
24 pages
Lecture_2 (1)
No ratings yet
Lecture_2 (1)
52 pages
CM20315 01 Intro 01
No ratings yet
CM20315 01 Intro 01
39 pages
16-Optimization and Loss Functions in Classifiers, Convolution Layers, Max Pool Layers-24!08!2024
No ratings yet
16-Optimization and Loss Functions in Classifiers, Convolution Layers, Max Pool Layers-24!08!2024
36 pages
Module-5
No ratings yet
Module-5
37 pages
UNIT-II MLT1
No ratings yet
UNIT-II MLT1
45 pages
10fold-split70
No ratings yet
10fold-split70
5 pages
MODULE6 5 Learning With Options
No ratings yet
MODULE6 5 Learning With Options
19 pages
Chapter10 Keras
No ratings yet
Chapter10 Keras
66 pages
Report2
No ratings yet
Report2
17 pages
Module6 4 Options
No ratings yet
Module6 4 Options
17 pages
Multi Layer Perceptron
No ratings yet
Multi Layer Perceptron
51 pages
Chap 6 - Deep FeedForward Networks - Eunjeong Yi
No ratings yet
Chap 6 - Deep FeedForward Networks - Eunjeong Yi
21 pages
Week_3
No ratings yet
Week_3
17 pages
2EL1730 ML Lecture07 Neural Networks
No ratings yet
2EL1730 ML Lecture07 Neural Networks
65 pages
2023 - An Advanced Deep Neural Network for Fundus Image Analysis and Enhancing Diabetic Retinopathy Detection
100% (1)
2023 - An Advanced Deep Neural Network for Fundus Image Analysis and Enhancing Diabetic Retinopathy Detection
19 pages
WINSEM2024-25_STS4006_TH_AP2024254001070_2025-03-01_Reference-Material-I
No ratings yet
WINSEM2024-25_STS4006_TH_AP2024254001070_2025-03-01_Reference-Material-I
14 pages
Neural Network Assignment
No ratings yet
Neural Network Assignment
6 pages
unit2ml-230101150634-5590aaef
No ratings yet
unit2ml-230101150634-5590aaef
202 pages
ANN 5TH PPT
No ratings yet
ANN 5TH PPT
98 pages
Lecture 5-Introduction to neural network (1)
No ratings yet
Lecture 5-Introduction to neural network (1)
42 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
Multilayer Neural Network
No ratings yet
Multilayer Neural Network
27 pages
En Subject
No ratings yet
En Subject
13 pages
Week_2
No ratings yet
Week_2
17 pages
module1.2
No ratings yet
module1.2
14 pages
Lesson 3 Basics of Neural Networks_Perceptron
No ratings yet
Lesson 3 Basics of Neural Networks_Perceptron
26 pages
unit 2 -ml
No ratings yet
unit 2 -ml
18 pages
Pima Indian Dibatets - Group Presentation - Ai
No ratings yet
Pima Indian Dibatets - Group Presentation - Ai
18 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Ai20 - 03 - NN
No ratings yet
Ai20 - 03 - NN
32 pages
AIML-UNIT-5
No ratings yet
AIML-UNIT-5
34 pages
soft computing unit 2
No ratings yet
soft computing unit 2
23 pages
Comparing Q Learning and Policy Gradient in Frozen Lake Environment
No ratings yet
Comparing Q Learning and Policy Gradient in Frozen Lake Environment
8 pages
Comparing Q Learning and Policy Gradient in Frozen Lake Environment (1)
No ratings yet
Comparing Q Learning and Policy Gradient in Frozen Lake Environment (1)
8 pages
ML Exam Prep
No ratings yet
ML Exam Prep
14 pages
Machine Learning and Data Mining: Prof. Alexander Ihler
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler
29 pages
ML 03
No ratings yet
ML 03
42 pages
1 Linear Algebra Basics 25-07-2024
No ratings yet
1 Linear Algebra Basics 25-07-2024
30 pages
Topluluk Öğrenmesi Yöntemleri Ile Göğüs Kanseri Teşhisi: Breast Cancer Diagnosis With Ensemble Learning Methods
No ratings yet
Topluluk Öğrenmesi Yöntemleri Ile Göğüs Kanseri Teşhisi: Breast Cancer Diagnosis With Ensemble Learning Methods
17 pages
Multilayer Perceptrons For Digit Recognition With Core APIs - TensorFlow Core
No ratings yet
Multilayer Perceptrons For Digit Recognition With Core APIs - TensorFlow Core
21 pages
Percept Ron
No ratings yet
Percept Ron
13 pages
ADL 1,2,3
No ratings yet
ADL 1,2,3
9 pages
8 Linear Classifiers HInge Loss 03-08-2024
No ratings yet
8 Linear Classifiers HInge Loss 03-08-2024
20 pages
Lecture07. ANN (Chapter 10-2)
No ratings yet
Lecture07. ANN (Chapter 10-2)
26 pages
Introduction to Artificial Neural Networks
No ratings yet
Introduction to Artificial Neural Networks
31 pages
CV Lab 7
No ratings yet
CV Lab 7
4 pages
Deep Learning I. Introduction (: 1. The History and The Development of Deep Learning
No ratings yet
Deep Learning I. Introduction (: 1. The History and The Development of Deep Learning
21 pages
3rd Unit ML
No ratings yet
3rd Unit ML
7 pages
Parvatham Yakshitha Gowri - Resume (2)[1]
No ratings yet
Parvatham Yakshitha Gowri - Resume (2)[1]
3 pages
22BCE7873 ASG7
No ratings yet
22BCE7873 ASG7
3 pages
Neural Networks Neural Networks
No ratings yet
Neural Networks Neural Networks
30 pages
UNIT 3
No ratings yet
UNIT 3
9 pages
Practice Lecture4
No ratings yet
Practice Lecture4
3 pages
Basics of Deep Learning
No ratings yet
Basics of Deep Learning
20 pages
Deep Learning
No ratings yet
Deep Learning
43 pages
Soft Computing Question Bank
No ratings yet
Soft Computing Question Bank
4 pages
Neural Networks Backpropagation Algorithm: COMP4302/COMP5322, Lecture 4, 5
No ratings yet
Neural Networks Backpropagation Algorithm: COMP4302/COMP5322, Lecture 4, 5
11 pages
Unit 1 Fundamentals of Deep Learning
No ratings yet
Unit 1 Fundamentals of Deep Learning
20 pages
22BCE7873 ASG9
No ratings yet
22BCE7873 ASG9
3 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Unit VML
No ratings yet
Unit VML
14 pages
Back propogation
No ratings yet
Back propogation
9 pages
Upload 1
No ratings yet
Upload 1
17 pages
Recurrent Neural Network (RNN)
No ratings yet
Recurrent Neural Network (RNN)
26 pages
Omar Arif Omar - Arif@seecs - Edu.pk National University of Sciences and Technology
No ratings yet
Omar Arif Omar - Arif@seecs - Edu.pk National University of Sciences and Technology
44 pages
ai
No ratings yet
ai
4 pages
Artificial Neural Network (2019 Pattern) Pyq
No ratings yet
Artificial Neural Network (2019 Pattern) Pyq
3 pages
DSA Time Complexity Table (1)
No ratings yet
DSA Time Complexity Table (1)
1 page
Back Propagation Example
No ratings yet
Back Propagation Example
3 pages
Time Series Prediction With Multilayer Perceptron (MLP) : A New Generalized Error Based Approach
No ratings yet
Time Series Prediction With Multilayer Perceptron (MLP) : A New Generalized Error Based Approach
2 pages
Pattern Recognition & Analysis Assignment - Ii
No ratings yet
Pattern Recognition & Analysis Assignment - Ii
19 pages
Mod 2.1,2.2
No ratings yet
Mod 2.1,2.2
24 pages
Perceptron Notes
No ratings yet
Perceptron Notes
4 pages
Graph theory report
No ratings yet
Graph theory report
9 pages
perceptron
No ratings yet
perceptron
3 pages
Multi-Layer Perceptron Tutorial
No ratings yet
Multi-Layer Perceptron Tutorial
87 pages
Machine Learning: Lecture 4: Artificial Neural Networks (Based On Chapter 4 of Mitchell T.., Machine Learning, 1997)
No ratings yet
Machine Learning: Lecture 4: Artificial Neural Networks (Based On Chapter 4 of Mitchell T.., Machine Learning, 1997)
14 pages
Activation Functions
No ratings yet
Activation Functions
2 pages
AI Week 12
No ratings yet
AI Week 12
2 pages
Unit 5
No ratings yet
Unit 5
61 pages
AD3511 SET2
No ratings yet
AD3511 SET2
2 pages
Neural Sheet 6
No ratings yet
Neural Sheet 6
3 pages
A Presentation On: By: Edutechlearners
No ratings yet
A Presentation On: By: Edutechlearners
33 pages
Unit - II ML
No ratings yet
Unit - II ML
9 pages
DL Question Bank Answers
No ratings yet
DL Question Bank Answers
55 pages
Csa4020 Deep-Learning LP 1.0 22 Csa4020 Deep-Learning LP 1.0 1 Deep Learning
No ratings yet
Csa4020 Deep-Learning LP 1.0 22 Csa4020 Deep-Learning LP 1.0 1 Deep Learning
2 pages
Python - Programming
No ratings yet
Python - Programming
9 pages
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
From Everand
Radial Basis Networks: Fundamentals and Applications for The Activation Functions of Artificial Neural Networks
Fouad Sabry
No ratings yet
Machine Learning - Advanced Concepts
From Everand
Machine Learning - Advanced Concepts
Derrick Mwiti
No ratings yet
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
From Everand
Backpropagation: Fundamentals and Applications for Preparing Data for Training in Deep Learning
Fouad Sabry
No ratings yet

9-MLP-EXAMPLE-08-08-2024

Uploaded by

9-MLP-EXAMPLE-08-08-2024

Uploaded by

CSE4006- Deep

With the final labels assigned to the entire

• # Load the CIFAR-10 dataset

• # Scale the pixel values to between 0 and 1

• # Create a Sequential model

• # Add a Flatten layer to flatten the input image

• # Add a softmax output layer with 10 units

• # Print the model summary

You might also like