2B MultiLayer Perceptron Assignment

The document outlines an assignment for building a Multi-Layer Perceptron (MLP) using PyTorch for MNIST digit classification. It explains the importance of hidden layers and nonlinearities, specifically the ReLU function, in enhancing model performance. The assignment encourages experimentation with model architecture and emphasizes the need for proper handling of loss functions in PyTorch.

Uploaded by

Sachin Pradhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

15 views

2B MultiLayer Perceptron Assignment

Uploaded by

Sachin Pradhan

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 3

2B_MultiLayer_Perceptron_Assignment

March 26, 2025

1 PyTorch Assignment: Multi-Layer Perceptron (MLP)

Duke Community Standard: By typing your name below, you are certifying that you
have adhered to the Duke Community Standard in completing this assignment.
Name:

1.0.1 Multi-Layer Perceptrons

The simple logistic regression example we went over in the previous notebook is essentially a one-
layer neural network, projecting straight from the input to the output predictions. While this can
be effective for linearly separable data, occasionally a little more complexity is necessary. Neural
networks with additional layers are typically able to learn more complex functions, leading to better
performance. These additional layers (called “hidden” layers) transform the input into one or more
intermediate representations before making a final prediction.
In the logistic regression example, the way we performed the transformation was with a fully-
connected layer, which consisted of a linear transform (matrix multiply plus a bias). A neural
network consisting of multiple successive fully-connected layers is commonly called a Multi-Layer
Perceptron (MLP). In the simple MLP below, a 4-d input is projected to a 5-d hidden representation,
which is then projected to a single output that is used to make the final prediction.
For the assignment, you will be building a MLP for MNIST. Mechanically, this is done very similary
to our logistic regression example, but instead of going straight to a 10-d vector representing our
output predictions, we might first transform to a 500-d vector with a “hidden” layer, then to the
output of dimension 10. Before you do so, however, there’s one more important thing to consider.

1.0.2 Nonlinearities

We typically include nonlinearities between layers of a neural network. There’s a number of reasons
to do so. For one, without anything nonlinear between them, successive linear transforms (fully
connected layers) collapse into a single linear transform, which means the model isn’t any more
expressive than a single layer. On the other hand, intermediate nonlinearities prevent this collapse,
allowing neural networks to approximate more complex functions.
There are a number of nonlinearities commonly used in neural networks, but one of the most
popular is the rectified linear unit (ReLU):

1
x = max(0, x) (1)

There are a number of ways to implement this in PyTorch. We could do it with elementary PyTorch
operations:
[ ]: import torch

x = torch.rand(5, 3)*2 - 1
x_relu_max = torch.max(torch.zeros_like(x),x)

print("x: {}".format(x))
print("x after ReLU with max: {}".format(x_relu_max))

Of course, PyTorch also has the ReLU implemented, for example in torch.nn.functional:
[ ]: import torch.nn.functional as F

x_relu_F = F.relu(x)

print("x after ReLU with nn.functional: {}".format(x_relu_F))

Same result.

1.0.3 Assignment

Build a 2-layer MLP for MNIST digit classfication. Feel free to play around with the model
architecture and see how the training time/performance changes, but to begin, try the following:
Image (784 dimensions) ->
fully connected layer (500 hidden units) -> nonlinearity (ReLU) ->
fully connected (10 hidden units) -> softmax
Try building the model both with basic PyTorch operations, and then again with more object-
oriented higher-level APIs. You should get similar results!
Some hints: - Even as we add additional layers, we still only require a single optimizer to learn the
parameters. Just make sure to pass all parameters to it! - As you’ll calculate in the Short Answer,
this MLP model has many more parameters than the logisitic regression example, which makes it
more challenging to learn. To get the best performance, you may want to play with the learning rate
and increase the number of training epochs. - Be careful using torch.nn.CrossEntropyLoss().
If you look at the PyTorch documentation: you’ll see that torch.nn.CrossEntropyLoss() com-
bines the softmax operation with the cross-entropy. This means you need to pass in the logits
(predictions pre-softmax) to this loss. Computing the softmax separately and feeding the result
into torch.nn.CrossEntropyLoss() will significantly degrade your model’s performance!
[ ]: ### YOUR CODE HERE

2
# Make sure to print out your accuracy on the test set at the end.

1.0.4 Short answer

How many trainable parameters does your model have? How does this compare to the logisitic
regression example?
[Your answer here]

Unit 5
No ratings yet
Unit 5
61 pages
Final_DL
No ratings yet
Final_DL
26 pages
ID6001_Homework_2b57bb1d39ec7c53700fa31dc04520dc
No ratings yet
ID6001_Homework_2b57bb1d39ec7c53700fa31dc04520dc
2 pages
AI in Smart Grid (II)
No ratings yet
AI in Smart Grid (II)
15 pages
Video_5_-_Building_a_Multilayer_Perceptron_for_Regression_in_PyTorch
No ratings yet
Video_5_-_Building_a_Multilayer_Perceptron_for_Regression_in_PyTorch
17 pages
exp3
No ratings yet
exp3
7 pages
L6 Multilayer FeedForward network XOR & MNIST DIGIT
No ratings yet
L6 Multilayer FeedForward network XOR & MNIST DIGIT
51 pages
Week_3
No ratings yet
Week_3
17 pages
9-MLP-EXAMPLE-08-08-2024
No ratings yet
9-MLP-EXAMPLE-08-08-2024
50 pages
Ilovepdf Merged
No ratings yet
Ilovepdf Merged
10 pages
Practicals 1 to
No ratings yet
Practicals 1 to
5 pages
Exp.no 2
No ratings yet
Exp.no 2
2 pages
cs519 hw2
No ratings yet
cs519 hw2
15 pages
Multilayer Perceptrons For Digit Recognition With Core APIs - TensorFlow Core
No ratings yet
Multilayer Perceptrons For Digit Recognition With Core APIs - TensorFlow Core
21 pages
Building Your Deep Neural Network - Step by Step v8 PDF
No ratings yet
Building Your Deep Neural Network - Step by Step v8 PDF
44 pages
MLP2021 22 cw1
No ratings yet
MLP2021 22 cw1
10 pages
Report2
No ratings yet
Report2
17 pages
Process
No ratings yet
Process
16 pages
Multilayer Neural Network
No ratings yet
Multilayer Neural Network
27 pages
AI Week 12
No ratings yet
AI Week 12
2 pages
P07-09 MultilayerPerceptron
No ratings yet
P07-09 MultilayerPerceptron
2 pages
Graph theory report
No ratings yet
Graph theory report
9 pages
NN From Scratch PDF 1735495327
No ratings yet
NN From Scratch PDF 1735495327
19 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
ML Unit-5
No ratings yet
ML Unit-5
14 pages
Experiment 2.5 DL
No ratings yet
Experiment 2.5 DL
3 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
Deep Learning
No ratings yet
Deep Learning
43 pages
HW1P1_F23
No ratings yet
HW1P1_F23
37 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
GNN Hands On 03
No ratings yet
GNN Hands On 03
7 pages
2802ICT Programming Assignment 2
No ratings yet
2802ICT Programming Assignment 2
6 pages
Omar Arif Omar - Arif@seecs - Edu.pk National University of Sciences and Technology
No ratings yet
Omar Arif Omar - Arif@seecs - Edu.pk National University of Sciences and Technology
44 pages
PDL Final Assignment-3 Aryan
No ratings yet
PDL Final Assignment-3 Aryan
8 pages
Video_7_-_Building_a_Multilayer_Feedforward_Network_for_Classification_in_PyTorch
No ratings yet
Video_7_-_Building_a_Multilayer_Feedforward_Network_for_Classification_in_PyTorch
18 pages
Mathematical Representation of A Perceptron Layer (With Example in Tensorflow)
No ratings yet
Mathematical Representation of A Perceptron Layer (With Example in Tensorflow)
5 pages
Deep Learning
No ratings yet
Deep Learning
2 pages
MultilayerPerceptron_v3
No ratings yet
MultilayerPerceptron_v3
78 pages
mlp-fromscratch__sigmoid-mse
No ratings yet
mlp-fromscratch__sigmoid-mse
13 pages
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
No ratings yet
Intro To PyTorch and Neural Networks - Intro To PyTorch and Neural Networks Cheatsheet - Codecademy
8 pages
DEEP LEARNING Lab
No ratings yet
DEEP LEARNING Lab
30 pages
txt
No ratings yet
txt
7 pages
Experiment 3
No ratings yet
Experiment 3
9 pages
ML Hota Assign5
No ratings yet
ML Hota Assign5
2 pages
l14 Machine Learning
No ratings yet
l14 Machine Learning
16 pages
soft computing unit 2
No ratings yet
soft computing unit 2
23 pages
MLP 1122 20240509 ch10 DeepNN
No ratings yet
MLP 1122 20240509 ch10 DeepNN
47 pages
Lab 2-Image-Classification-Using-NNs
No ratings yet
Lab 2-Image-Classification-Using-NNs
6 pages
PyTorch Crash Course 1713016363
No ratings yet
PyTorch Crash Course 1713016363
15 pages
Atelier1
No ratings yet
Atelier1
2 pages
Assignment-2_CNN-1[1]
No ratings yet
Assignment-2_CNN-1[1]
3 pages
unit 4 part 3 dl_1
No ratings yet
unit 4 part 3 dl_1
5 pages
Intro To Pytorch
No ratings yet
Intro To Pytorch
12 pages
Crash Course DL
No ratings yet
Crash Course DL
15 pages
AIML-UNIT-5
No ratings yet
AIML-UNIT-5
34 pages
Lesson 3 Basics of Neural Networks_Perceptron
No ratings yet
Lesson 3 Basics of Neural Networks_Perceptron
26 pages
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
Interview Questions for IBM Mainframe Developers
From Everand
Interview Questions for IBM Mainframe Developers
Robert Wingate
1/5 (1)
A Friendly Introduction to MATLAB Programming
From Everand
A Friendly Introduction to MATLAB Programming
Orhan Gazi
No ratings yet
1B Coding Environments - Copy
No ratings yet
1B Coding Environments - Copy
6 pages
1A_PyTorch_Installation
No ratings yet
1A_PyTorch_Installation
2 pages
Wise360 Company Profile (1) (1)
No ratings yet
Wise360 Company Profile (1) (1)
26 pages
EEPC2003_merged
No ratings yet
EEPC2003_merged
31 pages
IREL Connect & Inspire Internship Scheme for IREL website
No ratings yet
IREL Connect & Inspire Internship Scheme for IREL website
7 pages
Faq
No ratings yet
Faq
18 pages
Zega-Brochures_T640J
No ratings yet
Zega-Brochures_T640J
2 pages
Canon F-789SGA Manual
100% (1)
Canon F-789SGA Manual
66 pages
Indonesia Map Infographic PowerPoint Template
100% (1)
Indonesia Map Infographic PowerPoint Template
29 pages
Letters To His Son 1746 47
No ratings yet
Letters To His Son 1746 47
62 pages
ICG Material
No ratings yet
ICG Material
10 pages
Thesis Final Version
No ratings yet
Thesis Final Version
97 pages
Analog and Digital Communication
No ratings yet
Analog and Digital Communication
51 pages
CATS
No ratings yet
CATS
16 pages
Android Waydroid Steam Deck List of Games Tested
No ratings yet
Android Waydroid Steam Deck List of Games Tested
9 pages
Class 8- CH-2 Database using OpenOffice.orgbase
No ratings yet
Class 8- CH-2 Database using OpenOffice.orgbase
2 pages
MeshCentral2InstallGuide-0 0 8
No ratings yet
MeshCentral2InstallGuide-0 0 8
34 pages
Big Data Governance and Perspectives in Knowledge Management
No ratings yet
Big Data Governance and Perspectives in Knowledge Management
321 pages
SDTM
100% (1)
SDTM
33 pages
Disbursement Voucher DV 1
No ratings yet
Disbursement Voucher DV 1
18 pages
Appendix O Contents of The Health and Safety File
No ratings yet
Appendix O Contents of The Health and Safety File
5 pages
Atlas Copco XATS 156-Brochure
No ratings yet
Atlas Copco XATS 156-Brochure
4 pages
Interview Java: Most Asked Interview Questions
No ratings yet
Interview Java: Most Asked Interview Questions
23 pages
Report Logistic Regression
No ratings yet
Report Logistic Regression
17 pages
Royal Enfield - Re - Scavenge Circuit Oil Pressure Relief Valve Deletion
No ratings yet
Royal Enfield - Re - Scavenge Circuit Oil Pressure Relief Valve Deletion
1 page
Python L5 While Loops
No ratings yet
Python L5 While Loops
12 pages
V8 - Softree Optimal (1) 3 3
No ratings yet
V8 - Softree Optimal (1) 3 3
1 page
PRACTICLfile
No ratings yet
PRACTICLfile
45 pages
Best Graphics Group Duplo System 4000
No ratings yet
Best Graphics Group Duplo System 4000
4 pages
Api Specification For Molpay Integration: (Version 13.0)
No ratings yet
Api Specification For Molpay Integration: (Version 13.0)
101 pages
22616-2024-winter-question-paper (1)
No ratings yet
22616-2024-winter-question-paper (1)
3 pages
Form Pengajuan New Item Contoh
No ratings yet
Form Pengajuan New Item Contoh
4 pages
Datasheet TDT 6056 14s 200a NCM
No ratings yet
Datasheet TDT 6056 14s 200a NCM
20 pages
5Guidelines for Business Analytics Paper
No ratings yet
5Guidelines for Business Analytics Paper
5 pages
Rebar Bender Machine Catalogue
No ratings yet
Rebar Bender Machine Catalogue
10 pages
Wms (2966) Exp 5
No ratings yet
Wms (2966) Exp 5
5 pages

2B MultiLayer Perceptron Assignment

Uploaded by

2B MultiLayer Perceptron Assignment

Uploaded by

2B_MultiLayer_Perceptron_Assignment

March 26, 2025

1 PyTorch Assignment: Multi-Layer Perceptron (MLP)

1.0.1 Multi-Layer Perceptrons

print("x after ReLU with nn.functional: {}".format(x_relu_F))

1.0.4 Short answer

You might also like