Report

Uploaded by

pjun020321

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

12 views

Report

Uploaded by

pjun020321

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 4

Report: Implementing a Simple Neural Network for MNIST Dataset

1. Introduction

1.1 Objective
The primary goal of this assignment was to implement a simple neural network with up to five
layers to classify handwritten digits from the MNIST dataset. This involved designing the
architecture, implementing both forward and backward propagation, and optimizing weights. The
implementation was strictly done using Python and the NumPy library, without relying on pre-
built libraries like PyTorch or TensorFlow.
1.2 Dataset
The MNIST dataset consists of 60,000 training images and 10,000 testing images of handwritten
digits, each labeled as a number between 0 and 9. The images are grayscale with a resolution of
28 \times 28 pixels.
2. Methodology
2.1 Data Preparation
The dataset was processed to make it suitable for training:
• Images were normalized to values between 0 and 1 for consistency.
• Labels were extracted as integers ranging from 0 to 9.
• Mini-batches were created using the create_mini_batches function, which shuffled
the data to ensure unbiased training.
2.2 Model Architecture
The neural network architecture was implemented as follows:
1. Convolutional Layer:
• 3 \times 3 filters, with 16 feature maps.
• Extracts spatial features from the input images.
2. Flatten Layer:
• Converts the feature maps into a 1D vector.
3. Fully Connected Layers:
• FC1: 128 neurons, using the ReLU activation function.
• FC2: 10 neurons, using the Softmax activation function to output class
probabilities.
Activation Functions:

• ReLU: f ( x )=max(0 , x)

• Softmax: Converts raw scores into probabilities:

exp x i
Softmax ( xi ) =
∑ exp x j
j

2.3 Forward Propagation

The input data is passed through each layer in the following order:
1. Convolutional Layer produces feature maps.
2. Flatten Layer converts the feature maps into a 1D array.
3. Fully Connected Layers process the data, with Softmax applied at the output to
generate class probabilities.

2.4 Backward Propagation

Backward propagation involves calculating the gradients of the loss function with respect to the
weights in each layer:
1. Gradients for the output layer are calculated using the derivative of the Softmax and
Cross Entropy Loss functions.
2. Gradients are propagated backward through the fully connected and convolutional
layers.
3. The computed gradients are used to update the weights in each layer.
Loss Function:
Cross Entropy Loss was used, defined as:
N
−1
L= ∑ log p(i , y )
N i=1 i
where p_{i, y_i} is the predicted probability of the correct class for sample i.
2.5 Optimization
Weights were updated using the Gradient Descent algorithm:
W =W −α ⋅∇ L ,

where W represents the weights, \alpha is the learning rate, and \nabla L is the gradient of the
loss. The learning rate was set to 0.01 for this implementation.
3. Experimental Results
3.1 Experimental Setup
• Programming Environment: Python 3 and NumPy.
• Hardware: MacBook Air.
• Dataset: MNIST (60,000 training images, 10,000 testing images).
3.2 Results
The following tables present the loss values for each epoch and accuracy of tranning & test sets:

Epoch Loss values Tranning 92.00%

1 1.134826686807056 Accuracy
2 0.4221529610636374 (%)
3 0.357064769553238 Test 92.05%
4 0.3322257751385145 Accuracy
5 0.3188174469358134 (%)
6 0.3096424919533025
7 0.30356329993117026
8 0.2986148860533998
9 0.29462226426888455
10 0.291294958006831

5. Conclusion
This project successfully implemented a neural network from scratch using Python and NumPy
to classify MNIST digits. The model achieved a training accuracy of 92.00% and a testing
accuracy of 92.05%. Future improvements could include the use of dropout and batch
normalization to enhance performance further.
6. References

1. NumPy Documentation: https://numpy.org/

2. MNIST Dataset: https://www.kaggle.com/datasets/hojjatk/mnist-dataset

Advanced Epicor Functions
No ratings yet
Advanced Epicor Functions
43 pages
Form Recruitment HR Watsons
No ratings yet
Form Recruitment HR Watsons
3 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
NNDL Assignment-2 Report
No ratings yet
NNDL Assignment-2 Report
9 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
NNDL Assignment-2 Report
No ratings yet
NNDL Assignment-2 Report
7 pages
Handwritten Digit Recognition
No ratings yet
Handwritten Digit Recognition
19 pages
Image Classification using MNIST Dataset
No ratings yet
Image Classification using MNIST Dataset
28 pages
DL Practical 3 (1)
No ratings yet
DL Practical 3 (1)
5 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
Report on Handwritten Digit Recognition using a Feedforward Neural Network
No ratings yet
Report on Handwritten Digit Recognition using a Feedforward Neural Network
8 pages
IMPLEMENT A NEURAL NETWORK USING PYTHON
No ratings yet
IMPLEMENT A NEURAL NETWORK USING PYTHON
5 pages
Lab DigitRecognitionMINST
No ratings yet
Lab DigitRecognitionMINST
10 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
Artificial Intelligence Mini Project
No ratings yet
Artificial Intelligence Mini Project
5 pages
ENG21CS0302 - SGAN
No ratings yet
ENG21CS0302 - SGAN
7 pages
Deep Learning lab with Tensorflow (2)
No ratings yet
Deep Learning lab with Tensorflow (2)
84 pages
keras
No ratings yet
keras
4 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Razii Abraham - AF24SYD010 - MNIST Classification Using Multilayer Perceptrons (MLPs)
No ratings yet
Razii Abraham - AF24SYD010 - MNIST Classification Using Multilayer Perceptrons (MLPs)
6 pages
Introduction to ANN with steps 10 25
No ratings yet
Introduction to ANN with steps 10 25
30 pages
Vineela Ann1
No ratings yet
Vineela Ann1
9 pages
Introduction to Genetic Algorithm Neural Networks
No ratings yet
Introduction to Genetic Algorithm Neural Networks
44 pages
MAJOR PROJECT
No ratings yet
MAJOR PROJECT
10 pages
MNIST Based Handwritten Digits Recognition
No ratings yet
MNIST Based Handwritten Digits Recognition
5 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Experiment No. 10 TE SL-II (ANN)
No ratings yet
Experiment No. 10 TE SL-II (ANN)
3 pages
FDL 4 5 Print Merged
No ratings yet
FDL 4 5 Print Merged
11 pages
DLA Week 7
No ratings yet
DLA Week 7
8 pages
NNDL Lab Record
No ratings yet
NNDL Lab Record
26 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
Hand Writtendigit Recognition
No ratings yet
Hand Writtendigit Recognition
15 pages
MNIST
No ratings yet
MNIST
3 pages
Deep Learning
No ratings yet
Deep Learning
46 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
SL II C3
No ratings yet
SL II C3
2 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Experiment 2.5 DL
No ratings yet
Experiment 2.5 DL
3 pages
Homework: Prediction Methods and Machine Learning
No ratings yet
Homework: Prediction Methods and Machine Learning
3 pages
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
No ratings yet
DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab
5 pages
DL Practical
No ratings yet
DL Practical
23 pages
lab 6 ml
No ratings yet
lab 6 ml
7 pages
exno 4
No ratings yet
exno 4
3 pages
On Handwritten Digit Recognition
100% (1)
On Handwritten Digit Recognition
15 pages
Lab 12
No ratings yet
Lab 12
6 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
NN From Scratch
No ratings yet
NN From Scratch
5 pages
Lab 12
No ratings yet
Lab 12
3 pages
DEEP LEARNING EXPERIMENTS
No ratings yet
DEEP LEARNING EXPERIMENTS
42 pages
GK Deeplearning
No ratings yet
GK Deeplearning
15 pages
IMPLEMENT A NEURAL NETWORK USING PYTHON
No ratings yet
IMPLEMENT A NEURAL NETWORK USING PYTHON
4 pages
MAJOR PPROJECo
No ratings yet
MAJOR PPROJECo
58 pages
Control System Term Paper
No ratings yet
Control System Term Paper
12 pages
H1_AndresAlcivar
No ratings yet
H1_AndresAlcivar
4 pages
Exercise 2
No ratings yet
Exercise 2
3 pages
GenAI LAB-samiran
No ratings yet
GenAI LAB-samiran
27 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
Discussion On Implementation Of: Svamitva
No ratings yet
Discussion On Implementation Of: Svamitva
61 pages
Datasheet Agora 3kW a 10kW mono 220V
No ratings yet
Datasheet Agora 3kW a 10kW mono 220V
20 pages
ICT in Accounting S6 SB
No ratings yet
ICT in Accounting S6 SB
162 pages
8051 Tutorial
No ratings yet
8051 Tutorial
116 pages
Chaptgpt Use
No ratings yet
Chaptgpt Use
6 pages
A Project Proposal ON "Student Management System" Submitted To: Department of Computer Application Mmamc Biratnagar, Morang, Nepal
100% (2)
A Project Proposal ON "Student Management System" Submitted To: Department of Computer Application Mmamc Biratnagar, Morang, Nepal
15 pages
Class 10 Information Technology Sample Paper Set 2
No ratings yet
Class 10 Information Technology Sample Paper Set 2
8 pages
NBC 2005 Snow, Wind and Earthquake Load Design Criteria For Steel Building Systems
No ratings yet
NBC 2005 Snow, Wind and Earthquake Load Design Criteria For Steel Building Systems
46 pages
Point Derivation Report
No ratings yet
Point Derivation Report
9 pages
CRM Cheat Sheet
No ratings yet
CRM Cheat Sheet
7 pages
IGL 7.2.1 Configuring WildFly Clustering
No ratings yet
IGL 7.2.1 Configuring WildFly Clustering
57 pages
43rd Annual Convocation First Rank Holders List
No ratings yet
43rd Annual Convocation First Rank Holders List
5 pages
Installation, Operation and Maintenance Instructions: ELO Series
No ratings yet
Installation, Operation and Maintenance Instructions: ELO Series
28 pages
UIC774Railtrack
No ratings yet
UIC774Railtrack
37 pages
General Knowledge 243 Pages (Full Permission)
100% (1)
General Knowledge 243 Pages (Full Permission)
243 pages
Features: Medical Implantable RF Transceiver
No ratings yet
Features: Medical Implantable RF Transceiver
6 pages
NBS LODGuide Ss - 15 20
No ratings yet
NBS LODGuide Ss - 15 20
40 pages
(KPKL-HSE-CK-03) - Welding and Cutting Operations Safety Checklist
No ratings yet
(KPKL-HSE-CK-03) - Welding and Cutting Operations Safety Checklist
3 pages
I.mx Graphics User's Guide Linux
No ratings yet
I.mx Graphics User's Guide Linux
191 pages
Fuzzy Logic Based Control of Variable Wind Energy System: Trisha Bora Prateekee Chatterjee Saradindu Ghosh
No ratings yet
Fuzzy Logic Based Control of Variable Wind Energy System: Trisha Bora Prateekee Chatterjee Saradindu Ghosh
5 pages
Imesa - SECONDARY CABINS WITH INSULATED IMS IN SF6 APPROVED ENEL TYPE DY 800-1
No ratings yet
Imesa - SECONDARY CABINS WITH INSULATED IMS IN SF6 APPROVED ENEL TYPE DY 800-1
4 pages
Dav Public School, Vasant Kunj, New Delhi: Artificial Intelligence (Subject Code: 417)
No ratings yet
Dav Public School, Vasant Kunj, New Delhi: Artificial Intelligence (Subject Code: 417)
8 pages
05.interfaces in SAP Classes
No ratings yet
05.interfaces in SAP Classes
11 pages
Download Complete Applied Intelligence for Medical Image Analysis 1st Edition Aarti & Raju Pal & Mukesh Saraswat & Himanshu Mittal PDF for All Chapters
100% (7)
Download Complete Applied Intelligence for Medical Image Analysis 1st Edition Aarti & Raju Pal & Mukesh Saraswat & Himanshu Mittal PDF for All Chapters
60 pages
Bcom Ism
No ratings yet
Bcom Ism
3 pages
CH 2209 Fluid Mechanics Lab
No ratings yet
CH 2209 Fluid Mechanics Lab
2 pages
DTS 2687 - 132kV GIS Method Statement For HV Test & Partial Discharge Measurement - Rev.C
100% (1)
DTS 2687 - 132kV GIS Method Statement For HV Test & Partial Discharge Measurement - Rev.C
38 pages
DATA RECOVERY Case - Study
No ratings yet
DATA RECOVERY Case - Study
2 pages

Report

Uploaded by

Report

Uploaded by

Report: Implementing a Simple Neural Network for MNIST Dataset

• Softmax: Converts raw scores into probabilities:

2.3 Forward Propagation

2.4 Backward Propagation

Epoch Loss values Tranning 92.00%

1. NumPy Documentation: https://numpy.org/

You might also like