0% found this document useful (0 votes)

47 views5 pages

DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

The document outlines a project to classify handwritten digits using the MNIST dataset, consisting of 70,000 grayscale images. It details the setup using TensorFlow and Keras, including data preprocessing, model building, and evaluation. The project encourages experimentation with model architecture and training techniques to improve accuracy.

Uploaded by

Ricardo Emanuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

47 views5 pages

DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

Uploaded by

Ricardo Emanuel

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.

ipynb - Colab

keyboard_arrow_down 1. Problem Definition

The objective is to classify grayscale images of handwritten digits (0-9) from the MNIST
dataset. This is a collection of 70,000 grayscale images of handwritten digits (0-9). Each image
is a 28x28 pixel matrix, where each pixel represents the intensity of grayscale values (0 to 255).

The MNIST dataset consists of 60,000 training samples and 10,000 test samples. It is widely
used as a benchmark in machine learning for tasks like image classification.

The goal is to train a neural network to accurately predict the correct digit for each image based
on patterns learned during training. This is a supervised learning problem where the input is the
image, and the output is the digit label.

Here is an example of how the computer interpret each image from the dataset:

keyboard_arrow_down 2. Setup

keyboard_arrow_down 2.1. Libraries

# Libraries
import tensorflow as tf
from tensorflow.keras.models import Sequential
from tensorflow.keras.layers import Dense, Flatten
from tensorflow.keras.datasets import mnist
import matplotlib.pyplot as plt
import numpy as np

https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 1/5
30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

# Set seed for reproducibility

np.random.seed(42)
tf.random.set_seed(42)

TensorFlow is a popular deep learning framework used to build and train neural networks and
will be used here due to its flexibility, scalability, and ease of integration. For beginners,
TensorFlow offers high-level APIs like Keras, which simplify the process of creating and training
models. These features make TensorFlow an excellent choice for learning and experimentation.

keyboard_arrow_down 2.2. Exploratory analysis

Before loading the dataset, let's understand its structure visually. This will help us comprehend
the nature of the data we are working with.

# Temporary loading of MNIST for visualization

(temp_X_train, temp_y_train), _ = mnist.load_data()

# Plot a grid of sample images with labels

plt.figure(figsize=(12, 6))
for i in range(15):
plt.subplot(3, 5, i + 1)
plt.imshow(temp_X_train[i], cmap='gray')
plt.title(f"Label: {temp_y_train[i]}")
plt.axis('off')
plt.tight_layout()
plt.show()

This will help us check if the dataset is balanced (i.e., equal representation of all digits).

# Analyze class distribution

unique, counts = np.unique(temp_y_train, return_counts=True)
plt.bar(unique, counts, color='blue', alpha=0.7)
for i, count in enumerate(counts):
plt.text(unique[i], count, str(count), ha='center', va='bottom')
plt.title('Class Distribution in MNIST Training Set')
plt.xticks(unique, [str(digit) for digit in unique])
plt.xlabel('Digit Labels')
plt.ylabel('Frequency')
plt.show()

keyboard_arrow_down 3. Load and preprocess the MNIST dataset

# Load the MNIST dataset

(X_train, y_train), (X_test, y_test) = mnist.load_data()

https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 2/5
30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

# Normalize the pixel values to the range [0, 1]

X_train = X_train / 255.0
X_test = X_test / 255.0

# Visualize the effects of normalization

# Compare the pixel intensity distribution before and after normalization
plt.figure(figsize=(12, 5))

# Before normalization
plt.subplot(1, 2, 1)
plt.hist(temp_X_train.flatten(), bins=50, color='green', alpha=0.7)
plt.title('Pixel Intensity Before Normalization')
plt.xlabel('Pixel Intensity')
plt.ylabel('Frequency')

# After normalization
plt.subplot(1, 2, 2)
plt.hist(X_train.flatten(), bins=50, color='blue', alpha=0.7)
plt.title('Pixel Intensity After Normalization')
plt.xlabel('Pixel Intensity (Normalized)')
plt.ylabel('Frequency')

plt.tight_layout()
plt.show()

Normalization ensures that the input data has a uniform range, which helps the model converge
faster during training. Neural networks perform better when the data is scaled, as it prevents
larger input values from dominating the learning process.

keyboard_arrow_down 4. Modelling

The model is built using TensorFlow's Sequential API, which allows us to stack layers
sequentially. Each layer processes data in a specific way:

1. Flatten Layer: Converts the 2D input images (28x28 pixels) into a 1D array of 784 features.
This is necessary to feed the data into the Dense layers.
2. Dense Layer (Hidden): A fully connected layer and ReLU activation. ReLU introduces non-
linearity, allowing the model to learn complex patterns.
3. Dense Layer (Output): The final layer has 10 neurons, each representing a digit (0-9). It
uses the softmax activation function to produce probabilities for each class, enabling
classification.

# Build neural network model

model = Sequential([
Flatten(input_shape=(28, 28)), # Flatten the 2D image into a 1D array
Dense(8, activation='relu'), # First hidden layer with 128 neurons
https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 3/5
30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

Dense(10, activation='softmax') # Output layer with 10 neurons (one for each digit)
])

# Compile the model

model.compile(optimizer='adam',
loss='sparse_categorical_crossentropy',
metrics=['accuracy'])

# Train the model

history = model.fit(X_train, y_train, epochs=5, validation_split=0.1)

# Save the trained model immediately after training

model.save('mnist_model.h5')
print("Model saved successfully after training.")

# Evaluate the model on the test data

# Load the saved model

# This will load the model from the HDF5 file for evaluation or inference.
loaded_model = tf.keras.models.load_model('mnist_model.h5')
print("Model loaded successfully from HDF5 file.")

# Evaluate the loaded model on the test data

test_loss, test_acc = loaded_model.evaluate(X_test, y_test)
print(f"Test accuracy (from loaded model): {test_acc:.2f}")

# Visualize training performance

plt.figure(figsize=(10, 5))
plt.plot(range(1, len(history.history['accuracy']) + 1), history.history['accuracy'], mar
plt.plot(range(1, len(history.history['val_accuracy']) + 1), history.history['val_accurac
plt.xticks(range(1, len(history.history['accuracy']) + 1))
plt.xlabel('Epoch')
plt.ylabel('Accuracy')
plt.legend()
plt.title('Training and Validation Accuracy')
plt.show()

# Make predictions
predictions = model.predict(X_test)

# Visualize some predictions

plt.figure(figsize=(10, 5))
for i in range(10):
plt.subplot(2, 5, i + 1)
plt.imshow(X_test[i], cmap="gray")
predicted_label = tf.argmax(predictions[i]).numpy()
true_label = y_test[i]
plt.title(f"Pred: {predicted_label}, True: {true_label}")
plt.axis("off")
plt.show()

https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 4/5
30/03/2025, 23:47 DL-basics-of-neural-networks-MNIST-dataset.ipynb - Colab

keyboard_arrow_down 5. Experimentation

Run experiments with the model architecture and training process. Here are some suggestions:

Add more hidden layers: Try increasing the depth of the network by stacking additional
Dense layers.
Change the number of neurons: Modify the number of neurons in each * Dense layer to see
how it affects performance.
Use different activation functions: Experiment with alternatives like 'sigmoid', 'tanh', or
'LeakyReLU'.
Try other optimization algorithms: Replace 'adam' with optimizers like 'sgd', 'rmsprop', or
'nadam'.
Alter the number of epochs: Train the model for more or fewer epochs and observe
overfitting or underfitting.
Use dropout: Introduce dropout layers to prevent overfitting and enhance generalization.
Modify the learning rate: Adjust the optimizer's learning rate to see how it influences
convergence.

Document your changes and verify if or how each modification impacts the training, validation,
and test accuracy.

keyboard_arrow_down 6. Conclusion

Write your conclusions here

https://colab.research.google.com/github/proffranciscofernando/introduction-to-deep-learning/blob/main/DL-basics-of-neural-networks-MNIST-dat… 5/5

Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
09-Neural Networks
No ratings yet
09-Neural Networks
17 pages
Introduction to ANN with steps 10 25
No ratings yet
Introduction to ANN with steps 10 25
30 pages
DL Practical 3 (1)
No ratings yet
DL Practical 3 (1)
5 pages
Newbie’s Deep Learning Project to Recognize Handwritten Digit
No ratings yet
Newbie’s Deep Learning Project to Recognize Handwritten Digit
6 pages
01_mnist.ipynb (4) - JupyterLab
No ratings yet
01_mnist.ipynb (4) - JupyterLab
23 pages
Introduction to Genetic Algorithm Neural Networks
No ratings yet
Introduction to Genetic Algorithm Neural Networks
44 pages
Deep Learning lab with Tensorflow (2)
No ratings yet
Deep Learning lab with Tensorflow (2)
84 pages
IMPLEMENT A NEURAL NETWORK USING PYTHON
No ratings yet
IMPLEMENT A NEURAL NETWORK USING PYTHON
5 pages
Assignment_SQGAN
No ratings yet
Assignment_SQGAN
14 pages
IMPLEMENT A NEURAL NETWORK USING PYTHON
No ratings yet
IMPLEMENT A NEURAL NETWORK USING PYTHON
4 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
Assignment 2 Dl
No ratings yet
Assignment 2 Dl
10 pages
Experiment No. 10 TE SL-II (ANN)
No ratings yet
Experiment No. 10 TE SL-II (ANN)
3 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
MNIST CLASSIFICATION REPORT
No ratings yet
MNIST CLASSIFICATION REPORT
15 pages
projectAI(1)
No ratings yet
projectAI(1)
2 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Image Classification using MNIST Dataset
No ratings yet
Image Classification using MNIST Dataset
28 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
ML Ex6
No ratings yet
ML Ex6
8 pages
Capstone Project Report (Digit-Recognition Using CNN)
No ratings yet
Capstone Project Report (Digit-Recognition Using CNN)
11 pages
On Handwritten Digit Recognition
100% (1)
On Handwritten Digit Recognition
15 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
24 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
24 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
25 pages
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
8 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Ml0120en m2v4 The Mnist Database
No ratings yet
Ml0120en m2v4 The Mnist Database
2 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
Handwritten Digit Recognition
No ratings yet
Handwritten Digit Recognition
19 pages
Piyush Rastogi
No ratings yet
Piyush Rastogi
5 pages
DL Mannual For Reference
No ratings yet
DL Mannual For Reference
58 pages
SL II C3
No ratings yet
SL II C3
2 pages
Deep Learning
No ratings yet
Deep Learning
43 pages
Week_4
No ratings yet
Week_4
15 pages
21BCP167_AI_9
No ratings yet
21BCP167_AI_9
10 pages
Dlv Lab Manual Print
No ratings yet
Dlv Lab Manual Print
29 pages
Deeplearning - Ai Deeplearning - Ai
No ratings yet
Deeplearning - Ai Deeplearning - Ai
42 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Lab DigitRecognitionMINST
No ratings yet
Lab DigitRecognitionMINST
10 pages
Pytorch MNIST Digits Prediction Hands on 1
No ratings yet
Pytorch MNIST Digits Prediction Hands on 1
16 pages
Image/Digit Recognition Using Machine Learning: by Raghav Chawla, I.T/B.Tech/Hmritm/5 Semester 43713303117
100% (1)
Image/Digit Recognition Using Machine Learning: by Raghav Chawla, I.T/B.Tech/Hmritm/5 Semester 43713303117
15 pages
106106213
No ratings yet
106106213
637 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
lab 6 ml
No ratings yet
lab 6 ml
7 pages
Digit Recognizer Using CNN
No ratings yet
Digit Recognizer Using CNN
4 pages
Deep Learning Models (Basic)
No ratings yet
Deep Learning Models (Basic)
35 pages
DL Practical
No ratings yet
DL Practical
23 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
DL Programs
No ratings yet
DL Programs
12 pages
UNIT II_PPT_Part 1
No ratings yet
UNIT II_PPT_Part 1
41 pages
NNDL Assignment-2 Report
No ratings yet
NNDL Assignment-2 Report
7 pages
Deep_Learning_CNN_Implementation_MNIST
No ratings yet
Deep_Learning_CNN_Implementation_MNIST
2 pages
DEEP LEARNING EXPERIMENTS
No ratings yet
DEEP LEARNING EXPERIMENTS
42 pages
ADL Exp File
No ratings yet
ADL Exp File
56 pages
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
From Everand
DEEP LEARNING TECHNIQUES: CLUSTER ANALYSIS and PATTERN RECOGNITION with NEURAL NETWORKS. Examples with MATLAB
César Pérez López
No ratings yet
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
From Everand
DATA MINING AND MACHINE LEARNING. PREDICTIVE TECHNIQUES: REGRESSION, GENERALIZED LINEAR MODELS, SUPPORT VECTOR MACHINE AND NEURAL NETWORKS
César Pérez López
No ratings yet
Python For Beginners
From Everand
Python For Beginners
Célio Azevedo
No ratings yet
Financial Machina: Machine Learning For Finance: The Quintessential Compendium for Python Machine Learning For 2024 & Beyond Sampson - The latest updated ebook version is ready for download
100% (3)
Financial Machina: Machine Learning For Finance: The Quintessential Compendium for Python Machine Learning For 2024 & Beyond Sampson - The latest updated ebook version is ready for download
55 pages
Action n Pose Estimation
No ratings yet
Action n Pose Estimation
84 pages
AI DrivenQualityControlinPCBManufacturingEnhancingProductionEfficiencyandPrecision2
No ratings yet
AI DrivenQualityControlinPCBManufacturingEnhancingProductionEfficiencyandPrecision2
17 pages
Assignment 2 ML
No ratings yet
Assignment 2 ML
2 pages
1-s2.0-S0926580523004831-main
No ratings yet
1-s2.0-S0926580523004831-main
19 pages
Lecture1-Introduction-Jan09-2023
No ratings yet
Lecture1-Introduction-Jan09-2023
49 pages
1-s2.0-S0893608024010426-main
No ratings yet
1-s2.0-S0893608024010426-main
17 pages
Foundations of Large Language Models 1738142777
No ratings yet
Foundations of Large Language Models 1738142777
101 pages
iridology-diabetes2
No ratings yet
iridology-diabetes2
22 pages
VARAD BLACKBOOK
No ratings yet
VARAD BLACKBOOK
45 pages
syllabi
No ratings yet
syllabi
2 pages
AI IN THE RADIOGRAPHIC ASSESSMENT OF ILD
No ratings yet
AI IN THE RADIOGRAPHIC ASSESSMENT OF ILD
48 pages
MANIFOLD OBLIQUE RANDOM FORESTS: TOWARDS CLOSING THE GAP ON CONVOLUTIONAL DEEP NETWORKS
No ratings yet
MANIFOLD OBLIQUE RANDOM FORESTS: TOWARDS CLOSING THE GAP ON CONVOLUTIONAL DEEP NETWORKS
33 pages
PROJECT FINAL REPORT 2 (2)
No ratings yet
PROJECT FINAL REPORT 2 (2)
69 pages
What is convex optimization in simple terms _
No ratings yet
What is convex optimization in simple terms _
4 pages
BIA Data Science Detailed Brochure - Vikhroli West, Mumbai-1
No ratings yet
BIA Data Science Detailed Brochure - Vikhroli West, Mumbai-1
28 pages
417-AI-X
No ratings yet
417-AI-X
7 pages
Empowering_E-commerce_Leveraging_Open_AI_and_Sentiment_Analysis_for_Smarter_Recommendations
No ratings yet
Empowering_E-commerce_Leveraging_Open_AI_and_Sentiment_Analysis_for_Smarter_Recommendations
5 pages
Notes_ML_02_Slides_RNN_ANN
No ratings yet
Notes_ML_02_Slides_RNN_ANN
105 pages
Nndl Internal i Key
No ratings yet
Nndl Internal i Key
5 pages
AI in Material Science Revolutionizing Construction in the Age of Industry 4.0
No ratings yet
AI in Material Science Revolutionizing Construction in the Age of Industry 4.0
50 pages
Deep Learning 117 MCQ
No ratings yet
Deep Learning 117 MCQ
33 pages
Understanding Emotions in Text Using Deep Learning and Big Data (PRINTED)
No ratings yet
Understanding Emotions in Text Using Deep Learning and Big Data (PRINTED)
32 pages
Adjoint-Based Model Tuning and Machine Learning Strategy for Turbulence Model Improvement
No ratings yet
Adjoint-Based Model Tuning and Machine Learning Strategy for Turbulence Model Improvement
23 pages
Edge Intelligence Federated Learning-Based Privacy Protection Framework for Smart Healthcare Systems[1]
No ratings yet
Edge Intelligence Federated Learning-Based Privacy Protection Framework for Smart Healthcare Systems[1]
12 pages
FINAL UNIT 4
No ratings yet
FINAL UNIT 4
107 pages
Machine Learning
No ratings yet
Machine Learning
133 pages
Prediction of Gold Price Movement Using
No ratings yet
Prediction of Gold Price Movement Using
12 pages
Deep Generative Modeling Jakub M. Tomczak download
100% (1)
Deep Generative Modeling Jakub M. Tomczak download
47 pages
...
No ratings yet
...
10 pages