0% found this document useful (0 votes)

124 views

Image Classification With The MNIST Dataset: Objectives

The document provides an introduction to classifying handwritten digits with the MNIST dataset using deep learning. It discusses how deep learning can solve image classification problems where traditional programming cannot. It then describes loading the MNIST dataset into memory using Keras, which contains 60,000 training images and 10,000 validation images of handwritten digits from 0-9 sized at 28x28 pixels each. The document explores the characteristics of the image data.

Uploaded by

Praveen Singh

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

124 views

Image Classification With The MNIST Dataset: Objectives

Uploaded by

Praveen Singh

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 21

01_mnist about:srcdoc

Image Classification with the MNIST Dataset

In this section we will do the "Hello World" of deep learning: training a deep learning model to correctly
classify hand-written digits.

Objectives

Understand how deep learning can solve problems traditional programming methods cannot
Learn about the MNSIT handwritten digits dataset (http://yann.lecun.com/exdb/mnist/)
Use the Keras API (https://keras.io/) to load the MNIST dataset and prepare it for training
Create a simple neural network to perform image classification
Train the neural network using the prepped MNIST dataset
Observe the performance of the trained neural network

The Problem: Image Classification

In traditional programming, the programmer is able to articulate rules and conditions in their code that their
program can then use to act in the correct way. This approach continues to work exceptionally well for a huge
variety of problems.

Image classification, which asks a program to correctly classify an image it has never seen before into its
correct class, is near impossible to solve with traditional programming techniques. How could a programmer
possibly define the rules and conditions to correctly classify a huge variety of images, especially taking into
account images that they have never seen?

The Solution: Deep Learning

Deep learning excels at pattern recognition by trial and error. By training a deep neural network with sufficient
data, and providing the network with feedback on its performance via training, the network can identify,
though a huge amount of iteration, its own set of conditions by which it can act in the correct way.

1 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

The MNIST Dataset

In the history of deep learning, the accurate image classification of the MNSIT dataset (http://yann.lecun.com
/exdb/mnist/), a collection of 70,000 grayscale images of handwritten digits from 0 to 9, was a major
development. While today the problem is considered trivial, doing image classification with MNIST has
become a kind of "Hello World" for deep learning.

Here are 40 of the images included in the MNIST dataset:

Training and Validation Data and Labels

When working with images for deep learning, we need both the images themselves, usually denoted as X ,
and also, correct labels (https://developers.google.com/machine-learning/glossary#label) for these images,
usually denoted as Y . Furthermore, we need X and Y values both for training the model, and then, a
separate set of X and Y values for validating the performance of the model after it has been trained.
Therefore, we need 4 segments of data for the MNIST dataset:

1. x_train : Images used for training the neural network

2. y_train : Correct labels for the x_train images, used to evaluate the model's predictions during
training
3. x_valid : Images set aside for validating the performance of the model after it has been trained
4. y_valid : Correct labels for the x_valid images, used to evaluate the model's predictions after it has
been trained

The process of preparing data for analysis is called Data Engineering (https://medium.com/@rchang/a-
beginners-guide-to-data-engineering-part-i-4227c5c457d7). To learn more about the differences between
training data and validation data (as well as test data), check out this article
(https://machinelearningmastery.com/difference-test-validation-datasets/) by Jason Brownlee.

Loading the Data Into Memory (with Keras)

2 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

There are many deep learning frameworks (https://developer.nvidia.com/deep-learning-frameworks), each

with their own merits. In this workshop we will be working with Tensorflow 2 (https://www.tensorflow.org
/tutorials/quickstart/beginner), and specifically with the Keras API (https://keras.io/). Keras has many useful
built in functions designed for the computer vision tasks. It is also a legitimate choice for deep learning in a
professional setting due to its readability (https://blog.pragmaticengineer.com/readable-code/) and efficiency,
though it is not alone in this regard, and it is worth investigating a variety of frameworks when beginning a
deep learning project.

One of the many helpful features that Keras provides are modules containing many helper methods for many
common datasets (https://www.tensorflow.org/api_docs/python/tf/keras/datasets), including MNIST.

We will begin by loading the Keras dataset module for MNIST:

In [1]: from tensorflow.keras.datasets import mnist

With the mnist module, we can easily load the MNIST data, already partitioned into images and labels for
both training and validation:

In [2]: # the data, split between train and validation sets

(x_train, y_train), (x_valid, y_valid) = mnist.load_data()

Downloading data from https://storage.googleapis.com/tensorflow/tf-ke

ras-datasets/mnist.npz
11493376/11490434 [==============================] - 0s 0us/step

Exploring the MNIST Data

We stated above that the MNIST dataset contained 70,000 grayscale images of handwritten digits. By
executing the following cells, we can see that Keras has partitioned 60,000 of these images for training, and
10,000 for validation (after training), and also, that each image itself is a 2D array with the dimensions 28x28:

In [3]: x_train.shape

Out[3]: (60000, 28, 28)

In [4]: x_valid.shape

Out[4]: (10000, 28, 28)

Furthermore, we can see that these 28x28 images are represented as a collection of unsigned 8-bit integer
values between 0 and 255, the values corresponding with a pixel's grayscale value where 0 is black, 255
is white, and all other values are in between:

3 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

In [5]: x_train.dtype

Out[5]: dtype('uint8')

In [6]: x_train.min()

Out[6]: 0

In [7]: x_train.max()

Out[7]: 255

4 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

In [8]: x_train[0]

5 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

Out[8]: array([[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
3,
18, 18, 18, 126, 136, 175, 26, 166, 255, 247, 127, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 30, 36, 94, 154, 1
70,
253, 253, 253, 253, 253, 225, 172, 253, 242, 195, 64, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 49, 238, 253, 253, 253, 2
53,
253, 253, 253, 253, 251, 93, 82, 82, 56, 39, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 18, 219, 253, 253, 253, 2
53,
253, 198, 182, 247, 241, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 80, 156, 107, 253, 2
53,
205, 11, 0, 43, 154, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 14, 1, 154, 2
53,
90, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 139, 2

6 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

53,
190, 2, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 11, 1
90,
253, 70, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
35,
241, 225, 160, 108, 1, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
81, 240, 253, 253, 119, 25, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 45, 186, 253, 253, 150, 27, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 16, 93, 252, 253, 187, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 249, 253, 249, 64, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 46, 130, 183, 253, 253, 207, 2, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
39,
148, 229, 253, 253, 253, 250, 182, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 24, 114, 2
21,
253, 253, 253, 253, 201, 78, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 23, 66, 213, 253, 2
53,
253, 253, 198, 81, 2, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 18, 171, 219, 253, 253, 253, 2
53,

7 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

195, 80, 9, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 55, 172, 226, 253, 253, 253, 253, 244, 1
33,
11, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 136, 253, 253, 253, 212, 135, 132, 16,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0],
[ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0]], dtype=uint8)

Using Matplotlib (https://matplotlib.org/), we can render one of these grayscale images in our dataset:

In [9]: import matplotlib.pyplot as plt

image = x_train[0]
plt.imshow(image, cmap='gray')

Out[9]: <matplotlib.image.AxesImage at 0x7f3d61f19828>

8 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

In this way we can now see that this is a 28x28 pixel image of a 5. Or is it a 3? The answer is in the
y_train data, which contains correct labels for the data. Let's take a look:

In [10]: y_train[0]

Out[10]: 5

Preparing the Data for Training

In deep learning, it is common that data needs to be transformed to be in the ideal state for training. For this
particular image classification problem, there are 3 tasks we should perform with the data in preparation for
training:

1. Flatten the image data, to simplify the image input into the model
2. Normalize the image data, to make the image input values easier to work with for the model
3. Categorize the labels, to make the label values easier to work with for the model

Flattening the Image Data

Though it's possible for a deep learning model to accept a 2-dimensional image (in our case 28x28 pixels),
we're going to simplify things to start and reshape (https://www.tensorflow.org/api_docs/python/tf/reshape)
each image into a single array of 784 continuous pixels (note: 28x28 = 784). This is also called flattening the
image.

Here we accomplish this using the helper method reshape :

In [11]: x_train = x_train.reshape(60000, 784)

x_valid = x_valid.reshape(10000, 784)

We can confirm that the image data has been reshaped and is now a collection of 1D arrays containing 784
pixel values each:

In [12]: x_train.shape

Out[12]: (60000, 784)

9 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

In [13]: x_train[0]

10 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

Out[13]: array([ 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 3, 18, 18, 1
8,
126, 136, 175, 26, 166, 255, 247, 127, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 30, 36, 94, 154, 170, 25
3,
253, 253, 253, 253, 225, 172, 253, 242, 195, 64, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 49, 238, 253, 253, 25
3,
253, 253, 253, 253, 253, 251, 93, 82, 82, 56, 39, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 18, 219, 25
3,
253, 253, 253, 253, 198, 182, 247, 241, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
80, 156, 107, 253, 253, 205, 11, 0, 43, 154, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 14, 1, 154, 253, 90, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 139, 253, 190, 2, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 11, 190, 253, 7
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,

11 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 3
5,
241, 225, 160, 108, 1, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 81, 240, 253, 253, 119, 25, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 45, 186, 253, 253, 150, 27, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 16, 93, 252, 253, 18
7,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 24
9,
253, 249, 64, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 46, 13
0,
183, 253, 253, 207, 2, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 39, 14
8,
229, 253, 253, 253, 250, 182, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 24, 11
4,
221, 253, 253, 253, 253, 201, 78, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 23, 6
6,
213, 253, 253, 253, 253, 198, 81, 2, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 18, 17
1,
219, 253, 253, 253, 253, 195, 80, 9, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 55, 17
2,
226, 253, 253, 253, 253, 244, 133, 11, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
136, 253, 253, 253, 212, 135, 132, 16, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,

12 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0,
0,
0, 0, 0, 0], dtype=uint8)

Normalizing the Image Data

Deep learning models are better at dealing with floating point numbers between 0 and 1 (more on this topic
later). Converting integer values to floating point values between 0 and 1 is called normalization
(https://developers.google.com/machine-learning/glossary#normalization), and a simple approach we will take
here to normalize the data will be to divide all the pixel values (which if you recall are between 0 and 255) by
255:

In [14]: x_train = x_train / 255

x_valid = x_valid / 255

We can now see that the values are all floating point values between 0.0 and 1.0 :

In [15]: x_train.dtype

Out[15]: dtype('float64')

In [16]: x_train.min()

Out[16]: 0.0

In [17]: x_train.max()

Out[17]: 1.0

Categorical Encoding

13 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

Consider for a moment, if we were to ask, what is 7 - 2? Stating that the answer was 4 is closer than stating
that the answer was 9. However, for this image classification problem, we don't want the neural network to
learn this kind of reasoning: we just want it to select the correct category, and understand that if we have an
image of the number 5, that guessing 4 is just as bad as guessing 9.

As it stands, the labels for the images are integers between 0 and 9. Because these values represent a
numerical range, the model might try to draw some conclusions about its performance based on how close to
the correct numerical category it guesses.

Therefore, we will do something to our data called categorical encoding. This kind of transformation modifies
the data so that each value is a collection of all possible categories, with the actual category that this
particular value is set as true.

As a simple example, consider if we had 3 categories: red, blue, and green. For a given color, 2 of these
categories would be false, and the other would be true:

Actual Color Is Red? Is Blue? Is Green?

Red True False False

Green False False True

Blue False True False

Green False False True

Rather than use "True" or "False", we could represent the same using binary, either 0 or 1:

Actual Color Is Red? Is Blue? Is Green?

Red 1 0 0

Green 0 0 1

Blue 0 1 0

Green 0 0 1

This is what categorical encoding is, transforming values which are intended to be understood as categorical
labels into a representation that makes their categorical nature explicit to the model. Thus, if we were using
these values for training, we would convert...

values = ['red, green, blue, green']

... which a neural network would have a very difficult time making sense of, instead to:

14 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

values = [
[1, 0, 0],
[0, 0, 1],
[0, 1, 0],
[0, 0, 1]
]

Categorically Encoding the Labels

Keras provides a utility to categorically encode values (https://www.tensorflow.org/api_docs/python/tf/keras

/utils/to_categorical), and here we use it to perform categorical encoding for both the training and validation
labels:

In [18]: import tensorflow.keras as keras

num_categories = 10

y_train = keras.utils.to_categorical(y_train, num_categories)

y_valid = keras.utils.to_categorical(y_valid, num_categories)

Here are the first 10 values of the training labels, which you can see have now been categorically encoded:

In [19]: y_train[0:9]

Out[19]: array([[0., 0., 0., 0., 0., 1., 0., 0., 0., 0.],
[1., 0., 0., 0., 0., 0., 0., 0., 0., 0.],
[0., 0., 0., 0., 1., 0., 0., 0., 0., 0.],
[0., 1., 0., 0., 0., 0., 0., 0., 0., 0.],
[0., 0., 0., 0., 0., 0., 0., 0., 0., 1.],
[0., 0., 1., 0., 0., 0., 0., 0., 0., 0.],
[0., 1., 0., 0., 0., 0., 0., 0., 0., 0.],
[0., 0., 0., 1., 0., 0., 0., 0., 0., 0.],
[0., 1., 0., 0., 0., 0., 0., 0., 0., 0.]], dtype=float32)

Creating the Model

15 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

With the data prepared for training, it is now time to create the model that we will train with the data. This first
basic model will be made up of several layers and will be comprised of 3 main parts:

1. An input layer, which will receive data in some expected format

2. Several hidden layers (https://developers.google.com/machine-learning/glossary#hidden-layer), each
comprised of many neurons. Each neuron (https://developers.google.com/machine-learning
/glossary#neuron) will have the ability to affect the network's guess with its weights, which are values that
will be updated over many iterations as the network gets feedback on its performance and learns
3. An output layer, which will depict the network's guess for a given image

Instantiating the Model

To begin, we will use Keras's Sequential (https://www.tensorflow.org/api_docs/python/tf/keras/Sequential)

model class to instantiate an instance of a model that will have a series of layers that data will pass through in
sequence:

In [20]: from tensorflow.keras.models import Sequential

model = Sequential()

Creating the Input Layer

Next, we will add the input layer. This layer will be densely connected, meaning that each neuron in it, and its
weights, will affect every neuron in the next layer. To do this with Keras, we use Keras's Dense
(https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dense) layer class.

In [21]: from tensorflow.keras.layers import Dense

The units argument specifies the number of neurons in the layer. We are going to use 512 which we
have chosen from experimentation. Choosing the correct number of neurons is what puts the "science" in
"data science" as it is a matter of capturing the statistical complexity of the dataset. Try playing around with
this value later to see how it affects training and to start developing a sense for what this number means.

We will learn more about activation functions later, but for now, we will use the relu activation function,
which in short, will help our network to learn how to make more sophisticated guesses about data than if it
were required to make guesses based on some strictly linear function.

The input_shape value specifies the shape of the incoming data which in our situation is a 1D array of
784 values:

In [22]: model.add(Dense(units=512, activation='relu', input_shape=(784,)))

16 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

Creating the Hidden Layer

Now we will add an additional densely connected layer. Again, much more will be said about these later, but
for now know that these layers give the network more parameters to contribute towards its guesses, and
therefore, more subtle opportunities for accurate learning:

In [23]: model.add(Dense(units = 512, activation='relu'))

Creating the Output Layer

Finally, we will add an output layer. This layer uses the activation function softmax which will result in each
of the layer's values being a probability between 0 and 1 and will result in all the outputs of the layer adding to
1. In this case, since the network is to make a guess about a single image belonging to 1 of 10 possible
categories, there will be 10 outputs. Each output gives the model's guess (a probability) that the image
belongs to that specific class:

In [24]: model.add(Dense(units = 10, activation='softmax'))

Summarizing the Model

Keras provides the model instance method summary (https://www.tensorflow.org/api_docs/python

/tf/summary) which will print a readable summary of a model:

In [25]: model.summary()

Model: "sequential"
_________________________________________________________________
Layer (type) Output Shape Param #
=================================================================
dense (Dense) (None, 512) 401920
_________________________________________________________________
dense_1 (Dense) (None, 512) 262656
_________________________________________________________________
dense_2 (Dense) (None, 10) 5130
=================================================================
Total params: 669,706
Trainable params: 669,706
Non-trainable params: 0
_________________________________________________________________

17 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

Note the number of trainable parameters. Each of these can be adjusted during training and will contribute
towards the trained model's guesses.

Compiling the Model

Again, more details are to follow, but the final step we need to do before we can actually train our model with
data is to compile (https://www.tensorflow.org/api_docs/python/tf/keras/Sequential#compile) it. Here we
specify a loss function (https://developers.google.com/machine-learning/glossary#loss) which will be used for
the model to understand how well it is performing during training. We also specify that we would like to track
accuracy while the model trains:

In [26]: model.compile(loss='categorical_crossentropy', metrics=['accuracy'])

Training the Model

Now that we have prepared training and validation data, and a model, it's time to train our model with our
training data, and verify it with its validation data.

"Training a model with data" is often also called "fitting a model to data." Put this latter way, it highlights that
the shape of the model changes over time to more accurately understand the data that it is being given.

When fitting (training) a model with Keras, we use the model's fit (https://www.tensorflow.org/api_docs/python
/tf/keras/Model#fit) method. It expects the following arguments:

The training data

The labels for the training data
The number of times it should train on the entire training dataset (called an epoch)
The validation or test data, and its labels

Run the cell below to train the model. We will discuss its output after the training completes:

18 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

In [27]: history = model.fit( x_train, y_train, epochs=5, verbose=1, validation_

data=(x_valid, y_valid))

Epoch 1/5
1875/1875 [==============================] - 4s 2ms/step - loss: 0.19
11 - accuracy: 0.9434 - val_loss: 0.1234 - val_accuracy: 0.9671
Epoch 2/5
1875/1875 [==============================] - 4s 2ms/step - loss: 0.09
85 - accuracy: 0.9752 - val_loss: 0.1083 - val_accuracy: 0.9738
Epoch 3/5
1875/1875 [==============================] - 4s 2ms/step - loss: 0.08
05 - accuracy: 0.9801 - val_loss: 0.1312 - val_accuracy: 0.9741
Epoch 4/5
1875/1875 [==============================] - 4s 2ms/step - loss: 0.07
13 - accuracy: 0.9841 - val_loss: 0.1331 - val_accuracy: 0.9747
Epoch 5/5
1875/1875 [==============================] - 4s 2ms/step - loss: 0.06
35 - accuracy: 0.9868 - val_loss: 0.1324 - val_accuracy: 0.9773

Observing Accuracy

For each of the 5 epochs, notice the accuracy and val_accuracy scores. accuracy states how well
the model did for the epoch on all the training data. val_accuracy states how well the model did on the
validation data, which if you recall, was not used at all for training the model.

The model did quite well! The accuracy quickly reached close to 100%, as did the validation accuracy. We
now have a model that can be used to accurately detect and classify hand-written images.

The next step would be to use this model to classify new not-yet-seen handwritten images. This is called
inference (https://blogs.nvidia.com/blog/2016/08/22/difference-deep-learning-training-inference-ai/). We'll
explore the process of inference in a later exercise.

Summary

It's worth taking a moment to appreciate what we've done here. Historically, the expert systems that were built
to do this kind of task were extremely complicated, and people spent their careers building them (check out
the references on the official MNIST page (http://yann.lecun.com/exdb/mnist/) and the years milestones were
reached).

MNIST is not only useful for its historical influence on Computer Vision, but it's also a great benchmark
(http://www.cs.toronto.edu/~serailhydra/publications/tbd-iiswc18.pdf) and debugging tool. Having trouble
getting a fancy new machine learning architecture working? Check it against MNIST. If it can't learn on this
dataset, chances are it won't learn on more complicated images and datasets.

19 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

Clear the Memory

Before moving on, please execute the following cell to clear up the GPU memory. This is required to move on
to the next notebook.

In [28]: import IPython

app = IPython.Application.instance()
app.kernel.do_shutdown(True)

Out[28]: {'status': 'ok', 'restart': True}

In this section you learned how to build and train a simple neural network for image classification. In the next
section, you will be asked to build your own neural network and perform data preparation to solve a different
image classification problem.

☆ Bonus Exercise ☆
Have time to spare? In the next section, we will talk about how we arrived at some of the numbers above, but
we can try imagining what it was like to be a researcher developing the techniques commonly used today.

Ultimately, each neuron is trying to fit a line to some data. Below, we have some datapoints and a randomly
drawn line using the equation y = mx + b (https://www.mathsisfun.com/equation_of_line.html).

Try changing the m and the b in order to find the lowest possible loss. How did you find the best line? Can
you make a program to follow your strategy?

20 of 21 12/07/2021, 04:57 pm
01_mnist about:srcdoc

In [1]: import numpy as np

from numpy.polynomial.polynomial import polyfit
import matplotlib.pyplot as plt

m = -2 # -2 to start, change me please

b = 40 # 40 to start, change me please

# Sample data
x = np.array([ 0, 1, 2, 3, 4, 5, 6, 7, 8, 9])
y = np.array([10, 20, 25, 30, 40, 45, 40, 50, 60, 55])
y_hat = x * m + b

plt.plot(x, y, '.')
plt.plot(x, y_hat, '-')
plt.show()

print("Loss:", np.sum((y - y_hat)**2)/len(x))

Loss: 475.5

Have an idea? Excellent! Please shut down the kernel before moving on.

In [2]: import IPython

app = IPython.Application.instance()
app.kernel.do_shutdown(True)

Out[2]: {'status': 'ok', 'restart': True}

In [ ]:

21 of 21 12/07/2021, 04:57 pm

Image Classification Using CNN (Convolutional Neural Networks)
No ratings yet
Image Classification Using CNN (Convolutional Neural Networks)
16 pages
Thanksgiving Day
100% (4)
Thanksgiving Day
6 pages
Air Force Heraldry
No ratings yet
Air Force Heraldry
17 pages
MNIST Dataset
No ratings yet
MNIST Dataset
12 pages
24mcs1025-ex2-part-a
No ratings yet
24mcs1025-ex2-part-a
6 pages
TF Mannual
No ratings yet
TF Mannual
19 pages
Data Augmentation: Objectives
No ratings yet
Data Augmentation: Objectives
10 pages
Your First Neural Network
No ratings yet
Your First Neural Network
15 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
8 - Logistic - Regression - Multiclass - Ipynb - Colaboratory
No ratings yet
8 - Logistic - Regression - Multiclass - Ipynb - Colaboratory
6 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
logistic-regressions
No ratings yet
logistic-regressions
5 pages
Deep Learning With PyTorch
No ratings yet
Deep Learning With PyTorch
19 pages
Image Classification of An American Sign Language Dataset: Objectives
No ratings yet
Image Classification of An American Sign Language Dataset: Objectives
11 pages
Scikit Learn What Were Covering
No ratings yet
Scikit Learn What Were Covering
15 pages
1 s2.0 S0925231221009486 Main
No ratings yet
1 s2.0 S0925231221009486 Main
7 pages
Homework: Prediction Methods and Machine Learning
No ratings yet
Homework: Prediction Methods and Machine Learning
3 pages
DL Mini Project Siddhesh
No ratings yet
DL Mini Project Siddhesh
9 pages
Large-Scale Image Classification
No ratings yet
Large-Scale Image Classification
8 pages
B4
No ratings yet
B4
8 pages
Ihic-2022 PPT Paper - Id 101
No ratings yet
Ihic-2022 PPT Paper - Id 101
9 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Logistic Multiclass Classification
No ratings yet
Logistic Multiclass Classification
2 pages
CV Lab 9
No ratings yet
CV Lab 9
4 pages
AutoEncoders with TensorFlow_ Medium
No ratings yet
AutoEncoders with TensorFlow_ Medium
12 pages
Logistic_Regression_with_a_Neural_Network_mindset_v6a
No ratings yet
Logistic_Regression_with_a_Neural_Network_mindset_v6a
25 pages
Lab 1 - Machine Learning with Python - ML Engineering مهم
No ratings yet
Lab 1 - Machine Learning with Python - ML Engineering مهم
10 pages
Pre-Trained Models: Objectives
No ratings yet
Pre-Trained Models: Objectives
12 pages
Proyecto IA2
No ratings yet
Proyecto IA2
14 pages
02-DL-Deep Learning For Image Data (Convnets) 03
No ratings yet
02-DL-Deep Learning For Image Data (Convnets) 03
10 pages
DLT Record Final
No ratings yet
DLT Record Final
120 pages
Image Classification Using Convolutional Neural Network With Python
No ratings yet
Image Classification Using Convolutional Neural Network With Python
8 pages
Recommendation Systems With Neural Networks
No ratings yet
Recommendation Systems With Neural Networks
36 pages
Ronakk Exp1
No ratings yet
Ronakk Exp1
6 pages
Maneesha Nidigonda Major Project
No ratings yet
Maneesha Nidigonda Major Project
11 pages
AI lsn5 pdf
No ratings yet
AI lsn5 pdf
18 pages
Why Convolutions?: Till Now in MLP
No ratings yet
Why Convolutions?: Till Now in MLP
38 pages
Building Powerful Image Classification Models Using Very Little Data
No ratings yet
Building Powerful Image Classification Models Using Very Little Data
12 pages
Fast Multiresolution Image Querying: Charles E. Jacobs Adam Finkelstein David H. Salesin
No ratings yet
Fast Multiresolution Image Querying: Charles E. Jacobs Adam Finkelstein David H. Salesin
10 pages
COCo
No ratings yet
COCo
7 pages
Module3 forSesitivityAnalysisTopic
No ratings yet
Module3 forSesitivityAnalysisTopic
8 pages
X CH 2 AI ProjectCycle Notes Revised
No ratings yet
X CH 2 AI ProjectCycle Notes Revised
9 pages
CNN with TensorFlow and Keras
No ratings yet
CNN with TensorFlow and Keras
11 pages
Experiment 4 DNN.ipynb - Colaboratory
No ratings yet
Experiment 4 DNN.ipynb - Colaboratory
32 pages
ML0101EN Clas K Nearest Neighbors CustCat Py v1
100% (1)
ML0101EN Clas K Nearest Neighbors CustCat Py v1
11 pages
Classifying Authentic and AI-Generated Images with a Fine- Tuned ResNet50 Model.
No ratings yet
Classifying Authentic and AI-Generated Images with a Fine- Tuned ResNet50 Model.
7 pages
Implementing Artificial Neural Network in Python From Scratch
No ratings yet
Implementing Artificial Neural Network in Python From Scratch
16 pages
Deploying Your Model: Objectives
No ratings yet
Deploying Your Model: Objectives
9 pages
Maneesha Nidigonda Verzeo Major Project
No ratings yet
Maneesha Nidigonda Verzeo Major Project
11 pages
تمثيل النص كموترات - تدريب _ مايكروسوفت ليرن
No ratings yet
تمثيل النص كموترات - تدريب _ مايكروسوفت ليرن
14 pages
UMDFaces: An Annotated Face Dataset For Training Deep Networks
No ratings yet
UMDFaces: An Annotated Face Dataset For Training Deep Networks
10 pages
Optimal Multi-Scale Patterns in Time Series Streams: Spiros Papadimitriou Philip S. Yu
No ratings yet
Optimal Multi-Scale Patterns in Time Series Streams: Spiros Papadimitriou Philip S. Yu
12 pages
Lesson 2.2
No ratings yet
Lesson 2.2
15 pages
NoSQL Module -5
No ratings yet
NoSQL Module -5
28 pages
FODS
No ratings yet
FODS
6 pages
AnDevGuide MachineLearning
No ratings yet
AnDevGuide MachineLearning
35 pages
Week6_Bai
No ratings yet
Week6_Bai
14 pages
Unit5 - Linear Regression
No ratings yet
Unit5 - Linear Regression
4 pages
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
No ratings yet
DEEPLEARNINGTUTORIAL.ipynb-Colaboratory
8 pages
Breast_Cancer_1725709979
No ratings yet
Breast_Cancer_1725709979
30 pages
DeepSR A Deep Learning Tool For Image Super Resolution. DOI-10.1016j.softx.2022.101261
No ratings yet
DeepSR A Deep Learning Tool For Image Super Resolution. DOI-10.1016j.softx.2022.101261
16 pages
Digital Image Processing: Fundamentals and Applications
From Everand
Digital Image Processing: Fundamentals and Applications
Fouad Sabry
No ratings yet
Assessment: The Dataset
No ratings yet
Assessment: The Dataset
5 pages
Sequence Data: Objectives
No ratings yet
Sequence Data: Objectives
15 pages
Transfer Learning: Objectives
No ratings yet
Transfer Learning: Objectives
16 pages
Symptomatically Brain Tumor Detection Using Convolutional Neural Networks
No ratings yet
Symptomatically Brain Tumor Detection Using Convolutional Neural Networks
10 pages
Brain Tumour Segmentation Using Convolutional Neural Network With Tensor Flow
No ratings yet
Brain Tumour Segmentation Using Convolutional Neural Network With Tensor Flow
7 pages
Fundamentals of Deep Learning: Part 5: Pre-Trained Models
No ratings yet
Fundamentals of Deep Learning: Part 5: Pre-Trained Models
18 pages
Jupyterlab: Clearing Gpu Memory
No ratings yet
Jupyterlab: Clearing Gpu Memory
2 pages
Fundamentals of Deep Learning: Part 2: How A Neural Network Trains
No ratings yet
Fundamentals of Deep Learning: Part 2: How A Neural Network Trains
54 pages
Fundamentals of Deep Learning: Part 6: Advanced Architectures
No ratings yet
Fundamentals of Deep Learning: Part 6: Advanced Architectures
35 pages
An Overview of Microprocessor
No ratings yet
An Overview of Microprocessor
16 pages
Unit - 3 8255: (Programmable Peripheral Interface)
No ratings yet
Unit - 3 8255: (Programmable Peripheral Interface)
7 pages
Advanced Mechatronic Systems: 2014 International Conference On
No ratings yet
Advanced Mechatronic Systems: 2014 International Conference On
6 pages
Iot Based Solar Energy Monitoring System: Suprita M. Patil Vijayalashmi M
No ratings yet
Iot Based Solar Energy Monitoring System: Suprita M. Patil Vijayalashmi M
6 pages
Anand
No ratings yet
Anand
3 pages
Jurisdiction of Civil Courts Meaning Q. What Is The Meaning of The Word 'Jurisdiction'?
No ratings yet
Jurisdiction of Civil Courts Meaning Q. What Is The Meaning of The Word 'Jurisdiction'?
5 pages
Cause and Effect of Stereotyping
No ratings yet
Cause and Effect of Stereotyping
4 pages
Example 2: Vodafone SFA: Description, Role & Objectives
No ratings yet
Example 2: Vodafone SFA: Description, Role & Objectives
4 pages
J of Neuroscience Research - 2021 - Zhang
No ratings yet
J of Neuroscience Research - 2021 - Zhang
14 pages
Analytical Scoring Rubrics For Telling A Story or A Personal Anecdote
No ratings yet
Analytical Scoring Rubrics For Telling A Story or A Personal Anecdote
3 pages
Rsudza: Dr. Bobby H.E Fermi S
No ratings yet
Rsudza: Dr. Bobby H.E Fermi S
3 pages
Cardiovascular Regulatory Mechanisms
No ratings yet
Cardiovascular Regulatory Mechanisms
8 pages
Soalan Math
No ratings yet
Soalan Math
6 pages
Assigement On Lemon Uses
No ratings yet
Assigement On Lemon Uses
8 pages
1.DSM - Predicate Logic
No ratings yet
1.DSM - Predicate Logic
24 pages
HB UoE Unit 1
No ratings yet
HB UoE Unit 1
13 pages
Incidence of Leptospirosis in Captive Asiatic Elephant (Elephas Maximus)
No ratings yet
Incidence of Leptospirosis in Captive Asiatic Elephant (Elephas Maximus)
2 pages
Science Project Final 1
No ratings yet
Science Project Final 1
10 pages
Alternative Banking Channels and Economic Growth in Nigeria: Further Empirical Evidence
No ratings yet
Alternative Banking Channels and Economic Growth in Nigeria: Further Empirical Evidence
6 pages
2017 Yarmuch Et - Al Evaluating Crusher System Location in An Open Pit Mine Using Markov Chains PDF
No ratings yet
2017 Yarmuch Et - Al Evaluating Crusher System Location in An Open Pit Mine Using Markov Chains PDF
15 pages
LENG1161-Speaking To Inform Organizational Patterns (STUDENT WKT)
No ratings yet
LENG1161-Speaking To Inform Organizational Patterns (STUDENT WKT)
2 pages
Discourse As Dialogue Part 1
No ratings yet
Discourse As Dialogue Part 1
37 pages
Faculty of Economics and Management: Fakulti Ekonomi Dan Pengurusan
No ratings yet
Faculty of Economics and Management: Fakulti Ekonomi Dan Pengurusan
2 pages
Advanced-1-Reading-Practice-Test-5
No ratings yet
Advanced-1-Reading-Practice-Test-5
28 pages
Occupiers Liability 1
No ratings yet
Occupiers Liability 1
9 pages
CA21155 - DL The Loser - Press Release
No ratings yet
CA21155 - DL The Loser - Press Release
4 pages
Unit 2 Sources of History
No ratings yet
Unit 2 Sources of History
7 pages
Governor Generals List
No ratings yet
Governor Generals List
7 pages
CHEM PROJECT CLASS 12 With INVESTIGATORY PROJECT ON ' ANTACIDS' PDF
No ratings yet
CHEM PROJECT CLASS 12 With INVESTIGATORY PROJECT ON ' ANTACIDS' PDF
10 pages
What Is The Best Book To Prepare From For The New SAT - Quora
No ratings yet
What Is The Best Book To Prepare From For The New SAT - Quora
11 pages
Lesson Exemplar in Primary and Secondary Sources
No ratings yet
Lesson Exemplar in Primary and Secondary Sources
13 pages
Research Design: Meaning and Types. Formulation of Research Problem
No ratings yet
Research Design: Meaning and Types. Formulation of Research Problem
28 pages
As Math-10 Q3 W6
No ratings yet
As Math-10 Q3 W6
3 pages

Image Classification With The MNIST Dataset: Objectives

Uploaded by

Image Classification With The MNIST Dataset: Objectives

Uploaded by

01_mnist about:srcdoc

Image Classification with the MNIST Dataset

The Problem: Image Classification

The Solution: Deep Learning

The MNIST Dataset

Here are 40 of the images included in the MNIST dataset:

Training and Validation Data and Labels

1. x_train : Images used for training the neural network

Loading the Data Into Memory (with Keras)

There are many deep learning frameworks (https://developer.nvidia.com/deep-learning-frameworks), each

We will begin by loading the Keras dataset module for MNIST:

In [1]: from tensorflow.keras.datasets import mnist

In [2]: # the data, split between train and validation sets

Downloading data from https://storage.googleapis.com/tensorflow/tf-ke

Exploring the MNIST Data

Out[3]: (60000, 28, 28)

Out[4]: (10000, 28, 28)

In [9]: import matplotlib.pyplot as plt

Out[9]: <matplotlib.image.AxesImage at 0x7f3d61f19828>

Preparing the Data for Training

Flattening the Image Data

Here we accomplish this using the helper method reshape :

In [11]: x_train = x_train.reshape(60000, 784)

Out[12]: (60000, 784)

Normalizing the Image Data

In [14]: x_train = x_train / 255

Actual Color Is Red? Is Blue? Is Green?

Red True False False

Green False False True

Blue False True False

Green False False True

Actual Color Is Red? Is Blue? Is Green?

values = ['red, green, blue, green']

Categorically Encoding the Labels

Keras provides a utility to categorically encode values (https://www.tensorflow.org/api_docs/python/tf/keras

In [18]: import tensorflow.keras as keras

y_train = keras.utils.to_categorical(y_train, num_categories)

Creating the Model

1. An input layer, which will receive data in some expected format

Instantiating the Model

To begin, we will use Keras's Sequential (https://www.tensorflow.org/api_docs/python/tf/keras/Sequential)

In [20]: from tensorflow.keras.models import Sequential

Creating the Input Layer

In [21]: from tensorflow.keras.layers import Dense

In [22]: model.add(Dense(units=512, activation='relu', input_shape=(784,)))

Creating the Hidden Layer

In [23]: model.add(Dense(units = 512, activation='relu'))

Creating the Output Layer

In [24]: model.add(Dense(units = 10, activation='softmax'))

Summarizing the Model

Keras provides the model instance method summary (https://www.tensorflow.org/api_docs/python

Compiling the Model

In [26]: model.compile(loss='categorical_crossentropy', metrics=['accuracy'])

Training the Model

The training data

In [27]: history = model.fit( x_train, y_train, epochs=5, verbose=1, validation_

Clear the Memory

In [28]: import IPython

Out[28]: {'status': 'ok', 'restart': True}

In [1]: import numpy as np

m = -2 # -2 to start, change me please

print("Loss:", np.sum((y - y_hat)**2)/len(x))

In [2]: import IPython

Out[2]: {'status': 'ok', 'restart': True}

You might also like