0% found this document useful (0 votes)

62 views

Handwritten Character Recognition With Neural Network

This document discusses developing a neural network model for handwritten character recognition. Key steps include: 1. Importing Python libraries and reading in a dataset of images and labels for characters A-Z. 2. Splitting the data into training and test sets and reshaping the images. 3. Defining a convolutional neural network model with convolutional and max pooling layers followed by dense layers. 4. Training the model, evaluating accuracy on the test set, and using the model to make predictions on new images.

Uploaded by

shreyash sonone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

62 views

Handwritten Character Recognition With Neural Network

Uploaded by

shreyash sonone

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as DOCX, PDF, TXT or read online on Scribd

You are on page 1/ 12

Handwritten Character

Recognition with Neural Network

We offer you a brighter future with FREE online courses Start Now!!
In this machine learning project, we will recognize handwritten characters,
i.e, English alphabets from A-Z. This we are going to achieve by modeling a
neural network that will have to be trained over a dataset containing images
of alphabets.

Project Prerequisites
Below are the prerequisites for this project:

1. Python (3.7.4 used)

2. IDE (Jupyter used)
Required frameworks are

1. Numpy (version 1.16.5)

2. cv2 (openCV) (version 3.4.2)
3. Keras (version 2.3.1)
4. Tensorflow (Keras uses TensorFlow in backend and for some
image preprocessing) (version 2.0.0)
5. Matplotlib (version 3.1.1)
6. Pandas (version 0.25.1)
Stay updated with latest technology trends
Join DataFlair on Telegram!!

Download Dataset
The dataset for this project contains 372450 images of alphabets of 28×2,
all present in the form of a CSV file:
Handwritten character recognition dataset

Steps to develop handwritten character

recognition
Download Project Code
Please download project source code: Handwritten Character Recognition
with Neural Network
import matplotlib.pyplot as plt
import cv2
import numpy as np
from keras.models import Sequential
from keras.layers import Dense, Flatten, Conv2D, MaxPool2D, Dropout
from keras.optimizers import SGD, Adam
from keras.callbacks import ReduceLROnPlateau, EarlyStopping
from keras.utils import to_categorical
import pandas as pd
import numpy as np
from sklearn.model_selection import train_test_split
from sklearn.utils import shuffle

First of all, we do all the necessary imports as stated above. We


will see the use of all the imports as we use them.
Read the data:
data = pd.read_csv(r"D:\a-z alphabets\A_Z Handwritten Data.csv").astype('float32')
print(data.head(10))

 Now we are reading the dataset using the pd.read_csv() and

printing the first 10 images using data.head(10)
(The above image shows some of the rows of the dataframe data using the
head() function of dataframe)

Split data into images and their labels:

X = data.drop('0',axis = 1)
y = data['0']

Splitting the data read into the images & their corresponding labels. The ‘0’
contains the labels, & so we drop the ‘0’ column from the data dataframe
read & use it in the y to form the labels.

Reshaping the data in the csv file so that it can be displayed as an image
train_x, test_x, train_y, test_y = train_test_split(X, y, test_size = 0.2)
train_x = np.reshape(train_x.values, (train_x.shape[0], 28,28))
test_x = np.reshape(test_x.values, (test_x.shape[0], 28,28))
print("Train data shape: ", train_x.shape)
print("Test data shape: ", test_x.shape)

 In the above segment, we are splitting the data into training &
testing dataset using train_test_split().
 Also, we are reshaping the train & test image data so that they can
be displayed as an image, as initially in the CSV file they were
present as 784 columns of pixel data. So we convert it to 28×28
pixels.
word_dict =
{0:'A',1:'B',2:'C',3:'D',4:'E',5:'F',6:'G',7:'H',8:'I',9:'J',10:'K',11:'L',12:'M',13:'N',14:'O',15:'P',16:'Q',17:'R',18:'S',19:
'T',20:'U',21:'V',22:'W',23:'X', 24:'Y',25:'Z'}

All the labels are present in the form of floating point values, that

we convert to integer values, & so we create a dictionary
word_dict to map the integer values with the characters.
Plotting the number of alphabets in the dataset
y_int = np.int0(y)
count = np.zeros(26, dtype='int')
for i in y_int:
count[i] +=1
alphabets = []
for i in word_dict.values():
alphabets.append(i)
fig, ax = plt.subplots(1,1, figsize=(10,10))
ax.barh(alphabets, count)
plt.xlabel("Number of elements ")
plt.ylabel("Alphabets")
plt.grid()
plt.show()

 Here we are only describing the distribution of the alphabets.

 Firstly we convert the labels into integer values and append into
the count list according to the label. This count list has the
number of images present in the dataset belonging to each
alphabet.
 Now we create a list – alphabets containing all the characters
using the values() function of the dictionary.
 Now using the count & alphabets lists we draw the horizontal bar
plot.

Shuffling the data

shuff = shuffle(train_x[:100])
fig, ax = plt.subplots(3,3, figsize = (10,10))
axes = ax.flatten()
for i in range(9):
_, shu = cv2.threshold(shuff[i], 30, 200, cv2.THRESH_BINARY)
axes[i].imshow(np.reshape(shuff[i], (28,28)), cmap="Greys")
plt.show()

 Now we shuffle some of the images of the train set.

 The shuffling is done using the shuffle() function so that we can
display some random images.
 We then create 9 plots in 3×3 shape & display the thresholded
images of 9 alphabets.

(The above image depicts the grayscale images that we got from the
dataset)

Data Reshaping
Reshaping the training & test dataset so that it can be put in the model
train_X = train_x.reshape(train_x.shape[0],train_x.shape[1],train_x.shape[2],1)
print("New shape of train data: ", train_X.shape)
test_X = test_x.reshape(test_x.shape[0], test_x.shape[1], test_x.shape[2],1)
print("New shape of train data: ", test_X.shape)
Now we reshape the train & test image dataset so that they can be put in the model.
New shape of train data: (297960, 28, 28, 1)
New shape of train data: (74490, 28, 28, 1)

Now we reshape the train & test image dataset so that they can be put in the
model.

New shape of train data: (297960, 28, 28, 1)

New shape of train data: (74490, 28, 28, 1)

train_yOHE = to_categorical(train_y, num_classes = 26, dtype='int')
print("New shape of train labels: ", train_yOHE.shape)
test_yOHE = to_categorical(test_y, num_classes = 26, dtype='int')
print("New shape of test labels: ", test_yOHE.shape)

Here we convert the single float values to categorical values. This is done as
the CNN model takes input of labels & generates the output as a vector of
probabilities.

Now we define the CNN.

What is CNN?
CNN stands for Convolutional Neural Networks that are used to extract the
features of the images using several layers of filters.

(Example of how a CNN looks logically)

The convolution layers are generally followed by maxpool layers that are
used to reduce the number of features extracted and ultimately the output
of the maxpool and layers and convolution layers are flattened into a vector
of single dimension and are given as an input to the Dense layer (The fully
connected network).

The model created is as follows:

model = Sequential()
model.add(Conv2D(filters=32, kernel_size=(3, 3), activation='relu', input_shape=(28,28,1)))
model.add(MaxPool2D(pool_size=(2, 2), strides=2))
model.add(Conv2D(filters=64, kernel_size=(3, 3), activation='relu', padding = 'same'))
model.add(MaxPool2D(pool_size=(2, 2), strides=2))
model.add(Conv2D(filters=128, kernel_size=(3, 3), activation='relu', padding = 'valid'))
model.add(MaxPool2D(pool_size=(2, 2), strides=2))
model.add(Flatten())
model.add(Dense(64,activation ="relu"))
model.add(Dense(128,activation ="relu"))
model.add(Dense(26,activation ="softmax"))

Above we have the CNN model that we designed for training the model over
the training dataset.

Compiling & Fitting Model

model.compile(optimizer = Adam(learning_rate=0.001), loss='categorical_crossentropy', metrics=['accuracy'])
history = model.fit(train_X, train_yOHE, epochs=1, validation_data = (test_X,test_yOHE))

 Here we are compiling the model, where we define the optimizing

function & the loss function to be used for fitting.
 The optimizing function used is Adam, that is a combination of
RMSprop & Adagram optimizing algorithms.
 The dataset is very large so we are training for only a single
epoch, however, as required we can even train it for multiple
epochs (which is recommended for character recognition for
better accuracy).
model.summary()
model.save(r'model_hand.h5')

Now we are getting the model summary that tells us what were the different
layers defined in the model & also we save the model
using model.save() function.
(Summary of the defined model)

Getting the Train & Validation Accuracies & Losses

print("The validation accuracy is :", history.history['val_accuracy'])
print("The training accuracy is :", history.history['accuracy'])
print("The validation loss is :", history.history['val_loss'])
print("The training loss is :", history.history['loss'])

In the above code segment, we print out the training & validation
accuracies along with the training & validation losses for character
recognition.
Doing Some Predictions on Test Data
fig, axes = plt.subplots(3,3, figsize=(8,9))
axes = axes.flatten()
for i,ax in enumerate(axes):
img = np.reshape(test_X[i], (28,28))
ax.imshow(img, cmap="Greys")
pred = word_dict[np.argmax(test_yOHE[i])]
ax.set_title("Prediction: "+pred)
ax.grid()

 Here we are creating 9 subplots of (3,3) shape & visualize some of

the test dataset alphabets along with their predictions, that are
made using the model.predict() function for text recognition.
Doing Prediction on External Image
img = cv2.imread(r'C:\Users\abhij\Downloads\img_b.jpg')
img_copy = img.copy()
img = cv2.cvtColor(img, cv2.COLOR_BGR2RGB)
img = cv2.resize(img, (400,440))

 Here we have read an external image that is originally an image of

alphabet ‘B’ and made a copy of it that is to go through some
processing to be fed to the model for the prediction that we will
see in a while.
 The img read is then converted from BGR representation (as
OpenCV reads the image in BGR format) to RGB for displaying
the image, & is resized to our required dimensions that we want
to display the image in.
img_copy = cv2.GaussianBlur(img_copy, (7,7), 0)
img_gray = cv2.cvtColor(img_copy, cv2.COLOR_BGR2GRAY)
_, img_thresh = cv2.threshold(img_gray, 100, 255, cv2.THRESH_BINARY_INV)
img_final = cv2.resize(img_thresh, (28,28))
img_final =np.reshape(img_final, (1,28,28,1))

 Now we do some processing on the copied image (img_copy).

 We convert the image from BGR to grayscale and apply
thresholding to it. We don’t need to apply a threshold we could
use the grayscale to predict, but we do it to keep the image
smooth without any sort of hazy gray colors in the image that
could lead to wrong predictions.
 The image is to be then resized using cv2.resize() function into the
dimensions that the model takes as input, along with reshaping
the image using np.reshape() so that it can be used as model
input.
img_pred = word_dict[np.argmax(model.predict(img_final))]
cv2.putText(img, "Dataflair _ _ _ ", (20,25), cv2.FONT_HERSHEY_TRIPLEX, 0.7, color = (0,0,230))
cv2.putText(img, "Prediction: " + img_pred, (20,410), cv2.FONT_HERSHEY_DUPLEX, 1.3, color =
(255,0,30))
cv2.imshow('Dataflair handwritten character recognition _ _ _ ', img)

 Now we make a prediction using the processed image & use the
np.argmax() function to get the index of the class with the highest
predicted probability. Using this we get to know the exact
character through the word_dict dictionary.
 This predicted character is then displayed on the frame.
while (1):
k = cv2.waitKey(1) & 0xFF
if k == 27:
break
cv2.destroyAllWindows()

 Here we are setting up a waitKey in a while loop that will be stuck

in loop until Esc is pressed, & when it gets out of loop using
cv2.destroyAllWindows() we destroy any active windows created
to stop displaying the frame.
Conclusion
We have successfully developed Handwritten character recognition (Text
Recognition) with Python, Tensorflow, and Machine Learning libraries.

Handwritten characters have been recognized with more than 97% test
accuracy. This can be also further extended to identifying the handwritten
characters of other languages too.

C1W3 Assignment
No ratings yet
C1W3 Assignment
7 pages
Mnist Handwritten Digit Classification
No ratings yet
Mnist Handwritten Digit Classification
26 pages
505H Digital Governor For Hydraulic Turbines
No ratings yet
505H Digital Governor For Hydraulic Turbines
248 pages
ppt for presentation
No ratings yet
ppt for presentation
27 pages
Classifying Hand-Written Digits Using Neural Network
No ratings yet
Classifying Hand-Written Digits Using Neural Network
21 pages
748747019-ad3511-deep-learning-lab-manual-iii-yearjnn (1)-1
No ratings yet
748747019-ad3511-deep-learning-lab-manual-iii-yearjnn (1)-1
51 pages
DL Lab-final
No ratings yet
DL Lab-final
22 pages
DL Lab-III-II
No ratings yet
DL Lab-III-II
98 pages
Deep Learning Manual (1)
No ratings yet
Deep Learning Manual (1)
53 pages
How to Develop a CNN for MNIST Handwritten Digit Classification
No ratings yet
How to Develop a CNN for MNIST Handwritten Digit Classification
43 pages
Arabic OCR Report
No ratings yet
Arabic OCR Report
20 pages
Capstone Project Report (Digit-Recognition Using CNN)
No ratings yet
Capstone Project Report (Digit-Recognition Using CNN)
11 pages
Assignment 2
No ratings yet
Assignment 2
3 pages
Assignment 02# - Machine Learning 2023
No ratings yet
Assignment 02# - Machine Learning 2023
8 pages
Aishwarya MiniProjectReport - SC
No ratings yet
Aishwarya MiniProjectReport - SC
6 pages
hand writing using _cnn (1)
No ratings yet
hand writing using _cnn (1)
5 pages
Capstone Project-1
No ratings yet
Capstone Project-1
15 pages
Ad3511 Deep Learning Lab Manual
No ratings yet
Ad3511 Deep Learning Lab Manual
80 pages
CD 601 Lab Manual
No ratings yet
CD 601 Lab Manual
61 pages
Final Code
No ratings yet
Final Code
16 pages
Traffic Signs Recognition
No ratings yet
Traffic Signs Recognition
12 pages
Lab 1 Assignment_W2022
No ratings yet
Lab 1 Assignment_W2022
7 pages
ASNM Program Explain
No ratings yet
ASNM Program Explain
4 pages
"I C U N N ": Mage Lassification Sing Eural Etworks
No ratings yet
"I C U N N ": Mage Lassification Sing Eural Etworks
15 pages
AD3511 - Deep Learning Lab Manual - - Copy
No ratings yet
AD3511 - Deep Learning Lab Manual - - Copy
61 pages
Case Study - AP23322130042
No ratings yet
Case Study - AP23322130042
7 pages
Assignment 2.4.1 Multiclass Classification
No ratings yet
Assignment 2.4.1 Multiclass Classification
5 pages
A First Look On Nueral Network
No ratings yet
A First Look On Nueral Network
8 pages
Project Documentation
No ratings yet
Project Documentation
24 pages
Deep Learning for Vision Lab Manual 2024
100% (1)
Deep Learning for Vision Lab Manual 2024
25 pages
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
No ratings yet
Downloaded by R GAYATHRI (R.gayathri@aalimec - Ac.in)
56 pages
DL Practical
No ratings yet
DL Practical
23 pages
21BCP167_AI_9
No ratings yet
21BCP167_AI_9
10 pages
CNN MATLAB Lab Instructions
No ratings yet
CNN MATLAB Lab Instructions
7 pages
Assignment 2 Dl
No ratings yet
Assignment 2 Dl
10 pages
Image Classification of An American Sign Language Dataset: Objectives
No ratings yet
Image Classification of An American Sign Language Dataset: Objectives
11 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
No ratings yet
This Code Fragment Defines A Single Layer With Artificial Neurons, and It Expects Input Variables
9 pages
Cad and Dog 2
No ratings yet
Cad and Dog 2
5 pages
Dl 5 Excuted
No ratings yet
Dl 5 Excuted
13 pages
Assignment 3 DS5620
No ratings yet
Assignment 3 DS5620
11 pages
Project
No ratings yet
Project
4 pages
dl lab_merged (2)
No ratings yet
dl lab_merged (2)
60 pages
Deep Learning Lab With Output
No ratings yet
Deep Learning Lab With Output
12 pages
Deep Learning Lab Manual
No ratings yet
Deep Learning Lab Manual
46 pages
Convolutional Neural Networks: Objectives
No ratings yet
Convolutional Neural Networks: Objectives
10 pages
Deep Learning Lab Practicals
No ratings yet
Deep Learning Lab Practicals
24 pages
cat_dog_classification_CNN_Model
No ratings yet
cat_dog_classification_CNN_Model
13 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
24 pages
DL Programs
No ratings yet
DL Programs
12 pages
DL_LAB_MANUAL_mugesh
No ratings yet
DL_LAB_MANUAL_mugesh
12 pages
C2_W1_Assignment
No ratings yet
C2_W1_Assignment
25 pages
Experiement 1,2,4 and 5
No ratings yet
Experiement 1,2,4 and 5
12 pages
Practical 08 Solutions
No ratings yet
Practical 08 Solutions
6 pages
BATCH 6 for presentation
No ratings yet
BATCH 6 for presentation
37 pages
DL Practical 6,7 Outputs
No ratings yet
DL Practical 6,7 Outputs
9 pages
G54 Midterm
No ratings yet
G54 Midterm
15 pages
Explore the Implementation of CNNs in Python
No ratings yet
Explore the Implementation of CNNs in Python
10 pages
MNIST Based Handwritten Digits Recognition
No ratings yet
MNIST Based Handwritten Digits Recognition
5 pages
Profound Python Data Science
From Everand
Profound Python Data Science
Onder Teker
No ratings yet
Advanced C Concepts and Programming: First Edition
From Everand
Advanced C Concepts and Programming: First Edition
Gayatri
3/5 (1)
Datasheet
No ratings yet
Datasheet
28 pages
Data Analytics and Reporting - Notes Unit 1 and 2
No ratings yet
Data Analytics and Reporting - Notes Unit 1 and 2
11 pages
AIN2601 - Assessment 5 - 2023 - S2
No ratings yet
AIN2601 - Assessment 5 - 2023 - S2
35 pages
Jai Narain College of Technology, Bhopal Department of Information Technology Course Completion Unit Plan E-Commerce and Governance
No ratings yet
Jai Narain College of Technology, Bhopal Department of Information Technology Course Completion Unit Plan E-Commerce and Governance
2 pages
Muppai Rojullo Python
No ratings yet
Muppai Rojullo Python
85 pages
Assignment For ISO27001 Consultant
No ratings yet
Assignment For ISO27001 Consultant
17 pages
Msbte Campus Agri Report-2
No ratings yet
Msbte Campus Agri Report-2
75 pages
Chinese-Thai-English Translation Audible Electronic Dictionary Design and Implementation
No ratings yet
Chinese-Thai-English Translation Audible Electronic Dictionary Design and Implementation
7 pages
Bill 2
No ratings yet
Bill 2
1 page
Digital Marketing: Social Media Marketing Part 1
No ratings yet
Digital Marketing: Social Media Marketing Part 1
75 pages
Servo Control System
No ratings yet
Servo Control System
19 pages
Fortigate Lab
No ratings yet
Fortigate Lab
242 pages
Getting Started With: Dataverse
No ratings yet
Getting Started With: Dataverse
40 pages
ECB-641 2nd Quick Installation Guide PDF
No ratings yet
ECB-641 2nd Quick Installation Guide PDF
20 pages
Sample Report Cyber Threat Assessment Black Hat
No ratings yet
Sample Report Cyber Threat Assessment Black Hat
10 pages
Syriac Open Fonts For Windows
No ratings yet
Syriac Open Fonts For Windows
20 pages
GlobalLogic Nautilus Platform
No ratings yet
GlobalLogic Nautilus Platform
2 pages
User's Manual For FLP Spreadsheet Solver: University of Bath
No ratings yet
User's Manual For FLP Spreadsheet Solver: University of Bath
12 pages
Etl Syllabus
No ratings yet
Etl Syllabus
2 pages
M-Trends 2022 Report
No ratings yet
M-Trends 2022 Report
95 pages
Karim Elsayed
No ratings yet
Karim Elsayed
6 pages
Chapter 9 Introduction of Object Oriented Programming: Lecturer: Mrs Rohani Hassan
No ratings yet
Chapter 9 Introduction of Object Oriented Programming: Lecturer: Mrs Rohani Hassan
40 pages
TOSN LoRa
No ratings yet
TOSN LoRa
35 pages
C-Zone SDN BHD: Price List Effective 10 AUG 2019
No ratings yet
C-Zone SDN BHD: Price List Effective 10 AUG 2019
2 pages
2D Array and Insertion Sort
No ratings yet
2D Array and Insertion Sort
3 pages
Intrusion Detection System in MANET: A Survey: S.Parameswari, G.Michael
No ratings yet
Intrusion Detection System in MANET: A Survey: S.Parameswari, G.Michael
4 pages
Lab9 SQL Injection - SQL Injection UNION Attacks
No ratings yet
Lab9 SQL Injection - SQL Injection UNION Attacks
4 pages
MemorySeg and Banking
No ratings yet
MemorySeg and Banking
25 pages
radcom-LoLog 450
No ratings yet
radcom-LoLog 450
2 pages