Hybrid_Deep_Learning_Algorithms_for_Dog_Breed_IdentificationA_Comparative_Analysis

Uploaded by

thailadevona6

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

22 views

Hybrid_Deep_Learning_Algorithms_for_Dog_Breed_IdentificationA_Comparative_Analysis

Uploaded by

thailadevona6

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 12

Received 15 June 2023, accepted 10 July 2023, date of publication 24 July 2023, date of current version 28 July 2023.

Digital Object Identifier 10.1109/ACCESS.2023.3297440

Hybrid Deep Learning Algorithms for Dog Breed

Identification—A Comparative Analysis
B. VALARMATHI 1 , N. SRINIVASA GUPTA 2 , G. PRAKASH 3, R. HEMADRI REDDY 4,

S. SARAVANAN 5 , AND P. SHANMUGASUNDARAM 6

1 Department of Software and Systems Engineering, School of Information Technology and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu
632014, India
2 Department of Manufacturing Engineering, School of Mechanical Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu 632014, India
3 Department of Database Systems, School of Computer Science and Engineering, Vellore Institute of Technology, Vellore, Tamil Nadu 632014, India
4 Department of Mathematics, School of Advanced Sciences, Vellore Institute of Technology, Vellore, Tamil Nadu 632014, India
5 Department of Electronics and Communication Engineering, Srinivasa Ramanujan Centre, SASTRA Deemed University, Kumbakonam, Tamil Nadu 612001,

India
6 Department of Mathematics, College of Natural and Computational Sciences, Mizan-Tepi University, Mizan Teferi 5140, Ethiopia

Corresponding author: P. Shanmugasundaram ([email protected])

ABSTRACT Deep learning and computer vision algorithms will be applied to find the breed of the dog
from an image. The goal is to have the user submit an image of a dog, and the model will choose one of
the 120 breeds stated in the dataset to determine the dog’s breed. The proposed work uses various deep
learning algorithms like Xception, VGG19, NASNetMobile, EfficientNetV2M, ResNet152V2, Hybrid of
Inception &Xception, and Hybrid of EfficientNetV2M, NASNetMobile, Inception &Xception to predict
dog breeds. ResNet101, ResNet50, InceptionResNetV2, and Inception-v3 on the Stanford Dogs Standard
Datasetswere used in the existing system. The proposed models are considered a hybrid of Inception-v3
&Xception and a hybrid of EfficientNetV2M, NASNetMobile, Inception & Xception. This hybrid model
outperforms single models like Xception, VGG19, InceptionV3, ResNet50, and ResNet101.The authors used
a transfer learning algorithm with data augmentation to increase their accuracy and achieved a validation
accuracy score of 71.63% for ResNet101, 63.78% for ResNet50, 40.72% for InceptionResNetV2, and
34.84% for InceptionV3. This paper compares the proposed algorithms with existing ones like ResNet101,
ResNet50, InceptionResNetV2, and InceptionV3. In the existing system, ResNet101 gave the highest
accuracy of 71.63%. The proposed algorithms give a validation accuracy score of 91.9% for Xception,
55% for VGG19, 83.47% for NASNetMobile, 89.05% for EfficientNetV2M, 87.38% for ResNet152V2,
92.4% for Hybrid of Inception-v3 &Xception, and 89.00% for Hybrid of EfficientNetV2M, NASNetMobile,
Inception &Xception. Among these algorithms, the Hybrid of Inception-v3 &Xception gives the highest
accuracy of 92.4%.

INDEX TERMS EfficientNetV2M, hybrid of EfficientNetV2M, hybrid of inception and Xception, NasNet-
Mobile, ResNet152V2, VGG19, Xception.

I. INTRODUCTION tool in artificial intelligence applications. Optical Charac-

Nowadays, there is an increasing demand and usage of image ter Recognition and facial recognition are two examples of
classification and verification techniques; the most signifi- computer vision applications. These areas offer impressive
cant technique used for image data classification is a deep results, fueling increasing interest in deep learning. Classify-
learning model known as Convolutional Neural Networks ing images is a field in which deep learning excels. CNN is the
(CNN). Deep learning is increasingly becoming a crucial most popular deep-learning method for classifying images.
The proposed work is to investigate a variety of Convolu-
The associate editor coordinating the review of this manuscript and tional Neural Network models for classification. The created
approving it for publication was Nuno M. Garcia . algorithm might be used in a mobile or online application.

This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/
77228 VOLUME 11, 2023
B. Valarmathi et al.: Hybrid Deep Learning Algorithms for Dog Breed Identification

Transfer training, which helps us produce effective outcomes, set for this study. The data set was split into two categories:
was used to create the CNN that categorizes dog breeds. training and testing, with the training set being used to train
Convolutional Neural Networks (CNNs) are made up of the established model. Using more widespread pet dogs as
neurons with adjustable biases and weights. The dot product samples, it was suggested by the authors a yolov3-based
is computed by each neuron using input data. Convolution pet dog categorization model. Instead of using the pet dog’s
Neural Network architectures differ from other neural net- general characteristics, the model chose the face of the pet
works in that itrelies on actual images as inputs, unlike other dog and marked the dog category by detecting facial features,
neural network types. This enables the inclusion of partic- similar to face detection. The trials’ findings demonstrated
ular components in the design. CNN lower the number of that this approach was capable of rapidly and correctly locat-
observable variables that characterize a network. Single-layer ing the position of a pet dog’s face, and it resolved the issue
neurons act independently from prior layers. of pet dog detection and categorization.
A convolutional neural network architecture from the In this paper [2], the authors studied how animal identifi-
Inception family called Inception-v3 employs an additional cation in veterinary practice was managed through machine
classifier, factorized 7 × 7 convolutions, and label smoothing learning. Electronic animal health records included digi-
to carry label information further down the network. tal image graphs using image processing and recognition
Deep-separable convolution layers make up the complete technology to identify animals. The authors studied how
architecture of the CNN called Xception. combining ‘‘soft’’ biometrics, such as breed and facial bio-
A type of Convolution Neural Network called NASNet was metrics, could improve dog identification. The researchers
found through research into neuronal design. Normal cells applied transfer learning from GoogLeNet to propose Breed-
and decreasing cells serve as the foundation. Net for breed classification and subsequently to propose
EfficientNet is a CNN architecture; all depth, width, DogNet for identifying individual dogs within the classified
and resolution factors are scaled using a compound coeffi- breeds.
cient method. The EfficientNet scaling technique, in contrast In this article [3], by determining a dog’s breed in a given
to conventional practice, uniformly adjusts network width, image, this work aimed to solve the problem of fine-grain and
depth, and resolution using a collection of fixed scaling coef- multi-class image recognition. One of the sophisticated deep
ficients. learning approaches used in the research system was convo-
Residual Networks, or ResNets, rather than learning non- lutional neural networks. Two distinct networks were con-
referenced functions, learn residual functions about layer structed and assessed using the Stanford Dogs dataset. Con-
inputs.The residual nets enable those layers to match a resid- volutive neural networks’ application and evaluation were
ual map instead, assuming that each stacked layer exactly demonstrated using a software system. It had a central server
matches a desired underlying map. ResNets stack residual and a mobile client with resources and tools for online and
blocks together to build networks, such as ResNet-50, which offline neural network analysis. Two distinct convolutional
has 50 layers. neural network architectures were presented: the Inception-
A deep convolutional neural network is generally chal- ResNet-v2 deep architecture and the NASNet-A mobile
lenging to train from the start. Because size and depth data architecture. Deep Inception-ResNet-v2 model outperformed
set relevant to the neural network are rare, using transfer even the smaller, mobile-friendly CNN, with results that were
learning as a feature extractor is optimal. A model wasalready still encouraging.
trained on a large data collection. Transfer learning is a crucial This study [4] provided two models for categorizing dogs
component of deep convolutional neural networks that offers into different breeds. Due to the increasing difficulty of clas-
solutions to these issues. In computer vision, transfer learning sifying dogs and the fact that these classifications were based
is defined as using a pre-trained model. A pre-trained model on deep learning, forming the two models that provide differ-
must have been trained on a sizable benchmark dataset to ent levels of accuracy at both ends requires a fully defined
handle an issue similar to ours. The magnitude of the new data set. Since every model was periodically subjected to
dataset and how closely it resembles the existing dataset predictions, the researchers encountered numerous function-
play a significant role in determining transfer learning. So, ing levels during the investigation that weren’t considered in
before using transfer learning, a few circumstances shall be earlier research. The essential concept of transfer learning,
considered. The efficacy of the CNN is reduced by overfitting which dealt with the data augmentation technique and its
when the new dataset is smaller but contains the same data as capacity to increase the size of the data set, is also built upon
the old dataset. If the new dataset is large and has content by their approach. Afterward, accuracy levels were matched
comparable to the old data, the model can be refined using or compared with both models to establish a comparison for
the entire network. both models. A detailed procedure was also used to classify
the data. The comparison between Inception V3 and VGG16
II. LITERATURE SURVEY was offered in the publication. Observations showed that
In this research [1], to categorize and identify pet dogs’ faces, Inception V3 offered an accuracy of 85, whereas VGG16
the researchers offered an improved Yolov3 model. Eight provided a much lower accuracy of 69 than the Inception V3
distinct breeds of pet dogs were used to construct the data model.