WritingSample_3
WritingSample_3
3.2.2. Rescaling
The images have been collected from different sources and captured with different
devices. So, the images are not of the same size, therefore the images need to be
resized.
3.3. Segmentation
Image segmentation is required to separate a digital image into distinct areas
(Bhargava and Bansal 2021).
3.3.1. K-Means Clustering
K-means clustering is a type of vector quantization that divides n numbers of data
into k number of clusters. Every observation is assigned to the cluster with the
closest mean which acts as the prototype for the cluster. As a result, Voronoi cells
are created in the data space (Vertica, 2022). Within-cluster variances are
minimized using K-means clustering (squared Euclidean distances). Regular
Euclidean distances, on the other hand, are not the more difficult Weber problem:
the mean minimizes squared errors, but only the geometric median reduces
Euclidean distances. Better Euclidean solutions can be produced, for example, by
employing k-medians and k-medoids. The supervised KNN classifier, which is
commonly confused with k-means due to its name, is connected to the
unsupervised k-means algorithm. Using the cluster generated by k-means, the 1-
nearest neighbor classifier is used to categorize incoming data into existing
clusters.
3.3.2. Thresholding
Thresholding is an image segmentation technique that applies to the change of the
pixels of an image to make it easier to analyze. It turns into a binary image from a
color or grayscale image using thresholding. While the user can or should pick the
threshold T manually in some circumstances, the user often prefers the threshold
to be established automatically by an algorithm. The threshold should be the "best"
threshold in certain circumstances, separating the brighter foreground items from
the darker background objects. The threshold for picture intensity (image
brightness) can be preset manually or automatically by some applications. Pixels
whose bit values are zero turn black, whereas those with a bit value larger than
zero become white (a bit value of one).
74 | Fruits and Vegetables Disease Detection System Based on Indications Using
Machine Learning Approach: A Systematic Review
3.4. Feature Selection
Feature selection is the process of selecting the most important features to input in
machine learning algorithms. Feature selection techniques are implemented to
reduce the number of input variables by eliminating redundant or irrelevant
features and narrowing down the set of features to those most relevant to the
machine learning model. Some of the most common feature selection methods are:
3.4.1. Global Color Histogram (GCH)
GCH is a common term in machine learning. Color is regarded as one of the most
important forms of visual representation. Changes to an image's size, rotation, or
translation into different forms do not affect the image's color. Color space, color
reduction, and the color feature extraction procedure are all factors to consider
when extracting color characteristics from a picture (Han et al., 2022).
3.4.3. SVM
LBP is a robust method for image feature extraction in image processing. LBP
reflects the correlation among pixels within a local area (usually a 3×3 area) which
represents the local information (Bingham et al., 2015). By applying LBP, texture
pattern probability can be precise in a histogram. LBP values need to be
determined for all the image pixels (Prakasa, 2016).
3.4.6. LVQNN
LVQNN is a superior classification technique for digital pictures. In general, it's a
good idea to prepare data for LVQNN the same way it is done for KNN.
K-Means
JJMCO clustering
Image More than
E and J Apple Fruit segmentation
enhancement 95%
(2017) algorithm
LVQNN
Contrast
Apple
enhancement
Bell paper
Blueberry
Cherry(sour)
Corn(maize)
Ullah et Grape
al. Orange Leaf CNN 97.33%
(2019) Peach Image resizing
Potato
Raspberry
Soybean
Strawberry
Squash
Tomato
Top hat
transform
Ouyang Image
Median
et al. Strawberry Fruit segmentation 99%
filtering for
(2012) algorithm
removing noise
interference
Grape 84.65%
Mango and 76.6%
Pujari et
ANN for healthy
al. Fruit None
classifier and
(2013) Pomegranate
diseased
fruits
Qin et SID method
al. Citrus Fruit None for 96.2%
(2009) classification
Journal of FST, Volume 01, Issue 01, July 2022 | 77
Keras image
Khattak
Fruit pre-processing 94.55%
et al.
Citrus and using data CNN
(2021)
leaves generator class
and API
Grayscale
processing
Sorting
Yang et Binarization
image
al. Apple Fruit Enhancement 99.70%
recognition
(2022) Processing
system
Feature
extraction
Image SVM
ANN
segmentation (93.12%)
Doh et
SVM
al. Citrus Fruit
Feature Phenotyping ANN
(2019)
extraction K-means (88.96%)
clustering
Resizing and
Chowdh
Normalizing
ury et
Tomato Leaves Leaf CNN 99.89%
al.
segmentation
(2021)
Augmentation
Image DCNN
acquisition structures (
AlexNet,
Mostafa
SqueezeNet,
et al. Guava Plant 97.74%
GoogLeNet,
(2021) Illumination
ResNet-50,
and ResNet-
101)
78 | Fruits and Vegetables Disease Detection System Based on Indications Using
Machine Learning Approach: A Systematic Review
4. Comparative Analysis
After reviewing the works of multiple researchers, we can summarize that different
algorithms work best on different fruits and vegetables. In Figure 1, we can get a
brief idea about the accuracy rate for detecting disease in mango plants by using
different algorithms.
Journal of FST, Volume 01, Issue 01, July 2022 | 79
Accuracy Vs Algorithm
120.00%
97.13% 96%
100.00%
80%
80.00%
60.00%
40.00%
20.00%
0.00%
MCNN SVM, K-means DCNN
clustering
Accuracy Vs Algorithm
98.00% 97.18% 96.90%
97.00%
96.00%
95.00%
94.00%
93%
93.00%
92.00%
91.00%
90.00%
CNN LBP K-means clustering
segmentation
algorithm
Accuracy vs Algorithm
98.00% 96.20%
96.00% 94.55%
94.00% 93.12%
92.00%
90.00%
88.00% 86.96%
86.00%
84.00%
82.00%
SID method for CNN ANN SVM
classification
Han, Ju & Ma, Kai-Kuang. (2002). Fuzzy color histogram and its use in color
image retrieval. IEEE transactions on image processing: a publication of
the IEEE Signal Processing Society, 11, pp. 944-52.
JJMCOE, J. (2017). Detection and Classification of Apple Fruit Diseases using K-
means clustering and Learning Vector Quantization Neural Network.
Khan, A. I., Quadri, S. M. K., & Banday, S. (2020). Deep learning for apple
diseases: classification and identification. arXiv preprint
arXiv:2007.02980.
Khattak, A., Asghar, M. U., Batool, U., Asghar, M. Z., Ullah, H., Al-Rakhami, M.,
& Gumaei, A. (2021). Automatic detection of citrus fruit and leaves
diseases using deep neural network model. IEEE Access, 9, pp. 112942-
112954.
McGregor, M. (2020, July 02). SVM machine learning tutorial – what is the
support vector machine algorithm, explained with code examples.
Retrieved June 23, 2022, from https://www.freecodecamp.org/news/svm-
machine-learning-tutorial-what-is-the-support-vector-machine-
algorithm-explained-with-code-examples/ (Last accessed 21 July 2022)
Mia, M., Roy, S., Das, S. K., & Rahman, M. (2020). Mango leaf disease
recognition using neural network and support vector machine. Iran Journal
of Computer Science, 3(3), pp.185-193.
Mostafa, A. M., Kumar, S. A., Meraj, T., Rauf, H. T., Alnuaim, A. A., &
Alkhayyal, M. A. (2021). Guava Disease Detection Using Deep
Convolutional Neural Networks: A Case Study of Guava Plants. Applied
Sciences, 12(1), 239, pp. 1-19.
Ouyang, C., Li, D., Wang, J., Wang, S., & Han, Y. (2012, October). The research
of strawberry disease identification based on image processing and pattern
recognition. In International Conference on Computer and Computing
Technologies in Agriculture. Springer, Berlin, Heidelberg. pp. 69-77.
Öztürk, Ş., & Akdemir, B. (2018). Application of feature extraction and
classification methods for histopathological image using GLCM, LBP,
LBGLCM, GLRLM and SFTA. Procedia computer science, 132, pp. 40-
46.
84 | Fruits and Vegetables Disease Detection System Based on Indications Using
Machine Learning Approach: A Systematic Review
Pass, G., Zabih, R., & Miller, J. (1997, February). Comparing images using color
coherence vectors. In Proceedings of the fourth ACM international
conference on Multimedia, pp. 65-73.
Prakasa, E. (2016). Texture feature extraction by using local binary pattern.
INKOM Journal, 9(2), pp. 45-48.
Pujari, J. D., Yakkundimath, R., & Byadgi, A. S. (2013). Grading and
classification of anthracnose fungal disease of fruits based on statistical
texture features. International Journal of Advanced Science and
Technology, 52(1), pp. 121-132.
Qin, J., Burks, T. F., Ritenour, M. A., & Bonn, W. G. (2009). Detection of citrus
canker using hyperspectral reflectance imaging with spectral
information divergence. Journal of food engineering, 93(2), pp.183-191.
Singh, U. P., Chouhan, S. S., Jain, S., & Jain, S. (2019). Multilayer convolution
neural network for the classification of mango leaves infected by
anthracnose disease. IEEE Access, 7, pp. 43721-43729.
Ullah, M. R., Dola, N. A., Sattar, A., & Hasnat, A. (2019, November). Plant
Diseases Recognition Using Machine Learning. In 2019 8th
International Conference System Modeling and Advancement in
Research Trends (SMART). IEEE, pp. 67-73.
VerticaPy. (2022). Retrieved 22 June 2022, from
https://www.vertica.com/python/documentation_last/learn/KMeans/
(Last accessed 21 July 2022)
Yang, M., Kumar, P., Bhola, J., & Shabaz, M. (2022). Development of image
recognition software based on artificial intelligence algorithm for the
efficient sorting of apple fruit. International Journal of System
Assurance Engineering and Management, 13(1), pp. 322-330.
Zaki, M. A., Narejo, S., Ahsan, M., Zai, S., Anjum, M. R., & u Din, N. (2021).
Image-based Onion Disease (Purple Blotch) Detection using Deep
Convolutional Neural Network. Image, 12(5), pp. 448-458.