Classification of Sugarcane Diseases Using Machine Learning Algorithms
Classification of Sugarcane Diseases Using Machine Learning Algorithms
How to cite: E. R. Arboleda and J. L. N. Vergara, " Classification of sugarcane diseases using machine learning
algorithms," Jurnal Teknologi dan Sistem Komputer, vol. x, no. x, pp. xx-xx, 202x. doi:
10.14710/jtsiskom.x.x.202x.xx-xx, [Online].
Abstract - Sugarcane (Saccharum officinarum L.) many parts of Asia due to an abundance of insect pests
belongs to the Poaceae (grass) family's and diseases [6]. Sugarcane is contaminated with over
Andropogoneae tribe. Modern sugarcane cultivars are 200 diseases caused by fungi, viruses, bacteria,
primarily the product of interspecific hybridizations phytoplasmas, and nematodes, which are among the
between Saccharum species. Sugarcane yields have leading causes of sugarcane yield losses worldwide.
been rigorously abridged in various parts of Asia as Viruses are among the world's most essential sugarcane
attacked by number of diseases. Amongst all the diseases, both historically and in the future, causing
diseases, red rot and smut are causing significant large epidemics and significant losses. As a result,
outbreaks in recent years, causing 30-100% yield loss sugarcane viruses must still be recognized in breeding
in commercial sugarcane cultivars. The MATLAB programs worldwide, where the collection and removal
2021 Classification Learner App having 28 different of susceptible clones after resistance tests have become
machine learning was used in classifying the different standard practice [7].
diseases. The study came up to a Medium Neural Sugarcane crop is attacked by a massive number of
Network classifier. This model obtained an accuracy insect pests and diseases [8]. A large number of insect
of 100% with a training time of 0.85113 seconds. pests and diseases attack the sugarcane crop. Complexes
Keywords – sugarcane; medium neural network, of branch feeders, sap-sucking insects (e.g., aphids,
classification learner application thrips, mealybugs), root feeders (e.g., white grubs, stem
borers), and spider mites are among the arthropod pests
associated with the crop across the world [9].
I. INTRODUCTION
Being a long durational crop, sugarcane is prone to
Sugarcane (Saccharum officinarum L.) belongs to several diseases caused by pathogens, viz. fungi,
the Poaceae (grass) family's Andropogoneae tribe. bacteria, viruses, and phytoplasmas-like organisms.
Modern sugarcane cultivars are primarily the product of Amongst all the diseases, red rot and smut are causing
interspecific hybridizations between Saccharum species significant outbreaks in recent years, causing 30-100%
[1]. Saccharum is a complex genus with a high yield loss in commercial sugarcane cultivars throughout
polyploidy rate and regular aneuploidy [2]. Sugarcane India [10].
is the world's largest crop, with more than 1800 mt Generally, this research aims to classify sugarcane
generated in 2017 on a surface of nearly 26 mt. diseases with the help of MATLAB Classification
Sugarcane is infected by a range of viruses all over the Learner App.
world, owing to its vegetative propagation [3].
Sugarcane is one of the top ten most commonly II. METHODOLOGY
planted crops on the planet. Globally, about one billion
tons are harvested per year [4]. Sugarcane diseases are A. Classification
either seed or soil-borne, making it nearly difficult to
contain the disease once it has spread through the field, The researcher used MATLAB Classification
except with agrochemicals. One or more preventive Learner App and it contains 28 machine learning
mechanisms may be used to reduce disease incidence. algorithms for classifying the samples. The App trains
For the control of sugarcane diseases, no single method the data and validates the classification in terms of
has proven to be reliable. The most successful method accuracy and training speed. The data were inputted to
for managing all diseases is integrated sugarcane disease the Classification Learner App using 5 folds, this means
management. Agronomic, cultural, chemical, and that 180 were used for training and the 36 were used for
biological control steps are all included [5]. testing and validation [11].
.
Sugarcane yields have been severely reduced in
*)
Correspondence author (Edwin R. Arboleda)
Email: [email protected]
@cvsu.edu.ph
B. Image Processing Prolonged
storage or
To make some predictions in the data in a large set exposure to
of information, the image processing technique will
high relative
study and construct algorithms. Hence Image Processing 1 REDROT humidity,
is processing the images with a different form of environmenta
mathematical functions and operations by some signal l pollution,
processing. It is usually referring to digital image and high
processing to acquits the images for producing the input temperature
images. The digital image used in this research is a two-
dimensional image that has values by pixels and Multicellular
pictures. So by this image processing, similar sets of fungi
plant leaf diseases are identified from various plants. WHIP characterized by
2
SMUT their large
III. RESULTS AND DISCUSSION numbers of
teliospores
A. Description of Dataset
Affect the
The datasets are collected from Kaggle [11]. The vascular
3 WILT
sugarcane data frame has 180 rows and 6 columns as system of
shown in Table 1. It represents a randomized block plants.
design with 45 varieties of sugarcane and 4 blocks as
shown in Table 3. Yellow bud
mosaic in
Table I. Dataset Description peaches,
No. of No. of yellow vein
Datasets
Attributes Instances 4 RINGSPOT in grapes,
Sugarcane and stunted
6 180 growth in
Datasets
gladiolus and
Narcissus.
Table II. Sugarcane Dataset Attributes
Sample No. Attributes ID Different
genotypes in
1 S S0
one
2 N S1
individual,
3 R S2
5 MOSAIC who has
4 X S3
developed
5 Var S4
from a single
6 Block S5
fertilized
egg.
Table III. Sugarcane Dataset Attributes The presence
Sample No. Attributes ID of pin head
1 S S0 like orange-
2 N S1 colored dots
RATOON
3 R S2 6 of bacteria
STUNT
4 X S3 on the
internal soft
tissue in the
Where: nodal region.
R – No. of diseased shoots
X – No. of pieces of the stem
Var – Indicates variety of sugarcane in each plot Table 5 shows the classification results. The
Block – Factor for the blocks classifier used are the 28 machine learning algorithms of
MATLAB Classification Learner App. There are 45
varieties of sugarcane in each block, and the chosen
Table IV. Image Processing [12] settings are five folds. Five Folds is the number of
samples per class; 45 divided by 5 is equal to 9. A total
Name of the
S. No Images Description of 45 samples per class was used as training samples,
disease
and nine samples were used as test samples.
Table V. Classification Results Figure 1 shows the scatter plot of the Medium
Accuracy Training Neural Network for the sugarcane disease. The violet
Classifier Type colors represent Block A, the yellow colors represent
(%) Time (S)
Fine Tree 98.3 5.6625 Block B, the red colors represent Block C, and the blue
Medium Tree 98.3 2.4572 colors represent Block D. It can be seen in Figure 1 that
Coarse Tree 98.3 2.0329 values of the four groups are separable in the viewpoint
Linear Discriminant 100 2.4833 of the Medium Neural Network.
Gaussian Naïve Bayes 93.9 2.028
Kernel Naïve Bayes 92.9 2.5349
Linear SVM 100 3.2829
Quadratic SVM 100 2.1459
Cubic SVM 100 2.0904
Fine Gaussian SVM 74.4 1.8858
Medium Gaussian SVM 94.4 1.8058
Coarse Gaussian SVM 99.4 1.5785
Fine KNN 80 3.9228
Medium KNN 71.1 2.8147
Coarse KNN 48.3 2.8183
Cosine KNN 58.9 3.0026
Cubic KNN 64.4 1.6503
Weighted KNN 82.8 4.4522
Boosted Trees 25 3.5517
Bagged Trees 98.3 4.793
Subspace Discriminant 86.7 2.8469
Subspace KNN 96.7 2.6402
RUS Boosted Trees 25 0.9279
Narrow Neural Network 98.3 1.4661
Medium Neural Network 100 0.85113
Wide Neural Network 100 2.8794 Figure 2. Scatter Plot Confusion Matrix of the Medium
Bi-layered Neural Neural Network
98.9 4.7855
Network
Tri-layered Neural It can be seen in Figure 2, the confusion matrix that
97.8 1.9022 the Medium Neural Network Classifier was able to
Network
correctly identify all of the blocks by 100%.