An Identification and Detection Process For Wheat Using ML
An Identification and Detection Process For Wheat Using ML
Communication
Biosc.Biotech.Res.Comm Vol 12 (4) 1081-1091 (2019)
ABSTRACT
In India, the economic, political, and social stability depends directly as well as indirectly on agricultural
productivity. The agricultural product quality and productivity are affected by the different diseases in plants.
Therefore, diseases detection in plants is very important in the agriculture field. Leaf disease detection by
using different machine learning techniques is a very popular field of study. There have already been many
promising outcomes but only a few real-life applications that can make life simpler for the farmer. Machine
learning technique becomes the most accurate and precise paradigm for the detection of plant’s disease which is
helpful to reduce an oversized work of watching in huge farms of crops, and an early stage itself it detects the
symptoms of diseases on plant leaves. In this paper, we use a novel approach for detection and identification
of leaf diseases by K-means clustering, multi-class SVM, and advanced neural network techniques to process
data, on plant disease detection. Gray Level Co-occurrence Matrix (GLCM) is used for feature extraction. The
disease classification is done using SVM classifier, and the detection accuracy is improved by optimizing the
data using the Advanced Neural Network (NN). The extracted features are fit into the network. This is a great
success, demonstrating the feasibility of this approach in plant disease diagnosis and high crop yielding. The
aim of our research is solving the problem of detection and preventing diseases of crops. We also demonstrate
the model for leaf diseases detection system for future trends of hyperspectral data.
KEY WORDS: ADVANCED NEURAL NETWORK, GRAY LEVEL CO-OCCURRENCE MATRIX, HYPERSPECTRAL DATA,
K-MEANS CLUSTERING, SUPPORT VECTOR MACHINE
ARTICLE INFORMATION
*Corresponding Author: [email protected]
Received 05th Oct 2019
Accepted after revision 30th Nov 2019
Print ISSN: 0974-6455 Online ISSN: 2321-4007
CODEN: BBRCBA
Chronologically, the primary goal of farming Here we take some of the articles concerning
is to provide the country with food and feed the identification of plant leaf illnesses using
it. Plant diseases not only pose a global threat different sophisticated methods and some of them
to food security but can also have disastrous shown below. (Fujita et al.,2016) recommended
consequences for smallholder farmers who the CNN classifier for cucumber diseases. It
depend on healthy crops for their livelihoods. uses two datasets for training and validation.
There are more than 80 percent crop production In addition to the healthy class, these datasets
generated by small farmers in the developing comprise seven distinct kinds of disease. The first
countries (UNEP, 2013), and accounts of yield dataset is made up of 7320 targeted images of
losses of more than 50 percent are prevalent captured leaves under excellent circumstances.
owing to pests and diseases (Abebe et.al.,2017). The second dataset comprises of 7520 images
Also, the biggest proportion of hungry individuals captured that are healthy and unhealthy, Lu
(50%) live in smallholder farming homes (Sanchez et al.(2017) defined an automatic diagnostic
et.al, 2005), making smallholder farmers a group system for wheat disease based on a supervised
especially susceptible to pathogen-derived food deep learning framework that integrates wheat
supply disruptions. So, these leaf diseases in disease identification and disease region. In
any forms in crop tend to cause a reduction in addition, a fresh wheat disease image dataset,
quality, yield, and fiscal progression, respectively. Wheat Disease Database 2017 (WDD, 2017), is
Therefore, looking to the current farming arena, being gathered to confirm the system's efficiency.
instead of watching the crop through the naked Kamilaris et al.(2018) discussed and studied 40
eyes by a designated specialist where it does research papers using deep learning methods
need a lot of efforts to implicate. Hence, as a that addressed multiple problems in agricultural
result, the automation essentiality of leaf disease and food production. Also, examined the specific
identification and its management has turned the agricultural problems with the help of the specific
BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS An Identification and Detection Process for Leaves 1082
Farooqui & Ritika
models and frameworks. Ferentinos et al. (2018) management, (Bannerjee et al.,2018). There are
explained convolutional neural network models various methods had been already developed to
to detect and diagnose crop disease using plain prevent the losses of the leaf disease. However,
leaf images of healthy and diseased crops using for many farmers, these methods are unavailable
deep learning methods. The training model was and require thorough knowledge of the domain or
carried out using an open database of 87,848 a lot of money and resources to be implemented.
images, consisting of 25 distinct crops in a In this field of research, there are several existing
set of 58 different classes, including healthy limitations. One of them is that currently available
plants. Several model architectures were trained data sets do not include images from real-life
with the greatest outcomes reaching 99.53 situations that have been compiled and labelled.
percent achievement rate when defining the Another limitation is that the existing methods
corresponding combination. Kulkarni et al.(2012) accuracy is not good for the multiple disease in
described a methodology for early and accurate same plant, (Arsenovic et al.,2019). Therefore,
detection of plant diseases using the artificial in this paper our agenda is to resolve the above
neural network (ANN) and various techniques for limitations by proposing the architectural model
image processing. Since the suggested method is for disease detection in wheat leaf by combination
based on the classification ANN classifier and the of two methods and compare their accuracy.
extraction function Gabor filter, it delivers better
outcomes with a recognition rate of up to 91%. Leaf Disease Learning Model From Image: We are
proposing a Leaf Disease Learning Model from
In Malus domestica Bashir et al (2012) provided Image data, processing begins in this scheme
disease detection using an efficient technique with data collection in the form of the knowledge
such as K-mean clustering, texture, and color base, then training the data, which is used to
examination. It utilizes the texture and color apply the different machine learning algorithm.
characteristics that usually appear in ordinary Feature extractor steps to be permitted through
and impacted fields to classify and acknowledge pre-processing and then identify the illnesses by
distinct agriculture. Kaundal et al.,(2006) implementing the various algorithms. Finally, it is
contrasted the results of standard multiple possible to classify the defective region from the
regression, neural network, and support vector images. Fig.1 shows our proposed architecture.
machine (SVM). It was discovered that the
SVM-based regression approach gives better material and methods
result. The best description is the association of
environmental circumstances with the amount The proposed methodology for disease
of disease that could be helpful in disease identification is explained in the following
1083 An Identification and Detection Process for Leaves BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS
Farooqui & Ritika
BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS An Identification and Detection Process for Leaves 1084
Farooqui & Ritika
include designing is a manual part of the machine relatives can be estimated in Table1. The graphical
learning technique, and it takes more time (Le Cun. representation of the extracted feature is shown
et al., 2015). On the other hand, in an advanced in Fig 4.
machine learning technique feature extraction
is embedded with the learning algorithm where Training: In this step, train the image of the wheat
features are extracted in a fully automated way leaf by using K-means Clustering, Support Vector
without any intervention by a human expert. Machine, and advance neural network for making
Feature extraction plays a vital role in the process an image classification model.
of facsimile cataloging. Hence, GLCM could be
an effectual and right resourceful technique for Testing: The test set for leaf prediction as healthy/
statistical parameter extraction based on texture unhealthy with its disease name will be used in
mining (Gebejes et. al, 2013). this stage to assess the classifier's output.
The image features include Correlation, Entropy, Approaches K-Means Based Clustering Method:
Variance, Homogeneity, Contrast, Energy, and This stage involves the use of the K-Means
Mean are computed. The resulted topographies algorithm to cluster an image. Using a group of
using the monochrome concentration and K-classes, it is a very helpful technique for entity
positioning can be indicated by the association recognition (Dubey et al., 2013). It can be done by
of active dotted particles whose numerical limiting the square summation distance between
S. Contr Corre Energy Homog Mean Standard Ent RMS Variance Kurtosis Skew
No. ast lation eneity Deviation ropy ness
1 0.520 0.82726 0.81 0.956992 11.42 42.14 1.088 3.194 1523.1 17.921 1.8645
2 0.453 0.88268 0.741 0.942985 18.62 51.96 1.6572 4.679 2206.1 10.582 2.3426
3 1.195 0.76266 0.57 0.931926 25.981 60.49 2.1984 7.044 3429.7 7.2729 1.8866
4 0.6984 0.91896 0.602 0.955868 35.465 71.82 2.391 7.347 4946.8 4.3601 2.3403
5 1.2426 0.76266 0.78 0.939879 12.649 46.03 1.0954 3.338 1823.7 16.636 3.0963
6 1.2304 0.91896 0.376 0.932311 36.484 61.36 3.7752 9.044 3009.9 5.777 2.3511
7 0.7108 0.68088 0.655 0.929282 24.433 58.77 2.0447 4.267 2085.4 7.2405 3.2904
8 0.9014 0.80841 0.576 0.927381 35.799 71.94 3.6071 7.978 3948 5.0127 3.1041
9 1.1222 0.87418 0.698 0.943924 28.194 69.98 2.4453 6.015 3597.8 6.7964 3.0347
10 1.361 0.90097 0.724 0.935854 18.952 55.79 1.9201 5.197 2359.1 11.386 4.1762
11 0.7258 0.8651 0.614 0.925867 23.879 56 2.135 5.346 2258.1 7.4625 3.7641
12 0.4065 0.73879 0.75 0.94786 17.785 56.2 1.5335 4.562 2827.8 12.644 0.8359
13 1.221 0.8602 0.734 0.923988 15.923 48.57 1.4202 4.269 2039.3 11.61158 1.6122
14 0.7435 0.91903 0.757 0.938948 18.945 56.55 1.8782 4.812 2798.3 10.949 2.836
15 0.2743 0.70682 0.823 0.976112 13.225 44.84 2.1005 5.159 1703.5 20.069 1.3342
16 0.2681 0.85746 0.786 0.962688 14.843 47.85 2.0682 4.844 1490.1 16.601 1.3215
17 0.6847 0.91649 0.378 0.906731 59.88 78.53 4.5886 9.207 2859.6 2.1288 4.221
18 1.1217 0.92901 0.48 0.896516 40.152 73.24 3.153 7.711 4661.7 4.0614 3.0734
19 0.5468 0.92833 0.723 0.947347 17.922 49.77 1.6294 4.534 2051.4 10.111 0.4327
20 1.8418 0.87307 0.478 0.883095 45.042 77.61 2.9739 7.068 4540.2 3.109 0.745
21 1.5751 0.84231 0.843 0.903862 47.808 80.98 3.1664 7.936 5354.2 3.0829 1.8645
22 0.8019 0.81905 0.709 0.96402 12.901 46.13 2.0809 5.212 2015.2 19.969 2.3426
23 0.328 0.86446 0.237 0.944129 16.123 44.55 1.6311 2.662 352.88 12.241 1.8866
24 1.4987 0.76745 0.27 0.877284 77.399 84.8 4.79162711.2647 6629.51 1.5772 2.3406
25 1.346 0.86944 0.458 0.912759 62.05059 74.62 4.5629 10.71 4388.6 2.1224 3.0963
26 0.5789 0.87351 0.789 0.97456 68.999 66.78 4.3245 6.745 345.89 2.0789 2.0098
27 1.673 0.86979 0.399 0.982114 38.987 56.93 1.89789 3.679 661.78 9.0789 1.789
28 0.456 0.93412 0.897 0.911239 47.345 33.78 3.4567 5.8769 2897.9 13.1345 0.3344
29 1.876 0.8569 0.769 0.901987 59.8796 77.84 2.9897 10.78 2345.67 2.45908 5.789
30 1.798 0.789 0.749 0.80796 53.5346 79.1 1.79854 4.8999 3987.9 10.9089 1.0459
1085 An Identification and Detection Process for Leaves BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS
Farooqui & Ritika
the equivalent cluster and an entity to find the This process has already been implemented in
thought-provoking fragment of the source image. other leaf images to cluster more than one groups
In K-Means clustering techniques, clusters are having the respective diseases (Sethy et al., 2017).
determined by pixel groups with the same value After the successful implementation K-Means
current in an image. Practically, this new image cluster-based segmentation, the percentage of
processing technique's computational acceleration the infected area calculated, and features are
is very quick and provides a more precise output. extracted.
The input data set is divided into K set of clusters,
and each cluster is represented by a completely Classification of Diseases by SVM: In this phase,
variable cluster center. Initially, known values are the classification and comparison for the leaf
referred to as seed points and data points are also diseases of the wheat crop have done through by
known as inputs. Only the K-Means clustering storing the corresponding feature set of values to
method can be used to estimate the distances their respective dataset. Here a fully controlled
between the centers, inputs, and allocate inputs learning method of classification is used that is
to the closest center. Following are the steps for none other than support vector machines (SVM)
K-means grouping: with high dimension spaces, efficient memory,
and versatile decision function. Generally, SVM
Step 1: Image confirmation is categorized into two types: Linear SVM and
Step 2: Image transformation from primary color Multiclass SVM. Linear SVM is used to classify
space to L*a*b* Color Space, which helps in the two kinds of data set, and multiclass SVM is used
findings of pictorial transformations that present to classify more than two kind of data set. So
in the primary color space. multiclass SVM is used to classify four types of
wheat leaf diseases. First, the extracted feature
Step 3: Here the colors classification can be done dataset is optimized using the optimization
in 'a*b*' Space using K-means. technique, and then multiclass SVM is used
for the classification process. The quantity of
Step 4: From the results, pixel ordering, and cataloging is performed by the proportion of
labeling can be render using K-means by high-scale classification gain, Equation (1).
maintaining a return index to the corresponding
each cluster.
BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS An Identification and Detection Process for Leaves 1086
Farooqui & Ritika
techniques. First, they are extremely dependent of humans. ANNs are very helpful for pattern
on variables patterns as well as characteristics recognition, irrespective of any specific guidelines
to be extracted. Second, it is necessary to for recognition. Cui et al. (2018) stated that ANNs
train classifiers many times before applying to need fewer formal statistics and are capable of
applications in the real world (Zhang et.al., 2015) modeling complicated nonlinear situations. There
ANNs (Advanced Neural Networks) are the most is an increasing interest in applying ANNs to
promising tools for analyzing hyperspectral data. achieve the greater goal of disease detection in
ANNs ' mechanism is based on the nervous system wheat crop using hyperspectral data. ANNs have
Figure 7. (A) Apply as an Input Image (B) Black Figure 8. (A) Apply as an Input Image (B) Brown
Chaff Disease Detected Rust Disease Detected
1087 An Identification and Detection Process for Leaves BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS
Farooqui & Ritika
a higher degree of diagnosing ability than Neural proposed framework for the advanced neural
Networks (NNs) (Ettabaa KS, et. al.,2018). As a network represented in Fig. 6. This framework
non-invasive rapid detection method, (Zhu et helps to detect leaf diseases accurately and
al.,2017) explored the potential of hyperspectral automatically.
imaging. The Advanced Neural Networks support
the strongest ability to discriminate against plant results and discussion
diseases because they combine the finest sets
of trainers for precise classification. It can be There is various type of diseased wheat leaf
used to detect four diseases, namely black chaff, samples is taken as input. In this paper, four types
brown rust, powdery mildew, yellow rust. The of wheat leaf diseases are considered i.e. Brown
Figure 9. (A) Apply as an Input Image (B) Figure 10. (A) Apply as an Input Image (B)
Powdery Mildew Disease Detected Yellow Rust Disease Detected
Leaf Black Chaff Brown Rust Powdery Mildew Yellow Rust Accuracy
Disease Disease Disease Disease Disease (%)
BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS An Identification and Detection Process for Leaves 1088
Farooqui & Ritika
1089 An Identification and Detection Process for Leaves BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS
Farooqui & Ritika
Rust, Black Chaff, Powdery Mildew, and Yellow systems and gathered from different resources. Our
Rust. Fig. 7 shows the input image and output experimental findings and comparisons between
image classified as wheat Black Chaff disease. distinct techniques with feature extractors
Fig. 8 shows the input image and output image showed how our sophisticated machine-learning-
classified as Brown Rust disease. Fig. 9 shows based model could effectively identify distinct
the input image and output image classified as categories of diseases in distinct crops as well
Powdery Mildew disease. Fig. 10 shows the input as provide solutions for diseases. The proposed
image and output image classified as Yellow Rust methodology was successfully tested and verified
disease. By the help of this proposed method, from the execution point of view on various types
the total of 30 sets of wheat crop leaf disease of wheat leaf diseases such as black chaff, brown
specimens is taken and categorized into four rust, powdery mildew, yellow rust. Moreover, it
broad categories of diseases which has displayed was seen that the best result could be obtained
in Table 2 and Fig. 11. Among 30 number of test resourcefully using least methodical pains to
samples of Brown Rust Diseased leaf and Yellow check the productivity of scheduled techniques.
Rust Diseased are erroneously classified as Brown Another perspective of employing this method
and Yellow Spots on the leaf, which imply 93.33% is that the plant disease can be documented
and 90.00 % of accuracy respectively. And the at the beginning or primary stage only. SVM,
other two categories of diseased leaf i.e. Black therefore, proved to be the promising method
Chaff Disease and Powdery Mildew disease are for differentiating and categorizing wheat leaf
successfully classified with 100% of accuracy. diseases with other methods by using extracted
The average accuracy of the classification of feature vector dataset. Collecting a larger number
the proposed method is 95.83%. In this paper, of samples with more variation of the diseased
classification is first done using the K-Nearest wheat leaf can extend the study. We hope that our
Neighborhood (KNN) using K-Mean’s grouping suggested scheme will contribute to agricultural
with a productivity accuracy of 77.90%. research in a provocative manner.
BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS An Identification and Detection Process for Leaves 1090
Farooqui & Ritika
1091 An Identification and Detection Process for Leaves BIOSCIENCE BIOTECHNOLOGY RESEARCH COMMUNICATIONS