cnn7
cnn7
Neurocomputing
journal homepage: www.elsevier.com/locate/neucom
a r t i c l e i n f o a b s t r a c t
Article history: The automatic identification and diagnosis of rice diseases are highly desired in the field of agricultural
Received 6 February 2017 information. Deep learning is a hot research topic in pattern recognition and machine learning at present,
Revised 27 May 2017
it can effectively solve these problems in vegetable pathology. In this study, we propose a novel rice dis-
Accepted 17 June 2017
eases identification method based on deep convolutional neural networks (CNNs) techniques. Using a
Available online 1 July 2017
dataset of 500 natural images of diseased and healthy rice leaves and stems captured from rice exper-
Communicated by Zidong Wang imental field, CNNs are trained to identify 10 common rice diseases. Under the 10-fold cross-validation
strategy, the proposed CNNs-based model achieves an accuracy of 95.48%. This accuracy is much higher
Keywords:
Identification of rice diseases than conventional machine learning model. The simulation results for the identification of rice diseases
Convolutional neural networks show the feasibility and effectiveness of the proposed method.
Deep learning © 2017 Elsevier B.V. All rights reserved.
Image recognition
1. Introduction diseases remotely. The disadvantage is that not only the judgement
is easy to be wrong, but also the efficiency is low.
Rice is one of the most important food crops in China and even In order to improve the accuracy and rapidity of the diagnosis
in the world. Rice diseases have a devastating effect on rice pro- results, many researchers have studied the automated rice diseases
duction. Also, it is a major threat to food security. Thus, the diag- diagnosis based on pattern recognition and machine learning. Such
nosis and identification of rice diseases play a very important role as, using pattern recognition techniques [25], support vector ma-
in ensuring the high yield, high quality and high efficiency of rice. chine [16], digital image processing techniques [2] and computer
The traditional method of detecting rice diseases requires lots of vision[1]. Meanwhile, these advanced techniques are not only ap-
experts’ experience and knowledge. With the development of com- plied to the diagnosis of rice diseases, but also to other crops, such
puter and internet technology, farmers can search the rice diseases as wheat [20], maize [34], cotton [29], tomato [5], etc.
images database or consult the plant pathologists to judge rice The past 10 years have witnessed successful applications of
deep convolutional neural networks (CNNs) in diverse fields in-
cluding image classification [6,21], video classification [18], traffic
R
This work was supported in part by the National Natural Science Foundation of sign recognition [17] and human action recognition [15], etc. The
China under grants 61374127 and 61422301, the Outstanding Youth Science Foun- study of CNNs and related research have therefore gained persis-
dation of Heilongjiang Province under grant JC2015016, the Natural Science Foun- tent research interest since the early 1990s, see [10,12] and the
dation of Heilongjiang Province under grant F201428, the Science and Technology
references therein. During this period, Hinton and co-workers have
Research of Agricultural Bureau in Heilongjiang Province under grant HNK125B-04-
03, China Postdoctoral Science Foundation under grant 2016M591560, Heilongjiang
done widely meaningful and fundamental research on deep neu-
Postdoctoral Financial Assistance under grant LBH-Z15185, Heilongjiang Bayi Agri- ral network to improve algorithm performance and optimize archi-
cultural University Foundation under grant XA2016-05 and Fujian Provincial Key tecture, see e.g. [3,23,24,27], and the references cited. A number
Laboratory of Information Processing and Intelligent Control (Minjiang University) of research have been made for CNNs to improve the original ar-
Opening Fund under grant MJUKF201729.
∗
chitecture of Krizhevsky et al. [21]. For example, Zeiler and Fergus
Corresponding author at: College of Information Technology, Heilongjiang Bayi
Agricultural University, Daqing, Heilongjiang 163319, PR China.
[33] used stochastic pooling for regulation of CNNs and Simonyan
E-mail address: [email protected] (Y. Lu). and Zisserman [31] proposed very deep CNNs to 16–19 layers,
http://dx.doi.org/10.1016/j.neucom.2017.06.023
0925-2312/© 2017 Elsevier B.V. All rights reserved.
Y. Lu et al. / Neurocomputing 267 (2017) 378–384 379
disease of brown spot and blast diseases by using color texture of Layer name Function Weight filter sizes Output tensor
rice leaf images [28]. Input 3 × 512 × 512
As we all know that the machine learning technology is essen- Conv1 Convolutional 16 × 16 3 × 224 × 224
tial in the intelligent diagnosis of crop diseases. However, there ex- Pool1 Stochastic pooling 5×5 32 × 112 × 112
ist some defects and difficulties in the above research, involving Conv2 Convolutional 9×9 64 × 112 × 112
Pool2 Stochastic pooling 5×5 64 × 56 × 56
the recognition rate is higher for specific samples under certain
Conv3 Convolutional 5×5 128 × 28 × 28
circumstances. In the diagnosis model, some parameters are not Pool3 Stochastic pooling 5×5 128 × 14 × 14
optimal, the convergence speed of training algorithm is slow, and Output Softmax regression 10 × 1 × 1
it is easy to fall into local minima, and so on.
So far, no research has been published which explores deep
convolutional neural networks for rice diseases identification. The
goal of this research is to construct deep convolutional networks from a reduced data set applying convolution and pooling. The
model to achieve fast and accurate automated recognition by us- stochastic pooling method is applied in the proposed CNNs model.
ing rice diseases images. The 10 common rice diseases include For it is simple enough to randomly select elements in feature map
rice blast (RB), rice false smut (RFS), rice brown spot (RBS), rice according to their probability values, that is, the elements with
bakanae disease (RBD), rice sheath blight (RSHB), rice sheath rot large probability are easy to be chosen. Unlike max-pooling, the
(RSR), rice bacterial leaf blight (RBLB), rice bacterial sheath rot maximum element always chosen, and mean-pooling, the average
(RBSR), rice seeding blight (RSEB) and rice bacterial wilt (RBW). value of the elements is chosen. Also, it strengthened generaliza-
The key motivation for developing the deep convolutional net- tion ability of the proposed CNNs model. Third, softmax regression
works model for rice disease is to provide the farmers an easy-to- learning algorithm can solve multi-classification problem. Finally,
use system to detect early-stage infections by using common digi- we can distinguish between 10 common different rice diseases.
tal camera. Second, extracting effective features for identifying rice The remainder of this paper is organized as follows. In
diseases is a critical but challenging task, and CNNs are highly ex- Section 2, we describe the architecture and learning algorithm of
pected to be automated feature learning from the raw inputs in CNNs. Section 3 introduces the rice disease identification method.
a systematic way. Through the deep architecture, the learned fea- The application of the developed CNNs to the problem of rice dis-
tures are deemed as the higher level abstract representation of low eases recognition is presented in Section 4. Conclusions are given
level raw rice disease images. In addition, to improve diagnostic in Section 5.
results, the CNNs is regarded as one of the best classifications in
pattern recognition tasks. So we develop the deep convolutional
neural networks model for rice diseases identification in this pa-
per. 2. Architecture of deep convolutional neural networks model
In this paper, we present a novel rice diseases identification
method based on deep convolutional neural networks. The pro- Inspired by classical and successful LeNet-5, AlexNet CNNs ar-
posed CNNs-based model can get higher classification ratio. On chitecture and its performance improved by Ciresan et al. [6],
the basis of an analysis of structure and parameters in CNNs, the Krizhevsky et al. [21] and LeCun et al. [23], we describe a kind
gradient-descent algorithm can be applied to train CNNs. A total of our multi-stage-CNNs configuration. The CNNs-based model in-
of 500 rice leaves and stem images are pre-processed first, then cludes convolution layer, stochastic pooling layer, and softmax
the processed images are used to train CNNs. This CNNs model layer. An illustration and related parameters are shown in Fig. 1
can improve the convergence speed while training the parame- and Table 1.
ters in CNNs, and obtain a higher recognition accuracy than the The size of the input image is set to be 224 × 224 × 3 according
conventional model. The main contribution of this paper is mainly to the experience, which can be divided by 2, and computing capa-
twofold. (1) Convolutional neural network is first applied to the bility of the computer. It is composed of many slices in the depth
issue of the identification of rice diseases. Note that the proposed direction. One slice corresponds to many neurons, the weight in
method can correctly and effectively recognize the 10 common rice neurons can be thought as the convolution kernel, that is a square
diseases. (2) Experiment results show that the CNNs method can filter, such as 16 × 16, 9 × 9 and 5 × 5. These neurons respectively
not only improve the convergence speed, but also obtain a higher correspond to a local area in the image, and it is used to extract
recognition accuracy than other models: (1) standard BP algorithm; the feature of the region. Assume that the size of the input im-
(2) support vector machine (SVM); (3) particle swarm optimization age is W, the size of the convolution kernel is F, the mobile stride
(PSO). of the convolution kernel is S, generally S = 2, padding P is used
The advantages of the proposed CNNs are that images can be to fill in the input image boundary, usually P = 0, so the size of
input directly to the model. First, sparse-auto encoding is used to the image after convolution is (W − F + 2P )/S + 1. Then we get the
learn the features from images. Second, we can classify images output tensor.
380 Y. Lu et al. / Neurocomputing 267 (2017) 378–384
The proposed CNNs are of hierarchical architecture, which con- Softmax regression is applied in multi-class classification prob-
tain three convolutional layers. The first convolutional layer is used lem. The hypothesis function takes the form:
to extract different low-level features from the input image, such 1
as edges, lines and corners. The other two can get high-level fea- hθ ( x ) = (5)
1 + exp(−θ T x )
tures.
Each output map feature combines multiple input maps with Our goal is to train θ to minimize the cost function J(θ ).
convolutions. Generally, the output can be denoted by the follow-
1 (i )
m l
ing formula: J (θ ) = − l {y = j}log p(y(i ) = j|x(i ) ; θ ) (6)
m
i=1 j=0
xj = f xi −1 ∗ ki j + bj (1) For training dataset { ( x ( 1 ) , y ( 1 ) ), . . . , ( x ( m ) , y ( m ) )}, yi ∈
i∈M j
{1, 2, . . . , k}. In Softmax regression, the probability of classify-
ing x as category j is:
where represents the th layer, kij represents convolutional ker- T (i )
eθ j x
nel, bj represents bias and Mj represents a set of input maps. The p ( y ( i ) = j | x ( i ) ; θ ) = k (7)
θlT x(i)
detailed implementation of the CNNs can use a sigmoid function l=1 e
or a tanh function, also an additive bias can be applied in it. For Supervised learning algorithm is used to train the network to
example, the value of unit at position (x, y) in the jth feature map learn. The internal representation reflects the similarity among
xy
and the ith layer, denoted as vi j , is given by training samples. We design 10 classifications for rice disease iden-
tification. In order to know the feature representation learned by
Pi −1 Q j −1
the deep concolutional neural network, the maximized activation
xy
v = sigmoid bi j +
ij
wipqj ((xi−1
v + p)(y+q )
) (2) neuron represents the recognized rice disease. At last, we visual-
p=0 q=0 ize the image features by averaging image patches that are related
with the neurons with stochastic response in an upper-layer.
where sigmoid(·) is the sigmoid function, bij is the bias for the fea-
pq
ture map, Pi and Qj are the height and width of the kernel, and wi j 2.4. Training algorithm
is the kernel weight value at the position (p, q) connected to the (i,
j) layer. The parameters of CNNs, such as the bias bij and the ker- Also, we use back gradient-descent algorithm to train CNNs. It
pq includes two stages of feedforward pass and back propagation pass.
nel weight wi j , are usually trained using unsupervised approaches
[4,23]. In feedforward pass stage, we consider a multiclass problem with c
classes and N training samples. The squared-error function is given
by
2.2. Stochastic pooling layer c 2
1 n
N
EN = tk − ynk (8)
2
We use a stochastic-pooling layer to substitute a sub-sampling n=1 k=1
layer in CNNs [23]. Stochastic-pooling layer is used to reduce vari- where tkn is the kth dimension of the nth pattern’s corresponding
ance, it can compute the max value of a particular feature over a label, and ynk is the value of the kth output layer unit in response
region of the image. Suppose the image features have small trans- to the nth input pattern.
lations, it will ensure that we can obtain the same result. Stochas- Let represent the current layer, L denote the output layer and
tic pooling not only combines the advantages of meaning-pooling the input layer be denoted by l. The output of the current layer is
and max-pooling but also prevents over-fitting. It plays a very im- defined to be
portant role in rice diseases classification and detection.
In stochastic pooling, first, we need to compute the probabilities x = sigmoid W x−l + b (9)
p for each region j according to the Eq. (3)
where W represents weight value matrix and b represents bias.
αi In back propagation pass stage, from higher layers to lower lay-
pi = (3)
k∈R j αk ers, the back propagated error can use the following recurrence
function:
where Rj is pooling region j in feature map c, and i is the index of T
each element within it. δ = W +1 δ +1 f W x−l + b (10)
Then we sample from the multinomial distribution based on p
to pick a location l within the region. The stochastic pooling oper- where “ ” represents elements multiplication. Finally, we will up-
ation can be denoted by Stochastic(s), for each feature map c, the date the weight according to the following formulas:
Stochastic(s) is formulated by: ∂E
= x−1 (δ )T (11)
∂W
p,q = Stochastic(m,n,i, j )∈ p αm,n u (i, j )
α l,k l−1,k
(4)
∂E
W = −η (12)
∂W
where α l,k
p,q is the activation of the neuron at spatial coordinate
(p, q) in feature map k in lt h layer, u(i, j) is the weighing window 3. Rice disease identification method
function.
The advantages of this implementation are that stochastic pool- 3.1. Rice diseases images data acquisition and processing software
ing layer not only makes the CNNs have faster convergence, but
also improves generalization ability when processing superior in- Rice diseases images database is created, which consists of a
variant features. total of 500 rice diseases images. Some images of rice diseases
Y. Lu et al. / Neurocomputing 267 (2017) 378–384 381
Color
images
training
sample PCA &
Whitening Training
Color
Scale Random images test Feature
Rice The dataset of
Normalization sampling samples Map
Disease Normalized
Images Rice Disease
Mean Gray PCA &
Database Images
Normalization images Whitening
Test
training Feature
sample Map
[29] Z. Shicha, M. Hanping, H. Bo, Z. Yancheng, Morphological feature extraction for [33] M. Zeiler, R. Fergus, Stochastic pooling for regularization of deep convolutional
cotton disease recognition by machine vision, Microcomput. Inf. 23 (4) (2007) neural networks, arXiv preprint (2013). arXiv:1301.3557.
290–292. [34] L.N. Zhang, B. Yang, Research on recognition of maize disease based on mo-
[30] D. Silver, A. Huang, C.J. Maddison, A. Guez, L. Sifre, et al., Mastering the game bile internet and support vector machine technique, Trans. Tech. Publ. 108 (13)
of go with deep neural networks and tree search, Nature 529 (7587) (2016) (2014) 659–662.
484–489. [35] X. Zhang, Y. Lu, S. Zhang, Multi-task learning for food identification and anal-
[31] K. Simonyan, A. Zisserman, Very deep convolutional networks for large-scale ysis with deep convolutional neural networks, J. Comput. Sci. Technol. 31 (3)
image recognition, arXiv preprint (2014). arXiv:1409.1556. (2016) 489–500.
[32] F.Y. Wang, Lets go: from AlphaGo to parallel intelligence, Sci. Technol. Rev. 34 [36] W.S. Zhao, Y.L. Peng, Rice Diseases and Insect Pests Atlas, China Agriculture
(7) (2016) 72–74. Press, Beijing, China, 2012.
384 Y. Lu et al. / Neurocomputing 267 (2017) 378–384
Yang Lu received his M.S. and Ph.D. degrees from North- Yurong Liu received his B.Sc. degree in Mathematics from
east Petroleum University in Computer Application in Suzhou University, Suzhou, China, in 1986, the M.Sc. de-
2005 and Oil and Gas Engineering in 2013, respectively. gree in Applied Mathematics from Nanjing University of
Now he is an associate professor at Heilongjiang Bayi Science and Technology, Nanjing, China, in 1989, and the
Agricultural University. His research interests include ma- Ph.D. degree in Applied Mathematics from Suzhou Uni-
chine learning and pattern recognition, computer vision versity, Suzhou, China, in 20 0 0. Dr. Liu is currently a pro-
and neural network. fessor in the Department of Mathematics at Yangzhou
University, China. He has published more than 50 pa-
pers in refereed international journals. His current in-
terests include neural networks, complex networks, non-
linear dynamics, time-delay systems, multiagent systems,
and chaotic dynamics.
Shujuan Yi received her M.S. degree in Heilongjiang Bayi
Yong Zhang received his M.S. degree in Signal and Sys-
Agricultural University in 1990 and the Ph.D. degree in
tem Processing from Harbin Engineering University in
Agricultural Mechanization and Automation in Northeast
2011. Now he is an associate professor at Northeast
Agricultural University in 2008. Now she is a professor
Petroleum University and he is currently pursuing the
at Heilongjiang Bayi Agricultural University. Her research
Ph.D. degree in Chemical Engineering and Technology in
interests include control of complex systems, automation
Northeast Petroleum University. His research interests in-
and agricultural mechanization.
clude deep learning, neural networks and intelligent in-
strument.