Download as pdf or txt
Download as pdf or txt
You are on page 1of 7

International Conference on Sustainable Computing in Science, Technology & Management (SUSCOM-2019)

Disorder Detection in Tomato Plant Using Deep Learning

Saiqa Khana, Meera Narvekarb, Anam Ayesha Shaikh c, Hera Ansarid, Nida Ansari e
a
Dept of Computer Engg,M.H Saboo Siddik College of Engineering Mumbai University Mumbai, India Saiqa.comp@gmail.com
b
Dept of Computer Engg D.J Sanghvi College of Engineering Mumbai University Mumbai, India Narvekar.meera@gmail.com
c
Dept of Computer Engg M.H Saboo Siddik College of Engineering Mumbai University Mumbai, India Anams403@gmail.com
d
Dept of Computer Engg M.H Saboo Siddik College of Engineering Mumbai University Mumbai, India Ansarihera82@gmail.com
e
Dept of Computer Engg M.H Saboo Siddik College of Engineering Mumbai University Mumbai, India Nidaansari013@gmail.com

ARTICLE INFO ABSTRACT

Article history: Agricultural productivity is something on which the Economy highly depends. Plant Diseases and Pests are
Received 16 January 19 a major provocation in the agriculture sector. This is one of the reasons that disease detection in plants plays
Received in revised form 30 January 19 an important role. Accurate and faster detection of diseases in plants could help to develop an early treatment
Accepted 24 February 19 technique while substantially reducing economic losses. This paper proposes a deep learning-based
approach that automates the process of classifying tomato leaves diseases. The proposed system focuses on
major diseases like early blight, powdery mildew and downy mildew that occur in tomato plants. We make
Keywords:
use of Convolution Neural Network to classify the image data sets based on the visible effects of diseases.
Plant Disorder
Unlike Image Processing, Deep Learning learns and adapts to the changing data sets. The proposed
Convolution Neural Network (CNN)
methodology will be using a diverse dataset that includes images from the nursery, plant village, and farm.
Deep Learning
To test the diverse dataset, we use Receiver Operating Characteristic that will improve the classifier
Deep Neural Network(DNN)
performance. Thus, the proposed system serves as a phone based tool that helps in detecting tomato plant
Receiver Operating Characteristics
diseases based on capturing and analyzing the picture of a plant leaf. In this paper, we take a first step
(ROC) towards such a tool. As a result, the proposed system serves as a decision support tool to farmers in
identifying diseases that occur in tomato plant leaf.

© 2019SUSCOM. Hosting by Elsevier SSRN. All rights reserved.


Peer review under responsibility of International Conference on Sustainable Computing in Science, Technology and Management.

1. Introduction

Plants are an integral aspect of human life. The agricultural land mass is more than just being a feeding source in today's world. Indian economy is highly
dependent on agriculture. Smart farming is important for addressing the challenges of agricultural production in terms of productivity, food security and
environmental impact, to address these challenges, the complex, multivariate and unpredictable agricultural ecosystems need to be better understood by
monitoring, measuring and continuously analyzing various physical aspects and phenomena. Therefore, in the field of agriculture, the detection of disease
in plants plays an important role. To detect a plant disease at its initial stage, use of automatic disease detection technique is propitious. Tomato is by far the
most consuming crop in the world. According to the census obtained from the Food and Agriculture Organization of the United Nations, approximately 170
kilotons of tomatoes were produced in the year 2014 around the world (Jacobs and C.P. Bean, G.T. Rado and H. Suhl, 1963). In 2017, the worldwide
production of tomatoes was approximately 170.8 million tons. China, being the leading producer of tomatoes worldwide, accounted for 31% of the total
production, while India, accounted for 18.7% of the total production (T.Brosnan and D. W. Sun,2004). Tomatoes get easily infected by the pests and diseases
that occur in the fields. The diseases and pests affects not only the fruit but also the other parts of the plant, that is the roots, the stems, and the leaves of the
tomato plants. Tomato is affected by various diseases such as Early Blight, Late Blight, Powdery Mildew and Downy Mildew.
Computer Vision and Machine Learning techniques have been applied to different disease detection such as tomatoes, potatoes, banana and grapes, etc.
Motivated by development in computer visions especially in Convolution Neural Network which manage to produce significant results in the field of Image
Classification (Kumar at al, 2018).. This paper proposes a new system based on CNN for tomato plant disease detection. In order to develop accurate image
classifiers for the purposes of plant disease diagnosis, we needed a large, verified dataset (Farm, Nursery and Plant Village) of images of diseased and
healthy plants. In this system, we report on the classification of 4 diseases using 54,306 images with a convolutional neural network approach (Kumar,
Sharma, Sharma & Poonia, 2018).
Deep learning is an advanced and modern technique for image processing and data analysis that generates promising results. Deep Learning and Neural
Networks combined together is called Deep Neural Network and it has been applied to many diverse fields. Neural networks provide a mapping between
an input (an image of a diseased plant)—to an output— (a crop ∼ disease pair). Neurons are basically mathematical functions that accepts numerical input

February 26 - 28, 2019 | Amity University Rajasthan, Jaipur, India Page 2154
International Conference on Sustainable Computing in Science, Technology & Management (SUSCOM-2019)

and provides numerical output. Deep neural networks are comprised of input layer, a number of processing layers and an output layer. The network is trained
by fine tuning the hyper plane parameters in a manner that the mapping is improved during the training process. The performance of the proposed model is
measured depending on the probability of classification using ROC curve.

Fig .1. Worldwide Production of Tomato Plant ((Jacobs and C.P. Bean, G.T. Rado and H. Suhl, 1963)).

Section II discuss various Tomato Leaf Disorders. Section discuss about the related study of the proposed system. Section IV briefs about the technology;
CNN, various materials and methods and implementation methodology of proposed system.

2. TOMATO DISORDERS

Generally, there are two factors that can affect tomato plant; living (biotic) and non-living(abiotic) agents. Different living agents includes insects, bacteria,
fungi, and viruses. Non-living agent. Includes various environment effects such as rapid temperature change, excess moisture, insufficient nutrients, poor
soil pH and high humidity conditions.

Fig. 2. Tomato Plant Disease.

The above figure (see Fig. 2) shows various diseases of tomato plant.
 Early Blight: The tomato leaves will have one or two spots per leaf, approximately ¼ to ½ inch in diameter. Spots have tan centers with concentric
rings in them and yellow halos around the edges.
 Powdery Mildew: Cause a white powdery Fungal growth on upper surface of tomato leaves, stems and sometimes fruit. The spots on the leaf are
approximately 1/6 to 1/8 inch in diameter.
 Downy Mildew: Appears as yellow to white patches on the upper surfaces of older leaves. On the undersides, these areas are covered with white to
greyish, cotton-like fungi.
 Late Blight: Spots that start out pale green, usually near the edges of tips of foliage and turns brown to purplish-black. In humid conditions, a fuzzy
mole appears on the underside of leaves.

February 26 - 28, 2019 | Amity University Rajasthan, Jaipur, India Page 2155
International Conference on Sustainable Computing in Science, Technology & Management (SUSCOM-2019)

3. RELATED WORK

Deep Learning Techniques has shown immense improvement in the field of agriculture which includes disease detection in plants since 2014 (Table 1). The
table shows study experiments of different plants and their species with promising results. Some of the studies cited uses deep learning Convolution Neural
Network architecture are Satyam Srivastava et al.(2014); Hiteshwari Sabrol1 & Satish Kumar (2016); Sharada Prassanna Mohanty et al.(2016);Amara et al.
(2017); Lu, Yi, Zeng et al.(2017); Alvaro Fuentes et al.(2017); Oppenheim & Shani, (2017); Liu,et al.(2018).The different parameters discussed in the table
below (Table 1) includes:
i. Deep Learning Architecture : The Convolution Neural Network Architectures includes special kind of multi-layered neural networks,
designed to identify visual patterns directly from pixel images from minimal pre-processing.
ii. Accuracy: The correct amount of classification achieved by a CNN model. There are two types of accuracy.
a. Training Accuracy: The accuracy of the model on the dataset it was constructed.
b. Testing Accuracy: The accuracy of the model on an instantaneous sample.

Table 1. Related Work using Deep Learning for Plant Disease Detection.

DL
Sr.NoYearPaper Title Plant Disease Architecture DL Algorithm Accuracy

1. 2014 A Novel Vision TomatoEarly blight, CNN Pattern Principle 92%


Sensing System for Late Blight, Recognition Components
Tomato Quality Bacterial Canker, Supervised Analysis(PCA)
Detection. And Gray Mold. Learning and Linear
Classification Discriminant
Analysis(LDA)
Quadratic
Classifiers (QDA)
2. 2016 Fuzzy and Neural TomatoBacterial leaf spot. Sugeno-Type Subtractive The
Network based Fungal septoria leafFuzzy. Clustering. Hybridclassification
Tomato Plant spot. Bacterial Adaptive Learning accuracy is best
Disease Canker. Fungal late Neuro-Fuzzy Algorithm. Multi- yielded with
Classification using blight. Tomato leaf Classification Layer Feed multi- layer
Natural Outdoor curl. Model. Forward Back feed forward
Images Supervised Propagation back
Learning Network. propagation
Model. classifier of
87.2%

3. 2016 Using Deep Tomato26 different CNN (AlexNet, - 99%


Learning for Image- Disease GoogLeNet)
Based Plant Disease

Detection
4. 2017 A Robust Deep- TomatoPowdery Mildew. Faster-R CNN VGG-16 80%(Mean
Learning-Based Canker. SSD ResNet-50 accuracy of

Detector for Real- Gray Mold. RFCN ResNet-101 entire system


Time Tomato Plant ResNet-152 performance)
Diseases and Pests ResNeXt-50

Recognition
5. 2017 A Deep Learning Banana Black Sigatoka. CNN (LeNet Steps: 92-99%
Based Approach on Banana Speckle. architecture) Convolution

February 26 - 28, 2019 | Amity University Rajasthan, Jaipur, India Page 2156
International Conference on Sustainable Computing in Science, Technology & Management (SUSCOM-2019)

Banana Leaf Pooling.


Diseases Classification.
Classification.
6. 2017 Deep Learning for TomatoTomato Leaf CNN (AlexNet, Pre-training. 99%
Tomato Disease: Diseases. GoogLeNet) Training. Disease
Classification and Classification.
Symptoms. Symptoms
Detection.
7. 2017 Identification of rice Rice 10 different rice CNN (AlexNet - 95%
diseases using deep disease. inspired)
convolutional neural
networks.
8. 2017 Potato disease Potato Black Dot disease. CNN (VGG) - 96%
classification using Common Scab.
convolution neural Black Scurf

networks.[9] Silver Surf.

4. METHODOLOGY

4.1. Data Acquisition and Pre-processing

The proposed system takes input from three different data sources (nursery, farm and plant village). The nursery consists of 8-10 tomato plants located at
M.H. Saboo Siddik College of Engineering, Mumbai with latitude 18.9685103 N, longitude 72.8288362 E, temperature 29 C and humidity 69%. The dataset
also consists of images of plants from a farm at Karjat and a plant village.
The images collected are pre-processed using Image Processing techniques like Noise Reduction, Regression and Image Processing Enhancement that
helps in reducing computation cost and time.

4.2. Feature extraction model

The integral part of CNN is feature extraction. Through feature extraction; extracting high-level features from the available input images is very important
to create effective neural network solutions. This model consists a series of convolution and pooling layers (Cruz, A., Luvisi, A., Bellis, L. D., & Ampatzidis,
Y. (2017)).

4.2.1. Convolution Map


The main elementary unit in CNN architecture is Convolution maps. The goal of Convolution maps is to extract features from the input image. Convolution
Map is a mathematical operation using two different set of information. In this system convolution is applied on input data (infected/healthy tomato leaf
image) using convolution filter/kernel to produce feature maps. This is done by sliding The filter matrix over the input image matrix. At each location of the
input matrix we perform element-wise matrix multiplication (i.e. depending upon the Strides). After the matrix multiplication we sum the results. This sum
then goes into the feature map. Here, a single convolution map layer is not enough to extract the features of the tomato leaf. Thus, we perform multiple
convolutions on the input image each using different filters each resulting in unique feature maps. All the feature maps are then stacked together and the
result obtained is the final output of the convolution layer(Dertat, Arden,2017).

4.2.2. Non Linearity


A powerful neural network is constructed using non-linearity. By using non-linearity the network achieve true potential. The results of the convolution layer
is passed through the activation function.ReLU activation function is used so that the values in the final feature maps are not just the sums but an activation
function applied to them. Here, we have used ReLU because the training time of ReLU are several times faster than conventional tanh (Dertat, Arden,2017).

4.2.3. Pooling

February 26 - 28, 2019 | Amity University Rajasthan, Jaipur, India Page 2157
International Conference on Sustainable Computing in Science, Technology & Management (SUSCOM-2019)

Pooling is the most important layer after CNN. Pooling downsamples the feature maps by keeping the important information. Pooling downsamples each
feature map independently. Downsampling of feature maps are done by reducing the dimensionality. This enables us to reduce number of parameters which
both shortens the training time and combats overfitting. This system uses max-pooling.
Max-pooling takes the maximum value from the feature map(depending upon the Strides) in the window (Dertat, Arden,2017).

4.2.4. Fully Connected Layer

To wrap up the CNN architecture after Convolution and Pooling layers we add fully connected layer. Significance of fully connected layer is that, CNN
does not know what a noise is it considers it as a feature. Fully-connected layer learns how to use the important features produced by convolution in order
to classify the images accurately(Dertat, Arden,2017).

4.2.5. Training
The CNN network is now trained using Back propagation technique using gradient descent method (Dertat, Arden,2017).

4.2.6. Dropout
It is by far the most popular regularization technique for the Deep Neural Networks. Dropout gives the neural network model a 2% accuracy boost, which
is a fair substantial gain at this level. Dropout layer is implemented in order to combat the overfitting problem to the training data (Dertat, Arden,2017).

4.3. Classification model

Here, the classification model is used to compute the score of the classes using Soft-max activation function. The fully-connected layers provides a full-
connection to all the feature maps issues from the previous layers (Mohanty, S. P., Hughes, D. P., & Salathe, M. (2016),( DeChant, C., Wiesner-Hanks, T.,
Chen, S., Stewart, E. L., Yosinski, J., Gore, M. A., et al. (2017)). Classification model provides accurate detection of tomato leaf diseases with minimum
computational effort. Thus, the proposed system can serve as decision support tool to farmers in order to identify the diseases in tomato plant.
As shown in in Fig.3 a CNN is composed of three main parts, which are convolution, pooling and fully connected layers. (Durmus, Halil, et al,2017) .
Currently,the proposed system is composed of five Convolution Layers and three Pooling Layers. The convolution and pooling layers’ act as feature
extractors from the input images while the fully connected layer wraps up the CNN architecture.Additionally,the dropout layer is to provide accuracy boost
to our network.The essential purpose of convolution is to extract features automatically from each input image. The pooling layer then reduces the
dimensionality of these features. (Pan, S. J., & Yang, Q. (2010)) The activation function that the proposed system uses is ReLU (Dertat, Arden,2017) and
Softmax . Softmax is a classifier at the end of the neural network. At the end of the model, the fully connected layer with an Softmax activation function
makes use of the learned high-level features to classify the input images into predefined classes that are Early Blight,Late Blight, Powdery Mildew and
Healthy.

Fig.3. Proposed System

February 26 - 28, 2019 | Amity University Rajasthan, Jaipur, India Page 2158
International Conference on Sustainable Computing in Science, Technology & Management (SUSCOM-2019)

5. RESULTS AND DISCUSSION

To validate the performance of the adopted CNN architecture; we had performed survey of different architectures. This survey was done on 5 different
models
(i) AlexNet (Krizhevsky et al., 2012)
(ii) AlexNetOWTBn (Krizhevsky, 2014)
(iii) GoogLeNet (Szegedy et al., 2015)
(iv) Overfeat (Sermanet et al., 2013)
(v) VGG (Simonyan and Zisserman, 2014)

with a training and testing image ratio of 80/20 i.e 80% of training image and 20% of the testing image.These models and their training and testing
process was implemented on Torch7 machine learning framework.The training algorithms were implemented on GPU of an NVIDIA GTX1080 card
(Ferentinos, Konstantinos P.2-18).

The figure (fig.4) shown below compares various models based on different parameters:
 Success rate: percentage of successful classification of plant (healthy and infected) classes.
 Average error: this is average loss per batch over all the batches of the training data.
 Training epoch: the best performances were achieved at this epoch.
 Time (seconds per epoch): time required to train each model.

Fig.4. Comparison of different CNN Models (Ferentinos, Konstantinos P.,2018)

Based on the comparison the two most successful models are AlexNetOWTBn and VGG. The highest success rate 99.48% was achieved by VGG
Model on the other hand the lowest average testing was achieved by AlexNetOWTBn. Furthermore, in figure (fig 5) when the comparison was performed
between only these two models, the final highest ssuccessful classification percentage was achieved by VGG (i.e. 99.53% with top-1 error rate of 0.
47%).As from comparing the entire scenario and depending upon the application of our system we choose VGG Model for tomato plant disease detection.
VGG is a CNN model from the researches at Oxford's Visual Geometry Group, abbreviated as VGG.VGG is the runner-up at ImageNet(ILSVRC) 2014
competition was developed by Karen Simonyan and Andrew Zisserman.VGG is considered to be one of the most influential model because of its principle
"Keep it simple. Keep it deep”. Simplicity of VGG; That it is 19 layer CNN model that strictly use 3x3 filters with pad and stride of 1, also 2x2 max-
pooling layers with stride 2.VGG also uses ReLU activation function after each convolution map layer and trains using gradient descent (Deshpande,
Adit). Since, the proposed system CNN model is being designed from the scratch we choose VGG depending upon its performance and its simplicity to
understand the model in deep. Based upon all the validation and specification, developing an application specific system inspired by VGG architecture
helps in better performance and precise results.

Fig. 5. Final Model Performance (Ferentinos, Konstantinos P.,2018)

February 26 - 28, 2019 | Amity University Rajasthan, Jaipur, India Page 2159
International Conference on Sustainable Computing in Science, Technology & Management (SUSCOM-2019)

6. CONCLUSION AND FUTURE WORK

Agriculture suffers from various problems, plant diseases contributing to the major factor. The diseases in plants reduce the production and quality of yield.
In addition, the underdeveloped countries have a devastating impact on their development and quality of life due to the shortage of diagnostics tools and
technologies. As a result, there is an urgent need to detect plant diseases at an early stage to improve the production. In this paper, we have proposed a
system that focusses on disorder detection of a tomato plant. The proposed system classifies tomato plant diseases such as early blight, powdery mildew and
downy mildew using Convolution Neural Network; CNN trains the system in order to classify the diseases of a tomato plant. Using CNN our system is able
to find good classification results, which have proven that the proposed method can significantly support the accurate detection of tomato leaf diseases with
minimum computational effort. Thus, the proposed system can serve as a decision support tool to farmers in order to identify the diseases in a tomato plant.
We intend in our future work to test more tomato plant and leaf diseases with our model. Besides this, we will also target the severity estimation of the
identified diseases, as it will be beneficial for the farmers in deciding how to intervene to stop the disease.

ACKNOWLEDGMENT

We would like to thank the Assistant Professor Saiqa Khan, Computer Department, M.H. Saboo Siddik College of Engineering, Mumbai for supporting this work.

REFERENCES

"Food and Agriculture Organization of the United Nations," [Online]. Available:http://www.fao.org/home/e/. [Accessed 2017].I.S. Jacobs and C.P. Bean, “Fine
particles, thin films and exchange anisotropy,” in Magnetism, vol. III, G.T. Rado and H. Suhl, Eds. New York: Academic, 1963, pp. 271-350.
Amara, J., Bouaziz, B., & Algergawy, A. (2017). A deep learningbased approach for banana leaf diseases classification. In Lecture notes in informatics (LNI) (pp.
79e88). Bonn, Germany: Gesellschaft fu¨ r Informatik.
Aydogdu, M. Fatih, et al. “Comparison of Three Different CNN Architectures for Age Classification.” 2017 IEEE 11th International Conference on Semantic
Computing (ICSC), 2017, doi:10.1109/icsc.2017.61.
Cruz, A., Luvisi, A., Bellis, L. D., & Ampatzidis, Y. (2017). X-FIDO: “An effective application for detecting olive quick decline syndrome with deep learning and
data fusion”, IEEE. Frontiers in Plant Science, 8, 1741.
DeChant, C., Wiesner-Hanks, T., Chen, S., Stewart, E. L., Yosinski, J., Gore, M. A., et al. (2017). Automated identification of northern leaf blight-infected maize
plants from field imagery using deep learning. Phytopathology, 107(11), 1426e1432.
Dertat, Arden. “Applied Deep Learning - Part 4: Convolutional Neural Networks.” Towards Data Science, Towards Data Science, 8 Nov. 2017,
towardsdatascience.com/applied-deep-learning-part-4-convolutional-neural-networks-584bc134c1e2.
Deshpande, Adit. “The 9 Deep Learning Papers You Need To Know
Durmus, Halil, et al. “Disease Detection on the Leaves of the Tomato Plants by Using Deep Learning.” 2017 6th International Conference on Agro-
Geoinformatics,2017,doi:10.1109/agro-geoinformatics.2017.8047016.
Ferentinos, Konstantinos P. “Deep Learning Models for Plant Disease Detection and Diagnosis.” Computers and Electronics in Agriculture, vol. 145, 2018, pp. 311–
318., doi:10.1016/j.compag.2018.01.009.
Kumar, S., Sharma, B., Sharma, V. K., & Poonia, R. C. (2018). Automated soil prediction using bag-of-features and chaotic spider monkey optimization algorithm.
Evolutionary Intelligence, 1-12.
Kumar, S., Sharma, B., Sharma, V. K., Sharma, H., & Bansal, J. C. (2018). Plant leaf disease identification using exponential spider monkey optimization. Sustainable
Computing: Informatics and Systems.
Mittal, S.(2017), “Can Horticulture be a Success Story for India?”, Indian Council for Research on International Economic Relations, Working Paper No. 197.
Mohanty, S. P., Hughes, D. P., & Salathe, M. (2016). Using deep learning for image-based plant disease detection. Frontiers in Plant Science, 7. article 1419.
Oppenheim, D., & Shani, G. (2017). Potato disease classification using convolution neural networks. Advances in Animal Biosciences: Precision Agriculture, 8(2),
244e249.
Pan, S. J., & Yang, Q. (2010). “A survey on transfer learning. IEEE Transactions on Knowledge and Data Engineering”, IEEE, 22(10), 1345e1359.
T.Brosnan and D. W. Sun, “Improving quality inspection of food products by computer vision—a review,” Journal of Food Engineering, vol. 61, no. 1, pp. 3–16,
2004.

February 26 - 28, 2019 | Amity University Rajasthan, Jaipur, India Page 2160

You might also like