Professional Documents
Culture Documents
Plant Leaf Recognition
Plant Leaf Recognition
● UCI’s 100 Leaf Dataset: Sixteen samples of each leaf of one-hundred plant species. This
consisted of a total of 1600 samples spanning over 100 classes.
● Swedish Leaf Dataset: Contains pictures of 15 species of leaves, with 75 images per species.
Images are clean with a white background, with very few variations of luminance.
● Flavia Leaf Dataset: Consists of clean images only, which is characterized with a well-aligned
leaf on a contrasting background, with little or no variations of luminance or colour. Around 33
species, each having around 60 samples for training.
● ImageCLEF Dataset: Noisy dataset with considerable variations on lighting conditions,
viewpoints, background clutters and even occlusions. The dataset can be further split into two
subsets: uniform (total of ~10,800 images over 66 species), which is taken in a more controlled
environment, and natural (total of ~3,100 images over 57 species), which is taken in a natural
environment.
Approach and Classifier Advantages Disadvantages
This model has been designed based on the architecture beside. It has
shown a pretty good outcome for the Swedish Leaf Dataset, but we
cannot conclude the same for other datasets. A deeper ConvNet with
more aggressive filters is needed to extract the features more efficiently
and to deal with the noise and variation of other datasets.
Analysing Results
● 20% of our images from the dataset
are saved for validation.
● Our model brings us a validation
accuracy of 96.8% towards the end of
training of all the epochs.
● On observation, the graphs are not
completely monotonic and this can be
mainly related to the overfitting of
data.
● However, both graphs do recover and
we can see the validation loss
approaches an all-time low.
Shortcomings
We were able to propose an automatic plant species identification approach which is
employed machine learning techniques to classify plant leaf images. However, this
model too has its own setbacks:
● Based upon the smaller dataset that was utilised to create a
larger one
● Fails with images with multiple leaves since the model is
trained with only clean single leaf images.
● Although images with an unclean background are
inaccurately judged by the model proposed, applying a
customized mask does help improve the predictions.
● However, this mask works on the principle of filtering out
the green elements, which will not be helpful in the case of a
green background or in the presence of other leaves in the
background.
Working Model Screenshots
Final Product
Future Scope
● Acquire more data and fine-tune
ConvNets to fight overfitting
problems.
● Engage advanced techniques for
image augmentation.
● Explore state-of-the-art methods to
detect and locate leaves from the
background. This should allow
ConvNet to focus on the essential
features.
● Expand the dataset and improve the
model to improve predictions on
noisy images as well.