Professional Documents
Culture Documents
Final Presentation GROUP 6 2
Final Presentation GROUP 6 2
IMAGE PROCESSING
OUR TEAM
Number of Images:
• The PlantVillage Dataset contains over 50,000 images, making it a rich resource for
training and evaluating machine learning models.
Classes of Diseases Covered:
• The dataset covers a wide range of plant diseases across multiple plant species,
including but not limited to:
• Tomato: Early Blight, Late Blight, Bacterial Spot, Leaf Mold, Septoria Leaf Spot, etc.
• Potato: Early Blight, Late Blight
• Apple: Apple Scab, Cedar Apple Rust, Healthy, etc.
• Grape: Black Rot, Esca, Healthy, etc.
• And many more.
Preprocessing Steps Applied:
Prior to training our machine learning model, we performed several preprocessing steps on the dataset,
including:
• Image resizing to a standardized resolution to ensure uniformity.
• Normalization of pixel values to the range [0, 1] to facilitate convergence during model training.
• Splitting the dataset into training, validation, and test sets to assess the model's performance on unseen
data.
WORK FOR 499B
Image Data Preprocessing:
Part Two marks a crucial phase in the development of our disease detection system, where we transition from planning and data
collection to the technical execution of our project. This phase encompasses the following key tasks:
Data preprocessing, resizing, Noise Reduction and Normalization:
We will delve into the meticulous preparation of our dataset.
Data preprocessing is a foundational step that involves activities such as image resizing to a consistent format and normalization
to ensure uniformity in data distribution. These measures enhance the quality and suitability of our dataset for training and
evaluation.
Model exploration and optimization:
Model exploration entails fine-tuning and optimization of our models to adapt them to the specific demands of our crop disease
detection task. We will explore various techniques to improve model accuracy and efficiency .
Model Architecture
Layers:
1. Convolutional Layers
2. Pooling Layers
3. Dropout Layers
4. Fully Connected Layers
Activation Functions:
• The activation function used in the convolutional layers is the Rectified Linear Unit (ReLU),
which introduces non-linearity and helps the network learn complex patterns from the input
data.
• The final dense layer utilizes the softmax activation function, which converts the raw output
scores into probabilities, indicating the likelihood of each class.
• The CNN architecture leverages the hierarchical structure of the layers to automatically
learn discriminative features from input images.
• Initially, low-level features such as edges and textures are detected by the early
convolutional layers.
• As the network progresses through subsequent layers, higher-level features relevant to
disease classification, such as lesion patterns or discolorations, are extracted.
• The fully connected layers at the end of the network utilize these learned features to make
predictions, mapping them to the appropriate disease classes based on the training data.
Loading The Dataset:
The images cover 14 species of crops, including: apple, blueberry, cherry, grape,
orange, peach, pepper, potato, raspberry, soy, squash, strawberry and tomato. It
contains images of 17 basic diseases, 4 bacterial diseases, 2 diseases caused
by mold (oomycete), 2 viral diseases and 1 disease caused by a mite. 12 crop
species also have healthy leaf images that are not visibly affected by disease.
Data Preprocessing:
Data preprocessing is a crucial step in preparing the raw input data for training the Convolutional Neural Network (CNN)
model. This process involves several steps aimed at enhancing the quality and usability of the dataset for effective
model training. Standardizing the size of input images to a uniform resolution (e.g., 224x224 pixels) ensures
consistency across the dataset.
Build the model:
Our Convolutional Neural Network (CNN) model is meticulously designed to effectively classify plant diseases from
input images. The architecture comprises multiple layers, each meticulously crafted to extract hierarchical features and
enable accurate classification.
Model Training:
Model Evaluation:
Model Accuracy:
Final Testing:
Web App:
In addition to developing a Convolutional Neural Network (CNN) model for plant disease prediction, we have also
created a user-friendly web application using Streamlit. The web app serves as an intuitive interface for users to
interact with the model, enabling them to upload images of plant leaves and receive instant predictions regarding
potential diseases.
Key Features:
Upload Image:
• Users can easily upload images of plant leaves directly from their devices using the web app's user interface.
• Prediction Display:
• Upon uploading an image, the web app utilizes the trained CNN model to predict the likelihood of various plant
diseases affecting the uploaded leaf.
• Visualization:
• The web app provides visual feedback, including the uploaded image and the corresponding disease prediction,
enhancing user understanding and engagement.
• User-Friendly Interface:
• Streamlit's intuitive design allows for the creation of a user-friendly interface with minimal code, enabling seamless
navigation and interaction for users of all levels.
Transfer Learning: Using VGG 16
Transfer learning using VGG16 involves leveraging a pre-trained VGG16 model, which has been
trained on a large dataset (usually ImageNet) for image classification tasks. Instead of training a neural
network from scratch, transfer learning allows us to use the knowledge (learned features) gained by
the VGG16 model on a different but related task.
Transfer Learning: Using ResNet-50
Transfer learning using ResNet-50 involves leveraging a pre-trained ResNet-50 model, which has been trained on
a large dataset (typically ImageNet) for image classification tasks.