Professional Documents
Culture Documents
Computer_Vision
Computer_Vision
Abstract—This research conducted comparing hybrid segmen- images of blood cells. This method not only improves the
tation using K-Nearest Neighbor with Nick Thresholding applied classification effectiveness but also reduces computational time
to Naive Bayes as traditional classification and a new convo- significantly, achieving a notable accuracy rate of 98.87% with
lutional neural network named KR-NET in computer vision.
Specifically the research tried focusing on examining the perfor- the integration of Nick’s thresholding method.
mance of naive bayes with a hybrid model that integrates Fuzzy [2] research introduces a new convolutional neural net-
C-Means segmentation using two and three cluster configurations work (CNN) model, KR-NET, customized the network model
with nick threshold to KR NET for classifying the flower dataset. for the classification of retinal fundus images for glaucoma
The primary objective of the research is to evaluate each model’s detection. The paper details the application of a 26-layer
performance in handling diverse image datasets, with a focus on
metrics such as accuracy, and precision. CNN that utilizes variable mode decomposition for segmenting
Index Terms—computer vision, naive bayes, hybrid segmenta- the region of interest (ROI) in the fundus images, enhanc-
tion, convolutional neural network, flower dataset. ing the model’s ability to discriminate features crucial for
identifying glaucoma. The authors utilized a 10-fold cross-
I. I NTRODUCTION validation method and a 70:30 split ratio to validate their
In the field of computer vision, the effectiveness of clas- model, achieving notable accuracy rates across various public
sification and segmentation algorithms plays a pivotal role and private datasets.
in enhancing the interpretation and analysis of visual data.
III. M ETHODOLOGY
Traditional classification methods like Naı̈ve Bayes have used
for the long time due to its simplicity and ease of imple- This research has 2 methodologies applied to each approach
mentation. However, with the increasing complexity of image of classification method.
data and the need for higher precision and accuracy, there is a
continuous demand for more new approaches and techniques. A. Naive Bayes Classification
The integration of segmentation techniques can potentially This research employs a hybrid segmentation and classifi-
enhance its effectiveness, especially when applied to large cation methodology for the classification of flowers from the
visual data. in the other side, there is another technique for Oxford 102 Flower Dataset. The image will be segmented us-
classify the image using convolutional neural network (CNN), ing Fuzzy C-means for initial clustering followed by adaptive
CNN are designed to automatically and adaptively learn spatial thresholding to refine the segmentation, Nick adaptive thresh-
hierarchies of features through backpropagation, from low- to olding is applied after the Fuzzy C-means clustering to manage
high-level patterns. This makes them particularly suitable for variable image backgrounds and enhance the differentiation
image classification tasks where the preservation of spatial between each class. After the segmentation, key features such
relationships is key to recognizing visual patterns. as area, perimeter, major and minor axis lengths, aspect ratio,
This research focuses on evaluating the impact of hybrid eccentricity, and solidity are extracted. Then, the feature will
segmentation techniques on a traditional classifier and a convo- be trained for the model. After that, the model will be tried to
lutional neural network (CNN), using the Oxford 102 Flower classify the validation data. The methodology could be seen
Dataset as a test case. at 1
Layer Number of feature maps Stride Activation shape Total learnable parameters
Input 3 - 128 x 128 x 3 -
Convolution 48 1 126 x 126 x 48 1344
Batch normalization 48 - 126 x 126 x 48 96
ReLU 48 - 126 x 126 x 48 -
Max. pooling 48 2 63 x 63 x 48 -
Convolution 48 1 61 x 61 x 48 20784
Batch normalization 48 - 61 x 61 x 48 96
ReLU 48 - 61 x 61 x 48 -
Max. pooling 48 2 30 x 30 x 48 -
Convolution 48 1 4 x 4 x 48 20784
Batch normalization 48 - 4 x 4 x 48 96
ReLU 48 - 4 x 4 x 48 -
Convolution 48 1 2 x 2 x 48 20784
Batch normalization 48 - 2 x 2 x 48 96
ReLU 48 - 2 x 2 x 48 -
Fully connected 102 - 1 x 1 x 102
Softmax 102 - 1 x 1 x 102 -
”bee balm” shows improved recall but decreased precision, it and recall into a single metric, stands at 48.2%. This suggests
suggests the model is identifying more true positives but also that while the model is relatively consistent in identifying
more false positives, which might be a trade-off depending on correct positives.
the application of the algorithm. Classes like ”clematis” that
V. C ONCLUSION
might have improved in both metrics suggest a better fitting
of the model for those classes with an additional cluster. The comparative study on the classification of the Oxford
Among the high performers, ’passion flower’ stands out 102 Flower dataset reveals that the Convolutional Neural
with the highest F1-Score, which can be attributed to its Network (CNN) using the KR NET model significantly out-
distinct features that are easily captured by the model. In performs the hybrid segmentation technique that combines K-
contrast, ’clematis’, with the largest support but zero scores nearest neighbor clustering and nick adaptive thresholding,
across the board, presents a challenge, potentially due to its that applied to a Naive Bayes classifier. Despite the enhance-
visual similarity with other flowers or variability within the ments to the Naive Bayes classifier, its performance did not
class itself. reach that of the CNN, highlighting the substantial gap be-
tween traditional machine learning methods and modern deep
C. Experiment 2: Convolutional neural network using KR learning approaches in handling complex image classification
NET Model tasks.
Then for the second experiment, we doing it using the R EFERENCES
convolutional neural network using KR NET model. adapted to
[1] Chayashree Patgiri and Amrita Ganguly. “Adaptive
[2] the process begin with resize the image and then grayscale
thresholding technique based classification of red blood
the image before input those image into the model. but while
cell and sickle cell using Naı̈ve Bayes Classifier and
this dataset is a flower dataset, the grayscaling technique is
K-nearest neighbor classifier”. In: Biomedical Signal
not applied, the idea is using their color as a feature for the
Processing and Control 68 (2021), p. 102745. ISSN:
model.
1746-8094. DOI: https://doi.org/10.1016/j.bspc.2021.
The IV shows the performance of KR NET classification
102745.
on the Oxford 102 flower dataset. Certain classes like ”colt’s
[2] Kamesh Sonti and Ravindra Dhuli. “A new convolution
foot” and ”fire lily” show fine performance performance with
neural network model “KR-NET” for retinal fundus glau-
F1-scores of 1.0, indicating perfect precision and recall. This
coma classification”. In: Optik 283 (2023), p. 170861.
suggests that the model can distinctly identify features specific
ISSN : 0030-4026. DOI : https://doi.org/10.1016/j.ijleo.
to these classes without any false identifications.
2023.170861.
In contrast, classes like ”anthurium,” ”azalea,” and ”monks-
hood” exhibit zero values across precision, recall, and F1-
score, implying an inability of the model to correctly identify
any instances of these classes. This could be due to a lack of
distinctive features learned by the model for these flowers.
The KR NET model shows a moderate level of accuracy
(53.5%) on a complex task like flower classification, with
a balanced average precision and recall around 55.2% and
52.3% respectively. The F1-score, which combines precision
(a) (b)
(c)
Fig. 2. (a) Original Image. (b) Segmented image using 2 cluster.(c) Segmented image using 3 cluster.
VI. ATTACHMENT