Professional Documents
Culture Documents
Proceedings of The 2012 International Conference On Machine Learning and Cybernetics, Xian, 15-17 Uly, 2012
Proceedings of The 2012 International Conference On Machine Learning and Cybernetics, Xian, 15-17 Uly, 2012
M. SHAMSUL ARIFINI, M. GOLAM KIBRIA2, ADNAN FIROZE " M. ASHRAFUL AMINI, HONG YAN3
E-MAIL: arifin_cham@yahoo.com.golam.kibria@uits.edu.bd.adnan.firoze@gmail.com.aminmdashraful@ieee.org.
h.yan@cityu.edu.hk
Gomez et al. [10] introduced specialized neural techniques Disease Acne Eczema Psoriasis Tinea Scabies Vitiligo
Name Corporis
for the detection of Melanoma and Psoriasis. In an extensive
study, Handels et al. [11] introduced techniques to extract Patients 19 18 18 14 26 26
features from visual skin tumors. Hoffmann et al. [12]
focused on diagnosis of skin cancer using neural approaches Number 107 102 105 107 182 101
which was exemplary. of
Pictures
Taken
2. The Dermatological Disease Diagnosis System
1676
Proceedings of the 2012 International Conference on Machine Learning and Cybernetics, Xian, 15-17 J uly, 2012
Fig. 3. (a) Original Image with ROI-s Labeled (b) Image with the ROI-s
taken apart (c) Original image with no healthy skin present
• Area (in mm2 ) dy) is given by Shahbahrami et al. [21]. The (i, j) of Pdis the
• Energy, Entropy, Contrast and Homogeneity from GLCM
number of occurrences of the pair of gray-level i and j which
in each color channel [21]
are a distance "d" apart. The equations for obtaining the 4
features from a GLCM can be found in Shahbahrami et al.
At this point,we have separated the healthy skin and the
[21].
individual ROI-s in structures that contain the images of the
The distribution is how the diseased parts are spread out
RGB components. From the RGB components, we have
from each other from its manifestations. This was done from
computed the mean, variance (standard deviation) of the
calculating the Euclidean distances from one connected
channels individually. Energy, entropy, contrast and
component to another and getting their mean. It can be
homogeneity were computed from the regions by computing
expressed as the following equation (where, every i is a ROI
the gray level co-occurrence matrices (GLCM) in each
in a single image, dis�j is the distance from ROI i to j and m
channel as proposed by Shahbahrami et al. [21]. The
is the total number of ROI-s in an image). This feature
separation of ROI and healthy skin was done using masks as
calculates the "spread" of the ROIs in an image.
shown in Fig. 3.
Thus we can see how the healthy and normal skins were
separated to extract colors from them.
1677
Proceedings of the 2012 International Conference on Machine Learning and Cybernetics, Xian, 15-17 J uly, 2012
[m 1
should be noted that in the ROC curves: True Positives (TP)
L dist .. = Correctly detected Regions of Interest (diseased skin),
. . . ·-1
Dzstrzbutzon = 1-
Y
X �
ixels
False Positives (FP) = Incorrectly detected Regions of
m mm Interestlhealthy skin detected as ROI, True Negatives (TN) =
Correctly detected Healthy Skin (since the images are
(3) cropped to contain skin only, the value of this is constant at
The area or size of a ROI is very important to classify 100% at all times for our system), False Negatives (FN) =
the disease. It was done by fIrst using propping regions over Incorrectly detected healthy skin (healthy skin contains
the labeled ROI-s over the masks generated to detect the ROI/the measure of how much the system was unable to
regions of interest. Then we multiplied the area in pixel detect the diseased parts).
with the ratio of pixel to the reference object present in the From these results (Fig. 5) we see that we fInd the best
images (a 50 paisa coin with 2.2 c.m. in diameter) to fInally "detection of ROI accuracy" at the Color-Gradient threshold
get the area or size of the ROI in milimete? of 0.8 and the accuracies are 86.04% for unsupervised
clustering (Fig. 5-top) and 92.25% for semi-supervised (user
2.3.2 External features extraction inputted number of ROI) clustering (Fig. 5-bottom).
Consequently, we therefore choose the semi-supervised
The features that were collected from the patient history
clustering for further validation since it exhibited an
are the following: diseased body part, elevation and
improved detection rate over the unsupervised system.
frequency of occurrence (in months). The diseased body parts
were quantifIed and added manually to the sample
ROC Curves varying over Color Gradient Threshold
descriptions. Since stereo imaging was not feasible in a
(unsupervised system)
government hospital to take a 3D images, the elevations of 100.00 --True
.. Positive %
ROIs was taken from the patient history forms. A s::: 80.00
u
.. --False
dermatologist assisted in this regard. Elevation refers to the E 60.00 h,.,f-------''"'''
-' 'o;----1:-.JII'
;:- '-- '' OO.5 3 Positive %
a
depth or height of the diseased lesion or region of interest 1; --True
Negative %
40.00
Q.
(ROI). Naturally as it is a length that was measured and the 20.00 --False
unit was in millimeter. Negative %
0 .00
--Accuracy%
0.10 0.20 0.30 0.40 0.50 0.60 0.70 0.80 0.90 1.00
2.4. System Training and Testing Color Gradient
We have used feed-forward back-propagation neural ROC Curves varying over Color Gradient Threshold (semi
supervised system)
network training to perform this step. We validated and tested
our system using the tenfold cross validation process. The 100.00 -- True
Positive %
virtue of using a cross validation technique was that there ..
� 80.00
-- False
were no overlapping of the test data and training data,making Positive %
..
E
(; 60.00
the system testing results viable and dependable.
1:
-- True
Negative %
8'.. 40.00
--False
20.00 Negative %
0.10 0.20 0.30 0.40 0.50 0.60 0.70 0 .80 0.90 1.00
The selection of an optimized threshold of Color Gradient
color-gradient and neuron number of the feed-forward
back-propagation ANN was tested with different Fig. 5. ROC curves of ROI detection for varying Color Gradient using
configuration of ANN We have selected our semi-supervised
.
unsupervised clustering (top) and Sami-supervised clustering (bottom)
for Sample size: 2055.
system as it gives us better response than the unsupervised
system and they are discussed in next subsection.
3.2 Classification Performance
3.1. ROC Curves for Optimal Color Gradient Threshold
Upon creation of the training matrix of dimensions 103
x 2055 (2055 samples with 103 features each) and target
The ROC curves in Fig. 5 illustrate the selection of the
matrix of dimension 6 x 2055 (2055 total samples of 6
best threshold. Since the images were cropped to include only
diseases) we were set for tenfold cross validation. We
skin portions, thus in all cases True Negative=100%. It
1678
Proceedings of the 2012 International Conference on Machine Learning and Cybernetics, Xian, 15-17 J uly, 2012
partitioned each fold with 10% samples for testing and 90% 3.2.2 Disease Classification Accuracy Evaluation:
samples for training such that upon 10 folds we get
comprehensive accuracy results. We tested our system using We have evaluated our system's performance based on
70 neurons growing to 150 neurons in a single hidden layer. the sample sizes as weights to signify how well it performs.
From the plot in Fig. 6 we see that the highest classification Thus, based on table II, we conclude that our system
accuracy was found using 98 neurons in the hidden layer and performs with a classification accuracy of 94.0146% with
the accuracy was found to be 94.667%. training from 2055 samples of 6 diseases and an ANN with
one hidden layer with 98 neurons.
Neuron Numbers in Hidden Layervs.
Accuracy (Sample Size: 2055)
4. Conclusions
Accuracy
[1] Jianbo Gao,Jun Zhang,Matthew G. Fleming,IlyaPollak,
Disease Sample Size Accuracy X
Sample Size
Armand B. Cognetta. Segmentation of Dermatoscopic
Images by Stabilized Inverse Diffusion Equations.
Acne 405 96.66 % 391.347 Computerized Medical Imaging and Graphics,
Eczema 320 88.00 % 281.6 22(5):375-389 . 1998.
Psoriasis 288 89.33 % 257.2704 [2] Bushra Jalil . Multispectral Image Processing Applied to
T inea 280 88.33 % 247.324 Dermatology. Thesis Submitted for the Degree of MSc.
Corporis
Erasmus Mundus in Vision and Robotics (VIBOT).
Scabies 507 98.67% 500.2569
Vitiligo 255 99.67% 254.1585
Universite de Bourgogne .2008 .
Total: 2055 Total: 1932.083 [3] Michal Antkowiak . Artificial Neural Networks vs.
Accuracy using (equation 5) = (1931/2055) x 100% = Support Vector Machines for Skin Diseases
94.0146% Recognition . Master's Thesis in Computing Science .
Umea University,Sweden . 2006.
1679
Proceedings of the 2012 International Conference on Machine Learning and Cybernetics, Xian, 15-17 J uly, 2012
[4] S. Agatonovic-Kustrin and R. Beresford.Basic concepts [14] NIH: National Institute of Health (2011). Medline Plus -
of artificial neural network (ANN) modeling and its Scabies [Online]. Available:
application in pharmaceutical research. Journal of http://www.nlm.nih.gov/medlineplus/scabies.htrnl.
Pharmaceutical and Biomedical Analysis, 22:717-727, Retrieved: July 27,2011.
2000. [15] NIH: National Institute of Health (2011). Medline Plus -
[5] Michal Antkowiak. Recognition of skin diseases using Eczema [Online]. Available:
artificial neural networks. In Proceeding of USCCS'05, http://www.nlm.nih.gov/medlineplus/eczema.htrnl.
pages 313-325,2005. Retrieved: July 27,2011.
[6] Chris M. Bishop. Neural networks and their applications. [16] NIH: National Institute of Health (2011). Medline Plus -
Review of Scientific Instruments, 65(6):1803-1832, Psoriasis [Online]. Available:
1994. http://www.nlm.nih.gov/medlineplus/psoriasis.htrnl.
[7] Anthony du Vivier. Atlas of Clinical Dermatology. Retrieved: July 27,2011.
Churchill Livingstone,2002. [17] NIH: National Institute of Health (2011). Medline Plus -
[8] eCureMe. Medical Dictionary. [Online] . Available: Vitiligo [Online]. Available:
http://www.ecureme.com .Retrieved June 16,2011. http://www.nlm.nih.gov/medlineplus/vitiligo.htrnl.
[9] Monika Gniadecka, Peter Alshede Philipsen, Sigurdur Retrieved: July 27,2011.
Sigurdsson, Sonja Wessel, and et al. Melanoma [18] NIH: National Institute of Health (2011). Medline Plus
diagnosis by Raman spectroscopy and neural networks: TineaCorporis [Online]. Available:
Structure alterations in proteins and lipids in intact cancer http://www.nlm.nih.gov/medlineplus/tineacorporis.html.
tissue. J Invest Dermatol,122:443--449,2004. Retrieved: July 27,2011.
[10] David Delgado Gomez, Toke Koldborg Jensen, [19] NIH: National Institute of Health (2011). Medline Plus -
SuneDarkner, and Jens Michael Carstensen.Automated Scabies [Online]. Available:
visual scoring of psoriasis. [Online] . Available: http://www.nlm.nih.gov/medlineplus/scabies.htrnl.
http://www.imm.dtu.dkivisiondagIVD02/medicinsk/arti Retrieved: July 27,2011.
culo.pdf,2002. Retrieved: June 16,2011. [20] Rafael Gonzalez, Richard Woods . Color Image
[11] Handels H. ,RoB T. , Kreusch 1. , Wolff H. H. , and Processing. Digital Image Processing with MATLAB
P··oppl S. 1. . Feature selection for optimized skin tumor 2nd Edition. p. 204-207. Prentice Hall Publications ,
recognition using genetic algorithms. Artificial USA . 2003 .
Intelligence in Medicine, 16:283-297,1999. [21] Shahbahrami A. ,Borodin D. ,Juurlink B. . Comparison
[12] K. Hoffmann,T. Gambichler,A. Rick,M. Kreutz et al. . Between Color and Texture Features for Image
Diagnostic and neural analysis of skin cancer Retrieval . In Proceedings of ACM Multimedia
(DANAOS).A multicentre study for collection and Conference, Boston,MA. pp. 361 - 371,2008.
computer-aided analysis of data from pigmented skin [22] Health Policy of Bangladesh(2011). Ministry of Health
lesions using digital dermoscopy. British Journal of and Family Welfare - Government of People's Republic
Dermatology, 149:801-809,2003. of Bangladesh [Online]. Available:
[13] Samuel Freire da Silva .Dermatological Atlas [Online]. http://nasmis.dghs.gov.bdlmohfw/index.php?option=co
Available: m content&task=view&id=388<emid=483. Retrieved:
www.atlasdermatologico.com.br/index.htrnl .Retrieved:
Dec 11,2011.
July 27,2011.
1680