Professional Documents
Culture Documents
Brain Tumor Classification Using Resnet-101 Based Squeeze and Excitation Deep Neural Network
Brain Tumor Classification Using Resnet-101 Based Squeeze and Excitation Deep Neural Network
Brain Tumor Classification Using Resnet-101 Based Squeeze and Excitation Deep Neural Network
net/publication/336877091
Brain Tumor Classification Using ResNet-101 Based Squeeze and Excitation Deep
Neural Network
CITATIONS READS
7 579
6 authors, including:
Some of the authors of this publication are also working on these related projects:
Deep learning for screening of interstitial lung disease patterns in high-resolution CT images View project
All content following this page was uploaded by Lokesh Nandanwar on 25 May 2020.
Abstract—The brain tumor is one of the leading and most out in the country, out of which 20% are children which
alarming cause of death with a high socio-economic impact in amounts to over 2,500 children each year [1]–[3]. Due to the
Occidental as well as eastern countries. Differential diagnosis and lack of adequate doctors, many districts and rural parts of
classification of tumor types (Gliomas, Meningioma, and Pituitary
tumor) from MRI data are required to assist radiologists as well India is covered by a single specialist. As a result of this
as to avoid the dangerous histological biopsies. In the meantime, disparity, only a small proportion of the patients with the
improving the accuracy and stability of diagnosis is also one different socio-economic background can find their way to
challenging task. Many methods have been proposed for this the specialists. It is also anticipated that it will take a long
purpose till now. In this work, an automatic tool for classification time before the neurology workforce in clinical sciences is
of brain tumor from MRI data is presented where the image slice
samples are passed into a Squeeze and Excitation ResNet model adequate to deal with this kind of problems faced by the
based on Convolutional Neural Network (CNN). The use of zero- Indian population. Human assessment is one of the typical
centering and normalization of intensity for smooth variation ways for brain tumor detection and classification from MR
of the intensity over the tissues was also investigated as a pre- images and it highly depends on the experience of radiologists
processing step which together with data augmentation proved who evaluate the characteristics of image slices thoroughly
to be very effective. A relative study had been done to prove
the efficacy of the proposed CNN model in free tumor database. and makes it time-consuming and prone to error. The heavy
Experimental evaluation shows that the proposed CNN archives workload in a densely populated country like India may cause
an overall accuracy rate of 89.93% without data augmentation. fatigue and distraction for the physicians which may pose a
Addition of data augmentation has further improved the accura- constant threat to patient safety. Consequently, a demand has
cies up to 98.67%, 91.81% and 91.03% for Glioma, Meningioma been raised up from the neuroradiologists for the new approach
and Pituitary tumor respectively with an overall accuracy of
93.83%. Promising improvement with reference to sensitivity and of Computer-Aided Diagnosis (CAD). In view of this, there
specificity compared with some of the state-of-the-art methods is a good scope of developing a computerized diagnostic tool
was also observed. which will automatically detect and classify the brain tumors
Index Terms—ResNet, Convolutional Neural Network, Classi-
from the MR images and thus assist the medical practitioners.
fication, Brain Tumor Magnetic Resonance Imaging (MRI) is a non-invasive,
based on non-ionizing radiation imaging procedure and treated
I. I NTRODUCTION as the most popular techniques to diagnose and treatment of
brain tumor than the Computer Tomography (CT) or Ultra-
Among various diseases, the brain tumor has become a
sound image [4], [5]. Recognizing patterns or texture for clas-
pressing health concern in our country today. For many of
sification from highly variable images is time-consuming and
the cases, treatment options are extremely limited due to late
needs a lot of human effort especially if the data is very large.
detection and wrong diagnosis. Brain tumors are diagnosed
Various techniques have been proposed to help radiologists
with 40,000 to 50,000 new people living in India every year
to increase their diagnostic accuracy. Convolutional Neural
according to a recent hospital-based study that was carried
Network (CNN) which automatically learns a hierarchy of
Visvesvaraya PhD Scheme for Electronics and IT, Ministry of Electronics increasingly complex features directly from data have recently
and Information Technology(MeitY), Government of India. achieved impressive results by raising the accuracy of medical
∗ First three authors have equal contribution.
diagnosis and disease classification [6]–[9].
978-1-5386-7989-0/19/$31.00 ©2019 IEEE In this work, a fully automatic methodology which can
categorize brain tumors from a limited amount of data into 1) ROI segmentation: A region of interest (ROI) is a subset
different pathological types is proposed by using convolu- of an image or a dataset identified from the original samples
tional neural networks, which may release some workload for a specific purpose. The boundary of a brain tumor in T1
of the doctors. The squeeze and excitation [10] CNN model weighted MRI slices is the ROI in this case. The annotated
using Residual network architecture [11] was used for the masks of the tumors, which contained labels ‘1’ for tumor
classification of tumors. The remaining paper is structured region and ‘0’ for everything else, were provided in the brain
as follows: The dataset, preprocessing, data augmentation are tumor database. The exact tumors were extracted from the
described in Section II. Section III delivers an analysis of the brain MRI samples by multiplying them pixelwise with the
proposed model. The experimental setup and results are shown corresponding masks. Due to the variability of tumor sizes
in Section IV. Finally, some conclusion is drawn in Section V. in the samples, tumor ROI images were resized and added
zero padding to fit into the particular input shape for the
II. D ESCRIPTION OF DATASET proposed model. The dimension of each of the image after
A. Data Acquisition ROI segmentation was 256 × 256 pixels.
2) Intensity Zero-centering: Intensity zero-centering(or
The proposed method was tested and evaluated on the Brain mean-centering) [14] is a transformation of the data linearly
tumor database which has been collected in the year from which shifts the data so that it is centered at the origin.
2005 to 2010, from Nanfang Hospital, Guangzhou, China, Generally, because of zero-centering, non-linear operations
and General Hospital, Tianjin Medical University, China [12], and analysis perform well. Zero-centering to each of the ROI
[13]. The brain tumor dataset contained 3064 T1-weighted segmented dataset images was performed because it is much
contrast-enhanced images from 233 patients with three kinds easier to understand variation when you are centered at the
of brain tumor: meningioma (708 slices), glioma (1426 slices), origin. This process was done by subtracting the mean of all
and pituitary tumor (930 slices). The dimension of each MR the pixels of an image from all the pixels of that image.
slice was 512 × 512 pixels with a thickness and slice gap
3) Intensity Normalization: Generally, MRI intensities are
of 6 mm and 1 mm respectively. Only 3049 out of 3064
acquired in arbitrary units; thus, it becomes difficult to com-
images were used here because remaining 15 images were
pare images across scanners, subjects, and visits, even if the
of lesser resolution of 256 × 256 pixels. The dataset also
same procedure is used. Intensity normalization brings the
contained the tumor masks which was manually delineated
intensities to a common scale across people. This affected
and validated by three experienced radiologists from Medical
the model’s performance, prediction, and inference. It is an
College and Hospital Kolkata, India. One sample from each
important step in any image analysis with more than one
class of tumor is displayed in Fig.1 where the red line indicates
subject or time point to ensure comparability across images.
tumor boundary. Fig.2 shows the Preprocessed images for the
Division of all pixel values of one image by their standard
same.
deviation was performed to get the normalized images.
B. Pre-processing 4) Data Augmentation: Data augmentation [15] is another
way of Regularization which is used to reduce overfitting
of models by an increase in the amount of training data
using the present original information only. The idea of data
augmentation is very old, and in fact, various data augmen-
tation techniques have been applied to numerous problems.
In this case, several combinations of transformations on the
zero-centered, normalized, ROI segmented images of Glioma,
(a) (b) (c)
Meningioma, and Pituitary tumor data were applied. The
transformations were Flip, Rotate, Elastic transform [16] and
Fig. 1. Brain MRI dataset sample of Glioma(a), Meningioma (b), Pituitary Shear with variable degrees of transformations, for example,
tumor (c) 25 to 40 degrees of rotation, left-right, and top-bottom flip etc.
TensorLayer [17], a python library, is used as a framework for
this purpose.
TABLE I
AUGMENTED DATA AFTER APPLYING DATA AUGMENTATION ON ORIGINAL
DATASET
After applying all these preprocessing techniques, dataset observed that it is easier to train this network than training
was increased to 2850 Glioma, 2124 Meningioma, and 2745 simple deep convolutional neural networks. It also resolves
Pituitary tumor samples (Table I). Further, it was divided into the problem of accuracy degradation. ResNet-101 is 101-layer
training and testing data randomly at a ratio of 9:1. Testing Residual Network and is a modified version of the 50-layer
data contains 10% of the total data. ResNet.
2) Squeeze and Excitation block with ResNet: It was found
that Squeeze and Excitation (SE) blocks [10] combined with
X
ResNet architecture, has performed outstandingly good in clas-
Residual WxHxC sifying the brain tumors. By stacking the SE blocks one after
1x1xC
the other, SE-ResNet-101 [10] architecture is constructed that
X Global Pooling
generalise extremely well across our brain tumor dataset. For
Fully Connected 1 x 1 x C/r given transformation of a convolution or a set of convolutions,
Residual ReLU 1 x 1 x C/r equation (1) is as follows:
Fully Connected 1x1xC Ztr : X → Y, XRW0 ×H0 ×C0 , Y RW ×H×C (1)
Sigmoid
1x1xC Here width, height, and the number of channels are repre-
sented by W, H, and C respectively after taking X and Y as
X̃ Scale WxHxC input and output feature maps and r signifies the reduction
a) ResNet Module ratio [10]. The input features X are first passed through
WxHxC a squeeze operation, which is used to average the feature
maps in a spatial plane. After this, two fully connected layers
X̃
b) SE-ResNet Module
with ReLU and Sigmoid activations are used separately for
excitation operation. The Squeeze and Excitation block with
Fig. 3. Block Diagram of a) Residual Module b) SE-ResNet module ResNet-101 which performs feature recalibration is shown in
Fig.3(b) [10].
Augmented
Images
SE-ResNet-101
T1 - weighted CE- Segmented Tumor Segmented Tumor CNN
Normalized Image
MRI image (512 * 512) (256 * 256) One hot Encoded
Output
Legend
ROI Region of Interest
Segmentation
Annotated Tumor IN Intensity Normalisation
Mask ZC Zero Centering
Element wise
Multiplication
consisting of all 3 classes of the tumor to get the overall result 3) Sensitivity: Sensitivity [19] (True positive rate or recall)
of the model. The 2nd phase was about evaluating the model measures the proportion of actual positives that are correctly
on 3 separate classes of tumors to get the accuracy on the identified.
particular type of tumor.
No. of true positives
Sensitivity =
IV. E XPERIMENTAL S ETUP AND R ESULTS No. of true positives + No. of false negatives
(4)
A. Implementation 4) Cross-entropy loss: Cross-entropy loss [21], sometimes
known as log loss, computes the performance of a multi-class
The whole setup was implemented and trained in Linux
classification model whose output value lies in between 0 and
environment using NVIDIA GTX 1060 8GB GPU on a
1. Cross-entropy loss increases as the predicted probability
system of 16 GB RAM and core-i7 7th generation @4.0GHz
deviate from the actual value.
processor. Keras, a python module, was used as a framework
for all the networks. C. Results
After the training of the proposed neural network for 29
B. Performance metrics epochs, training and validations plots of accuracy and cross-
entropy loss are shown in Fig.6 and Fig.5. This model was
1) Categorical Accuracy i.e Accuracy: Categorical Ac- trained on the original dataset for 20 epochs and after that, it
curacy calculates the mean accuracy rate which calculates was trained on the augmented dataset with pre-trained weights.
the proportion of true positive and true negative across all This gives slightly low accuracy and high loss in 21st epoch,
predictions for multi-class classification problems. which becomes much better at around 29th epoch. During
testing the trained model on the testing data, the accuracy
No. of correct predictions was 88.18%. The accuracy of the model on testing data, when
Accuracy = (2)
Total No. of predictions model got further trained on augmented data, was improved
to 94.70%.
The performance of the proposed model can be evaluated
2) Specificity: Specificity [19] (True negative rate) mea- from the tables and plots provided. Table II shows the con-
sures the proportion of actual negatives that are correctly fusion matrix along with the Specificity [19], Sensitivity [19]
identified. and Accuracy (Categorical accuracy) obtained for each of the
3 classes. It was observed that classification of Glioma gave
No. of true negatives the best result as Glioma is having largest training data in
Specificity = comparison with other two tumor classes.
No. of true negatives + No. of false positives
(3) The presented model had the overall accuracy of 93.83%,
specificity of 0.9715 and sensitivity of 0.9384 respectively,
2019 Second International Conference on Advanced Computational and Communication Paradigms (ICACCP)
TABLE III
2.5
C OMPARISON OF S PECIFICITY AND S ENSITIVITY WITH SVM MODEL
Training without data augmentation Training with data augmentation
Validation without data augmentation Validation with data augmentation Glioma Meningioma Pituitary Tumor
2 Performance Metrics
SVM model SE-ResNet-101 SVM model SE-ResNet-101 SVM model SE-ResNet-101
CATEGORICAL CROSS-ENTROPY LOSS
94.5
1 94
93.5
0.5
93
Accuracy (%)
92.5
0
0 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29
92
NUMBER OF EPOCHS
91.5
91
Fig. 5. Plot of categorial cross-entropy during Training of Neural network
model 90.5
90
100
89.5
90 SVM DNN SE-ResNet-101
Accuracy 91.14 91.43 93.83
80
60
ACCURACY (%)
proper scientific environment. Funding is provided by Visves- [22] J. S. Paul, A. J. Plassard, B. A. Landman, and D. Fabbri, “Deep learning
varaya Ph.D. scheme of DeitY (Department of Electronics and for brain tumor classification,” in Proc. SPIE 10137, Medical Imaging
2017: Biomedical Applications in Molecular, Structural, and Functional
Information Technology), Govt. of India. The authors are also Imaging, vol. 1013710, pp. 10137–16, 2017.
thankful to Medical College Kolkata and EKO Diagnostics,
Kolkata for providing valuable input and feedback for this
work.
R EFERENCES