Professional Documents
Culture Documents
Cbmi2023 36
Cbmi2023 36
we propose to use the attention-based deep MIL approach, which Nodule bags
...
...
...
...
... Mechanism Non-nodule
I1 I2 Ik bags
Output layer
Outline of proposed scheme: FPR in lung nodule detection has Instances Backbone layer h1 h2 hk
MIL Attention
Weighted
layer
Feature vectors
always been demanding in the computer-aided detection system. A feature vectors
create a single vector to represent the bag. This vector was then
identified whether the bag as nodule or non-nodule.
Furthermore, class weighting should be considered to address
the imbalanced data classification problem between nodule and
non-nodule instances. This can be accomplished by giving different
weights to the majority and minority classes. In the training phase,
the weight difference will influence the final classification. However,
weight of each class is usually difficult to choose without prior
knowledge. A natural method of determining weight distribution
is using proportion of classes in training set. The whole purpose
of this method is to penalize the misclassification created by the
minority class by assigning a higher weight value while, at the
same time, reducing the weight of the majority class.
1 𝑛_𝑠𝑎𝑚𝑝𝑙𝑒𝑠
𝑤𝑗 = × (3)
𝑛_𝑠𝑎𝑚𝑝𝑙𝑒𝑠 𝑗 𝑛_𝑐𝑙𝑎𝑠𝑠𝑒𝑠
where 𝑤 𝑗 is weight of each class. 𝑛_𝑠𝑎𝑚𝑝𝑙𝑒𝑠 is total number of
samples in the dataset. 𝑛_𝑐𝑙𝑎𝑠𝑠𝑒𝑠 is total number of unique classes
in the target, here is 2 (nodule and non-nodule classes). 𝑛_𝑠𝑎𝑚𝑝𝑙𝑒𝑠 𝑗
is total number of sample of the respective class.
Figure 4: Example of attention weights for nodule bags.
3 EXPERIMENTS
Image dataset: Our experiment used the public dataset LUng Nod-
ule Analysis 2016 (LUNA16) challenge [9], a widely utilized dataset Experiment Setting: We first build the attention CNN model
for the task of lung nodule detection and FPR. Four experienced with a 64 × 64 input. A convolutional of 64 layers are added to
thoracic radiologists manually annotated the candidate nodules. help the model increase attention on learning instances. Finally,
Each radiologist labeled the nodules as nodules ≥ 3 mm, nodules the network is followed by two fully-connected layers, with 512
< 3 mm, or non-nodules. LUNA16 dataset consists of CT scans from and 256 neurons, respectively. We use an Adam optimizer to train
888 patients with 1,186 nodules, confirmed by at least three of four the CNN with an initial learning rate of 1 × 10 −8 . We exponentially
radiologists. LUNA16 dataset uses the candidate list provided by the decay the learning rate with a ratio of 1 × 10 −2 after every epoch.
competition host, a total of 754,975 candidates as non-nodules. We A lower learning rate can avoid a suboptimal set of weights that
use 1,068 nodules (90%) from the total of 1,186 ground-truth nodules are learned too quickly. The number of training epochs is 100, and
for generating the training and validation set; the remaining 118 the batch size is 1. The training loss is a sparse cross-entropy loss
nodules (10%) are kept for testing data averaged by the number of candidates. In addition, we used early
The dataset provided by the organizers of LUNA16 has about 460 stopping when performance on validation set was not improved.
times more non-nodules than nodules. While applying more train- To confirm the effectiveness of the proposed method, accuracy,
ing samples can improve the training performance, but training specificity, recall, and F1-score are used to evaluate the model pre-
on an imbalanced dataset can lead to overfitting problems. There- diction. We consider the classification ability between nodule and
fore, we implement several sampling and augmentation methods non-nodule of MIL model. We train our model from scratch without
to handle the data skewness problem. We included all the nodule pre-trained weights.
samples in the training data. The training set is further balanced by Results: Table 1 compares the nodule and non-nodule classifica-
up-sampling the nodules by applying the following adjacent slice tion performance of our proposal to other methods. The classifica-
and augmentation methods. In the adjacent slice strategy, most tion by our method, which adopts an attention-deep MIL approach
pulmonary nodules are in spheroidal shapes while radiologists only structure, outperforms all other methods in classification metrics.
annotated CT slices with the largest cross-sectional area. Therefore, The proposed method is the most accurate and received the highest
we also measure two upper and two lower adjacent slices of the recall. A higher recall is especially important in the medical domain
ground-truth nodule image marked by the radiologists and add since false negatives could lead to severe consequences, includ-
them to the training set. This method results in 5,210 lung nodules ing patient fatality. We also notice that the attention mechanism
for the training set. All nodule samples are then augmented to performs better than the plain CNN networks.
increase the total of training samples. Each sample is randomly ro- Finally, we present an exemplary result of the attention mecha-
tated by 20 degrees, horizontally and vertically flipped. In addition, nism of MIL in Fig. 4. In our experiments, each bag consists total of
each nodule image is slightly shifted to a random position. The ran- three images. For each image, the corresponding attention score
dom center shifting method prevents all objects from being located is given by our trained network. The bag is accurately predicted
in the center of the patch. These augmentation methods balance as nodules, and all nodule instances in a bag are correctly high-
the training set. All data sampling and augmentation techniques lighted with higher scores. Hence, the attention mechanism works
were implemented before the training model. as expected.
CBMI 2023, September 20–22, 2023, Orléans, France Chi Cuong et al.
4 CONCLUSION AND FUTURE WORK Maria Evelina Fantacci, Bram Geurts, et al. 2017. Validation, comparison, and
combination of algorithms for automatic detection of pulmonary nodules in
In this study, we developed a method for classifying nodule and computed tomography images: the LUNA16 challenge. Medical image analysis
non-nodule to reduce the false positive rate in CT images using 42 (2017), 1–13.
[10] Haiying Yuan, Zhongwei Fan, Yanrui Wu, and Junpeng Cheng. 2021. An efficient
attention-based MIL. Experimental show that our attention MIL multi-path 3D convolutional neural network for false-positive reduction of pul-
can outperform the learning ability with supervised learning meth- monary nodule detection. International Journal of Computer Assisted Radiology
ods and highlight the instance that contributed to the decision by and Surgery 16, 12 (2021), 2269–2277.
the attention mechanism. Our method automatically classifies nod-
ule candidates in CT images more efficiently and accurately than
traditional supervised learning methods. Our sampling and class
weighting can address the problem of imbalanced learning.
We used a binary classification problem to evaluate our proposed
approach. However, extending the current study to a multiple-class
classification task as nodule malignancy prediction through an
attention-based MIL network is an interesting direction for fur-
ther research. In addition, by taking advantage of three-dimension
spatial information of nodules in CT images, the three-dimension
attention-based MIL method might be one potential shortcoming
to be overcome in future research.
ACKNOWLEDGMENTS
This work was supported by the French Embassy in Vietnam under
the grant No. 16/2022/SCAC.
REFERENCES
[1] Guitao Cao, Tiantian Huang, Kai Hou, Wenming Cao, Peng Liu, and Jiawei Zhang.
2018. 3D convolutional neural networks fusion model for lung nodule detection
onclinical CT scans. In 2018 IEEE International Conference on Bioinformatics and
Biomedicine (BIBM). IEEE, 973–978.
[2] George Cybenko. 1989. Approximation by superpositions of a sigmoidal function.
Mathematics of control, signals and systems 2, 4 (1989), 303–314.
[3] Salsabil Amin El-Regaily, Mohammed Abdel Megeed Salem, Mohamed Has-
san Abdel Aziz, and Mohamed Ismail Roushdy. 2020. Multi-view Convolutional
Neural Network for lung nodule false positive reduction. Expert systems with
applications 162 (2020), 113017.
[4] Maximilian Ilse, Jakub Tomczak, and Max Welling. 2018. Attention-based deep
multiple instance learning. In International conference on machine learning. PMLR,
2127–2136.
[5] Khai Dinh Lai, Thuy Thanh Nguyen, and Thai Hoang Le. 2021. Detection of lung
nodules on ct images based on the convolutional neural network with attention
mechanism. Annals of Emerging Technologies in Computing (AETiC) 5, 2 (2021),
78–89.
[6] Chi Cuong Nguyen, Giang Son Tran, Jean-Christophe Burie, Thi Phuong Nghiem,
et al. 2021. Pulmonary nodule detection based on faster R-CNN with adaptive
anchor box. Ieee Access 9 (2021), 154740–154751.
[7] Jan Ramon and Luc De Raedt. 2000. Multi instance neural networks. In Proceedings
of the ICML-2000 workshop on attribute-value and relational learning. 53–60.
[8] Vikas C Raykar, Balaji Krishnapuram, Jinbo Bi, Murat Dundar, and R Bharat
Rao. 2008. Bayesian multiple instance learning: automatic feature selection and
inductive transfer. In Proceedings of the 25th international conference on Machine
learning. 808–815.
[9] Arnaud Arindra Adiyoso Setio, Alberto Traverso, Thomas De Bel, Moira SN
Berens, Cas Van Den Bogaard, Piergiorgio Cerello, Hao Chen, Qi Dou,