Professional Documents
Culture Documents
Transformer Embedded With Learnable Filters For Heart Murmur Detection
Transformer Embedded With Learnable Filters For Heart Murmur Detection
Page 2
ture of data periodic characteristics. First, we perform a preprocessing stage, this paper divides each heart sound
Fast Fourier Transform (FFT) along the item dimension to recording of the patient into several segments. Therefore,
transform the input information from the time domain to how to aggregate the diagnostic results of the segmented
the frequency domain: fragments into the final diagnosis of the patient is cru-
cial. Our aggregation strategy is based on the majority
X =F F T (F ) ∈ C n×d , (1) voting rule. First we aggregate the segment-level to record-
level (summarize predictions from multiple segments of a
where F F T (·) denotes the one-dimensioal FFT. X is a heart sound recording into predictions for this heart sound
complex tensor and represents the spectrum of F ,and n,d recording). Then, the prediction results of one record or
denotes the length of feature sequence and the dimension multiple records of the patient are aggregated into the pre-
of feature embedding,respectively. Then, we can multiply diction results of the patient.During the aggregation pe-
the obtained spectrum by the learnable filter K to achieve riod, we may experience some tie-breakers. In response to
the effect of modulating the spectrum. these situations, we set priorities according to the official
calculation cost flow chart. Priority for three categories:
X̃ = K ⊙ X, (2) Present>Unkown>Absent.
Since our current task is a classification task, not a time Training Validation Test Ranking
series prediction task, we only need to use the encoder 0.582 ± 0.013 0.394 0.402 39/40
module in the Transformer model. Each encoder layer con- Table 1. Weighted accuracy metric scores (official Chal-
sists of a multi-head self-attention mechanism sub-layer, lenge score) for our final selected entry (team Bear FH)
followed by a fully connected feed-forward network. As for the murmur detection task, including the ranking of our
described in [13], we use skip connections and layer nor- team on the hidden test set.
malization in each sublayer. The long-term dependencies
of the heart sound signal are captured by the transformer
encoder module, which enables the network to learn more Training Validation Test Ranking
effective features from the input signal. 7493 ± 132 11331 15982 35/39
2.4. Voting Decision Strategy Table 2. Cost metric scores (official Challenge score) for
our final selected entry (team Bear FH) for the clinical out-
According to the scoring rules of the 2022 Phys- come identification task, including the ranking of our team
ioNet/CinC challenge, we know that the model’s final on the hidden test set.
prediction results are for patients. However, in the data
Page 3
3. Conclusions Heart Murmurs Using Neuromorphic Auditory Sensors.
IEEE Transactions on Biomedical Circuits and Systems
In this paper, we propose a learnable filter-based trans- 2017;12(1):24–34. ISSN 1932-4545.
former architecture. The model utilizes learnable filters to [7] Zeng W, Yuan J, Yuan C, Wang Q, Liu F, Wang Y. A New
adaptively reduce noise information in the frequency do- Approach for the Detection of Abnormal Heart Sound Sig-
main and can better capture periodic features. At the same nals Using Tqwt, Vmd and Neural Networks. Artificial In-
time, the long-term dependence of the heart sound signal telligence Review 2021;54(3):1613–1647.
[8] Zhang W, Han J, Deng S. Abnormal Heart Sound Detection
is captured by the transformer encoder module. However,
Using Temporal Quasi-Periodic Features and Long Short-
during the experiment, it was found that our method has Term Memory Without Segmentation. Biomedical Signal
potential over-fitting risks and does not take full advantage Processing and Control 2019;53:101560.
of the unique advantages of the transformer. We will find [9] Ahmedt-Aristizabal D, Armin MA, Denman S, Fookes C,
more detailed solutions in future work. Petersson L. Attention Networks for Multi-Task Signal
Analysis. In 2020 42nd Annual International Conference
Acknowledgments of the IEEE Engineering in Medicine & Biology Society
(EMBC). IEEE, 2020; 184–187.
This work was supported in part by the National [10] Reyna MA, Kiarashi Y, Elola A, Oliveira J, Renna F, Gu
Key Research and Development Project under Grant A, et al. Heart Murmur Detection from Phonocardiogram
2016YFC1000307-3 and Grant 2019YFE0110800,in part Recordings: The George B. Moody Physionet Challenge
2022. MedRxiv 2022;URL https://doi.org/10.1
by the National Natural Science Foundation of China un-
101/2022.08.11.22278688.
der Grant 61806032 and Grant 61976031, in part by the
[11] Oliveira J, Renna F, Costa PD, Nogueira M, Oliveira C, Fer-
National Major Scientific Research Instrument Develop- reira C, et al. The Circor Digiscope Dataset: From Mur-
ment Project of China under Grant 62027827, in part by mur Detection to Murmur Classification. IEEE Journal of
the Scientific and Technological Key Research Program of Biomedical and Health Informatics 2021;26(6):2524–2535.
Chongqing Municipal Education Commission under Grant [12] Zhou K, Yu H, Zhao WX, Wen JR. Filter-Enhanced Mlp is
KJZD-K201800601. All You Need for Sequential Recommendation. In Proceed-
ings of the ACM Web Conference 2022. 2022; 2388–2399.
References [13] Vaswani A, Shazeer N, Parmar N, Uszkoreit J, Jones L,
Gomez AN, et al. Attention is All You Need. Advances
[1] James SL, Abate D, Abate KH, Abay SM, Abbafati C, Ab- in Neural Information Processing Systems 2017;30.
basi N, et al. Global, Regional, and National Incidence,
Prevalence, and Years Lived with Disability for 354 Dis-
eases and Injuries for 195 Countries and Territories, 1990– Address for correspondence:
2017: A Systematic Analysis for the Global Burden of Dis- Pengfei Fan
ease Study 2017. The Lancet 2018;392(10159):1789–1858. School of Computer Science
[2] Markaki M, Germanakis I, Stylianou Y. Automatic Clas- No.2, Chongwen Road, Nan’an district, Chongqing, China
sification of Systolic Heart Murmurs. In 2013 IEEE Inter- fpf1033016159@163.com
national Conference on Acoustics, Speech and Signal Pro-
cessing. IEEE. ISBN 1479903566, 2013; 1301–1305.
[3] Quiceno-Manrique A, Godino-Llorente J, Blanco-Velasco Yucheng Shu
M, Castellanos-Dominguez G. Selection of Dynamic Fea- School of Computer Science
tures Based on Time–Frequency Representations for Heart No.2, Chongwen Road, Nan’an district, Chongqing, China
Murmur Detection from Phonocardiographic Signals. An- shuyc@cqupt.edu.cn
nals of Hiomedical Engineering 2010;38(1):118–137. ISSN
1573-9686. Yiming Han
[4] Uğuz H. A Biomedical System Based on Artificial Neural School of Computer Science
Network and Principal Component Analysis for Diagnosis
No.2, Chongwen Road, Nan’an district, Chongqing, China
of the Heart Valve Diseases. Journal of Medical Systems
s210231061@stu.cqupt.edu.cn
2012;36(1):61–72. ISSN 1573-689X.
[5] Dissanayake T, Fernando T, Denman S, Sridharan S,
Ghaemmaghami H, Fookes C. A Robust Interpretable Deep
Learning Classifier for Heart Anomaly Detection Without
Segmentation. IEEE Journal of Biomedical and Health In-
formatics 2020;25(6):2162–2171.
[6] Dominguez-Morales JP, Jimenez-Fernandez AF,
Dominguez-Morales MJ, Jimenez-Moreno G. Deep
Neural Networks for the Recognition and Classification of
Page 4