Professional Documents
Culture Documents
1 s2.0 S1746809422001033 Main
1 s2.0 S1746809422001033 Main
A R T I C L E I N F O A B S T R A C T
Keywords: Blood pressure (BP) is an important clinical indicator for cardiovascular health assessment, and accurate
Blood pressure monitoring monitoring of continuous BP is still a challenging task. In this paper, an attention-based residual improved U-Net
Photoplethysmography signal (ARIU) model is proposed to improve the accuracy of continuous BP monitoring by using the photo
Deep learning
plethysmography (PPG) signal. This model consists of an improved U-Net employed to learn the high dimen
Attention mechanism
Residual mechanism
sional features from PPG signal, an attention module embedded in the skip connections to reduce redundancy of
learning features, and a residual module replaced common convolution to prevent degradation problems and
enhance generalization performance. The raw PPG signals and arterial BP download from the MIMIC-III data
base, the first and second derivatives of PPG signal are utilized as additional inputs to increase the multiform of
input information, and a data input way of parallel-based fusion are adopted to improve the effectiveness of
information mining. After data preprocessing, the dataset used in this study contains 150,000 samples, belonging
to 100 subjects. The reliability of the proposed model is verified by the ablation experiments, and the
advancement of the model is demonstrated by the comparison experiments with other state-of-art methods. The
mean absolute error (MAE) and standard deviation (STD) of systolic blood pressure (SBP) predicted by the
proposed model are 4.75 mmHg and 6.72 mmHg respectively, and that of diastolic blood pressure is 2.81 mmHg
and 4.59 mmHg. The results meet the requirements of the Advancement of Medical Instrumentation (AAMI) and
reach the “Grade A” of the British Hypertension Society (BHS) protocol.
1. Introduction technical requirements, this approach is only suitable for the patients
under major surgery or critical care [7,8]. Intermittent BP monitoring
Cardiovascular disease (CVD) is a serious health problem that causes method mainly includes the Korotkoff method and the oscillometric
millions of deaths every year around the world [1], and blood pressure method, which also can reach high accuracy [9], but have the disad
(BP) is the main clinical symptom used to detect, diagnose or monitor vantages of insufficient ability to track and reflect the instantaneous and
cardiovascular and hemodynamic diseases [2,3]. Therefore, BP moni continuous BP changes. In contrast, the continuous BP monitoring
toring is very necessary for the prevention, control, and research of these method can monitor the continuous BP change in the daily environment,
diseases. and the measurement results are closer to the real situation of the human
According to the different measurement techniques, the BP moni body with more physiological effectiveness and monitoring accuracy
toring method can be divided into three categories: direct BP monitoring [10,11].
method, intermittent BP monitoring method, and continuous BP moni At present, continuous BP monitoring methods used in cardiovas
toring method [4]. Direct BP monitoring method, also known as a direct cular health management mainly include arterial tonometry method,
measurement with an intra-arterial catheter, is an invasive monitoring volume clamp method, pulse wave velocity (PWV) or pulse wave transit
approach [5], and its measurement result is considered as the “gold time (PWTT) measurement method, and pulse wave parameter mea
standard” [6], but due to the disadvantages of complications and high surement method [12,13]. The arterial tonometry method uses pressure
* Corresponding author.
E-mail address: jmzhu_usst@163.com (J. Zhu).
1
Contributed equally to the work.
https://doi.org/10.1016/j.bspc.2022.103581
Received 24 September 2021; Received in revised form 28 January 2022; Accepted 12 February 2022
Available online 23 February 2022
1746-8094/© 2022 Elsevier Ltd. All rights reserved.
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
sensors pressed directly against the skin to measure the pressure pulse signals through skip connections between the contracting and expansive
wave and calculate BP [14], but it is difficult to always maintain the path, but the direct fusion of feature information at different scales will
precise measurement position of the sensor and susceptible to motion inevitably generate some inefficient or redundant information for BP
artifacts [15]. The volume clamp method follows the principle of the monitoring. On the other hand, the conventional convolution operations
relaxed wall of the vessel and uses the counter-pressure instead of the in the classic U-Net generally have the information loss and vanishing
artery pressure, but the measurement accuracy is affected by observa gradient, which reduced the effectiveness of feature extraction from the
tion position. PWV or PWTT measurement method respectively utilizes PPG signals [29]. Aiming at the shortcomings of the classic U-Net model,
propagation speed or propagation time to calculate arterial BP and a handful of researchers have made new improvements to the model
generally requires at least two sensors to collect relevant physiological structure [30,31]. Although these optimizations have achieved good
signals, such as electrocardiogram (ECG) signals and pulse wave signals, results in the research fields, such as image segmentation, the
but multi-sensors is inconvenient to deploy and monitoring accuracy is improvement of the model is still beginning and developing in contin
limited by synchronization algorithms of sensors. The pulse wave uous BP monitoring by using physiological signals.
parameter measurement method uses the characteristic information Relevant researches show that the first derivative of the PPG signal
related to pulse wave waveform to establish the correlation between the (VPG) and the second derivative of the PPG signal (APG) can provide
waveform and BP, which is the research hot spot for high precision non- additional potential information for improving the accuracy of BP esti
invasive BP monitoring and needs to be further studied. mation [32]. Shimazaki et al. [33] used the serial combination of the
Due to the advantages of simple operation, stable performance, and PPG, VPG, and APG signals to estimate cuffless BP, which achieved
strong adaptability, photoplethysmography (PPG) has been gradually higher accuracy than only using the separate PPG signal. Slapničar et al.
used for non-invasive continuous BP monitoring, which can detect the [25] used the PPG, VPG, and APG signals to train their proposed
changes of blood volume in human microvessels [16] and contains rich network respectively, and further trained the network after fusing the
cardiovascular information [17]. Teng et al. [18] extracted manual features obtained from the three signals, which obtained better results
features from a single PPG signal and used a linear regression method to than the separate PPG signals. Nevertheless, these studies didn’t
estimate BP, which achieved good monitoring accuracy. Zadi et al. [19] consider the synchronization and correlation between the PPG, VPG,
established the autoregressive moving average (ARMA) model based on and APG signals in the time domain, so the utilization of these potential
the peak and valley waveform features of the PPG signal to estimate information also needs to be further improved.
systolic blood pressure (SBP) and diastolic blood pressure (DBP), which In order to address the aforementioned problems and further
obtained good predicting results. Li et al. [20] used semi-classical signal improve the accuracy of non-invasive BP monitoring, an attention-based
analysis (SCSA) to extract features from PPG signal and then developed residual improved U-Net (ARIU) model by using the parallel-based
and compared three machine learning methods to estimate BP, which fusion of PPG, VPG, and APG are proposed to monitor continuous BP.
revealed support vector machine (SVM) algorithm overall achieved the The main contributions of this paper are outlined as follows:
best estimation accuracy. Kurylyak et al. [21] extracted 21 feature pa
rameters from the PPG signals and established an artificial neural (1) An ARIU model was proposed to monitor continuous BP by using
network (ANN) model to estimate continuous BP, which achieved better the PPG signals, which the improved U-Net was employed to
results than the linear regression method. However, the handcrafted learn multi-scale features, the attention module was utilized to
features in these studies make the PPG signals lose the potentially sen reduce redundant features, and the residual module was used to
sitive features to some extent, which hindered the further improvement prevent performance degradations.
in accuracy of continuous BP monitoring. (2) Considering the synchronization and correlation in the time
With the powerful capabilities of data mining and feature extraction domain, the parallel-based information fusion of the PPG, VPG,
[22,23], deep learning has gradually been applied to the field of and APG signals can improve the efficiency and effectiveness of
continuous BP monitoring based on the PPG signals. Shimazaki et al. feature extraction.
[24] used an auto-encoder to extract features from the PPG signals and (3) The proposed model was validated by ablation and comparison
employed a multi-layered neural network (MNN) model to estimate BP, experiments. Compared with other state-of-art methods, the
and the accuracy is better than the multiple regression analysis (MRA) proposed model can obtain higher accuracy of BP monitoring.
model. But the methods still did not get rid of the limitations of the
feature engineering. Some researchers directly employ the deep learning The rest of this paper is organized as follows. Section 2 describes the
model to adaptively extract high-dimensional features from the PPG used dataset and the model for BP monitoring. The experiment and
signals. Slapničar et al. [25] proposed a deep neural network model to result analysis are provided in Section 3. The conclusion are drawn in
estimate BP based on time–frequency information fusion, which ach Section 4.
ieved better results than the random forest model based on manually
extracted features from the PPG signals. Harfiya et al. [26] designed an 2. Materials and methods
auto-encoding model based on Long Short Term Memory (LSTM) to
monitor the continuous BP by using the raw PPG signals, and the results In this Section, the overall framework of the proposed continuous BP
met the requirements of the Advancement of Medical Instrumentation monitoring method is shown in Fig. 1. This method mainly involves the
(AAMI) standard. Although these deep learning methods can work more data preparation phase and model building phase, which concretely
effectively than traditional methods, there is still space for further includes data acquisition, data preprocessing, dataset setup, model
improvement in the data mining of PPG signals. designing, and model training.
Recently, Ronneberger et al. [27] proposed the U-architecture con In the data preparation phase, in order to ensure the sufficiency and
volutional network (U-Net) for biomedical image segmentation, which richness of the samples, the PPG signals and their corresponding BP
had been attracted the attention of some researchers and helped them to records are downloaded firstly from the public Medical Information
solve many challenges in various fields. Athaya et al. [28] introduced Mart for Intensive Care III (MIMIC-III) database. Then, data pre
the U-Net into the continuous non-invasive BP monitoring based on the processing is used to eliminate motion artifacts and distortions of the
raw PPG signals, which obtained better estimate results than the other original data. Next, in order to mine more effective information from
compared methods and met the requirements of the AAMI and the preprocessed data, the VPG and APG signals are calculated and fused in
British Hypertension Society (BHS) standards. However, the classic U- parallel with the PPG signal, which is regarded as a single set of input
Net still has some shortcomings for mining the PPG signals. On the one data. Besides, the corresponding SBP and DBP of each sample are
hand, the U-Net implements the high-low scale feature fusion of the PPG calculated and regarded as the label of the subsequent proposed model.
2
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
In the model building phase, the ARIU model is designed to learn In this part, we refer to the preprocessing method of other research [25].
high dimensional features from input signals adaptively, and the 5-fold As a widely used data stream smoothing and denoising method, the
cross validation strategy is utilized to train the designed model. In Savitzky-Golay filter can retain the distribution characteristics of rela
addition, the hyperparameters of the proposed model are adjusted ac tive maximum, minimum, and width in the time domain [36]. There
cording to the training results. Ultimately, after performance evaluation fore, this filter is employed to eliminate abnormal mutation values and
on the testing dataset, the model that met the performance requirements high-frequency noises in each reserved sample segment of PPG and BP
is used for continuous BP monitoring. signals. Then, based on the peak and valley detection algorithm pro
posed by Elgendi et al. [37], the effective peak and valley points of each
filtered sample segment. According to the detected points, the data be
2.1. Data preparation
tween two consecutive valley points in each sample segment is regarded
as one cycle. Moreover, if two valley points are too far or too close, this
2.1.1. Data acquisition
data of one cycle will be skipped. Next, the data with five consecutive
The dataset used in this study comes from the MIMIC-III waveform
cycles (e.g., from the first valley point to the sixth valley point) are
database [34,35], which contains 67,830 records of about 30,000 ICU
further segmented into one independent sample. And there is no overlap
patients, and almost all of these records consist of various digital
between two consecutive samples of five cycles. If the number of sam
physiological signals (such as ECG signals, BP signals, respiratory sig
ples for a patient does not exceed 1500, then this patient’s data should
nals, PPG signals, and other physiological signals). Each record corre
be discarded. In the end, if>1500 data segments are obtained from all
sponds to the physiological signals of an individual patient during the
the records of each patient, then these data segments of this patient have
whole hospitalization period. In this study, a total of 230 patients’ raw
been shuffled and 1500 data segments have been randomly selected.
data are downloaded from this open database, and the data of each
After the above preprocessing work, the number of patients in our
patient contains the PPG signals and the corresponding BP signals with a
dataset dropped from 230 to 100. Besides, for the PPG signal of each
sampling rate of 125 Hz with 8-bit, 10-bit, or 12-bit (occasionally) res
sample, the normalization is used to reduce its range to the interval of
olution. The lengths of these records also vary, most are a few days in
[0,1], which is shown in:
duration, but some are shorter and others are several weeks long.
xi − xmin
xnorm = (1)
2.1.2. Data preprocessing
i
xmax − xmin
The physiological signals from the MIMIC-III database contain a lot Finally, since the input of the subsequent deep neural network model
of noises, such as motion artifacts and distortions. No doubt that the 230 generally requires the same length, and the actual sampling points of the
patients’ raw data downloaded in our study also have quality problems, PPG signal of each sample which contains five consecutive cycles are
such as distorted signal (flat lines and flat peaks, etc) or motion artifacts, always inconsistent, such as three examples with different lengths as
so the preprocessing is indispensable for the acquired data. shown in Fig. 2. Therefore, this study uses the cubic spline interpolation
To ensure data synchronization between PPG signals and BP signals, method to establish a fitting function for the raw PPG signal of each
the acquired data are firstly divided into numerous sample segments normalized sample, then uses the fitting function to take fixed-length
according to the null position of the raw continuous signal. Each sample sampling points on the raw PPG signal of the sample. In this study,
segment that contained the abnormal data points (including flat lines, based on preliminary experiments, the length of sampling points is set to
flat peaks, etc.) is discarded to improve the efficiency of data processing.
3
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
Fig. 2. Diagram of the resampling process for examples under different bpms.
600. Moreover, the examples from top to bottom in Fig. 2 respectively in each segment. Concretely, the example of PPG, VPG, and APG signal
correspond to a heart rate of 144 beats per minute (bpm), 67 bpm, and data from one sample is shown in Fig. 3.
59 bpm. Then, each sample (the parallel-based fusion of PPG, VPG, and APG
It can be seen that after the resampling process, this method does not signals) and its corresponding label (SBP and DBP) are assigned the same
change the waveform of the original signal, which not only retains the tag number. After repeating the above steps, a dataset containing the
physiological characteristics but also meets the input requirements of parallel-based fusion of PPG, VPG, and APG signals as input data and
the subsequent model. In addition, to overcome the time delay between SBP and DBP as labels are finally obtained in this study, which totally
the PPG signals and the BP signals, the average value of the five effective consisted of 100 patients and each patient has 1500 sets of input data
peak points and valley points of the BP signal which got through the and label. Then, the 150,000 samples processed in this study have been
same detection algorithm used on PPG signals [37] in each sample is randomly shuffled for the training and testing of the subsequent model.
used as the SBP and DBP respectively, that is, the label of the subsequent Correspondingly, the overall distribution of SBP and DBP is shown in
model. Fig. 4. It can be seen that the samples used in this study have covered
different BP situations, which basically meet the normal distribution in
2.1.3. Dataset setup both SBP and DBP.
To fully mine the potential information of raw time series PPG sig As shown in Table 1, the used dataset reach the requirement of the
nals, some researchers simultaneously used the first and second de AAMI standard for patient number. Besides, according to each patient’s
rivatives of PPG signals (i.e., VPG signals and APG signals) as
supplementary information without expanding other physiological sig
nals. Moreover, these studies have been shown that the fusion of PPG
signals and the derivative signals not only can effectively provide the
dimension of input information, but also can improve the accuracy of
continuous BP prediction obviously [25,33]. However, the cascade-
based fusion of input signals used in these studies didn’t consider the
synchronization and relationship in the time domain, which hindered
the efficiency and effectiveness of information extraction. Therefore, in
order to address this problem, the parallel-based fusion of PPG, VPG and
APG signals is used as the input signals of the subsequent proposed
model for continuous BP monitoring in this research.
For each processed PPG signal segment including five cycles in
Section 2.1.2, the first derivative of the fitting function of the PPG signal
is obtained in this section, that is, the VPG function. Then 600 points are
taken within the original data length (5 cycles) according to the function
so that the VPG signal is obtained. Similarly, the APG signal is obtained
Fig. 4. SBP and DBP distribution of patients in our dataset.
by taking the second derivative of the fitting function of the PPG signal
Fig. 3. Example of the PPG, VPG and APG signal data from one sample.
4
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
Table 1 includes three modules: the basic network module, the attention
Requirements of subjects in the AAMI standard. mechanism module, and the residual mechanism module, which in
No. of subjects Female ratio Male ratio volves the contracting path and the expansive path.
As is shown in Fig. 5, the contracting path (on the left side of the
AAMI >85 >30% >30%
Our dataset 100 37% 63% attention modules) successively consists of 8 residual modules, which
divide into five down-sampling blocks with the channel size of 64, 128,
256, 512, and 1024 respectively. In addition, instead of the 2 × 2 max-
ID, the gender information is searched from the database, and the pooling operation with stride 2 for down-sampling, the 3 × 1 convolu
gender ratio of the used dataset is 63:37, which also meets the relevant tion operation with stride 2 is used for down-sampling between each
requirement of the AAMI standard. block to reduce the loss of key feature information. Motivated by that
different convolutional kernel sizes can increase the receptive field
2.2. Model building without loss of resolution and capture multi-scale key information, thus
after the last residual module in the contracting path, three parallel
2.2.1. Model designing convolution layers are set with kernel sizes of 1 × 1, 3 × 1, and 5 × 1 and
In this study, the ARIU model is designed to adaptively extract high- following with an attention module respectively.
dimensional features from input signals and effectively monitor In the expansive path, the output of three parallel attention modules
continuous BP (SBP and DBP values). The network structure of the is concatenated firstly by a convolution layer to achieve feature aggre
designed model is shown in Fig. 5. gation. Besides, the expansive path mainly contains 12 convolutional
In this model, the parallel-based fusion of PPG, VPG, and APG signal layers, which are divided into five up-sampling blocks with the same
is taken as input with a size of 600 × 3, and the array of SBP and DBP is channel size as the contracting path. Moreover, an attention mechanism
taken as output with a size of 2 × 1. Moreover, this model mainly is embedded in each skip connection between the contracting path and
5
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
the expansive path, that is, the feature set obtained adaptively before [38]. Therefore, the attention mechanism based on the SE module was
each down-sampling in the contracting path is input into the attention embedded to the skip connections between the contracting and expan
module, and then its output is fused with the same-scale adaptive fea sive path, which was utilized to enhance the sensitivity of key multi-
tures set of the corresponding part in the expansive path. Through these scale features in the contracting path and reduce the information
attention mechanism modules, the high and low dimensional adaptive fusion of redundant features across paths [30,31]. Concretely, the
features obtained by the contracting and expansive path respectively can attention mechanism structure proposed in this paper based on the SE
be fully integrated, which effectively improves the multiscale repre module [38] is shown in Fig. 6.
sentation effectiveness of the extracted features. Finally, after a global As is shown in Fig. 6, for the input feature maps with the size of L × C,
average pooling layer behind the expansive path, SBP and DBP are the attention mechanism module applied global average pooling-based
respectively output through the fully connected layer. squeeze operations to aggregate the global information. In addition, to
take full advantage of the information, the exception operation is
2.2.1.1. Basic network module. As one of the typical structures of CNN, employed to recalibrate the channel-wise feature responses adaptively
the U-Net has achieved excellent performance in the field of medical according to interdependencies between channels. Concretely, the
image segmentation [27], which has demonstrated the strong ability of channel of aggregated features is firstly reduced from C to C/r through a
adaptive feature extraction and multi-scale feature learning and also has full connection layer in the exception operation. As a reduction ratio for
great potential for monitoring continuous BP by using PPG signals. varying the module capacity, the parameter r is set to 4 in this study by
Therefore, the basic network module of the proposed model is improved considering computation cost and model performance comprehensively.
based on the classic U-Net. For example, the channel size of the proposed Next, through the ReLU6 activation function layer shown as in formula 2
model is the same setting as the classic U-net [27]. Concretely, compared and a full connection layer, the amount of the channels is expanded from
to the original U-Net, the basic network module employed three parallel C/r to C. Then, a hard-swish activation function layer is adopted to
convolution operations with different feature filters in the last layer of replace the original sigmoid function layer, as shown in formula 3,
the contracting path to obtain different receptive fields, which has which further reduced the computation cost of the attention mechanism
effectively achieved the multi-scale feature extraction through the module [39].
feature aggregation. In addition, the classical image-oriented U-Net
ReLU6 = min(6, max(0, x)) (2)
usually adopted the max-pooling layer to realize the downsampling
operation. But for the low-width time-series signals, the max-pooling ReLU6(x + 3)
downsampling easily caused the loss of potential effective information hard - swish [x] = x (3)
6
to some extent. To this end, the convolution operation with the small
feature kernel is replaced the original max-pooling layer. Therefore, Finally, the recalibrated vector with a size of 1 × C is multiplied with
through these improvements on the classical U-Net, the basic network input features to obtain sensitive features with the size of L × C.
module is employed to adaptively learn the high dimensional multi- Consequently, by embedding the attention module in the skip connec
scale features from input signals. tions of the basic network in this study, the proposed model reduced the
redundancy of features from input signals and learn the sensitive fea
tures for adaptive BP monitoring.
2.2.1.2. Attention mechanism module. In the basic network module, the
convolution operations in both contracting and expansive path seam
2.2.2. Residual mechanism module
lessly integrate the spatial and channel information of the extracted
Although the attention mechanism can help the basic network
features on the different receptive fields, which obtained the richer and
extract sensitive features from the input signals, with the increasing of
more comprehensive multi-scale feature information. But the sensitivity
the convolution layers, the deep model confronted the vanishing
of this information to continuous BP monitoring is different, especially
gradient and the performance degradation. Through the convolutional
for the multi-channel input signals of PPG, VPG, and APG signals, the
mapping of skip connections, the residual network [40,41] can address
channel feature information is more important.
these problems and improve the ability of feature learning. Due to the
Through carrying out weighted learning on different feature chan
fact that the skip connections already existed in the expansive path of
nels, the squeeze and excitation (SE) module can effectively enhance
the basic network, the residual module shown in Fig. 7 is only utilized to
important feature information and suppress useless feature information
replace the conventional convolution in the expansive path of the
6
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
7
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
As shown in Fig. 5, based on the classic U-Net structure [27], the 3.3. Comparative experiment and analysis
proposed ARIU model is designed for continuous BP monitoring, which
mainly consists of three modules, namely an improved U-Net, an To further demonstrate the effectiveness and advancement of the
attention module embedded in the skip connections, and a residual proposed model, the comparative experiment is implemented with the
module replaced common convolution. To test the effectiveness of each existing state-of-art models in the publicized literature. Concretely, the
module, ablation experiments are conducted on the used dataset. details of these compared models are described in Table 4.
Firstly, based on the classic U-Net model, Model A is designed as the Since the fact that these studies have used different datasets or
basic reference model in the ablation experiment. Then, on the basis of different numbers of objects for BP monitoring, thus fair comparison
Model A, we modified the model step by step by optimizing the down- can’t be performed for BP measurement. To eliminate the impact of the
sampling layer, adding a new multi-scale feature extraction layer, object and database capacity, the comparisons of these models are car
attention modules, and residual modules. Concretely, the description of ried out on the same datasets used in this study by using the 5-fold cross-
each ablation model is described in Table 2, and the details of the validation. Finally, the results of the different models are shown in
improvement can be referred to in Section 2.2.1. In addition, the sepa Table 5.
rate PPG signals, the cascade-based fusion of the PPG, VPG, and APG It can be seen from Table 5, both MAE and STD of the SBP and DBP
signals (PPG + VPG + APG), and the parallel-based fusion of the PPG, predicted by the proposed model are obviously lower than other
VPG, and APG signals (PPG & VPG & APG) are used as input for training compared models obviously, which means that the proposed model in
the different models respectively. this paper has higher accuracy for BP monitoring. In addition, the scatter
In the ablation experiment, a five-fold cross-validation method is diagram of the predicted values of SBP and DBP relative to the actual
used to train each model described in Table 2. In the training and vali values are presented in Fig. 11 and Fig. 12 respectively.
dating process, the change curves of MAE of SBP and DBP under the It can be seen intuitively that, whether for SBP or DBP, the predicted
different models are drawn in Fig. 9 and Fig. 10 respectively. It can be values of the proposed model are closer to the actual values. As shown in
seen that the classic U-Net structure (Model A) converges faster Fig. 11 and Fig. 12, the Pearson correlation coefficient of SBP and DBP
benefiting from fewer total parameters, but the proposed model has predicted by the model proposed in this paper are 0.932 and 0.905
higher accuracy when the model converges. Besides, although the input respectively, which demonstrated that the proposed model has a better
form of “PPG + VPG + APG” increases the amount of information, the performance.
monitoring effects of Model A, Model B, and Model C deteriorate obvi The error distributions of the SBP and DBP predicted by each model
ously without considering the time-series correlation among PPG, VPG, are presented in Fig. 13 and Fig. 14 respectively. According to the dis
and APG. tribution histograms, it is obviously found that compared with other
Finally, the average results of two measurement criteria on the models, the error distribution of the proposed model is significantly
testing sets are shown in Table 3 below. more concentrated and closer to zero. In addition, to quantitatively
As can be seen from Table 3, the MAE and STD of SBP predicted by analyze the error distribution of these models, the percentage of samples
the proposed model are 4.75 mmHg and 6.72 mmHg respectively, which that the predicted error between + 5 mmHg and − 5 mmHg are shown in
are the minimum values in all ablation models under different inputs. the upper-left of these figures. It can be seen that the proposed model has
Meanwhile, the MAE and STD of DBP predicted by the proposed model more small error samples in predicting BP. Therefore, these results
are 2.81 mmHg and 4.59 mmHg respectively, which are also the mini further demonstrated that the proposed model can more accurately
mum values in all ablation models under different inputs. These results monitor continuous BP.
show that the proposed model can effectively improve the accuracy of Furthermore, the Bland-Altman analysis is carried out for the pro
continuous BP monitoring. posed model, and the analysis results are drawn in Fig. 15.
The superiorities of both the model network and the input forms As is shown in Fig. 15, it can be found that 94.95% of samples are
proposed in this research are effectively verified by the ablation exper within the consistency boundary for SBP, and 95.77% of samples are
iments. On the one hand, from the perspective of the model structure, within the consistency boundary for DBP. The results show that the
the predicted errors of the proposed model under three input forms are predicted BP values of the proposed model are in good agreement with
not only smaller than the basic U-Net model, but also as a whole smaller the actual BP value.
than other ablation models. On the other hand, from the perspective of
the input form, the predicted errors of the parallel-based “PPG & VPG & 3.4. Verification with international standards
APG” signals under five ablation models are not only smaller than the
separate PPG signals but also as a whole smaller than the cascade-based To verify the practical significance of the proposed model, the pre
dicted results are compared with two internationally recognized
sphygmomanometer standards, namely AAMI [45] and BHS [46]. The
Table 2
result of this comparison means whether the method of this research has
The description of the models for ablation experiment.
the ability to meet the error requirements of these standards in the
Model Description
actual application of the measurement process. Especially for contin
Model A Classic U-Net [27]: imitate the model designed by the U-net uous BP monitoring, every real-time calculation needs to meet this error
proponent, including structure, parameters, and dimensions. requirement.
Model B Improved U-Net: optimized the down-sampling layer and added a
new multi-scale feature extraction layer in the contracting path
AAMI standard is used for non-invasive BP monitoring, which mainly
based on Model A. includes the number of subjects, MAE, and STD criteria. The standard
Model C Model B integrated the residual mechanism: replaced the conventional requires MAE and STD to be less than 5 mmHg and 8 mmHg respec
convolution modules with the residual modules in the contracting tively. As is shown in Table 6, for the proposed model, the MAE of SBP
path based on Model B.
and DBP are 4.75 mmHg and 2.81 mmHg respectively, and the STD of
Model D Model B integrated the attention mechanism: embedded the attention
module in the skip connections and the multi-scale feature SBP and DBP are 6.72 and 4.69 respectively, which all of them are
extraction layer based on Model B. satisfied with the requirements of the AAMI standard.
Proposed Model B integrated the attention mechanism and the residual BHC standard takes the percentage of the samples within three
Model mechanism simultaneously: the details are described in Section 2.2.1 different error thresholds as the evaluation standard. The standard has
Model designing.
three grades with A/B/C, and each grade has three thresholds with 5/
8
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
Fig. 9. Diagrams of the training and validation curves for SBP under different ablation models.
Fig. 10. Diagrams of the training and validation curves for DBP under different ablation models.
10/15 mmHg. As is shown in Table 7, both SBP and DBP predicted by the study passed the standard except for the proposed model. Therefore,
proposed model are achieved Grade A within all three thresholds. verified with the requirements of AAMI and BHS standards, it can be
It is worth noting that none of the other comparative models in this found that the model proposed in this paper has excellent performance
9
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
Table 3
Performances of the ablation models under different inputs.
Model PPG PPG + VPG + APG PPG & VPG & APG
MAE STD MAE STD MAE STD MAE STD MAE STD MAE STD
Model A 6.65 7.58 4.51 5.77 6.46 7.31 3.89 5.12 6.29 7.02 3.77 4.93
Model B 6.36 7.51 3.75 5.20 6.33 7.24 3.67 4.98 6.29 7.43 3.67 5.07
Model C 6.29 7.44 3.69 5.08 6.31 7.36 3.63 4.95 6.27 7.49 3.67 5.10
Model D 5.23 7.26 3.08 4.97 5.29 7.30 3.07 4.98 5.00 6.95 2.95 4.86
ProposedModel 4.91 6.87 2.91 4.85 5.16 7.13 3.04 5.05 4.75 6.72 2.81 4.59
can improve the predicting accuracy of SBP and DBP certainly. In gen
Table 4
eral, the results of the ablation and comparison experiments and the
The description of the models for comparison experiment.
significance test demonstrate the effectiveness and improvement of the
Model Descriptions proposed model and the advancement of the designed structure and the
Model 1 Spectro-Temporal Deep Neural Network: Deep residual network fused input form.
model based on time–frequency domain information fusion [25]. Next, a significance test is performed between the proposed model
Model 2 PP-Net Model: Deep network model combined CNN with LSTM
and other advanced models, and the results are shown in Table 9.
[43].
Model 3 ResNet18: Classic deep residual network [40]. Analyzing the results of significant experiments, it can be seen that the
Model 4 Deep network model based on CNN, Bi-GRU, and attention module proposed model has a very significant difference compared with other
[44]. comparative models. Combining the results in Table 5, it can be
Model 5 U-Net deep learning model: Classic U-Net modified with Leaky ReLU concluded that the model proposed in this study has obvious
activation function, dropout layer, convolution layer in expansion
advantages.
path [28].
Proposed Improved U-Net integrated attention mechanism module and Finally, according to the comparison between the monitoring results
Model residual mechanism module. of the proposed model and the AAMI and BHS standards in Table 6 and
Table 7, the proposed model has practical application potential in the
current wearable smart device field. And it also provides new ideas for
Table 5 other researchers in terms of model structure design and data input form
Comparison results of the different models. to some extent.
Model SBP/mmHg DBP/mmHg
Since the dataset used in this study is limited, the model is not
lightweight enough, and there is still a great potential for optimization.
MAE STD MAE STD
In future work, on the one hand, we will increase the capacity of data to
Model 1 12.23 10.67 8.64 7.69 enhance the generalization ability of the model. On the other hand, try
Model 2 9.29 5.97 9.31 6.28 to reduce the model parameter to decrease computing resource
Model 3 8.09 8.20 4.99 5.66
Model 4 7.56 8.11 4.56 5.47
consumption.
Model 5 6.58 7.22 4.08 5.05
Proposed Model 4.75 6.72 2.81 4.59 4. Conclusion
10
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
Fig. 11. Diagram of the correlation analysis between the predicted and actual SBP.
Fig. 12. Diagrams of the correlation analysis between the predicted and actual DBP.
11
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
Fig. 13. Error distribution histograms of the predicted SBP under different models.
Fig. 14. Error distribution histograms of the predicted DBP under different models.
12
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
Fig. 15. Bland-Altman plots for SBP (left) and DBP (right).
13
M. Yu et al. Biomedical Signal Processing and Control 75 (2022) 103581
[17] Y. Liang, M. Elgendi, Z. Chen, et al., An optimal filter for short [32] W.H. Lin, F. Chen, Y. Geng, et al., Towards accurate estimation of cuffless and
photoplethysmogram signals, Sci. Data 5 (1) (2018) 1–12. continuous blood pressure using multi-order derivative and multivariate
[18] Teng XF, Zhang YT. Continuous and noninvasive estimation of arterial blood photoplethysmogram features, Biomed. Signal Process. Control 63 (2021) 102198.
pressure using a photoplethysmographic approach[C]//Proceedings of the 25th [33] S. Shimazaki, H. Kawanaka, H. Ishikawa, et al.. Cuffless blood pressure estimation
Annual International Conference of the IEEE Engineering in Medicine and Biology from only the waveform of photoplethysmography using CNN[C]//2019 41st
Society (IEEE Cat. No. 03CH37439). IEEE, 2003, 4: 3153-3156. Annual International Conference of the IEEE Engineering in Medicine and Biology
[19] A.S. Zadi, R. Alex, R. Zhang, et al., Arterial blood pressure feature estimation using Society (EMBC), IEEE, 2019, pp. 5042–5045.
photoplethysmography, Comput. Biol. Med. 102 (2018) 104–111. [34] Moody, B., Moody, G., Villarroel, M., Clifford, G., & Silva, I. (2020). MIMIC-III
[20] P. Li, T.M. Laleg-Kirati, Schrödinger Spectrum Based PPG Features for the Waveform Database (version 1.0). PhysioNet. https://doi.org/10.13026/c2607m.
Estimation of the Arterial Blood Pressure[C]//2020 42nd Annual International [35] A.E.W. Johnson, T.J. Pollard, L. Shen, L.H. Lehman, M. Feng, M. Ghassemi,
Conference of the IEEE Engineering in Medicine & Biology Society (EMBC), IEEE B. Moody, P. Szolovits, L.A. Celi, R.G. Mark, MIMIC-III, a freely accessible critical
(2020) 2683–2686. care database, Sci. Data 3 (2016), 160035.
[21] Y. Kurylyak, F. Lamonaca, D. Grimaldi, A Neural Network-based method for [36] M.J. Niedźwiecki, M. Ciołek, A. Gańcza, et al., Application of regularized Savitzky-
continuous blood pressure estimation from a PPG signal[C]//2013 IEEE Golay filters to identification of time-varying systems✩, Automatica 133 (2021),
International instrumentation and measurement technology conference (I2MTC), 109865.
IEEE (2013) 280–283. [37] M. Elgendi, I. Norton, M. Brearley, et al., Systolic peak detection in acceleration
[22] E. Martinez-Ríos, L. Montesinos, M. Alfaro-Ponce, et al., A review of machine photoplethysmograms measured from emergency responders in tropical
learning in hypertension detection and blood pressure estimation based on clinical conditions, PLoS ONE 8 (10) (2013), e76585.
and physiological data, Biomed. Signal Process. Control 68 (2021), 102813. [38] Hu J, Shen L, Sun G. Squeeze-and-excitation networks[C]//Proceedings of the IEEE
[23] C. El-Hajj, P.A. Kyriacou, Deep learning models for cuffless blood pressure conference on computer vision and pattern recognition. 2018: 7132-7141.
monitoring from PPG signals using attention mechanism, Biomed. Signal Process. [39] A. Howard, M. Sandler, G. Chu, et al., Searching for mobilenetv3[C]//Proceedings
Control 65 (2021), 102301. of the IEEE/CVF, International Conference on Computer Vision (2019) 1314–1324.
[24] S. Shimazaki, S. Bhuiyan, H. Kawanaka, et al., Features extraction for cuffless blood [40] He K, Zhang X, Ren S, et al. Deep residual learning for image recognition[C]//
pressure estimation by autoencoder from photoplethysmography[C]//2018 40Th Proceedings of the IEEE conference on computer vision and pattern recognition.
annual international conference of the IEEE engineering in medicine and biology 2016: 770-778.
society (EMBC), IEEE (2018) 2857–2860. [41] M. Owais, M. Arsalan, J. Choi, et al., Artificial intelligence-based classification of
[25] G. Slapničar, N. Mlakar, M. Luštrek, Blood pressure estimation from multiple gastrointestinal diseases using endoscopy videos for clinical diagnosis,
photoplethysmogram using a spectro-temporal deep neural network, Sensors 19 J. Clin. Med. 8 (7) (2019) 986.
(15) (2019) 3420. [42] Kingma D P, Ba J. Adam: A method for stochastic optimization. arXiv preprint
[26] L.N. Harfiya, C.C. Chang, Y.H. Li, Continuous Blood Pressure Estimation Using arXiv:1412.6980, 2014.
Exclusively Photopletysmography by LSTM-Based Signal-to-Signal Translation, [43] M. Panwar, A. Gautam, D. Biswas, et al., PP-Net: A deep learning framework for
Sensors 21 (9) (2021) 2952. PPG-based blood pressure and heart rate estimation, IEEE Sens. J. 20 (17) (2020)
[27] O. Ronneberger, P. Fischer, T. Brox, U-net: Convolutional networks for biomedical 10000–10011.
image segmentation[C]//International Conference on Medical image computing [44] H. Eom, D. Lee, S. Han, et al., End-to-end deep learning structure for continuous
and computer-assisted intervention, Springer, Cham, 2015, pp. 234–241. blood pressure estimation using attention mechanism, Sensors 20 (8) (2020) 2338.
[28] T. Athaya, S. Choi, An estimation method of continuous non-invasive arterial blood [45] G.S. Stergiou, B. Alpert, S. Mieke, et al., A universal standard for the validation of
pressure waveform using photoplethysmography: A U-Net structure-based blood pressure measuring devices: Association for the Advancement of Medical
approach, Sensors 21 (5) (2021) 1867. Instrumentation/European Society of Hypertension/International Organization for
[29] Zhang L, Ji Y, Lin X, et al. Style transfer for anime sketches with enhanced residual Standardization (AAMI/ESH/ISO) Collaboration Statement, Hypertension 71 (3)
u-net and auxiliary classifier gan[C]//2017 4th IAPR Asian Conference on Pattern (2018) 368–374.
Recognition (ACPR). IEEE, 2017: 506-511. [46] E. Obrien, B. Waeber, G. Parati, et al., Blood pressure measuring devices:
[30] X. Chen, L. Yao, Y. Zhang. Residual attention u-net for automated multi-class recommendations of the European Society of Hypertension, BMJ-Br. Med. J. 322
segmentation of covid-19 chest ct images. arXiv preprint arXiv:2004.05645 (2020). (7285) (2001) 531–536.
[31] J. Wang, X. Zhang, P. Lv, et al. EAR-U-Net: EfficientNet and attention-based
residual U-Net for automatic liver segmentation in CT. arXiv preprint arXiv:
2110.01014 (2021).
14