Professional Documents
Culture Documents
A Blood Pressure Prediction Method Based On Imaging Photoplethysmography in Combination With Machine Learning
A Blood Pressure Prediction Method Based On Imaging Photoplethysmography in Combination With Machine Learning
A Blood Pressure Prediction Method Based On Imaging Photoplethysmography in Combination With Machine Learning
A R T I C L E I N F O A B S T R A C T
Keywords: This paper proposes a non-contact blood pressure implement (NCBP) system based on imaging photo
Non-contact measurement plethysmography (IPPG) The system collects facial videos through a webcam under ambient light, and extracts
Blood pressure (BP) pulse wave signals from the videos by means of IPPG technology. From the signals (also called IPPG signals), we
Imaging photolethysmography (IPPG)
extracted 26 features for estimating blood pressure (BP), and trained them through four machine learning al
Webcam-based
Machine learning (ML)
gorithms. Finally, we selected the most accurate model for blood pressure prediction. By experimenting on 191
volunteers and comparing four models, support vector regression (SVR) is the best model for predicting blood
pressure. The results of SVR are that the standard deviation (STD) and mean absolute error (MAE) of systolic
blood pressure (SBP) are 3.35 mmHg, 9.97 mmHg, and those of diastolic blood pressure (DBP) are 2.58 mmHg,
7.59 mmHg respectively. We conclude that through our proposed system based on IPPG technology, blood
pressure can be accurately predicted in a non-contact way. In addition, this paper proposes two new methods, the
region of interest (ROI) selection method based on colormaps and robust peak extraction method, which solve
the key steps in IPPG technology. Finally, we discussed the influence of light intensity on the experiment, and
simplified the NCBP experimental device. The system has the potential of replacing the traditional cuff-based
sphygmomanometers, and has guiding significance to the future development of blood pressure measurement
devices.
Abbreviations: NCBP, non-contact blood pressure; IPPG, imaging photoplethysmography; BP, blood pressure; SVR, support vector regression; STD, standard
deviation; MAE, mean absolute error; SBP, systolic blood pressure; DBP, diastolic blood pressure; ROI, region of interest; ML, machine learning; CVD, cardiovascular
disease; NIBP, non-invasive blood pressure; PPG, photoplethysmography; PTT, pulse transit time; ECG, electrocardiograph; ANN, artificial neural network; FPS,
frames per second; RGB, red green and blue; HR, heart rate; MLR, multiple linear regression; RF, random forest; MLP, multi-layer perceptron; ME, mean bias; BHS,
British Hypertension Society; AAMI, Advancement of Medical Instrumentation.
* Corresponding author.
E-mail addresses: rm0233@163.com (M. Rong), lky@whu.edu.cn (K. Li).
https://doi.org/10.1016/j.bspc.2020.102328
Received 25 May 2020; Received in revised form 17 October 2020; Accepted 4 November 2020
Available online 21 November 2020
1746-8094/© 2020 Elsevier Ltd. All rights reserved.
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
blood pressure can be estimated based on PTT. Joonnyong Lee et al. [7] condition of sufficient light, NCBP measurement does not need
analyzed 2309 surgical patients to assess the correlation between PTT additional light source. Only one camera is needed to achieve blood
and BP. Correlation analysis shows that the average correlation between pressure prediction.
PTT and BP extracted from the intersection tangent point of PPG is the 3) We proposed a method for selecting region of interest (ROI) based on
highest. While estimating blood pressure through PTT, Vicent et al. [8] facial colormaps and human appearance features. This method
introduced an artificial neural network method for analysis and ach observe the distribution of light intensity on faces of volunteers by
ieved good results. By comparing the PPG signals obtained from the means of colormaps. At the same time, combine the physical char
wrist and fingers, Satu et al. [9] proved that the PPG signal at the wrist acteristics of both man and women to find the most suitable area for
can also be used to calculate PTT and then to estimate blood pressure. analysis.
Robert C et al. [10] analyzed the PPG signal (from fingers, toes and ears) 4) A new peak detection algorithm was proposed. This new method can
and ECG signal to obtain the PTT between different parts. The result accurately identify the peak and valley points, even if the waveform
shows that the best correlation by a substantial extent was between toe has noises.
PTT via the PPG foot and systolic BP.
In the NIBP field, another analysis method is to establish a blood The rest of this paper is structured as follows. Section 2 is an over
pressure prediction model through PPG signal features and other related view of the whole system. Section 3 describes the IPPG signal processing
features. Various studies [11–13] extracted dozens or hundreds of fea in detail. Section 4 shows the machine learning module. Section 5 is the
tures from PPG and ECG waveforms and personal information (such as result of our works. Section 6 and Section 7 are discussion and conclu
age, weight, etc.), and extracted features that are highly related to blood sion respectively.
pressure. The blood pressure prediction model is established through
these high correlation features, and got some good results. Although the 2. System overview
field of NIBP has made great progress, most of the work is to collect PPG
signals by directly or indirectly contacting the human body through This section provides an overview of the system from three aspects.
sensors. This will cause measurement inconvenience to a certain extent. One is the description of the experimental equipment. The second is the
Recently, some scholars have studied on non-contact blood pressure detailed information of the experimental data. The third is the flow chart
measurement (NCBP). The NCBP is to detect the physiological signals of the entire system.
without touching the human body through camera equipment. This
method can realize blood pressure estimation more conveniently, which 2.1. Experimental device
is a promising direction. Some studies [14–16] took videos from two
parts of body, and predicted blood pressure by PTT. However, their The system uses Logitech C922 PRO webcam for video capture. The
methods are complicated. Firstly, in their works, the acquisition of PTT webcam can reach a frame rate of 30FPS (frames per second) at 1080p
requires photographing from multiple parts of human body, this will and 60 FPS at 720p. In our experiments, the frame rate of facial videos is
cause a certain degree of inconvenience when shooting. Secondly, the 30FPS and the video resolution is 1280*720. Before taking videos, the
devices used in these methods are expensive and need external light volunteers rested for five minutes to ensure a stable physiological state.
sources, which lead to poor practicality. Secerbegovic et al. [17] After that, each volunteer took a 40 seconds video through the webcam.
established the model between PTT and blood pressure through simple The volunteers were told to keep their heads stationary to ensure the
linear regression model. The error of this method did not meet the in stability of IPPG signals. The video acquisition process was performed
ternational standards. Hong et al. [18] collected facial videos through a under ambient light without auxiliary light sources. At the same time,
smartphone camera. They extracted features related to blood pressure we collected blood pressure and heart rate through a sphygmomanom
by transdermal optical imaging technology, and used artificial neural eter (OMRON HEM-1020, which complies with American Medical De
network (ANN) model for training and predicting. They did a lot of vice Promotion Association standard) for comparison analysis. The
experiments to prove the correlation between blood flow in the facial measurement system and device is shown in Fig. 1.
vasculature and blood pressure. However, the experimental data of their
work were limited to the normotensive people, and only ANN model was 2.2. Experimental database
used for training and predicting. In addition, the system still required
external light sources. In our experiments, a total of 191 volunteers were collected in School
Based on the above background, this paper proposes a non-contact of Physical Science and Technology of Wuhan University (Wuhan,
blood pressure measurement method by IPPG technology. This system China) and Wuhan Haobo Technology Co., Ltd. (Wuhan, China, a
used a webcam to capture facial videos under ambient light without
extra light sources. IPPG signals extracted from the facial videos based
on imaging photoplethysmography technology. Anand et al. [19] veri
fied that using photoplethysmography of other parts from human body
can accurately estimate blood pressure without brachial cuff for cali
brating. The features related to blood pressure were extracted from IPPG
signal, and then several machine learning methods were used to build
the blood pressure prediction model. Our main contributions includes:
2
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
1) Acquisition of facial videos through a webcam. Where, ε(λ) is the absorption coefficient, C is the medium concentration,
2) Obtain three RGB (red green and blue) channel signals from the and L is the travel distance of light in the material. When light irradiates
videos. the skin tissue, with the periodic change of blood volume in the blood
3) Filter and extract the peak points and valley points of IPPG signals. vessel, the light intensity will also change accordingly, and the reflected
4) Extract features from the clean IPPG signals. light intensity contains a lot of physiological information of human
5) Establish blood pressure prediction models through four machine body.
learning algorithms. IPPG signal is obtained by averaging the pixel values of ROI region of
each frame according to the Eq. 2.
The system flow chart is shown in Fig. 3. ∑
M ∑
N
sippg (i, j, t)
3. IPPG signal processing (2)
i=1 j=1
Sippg (t) =
M∗N
For IPPG signal processing, it mainly includes three steps. Firstly, Where t represents frame sequence, M and N are the height and width of
extract the initial IPPG signal from the facial video. Secondly, filter the region of ROI.
IPPG signal to obtain the clean signal. Thirdly, the peak point detection After getting the facial videos, these videos were decomposed into
of IPPG signal. frame pictures, and then three components of RGB color channels of
each frame were extracted form the videos. Finally, three channel IPPG
3.1. Extraction of IPPG signal signals are obtained by the above formula as shown in Fig. 4.
As the optical absorption property of hemoglobin reach the peak at
3.1.1. IPPG signal extraction 500nm-600 nm, which corresponds to the green channel signal [20],
IPPG technology is developed based on Lambert-Beer law and light [21]. Besides, in the period between systolic and diastolic, the signal of
scattering theory. Lambert-Beer law describes the attenuation charac green channel changes most obviously, so the green channel signal was
teristics of light passing through a homogeneous medium with absorp selected as IPPG signal in this paper. It is worth mentioned that our
tion properties [apped1]. According to the law, when the method only needs to shoot one region of human body.
monochromatic light with wavelength λ shines on a certain solution
substance, the relationship between the transmitted light intensity I and 3.1.2. Region of interest selection
the emitted light intensity I0 is as follows: Generally, we will use the whole facial region as ROI, but this is
I = I0 e− ε(λ)CL
(1) worth discussing. The choice of ROI will affect the quality of IPPG
Fig. 2. Blood pressure distribution histograms. (a) is the data distribution of SBP, where the minimum value is 92 and the maximum value is 148, (b) is the data
distribution of DBP, where the minimum value is 51 and the maximum value is 102.
3
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
Fig. 4. IPPG signal extraction. Extract the red, green and blue signals from the face video separately. Because of the optical absorption property of hemoglobin reach
the peak at 500 nm–600 nm, which corresponds to the green channel signal. We choose the green channel signal as IPPG signal.
signals because of some factors such as facial motion artifacts (e.g., signal is down-sampled. After sampling, the signal is decomposed into
blinking, wrinkling nose, etc.), light intensity distribution, and human two components: one is the low-pass filter component that approximate
appearance features. In our experiments, volunteers were required to to the original signal and the other is the high-pass filter component with
keep their heads stable. Thus, we discussed the problem of ROI selection details of the original signal. Generally speaking, the baseline wander
from the perspective of facial colormaps and human appearance phenomenon occurs in the low frequency region [22]. Besides, the frame
features. rate of video capture is 30FPS and the typical respiration rate is between
IPPG signal is composed of the light intensity that transmitted 0.14 and 0.75 Hz [23], corresponding to the fifth layer of wavelet
through human face. Facial light intensity is actually the size of pixel decomposition. Based on the above analysis, we select Sym6 mother
values. The pixel values of the location are also large where the light wavelet to decompose IPPG signal in five layers, and take the fifth layer
intensity is strong, and the intensity of IPPG signal is strong. We selected as the baseline drift signal. We subtracted the fifth layer low-frequency
six volunteers to observe the distribution of light intensity on their faces component of wavelet signals from the original IPPG signal, so as to
by means of colormaps. achieve the purpose of eliminating the baseline drift phenomenon.
From Fig. 5, it can be observed that the light intensity distribution of Further, it was verified that the frequency range of heart rate is 0.7
forehead and cheek areas are more strong than other areas (shown as the Hz–4 Hz [24]. A Butterworth band-pass filter (0.7 Hz–4 Hz) was adopted
color of the facial area is more inclined to red, purple and white), to remove the noise caused by involuntary shaking and fluorescent
indicating that the IPPG signals amplitude obtained from these regions lamps power frequency. This method can make the whole waveform
are higher. In addition, women’s hair frequently covers their forehead smoother, so as to facilitate the extraction of feature points. Besides, the
(e.g., Fig. 5(C)). Although the light intensity distribution of forehead frequency response curve of Butterworth filter in the pass band is flat as
area is strong, this area is not selected as ROI in this paper. far as possible without fluctuation. The useful information in IPPG signal
Finally, the ROI was finally selected as cheek and nose areas, as can be retained to the greatest extent.
shown by the red rectangle in Fig. 4. Finally, we got the clean IPPG signal. The filtering process of original
IPPG signal is shown in Fig. 6.
4
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
Fig. 5. Colormaps of facial area. The red to white area indicates that the light intensity distribution in this area is relatively strong. The strength of the IPPG signal
obtained from this area is also relatively strong.
5
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
4.1.3. K value
Yang et al. have confirmed that there is a significant correlation
between K value and arterial blood pressure [26], so K value was used as
one of the features in this paper. The K value is calculated as follows:
oo’ I
Sippg − Sippg (to )
SKippg = I I
(9)
Sippg (to ) − Sippg (to’ )
In which
Fig. 7. Peak point extraction. The colored dots represent the peak points found
by the algorithm. 1 ∑
to ’
oo’
Sippg = SI (t) (10)
⎧ to’ − to t=to ippg
⎨ 1
III II II
Sippg (t) = Sippg (t + 1) − Sippg (t) = 0 , t ∈ (0, 1, ⋯, Nframe − 1) (6) Here o and o’ are the start and end points of each period of IPPG signal.
⎩
− 1
Since SIII
ippg (t) = 0 is unnecessary data, it is not shown in Fig. 7.
Where SMAX
ippg represents the maximum points sequence of IPPG signal,
and max() is the sign that find the maximum values.
This is the solution to distinguish the peak points of IPPG signal. Fig. 8. Time-domain features of IPPG signal. From the IPPG waveform, we
obtained 22 features based on the shape of the waveform.
6
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
Fig. 9. Correlation heat map, 1-26 on the axis represents F1-F26 features. The darker the square, the higher the correlation between the corresponding features, and
the lighter the square, the lower the correlation between the features.
7
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
C 1 Table 3
SBP
Gamma 30 Four machine learning model results. It can be seen from the table, the accuracy
SVR
C 0.5 of MLR model is poor for predicting blood pressure, the accuracy of RF is similar
DBP
Gamma 20
to that of MLP algorithm, the MAE and mean bias of SVR model are the smallest.
n_estimators 22
SBP
max_features 5 Model SBP DBP
RF
n_estimators 16
DBP STD MAE Mean STD MAE Mean
max_features 5
(mmHg) (mmHg) Bias (mmHg) (mmHg) Bias
hidden_layer_sizes (12,1)
(mmHg) (mmHg)
SBP activation Relu
solver adma MLR 7.92 12.05 6.04 5.95 8.75 1.58
ANN
hidden_layer_sizes (12,1) SVR 3.35 9.97 2.1 2.58 7.59 0.79
DBP activation Relu RF 6.52 10.41 3.92 6.52 8.65 1.34
solver adma MLP 1.65 10.76 − 3.87 1.65 8.79 − 1.79
8
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
Table 4
Comparison with BHS standard. It can be seen from the table, our experimental
results comply with BHS standards.
<5 mmHg <10 mmHg <15 mmHg
SBP Totals 27 44 53
DBP Totals 31 48 55
This paper
SBP (%) 48.2% 78.6% 94.6%
DBP (%) 55.4% 85.7% 98.2%
Grade A (%) 60% 85% 95%
BHS Grade B (%) 50% 75% 90%
Grade C (%) 40% 76% 85%
15mmhg. For DBP, it achieved grade B on the index of <5 mmHg and
grade A on the two indexes of <15 mmHg and <10 mmHg. The proposed
system accuracy is within the allowable range of the BHS standard.
Table 5 shows a comparison between the results of SVR model and
the Advancement of Medical Instrumentation (AAMI) [35] standard.
The standard requires that the ME and the STD for predicting blood
pressure values be less than 5 mmHg and 8 mmHg respectively. It can be
seen from the table that the ME and the STD of SBP are 2.1 mmHg and
3.35 mmHg respectively, the ME and the STD of DBP are 0.79 mmHg
and 2.58 mmHg respectively. For SBP and DBP, these two indicators
meet the requirements of AAMI standard. The results show that the
proposed system accuracy is completely in conformity with the AAMI
standard.
9
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
Table 6
Error information for the effect of the light intensity on BP
Statistic SBP(mmHg) DBP(mmHg)
internationally gold standard, but our system has not been verified.
Therefore, future experiments must use manual sphygmomanometers
for measurement, which is more convincing as a reference blood
pressure.
7. Conclusion
Ethical approval
10
M. Rong and K. Li Biomedical Signal Processing and Control 64 (2021) 102328
Declaration of Competing Interest [16] I.C. Jeong, J. Finkelstein, Introducing contactless blood pressure assessment using
a high speed video camera, J. Med. Syst. 40 (January (4)) (2016) 77.
[17] A. Secerbegovic, et al., Blood pressure estimation using video plethysmography,
The authors report no declarations of interest. Proc. IEEE Int. Symp. Biomedical Imaging (2016) 461–464.
[18] L. Hong, et al., Smartphone-based blood pressure measurement using transdermal
Appendix A. Supplementary data optical imaging technology, Circ-Cardiovasc. Imag. 12 (August (8)) (2019)
e008857.
[19] A. Chandrasekhar, et al., Smartphone-based blood pressure monitoring via the
Supplementary material related to this article can be found, in the oscillometric finger-pressing method, Sci. Transl. Med. 78 (March (431)) (2018)
online version, at https://doi.org/10.1016/j.bspc.2020.102328. eaap8674.
[20] U. Bal, Non-contact estimation of heart rate and oxygen saturation using ambient
light, Biomed. Opt. Express 6 (December (1)) (2015) 86–97.
References [21] A.R. Guazzi, et al., Non-contact measurement of oxygen saturation with an RGB
camera, Biomed. Opt. Express 6 (October (9)) (2015) 3320–3338.
[1] World Health Statistic Overview 2019, World Health Organization, Geneva, [22] S. Wang, et al., Adaptive pulse oximeter with dual-wavelength based on wavelet
Switzerland, 2019. transforms, Opt. Express 21 (October (20)) (2013) 23058–23067.
[2] R.B. D’Agostino, et al., General cardiovascular risk profile for use in primary care: [23] S. Fleming, et al., Normal ranges of heart rate and respiratory rate in children from
the Framingham Heart Study, Circulation 117 (January (6)) (2008) 743–753. birth to 18 years of age: a systematic review of observational studies, Lancet 377
[3] J.C. Petrie, et al., Recommendations on blood pressure measurement, Br. Med. J. (March (9770)) (2011) 1011–1018.
(Clinical research ed.) 293 (September (6547)) (1986) 611. [24] M.Z. Poh, et al., Advancements in noncontact, multiparameter physiological
[4] L. Peter, et al., A review of methods for non-invasive and continuous blood pressure measurements using a webcam, IEEE Trans. Biomed. Eng. 58 (January (1)) (2010)
monitoring: Pulse transit time method is promising? IRBM 35 (5) (2014) 271–282. 7–11.
[5] Y. Choi, et al., Noninvasive cuffless blood pressure estimation using pulse transit [25] R.B. Dunn, et al., Detection of transient signals using the energy operator, Proc.
time and Hilbert–Huang transform, Comput. Electr. Eng. 39 (January (1)) (2013) IEEE ICASSP-93 (1993) 145–148.
103–111. [26] H. Yang, et al., Relationship between vascular elasticity and human pulse
[6] R. Mukkamala, et al., Toward ubiquitous blood pressure monitoring via pulse waveform based on FFT analysis of pulse waveform with different age, Proc. 3rd
transit time: theory and practice, IEEE Trans. Biomed. Eng. 62 (June (8)) (2015) IEEE ICBBE (2009) 1–4.
1879–1901. [27] E. Monte-Moreno, Non-invasive estimate of blood glucose and blood pressure from
[7] J. Lee, et al., Analysis of pulse arrival time as an indicator of blood pressure in a a photoplethysmograph by means of machine learning techniques, Artif. Intell.
large surgical biosignal database: recommendations for developing ubiquitous Med. 53 (June) (2011) 127–138.
blood pressure monitoring methods, J. Clin. Med. 8 (February (11)) (2019) 23–27. [28] R. Fazan, et al., Frequency-dependent baroreflex modulation of blood pressure and
[8] V.R. Ripoll, et al., Blood pressure assessment with differential pulse transit time heart rate variability in conscious mice, Am. J. Physiol. Heart Circ. Physiol. 289
and deep learning: a proof of concept, Kidney Diseases 5 (September (1)) (2019) (November (5)) (2005) H1968–H1975.
1773. [29] P.A. Lanfranchi, V.K. Somers, Arterial baroreflex function and cardiovascular
[9] S. Rajala, et al., Comparison of photoplethysmogram measured from wrist and variability: interactions and implications, Am. J. Physiol. Regul. Integr. Comp.
finger and the effect of measurement location on pulse arrival time, Physiol. Meas. Physiol. 283 (October (4)) (2002) R815–R826.
39 (August (7)) (2018), 075010. [30] Y. Bao, Z. Liu, A fast grid search method in support vector regression forecasting
[10] R.C. Block, et al., Conventional pulse transit times as markers of blood pressure time series, in: Lecture Notes on Computer Science, vol. 4224, 2006, pp. 504–511.
changes in humans, Sci. Rep. 10 (October (1)) (2020) 1–9. September.
[11] A. Attarpour, et al., Cuff-less continuous measurement of blood pressure using [31] L. Breiman, Random forests, Machine Learning 45 (October (1)) (2001) 5–32.
wrist and fingertip photo-plethysmograms: evaluation and feature analysis, [32] F. Pedregosa, et al., Scikit-learn: Machine learning in Python, J. Mach. Learn. Res
Biomed. Signal Proces. 49 (March) (2019) 212–220. 12 (October) (2011) 2825–2830.
[12] S. Chen, et al., A non-invasive continuous blood pressure estimation approach [33] K. Dewitte, et al., Application of the Bland–Altman plot for interpretation of
based on machine learning, Sensors 19 (June (11)) (2019) 2585. method-comparison studies: a critical investigation of its practice, Clin. Chem. 48
[13] W.H. Lin, et al., Towards accurate estimation of cuffless and continuous blood (May (5)) (2002) 799–801.
pressure using multi-order derivative and multivariate photoplethysmogram [34] E.O. Brien, et al., The British Hypertension Society protocol for the evaluation of
features, Biomed. Signal Proces. 63 (January (11)) (2021) 102198. automated and semi-automated blood pressure measuring devices with special
[14] R. Takahashi, et al., Non‑contact method of blood pressure estimation using only reference to ambulatory systems, J. Hypertension 8 (7) (1990) 607–619.
facial video, Artif. Life Robot. 25 (July (3)) (2019) 343–350. [35] American National Standard for Electronic or Automated Sphygmomanometers,
[15] N. Sugita, et al., Techniques for estimating blood pressure variation using video ANSI/AAMI SP 10 2002, Association for the Advancement Instrumentation,
images, Proc. IEEE 28th Annu. Int. Conf. Eng. Med. Biol. Soc. (2015) 4218–4221. Arlington, VA, USA, 2002.
11