Download as pdf or txt
Download as pdf or txt
You are on page 1of 4

Selective heart rate variability analysis to account for uterine activity

during labor and improve classification of fetal distress


G.J.J. Warmerdam1 , R. Vullings1 , J.O.E.H. Van Laar2 , M.B. Van der Hout-Van der Jagt2 , J.W.M. Bergmans1 ,
L. Schmitt3 , and S.G. Oei 2

Abstract— Cardiotocography (CTG) is currently the most of- response of the fetal ANS to stabilize the cardiovascular sys-
ten used technique for detection of fetal distress. Unfortunately, tem under fluctuating conditions of the exterior (i.e. uterus).
CTG has a poor specificity. Recent studies suggest that, in If HRV features are calculated without distinguishing periods
addition to CTG, information on fetal distress can be obtained
from analysis of fetal heart rate variability (HRV). However, of contractions from rest periods, this effect is averaged over
uterine contractions can strongly influence fetal HRV. The aim the rest periods and is no longer visible.
of this study is therefore to investigate whether HRV analysis In this paper we examine whether detection of fetal
for detection of fetal distress can be improved by distinguishing distress by HRV analysis can be improved by separating
contractions from rest periods. Our results from feature selec- contractions from rest periods. Therefore, in addition to
tion indicate that HRV features calculated separately during
contractions or during rest periods are more informative on HRV features calculated over the entire heart rate, HRV
fetal distress than HRV features that are calculated over the features were also calculated separately during contractions
entire fetal heart rate. Furthermore, classification performance and during rest periods. A sequential feature selection al-
improved from a geometric mean of 69.0% to 79.6% when gorithm was implemented to determine the most informative
including the contraction-dependent HRV features, in addition subset of HRV features, both with and without the additional
to HRV features calculated over the entire fetal heart rate.
contraction-dependent HRV features. Using these subsets as
I. I NTRODUCTION input, performance of classification of healthy fetuses and
fetuses with adverse outcome was evaluated by Support
During labor, oxygen supply to the fetus can temporally Vector Machines (SVMs).
be reduced due to contractions. If the reduction in oxygen First, data acquisition is described in Section II-A. Then,
supply is severe and prolonged this can lead to neonatal Sections II-B to II-F describe the signal processing tech-
morbidity and mortality. In clinical practice, cardiotocogra- niques used in this paper: pre-processing, HRV analysis,
phy (CTG) is currently the most often used technique for HRV analysis during contractions and rest periods, feature
fetal monitoring. However, CTG is interpreted visually and selection, and classification. In section II-G, our validation
has a high inter- and intra-observer variability. Besides, the method is explained. Finally, results are presented in Section
poor specificity of CTG leads to unnecessary interventions. III and discussion in Section IV.
Therefore, additional information is required to reduce the
number of unnecessary operative deliveries. II. M ETHOD
Recent studies have shown interest in analysis of fetal This section describes the different processing techniques
heart rate variability (HRV) for detection of fetal distress that were used in this paper. The overall procedure is
[1]–[6]. Since the heart rate is regulated by the autonomic presented in Fig. 1.
nervous system (ANS), variations in the heart rate reflect
changes in ANS control and could indirectly provide infor- A. Data acquisition
mation on fetal distress. In the literature, many HRV features For this study, fetal heart rate (FHR) and uterine activity
have been developed that describe different properties of the (UA) signals were obtained from CTG registrations that
heart rate. However, most HRV features have been developed were recorded on a Neoventa STAN R
(Mölndal, Sweden).
for adults. Unlike for adults, it is not possible to control the We only used beat-to-beat FHR signals that were measured
fetal cardiovascular system. Moreover, during labor contrac- by a scalp electrode. The UA was recorded either with a
tions can strongly influence the fetal cardiovascular system. tocodynamometer or intra-uterine pressure catheter. Based
In [6] and [7] it was shown that for healthy fetuses, spectral on the UA signal, contractions were annotated by a clinical
power is significantly higher during contractions as compared expert that had no knowledge of the FHR signal or any other
to rest periods (i.e. the periods in between contractions). The medical information. An example of a recording is shown in
increased spectral power during contractions could indicate a Fig. 2.
1 G.J.J. Warmerdam, R. Vullings, and J.W.M. Bergmans are with the Fetal outcome was defined using the arterial acid-base
Faculty of Electrical Engineering, Eindhoven University of Technology, status: adverse outcome was defined as a pH below 7.05 and
Eindhoven, The Netherlands g.j.j.warmerdam@tue.nl healthy as a pH above 7.20 [8]. Pregnancies complicated by
2 J.O.E.H. Van Laar,, M.B. Van der Hout-Van der Jagt, and S.G. Oei
intra-uterine growth restriction or fetal congenital anomalies
are with the Department of Obstetrics and Gynecology, Máxima Medical
Center, Veldhoven, The Netherlands were excluded. In total, 22 cases were included with adverse
3 L. Schmitt is with Philips Research, Eindhoven, The Netherlands outcome, that were matched with 110 healthy cases. Note

978-1-4577-0220-4/16/$31.00 ©2016 IEEE 2950


Pre-processing of successive differences (RM SSD) [9]. Since SD is a
measure for the overall variability, it is related to both
Segment selection sympathetic and parasympathetic activity. RM SSD on the
other hand is mostly sensitive to beat-to-beat variations,
HRV analysis hence reflecting parasympathetic activity. Although SD and
RM SSD are actually defined on the beat-to-beat FHR, we
Feature selection
calculated these features on the resampled FHR so that all
Classification HRV features used in this study were extracted from the
same signal.
Validation Spectral analysis was performed to examine the heart rate
at specific frequency bands that are related to sympathetic
Fig. 1. Flowchart of signal processing. and parasympathetic activity [1]. We used a Continuous
200
Wavelet Transform with a fifth order symlet wavelet to
FHR (BPM)

150
calculate the power in the frequency bands [10]. Power
100 in the low frequency band (LF , 0.04-0.15 Hz) represents
50 both sympathetic and parasympathetic activity, and power
100
in the high frequency band (HF , 0.4-1.5 Hz) represents
UA (mmHg)

75

50
parasympathetic activity only. Besides absolute LF and HF ,
25 also normalized frequency powers were calculated (LFn and
0
0 2 4 6 8 10 HFn ), by dividing LF and HF by the total power (T P ,
time (min)
0.04-1.5 Hz).
Fig. 2. Example of a FHR and UA signal for a healthy fetus. Heart rate signals often show complex and irregular fluc-
tuations that cannot be explained by spectral analysis. The
presence of these irregularities in the heart rate is seen as
that certain medication can cross the placenta and influence healthy blood pressure control by the ANS. Complexity in
fetal HRV (e.g. anti-hypertensives or pain killers). We ap- short time signals can be quantified by Sample Entropy
proximately matched the prevalence of these medications in (SampEn) [11]. SampEn estimates the conditional proba-
both groups, in accordance with the difference in size of the bility that if patterns are similar for length m − 1, they will
two groups. also be similar for length m. A tolerance parameter r is used
to define a threshold for similarity between two patterns. In
B. Pre-processing this study we used m = 2 and r = 0.2SD, similar to settings
HRV features are calculated on interbeat interval (R-R used in [2] and [3].
interval) signals. Since beat-to-beat R-R intervals can only Finally, Detrended Fluctuation Analysis (DFA) was used
be determined at the instance a heartbeat occurs, the R- to describe the scaling properties of the HRV over different
R intervals are not equidistantly sampled. Therefore, linear time scales [12]. The scaling exponent α was calculated
interpolation was used to obtain an equidistantly sampled over a range of 1 to 60 seconds, corresponding to the
heart rate. When the difference between two consecutive R- physiological range of the baroreflex.
R intervals was more than 200 ms or more than 25 Beats Per
Minute (BPM), we defined both R-R intervals as artifacts. D. HRV analysis during contractions and rest periods
Artifacts were corrected by linear interpolation. To calculate HRV features separately during contrac-
For each CTG registration, a segment of 10 minutes was tions and during rest periods, the expert annotations of the
selected with less than 20% loss of FHR signal and at least contractions were used. Rest periods were defined as the
2 minutes of FHR signal during contractions and during rest periods in between contractions. If the time between two
periods. Segments were selected closest to birth, only during contractions was more than one minute, we only considered
the second stage of labor (stage of active pushing) and within the final minute preceding the second contraction as rest
a range of 45 minutes before birth. period. Values of HRV features during contractions were
calculated based on R-R values that were recorded during
C. HRV features contractions. Similarly, values of HRV features during rest
For each 10 minute FHR signal, conventional HRV fea- were calculated based on R-R intervals that were recorded
tures were calculated that describe different properties of during rest periods. For each HRV feature, the average value
the heart rate. An overview of the used HRV features is of all contraction periods and all rest periods within the 10
presented in Table I. Due to limited space, we only briefly minute segments (indicated as f eatureuc and f eaturerest ,
summarize the HRV features used in this study. For a more respectively) is then used for classification. We also con-
thorough description of used HRV features, the authors refer sidered the ratio between HRV features calculated during
the interested reader to [6]. contractions and during rest periods (f eatureratio ).
To describe the statistical properties of the heart rate we Note that computation of LF , LFn , HFn , and α requires
used the standard-deviation (SD) and root mean square approximately one minute of consecutive FHR signal. Since

2951
TABLE I boundaries. For implementation of the SVMs, we used the
HRV FEATURES standard implementation fitcsvm in the Statistics and machine
Learning toolbox of Matlab R
.
HRV features entire FHR contractions rest ratio
SD x x x x Due to the imbalance in the distribution of our dataset
RM SSD x x x x over the classes, training an SVM with a similar cost for
LF x misclassification of both classes would result in a classifier
HF x x x x
TP x that tends to correctly classify the majority class (healthy
LFn x fetuses) but performs poorly on classification of the minority
HFn x class (fetuses with adverse outcome) [13]. To account for
SampEn x x x x
α x the imbalance in the distribution of the dataset we therefore
defined the cost for misclassification of the minority class
five times higher than for misclassification of the majority
during the second stage contractions and rest periods are class, in line with the class distribution.
often less than one minute, we could not extract these HRV An SVM rbf classifier requires optimization of two pa-
features separately during contractions and rest periods. We rameters; the width of the Gaussian kernel function (σ) and
thus limited our analysis during contractions and rest periods a penalty parameter for misclassification (C). Since feature
to SD, RM SSD, HF , and SampEn. selection by SFS uses the performance of the classifier to
In total, 21 HRV features were used: 9 calculated over score the candidate subsets, σ and C should ideally be
the entire FHR, 4 during rest, 4 during contractions, and 4 optimized for each candidate subset. However, this would
ratio features. The feature set that consists only of HRV fea- computationally not be feasible. Instead, for feature selection
tures calculated over the entire FHR, without distinguishing we therefore used the default settings of the SVM (σ = 1
contractions from rest periods, is defined as S1 ∈ R9 . The and C = 1). Then, after feature selection, σ and C were
feature set consisting of HRV features that are calculated optimized for the selected HRV feature subset.
over the entire FHR and separately during contractions and
G. Validation
rest periods is defined as S2 ∈ R21 .
From S1 and S2 , subsets of three HRV features were
E. Feature selection selected by SFS. Then, SVM parameters (σ and C) were
Many of the HRV features described in Sections II-C and optimized with the selected subsets as input. We used 10-fold
II-D are correlated and might contain redundant information. cross validation to find the optimal subset of HRV features
A feature is considered redundant if the information about the and for optimization σ and C.
two classes is already captured by other features. Moreover, Due to our relatively small dataset containing only 22
we have a small dataset with only 22 fetuses with adverse cases of fetuses with adverse outcome, feature selection and
outcome and there is a risk of overfitting when too many classification could be influenced by the location where the
HRV features are used for classification. dataset was split into ten fold. To obtain a more objective
To select the best combination of HRV features we used result, the above procedure was repeated ten times. For both
a sequential feature selection algorithm. Since the number S1 and S2 we thus generated ten subsets of three HRV
of HRV features in S1 and S2 is relatively small, we use a features and ten combinations of σ and C.
In case of an imbalanced dataset, the overall classification
Sequential Forward Selection (SFS) algorithm. Starting from
accuracy is not the best metric to evaluate the performance
an empty feature set, SFS creates candidate feature subsets
of the classifier [13]. Instead we used the geometric mean
by sequentially adding features that have not been selected,
(g) as metric. The metric g provides a balance between the
such that the new candidate subset maximizes an objective
specificity (Sp, classification accuracy for the majority class)
function. For implementation of SFS, we used sequentialfs
and sensitivity (Se, classification
√ accuracy for the minority
in the Statistics and Machine Learning toolbox of Matlab R

class), and is defined as g = Sp · Se.


(The Mathworks, Inc. Natick, MA).
SFS can be categorized as a wrapper method for feature III. R ESULTS
selection. Wrapper methods typically use the performance of Figure 3 shows all HRV features that were selected in the
a classifier as objective function to score candidate feature ten runs by SFS. From S1 (only HRV features that are calcu-
subsets. In our study we used the performance of an SVM lated over the entire FHR) α, SD, LFn , and HFn were most
classifier that is explained in Section II-F. frequently selected, as shown in Fig. 3a. In case S2 was used
(also including HRV features that were calculated separately
F. Classification during contractions and rest periods), SDratio , RM SSDuc ,
In this paper, SVMs were used to classify healthy fetuses and SampEnratio were most frequently selected, as shown
and fetuses with adverse outcome. The SVM constructs in Fig. 3b.
a set of hyperplanes in feature space that minimizes the Results of the classification performance with the selected
misclassification error while maximizing the margin between subsets as input are shown in Table II. The average value of
the hyperplanes and the data points. A Gaussian radial the Se, Sp, and g for the ten runs is presented. The average
based function (rbf) was used to enable non-linear decision g for S1 and S2 were 69.0% and 79.6%, respectively.

2952
10
classifier was trained to perform well on a balance between
# selected Se and Sp. Depending on how the classifier is used in
5
clinical practice, it might be better to define a different
0
penalty for mis-classification. This different penalty will
SD LFn HFn LF TP SampEn
α
increase either Se or Sp. It is difficult to compare our results
(a) to other work in the literature due to differences in the used
10
dataset. We only included FHR signals that were recorded
# selected

5 during the second stage of labor. Moreover, we also included


several types of medications that could influence fetal HRV.
0
SDratio RMSSDuc SampEnratio SDuc HF
For future work it would be interesting to evaluate the
performance of the classifier for different types of medication
(b)
that are prescribed during labor. Furthermore, we selected
Fig. 3. Results feature selection by SFS. (a) Selected HRV features from FHR segments near birth, but it might be interesting to
S1 . (b) Selected HRV features from S2 .
examine the classification performance over time. Moreover,
TABLE II inclusion of clinical information such as gestational age or
R ESULTS CLASSIFICATION maternal temperature might also improve the classification.
Feature set g (%) Se (%) Sp (%) ACKNOWLEDGMENT
S1 69.0 62.7 76.4
S2 79.6 77.7 81.6 This research was performed within the framework of the
IMPULS perinatology program. The authors would like to
IV. D ISCUSSION thank A. Kwee (University Medical Center Utrecht, The
Analysis of fetal HRV provides information on fetal dis- Netherlands) and K.G. Rosén (Boras University, Sweden)
tress and could assists clinicians in their decision making. for provision of their STAN registrations.
However, during labor the fetal cardiovascular system is R EFERENCES
strongly influenced by contractions. In this paper we showed [1] J. van Laar, C. Peters, R. Vullings, S. Houterman, J. Bergmans, and
that separating contractions from rest periods improves HRV S. G. Oei, “Fetal autonomic response to severe acidaemia during
analysis for the detection of fetal distress during labor. labour.” BJOG, vol. 117, no. 4, pp. 429–437, Mar 2010.
[2] H. Gonçalves, A. Rocha, D. Ayres-de Campos, and J. Bernardes,
We used SFS to select the best combination of HRV “Linear and nonlinear fetal heart rate analysis of normal and acidemic
features for detection of fetal distress. Although SFS usually fetuses in the minutes preceding delivery,” Med Biol Eng Comput.,
provides an optimal subset for a particular classifier, it uses vol. 44, no. 10, pp. 847–855, 2006.
[3] J. Spilka, V. Chudáček, M. Kouckỳ, L. Lhotská, M. Huptych, P. Janků,
the classification performance to score the candidate subsets. G. Georgoulas, and C. Stylios, “Using nonlinear features for fetal heart
As a consequence, selected HRV features might perform rate classification,” Biomedical Signal Processing and Control, vol. 7,
differently for other classifiers. Furthermore, the best subset no. 4, pp. 350–357, 2012.
[4] A. Georgieva, S. J. Payne, M. Moulden, and C. Redman, “Artificial
was determined by 10-fold cross validation. Since splitting neural networks applied to fetal monitoring in labour,” Neural Com-
the dataset into ten folds influences the feature selection, SFS puting and Applications, vol. 22, no. 1, pp. 85–93, 2013.
was repeated ten times. The influence of splitting the dataset [5] P. Abry, S. Roux, V. Chudácek, P. Borgnat, P. Goncalves, and
M. Doret, “Hurst exponent and intrapartum fetal heart rate: Impact
is reflected by the number of HRV features that SFS selected of decelerations,” in Proc. of the IEEE 26th Int. Symp. on Computer-
at least once in the ten runs. As can be seen from Fig. 3, Based Medical Systems. IEEE, 2013, pp. 131–136.
feature selection from S2 was influenced less by splitting [6] G. Warmerdam, R. Vullings, J. Van Laar, J. Bergmans, L. Schmitt,
S. Oei et al., “Using uterine activity to improve fetal heart rate
the dataset than feature selection from S1 . Moreover, the variability analysis for detection of asphyxia during labor,” Physiol
most frequently selected HRV feature from S1 (α) was only Meas, vol. 37, no. 3, pp. 387–400, 2016.
selected 8 times, while the most frequently selected HRV [7] M. Romano, P. Bifulco, M. Cesarelli, M. Sansone, and M. Bracale,
“Foetal heart rate power spectrum response to uterine contraction,”
feature from S2 (SDratio ) was selected in all runs. Med Biol Eng Comput., vol. 44, no. 3, pp. 188–201, 2006.
As shown in Fig. 3b, HRV features calculated over the [8] A. Sundström, D. Rosén, and K. Rosén, “Fetal surveillance,” Gothen-
entire FHR are generally not selected from S2 . This indicates burg: Neoventa Medical AB, 2000.
[9] U. Schneider, E. Schleussner, A. Fiedler, S. Jaekel, M. Liehr,
that HRV features calculated separately during contractions J. Haueisen, and D. Hoyer, “Fetal heart rate variability reveals dif-
and during rest periods contain more information about the ferential dynamics in the intrauterine development of the sympathetic
two classes then HRV features calculated over the entire and parasympathetic branches of the autonomic nervous system,”
Physiological Measurement, vol. 30, no. 2, p. 215226, Jan 2009.
FHR. Similar to results obtained in [6], our result suggest that [10] C. Peters, R. Vullings, M. Rooijakkers, J. Bergmans, S. Oei, and
in particular ratio features are informative on fetal distress. P. Wijn, “A continuous wavelet transform-based method for time-
Further study using a larger dataset and other classifiers frequency analysis of artefact-corrected heart rate variability data.”
Physiol Meas, vol. 32, no. 10, pp. 1517–1527, Oct 2011.
is required to gain insight in which combination of HRV [11] J. Richman and J. Moorman, “Physiological time-series analysis using
features is the most informative. approximate entropy and sample entropy,” Am J Physiol Heart Circ
The average classification performance improved from Physiol., vol. 278, no. 6, pp. H2039–H2049, 2000.
[12] E. Ihlen, “Introduction to multifractal detrended fluctuation analysis
g = 69.0% for HRV features calculated over the entire in matlab,” Front. Physio., vol. 3, 2012.
FHR to g = 79.6% when HRV features were also calculated [13] H. He and E. A. Garcia, “Learning from imbalanced data,” IEEE
separately during contractions and rest periods. Note that the Trans. Knowledge and Data Eng., vol. 21, no. 9, pp. 1263–1284, 2009.

2953

You might also like