Professional Documents
Culture Documents
Advances in Electrodermal Activity Processing With Applications For Mental Health
Advances in Electrodermal Activity Processing With Applications For Mental Health
Greco · Gaetano Valenza
Enzo Pasquale Scilingo
Advances in
Electrodermal
Activity Processing
with Applications
for Mental Health
From Heuristic Methods to Convex
Optimization
Advances in Electrodermal Activity Processing
with Applications for Mental Health
Alberto Greco • Gaetano Valenza
Enzo Pasquale Scilingo
Advances in Electrodermal
Activity Processing
with Applications
for Mental Health
From Heuristic Methods to Convex
Optimization
123
Alberto Greco Gaetano Valenza
Department of Information Engineering Department of Information Engineering
Bioengineering and Robotics Bioengineering and Robotics
Research Center ‘E Piaggio’ Research Center ‘E Piaggio’
University of Pisa University of Pisa
Pisa, Italy Pisa, Italy
This book presents a critical review of methodological studies for the analysis of
the electrodermal activity (EDA), one of the most powerful noninvasive peripheral
measures of the autonomic nervous system (ANS) neural pathway. Through the
book, the author leads the reader from a thorough description of electrodermal
physiological phenomena to an advanced introduction and discussion of recom-
mended techniques for correct data collection and effective data analysis. Several
experimental setups are also presented.
Although the EDA signal is fairly easy to acquire and very informative, powerful
methodologies and efficient models are required to make meaningful inferences
on the dynamics at the central nervous system level. The book introduces and
emphasizes a novel computational model for EDA analysis that I have personally
promoted and coauthored. The method relies on rigorous mathematical techniques,
such as convex optimization, to provide an effective window on the ANS dynamics,
and it has been successfully applied in several experimental scenarios. EDA is a
source of many sensitive psychophysiological markers and it finds application in
several fields of research, such as psychology and medicine, as a viable indicator in
emotion assessment and pathological mood state recognition. Remarkably, the book
presents several experimental applications exploring different sensory channels
for emotion stimulation in both healthy subjects and bipolar patients with very
promising results.
I am confident the reader will find useful information on proper characterization
of EDA dynamics and how this can be applied to the rising fields of affective
computing and psychophysiology. The high technical content makes the book
attractive to anyone interested in signal processing, statistics, applied mathematics,
and physics.
The book is a valuable reference for active research scientists and postgraduate
students interested in methods at the interface of bioengineering and statistics. I
expect that this book will stimulate and encourage the use of such methods in
different fields of applied science.
Electrodermal activity (EDA) can be considered one of the most common perceptual
channel, of the autonomic nervous system (ANS) dynamics and manifests itself as
changes in electrical properties of the skin. Several previous studies have shown
how EDA can be a very informative biomedical sign with high discriminant
power between different psychophysiological states, although in this case many
methodological issues arise. This book fervently shows how to retrieve much
reliable information from EDA, to investigate also the assessment of emotional
responses in healthy subjects and patients with pathological mood/mental states.
Throughout the chapters, in-depth methodological and applicative studies involving
EDA are described, including a critical review on the current state of the art.
Since continuous deconvolution analysis (CDA) has been recognized as one of the
mostly used methods for EDA analysis, we first show how to apply this model to
discern different affective states in healthy volunteers. Emotions were evoked using
multimodal standardized sets of pictures, sounds, caresses, and smells. Valence and
arousal levels of such emotions were identified as the principal dimensions of the
affective responses. The achieved results are consistent with the hypothesis that it is
possible to objectively study ANS dynamics involved in the emotional processing
by properly processing the EDA.
Furthermore, this book reports on a novel computational model for the EDA
analysis based on convex optimization methods. This model, hereinafter called
cvxEDA, describes the EDA as a sum of the phasic component, the tonic com-
ponent, and an additive white Gaussian noise term incorporating prediction errors,
as well as measurement errors and artifacts. CvxEDA is physiologically inspired
and overcomes the limitations of the heuristic solutions and post-processing steps
of the conventional approach. It is based on a rigorous methodology grounded on
Bayesian statistics, mathematical convex optimization, and sparsity. Building on our
previous CDA-based experimental results, outcomes of cvxEDA often demonstrate
higher accuracy than CDA while discerning elicited emotional states in healthy
subjects. When applied to EDA from psychiatric patients suffering from bipolar
vii
viii Preface
We would like to express our deepest and sincere gratitude to all the people who
contributed to data acquisition and analysis: Dr. Antonio Lanatá, Dr. Andrea Guidi,
Dr. Mimma Nardelli, Dr. Matteo Bianchi, Prof. Claudio Gentili, Dr. Nicola Vanello.
A special mention to Dr. Luca Citi for his fundamental contribution to our research
and for his foreword.
ix
Contents
xi
xii Contents
Bibliography . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . . 123
Acronyms and Symbols
The following table shows the most used and important acronyms.
Statement Acronym
Autonomic nervous system ANS
Central nervous system CNS
Electrodermal activity EDA
Electrodermal response EDR
Electrodermal level EDL
Skin conductance SC
Skin conductance response SCR
Skin conductance level SCL
Non specific skin conductance response NsSCR
Heart rate variability HRV
International Affective Picture System IAPS
Circumplex model of affect CMA
International Affective Digital Sounds IADS
Continuous deconvolution analysis CDA
Signal-to-noise ratio SNR
Finite impulse response FIR
Infinite impulse response IIR
Impulse response function IRF
Auto-regressive moving average ARMA
Auto-regressive AR
Moving average MA
Area under the curve AUC
Respiratory volume RV
Respiration activity RSP
Skin temperature ST
Pupil diameter PD
xv
xvi Acronyms and Symbols
xvii
xviii Introduction
response function (IRF) and a sparse signal representing the sudomotor nerve
activity, which is part of the ANS. The IRF is modeled as an IIR filter allowing a
much more compact and non-banded matrix representation increasing the accuracy
and reducing the computational cost. Unlike previous algorithms in the literature,
this model incorporates the intrinsic physiological characteristics of EDA without
necessarily resorting to heuristics and ad hoc solutions, thanks to the presence
and definition of prior probabilities for the phasic and tonic signals. Results were
compared to those obtained through the continuous deconvolution analysis (CDA)
model [1], a method that performs a deterministic inversion of the peripheral
model. The proposed method showed good performance confirming a promising
applicability in the field of affective computing as well as of mental health.
In the second part of the book, we report on several EDA application sce-
narios, especially related to two specific research fields: emotion recognition and
assessment of mood/mental disorder. Indeed, emotions and mental disorders are
strictly and intrinsically interrelated; therefore, when emotions are dysregulated,
mental health is not guaranteed. Affective experiences accompany all cognitive
processes and social activities even in the case of psychopathologies [2, 3].
Moreover, prevalent theories affirm that the emotional processes can have primacy
over cognition [4]. As an example, the regulation process of emotions is crucial
in the occurrence and control of major depressive episodes, and some theoretical
views of depression are based on emotion changes which have implications in the
assessment, treatment, and prevention of the pathology [5]. Another well-known
relationship between emotions and mental disorders regards anxiety [6] as well as
brain damages of emotional processing areas and decision-making process [7].
In this part of the book, several experimental results gathered from testing EDA
models to robustness to noise, ability to separate and identify exogenous stimuli,
and capability of properly describing the activity of the autonomic nervous system
in response to specific affective elicitation are reported in detail. Concerning the
affective elicitation paradigm, we show exemplary applications of EDA modeling
on data gathered from healthy subjects undergoing multimodal affective elicitation,
where visual, auditory, olfactory, and tactile stimuli were investigated. Concerning
the mental health scenario, EDA analysis was employed to assess patients with
bipolar disorder [8–10], who experienced depressive and manic or hypomanic
episodes. Data used for this study were acquired in the frame of a European
collaborative project called PSYCHE (personalized monitoring systems for care in
mental health) [8, 11].
Chapter 1
Electrodermal Phenomena and Recording
Techniques
Electrodermal activity (EDA) is the general term used to define autonomic changes
in the electrical properties of the skin. One of the most frequently used measures of
EDA is skin conductance (SC), which can be quantified by applying an electrical
potential between two points of skin contact, usually the medial or distal phalanxes
of the non-dominant hand, and measuring the resulting electric current between
them. Electrodermal signals are a manifestation of the activity in eccrine sweat
glands that are innervated by the sympathetic branch of the autonomic nervous
system (ANS), mainly by the sudomotor nerves [12]. Indeed, when the sudomotor
nerves stimulate the production of sweat, the conductivity measured on the skin
surface changes as a result of sweat secretion and variations in ionic permeability of
sweat gland membranes [13–15]. The EDA is comprised of two main components,
having different time scales and relationships with the triggering stimuli: tonic and
phasic. The tonic EDA is given by the skin conductance level (SCL) which repre-
sents the slow-varying baseline level of the SC. Variations in the SCL are thought
to reflect slow changes in the ANS dynamics. The phasic EDA is represented by
a fast changing component, called skin conductance response (SCR), reflecting the
evoked response of the eccrine sweat glands to an external stimulus. The SCR is
defined as the SC transient arising within a predefined window (1–5 s) after the
stimulus onset and satisfying a minimum amplitude criterion (0.05 S) [16]. Recent
evidences suggest that these two components rely on different neural mechanisms
[17] and, consequently, that both convey relevant and non-redundant information
about the ANS activity.
The SC can be easily measured by applying a constant 0.5 V potential across two
skin-contact points, usually on the surface of hands or more specifically of fingers,
where there is a high concentration of eccrine sweat glands [18, 19]. EDA is used in a
wide range of experimental setups because it is a relatively straightforward measure
providing valuable information about the ANS response to a broad range of external
stimuli. In particular, SC analysis is commonly used to quantify the levels of arousal
associated with emotional and cognitive processes [17, 20, 21].
Although sweating is primarily a means of thermoregulation, sweat glands
located on the palmar and plantar (glabrous) surfaces possibly evolved to increase
grip and enhance sensitivity, and may be more responsive to psychologically
significant stimuli than to thermal ones [13, 15]. This relationship between EDA,
ANS, and psychological stimuli—together with the relative ease of measurement—
makes this physiological signal widely popular in neuroscience research, including
information processing, quantification of arousal levels during emotional and cogni-
tive processes, and clinical research examining predictors and correlates of normal
and pathological behaviour [17, 20–22], such as psychopathology, personality
disorders, conditioning, and neuropsychology.
The main role of the skin is to protect the body from environmental threats such
as temperature, chemical, mechanical and infectious agents. It acts as a selective
barrier, and provides sense functions thanks to the mechanoreceptors, nociceptors,
and thermoreceptors. As a selective barrier the skin acts an other important role: the
regulation of perspiration both to prevent the body from drying out and to control
the emission of fluid thanks to the sweat glands.
The skin is not a single organ but consists of a complex set of organs and it is
possible to distinguish several layers (see Table 1.1).
The cutis is comprised of two sub-layers, the dermis and the epidermis (see
Fig. 1.1). The epidermis is the outermost layer of the skin and consists of the
epithelial tissue, which becomes progressively hornier closer to the surface. At
its base there are numerous layers of cells that reproduce continuously and move
towards the outer layers in order to replace those that die and fall off. The
epidermis does not contain blood vessels, and it is nourished by diffusion through
the underlying dermis. In particular, the stratum corneum plays an important role in
the electrothermal phenomena: generally this layer is dry, but becomes wet in the
presence of sweat.
1.2 Anatomy of the Skin 3
Stratum corneum
Stratum lucidum
Stratum granulosum
Stratum spinosum
Stratum germinativum
Stratum
papillare
Dermis
Stratum
reticulare
Subcutis
(Hypodermis)
Fig. 1.1 Layered composition of the glabrous human skin. An eccrine sweat gland, encircled by
its glomerulus, together with its straight dermal and irregularly coiled (helical) epidermal duct
(labeled acrosyringium), is shown in cross section. A part of the reticular layer has been omitted
due to its size in relation to the rest of the skin. Courtesy of [23]
The epidermis is relatively thin in comparison with the deeper-lying dermis. This
is located under the layer of the epidermis and consists of a connective tissue, which
acts as a cushion for the mechanical trauma of the skin. The dermis is tightly
connected to the epidermis by a basement membrane and contains many nerve
endings (mechanoreceptors) that provide the sense of touch and heat. It also contains
blood vessels, lymph vessels, sebaceous glands, sweat glands, hair follicles and
apocryphal [24–27]. The hypodermis (subcutis) separates the cutis from the deeper-
lying tissue and is composed of loose connective tissue. It contains the secretory
part of the sweat glands, appearing as a glomerulus (Fig. 1.1), as well as fatty tissue,
and the larger vessels which supply the body surface [23].
4 1 Electrodermal Phenomena and Recording Techniques
There are regional differences not only in the skin’s vertical layering, but also
in its horizontal structure. In the early stages of embryonic development, different
patterns of skin are formed by either ridge formation or folding into polygonal
structures [24]. Thus, the different types of skin are referred to as ridged skin
and polygonal skin. Specifically, ridged skin is seen only on the palms and soles.
The sweat gland ducts usually enter the epidermis at the nadir of the ridges. The
ridged skin is glabrous (hairless) and has no sebaceous nor scent glands. The rest
of the body is covered by polygonal skin patterns. Unlike ridged skin, the ducts of
sweat and scent glands enter the epidermis at the higher parts of the skin. Hairs and
sebaceous glands, however, are located in the channels of the polygonal skin.
All these layers have different properties depending on body sites. Concerning
the EDA, palms and fingers of hands are very important for their specificity
for emotional sweating. They are characterized by a very thin epidermis, i.e.
approximately 1 mm while it is ordinarily 50–200 mm.
The sweat glands cover almost the entire surface of the human body; they amount
to more than three millions, increasing by a factor of 7 from birth to adulthood.
They are present in high concentrations in specific areas such as palms, forehead
and soles, and in low concentrations on arms, trunk, legs [28], whereas there are
areas in which they are totally missing, such as lips or the inner ear channel [29].
Sweat glands are considered as exocrine glands because they secrete directly onto
the skin’s surface. Concerning the characteristics of the sweat secretion, sweat
glands are classified into eccrine and apocrine. Apocrine sweat glands play only
a negligible role with respect to the total amount of sweating [30] and respect to
the electrodermal activity too, and they are distributed especially on the breast, the
axillary, circumanal and genital regions [24, 31]. The eccrine sweat glands, instead,
play the most important role in the electrodermal activity phenomena. They are
the majority of sweat glands in human beings and are present in almost all the
surfaces of the body. Their secretions, i.e. eccrine sweat, do not contain noticeable
amounts of cytoplasm from the glandular cells, and they are not continuous, but
depend on various stimuli that affect also the regions in which they take place.
Heat, for example, causes sweating mainly on forehead, neck, back, chest and
back of hands [24]. Instead, emotional stimuli affect armpit regions, sides of the
torso, palms of hands and soles of feet. In 24 h the human body can produce
a huge amount of eccrine sweat, which can be up to 10–12 l [32]. The gland
activity is controlled by the hypothalamus and, as stated above, the sweat glands
are innervated by the fibers belonging to the sympathetic nervous system. They
are highly vascularized and innervated by a dense network of nerves, which are
both cholinergic and adrenergic. The secretory part of eccrine glands is innervated
only by the sympathetic cholinergic nerves: that means that they use exclusively
acetylcholine as a synaptic transmitter, which is produced and used on site.
1.4 Physiology of the Electrodermal System 5
Although the local processes underlying the EDA in the skin are well known, the
central origins of EDA are still under study. EDA is the result of the interaction
between local processes in the skin and sympathetic nervous system activity. At least
two different CNS sources have been identified as areas controlling the sudomotor
activity that leads to the electrodermal changes (see Fig. 1.2).
However, they use the same peripheral sudomotor efferents to the sweat glands
as a common final pathway. Specific central sudomotor pathways are not well
established, and even with the aid of brain imaging, no final breakthrough has
been reached in clarifying the central elicitation or inhibition of electrodermal
phenomena [23].
The hypothalamus is the area of the brain that directly regulates the secretion
of sweat; it is the main center of thermoregulation, then it is supposed to play
an important role in the origin of the EDA signal. The hypothalamus functions
are influenced by different brain structures, which are part of the limbic system.
Moreover, also the cortical area interacts with the limbic system, which can also
affect the hypothalamus activity and, consequently, the EDA. As a matter of fact,
previous studies [33, 34] have shown a decrease or the disappearance of the skin
conductance response to an emotional stimulus in subjects with lateral-frontal
lesions, comparing with other cortical areas lesions. Therefore, the sudomotor reflex
seems to be under the complete control of the limbic system and brain structures
associated with it. Specifically, the amygdala and the hippocampus are the limbic
structures mostly involved in the control of the hypothalamic functions related to
6 1 Electrodermal Phenomena and Recording Techniques
Premotor
cortex (Area 6)
Ci
Limble System
Ci = Cingulate gyrus C Basal Ganglia
A = Anterior thalamus C = Caudate nucleus
Fo = Fornix P = Putamen
A Fo
M = Medial part of the
Hi = Hippocampus
Hy = Hypothalamus P }
L = Lateral pallidum
L
M
Hy
Hi
1 2
3
RF = Reticular RF
formation Medulla
Sympathetic
anterolateral
pathway
Spinal cord
Fig. 1.2 Central nervous system elicitation of EDA in humans. 1: ipsilateral influences from the
limbic system via hypothalamic thermoregulatory areas (EDA1); 2: contralateral influences from
premotor cortical and basal ganglia areas (EDA2); 3: reticular influences. Courtesy of [23]
the sudomotor nerve activity. The amygdala should play an excitatory function,
whereas the hippocampus plays an inhibitory function. Specifically, the amygdala
plays a key role in sweating control as response to emotional stimuli. This because
of the involvement in the behavioral pattern memory (emotions behavior, social
behavior, endocrine and autonomous functions). In 1965 Bagshaw [35], removing
the amygdala from the brain of some monkeys, found out that the activation of
the EDA does not terminate with the limbic-hypothalamic activity, but also occurs
as a result of phenomena such as deep breathing and movements. This suggested
the presence of at least two other sources: the premotor cortical and basal ganglia
1.4 Physiology of the Electrodermal System 7
and the reticular system. The first consists of premotor cortical areas, whose fibers
for the transmission of pulses are found in skeletal muscle connections close to
the fibers that control the secretion of sweat. In fact, when these cortical areas
are naturally or electrically stimulated, or removed, an intense sweating can be
observed. The reticular system is a nuclear and fiber network in the inner part of
the brain. It plays a very important role in various phenomena, such as keeping the
alertness, processing of sensory stimuli, adjustment of spinal reflexes. Therefore,
the reticular system can trigger and modulate the EDA.
There are lots of empirical evidences that electrodermal activities are gener-
ated by sweat glands in conjunction with epidermal membrane processes. Sweat
secretion leads to two phenomena: the filling of sweat ducts and the moistening of
the relatively dry upper epidermal layer, the stratum corneum. When the ducts are
filled, the skin conductance increases due to shunts through the epidermal barrier
that connect the surface of the skin with the highly conductive dermal tissue. On
the other side, the moistening of the corneum increases skin conductance thanks
to salty sweat. In the literature, there are several simple electrical models that
can describe those purely resistive properties of the electrodermal system. In 1983
Edelberg proposed a resistive model of EDA, which resembled the simplified form
of the Montagu-Coles model [36] as depicted in the right-hand part of Fig. 1.3. He
regarded the corneum and the sweat duct as resistors in parallel, connected in series
with a resistor which includes some corneal and all subcorneal structures, except
the sweat gland lumen. Edelberg [37] also did not include the frequently discussed
active epidermal membrane, because of its hitherto uncertain role in contributing
to conductance changes [38]. In case of exosomatic recording, such a membrane
would act as a capacitor lying in parallel to the resistors of the corneum and sweat
duct.
Fig. 1.3 Left-hand panel: Electrical equivalent circuit for the skin, according to Montagu and
Coles [36]. R1: resistance of the dermis and the body core. R2: resistance of the stratum corneum.
r1, . . . , rn: connectable resistances of the sweat gland ducts. C: capacitive element. Right-hand
panel: Simplified Montagu-Coles model. R: variable resistance resulting from sweat gland ducts.
Left-hand panel from [36]. Courtesy of [23]
In fact, the membranes can store electric energy and become potential generators.
The membranes that behave as polarized capacitors are presumably mainly localized
in the secretory activity of the sweat glands, in the transition area between the dermis
and the epidermis and into the epidermis. In conclusion, the skin can be interpreted
as an electrical network formed by RC circuits in series and parallel. We can assume
that the capacitive elements can be loaded very quickly, but once that they are fully
charged, they can not scroll other current and therefore they can not further influence
the EDA.
The sequence of events that occur during the electrodermal response in the
epidermal duct of a sweat gland is shown in Fig. 1.4.
• at the beginning of the response, if the surface layer of the stratum corneum is
well hydrated, the pore sudoriparo and the distal part of the ductus sudoriparo are
closed under the effect of pressure exerted by the surrounding stratum corneum
(approximately 20 mmHg). In fact, the conductance does not increase.
• If the sweat fills the duct, it occurs higher conductivity of the duct itself and thus
measuring an increase in conductance in the horny layer (composed of dead cells
now dried).
1.5 Recording Systems 9
Three standard methodologies are usually employed to record the EDA. The first
one is called endosomatic measurement. This is rarely used and will not be detailed
in this book. It consists in measuring, directly on the skin, the potential difference
between two skin sites, in a passive way. It does not need special amplifiers and
coupling electrical circuits. Although it is a quite unknown bioprocess, it is accepted
that changes in skin potential during sympathetic activity may be provoked by the
sodium reabsorption across the duct walls and the consequential change of the ionic
potential in the sweat ducts [15].
10 1 Electrodermal Phenomena and Recording Techniques
2
1
0
6
5
μS
4
3
2
1
0
0 50 100 150 200 250 300 350 400 450 500
t [s]
Fig. 1.5 Representative SC signal during affective stimulation. (Of note, the first 150 s correspond
to a resting phase)
The other two methods are based on an exosomatic approach, i.e., a small exter-
nal current is directly injected into the skin. In this regard, two different methods are
used, specifically, the direct current method (DC) and the alternating current method
(AC) at different frequency levels. Generally, they perform a measure of resistance
(impedance) or conductance (admittance: “conductance C j susceptance”), where
j is the imaginary unit. When DC source is used resistance and conductance are
inverse, while when we use AC source the inverse paradigm is not valid, due to
changes in the equivalent electrical model of the skin. In fact, it changes from a
parallel resistor-capacitor circuit (where, in DC regime, resistance and conductance
are inverse) to a series resistor-capacitor circuit [39].
Although AC methods allow measuring capacity changes in the electrodermal
responses, DC procedures are the most implemented [40]. Within DC methodology,
even if much effort has been spent to standardize, an agreement concerning
the use of either constant-voltage or constant-current scheme has not yet been
achieved. Therefore, different designs can be found. In constant-voltage sources the
conductance of the skin can be directly measured as an output of the circuit without
the need of any further transformation. However, constant-current sources provide
more stability and exhibit less tolerance, but in this method much attention must be
payed to possible damage to the sweat ducts due to the injected current through a
small area of the skin [39].
Unfortunately, a very low number of studies have been published on the
difference among AC and DC stimulation in EDA measurement. One of the
most interesting study [40] showed that using an AC source at 88 Hz, the major
contribution to EDA is given by the conductance term and not by the susceptance.
More specifically, the authors found that, at the instant of the conductance response,
there were no susceptance responses, and this could indicate the absence of a not
significant capacitance in the sweat ducts.
1.6 Exemplary Electrodermal Activity Monitoring Devices 11
C6 Hypothenar
Thenar eminence
eminence Active electrode
C8 Dermatomal
Crease
distribution
Unipolar
placement
(SP measurement)
Reference
electrode
Abraded
site
Elbow
Electrodes
vo AC2
vDC
0.5 V AC1 +
Fig. 1.7 Simplified circuit for the acquisition of the DC electrodermal activity
A device for AC recording of EDA was proposed by Boucsein et al. [38], pointing
out that this method is preferred for medical applications to prevent nonlinearities
that may result from uncontrolled current densities when using a constant effective
voltage source.
With this method both impedance and phase angle are obtained as analog output
signals. Specifically they used an oscillator capable of generating a sine wave with
a continuously adjustable voltage between 1 Hz and 1 kHz. The sine wave was
converted by a voltage-to-current circuit into a constant current in the range between
0 and 10 mA of peak value, which is delivered to the subject.
The terminal voltage from the subject’s skin was preprocessed in two different
ways in order to compute both the impedance value and the phase angle. After a
pre-amplification step, the signal was amplified and rectified. Then, the impedance
was calculated after further low-pass filtering stage with a cutoff frequency of either
0.1 or 1 Hz and sent to an digital display. For calculating the phase angle, the pre-
amplified signal was multiplied in a phase sensitive detector with the oscillator
signal—which was phase shifted with the possibility of adjusting the phase angle
continuously—acting as a zero-offset for the phase signal. The output signal of the
phase sensitive detector was also rectified, low-pass filtered with the same frequency
limit as the impedance signal, and delivered to an other digital display.
A block diagram of the AC circuit for EDA recording is shown in Fig. 1.8.
14 1 Electrodermal Phenomena and Recording Techniques
calibration
100 100
4.73 nF output (2)
kW kW
phase angle[deg]
CAL 1 CAL 2
sub- digital
pre- phase low-pass
rectifier display
recording amplifier detector filter phase angle
.1 Hz
1 Hz
constant
current
source
peak- output (1)
current
0-10mA amount of
impedance
ject digital
phase voltage low-pass
rectifier offset display
shifter amplifier filter voltage
.1 Hz
0-100° 1 mV 1 Hz
10 mV
sinus 0° 100 mV
oscilator 90°
180° 1V
1Hz-1kHz 270°
x0... 10 10 v
1 kHz
10 Hz
1 Hz
Fig. 1.8 Block diagram of an analog front-end for simultaneous recording of impedance and phase
angle (courtesy of [38])
Several devices and circuit designs for the acquisition of the exosomatic EDA can
be already found in the literature and on the market. Here, we discuss the most
commonly used among those which need to be connected to a remote computer
[52].
• ProComp. It is a multi-channel and multi-modality acquisition system used by
a number of researchers to record skin conductance signal in addition to other
physiological data used in clinical observation and biofeedback (e.g., EEG, ECG,
EMG . . . ) [53] as well as other derivated measures such as heart rate, blood
volume pulse, respiration, goniometry, force. It has 8 protected pin sensor inputs;
2 channels that read data at 2048 samples/second, and 6 channels that read it
at 256 samples/second. The connection between the remote computer and the
ProComp system is performed by means of a fiber-optic cable.
• BioPac. The BioPac MP [54] system is a data-acquisition board that can be
connected to a remote computer by USB or Ethernet cable. It is widely used in the
scientific community and also in this book has been used in several experimental
studies (see Chaps. 3 and 5). The BioPac system is able to collect a large variety
of physiological multi-modal data, including skin conductance, with adjustable
sampling frequency. Moreover, it is possible to set a digital preprocessing stage
including variable gain and different kinds of filter.
1.6 Exemplary Electrodermal Activity Monitoring Devices 15
In the field of DC-devices, in the last decades, many wearable devices have been
developed in order to have the possibility of acquiring the EDA in daily-life
scenarios. They usually need a host computer to stream and memorize EDA data.
Most of them are designed as “a glove-like wearable device” in order to acquire the
EDA signal from the palm of the hand [56, 57]. An example is given by the so-
called Galvactivator [56] developed by the MIT Media Lab group. It is “a glove-like
wearable device” that senses the wearer’s skin conductivity and maps its values to a
bright LED display”. The galvactivator device also provides a data port from which
an analog to digital converter can sample. The sensor is comfortable, but requires
that the wearer be cabled to a host computer to transmit EDA data.
A commercial wearable DC device is the Brainquiry [58]. As a maker of “neuro-
feedback, biofeedback and psychophysiological measuring equipment”, Brainquiry
sells a compact skin conductance sensor which uses Bluetooth to communicate with
a host computer. However, little information is provided by the manufacturer about
the proprietary design of the biofeedback amplifier.
Finally, as mentioned before, in one of our previous study [57], a novel EDA
glove based on textile-integrated electrodes has been developed. The novelty
consisted mainly in the use of integrated textile electrodes placed at the fingertips.
The system is able to acquire and process the DC skin conductance in order
to discriminate affective states. The textile electrodes have been compared with
Ag/AgCl electrodes demonstrating comparable performance. More specifically,
reported results on electrode characterization, performed by means of the voltage-
current characteristics, and its electric impedance showed that textile electrode
achieves a good electrical and thermal coupling with biological site. Moreover, a
dedicated experiment where 35 subjects were enrolled and aiming at discriminating
different affective states using only EDA was designed and realized. A new set
of features extracted from non-linear methods was used, improving remarkably
successful recognition rates. Results were, indeed, very satisfactory and promising
in the field of affective computing.
16 1 Electrodermal Phenomena and Recording Techniques
As described in the previous chapter, EDA broadly refers to any alterations in the
electrical properties of the skin. The most frequently used measure of EDA is the
SC. The SC signal can be decomposed in two components, tonic and phasic, which
have different time scales and relationships to exogeneous stimuli. Tonic phenomena
include slow drifts of the baseline skin conductance level (SCL) and spontaneous
fluctuations (SF) in SC [15]. The phasic component, i.e., the skin conductance
response (SCR), reflects the short-time response to the stimulus. The typical shape
of SCR is comprised of a relatively rapid rise from the conductance level followed
by a slower, asymptotic exponential decay back to the baseline.
When the inter-stimulus interval (ISI), i.e., the temporal gap between two
consecutive stimuli, is shorter than the recovery time of the first response, the
two SCRs overlap. This occurrence is observed in many experimental paradigms,
particularly in cognitive neuroscience where common values of ISI (1–2 s) are
generally shorter than the recommended minimum ISI to avoid such an overlap,
which is around 10–20 s [17, 61]. The overlap issue is probably the main limitation
in the treatment of the decomposition of SC into its phasic and tonic components.
Despite the wide use of EDA measurements and related research, the generation of
SCR via skin sympathetic nerve fibres is still unknown.
In the past two decades, several mathematical solutions have been developed to
decompose the phasic signal into individual SCRs associated with each stimulus,
even during short ISI experimental paradigms, and to model how ANS activity
(and, in particular, the sudomotor nerve activity) causes SCRs. This process allows
estimation of ANS activity with potentially better time resolution than using the
raw SCR signal. Many of the early methods, whose primary aim was to overcome
the overlap issue, required visual inspection and introduced subjective elements
into the analysis. For example, Barry et al. [62] attempted to correct the baseline
by subtracting each SCR from an extension of the preceding SCR using graphical
tools. Lim et al. [63] proposed a model based on a response function made of 4–
8 parameters optimized for each single response to obtain a response-by-response
variation in SCR shape. This method also required visual inspection to select the
best model setting.
Further assumptions have been related to the description of the peripheral ner-
vous system as a linear time-invariant (LTI) system [64]. In addition to decomposing
the phasic signal into individual SCRs, these models often attempt to estimate the
ANS activity by searching for the most likely input signal which could explain the
observed output (the measured SC). The first LTI model for EDA analysis was
presented by Alexander et al. [65]. Their method allowed the estimation of the
sudomotor nerve activity (SMNA) using a model where the SC is the result of a
convolution between discrete bursting episodes of the SMNA and a biexponential
impulse response function (IRF) assumed to be known a priori and time invariant.
Benedek and Kaernbach criticized some aspects of Alexander’s model and
developed two new models in which the LTI assumption was modified to take into
account the variability in SCR shape. In addition, they considered mathematical
constrains to significantly improve the reliability of the physiological modeling.
These models are known as the non-negative deconvolution model [16] and the
continuous deconvolution model [1]. Both models split the SMNA into two parts,
one describing the actual phasic activity and the other representing EDA variations
of different origins (e.g., noise). Both models assume a pharmacokinetic model of
the dynamic law of diffusion of sweat. They adopted a biexponential IRF, called the
Bateman function. Although observation noise is not formally modeled in any of
these methods [1, 16, 65], all of three assume its existence. They estimate a noisy
SMNA and then recover a filtered phasic component using a low-pass filter and a
subsequent heuristic and prefixed peak-detection scheme.
Recently, Bach et al. presented the SCRalyze toolbox (now incorporated into
PsPM and available online at: pspm.sourceforge.net), which comprises several
models that assume a linear time-invariant system [66]. Note that Bach’s model also
imposes mathematical constrains to improve the physiological mimicking. These
models and that of Alexander et al. use a heuristic IRF whose parameters have been
optimized on large datasets. SCRalyze algorithms try to estimate the model input
(SMNA) or parameters that best explain the observed SC data based on optimization
methods. Moreover, they include a noise term, which also accounts for possible
violations of the assumption of time invariance.
More recently, Chaspari et al. [67] proposed a sparse representation of EDA
but their use of overcomplete dictionaries leads to a non-convex problem with no
guarantee of finding the globally optimal solution. Since an great variety of practical
problems can be cast in the form of a convex optimization problem, mathematical
optimization has become an important tool in many disciplines and the list of its
applications is steadily growing [68].
2.2 EDA Analysis 21
EDA is a widely used measurement in, e.g, the psychophysiology research field.
In this context, one of the main goal is to infer on perceptual affective states
from physiological, peripheral, may be non-invasive measurements such as SCR.
However, sometimes psycho-physiological processes (e.g., sympathetic arousal) can
have higher time resolution than the observed variables (e.g., SCRs). This shortcom-
ing dramatically impact on the use of conventional analysis, which does not employ
any a-priori modeling. Consequently, recent years have seen an increased interest
in the design of causal models, aiming to estimating unobservable processes from
observable ones such that inference can be drawn from the unobservable variable
directly [64, 66].
In the following text, we take into account two different approaches: the
conventional and the model-based approach.
The aim of conventional analysis is to extract features from the observable variable,
in this case the SC signal, that can closely represent a psychological central state.
A conventional data analysis algorithm can be described as follows: (1) data filtering
to reduce the observation noise, (2) definition of a time response window in order to
identify only the peaks that are stimulus-evoked, and (3) definition of some criteria
to detect peaks within this window (e.g., a threshold beyond which the SC peak can
not be considered significant, see Fig. 2.1). After the identification of peaks, feature
extraction can be performed. These features are usually defined through qualitative
or semi-quantitative models.
22 2 Modeling for the Analysis of the EDA
Peak
Feature
Data filtering Segmentation detection
extraction
criteria
The model-based approach describes how the observable process, i.e., EDA, is
generated by the central process, (e.g., sympathetic arousal) using mathematical
equations that formulate psychophysiological assumptions. Therefore, the model
predicts the SC time series, and the independent variable consists in the unobserv-
able psychological status. However, in the analysis of experimental data we have to
consider the opposite situation: we know the observed SC data but not the central
state, and we try to estimate the time series of the central process that generated
these SC data. To this extent, the forward model has to be turned backwards,
to find the relation between SC and central state. In statistics, this mathematical
process is often termed “model inversion”. In conclusion, both conventional and
model-based analysis try to infer the CNS state from the peripheral signals, but the
difference is that model-based methods use a more stringent mathematical language
and computational methods to do so, while the general aim is the same [64].
Usually, the model-based methods distinguish two steps in the relationship
between the central state and the SC data: the neural model that specifies how the
central state, in terms of event-related or spontaneous sympathetic arousal, elicits
SMNA, and the peripheral model, from the SMNA to the SC, that specifies how
SMNA generates SC usually in the form:
SC D SMNA IRF
2.3 CDA: Continuous Deconvolution Analysis 23
where is the convolution operator, and IRF is a skin conductance impulse response
function. This model is, in a basic version, deterministic. Through this approxima-
tion, the SC time series is only influenced by SMNA and not by other confounding
factors such as noise (note that the deterministic deconvolution enhances noise [16]).
Different model inversions schemes treat this problem differently.
The evaluation of a model and the comparison between two or more methods is
hard to be performed. Each method returns an output that can be an index or the
central state, but we have to measure which method estimates the central state at
best. A possible evaluation process uses an experimental paradigm in order to create
two central states that are known to be different. Thus, the method can be evaluated
investigating the ability to detect this difference. This is achieved using a statistical
approach on the features extracted that quantify the central activation.
In this book, we first present a conventional modeling approach, the Continuous
Deconvolution Analysis (CDA) [1], and a recently proposed model proposed by
Greco et al. based on rigorous mathematical definitions: the convex optimization
approach (cvxEDA). In the next chapters, performance of this model will be
empirically evaluated and compared.
2.3.1 Preprocessing
In the preprocessing stage, the detection of movement artifacts was carried out
by visual inspection. Artifact-free signals exclusively were taken into account for
further analysis. In order to limit the frequency bandwidth of the EDA signal, it was
filtered with a low pass zero-phase forward and reverse digital filter [71, 72] with a
cutoff frequency of 2 Hz, having Buttworth approximation.
24 2 Modeling for the Analysis of the EDA
EDA is produced by changes in the skin conductivity as major effect of the sweat
gland activity. Specifically, sweat is released to the sweat duct, passes to the stratum
corneum, and finally is brought out of the skin. Accordingly, the dynamics of the
variation of concentration of sweat in the stratum corneum can be represented by
a two-compartment pharmacokinetic model in which the sweat concentration is
assumed to change only by diffusion [14, 73]. The first compartment represents
the sweat duct and the second compartment the stratum corneum. Being the two
compartments different in dimension (i.e. the stratum corneum is much larger
than the sweat duct), the diffusion can be considered as a one way-diffusion.
Solving the two coupled first-order differential equations of each compartment, the
solution is the Impulse Response Function IRF.t/ which is also known as Bateman
function [74]:
t t
IRF.t/ D .e 1 e 2 / u.t/ (2.1)
The Bateman function is characterized by a steep onset and a slow recovery. The
steepness of onset and recovery is determined by the time constants 1 and 2 .
EDA can be divided into tonic (SCL: Skin Conductance Level) and phasic
components (SCR: Skin Conductance Response). The tonic electrodermal com-
ponent represents the baseline level of the signal whereas the phasic component
indicates a direct response to a specific stimulus. However, there are often phasic
parts of EDA which cannot be related to any specific stimulus, and hence, they are
called spontaneous or nonspecific SCRs [23]. When the time interval between two
consecutive stimuli is shorter than the recovery period of SCR, the stimuli responses
in the SCR are overlapped. In this case, the typical shape of the SCR is lost and this
could be one of the main issue for the extraction of the correct information from
the electrodermal signal. In order to overcome this issue, the EDA signal process is
modeled as a convolution process between the SudoMotor Nerve Activity (SMNA),
as part of the sympathetic nervous system, and IRF [1] under the hypothesis that
EDA is controlled by SMNA resulting in a sequence of distinct impulses which
regulate the eccrine sweat glands dynamics (see Fig. 2.2).
Formally, it is possible to write:
Pre-processing Decomposition
Raw signal • Segmentation (Deconvolution) SMNA
(EDA) • Filtering • IRF=(exp(-t/τ1)-exp(-t/τ2))u(t)
Fig. 2.2 Electrodermal acquisition and decomposition process. The EDA is filtered to reduce the
noise and then decomposed in tonic and phasic components by means of a deconvolution with an
impulse response function (IRF) called Bateman function
2.4 CvxEDA: A Convex Optimization Approach to Electrodermal Activity. . . 25
2.3.3 Optimization
Starting from fixed values, the parameter set of the IRF (i.e. 1 and 2 ) was optimized
according to criteria evaluating the quality of the model, through the minimization of
a specific cost function given by the sum of the number of points of the DRIVERphasic
component that have negative value and the number of points above a predefined
threshold (equal to 5 % of the maximum of DRIVERphasic ). This procedure aims at
having a signal with a zero baseline a peaks as distinguishable as possible. More
details can be again found in [1].
CvxEDA has been inspired by some aspects of the CDA model, such as the
Batman function. This novel algorithm is based on three main concepts: maximum
a posteriori probability, convex optimization, and sparsity.
x C .1 /y 2 K (2.3)
26 2 Modeling for the Analysis of the EDA
8 x; y 2 K and 2 Œ0I 1. The meaning of the inequality (2.4) is that, for any
two points x and y in the domain of the function, the segment between .xI f .x// and
.yI f .y// lies above the graph of the function. Equivalently, we can define a convex
function as a function whose epigraph is a convex set [68].
Considering a standard optimization problem:
minimize f0 .x/
(2.5)
subj. to fi .x/ 0 i D 1; : : : ; m ;
the optimal choice is the one minimizing the objective function f0 .x/, which
represents the cost of choosing x, while simultaneously satisfying the constraints
fi .x/ 0. An optimization problem is convex when both the objective and the
constraint functions are convex. In the context of mathematical optimization, the
most important consequence of convexity is that necessary conditions for local
optimality are also sufficient for global optimality. Moreover, important categories
of convex optimization problems can be solved efficiently (this is rarely the case for
general nonconvex problems).
A special subclass of convex optimization problems is represented by least-
square problems where the goal is the unconstrained minimization of a quadratic
objective function kAx bk22 . For this class of problems—frequently arising in
regression analysis, parameter estimation and data fitting methods [68]—an ana-
lytical solution exists. An important statistical interpretation is that the least-square
solution coincides with the maximum likelihood estimation in the case of a linear
model corrupted by additive Gaussian noise. Regularization, e.g. adding a norm of
the optimization variable x as an extra term to the cost function, can be applied to
least-squares problems to prevent overfitting (L2 -norm) or to favour sparse solutions
(L1 -norm). While in the former case an analytical solution exists, in the case of the
L1 -regularization the problem can be cast as a quadratic program (QP), i.e. a convex
problem with quadratic cost function and affine constraints:
1
minimize xtrP x C qtrx C r
2 (2.6)
subj. to Hx g 0 and Ux v D 0 :
In this model, the EDA generation process based on the following assumptions:
2.4 CvxEDA: A Convex Optimization Approach to Electrodermal Activity. . . 27
A1) SCRs are preceded by bursts from the sudomotor nerves controlling the sweat
glands. These bursts are temporally discrete episodes [75, 76], i.e. SCRs are
generated by a neural signal that is sparse and non-negative because of the
nature of a nerve activity.
A2) The relationship between the number of sweat glands recruited and the
amplitude of a firing burst is linear [76]. Moreover, the output response of
the system depends only on the instant where the nerve input is applied. Stated
otherwise, the timecourse of a single SCR induced by a neural burst is not
influenced by previous ones, even when their SCRs overlap [77]. In the light
of these considerations it is reasonable to characterize the system as linear
time-invariant.
A3) The sweat diffusion process has a subject-specific impulse response function
(IRF) which is relatively stable for all SCRs from the same subject [1].
A4) This phasic activity is superimposed to a slowly varying tonic activity with
spectrum below 0:05 Hz [78], i.e. whose information content can be repre-
sented by samples spaced every 10 s (e.g., by 10-s averages in [15]).
A given N-sample long SC signal (y) is modeled as the sum of a tonic (t) and a
phasic (r) component plus an additive noise term ():
y D r C t C ; (2.7)
where y, t, r, and are N-long column vectors. The noise term is an iid (inde-
pendent and identically distributed) sequence of zero-average Gaussian random
variables with variance 2 , representing measurement and modelling errors.
The tonic component is represented as the sum of cubic B-spline functions with
equally-spaced knots every 10 s (assumption A4), an offset and a linear trend term:
t D B` C Cd; (2.8)
where B is a tall matrix whose columns are cubic B-spline basis functions, ` is the
vector of spline coefficients, C is a N 2 matrix with Ci;1 D 1 and Ci;2 D i=N, d is
a 21 vector with the offset and slope coefficients for the linear trend.
Within r, the shape of a single phasic response (under assumptions A2 and A3)
is modelled using a biexponential impulse response function, called the Bateman
function:
h. / D .e 0 e 1 / u. /; (2.9)
where 0 and 1 are, respectively, the slow and fast time constants while u. / is the
unitary step function. The Bateman function is the output of a bi-compartmental
28 2 Modeling for the Analysis of the EDA
pharmacokinetic model representing the diffusion of the sweat through the gland
ducts [73]. The Laplace transform of (2.9) is simply:
˚ 1 1
L h. / D 1
; (2.10)
s C 0 s C 11
where 01 and 11 are the poles of this second-order LTI system. Its discrete-
time approximation, obtained using central differencing (bilinear transform) s D
2 z1
ı zC1
with sampling interval ı, is the following ARMA model:
2
1 C z1
H.z/ D
C z1 C z2
D .11 ı C 2/.01 ı C 2/=.11 ı 2 01 ı 2 / (2.11)
D .2 11 01 ı 2 8/=.11 ı 2 01 ı 2 /
D .11 ı 2/.01 ı 2/=.11 ı 2 01 ı 2 /:
q D A1 p; r D M q; (2.12)
where: p represents the sudomotor nerve activity; q is an auxiliary variable that will
be used to find p indirectly; M is a tridiagonal matrix with elements Mi;i D Mi;i2 D
1, Mi;i1 D 2, 3 i N; and A is a tridiagonal matrix with elements Ai;i D ,
Ai;i1 D , Ai;i2 D , 3 i N.
Finally, the observation model (2.7) can be written as:
y D Mq C B` C Cd C : (2.13)
Given the observation model (2.13), the goal is to identify the maximum a posteriori
(MAP) spike train (p) and tonic component (t) parametrized by Œq; `; d, for the
measured SC signal (y):
Assuming independence between q, ` and d (i.e. between the phasic activity, the
slowly varying tonic component and the drift) and applying Bayes’ theorem, we
obtain:
pi Pois.ı/; (2.16)
where ı is the expected firing rate per bin, i.e. is the average number of spikes
per unit time. To keep the analysis tractable, the Poisson distribution is replaced with
an exponential distribution of the same mean [79]. In this way the constraint pi 2 N
can be relaxed to pi 0. Finally, since p and q are related by (2.12), the prior PŒq
becomes:
YN 1 1 pi YN 1 .Aq/i
PŒq D e ı / e ı : (2.17)
iD1 ı iD1
Concerning the tonic component, the authors make use of assumption A4 and
consider a uniform frequency spectrum in the band 0 0:05 Hz. Because equally-
spaced knots every
D 10 s are used, the sampling frequency is exactly twice the
upper band limit and the elements of the vector ` can be assumed iid. In particular,
a normal distribution is adopted for the amplitude at each knot `i N .0; `2 /. As a
result the prior PŒ` is:
YQ
1 1 `2i
PŒ` D p exp 2 ; (2.18)
iD1 2 ` 2 `
Replacing (2.17), (2.18) and (2.19) in (2.15) and taking the logarithm:
1 XN
ln PŒq; `; d j y D .Mq C B` C Cd y/2i
2 2 iD1
1 XN 1 XQ 2
.Aq/i 2 ` C const; (2.20)
ı iD1 2` iD1 i
with .Aq/i 0. Maximizing (2.20) yields the MAP solution to (2.14). After
multiplying by 2 and substituting ˛ D 2 =.ı/ and D 2=`2 , (2.20) is rewritten
as a constrained minimization problem in matrix form to obtain a more compact
notation. This optimization problem, that is termed cvxEDA, represents the core of
the algorithm presented in this manuscript:
1
minimize kMq C B` C Cd yk22 C˛ kAqk1 C k`k22
2 2 (2.21)
subj. to Aq 0:
After some matrix algebra, this optimization problem can be re-written in the
standard QP form and solved efficiently using one of the many sparse-QP solvers
available. After finding the optimal Œq; `; d, the tonic component t can be derived
from (2.8) while the sudomotor nerve activity driving the phasic component can be
easily found as p D Aq.
Although solving (2.21) is strictly equivalent to maximizing (2.20), the former
has a different interpretation. In the optimization problem, the objective function
to be minimized is a quadratic measure of misfit between the predicted and the
observed data. Prior knowledge is accounted for by means of additive regularizing
terms. For example, the spiking nature of the driving input (assumption A1) is
enforced by means of the l1 -norm penalization which is an effective way to sparsify
a signal while maintaining convexity [80–82]. Smoothness of the tonic curve
(assumption A4) is enforced by the choice of the basis (B) and through the l2 -
norm penalization of the spline coefficients. The two parameters ˛ and control
the strength of the penalty for the phasic and tonic components, respectively. A
large ˛ (stronger l1 regularization of p) yields a sparser estimate with most noise-
induced spurious spikes suppressed but also more signal distortion (i.e. attenuation
of genuine activations). Conversely, a small ˛ produces a less distorted but noisier
solution. Concerning , higher values mean a stronger penalization of `, i.e. a
smoother tonic curve.
Of note, CvxEDA algorithm is implemented in Matlab language and the soft-
ware is available online (www.mathworks.com/matlabcentral/fileexchange/53326-
cvxeda).
2.5 Feature Extraction 31
Regardless of the model or the type of analysis used to perform the decomposition
of the EDA signal, several features are extracted from tonic and phasic signals in
order to assess the sympathetic system activity.
Typically, time-domain features of EDA are widely used to quantify the overall
activation of the ANS. Features extracted from the phasic signal are usually calcu-
lated into time windows of 5 s after the onset of the external stimulus (according
to the knowledge that SCRs arise within 1–5 s after the stimulus onset [16, 83]).
Features extracted from the tonic component express the sympathetic tone and are
often computed within time windows of 20 s, since the upper cut-off frequency of
the tonic component is about 0:05 Hz [84]. Moreover, within the group of tonic
measurements, nonspecific skin conductance responses are included, by definition.
They are often characterized by their frequency and mean amplitude within the time
window analysis.
In Table 2.1, the features set is summarized along with the corresponding
description.
Table 2.1 List of the features extracted from the EDA phasic and tonic
components
Feature Description
nSCR Number of significant SCRs within
the time response windows (WTRW) of 5 s
MAX-Tonic Maximum value of the tonic curve
within the time window
MAX-Phasic Maximum value of the phasic curve WTRW
AUC-Tonic Area under the tonic curve over time
AUC-Phasic Area under the phasic curve WTRW
Mean-Tonic Mean value of the tonic component over time
Mean-Phasic Mean value of the phasic component WTRW
STD-Tonic Standard deviation of the tonic component
STD-Phasic Standard deviation of the phasic component WTRW
NsSCR freq Frequency of the NsSCRs
Mean-NsSCR Mean value of the NsSCRs component over time
32 2 Modeling for the Analysis of the EDA
Fig. 2.3 Diagram of the signal processing procedure to extract EDA and HRV parameters.
Courteously from [85]
Chapter 3
Evaluation of CDA and CvxEDA Models
Each synthetic SC time series lasted T D 90 s and was generated as the sum of three
terms: the first one, representing the phasic component of the EDA, was obtained
as the result of a convolution between a synthetic SMNA and a biexponential IRF
(1 D 0:7 s, 0 Unif.2:0; 4:0/ s); the second one was a slowly varying signal
representing the tonic component, obtained as a linear trend plus a sinusoid with
a period Tt Unif.45:0; 90:0/ s; the third term was an additive white Gaussian
noise (AWGN). The sudomotor nerve activity driving the phasic component was
simulated by placing 10 pulses of unit area (modelling neural bursts) at random
times with a minimum 1-s distance between them and from the two ends. To test the
ability of the two methods to recover partially overlapping SCRs in the presence of
noise, two sets of 100 time series were generated with different levels of signal-to-
noise ratio (SNR): 33 dB and 13 dB (defined as 10 log10 .a2 =N2 / where a is the foot
to peak amplitude of a single SCR and N2 is the AWGN variance).
In the first experiment, 15 healthy subjects (aged 18–35 years; 7 females) performed
a forced maximal expiration task [89], in which they were asked to breathe out with
the maximum possible intensity in order to trigger the ANS-mediated expiration
reflex. All subjects gave written informed consent prior to taking part in the study,
which was approved by the local Ethics Committee. A Biosemi Active II system
was used to acquire the SC signal and the respiratory effort (by means of a thoracic
respiration belt). The protocol started with the subjects breathing normally and
resting in front of a grey monitor for 3 min in order to record their baseline levels.
This was followed by three stimulus sessions in which subjects had to perform a
deep expiration whenever the colour of the screen background changed to black.
Each session consisted of six forced expirations with a variable ISI chosen randomly
among 4, 8 and 12 s. Consecutive sessions were separated by a 30-s recovery
interval.
This experimental paradigm was chosen to obtain SC signals in which the
presence of an autonomic response to the stimulus was as objective and reliable
as possible. In fact, previous studies have shown that the forced expiration protocol
is a valid method of evoking SCRs unaffected by emotional change with more stable
waveform patterns, less habituation and better reproducibility than other means
of stimulation (including electrical) [89]. In this way, the presence of at least one
SCR after each stimulus was ascertained, allowing determination whether the new
methodological approach was able to separate and identify each phasic response
even when stimuli were close to each other and their SCRs overlapped.
3.4 EDA Processing and Analysis 37
Table 3.1 Arousal rating of Session Arousal rating Arousal range Arousal level
IAPS images used
N 2:81 ˙ 0:24 2.42–3.22 VL
A1 3:58 ˙ 0:30 3.08–3.98 L
A2 4:60 ˙ 0:31 4.00–4.99 L-M
A3 5:55 ˙ 0:28 5.01–6.21 M-H
A4 6:50 ˙ 0:33 5.78–6.99 H
In the second experiment, 15 healthy subjects (aged 22–26 years; 7 female) different
from the previous ones were stimulated by viewing affective images from the
official IAPS database [90] to assess algorithm’s predictive validity, i.e. its ability
to distinguish stimulations with different arousal content and provide meaningful
information about ANS activation. All subjects gave written informed consent
before taking part in the study, which was approved by the local Ethics Committee.
Subjects were comfortably seated in an acoustically insulated room watching the
slideshow on a computer screen while their SC was recorded using a BIOPAC
MP150 physiological acquisition system. The affective elicitation consisted of four
arousal sessions alternated with four neutral sessions: N, A1, N, A2, N, A3, N, A4;
where N sessions are sequences of 6 very low arousal (VL) images while Ai (with
1 i 4) are sets of 20 images eliciting increasing levels of arousal. Details about
arousal rating values are reported in Table 3.1. Arousal sessions were classified as
Low (L), Low–Medium (L–M), Medium–High (M–H) and High (H) according to
the IAPS score criteria. Each image was presented for 10 s.
For each dataset, the continuous deconvolution analysis and the convex-
optimization-based EDA model described in the previous chapter were applied
to each SC time series. As per assumption of linearity and time-invariance system, a
subject-specific IRF was considered for this study. Concerning the CDA algorithm
1 and 2 parameters were optimized according to criteria for reduction of the
number of points of the phasic driver component that have negative value and the
number of points above a predefined threshold. Concerning the cvxEDA model,
while 1 D 0:7 s was used for all subjects, the optimal 0 was determined on a per-
subject basis as the value 0 2 Œ2:0; 4:0 s that minimized the l2 -norm of the residual
after fitting the cvxEDA model. Fixed values ˛ D 0:4 and D 0:01, chosen during
previous exploratory tests on separate data, were employed throughout this analysis.
The accuracy of the algorithms on the synthetic dataset was assessed by
measuring its ability to recover the neural activations in the phasic driver from
noisy SC time series. For each time series, the set of occurrence times T e of pulses
38 3 Evaluation of CDA and CvxEDA Models
with area exceeding a 0:5 threshold was compared to the set of times T s of the
impulses in the synthetic SMNA using an algorithm modelled after the AAMI/ANSI
EC38:1998 standard. Briefly, times in T e and times in T s were considered as
“matching” if they were within a match window of ˙0:15 s. Each impulse from
either signal could only match a single impulse from the other one. Times in T e not
matching any element of T s were considered false positives (FP) while times in T s
not matching any element of T e were considered false negatives (FN). Finally, the
performance of the algorithm was measured in terms of sensitivity, computed as the
fraction of matched elements of T s , and positive predictive value (PPV), computed
as the fraction of matched elements of T e .
In the respiratory stimulation dataset, the presence of an estimated burst of
SMNA activity was verified in each 5-s time window following a stimulus onset,
in order to prove the model’s ability to correctly detect real SCRs.
In the last study, to verify that the recovered components represented meaningful
information regarding ANS activity, we investigated whether the amplitude of the
phasic driver p increased in response to affective stimulation with increasing levels
of arousal, as previously reported in the literature [91, 92], [15, Chap. 2.2.2]. An
intersubject analysis compared the responses to the four arousal levels through a
non-parametric Page test [93, Chap. 7.2], under the alternative hypothesis of increas-
ing phasic responses with increasing levels of arousal (we used non-parametric
tests because the hypothesis of Gaussianity was rejected by a Kolmogorov–
Smirnov test, p < 0:05). In post-hoc analysis, each pair of arousal sessions was
compared using a one-tailed Wilcoxon signed-rank test with Bonferroni correction
to determine significant differences between arousal levels in the expected direction.
We computed the adjusted p-value, i.e. the original p-value multiplied by 6 (the
number of pairwise comparisons among 4 conditions), to allow direct comparison
to the standard significance levels (e.g., 0:05). In the following,
p we also report the
Z-scores (from which the measure of effect size ZN D Z= N can be computed,
where N D 15 is the sample size). Finally, the slow tonic component was analyzed
comparing mean values of each arousal session with the preceding neutral session,
using a one-tailed Wilcoxon signed-rank test.
For all EDA datasets analyzed, the cvxEDA model produced the expected results:
the SC data (Fig. 3.1a) was decomposed into two signals, a sparse component p
and a smooth component t, that we interpret as the activity of the sudomotor nerve
(Fig. 3.1b) and the tonic level (Fig. 3.1c).
3.5 Experimental Evaluation Results 39
2.0 (a)
y [n.u.]
1.0
0.0
–1.0
1.0
(b)
0.8
p [a.u.]
0.5
0.2
0.0
2.0 (c)
t [a.u.]
1.0
0.0
–1.0
0 100 200 300 400 500 600 700 800 900 1000
τ [s]
Fig. 3.1 Application of the cvxEDA decomposition procedure to the SC signal recorded during
the forced maximal expiration task for a representative subject. (a) Raw SC signal, Z-score
normalized. (b) Estimated sparse phasic driver component p. (c) Estimated slow tonic component t
Application of the cvxEDA model to the synthetic dataset highlighted the sparsity
of the p term, as well as the smoothness of the tonic component, even with low SNR.
Qualitative visual inspection analysis of Fig. 3.2 was sufficient to determine that the
algorithm worked properly on these data. A quantitative proof was also pursued in
terms of detection performance while recovering the neural activations in the phasic
driver from the noisy SC time series. In the high-SNR test the algorithm achieved
99:3 % sensitivity and 100:0 % PPV on average, whereas it scored 96:7 % sensitivity
and 91:3 % PPV in the low-SNR condition. Concerning the analysis performed
through the Ledalab software, a preliminary visual inspection analysis showed
a clear underestimation of both the tonic and the synthetic SMNA input signal
Fig. 3.3. All peaks, visually detectable after a zoom of the estimated phasic driver
component Fig. 3.4, results under the applied threshold. Despite all, by applying a
different threshold in the high-SNR test, the algorithm achieved 46:57 % sensitivity
and 67:03 % PPV on average, whereas no statistical consideration can be made about
the low-SNR condition. Furthermore, the sparsity of the phasic CDA signal is not
remarkable. While cvxEDA shows a robust behavior, the performances of Ledalab
software are extremely dependent from the additive noise power level.
40 3 Evaluation of CDA and CvxEDA Models
4.0
simulated EDA, y
3.0 t
simulated
tonic
[a.u.]
2.0 simulated
SMNA
1.0 p
0.0
4.0
3.0
[a.u.]
2.0
1.0
0.0
0 10 20 30 40 50 60 70 80 90
τ [s]
Fig. 3.2 Solution of EDAcvx model applied to a synthetic signal with different levels of additive
white Gaussian noise (top: 33 dB; bottom: 13 dB)
4.0
3.0
[ a . u].
2.0
1.0
0.0
4.0
3.0
[ a . u].
2.0
1.0
0.0
0 10 20 30 40 50 60 70 80 90
τ [s]
Fig. 3.3 Phasic SMNA estimation using a CDA model applied to a synthetic signal with different
levels of additive white Gaussian noise (top: 33 dB; bottom: 13 dB)
3.5 Experimental Evaluation Results 41
0.3
0.2
[a.u.]
0.1
−0.1
0 10 20 30 40 50 60 70 80 90
τ [s]
0.3
0.2
[a.u.]
0.1
−0.1
0 10 20 30 40 50 60 70 80 90
τ [s]
Fig. 3.4 Solution of CDA model applied to a synthetic signal with different levels of additive
white Gaussian noise (top: 33 dB; bottom: 13 dB)
A visual inspection analysis of time series recorded during the forced maximal
expiration protocol confirmed the effectiveness of the paradigm in eliciting strong
SCRs that were partly overlapped because of short ISIs (Fig. 3.5). After applying
the cvxEDA algorithm, we considered peaks in the p signal within a 5-s time
window post-stimulus (considering the latency of a typical SCR [15]). Inter-subject
analysis indicated that the algorithm was able to identify the corresponding phasic
peak after 96:5 % of the stimuli and overcome the overlap issue. Furthermore, a
visual inspection analysis of the raw SC data in the time windows after stimuli
that were not identified by the algorithm showed the almost complete absence
of a SCR, probably because of incorrect performance of the task by the subject.
Results gathered from the CDA show a percentage of identification of the respiratory
stimulus of 93:01 %. The difference was due to some consecutive overlapped
stimuli, which were not recognized by CDA.
42 3 Evaluation of CDA and CvxEDA Models
10
8
y [μS]
2
0
8
6
p [μS]
2
phasic driver [μS]
0
8
0
0 50 100 150 200 250 300 350 400
τ [s]
Fig. 3.5 Example of SC raw data (top) and its estimated phasic component by cvxEDA (middle)
and CDA (bottom) models during the forced maximal expiration task. Dotted lines mark the onset
of the visual cue triggering a forced expiration
Statistical analysis of the tonic and phasic driver components confirmed the
ability of the cvxEDA algorithm to characterize the ANS activity. Page-test results
comparing the four arousal sessions indicated a strong significant (p D 106 ,
L D 428, L D 4:74) relationship between the arousal level and the phasic driver
peak amplitude (see also Fig. 3.6(left)), which is the most appropriate parameter to
quantify ANS activity [15]. Post-hoc Bonferroni-corrected pair-wise comparisons
revealed significantly larger phasic responses to A2 than to A1 (p D 0:040,
Z D 2:47), to A3 than to A1 (p D 0:018, Z D 2:75), to A4 than to A1 (p D 0:002,
Z D 3:32), and to A4 than to A2 (p D 0:004, Z D 3:21). Concerning the tonic
component, the Wilcoxon test showed that the tonic mean values were significantly
higher during arousal than during neutral sessions (p D 0:001, Z D 3:147,
ZN D 0:813).
3.5 Experimental Evaluation Results 43
cvxEDA CDA
Mean ± SE Rank 4.0
3.0
2.0
1.0
A1 A2 A3 A4 A1 A2 A3 A4
* **
* *
**
Arousal Level
Fig. 3.6 Within-subject ranks of the peak amplitudes of the phasic component obtained by
cvxEDA (left) and CDA (right) for the four arousal levels. Dots mark the across-subject average
rank for each level while the wiskers indicate the standard error. The hypothesized effect of arousal
level on the phasic component was confirmed by the Page test (p D 106 for cvxEDA, p D 0:001
for CDA). Post-hoc Bonferroni-corrected pair-wise comparisons of the peak amplitudes found
significant differences in the cases indicated by asterisks ( W p < 0:05I W p < 0:01)
CDA estimated a phasic driver whose peak amplitude within each arousal
session increased with increasing levels of arousal (Page test p D 0:001, L D
409, L D 3:04). However, a post-hoc analysis revealed a significant difference
(p D 0:010, Z D 2:93) only between the most extreme sessions, A4 and A1
(see Fig. 3.6(right)). Overall, the new approach provides a stronger correlation
and augmented discriminant power, with respect to the elicited arousing session,
than CDA.
Chapter 4
Emotions and Mood States: Modeling,
Elicitation, and Recognition
Emotions are usually defined through features related to facial expressions, lan-
guage, gestures or posture, rather than their essential nature. This is due to the
enormous complexity of emotional phenomena that are often interrelated [94–98].
Moreover, they are the result of the continuous adaptation of individuals to changes
in the social environment. Indeed, the simultaneous coexistence of multiple positive
or negative emotions makes developing an accurate system of recognition of
emotions very complicated.
Nowadays, a remarkable theory of emotion belongs to the neurobiologist Joseph
LeDoux, who believed that the perception of an affective stimulus activates two
neural information paths that are independent but interconnected: an unconscious
path activating the peripheral reactions, and a conscious one producing the experi-
ence cognitive awareness of emotion [99]. According to LeDoux, one of the neural
centers related to the unconscious way is the amygdala. This region makes a first
process, of the stimulus perception, even with absence of awareness. Therefore, the
peripheral activation could be emotion-specific even before the complete conscious
experience. This mechanism is important in an evolutionary contest: considering
a potentially dangerous stimulus, the unconscious processing allows to implement
adaptive responses such as we have time to prepare the most appropriate response.
After a more detailed analysis of the stimulus that occurs through the sensory
cortices and the conscious processing, in case the stimulus is not considered really
dangerous, the system falls rapidly to the basal state, otherwise the activation of
the emotional response is supported and retained. The presence of a direct visual
pathway that overcomes the visual cortex has recently been demonstrated in blind
patients due to a partial or total destruction of the primary visual cortex. Although
they could not consciously see anything, they had a significant activation of the
amygdala in MRI, when they presented images of affective face expressions [100].
In LeDoux’s theory, the conscience of the stimulus does not result from the periph-
eral response, and the awareness of emotion does not cause a peripheral change.
Rather, the perception of an affective stimulus determines both the peripheral
change (through a subcortical region that is inaccessible to consciousness) and the
awareness of emotion. The unconscious and conscious systems affect each other
increasing or decreasing their activity.
What are emotions? A definition that partly reflects LeDoux’s theory describes
them as a quick, automatic, and stereotyped response of the organism to a
potentially significant stimulus to the survival of the individual or the species.
Certain situations can be unpleasant or dangerous to the survival of every individual.
For this reason it is necessary for the body to react as quickly and effectively as
possible. For example, in a very dangerous situation we do not often have time to
think, or rather the body must be prepared to react before we start thinking. To
make this possible, our physiological response must be automatic and consequently
stereotyped, identical or very similar in different situations in which we experience
the same emotion: fear of a bear can not be different from the fear of a lion or a
robber. The last aspect is probably related to the fact that emotions (at least those
considered primary ones as anger, surprise, happiness, sadness, fear and disgust) are
not only present in humans but are universally expressed in the same way. Studies
on facial expressions by the American psychologist Paul Ekman confirm this idea:
we can feel fear for different things, but always in the same way [101]. Of note, the
time profile of the emotions changes rapidly both in the onset, and during the phase
of extinction. In fact, when the emotive situation is concluded, typically the emotion
ends in both the cognitive and peripheral psycho-physiological component.
In the literature, several theories for modeling emotions have been proposed.
Discrete, dimensional, appraisal and dynamical models are the most interesting, but
one cannot exclude the others.
In discrete models, emotions can be seen as the result of a selective adaptation
that ensures survival [102]. This survival concept could be illustrated by the
following relation: danger D> fear D> escape D> survival. The result of this
selection is a small set of basic, innate and universal emotions. For instance, Ekman
proposed 6 basic emotions which are identified on the basis of facial expressions:
anger, disgust, fear, joy, sadness and surprise [103, 104]. Besides, in the literature
other discrete models have been proposed and they include more or less basic
4.2 Modeling Emotions 47
It has been demonstrated that ANS dynamics reflects measurable changes according
to subjects’ emotional experience [126, 127].
ANS is a control system in charge of the regulation of peripheral functions
such as heart rate, digestion, respiratory rate, pupillary response, urination, and
sexual arousal [128]. This system comprises two components: the sympathetic
and parasympathetic nervous systems. Due to its aspecific nature, the ANS is not
involved only in emotion regulation, but includes a wide variety of other functions
related to stress, attention and so on [129]. As a matter of fact, several physiological
ANS signs (e.g. HRV, respiration activity, EDA, pupil size and eye movement
variation) correlate with subject behavior or emotional status [130–141]. The most
commonly used indexes of activation of the ANS are based on EDA (i.e., sweat
glands) or cardiovascular dynamics [142]. As mentioned in the previous chapters,
EDA is typically quantified in terms of skin conductance and primarily reflects
sympathetic activity.
In 1884, James was the first psychologists to claim that different emotional
states (such as sadness, anger or fear) involve specific parameters of activation
of the autonomic nervous system [143]. This principle has been very important
in many theories of emotions [144, 145], and most of the researches inspired by
4.3 Autonomic Nervous System Correlates of Emotions 49
Arousal
Alert
Tense
ACTIVATION
s Ex
ou ict
ed
N erv
ed
El
ss
ate
re
d
St
Happ
Upset
y
UNPLEASANT PLEASANT
Valence
nted
Conte
Bor
DEACTIVATION
ed
e
D
ren
ep
re Se
ss
ed d
la xe
Sad Re
Calm
Fig. 4.1 A graphical representation of the circumplex model of affect with the horizontal axis
representing the valence dimension and the vertical axis representing the arousal or activation
dimension
James’s theory have focused on ANS measures. The strong scientific interest in the
specificity of the ANS is due to popular thought that emotions involve discrete types
of activation of the ANS (e.g., the supposed link between anxiety and increased
heart rate [146]). However, there are conflicting positions in the scientific society.
Although some works have reported evidences for specificity of the ANS [147–149],
a recent meta-analysis has featured such effects as inconsistent [150]. In this meta-
analysis, only a small part of the analyzed ANS correlates reliably distinguished
discrete emotions, highlighting the lack of ability to support the hypothesis of
specificity of ANS [150].
Given these considerations, Lang et al. [90] have shown in some studies that
the level of skin conductance increases systematically and linearly depending on
the general arousal level of some emotional stimuli. In addition, the relationship
between affective stimuli and EDA is note independent from the valence, from the
kind of stimulation, and certainly from the specific emotion that was originated.
These findings were consistent with theories that argue that ANS activities indicate
the level of excitement of the emotional state rather than its basic emotion
[151, 152]. However, not all measurements of the ANS can be mapped in a single
dimension. In accordance with the principle of “directional fractionation” [153],
different measurements of ANS activity can operate independently or contrariwise.
50 4 Emotions and Mood States: Modeling, Elicitation, and Recognition
For example, the heart rate decreasing can contribute to an increase in sympathetic
activity as assessed by other ANS correlates [154]. In order to explain such
fractionation of the ANS, at least two dimensions should be considered, e.g., the
valence. For example, Cacioppo et al. [150] revealed that cardiac output, blood
pressure, heart rate and skin conductance respond to the emotional valence.
Although the ANS seems sensitive to dimensional rather than to discrete emo-
tional states, taking in account many ANS correlates can help achieving a greater
degree of autonomic specificity [155]. For example, anger and fear, despite their
alignment in terms of valence and arousal, can be differentiated by a combination
of cardiovascular and respiratory measures [142]. Thus, combinations of several
measures ANS can provide a better (automatic) detection of discrete emotional
states.
Table 4.1 Performance of the peripheral biosignal based emotion recognition methods reported in
the literature of last decade
Authors Signals Elicitation Emotion classes Accuracy Best results (%)
Yoo et al. ECG, EDA Video clip Sad, calm ANN 80
[166] pleasure,
Interesting
pleasure, Fear
Choi & Woo BVP, EDA Music and Joy, anger, and ANN 74.5
[167] image sadness
choosen by
subject
Healey & EMG, ECG, Driving 3 Stress levels LDA 97
Picard [168] EDA, RSP
Li & Chen ECG, BVP, Film clips Fear, neutral, and CCA 93.33
[169] EDA, ST joy
Rani et al. ECG, BVP, Cognitive Engagement, SVM 86
[170] EDA, EMG tasks (i.e. anxiety,
Anagrams boredom,
and Pong) frustration and
anger
Rainville ECG, RSP, Self induction Anger, fear, SDA 49
et al. [171] EDA, EMG happiness,
sadness
Zhai & EDA, BVP, Stroop test 2 stress levels SVM 90
Barreto [172] PD, ST game
Leon et al. ECG, EDA, IAPS Neutral, negative, ANN 71
[173] BVP positive
Liu et al. ECG, ICG, Cognitive Anxiety, SVM 83
[174] BVP, HS, tasks (i.e. engagement,
EDA, EMG, anagrams and liking.
ST Pong.)
Katsis et al. EMG, ECG, Car-racing High stress, low SVM 79.3
[159] RSP, EDA drivers stress,
disappointment,
euphoria
Yannakakis & ECG, BVP, Interactive 2 fun levels SVM, 70
Hallam [175] EDA games ANN
Kim & André EMG, ECG, Music 4 musical LDA 70/95
[132] EDA, RSP listening emotion
Katsis et al. BVP, ECG, IAPS Relaxed, neutral, ANN, 84
[133] EDA, RSP startled, SVM
apprehensive,
very
apprehensive
Gouizi et al. EMG, RV, IAPS 6 discrete SVM 75–83
[176] SKT, EDA, emotion
BVP, ECG
(continued)
52 4 Emotions and Mood States: Modeling, Elicitation, and Recognition
How emotions can be elicited to humans is a crucial issue still open. Many
researches have turned their attention to sensory stimuli able to elicit emotions.
The difficulty associated with the elicitation is related to a complex interaction
between cognition and neurophysiological changes. Several modalities and percep-
tual channels could be used for this purpose, which can be thought as affected by
several “noisy” factors, including psychophysiological processes such as attention,
social interaction, and body-to-biosensors connections. In the literature, a wide
4.6 Emotions and Mood Disorders: Bipolar Disorder 53
range of elicitation methods have been applied: introspection, movements, lights and
colors [188], set of actions, images (e.g IAPS described below) [90, 189], sounds
(e.g., music and IADS described below) [132, 190–193], (fragments of) movies
[194, 195], speech [196], commercials [197], games, agents/serious gaming/virtual
reality [173], reliving of emotions [198], real world experiences [168, 199] along
with using personalized imagery stimuli [131].
In order to evoke affective states in a laboratory setting, some authors have
assembled sets of pictures [90], sounds [200], odorants [201], and words chosen
to elicit a range of positive, neutral or negative affective states. In such studies,
subjects had to rate pictures, sounds, odorants or words in terms of pleasure and
arousal. Results indicated that the shape of the distribution is very similar across all
sensory stimulations [142].
In this perspective, the International Affective Picture System (IAPS) [90] and
the International Affective Digital Sounds system (IADS) [202] are two of the most
frequently used tools in the area of affective elicitation. They consist of hundreds of
images and sounds with associated affective scores. A commonly used approach is
to have a collection of stimuli where each one is slightly varied in terms of intra-
individual standard deviation of affective ratings.
In several experiments reported in this book, a set of images gathered from the
IAPS was chosen [203]. IAPS is a set of 944 images having a specific emotional
rating, in terms of valence, arousal, and dominance. The emotional ratings are
based on several studies previously conducted where subjects were requested to
rank these images using the self assessment manikin [204]. The elicitation by
IAPS is able to activate segregated neural representations of the different emotion
dimensions in different prefrontal cortical regions [205, 206]. Touch is another
sense capable of inducing emotions. Specifically, affective haptics is the science
referring to the ability of haptic systems to communicate emotions, possibly
affecting social behavior and interactions [207]. This is possible because of the
action of a specialized kind of tactile sensors in the skin, i.e., the unmyelinated
CT tactile fibers [208, 209], whose activity is linked to the controlateral primary and
bilateral secondary somatosensory area, as well as contralateral middle and posterior
insula cortex [210]. Previous studies demonstrated how these fibers are sensitive to
changes in the physical characteristics of the haptic stimulus. Specifically, changes
in contact force and velocity of human caresses can vary the valence perception
(pleasantness/unpleasantness) of the stimulus [211, 212].
including loss of appetite and sleep are also present. Depressed patients might also
experience thoughts of ruin, guilt or death including suicidal thoughts that might
lead to suicide attempts. During manic episodes, patients are hyperactive, and often
experience a reduction of the need to sleep. Mixed states are characterized by both
depressive and hyperactivity symptoms. In the intervals between these episodes,
patients typically experience periods of relatively good emotional balance (labeled
as euthymia). Moreover, mood swings are also usually accompanied by anxiety,
which is associated with bipolar disorder either as a symptom of the bipolar disorder
itself or as a separate pathological condition [214].
Despite the great impact of bipolar disorder on the population and healthcare
costs, current clinical practice still relies only on the physician expertise, rating
scales and questionnaires, such as the Bauer Internal Mood Scale, the Hamilton
Scale for Depression and the Young Mania scale [215]. Physiological parameters
(e.g., biological markers, physiological signals, etc.) are not taken into account
for diagnosis or follow-up purposes [216–218]. As a matter of fact, there is the
need of more objective parameters for the diagnosis of mental disorders. These are
long-term illnesses and may remain undetected for years before they are properly
diagnosed and put under treatment. Moreover, patients are extremely heterogeneous
with respect to the phenomenology and severity of symptoms, number and duration
of episodes, as well as time interval between them. Finally, other disorders may also
be present (i.e., comorbidity).
Previous researches have shown a link between Autonomic Nervous System
(ANS) dysfunctions and bipolar disorders [8, 219–222]. Specifically, studies on
sleep [223], voice analysis [10], and circadian heart rate rhythms [224, 225] showed
to be sensitive to changes in clinical state, suggesting that these parameters may be
considered as markers of clinical change. Moreover, it is known that electrodermal
hypoactivity is present during depression in both unipolar and bipolar patients
[226, 227]. This condition is stable over time, and does not appear to depend on
experimental conditions or stimulus characteristics [228].
Since changes on EDA are directly related to the sympathetic activity [23], EDA
analysis could serve as an effective ANS marker for characterizing different mood
states.
The study on bipolar patients presented in this book was carried out in the
frame of the European project PSYCHE, which stands for personalized monitoring
systems for care in mental health. Within such a project, a personalized, pervasive,
cost-effective, and multi-parametric monitoring system based on textile platforms
and portable sensing devices was devised for the long-term and short-term analysis
of mood disorders [8–10].
Chapter 5
Experimental Applications on Multi-Sensory
Affective Stimulation
assessment of pathological mood states in bipolar disorder. EDA features were also
inputed to an automatic classifier for mood state recognition, achieving an accuracy
> 80 % for the inter-subject analysis while discerning between the state of good
affective balance (euthymia) and severe mental states such as depression and mixed
state. Therefore, experimental evidences on the correlation between pathological
mood disorders and EDA were found, and the obtained results are promising for an
effective and objective biosignal-based mood recognition [9, 11].
In all of the experimental applications described in this book, each participant took
part to all the different arousal and valence level sessions. Consequently, to compare
the arousal and valence stimuli, a paired dataset has to be considered. Therefore,
a non linear classifier for paired data was implemented and used. This classifier,
called paired within-rank (PWR) K-NN, acts in two steps: first, transforms the
features set in a within-subject rank matrix based on the Friedman test statistic; then,
the ranked matrix is applied as input of a k-Nearest Neighbor algorithm (K-NN)
classifier.
In detail, the dimension of the feature space was given by the number of selected
EDA parameters. The final features set was achieved identifying the group of
parameters that performed a higher recognition accuracy in the training set. For the
5.2 Classification Procedure 59
Decomposition process
Phasic cvxEDA
and or EDA
Tonic CDA
Feature Extraction
and Statistical Analysis
Training Set
Feature
of N-1
Selection
subject
Fig. 5.1 Overall block scheme of the proposed emotion recognition system. The EDA is processed
in order to extract the phasic and tonic components. According to the protocol timeline, several
features are extracted and, then, statistically compared. The PWR-KNN algorithm is engaged to
perform pattern recognition by adopting a leave-one-subject-out procedure
arousal and valence recognition, the number of samples (i.e., rows of the dataset)
in such a space was related to the number of subjects multiplied by the number
of arousal or valence levels to be classified. Before the classification procedure,
the values of each feature (i.e. a column of the dataset) were transformed ranking
the data within each subject. Finally, the ranked feature set was used as input to
a K-NN Classifier [252, 253], which was validated through the LOSO procedure
[254], as described at the beginning of this section. A block diagram of the proposed
recognition system is illustrated in Fig. 5.1.
The K-NN is considered one of the simplest machine learning procedures. An
example is classified considering the majority vote of its neighbors K, which
is a positive integer typically not very large. The choice of K depends on the
characteristics of the data. Generally the increase of K reduces the noise that affects
the classification, but the criterion of choice for the class becomes more labile.
The choice is made by means of heuristic techniques such as the cross-validation.
The space is partitioned into regions based on the positions and characteristics of the
training examples. For the calculation of the distance, examples are represented by
60 5 Experimental Applications on Multi-Sensory Affective Stimulation
The SVM method was used as an alternative method for recognizing the valence
and arousal levels in case of unpaired dataset (as in the tactile paradigm,
see Sect. 5.5.5.4). It replaces the ranked transformation function and the K-NN
method.
The classification problem can be viewed as a task of finding a separating
hyperplane that divides the examples belonging to each different classes. Also in this
case, the classification procedure consists of two phases: training and testing. Firstly,
the dataset is divided into two groups: the training set and the test set. During the
training, the algorithm estimates a hyperplane that separates the examples contained
in the training set according to their labels. The decision function that has been
learned from the training data then can be used during the testing phase to predict
the class of a new test example. More details about SVM can be found in [255].
Of note, all classification results are expressed in this book as recognition
accuracy in form of confusion matrices [256]. A generic element cij of a confusion
matrix indicates the percentage of how many times the feature set belonging to the
class i, was recognized as belonging to the class j. This means that a higher average
of the values on the matrix diagonal corresponds to a better degree of classification.
The most widely used affective stimulation is the presentation of images taken
form the IAPS database. They consist of hundreds of images, with associated
standardized affective values (in terms of arousal and valence). In the literature
there are several studies where IAPS database is used to estimate the affective
state (especially when the arousal level changes) through the study of physiological
signals, such as electroencephalogram [257–260], blood oxygen level dependent
signals [261], facial electromyograms [92, 262, 263], EDA [264–266] and HRV
[248, 260, 267]. Concerning EDA, many studies over the past years have demon-
strated that the amplitude of the skin conductance changes and the intensity of
emotional experience is almost linearly associated with the arousal dimension,
[92, 268, 269].
5.3 Affective Visual Elicitation 61
In the previous chapter (see Sect. 3.3), the relationships between the arousal
levels of the IAPS and the phasic component features coming from CDA and
cvxEDA models, have been already statistically compared. CvxEDA has shown a
statistical discriminant power higher than CDA. Indeed, all the four arousal levels
were significantly different and showed a trend with a high correlation with the
arousal scale. CDA significantly discriminated only the extreme levels but not the
middle ones, see Fig. 3.6.
Furthermore, a multivariate pattern analysis using the PWR KNN classifier was
performed in order to automatically recognize the four different levels of arousal
and two levels of valence.
Valence
Neutral
Fig. 5.2 Timeline of the experimental protocol in terms of arousal and valence levels. The vertical
axis relates to the valence score, whereas the horizontal axis relates to the time
62 5 Experimental Applications on Multi-Sensory Affective Stimulation
Concerning the arousal levels, the multivariate pattern recognition analysis showed a
strong difference between the accuracy of the cvxEDA feature (71:67 %) set and the
CDA feature set (36:66 %) (see Tables 5.1 and 5.2). Instead, concerning the valence
levels, the two models demonstrated the same performance, discerning pleasant and
unpleasant images with a similar accuracy of 68:33 % (see Tables 5.3 and 5.4).
5.4 Affective Sound Elicitation 63
One of the computational study proposed in this book aims to recognize emotions
through the identification of the elicited arousal and valence levels of standardized
acoustic stimuli gathered from the International Affective Digitized Sound system
(IADS) [270]. Likewise the IAPS database [202, 271], IADS is a database of
affective sounds characterized in terms of valence and arousal dimensions [270].
The auditory stimulus is one of the most powerful means to induce and
communicate emotions to people. It is easy to understand the role that voice tone
plays in conveying speaker affect [272], or in eliciting hearer emotions. The music
is another way to enhance orally expressed affective messages [273, 274]. Speech
and music seem to be only a portion of the sounds that we hear. There are also
non-musical and nonlinguistic sounds [275], which carry affective information in
the audio environment around a listener [276].
In the literature there are studies dealing with the relationship among physi-
ological signals and pleasant and unpleasant sounds or music-induced emotions.
PET and fMRI studies on emotions evoked by auditory stimuli have found that
pleasant sounds lead the activation of brain areas such as the orbitofrontal cortex
and the anterior insula [277]. Sad music or unpleasant noises instead lead to the
activation of regions involved in negative emotional states and anxiety-related,
such as the hippocampus, the amygdala and the areas of the medial temporal
lobe [278, 279]. All this brain areas are directly involved in the control of the
human affective system. Other studies take into account EEG and ANS dynamics
[192, 240–242, 280, 281], also to automatically recognize four types of music-
induced emotions [192]. In general, it is well known that there are specific peripheral
activation patterns associated with the emotional valence of sounds. These changes
consist of a larger heart rate deceleration in response to unpleasant stimuli and
higher electrodermal reactions in response to emotionally valenced stimuli (pleasant
or unpleasant) compared to neutral stimuli [92]. However, a characterization of
the affective state in terms of arousal and valence using EDA signal had not been
performed yet.
Also in this computational study, the proposal is to automatically recognize
arousal and valence levels of the standardized affective acoustic stimuli, gathered
from the IADS dataset, using only the EDA analysis. IADS sounds have been
64 5 Experimental Applications on Multi-Sensory Affective Stimulation
already used in the literature (often jointly with IAPS picture) showing changes
in ANS dynamic [282, 283] and a relationship with EDA variations [284, 285].
In order to perform this research we designed a new experiment where IADS
sounds were administered to a group of young participants. During the experimental
sessions, EDA signal was continuously monitored. In this application, we demon-
strated that the convex optimization-based EDA model is suitable for affective
computing on IADS elicitation. After a monovariate and multivariate analysis,
CvxEDA, in fact, allows a better discrimination than CDA.
Several features were computed from the tonic and phasic components as outputs
of both the cvxEDA and CDA models. The mean value of the tonic component
was calculated in order to estimate the general psychophysiological status of the
5.4 Affective Sound Elicitation 65
AROUSAL VALENCE
10 2
(Pos)
0
5
(Neg)
-2
REST (N) (N) (N)
0
L M H
(AR1) (AR2) (AR3)
Fig. 5.3 Timeline of the experimental protocol in terms of arousal and valence levels. The vertical
axis relates to the IADS score, whereas the horizontal axis relates to the time. The neutral sessions,
which are marked with blue lines, alternate with the arousal ones, which are marked with red
staircases. Along the time, the red line follows the four arousal sessions having increasing intensity
of activation. The dotted green line indicates the valence levels within an arousing session. The
neutral sessions are characterized by lowest arousal and medium valence scores. Yellow line relates
to the resting state
Table 5.6 List of features extracted from EDA phasic and tonic compo-
nents
Feature Description
Npeak Number of significant SMNA SCR wrw
AUC Area under curve of SMNA signal wrw (S s)
Peak Maximum amplitude of significant peaks of
SMNA signal wrw3 (S)
Stdphasic Standard deviation of SMNA signal wrw (S)
MeanTonic Mean value of the tonic component
within session windows (S)
wrw within response window (i.e., 5 s after stimulus)
subjects. The characterization of the stimulus response was achieved from the sparse
phasic signal. Specifically, we calculated the number of the peaks, their maximum
amplitude and the area under curve within a time response window of 5 s after each
stimulus onset (see Table 5.6).
66 5 Experimental Applications on Multi-Sensory Affective Stimulation
For each feature, we statistically compared the two levels of valence, i.e.,
positive and negative, and the three arousal levels. Each statistical comparison was
performed using the nonparametric Wilcoxon tests for paired samples, given the
non-gaussianity of samples (p<0.05 from Shapiro-Wilk test with null hypothesis
of having a Gaussian sample). The multiple pairwise comparisons among the
four arousal levels were corrected with the Bonferroni’s method, multiplying the
p-values by the number of comparisons, i.e., 3. Furthermore, the multivariate pattern
analysis was performed in order to classify different levels of arousal and two levels
of valence (Tables 5.7, 5.8, 5.9, and 5.10).
5.4 Affective Sound Elicitation 67
1.65
2.3
1.6
2.2
mean ± SE rank
mean ± SE rank
2.1 1.55
2 1.5
1.9 1.45
1.8 1.4
1.7 1.35
A1 A2 A3 Pos Neg
Fig. 5.4 Within-subject ranks of the area under curve of the phasic component obtained by
cvxEDA for the three arousal levels (top) and the two valence levels (bottom). The dots mark
the across-subject average rank for each level while the wiskers indicate the standard error. The
three arousal levels (top) and the two valence levels (bottom) were statistical compared by the
Bonferroni-corrected Wilcoxon test that did not show significant differences
All the features extracted from the phasic component with both methods, did not
show any significant differences among the three levels of arousal and the two
levels of valence (as example, Figs. 5.4 and 5.5 plotted the statistical description of
the AUC feature comparisons). Instead, the cvxEDA mean tonic value statistically
discriminates the three arousal and the two valence levels, showing a monotonic
trend, although inverse (Figs. 5.6 and 5.7).
Concerning the arousal levels, the multivariate pattern recognition analysis showed a
strong difference between the accuracy of the cvxEDA feature (77:33 %) (Table 5.8)
set and the CDA feature set (45:33 %) (Table 5.7). In the same way, concerning the
valence levels the convex-optimization approach showed a good performance, dis-
cerning the pleasant and unpleasant sounds with an accuracy of 84 % (Table 5.10),
whereas the CDA achieved only 54:5 % of mean successful recognition (Table 5.9).
68 5 Experimental Applications on Multi-Sensory Affective Stimulation
2.15
1.6
2.1
mean ± SE rank
mean ± SE rank
2.05 1.55
2 1.5
1.95
1.45
1.9
1.4
1.85
A1 A2 A3 Pos Neg
Fig. 5.5 Within-subject ranks of the area under curve of the phasic component obtained by CDA
model for the three arousal levels (top) and the two valence levels (bottom). The dots mark
the across-subject average rank for each level while the wiskers indicate the standard error. The
three arousal levels (top) and the two valence levels (bottom) were statistical compared by the
Bonferroni-corrected Wilcoxon test that did not show significant differences
2.8
1.8
2.6
1.7
2.4
2.2 1.6
mean ± SE rank
mean ± SE rank
2 1.5
1.8
1.4
1.6
1.4 1.3
1.2 1.2
1
1.1
0.8
1
A1 A2 A3 Pos Neg
Fig. 5.6 Within-subject ranks of the mean tonic value of the cvxEDA model for the three arousal
levels (top) and the two valence levels (bottom). The dots mark the across-subject average rank for
each level while the wiskers indicate the standard error. Post-hoc Bonferroni-corrected pair-wise
comparisons of the peak amplitudes found significant differences in the cases indicated by asterisks
( W p < 0:05I W p < 0:01I W p < 0:001)
Affective touch plays an important role in social behavior and interactions [207].
Physiologically, affective haptic perception is mediated by the unmyelinated C
tactile (CT) fibers [208, 209], whose activity is linked to the contralateral primary
and bilateral secondary somatosensory area, as well as contralateral middle and
posterior insula cortex, contralateral posterior parietal cortex [210]. Since such
cortices are known to be responsible for crucial homeostatic functions involving
Autonomic Nervous System (ANS) signaling from the whole body, several previous
studies have demonstrated the correlation between ANS dynamics and affective
elicitation [156].
5.5 Affective Touch Elicitation 69
1.7
2.25
2.2 1.65
2.15 1.6
mean ± SE rank
mean ± SE rank
2.1
1.55
2.05
2 1.5
1.95
1.45
1.9
1.85 1.4
1.8 1.35
1.75
1.3
A1 A2 A3 Pos Neg
Fig. 5.7 Within-subject ranks of the mean tonic value of the CDA model for the three arousal
levels (top) and the two valence levels (bottom). The dots mark the across-subject average rank for
each level while the wiskers indicate the standard error. The three arousal levels (top) and the two
valence levels (bottom) were statistical compared by the Bonferroni-corrected Wilcoxon test that
did not show significant differences
Tactile stimulations were used in several studies where EDA was monitored
and analyzed, specifically in relation with the study of nociceptive pain. EDA
resulted useful in a fast and continuously detection of nociceptive pain with higher
sensitivity and specificity than other available objective methods [286]. Moreover,
tactile stimulation of the preterm newborn infants was found to produce significantly
higher increases in EDA than nociceptive stimulation [287].
Few studies report on the link between EDA and affective haptic stimulation.
A work by Olausson et al. [288] examined the effects on EDA dynamics of the
affective touch applying a tactile stimulation by means of a soft brush to the radial
side of the forearm. The authors performed a comparison between the effects of
pleasant touch on 4 healthy subjects and 2 patients with sensory neuronopathy
syndrome. In both patients with neuropathy, a strong sympathetic response was
reported with a latency significantly longer than in healthy subjects, suggesting a
different response between the two groups. However the small samples did not allow
a robust statistics.
In this research, we reports a case study, which is part of the European
project “WEARHAP” (WEARable HAPtics for humans and robots). During the
experimental sessions, EDA signal was continuously monitored during affective
haptic stimulation. Specifically, we had two main goals: first, to assess the physical
characteristics of the caresses (i.e. force and velocity) in terms of the arousal and
valence scores; second, we aimed to process the EDA in order to extract features
correlated to force and velocity levels of affective touch stimulation and perform an
automatic affective recognition.
In this view, we used a haptic device able to mimic the human caress [247]
(Fig. 5.8). Specifically, a layer of fabric is stretched by two motors, whose position
and velocity determine the force and velocity of the simulated caress. In this way, the
artificial caresses can be controlled and standardized in terms of force and velocity
[247].
70 5 Experimental Applications on Multi-Sensory Affective Stimulation
Previous works have already investigated about the perception of affective touch
as a function of the velocity and force of the stimulus. As an example, a peak
in pleasantness perception was found for stimuli having velocity of 3 cm/s [211].
Of note, this velocity was found to be optimal for the activation of CT fibers
[289]. Moreover, low force of caressing was considered more pleasant [212]. Other
physical parameters, e.g., roughness, were found to affect the affective perception
of the tactile stimulus [290].
In this research the haptic stimuli were reproduced by means of an affective touch
display [247] that simulates the human caress.
The device uses a layer of elastic fabric. After having tested several materials
(including commercial lycra, latex layer, and silicon rubber), we selected the elastic
Superbiflex HN by Mectex S.P.A for its high resistance to traction and the good
elastic behavior exhibited in a large range of elasticity.
More specifically, the extremities of a rectangular-shaped fabric (60 mm
160 mm) are connected by means of screws to two rolls, each of them can be
independently moved by one motor (HITEC digital DC servomotor HS-7954 H
with an input voltage of 7.4 V). The choice of this kind of motors is motivated by the
fact that they provide a good trade-off between velocity and torque, thus enabling
fast changes in the stretching state of the fabric. The motors are also suitably
modified to allow a continuous rotation of the motor shafts, which are connected
directly to the rolls (see Fig. 5.9 for further details). The motor positions and rotation
velocity are controlled by a customized electronic board (PSoC-based electronic
board with RS–485 communication protocol), which reads motor positions by using
two absolute magnetic encoders (12 bit magnetic encoder by Austrian Microsystems
AS5045 with a resolution of 0:0875ı ).
Finally, the system is endowed with a load cell (Micro Load Cell [0–5 kg] with
a resolution of 5 g by Phidgets) placed at the basis of the forearm support to record
5.5 Affective Touch Elicitation 71
Fig. 5.9 The system: exploded draw without the cover. Each component of the system is reported.
The total dimensions of the system can be inscribed in a parallelepiped of dimensions 150 150
80 mm
the normal force exerted by the fabric on the forearm. The device was realized in
ABS plastic material and it is encapsulated in a plastic cover to protect motors.
The system exploded draw is reported in Fig. 5.9.
The system is designed to be wearable, as it can be applied to the forearm of a
subject without discomfort (see also Fig. 5.8). The subject places her/his forearm
on the forearm support (half hollow cylinder, radius 45 mm and length 60 mm, see
Fig. 5.9), under the fabric. When the device is operating, two different phases can
be individuated (see Fig. 5.10):
Calibration Phase: in this phase it is possible to calibrate the force exerted
by the fabric on the subject forearm. There are two modalities of control: when
motor 1 rotates in a counter-clockwise direction and motor 2 rotates in a clockwise
direction, the fabric is increasingly wrapped around the forearm thus pressing it with
an increasing force (as it is reported in Fig. 5.10), that can be recorded by the load
cell. Conversely, when motor 1 starts to rotate in a clockwise direction and motor
2 in a counter-clockwise direction the force exerted by the fabric on the forearm
decreases. In this manner it is possible to modulate the “strength of the caress” by
suitably controlling the positions of the motors. The maximum level of force that
can be applied is 20 N.
Movement Phase: when the desired level of force exerted by the fabric is
achieved and both motors are in the reference positions, they start to coherently
72 5 Experimental Applications on Multi-Sensory Affective Stimulation
Fig. 5.10 The scheme of the working principle of the system. The Movement Phase is displayed
in green while the Calibration Phase is displayed in blue: in the example of the figure, the level
of force exerted by the fabric must be increased to reach the desired one. The radius of each roll
(15 mm) is reported
rotate and the fabric moves forward and backward over the subject forearm,
simulating a caress. The velocity of the caress can be modulated by regulating the
velocity of the motors. More specifically, since the electronic board is endowed with
a built-in motor position controller, we can feed the motors with a sinusoidal input
reference trajectory, whose frequency and amplitude can be set, thus controlling the
velocity and the amplitude of motor rotation, respectively. The maximum angular
displacement of the motors from the reference positions is set to ˙90ı . An entire
control cycle lasts 1 ms and the fastest sinusoidal input (frequency of 5 Hz) can be
followed with a phase delay less than 2 ms.
N
mm
s
0.7 6
0.4 Velocity
Force
2
0.1
Fig. 5.11 Example of protocol scheme of one subject. Velocity and force combination is
randomized
the right forearm horizontally placed on the support of the haptic device. For all
trials, participants wore earplugs in order to prevent any auditory cues. Six different
combinations of stimuli among 2 levels of force (2 N, 6 N) and 3 levels of velocity
(9.4, 37, and 65 mm/s) were used. During the interval between two stimuli, the
motors were stopped and the force was set to 0 N, in this case the fabric was only
lightly in contact with the forearm.
The length of the experiment was divided into 3 consecutive sessions (see
Fig. 5.11):
• resting state session of 3 min;
• automatic caress session of 12 min: the participants are subject to haptic
stimulations by the affective touch display on the right forearm. All the 6
combinations of stimuli (two force and three velocity) are administered;
• final resting state session of 2 min.
Of note, all the six combinations of velocities and forces were suitably random-
ized among subjects, with a pre-stimulus and a post-stimulus interval of 35 s each.
After each post-stimulus, the subjects were asked to assess the stimulus in terms of
arousal and valence scores [291], within a time window of 20 s. The evaluation
of the emotions elicited is performed adopting the simplified version of CMA
[247, 248, 292] that allows taking into account the two main dimensions: valence
and arousal.
74 5 Experimental Applications on Multi-Sensory Affective Stimulation
Table 5.11 List of features extracted from EDA phasic and tonic
components
Phasic feature Description
Npeak Number of significant SCR wrw
AUC Area under curve of SMNA signal wrw (S s)
Peak Maximum amplitude of significant peaks of
SMNA signal wrw6 (S)
Stdphasic Standard deviation of SMNA signal wrw (S
Tonic feature Description
MeanTonic Mean value of the tonic component wrw (S)
MeanTonic difference between
diffTonic post/pre rest sessions (S)
Difference of AUC of spontaneous SCRs
NsAUC between post/pre rest sessions (
S s)
wrw within response window (i.e., 5 s after stimulus)
Once the tonic and phasic components were estimated from both cvxEDA and
CDA approaches, several features could be extracted to investigate the sensitivity
to changes in caressing, along the force and velocity dimensions. Proposed features
in this study are summarized in Table 5.11.
Through these features, the following analyses were performed:
• Event-related phasic analysis, through which EDA was studied within a time
response window of 5 s after the affective stimulus;
• Non specific fluctuation and tonic analysis, comprised of (i) the tonic value,
averaged within the time-response window, and the nonspecific electrodermal
fluctuations, and (ii) differential tonic value between the post- and pre-stimulus
session.
The differences between the two levels of force (i.e., F1 D 2 N and F2 D 6 N) and
among the three velocities (i.e., V1D9.4 mm/s, V2D37 mm/s and V3D65 mm/s)
were studied using Wilcoxon signed-rank tests and Friedman tests, respectively,
due to the non-Gaussianity of samples (as confirmed by a preliminary analysis
performed through Shapiro-Wilk tests). In case of rejection of the Friedman test
null-hypothesis, a post-hoc analysis, using Wilcoxon signed-rank with Bonferroni
correction, was also carried out.
Finally, to test the suitability of the proposed algorithm in embedded computing
systems, computational performance analysis was also performed. This is related to
the execution time of the algorithm (processing time), and the memory usage of the
CDA and cvxEDA models output vectors. Estimates of these metrics were obtained
on the same personal computer, with processor 1.7 GHz Intel Core i7, RAM memory
8 GB 1600 MHz DDR3.
5.5 Affective Touch Elicitation 75
Arousal−Velocity Arousal−Force
p < 0.02 p < 0.001
2.5 1.9
1.8
1.7
Ranks
Ranks
1.6
2
1.5
1.4
1.3
1.5
9.4 37 65 2 6
Velocity (mm/s) Force (N)
Valence−Velocity Valence−Force
p < 1e−06 p < 0.005
1.9
2.6 1.8
2.4 1.7
Ranks
1.6
Ranks
2.2
2 1.5
1.8 1.4
1.6 1.3
1.4
9.4 37 65 2 6
Velocity (mm/s) Force (N)
Fig. 5.12 Multiple comparisons of Arousal (top figures) and Valence (bottom figures) values
correspondent to three levels of velocities (9.4, 37, and 65 mm/s) and two levels of forces (F D 2 N,
F D 4 N)
In this section, results from the statistical analysis performed on features from CDA
and cvxEDA algorithms are reported. Such results are shown considering EDA non-
specific fluctuations and tonic components analysis, as well as event-related phasic
components analysis (Fig. 5.13).
Only one feature, diffTonic, extracted from both the CDA and cvxEDA models
showed a significant difference (p < 0:005, Fig. 5.14) between the two levels
of force while caressing. In particular, the higher diffTonic values, the higher
the caressing force. Note that diffTonic calculated through cvxEDA showed a
higher discerning power, i.e., lower p-value, as compared with CDA modeling (see
Tables 5.12 and 5.13).
Concerning the three levels of caress velocity, among the features extracted using
the CDA model, significant differences (p < 0:05) were found in diffTonic while
discerning V1 vs. V2, and V1 vs. V3, and in diffNSAUC while discerning V1 vs.
V3 and V2 vs. V3 (see Table 5.14 and Figs. 5.16 and 5.17). On the other hand,
using the cvxEDA model, a significant difference in discerning V1 vs. V2 was found
through meanTonic (see Table 5.15 and Fig. 5.16). CDA showed a higher number of
statistical differences among the velocities through the tonic features than cvxEDA,
though a complete discrimination of all of the three levels is not achieved by both.
This analysis revealed no significant differences between the two caressing forces,
being consistent between the CDA and cvxEDA approaches (see Tables 5.12
and 5.13, and Fig. 5.15).
Concerning differences between the three caressing velocities, CDA modeling
showed significant differences (p < 0:002) on the number of significant peaks
(Npeak), and the area under the phasic signal curve (AUC) while discerning V1
vs. V2, and V1 vs. V3. No significant differences were found otherwise, including
V2 vs. V3 (see Table 5.14 and Fig. 5.17). Using the cvxEDA approach, instead,
we found significant differences in all of the pairwise comparisons, with p < 0:005
(see Table 5.15 and Fig. 5.17). Importantly, a monotonically increasing trend among
the three velocities was found for each of the considered features (see Fig. 5.17).
Figure 5.13 shows exemplary EDA phasic responses from CDA and cvxEDA
models, for each combination of caressing force and velocity. Note that, considering
sparse outputs from cvxEDA, amplitude values are consistent with the caressing
velocity level.
5.5 Affective Touch Elicitation 77
[a.u.]
[a.u.]
8
4
6
4 2
F1, V1 2
0 10 20
t [s]
30 40
0
0 10 20
t [s]
30 40
−3
x 10
12 12
10 10
8
8
[a.u.]
[a.u.]
6
6
4
4 2
F1, V2 2
0 2 4 6 8
0
0 2 4 6 8
t [s] t [s]
0.035 15
0.03
0.025 10
[a.u.]
[a.u.]
0.02
0.015 5
0.01
F1, V3 0.005
0 1 2
t [s]
3 4 5
0
0 1 2
t [s]
3 4 5
0.02 6
5
0.015
4
[a.u.]
[a.u.]
0.01 3
2
0.005
1
F2, V1 0
0 10 20
t [s]
30 40
0
0 10 20
t [s]
30 40
0.025 15
0.02
10
0.015
[a.u.]
[a.u.]
0.01
5
0.005
F2, V2 0
0 2 4
t [s]
6 8
0
0 2 4
t [s]
6 8
0.035 15
0.03
10
0.025
[a.u.]
[a.u.]
0.02
5
0.015
F2, V3 0.01
0 1 2
t [s]
3 4 5
0
0 1 2
t [s]
3 4 5
Fig. 5.13 Example of CDA (left) and cvxEDA (right) phasic responses to each combination
of caressing force (F D 2 N, F D 6 N) and velocity (V1D 9.4 mm/s, V2 D 37 mm/s and
V3D 65 mm/s) level, within the stimulus time window a.u. stands for arbitrary unit.
78 5 Experimental Applications on Multi-Sensory Affective Stimulation
meanTonic meanTonic
1.7 1.7
mean ± SE rank
mean ± SE rank
1.6 1.6
1.5 1.5
1.4 1.4
1.3 1.3
F1 F2 F1 F2
diffTonic diffTonic
1.8
mean ± SE rank
mean ± SE rank
1.6 1.6
1.4 1.4
1.2 1.2
F1 F2 F1 F2
diffNSAUC diffNSAUC
mean ± SE rank
mean ± SE rank
1.7
1.6 1.6
1.5 1.5
1.4 1.4
1.3
F1 F2 F1 F2
Fig. 5.14 Within-subject ranks of the tonic feature set obtained from CDA (left) and cvxEDA
(right) models between the two force levels (F1D 2 N, F2D 6 N). Values represent average rank
˙ standard error (SE) across subjects. Asterisks indicate significant differences between velocities:
./p < 0:05; ./p < 0:01; . /p < 0:001
Tables 5.16 and 5.17 show the confusion matrix relative to the classification process
of the three velocity levels. In this case, even if the statistical results showed a
better discrimination for the cvxEDA features set, the CDA features have a better
accuracy (67:7 %). Otherwise, concerning the force level classification, the accuracy
of the cvxEDA (68:75 %) model was more than 10 % compared to CDA (58:33 %)
(Tables 5.18 and 5.19).
Independently form the model used in the analysis, these no satisfactory results
of the classification procedure suggested the idea to perform a classification in
the arousal and valence dimensions without taking into account the physical
characteristics of the stimulus.
The arousal and valence scores assigned by the recruited subjects were grouped
into two main classes (Tables 5.20, 5.21, 5.22 and 5.23). Concerning the arousal
levels, we identified a neutral class (arousal < 2), and an aroused class (arousal > 2).
Concerning the valence levels we divided the stimuli in a pleasant class (valence >
0), and in an unpleasant class (valence < 0), In this case due to the unpaired nature of
the samples, a SVM classifier was applied to the dataset. The confusion matrixes for
both the arousal and valence problems showed a better recognition for the cvxEDA
method. The accuracy of the arousal classification was 75:14 % (Table 5.21) against
63:35 % of the CDA (Table 5.20). Regarding the valence classification, the cvxEDA
reached an accuracy of 77:96 % (Table 5.23) whereas the CDA model showed only
66:11 % (Table 5.22).
80
p-value
Feature V1 V2 V3 V1-V2 V1-V3 V2-V3
Npeak 2.0 ˙ 1.0 3.0 ˙ 1.5 4.0 ˙ 2.0 0:00409 5.36e-07 2.82e-04
Peak 0.256 ˙ 0.256 4.36 ˙ 4.03 7.61 ˙ 6.16 8.55e-06 5.68e-07 0:261
AUC 6.89 ˙ 6.59 32.4 ˙ 24.8 72.6 ˙ 42.0 7.61e-08 1.11e-09 1.34e-05
Stdphasic 0.0356 ˙ 0.0355 0.469 ˙ 0.406 0.752 ˙ 0.577 4.25e-07 2.01e-08 0:0215
MeanTonic 0.527 ˙ 0.471 0.261 ˙ 0.429 0.292 ˙ 0.568 0:0364 0:111 1
diffTonic 0.122 ˙ 0.335 0.0251 ˙ 0.303 0.0417 ˙ 0.221 0:0689 0:0666 1
diffNSAUC 2.1 ˙ 6.61 2.13 ˙ 9.22 5.43 ˙ 9.37 1 1 0:344
Values were averaged among the subjects. Last three columns shows p-values from Wilcoxon non-parametric
tests, with null hypothesis of equal median values between three velocity levels
81
82 5 Experimental Applications on Multi-Sensory Affective Stimulation
Npeak Npeak
1.7
mean ± SE rank
mean ± SE rank
1.6
1.6
1.5 1.5
1.4
1.4
1.3
F1 F2 F1 F2
peak peak
1.7
mean ± SE rank
mean ± SE rank
1.6 1.6
1.5 1.5
1.4 1.4
1.3
F1 F2 F1 F2
AUC AUC
mean ± SE rank
mean ± SE rank
1.7 1.7
1.6 1.6
1.5 1.5
1.4 1.4
1.3 1.3
F1 F2 F1 F2
stdphasic stdphasic
mean ± SE rank
mean ± SE rank
1.6 1.6
1.5 1.5
1.4 1.4
F1 F2 F1 F2
Fig. 5.15 Within-subject ranks of the phasic feature set obtained from CDA (left) and cvxEDA
(right) models between the two force levels (F1=2 N, F2=6 N). Values represent average rank ˙
standard error (SE) across subjects. Asterisks indicate significant differences between velocities:
./p < 0:05; ./p < 0:01; . /p < 0:001
meanTonic meanTonic
mean ± SE rank
mean ± SE rank
2.2 2.2
2 2
1.8
1.8
1.6
V1 V2 V3 V1 V2 V3
diffTonic diffTonic
2.4
mean ± SE rank
mean ± SE rank
2.2
2.2
2
1.8 2
1.6 1.8
1.4
1.6
V1 V2 V3 V1 V2 V3
diffNSAUC diffNSAUC
mean ± SE rank
mean ± SE rank
2.5 2.2
2 2
1.5 1.8
V1 V2 V3 V1 V2 V3
Fig. 5.16 Within-subject ranks of the tonic feature set obtained from CDA (left) and cvxEDA
(right) models between the three velocity levels (V1D9.4 mm/s, V2D37 mm/s and V3D65 mm/s).
Values represent average rank ˙ standard error (SE) across subjects. Asterisks indicate significant
differences between velocities: ./p < 0:05; ./p < 0:01; . /p < 0:001
than 1300 s, CDA processing time tends to grow superlinearly. Concerning memory
usage, a significant difference was found between the two models. As expected,
given to the intrinsic sparse nature of cvxEDA phasic components, lower storage
values were associated with the cvxEDA model.
Emotional experience and hedonic judgment are principal aspects of the olfactory
sense [293]. The popular belief reports the ability of fragrances to affect emotional
states. In fact, odor perception can influence our daily life in many ways such as
by modulating our behavior, our autonomic nervous system parameters, and our
cerebral activity [294–296].
84 5 Experimental Applications on Multi-Sensory Affective Stimulation
Npeak Npeak
2.4
mean ± SE rank
mean ± SE rank
2.5
2.2
2 2
1.8
1.6 1.5
1.4
1.2 1
V1 V2 V3 V1 V2 V3
peak peak
2.4
mean ± SE rank
mean ± SE rank
2.5
2.2
2
2
1.8 1.5
1.6 1
V1 V2 V3 V1 V2 V3
AUC AUC
mean ± SE rank
mean ± SE rank
2.5 2.5
2 2
1.5 1.5
1 1
V1 V2 V3 V1 V2 V3
stdphasic stdphasic
2.4
mean ± SE rank
mean ± SE rank
2.5
2.2
2
2
1.5
1.8 1
V1 V2 V3 V1 V2 V3
Fig. 5.17 Within-subject ranks of the phasic feature set obtained from CDA (left) and cvxEDA
(right) models between the three velocity levels (V1D9.4 mm/s, V2D37 mm/s and V3D65 mm/s).
Values represent average rank ˙ standard error (SE) across subjects. Asterisks indicate significant
differences between velocities: ./p < 0:05; ./p < 0:01; . /p < 0:001
The brain limbic areas activated by a hedonic olfactory stimulus are also known
to be responsible for crucial homeostatic functions of the whole body involving
the autonomic nervous system (ANS) activity. More specifically, prior state of
the art reports a strong correlation between ANS dynamics, as estimated through
electrodermal activity (EDA) processing, and affective elicitations [57, 156].
The ANS controls heart rate variability (HRV) and EDA [135, 160, 299], which
are modulated by the perception of an odorant [250, 300–302].
In particular, EDA was found to be associated with the odorant concentration:
weak concentrations of odorants evoked lower EDA response than higher con-
centrations [303]. The correlation between odor intensity, arousal, hedonic tone
86 5 Experimental Applications on Multi-Sensory Affective Stimulation
Table 5.24 Median ˙ MAD intervals for CDA and cvxEDA perfor-
mances
CDA cvxEDA p-value
Processing time [s] 8.171 ˙ 1.239 7.038 ˙ 1.657 0.550
Memory [Kb] 359.004 ˙ 2.868 136.65 ˙ 95.384 2.163e-4
p-values are gathered from the Mann-Whitney non-parametric tests with
null hypothesis of equal medians between models.
Values were calculated for each recording, and averaged among the
subjects
The bold value represents the significant statistical value
and familiarity has been already addressed in the literature. Henion et al. [304]
considered the intensity and hedonic tone as a single feature, whereas other authors
did not shared this idea [305–307]. Other studies showed that the EDA signal
could be modulated by odor intensity, valence, arousal or familiarity [250, 308].
Brauchli et al. also showed that the mean tonic value varied according to the
smell pleasantness, but not to arousal [302]. Findings concerning HRV are similar:
generally, unpleasant smells evoke an increase of the mean HRV value, and
viceversa [295, 302, 308].
Several papers reported gender differences in the emotion area [309–313]. There-
fore, gender differences should be taken into account when emotional paradigms
are used [314]. However, possible gender differences in physiological responses
to odorants have been rarely studied. In childhood, gender differences were found
only in response to unpleasant odors [315]. Yousem et al. [316] examined the gender
effects on odor-stimulated fMRI and evidenced a greater fMRI activation in women
than in men.
Considering the evidences of the relationship among olfactory emotion stimuli
and ANS, we analyzed the effect of olfactory stimulation on EDA. In addition to
the automatic pattern recognition system to classify the valence level of affective
olfactory stimuli, and the gender effect was investigated.
Considering these evidences of the relationship among olfactory emotion stimuli
and ANS, we studied the characterization of the physiological response to olfactory
affective stimuli in terms of arousal and valence analyzing the EDA variations.
88 5 Experimental Applications on Multi-Sensory Affective Stimulation
20
CvxEDA
18
CDA
16
14
Processing Time [s]
12
10
0
0 200 400 600 800 1000 1200 1400 1600 1800
Time Window lenght [s]
Fig. 5.18 Processing time of the CDA and cvxEDA algorithms at different length of signal input,
with sampling time of 60 s
In this experiment we used five different smells, which were synthesized in the
laboratory. These odorants were selected due to their different hedonic tone and
their safety for the panel members [317].
• O1 D Vanillin (C8 H8 O3 , 152:15 g=mol)
• O2 D Benzaldehyde (C6 H5 CHO, 106:12 g=mol)
• O3 DN-butanol (CH3 CH2 CH2 CH2 OH, 74:12 g=mol)
• O4 D Isovaleric acid ( .CH3/2 CHCH2 COOH, 102:13 g=mol)
• O5 D Butyric acid (CH3 CH2 CH2 CO2 H, 88:11 g=mol)
The first two smells were considered pleasant, the last two unpleasant, a priori.
The N-butanol was considered neutral. The solutions with the specific smell were
obtained by mixing each of the chemical compounds with 500 mL of distilled water.
The final concentration of all solutions was such as to appear isointense.
Thirty-two subjects were enrolled in the experiment. In order to have the most
homogeneous subject sample as possible, we determined the olfactory perception
5.6 Affective Olfactory Elicitation 89
Fig. 5.19 Timeline of the experimental protocol: each stimulus was randomized and between two
resting sessions. After the post-stimulus rest the subject scored the stimulus in terms of arousal and
valence levels
Following the same approach used in the affective touch protocol, we extracted
the features as summarized in Table 5.35. Likewise the affective touch, a statistical
analysis among the five smells were performed on the arousal and valence scores,
and on the EDA phasic and tonic features. We distinguished the event-related phasic
analysis, i.e., EDA was studied within a time window of 5 s correspondent to
the affective stimulus session, and the non specific fluctuation and tonic analysis,
comprising the tonic level comparison and the differential value of the tonic features
between the post- and pre-stimulus resting session.
The differences among the smells were studied using the Friedman tests, and
in case of rejection of the Friedman test null-hypothesis, a post-hoc analysis was
performed by means of a Bonferroni corrected Wilcoxon signed-rank.
AROUSAL VALENCE
3.2 1.5
3.1 1
3
0.5
mean ± SE rank
mean ± SE rank
2.9
0
2.8
−0.5
2.7
−1
2.6
2.5 −1.5
2.4 −2
2.3 −2.5
O1 O2 O3 O4 O5 O1 O2 O3 O4 O5
Fig. 5.20 Multiple comparisons of Arousal (left figure) and Valence (right figure) values corre-
spondent to the five smells. Legend: O1 D Vanillin; O2 D Benzaldehyde; O3 DN-butanol;
O4 D Isovaleric acid; O5 D Butyric acid. Asterisks indicate significant differences between
smells: ./p < 0:05; ./p < 0:01; . /p < 0:001
acid were the unpleasant smells. In conclusion, each smell elicited the same intensity
sensation but confirmed the a priori consideration regarding the valence level.
The subsequent EDA analysis was based on the SAM results in order to find a
relationship with the response to the pleasant and the unpleasant stimuli.
The statistical analysis of all the feature extracted from both CDA and cvxEDA did
not show any significant differences among the smells. Neither features extracted
from the phasic or the tonic components were not able to find any differences in the
relationship between EDA and the five odors. However, considering the multivariate
analysis, we grouped the two positive and two negative smells, without taking
into account the arousal dimension due to the fact that no statistical differences
were found in the SAM analysis. Both method showed the same recognition
performances. CDA and cvxEDA features were able to classify the pleasant and
unpleasant stimulus with an accuracy of 68.75 % (see Tables 5.25 and 5.26).
92 5 Experimental Applications on Multi-Sensory Affective Stimulation
Considering the results obtained in the classification and in the SAM analyses, we
decided to reduce the stimulus set to two odorants. According to the SAM results
and the literature review [317], we selected the smells with the two intermediate
median levels of arousal among five smells of the previous analysis.
O1 : Benzaldehyde C6 H5 CHO, (concentration 106:12 g=mol);
O2 : Isovaleric acid .CH3/2 CHCH2 COOH, (concentration 102:13 g=mol).
The described classification procedure for paired data was applied for valence
recognition of the three following datasets (outputs of the cvxEDA model):
x: the whole subject dataset,
z: the reduced dataset of all the male subjects
w: the reduced dataset of all the female subjects
Considering the three datasets, results from the self-assessment-questionnaire are
shown in Fig. 5.21. No significant differences were found among the arousal scores
in the three datasets (x;y;z). Concerning the valence dimension, we can statistically
discern a pleasant smell and an unpleasant smell in the three groups of participants
(p < 106 ). Both smells elicited the same intensity sensation but confirmed the
a priori consideration regarding their different valence level, both in males and
females.
Results of the classification procedure on the three datasets are shown in the form
of a confusion matrix in Table 5.27. Considering data from all of the subjects (x),
i.e., men and women, cvxEDA features were able to recognize the two valence levels
with an average accuracy of 68.76 %. Given the poor classification performance, we
hypothesized that gender could have significantly affected the system accuracy, and
then split the dataset into two sub-sets according to gender. With the male dataset
(z), recognition accuracy was still poor 62.5 %, whereas an accuracy of 78.13 % was
achieved with data from females (w).
5.7 Assessment of Mood States in Bipolar Patients Using EDA 93
AROUSAL
AROUSAL AROUSAL
5
4 4
3.5 3.5
4
Mean ± STD
Mean ± STD
Mean ± STD
3 3
2.5 2.5 3
2 2
2
1.5 1.5
1 1 1
O1 O2 O1 O2 O1 O2
VALENCE VALENCE VALENCE
3 3 2
2 2 1
Mean ± STD
Mean ± STD
Mean ± STD
1
1 0
0
0 −1
−1
−2 −1 −2
−3 −2 −3
O1 O2 O1 O2 O1 O2
Fig. 5.21 Comparisons of Arousal (top row) and Valence (bottom row) values correspondent to the
two smells for the three datasets: whole subjects (column 1), male subjects (column 2) and female
subjects (column 3). Asterisks indicate statistical significant differences between Benzaldehyde
and Isovaleric acid (O1, O2)
The results gathered from the multi-sensory applications suggested the possibility
to investigate psychiatric pathologies, which involve emotion disorders by means
of EDA analysis. In this research, we study the bipolar disorder. Bipolar patients
are characterized by a pathological unpredictable behavior, resulting in fluctuations
between states of depression and episodes of mania or hypomania. In the current
clinical practice, the psychiatric diagnosis is made through clinician-administered
rating scales and questionnaires, disregarding the potential contribution provided
94 5 Experimental Applications on Multi-Sensory Affective Stimulation
by physiological signs. The aim of this research was to investigate how changes in
the autonomic nervous system activity can be correlated with clinical mood swings.
More specifically, a group of ten bipolar patients underwent an emotional elicitation
protocol to investigate the autonomic nervous system dynamics, through the EDA,
among different mood states. Physiological signals were analyzed by applying both
the CDA and the cvxEDA methods to decompose EDA into the tonic and phasic
components, from which several significant features were extracted to quantify the
sympathetic activation.
Ten patients affected by bipolar disorder I or II were selected for this study. None of
them had suicidal tendencies, delusions or hallucinations. Patients were admitted to
the psychiatric unit of the hospital and periodically screened through a psychiatric
interview. Before each acquisition a mood label among “euthymic”, “depressed”,
“maniac” and “mixed-state” was associated to each patient/acquisition. As a control
group, a group healthy subjects were enrolled and participate to the study. In
particular, ten healthy subjects (5 females, age ranged from 20 to 32), i.e. not
suffering from both cardiovascular and evident mental pathologies, was asked to
fill out the Patient Health QuestionnaireTM (PHQ). All participants showed score
lower than 5. Such a cut-off value was chosen in order to avoid the presence of
either middle or severe personality disorders [318].
An ad-hoc affective elicitation experimental was administered to both the healthy
and bipolar patients group. In particular, such an experimental protocol, graphically
shown in Fig. 5.22, was structured as follows:
• 5-min at rest with closed eyes;
• 5-min at rest with open eyes;
• 6-min slideshow of IAPS pictures with high arousal and negative valence;
• up to 4 min of pictures gathered from TAT.
Fig. 5.23 Example of EDA signal and related components during euthymic state, extracted
through deconvolutive method of analysis. On the top panel, the black signal representing the raw
EDA signal along with the DRIVERtonic (red) are shown. On the lower panel, the DRIVERphasic is
shown. Rest phases lasted for the first 600 s. Afterwards, IAPS and TAT emotional stimulation is
performed
As described above, the protocol is split into two sessions: rest and emotional
elicitation. The latter session is divided, in turn, into two stages, both of which are
intended to elicit a variation of the ANS response. Specifically, IAPS pictures lasted
for 2 s presenting negative emotional contents (high arousal and negative valence).
The same IAPS pictures were presented to all patients and healthy subjects and
nobody was asked to score the elicited level of arousal and valence. The images
were chosen according to the following characteristics: arousal score > 6:7; valence
< 4:5. Afterwards, patients were invited to tell a story based on the input coming
from the TAT pictures. However, in order to avoid biased results related to the IAPS
and TAT sequential order, IAPS-TAT and TAT-IAPS session order was randomly
interchanged. The hypothesis of this study is that the ANS differentially reacts to
such emotional stimuli upon different pathological mood states. During the whole
duration of the protocol, the EDA signal was acquired using the BIOPAC MP150
system with a sampling frequency of 1000 Hz (Fig. 5.23). EDA sensors were placed
on the distal phalanx of the second and third finger of the non-dominant hand,
imposing a DC voltage of 0.5 V. The protocol was run for a follow-up period up
to 75 days. Patients repeated the protocol at each mood change, whereas healthy
subjects repeated the experiment twice within 2 weeks in order to investigate
96 5 Experimental Applications on Multi-Sensory Affective Stimulation
Skin conductance data related to the TAT sessions was excluded from the analysis
since patients’ voice could affect the EDA acquisition. Therefore we considered
only the IAPS stimulation.
Both cvxEDA and CDA model were applied to each EDA time series and several
features were extracted from both phasic and tonic components.
In Table 5.29 the whole set of features is reported along with a corresponding
description. Each feature was normalized by subtracting its correspondent value at
rest.
5.7 Assessment of Mood States in Bipolar Patients Using EDA 97
For each of the seven subjects (i.e. Pz01, Pz02, Pz04, Pz07, Pz08, Pz09 and Pz10)
who changed their mood state and performed the experiment twice, the feature-sets
extracted from the two different acquisitions were compared by using a Wilcoxon
test for paired data [319] (i.e., intra-subject statistical analysis).
Moreover, an inter-subject analysis was performed. For each features, all values
associated to the same mood label were grouped. The three different groups corre-
spondent to the three mood states (i.e. depression, mixed-state and euthymia) were
compared by means of a Kruskal-Wallis test to evaluate whether they statistically
belonged to the same population. In case of rejection of the null hypothesis a Mann-
Whitney post-hoc analysis [320] with Bonferroni adjustment was carried out.
Of note, the statistical inference analysis was performed by means of non-
parametric tests due to the non-gaussianity of the samples (p < 0:05 given by
Kolmogorov-Smirnov test with null hypothesis of Gaussian distributed samples).
In this section, the experimental results performed on both groups of bipolar patients
and controls (i.e., healthy subjects) are shown in detail. Further statistical analyses
pointing out differences between phasic and tonic features, for each EDA model and
for each acquisition, as well as results on intra- and inter-subject evaluations follow
below (see Tables 5.30 and 5.31).
Results on Bipolar Group A summary of the clinical evaluations of the patients
recruited for this study, expressed as mood labels, is shown in Table 5.28.
Concerning the CDA results, for each acquisition, we found significant dif-
ferences (p < 0:03) for all of the considered EDA features but the STD-Tonic.
98 5 Experimental Applications on Multi-Sensory Affective Stimulation
Table 5.30 Results from the bipolar patients dataset expressed as statistical significance
for each EDA feature extracted from the CDA model outputs
IAPS Pz01 Pz02 Pz04 Pz07 Pz08 Pz09 Pz10
MAX-Tonic < 106 > 0:05 > 0:05 < 106 < 106 < 106 < 106
MAX-Phasic < 104 < 106 < 106 < 106 < 106 < 106 < 106
AUC-Tonic < 106 > 0:05 > 0:05 < 106 < 106 < 106 < 106
AUC-Phasic < 104 < 106 < 106 < 106 < 106 < 0:05 < 106
Mean-Tonic < 105 > 0:05 > 0:05 < 106 < 106 < 106 < 106
Mean-Phasic < 104 < 106 < 106 < 106 < 106 < 0:05 < 106
STD-Tonic < 0:05 > 0:05 > 0:05 > 0:05 > 0:05 < 0:005 < 104
STD-Phasic < 104 < 106 < 104 < 106 < 106 < 106 < 106
p-values are from the Wilcoxon test
Samples are estimated during IAPS elicitation sessions of the two acquisition/mood states
The bold values represent the significant statistical values
Table 5.31 Results from the bipolar patients dataset expressed as statistical significance
for each EDA feature extracted from the CvxEDA model outputs
IAPS Pz01 Pz02 Pz04 Pz07 Pz08 Pz09 Pz10
MAX-Tonic > 0:05 > 0:05 > 0:05 > 0:05 > 0:05 > 0:05 > 0:05
MAX-Phasic < 106 < 106 < 106 < 106 < 106 < 106 < 106
AUC-Tonic > 0:05 > 0:05 > 0:05 > 0:05 > 0:05 > 0:05 > 0:05
AUC-Phasic < 106 < 106 < 106 < 106 < 106 < 103 < 106
Mean-Tonic > 0:05 > 0:05 > 0:05 > 0:05 > 0:05 > 0:05 > 0:05
Mean-Phasic < 106 < 106 < 106 < 106 < 106 < 106 < 106
STD-Tonic < 103 < 103 < 106 < 103 < 103 < 0:01 < 103
STD-Phasic < 106 < 106 < 106 < 106 < 106 < 106 < 106
p-values are from the Wilcoxon test
Samples are estimated during IAPS elicitation sessions of the two acquisition/mood states
The bold values represent the significant statistical values
CvxEDA algorithm showed similar results about phasic components, instead the
tonic features were not significant except for the STD-Tonic (Tables 5.31 and 5.32).
Results Using CDA Model Wilcoxon test for paired data was applied on patients
with two acquisitions, i.e Pz01, Pz02, Pz04, Pz07, Pz08, Pz09 and Pz10. Statistical
analysis results show that all the phasic features resulted to be statistically different
for all subjects. Patients Pz02, Pz04 showed a non-significant tonic features set
between the two acquisitions. More in detail, patients Pz01, Pz07, Pz08, Pz09
and Pz10 exhibited significant increase in the mean value, in the area under the
curve and in the maximum value of both DRIVERphasic and DRIVERtonic components
during second acquisition (see an example in Fig. 5.24). Pz02 showed no statistical
difference in tonic features, but an increasing significant trend of the phasic features
was found. As all of five patients clinically improved (i.e. change into an euthymic
state) their status, this results could be due to an increased sympathetic activity
during the emotional stimulation session [227]. On the contrary, Pz04 showed a
5.7 Assessment of Mood States in Bipolar Patients Using EDA 99
1
acquisitions
40 50 60 70 80 90 100 Ranks110
(b) AUC TONIC DRIVER
1
acquisitions
5 10 15 20 25 Ranks 30
Fig. 5.24 Pz01’s statistical analysis for IAPS elicitation. Results of Pz01’s CDA-AUC of
DRIVERphasic (a) and DRIVERtonic (b) features
significant decrease for all phasic features in the second acquisition as compared to
the first one, whereas tonic features were not statistically different (see an example
in Fig. 5.25). Yet this result can be interpreted as a reduction of sympathetic activity
when moving from a mixed state, where hypomanic symptoms could be present,
to an euthymic condition [227]. The standard deviation of both DRIVERtonic and
DRIVERphasic components showed similar trend between the two acquisitions for all
of the seven patients having two observations. In particular, STD-Tonic and STD-
Phasic decreased in the second acquisition, i.e. euthymic state.
Furthermore, an inter-subject statistical analysis was performed including also
the patients with one acquisition only. Data were not considered as coming from
100 5 Experimental Applications on Multi-Sensory Affective Stimulation
1
acquisitions
30 40 50 60 70 80 90 100Ranks110
AUC TONIC DRIVER
(b)
1
acquisitions
10 12 14 16 18 20 22 Ranks 24
Fig. 5.25 Pz04’s statistical analysis for IAPS elicitation. Results of Pz04’s CDA-AUC of
DRIVERphasic (a) and DRIVERtonic (b) features
Depressed
Mixed−State
Euthymic
Depressed
Mixed−State
Euthymic
Fig. 5.26 IAPS stimulation: inter-subject statistical analysis. CDA-AUC of DRIVERphasic (a) and
DRIVERtonic (b) features
(see example in Fig. 5.29b). In the phasic AUC features results confirmed the CDA
considerations, but with an increased level of the statistical significance (i.e., a
lower p-value), especially in the comparisons between depressed or mixed-state and
euthymic state (see Fig. 5.29a). A different contribution of the cvxEDA model was
shown for the inter-subject statistical analysis of the maximum value of the phasic
component. In this case, the features was able to distinguish the three mood states
and not only the depressed (p < 103 ) from the group mixed-state plus euthymic
state.
Results on Healthy Controls We performed statistical analyses based on the
Wilcoxon test for paired samples to investigate whether differences on the EDA
feature patterns of healthy subjects are statistically significant between multiple
affective elicitation protocols over time. Likewise the analysis performed on the
bipolar patients group, features reported in Table 5.35 were extracted form both
the phasic and tonic series. We report that the inter-subject statistical analysis
independently performed considering data from IAPS and TAT sessions showed
no statistically significant differences between the two acquisitions on each of the
considered EDA features (p > 0:05).
Classification Results In order to verify whether the proposed methodologies were
able to recognize changes in the ANS dynamical patterns associated to different
mood states, the capability of the k-NN algorithm for solving the 3-class inter-
subject pattern recognition problem was also tested. The extracted features were
grouped in three sets. Specifically, we defined the feature set ˛ as the set extracted
from driver tonic exclusively; feature set ˇ as the set extracted from driver phasic
102 5 Experimental Applications on Multi-Sensory Affective Stimulation
(a)
1
acquisitions
(b)
1
acquisitions
Fig. 5.27 Pz01’s statistical analysis for IAPS elicitation. Results of Pz01’s CvxEDA-AUC of
phasic (a) and tonic (b) features
(a)
1
acquisitions
(b)
1
acquisitions
Fig. 5.28 Pz04’s statistical analysis for IAPS elicitation. Results of Pz04’s CvxEDA-AUC phasic
(a) and tonic (b) features
5.8 Changing Source Oscillations of Skin Admittance: A Study in the. . . 103
(a)
Depressed
Mixed−State
Euthymic
1600 1700 1800 1900 2000 2100 2200 2300 2400 2500 2600
(b)
Depressed
Mixed−State
Euthymic
Fig. 5.29 IAPS stimulation: inter-subject statistical analysis. CvxEDA-AUC of phasic (a) and
tonic (b) features
exclusively; and feature set as the set obtained as union of ˛ and ˇ sets. The
classification procedure was performed using all the three feature sets, in order to
assess the contribution of the tonic and phasic components of the EDA. Results are
expressed in form of confusion matrices. Results achieved using the feature set
coming from the CDA analysis are very satisfactory and reported in Tables 5.33
and 5.34. Indeed, confirming the evidences on the inter-subject statistical analysis,
when using both tonic and phasic information the system is able to achieve
accuracies always greater than 80 %.
The cvxEDA model confirmed an accuracy over the 80 % with the whole dataset,
but increased the accuracy of the pattern recognition with both the phasic and tonic
dataset taken separately.
There are two main methods for measuring EDA: endosomatic (internal electrical
source) and exosomatic (external electrical source). Even though the exosomatic
approach is the most widely used, differences between alternating current (AC)
and direct current (DC) methods and their implication in the emotional assessment
field have not yet been deeply investigated. This section aims at investigating how
104 5 Experimental Applications on Multi-Sensory Affective Stimulation
Forty healthy subjects were enrolled in the experiment, aged 26˙ 4 (18 females).
All subjects gave written informed consent before taking part in the study, which
was approved by the local Ethics Committee. The experiment was designed as
following:
5.8 Changing Source Oscillations of Skin Admittance: A Study in the. . . 105
For each dataset, the cvxEDA model was applied to each time series. Concerning
the IRF parameters considered for this study, values of 1 D 0:7 s, 0 D 0:7 s, ˛ D
0:4 and D 0:01 were employed throughout this analysis, according to previous
exploratory tests on separate data.
In the respiratory stimulation dataset, the presence of an estimated burst of
SMNA activity was verified in each 5-s time window following a stimulus onset,
in order to prove the model’s ability to correctly detect partially overlapped phasic
responses.
As summarized in Table 5.35, we segmented each signal in correspondence to
each IAPS image time window and we extracted several features from both the
tonic and phasic component:
The feature set, extracted from each single IAPS image, was used as input of a
pattern recognition algorithm in order to classify the two arousal levels, according
to the IAPS rates. The supervised classification of the feature set was implemented
following a Leave-One-Subject-Out procedure (LOSO) applied to a KNN-based
classifier. For each of the N iterations (where N is the total number of participants)
the whole dataset was split into a training set including .N 1/ subjects and
a test set including the cvxEDA feature values of the remaining subject Nth.
Moreover for each iteration of the LOSO scheme, a feature selection procedure was
performed in order to identify the combination of parameters that resulted in the
highest recognition accuracy within the training set examples. Each selected feature
constituted a single dimension of the feature space. The LOSO pattern recognition
procedure is illustrated in Fig. 5.30.
Table 5.35 List of features extracted from EDA phasic and tonic compo-
nents
Feature Description
Npeak Number of significant SMNA peaks wrw
AUC Area under curve of reconstructed phasic signal wrw (S s)
Peak Maximum amplitude of significant peaks of
SMNA signal wrw9 (S)
MeanTonic Mean value of the tonic component
within each image time window (S)
wrw within response window (i.e., 5 s after stimulus onset)
5.8 Changing Source Oscillations of Skin Admittance: A Study in the. . . 107
Decomposition
N-1 Selection
classification
Fig. 5.30 Overall block scheme of the proposed valence recognition system. The EDA is
processed in order to extract the phasic and tonic components using the cvxEDA algorithm.
According to the protocol timeline, several features are extracted. The KNN classifier is engaged
to perform the pattern recognition by adopting a leave-one-subject-out procedure
As we expected from the specifics of the cvxEDA model, all EDA data (Fig. 5.31a)
were decomposed into two signals, a sparse component p and a smooth component
t, that we interpret as the activity of the sudomotor nerve (Fig. 5.31b) and the tonic
level (Fig. 5.31c).
Maximal Expiration Task Results We performed both a visual and a statistical
inspection of time series to verify whether the effectiveness of the experimental
protocol in eliciting phasic responses was confirmed for all different kinds of
acquisition method (DC and AC).
After the application of the cvxEDA model, we considered a time windows of
5 s after the onset of each expiration task, and we looked for peaks of the SMNA
signal (in fact, the phasic response is defined as the part of the signal arises within a
predefined response window of 1–5 s [15, 16]) Of note, due to the stimulus intervals
of about 20 s no overlap between consecutive responses occurred.
Results of an inter-subject analysis showed that cvxEDA was able to correctly
detect the corresponding phasic peak response over 97:5 % of the respiratory stimuli.
Moreover, a visual inspection of the small percentage of cases that were not
correctly identified revealed a very low signal-to-noise ratio of that segments of
signal.
Automatic Arousal Recognition Results Results of the arousal-level-classification-
procedure on the four datasets, namely, DC, AC 10 Hz, AC 100 Hz, AC 1 kHz, are
shown in Tables 5.36, 5.37, 5.38, and 5.39. The recognition accuracy is reported in
the form of a confusion matrix. An element rij of the confusion matrix indicates a
percentage of mismatches, i.e. how many times a pattern belonging to class i was
erroneously classified as belonging to class j. Terms rij on the main diagonal of the
confusion matrix correspond to correct classifications.
108 5 Experimental Applications on Multi-Sensory Affective Stimulation
1.5
0 50 100 150 200 250 300
time [s]
(b) SMNA
30
20
[a.u.]
10
0
0 50 100 150 200 250 300
time [s]
(c) Tonic
2.4
2.2
[a.u.]
1.8
1.6
0 50 100 150 200 250 300
time [s]
Fig. 5.31 Application of the cvxEDA decomposition procedure to the EDA signal recorded for a
representative subject. (a) Raw EDA signal, Z-score normalized. (b) Estimated sparse phasic driver
component p. (c) Estimated slow tonic component t
Both DC and AC measures did not show very high average recognition accuracy.
However, it is worthwhile noting that using 100 Hz of frequency current source,
we obtain an average accuracy significantly higher than in the other cases. More
specifically, using DC, 10 Hz and 1 kHz, the average accuracy was in the range
of 62.5–63.34 %, whereas at 100 Hz the pattern recognition system showed an
accuracy of 71.67 %.
5.8 Changing Source Oscillations of Skin Admittance: A Study in the. . . 109
This book has unveiled the strong relationship between Electrodermal Activity
(EDA) signal and autonomic nervous system (ANS) dynamics, and how EDA
could be source of reliable and effective markers for the characterization of the
physiological response to different emotional stimuli and for the automatic affective
and mood state recognition.
In the literature, many studies have demonstrated the link between EDA and
ANS. It is worthwhile mentioning that until the early 1990s, most of the analyses
of the EDA relied, almost exclusively, on heuristic methods, such as the visual
inspection. In the last two decades, a model-based approach has emerged and several
mathematical models have been developed in order to automatize the decomposition
and the processing of the EDA signals as well as the feature extraction stage. In fact,
many studies have examined automatic ways to count spontaneous SCRs, to extract
amplitude or other measures of a single causal SCR, and to deal with motion artifacts
and superposition on the SCRs (there are also publicly- available toolboxes for these
tasks) [16, 66]. Such automatic methods have brought the opportunity to estimate
the unobservable processes (e.g. the sudomotor nerve activity, SMNA) underlying
the EDA phenomena, along with the relationship between SMNA and the sweat
diffusion process. Nevertheless, they still relied, in part, on the use of post- and
pre-processing stages and ad-hoc solutions. Using the model-based approach, the
extracted features have demonstrated to be reliable enough while inferring on the
central nervous system. For example, a recent study proposed a feature that allows
to quantify the sympathetic activity from EDA [85]. This has overcome issues that
are still present in other physiological signals, such as HRV.
Throughout the book, conventional methodologies of EDA processing and
models have been described, also aiming to perform a fair comparison with our
recently proposed EDA processing through convex optimization approach, whose
use has been indeed emphasized in some chapters. Such a modeling approach, called
cvxEDA, was based on maximum a posteriori probability, convex optimization,
and sparsity. The model describes the recorded skin conductance signal as the sum
of three terms: the phasic component, the tonic component, and an additive white
Gaussian noise term incorporating model prediction errors as well as measurement
errors and artifacts. The new algorithm models the IRF (2.9) as an ARMA model
(i.e. an IIR filter) instead of a MA model (i.e. a FIR filter). This allows a much
more compact representation of the IRF by means of two tridiagonal matrices
instead of a banded matrix, thus increasing the accuracy and significantly reducing
the computational cost. In fact, the sparsity and structure of the problem (2.21)
can be effectively exploited by the state-of-the-art sparse-QP solvers. The main
difference between this model other methods in the literature lies in the presence
and definition of the prior probabilities for the phasic and tonic signals. Positiveness
and burstiness of the sudomotor nerve activity driving the phasic component
is modelled through a first order description of spike trains, i.e. assuming a
Poisson distribution approximated by an exponential distribution. This form of
the prior probability translates into a non-negative inequality constraint and an l1 -
norm regularizer in the final optimization problem. Although one could impose
a stronger regularization—e.g. l0 -“norm” [82]—on the phasic driver, this would
render the problem non-convex, i.e. computationally more demanding, and would
significantly deviate from the physiological explanation in terms of Poisson spike
trains. Physiologically-plausible temporal scale and smoothness of the tonic input
signal are achieved by means of an adequate choice of the spacing between the
knots of the spline and through a Gaussian prior on the values at the knots, which
ultimately translates into an l2 regularization of the spline’s coefficients in the
optimization problem. Thanks to the ARMA observation model and to this choice
of priors, physiologically sound constraints on the signals can be imposed to be
estimated and yet be able to obtain the globally optimal solution by solving a
standard quadratic-programming problem. The proposed cvxEDA model shares
some major limitations with most state-of-the-art algorithms, mainly by relying on
the strong assumptions of linearity and time-invariance of the system. In reality,
physiological systems—especially those involving neural dynamics—are likely
to show nonlinear and complex dynamics. Furthermore, such a dynamics and
its statistical properties can be different among subjects and further depend on
environmental and experimental conditions. Within the proposed EDA modeling
framework, inter- and intra-subject variability can be accounted for by choosing
a customized IRF function for each subject/condition. This problem was partially
addressed in our experimental analysis by performing an outer optimization step to
tune the slow time constant of the IRF for each specific subject.
The new algorithm was evaluated in three ways to test its robustness to noise,
its ability to separate and identify each phasic response (even when they overlapped
because of short ISIs) and its capability of properly describing the activity of the
autonomic nervous system in response to strong affective stimulation. The results
of the three analyses confirmed the proprieties of the model. On a synthetic dataset,
the algorithm proved to be robust to different levels of noise. When applied to real
data from a forced maximal expiration protocol, the algorithm demonstrated strong
ability to reliably detect phasic responses to eliciting stimuli, also overcoming the
problem of overlapping SCRs encountered in experimental paradigms involving
6 Conclusions 113
short ISIs. In the affective stimulation paradigm, the mean tonic level estimated by
the model was significantly different in arousal and neutral sessions. Analyzing the
phasic response, we found a consistent statistical relationship between the arousal
levels and the peak amplitude of the estimated phasic driver, thus confirming the
model’s predictive validity. These results were compared to those obtained using
Ledalab implementation of the CDA [1], a method that performs a deterministic
inversion of the peripheral model. The trends found using the CDA confirmed those
obtained from cvxEDA model. However, CDA only found statistically significant
differences between the lightest and the strongest levels of arousal while cvxEDA
model allowed a finer discrimination.
Because it can be implemented in few lines of code and does not depend
on external libraries (except for a conventional QP solver), the new algorithm
has a wide applicability and can be readily integrated in existing open-source
psychophysiological modeling software. Given all these features and the low
computational cost of the proposed algorithm, the cvxEDA model can be employed
in further affective computing applications.
In order to evoke changes in the human affective states in a laboratory setting, we
assembled sets of pictures, sounds, caresses and smells chosen to elicit a range of
aroused, positive, neutral or negative emotions. Valence and arousal were identified
as the principal dimensions of affective response to the environment. In this case,
valence is defined as the degree to which one has favorable feelings towards a
situation, while arousal is defined as the degree to which one feels excited in the
situation. EDA and its ability to characterize the emotion reaction in a multi-sensory
scenarios is studied in four different configurations.
Affective Visual Scenario Concerning the affective visual stimulation, in addition
to the statistical analysis among different levels of arousal, described above, we
have shown a multivariate pattern recognition analysis considering both tonic and
phasic features. More specifically, the set of features were extracted using cvxEDA
and CDA and used as input to a classifier to automatically recognize four arousal
classes and two valence levels.
EDA features allowed achieving a greater sensitivity to changes in the arousal
levels than in the valence levels of the stimulus. Using the cvxEDA approach, results
were very satisfactory and all the four classes of arousal and the two classes of
valence could be discriminated with an acceptable error, i.e. we obtained over 71 %
of successful recognition for the arousal problem and over 68 % for the valence
one. The CDA showed similar percentages in case of valence classification, but its
performance remarkably dropped in the arousal recognition problem. Therefore,
looking at these results, we could assume that the contribution provided by the
convex optimization approach is essential to attain much better results.
Affective Auditory Scenario Concerning the auditory stimulation, we applied the
cvxEDA approach used for the visual scenario to automatically recognize emotions,
as elicited by affective sounds, in young healthy subjects. Even in this case emotions
were expressed in terms of arousal and valence levels according to the circumplex
114 6 Conclusions
Performance of the new cvxEDA technique was compared with the ones
obtained through the CDA model [15]. Comparisons were performed on statistical
significance in discerning affective stimuli along the force and velocity dimensions,
the time of execution of the algorithm, and memory usage.
Concerning the study of the statistical power of the EDA features, experimental
results demonstrated that, along the caressing force dimension, the cvxEDA and
CDA models have the same discriminant power. Only the diffTonic feature, in
fact, was significantly different between the two caressing force levels, with higher
caressing force associated with higher feature values. This means that the higher the
intensity of caressing, the higher the tonic level of EDA after such a cutaneous
stimulus. This is a reasonable behavior, being in line with typical physiological
dynamics associated to EDA [15]. However, it is worthwhile noting that the cvxEDA
provided tonic features with more discriminant power (i.e., lower p-values) than the
CDA approach.
Along the caressing velocity dimension, experimental results demonstrated that
cvxEDA modeling outperforms classical CDA approach. Features from the sparse
phasic components of EDA, in fact, were able to discern all of the differences
between caressing velocity levels. Phasic components estimated from CDA, instead,
were not able to discern between V2 vs. V3, and were always associated with higher
p-values than the cvxEDA ones (see Tables 5.14 and 5.15). Importantly, increasing
monotonic trends among caressing velocities were associated to cvxEDA-related
phasic features. Therefore, it is possible to conclude that cvxEDA modeling
approach provides feature values able to automatically assess caressing stimuli in
a force-velocity space.
We also demonstrated that the cvxEDA approach is particularly suitable for
implementations in embedded computing systems. Computational performance
analysis, in fact, demonstrated that the execution time of the cvxEDA algorithm
linearly increases with the length of the acquisition, whereas processing time of
CDA model tends to grow superlinearly (see Fig. 5.18). This is reasonably due to
the CDA optimization stage [15]. Moreover, taking advantage of the sparse nature of
its phasic components, cvxEDA-derived outputs needed significantly lower storage
values than the CDA model, thus being more suitable for the implementation in
wearable monitoring systems than CDA. Furthermore, it is worthwhile noting that
the cvxEDA approach needs to solve a convex optimization problem, thus always
guaranteeing to find the globally optimal solution. Moreover, it is worth to notice
that the degree of sparsity of the cvxEDA-based phasic components depends on the
number of peaks, i.e., number of stimulus responses, occurring in a given recording.
Therefore, differences in memory storage between the CDA and cvxEDA models
could be minimized even in other experimental protocols involving high frequency
stimuli.
Concerning the pattern recognition analysis, the two models showed opposite
performance with respect to the velocity and the force classification. The CDA
features showed greater accuracy than cvxEDA in the velocity classification,
whereas the cvxEDA parameters performed better accuracy in the classification of
the force levels. We can find an explanation of that, looking at the affective analysis
116 6 Conclusions
of caressing stimuli used in this study. We demonstrated how caressing force and
velocity levels relate to perceived emotional arousal and pleasantness levels of
emotions [157]. Specifically, subjects’ self-ratings revealed that caresses performed
at low force and low velocity are perceived as more pleasant and less arousing than
others [157, 211, 212]. However, while concerning the forces, we could make a
clear distinction between what force level was more pleasant and aroused, regarding
the velocities, the distinction was not so clear cut. As a consequence of these
considerations, we performed a new classification in terms of arousal and valence
scores of the participants. We identified two groups of arousal (i.e., aroused and
neutral stimuli) and valence scores as input of a SVM classifier. CvxEDA showed
a discrimination accuracy over the 75 % (10 % more than CDA) for both the
recognition problems, confirming the good ability in the emotion identification,
defined in terms of valence and arousal.
Findings of this study can be profitably exploited in the field of affective
haptics or, more in general, wearable haptic devices [321]. These systems, in
fact, require processing algorithms with low- computational cost and low-memory
consumption, in order to effectively augment communication, interaction, and
cooperation between human and robots.
Affective Olfactory Scenario The last experimental application concerns the
affective olfactory stimulation. In this study, an automatic valence recognition of
affective olfactory stimuli was performed. Specifically, we studied ANS dynamics
through the analysis of EDA in 32 healthy subjects (16 males). The experimental
protocol foresaw five different smells: Vanillin, Benzaldehyde, N-butanol, Isovaleric
acid, and Butyric acid. All the participants to the experiment had the same olfactory
threshold. Subjective ratings on SAM questionnaires confirmed that the Vanillin and
the Benzaldehyde were perceived as more pleasant, whereas the isovaleric acid and
the Butyric acid were assessed as unpleasant smells. No significant differences were
found among the arousal scores of the five smells.
EDA statistical analysis was carried out firstly studying the general physiological
state analyzing the tonic component (not directly related to the stimulus) [23] during
the resting states before and after the olfactory stimulus. Secondly, we studied
the stimulus-related responses in the time response window correspondent to the
presentation of each smell. Both analyses carried out on features extracted from the
CDA and cvxEDA models did not show any significant results.
Considering the challenging task of discerning valence levels from EDA signal
only, experimental results were quite satisfactory. We obtained an overall recogni-
tion accuracy of 68.75 % using a LOSO procedure on PWR-KNN classifier for both
the cvxEDA and the CDA models. The quite satisfactory classification accuracy
were obtained grouping the two pleasant smells and the two unpleasant smells as
assessed by the subjects.
Although the not statistical significant results, the classification problem suggests
remarkable electrodermal variations occurring during affective olfactory elicitation.
Accordingly, previous studies highlighted strong correlations between the effects of
odorous stimuli on EDA and valence, intensity and familiarity of the smells.
6 Conclusions 117
and starts very early at very limited areas on the electrode surface (e.g., It has
been showed that very weak non-linearity is measurable at voltages than 100 mV).
Hence, it may be difficult to differentiate between the non-linearity of the electrode
processes and the tissue processes [323].
We are aware that works stated that the role of the susceptance is less important
with respect to the conductance at low frequency [40], but our results seem indicate
that a significant difference in EDA results are frequency dependent even more when
they are not mechanical but emotional evoked.
In other words, we assume that it could be feasible that emotional stimuli may
involve a capacitive component in the medium under investigation that has a bigger
contribution at 100 Hz.
Moreover, we should take into account that the Ohm’s law, given by J D E,
in such a medium could be not valid and it may be useful to treat as a complex
quantity in order to incorporate dielectric losses and frequency dependence [324],
therefore defining as: D 0 C j 00 (where j is the imaginary unit).
Future works will investigate the real and imaginary component of the admittance
in the analysis of the EDA dynamics by involving time varying method that could
highlight the nonlinear nature of the electrodermal response.
The applicability of cvxEDA approach is not limited to EDA analysis but can be
extended to other domains requiring the deconvolution of pulse trains from the
output of systems that can be represented as ARMA models, for example in calcium
imaging [79] or hormone secretion analysis [82].
Moreover, to achieve a more complete description of the ANS and CNS activity,
both cvxEDA tonic and phasic outputs could be used to develop multivariate
models. For instance, the estimated SMNA component and EEG signals could be
related to each other via certain joint probabilities in order to assess joint CNS-
ANS dynamics especially related to sympathetic activity during, e.g., emotional
elicitation. Moreover, the tonic component, which can be considered as source of
reliable information on the sympathetic tone [85] (see also Sect. 2.5.2), could be
modeled using bivariate system of equations along with series of heartbeat dynamics
to provide information related to the sympatho-vagal balance.
Envisioned future challenging applications can be related to assistive devices
and rehabilitation, e.g. for patients with severe brain damages, which can be in
one of several states collectively known as Disorders Of Consciousness (DOC).
Indeed, the treatment of these patients is often driven by subjective experience, and
self-intuition of the clinicians. Nevertheless, there is not a standardized approach
to investigate if some perceptual channels (e.g., touch, which was proven to
communicate distinct emotions as discussed in the Introduction) are still active
in DOC. DOC assessment and rehabilitation could benefit from affective elicitation
6.1 Future Challenges 121
and recognition systems, like the ones described in this book. Finally, the cvxEDA
algorithm could be also profitably used in conjunction with wearable/portable
sensing systems (e.g., Empatica tools [325], or like the one in [42, 57]) to assess
mental and physical stress [326], which are key parameters to be monitored during
the course of a rehabilitation intervention.
Bibliography
[1] Benedek, M., & Kaernbach, C. (2010). A continuous measure of phasic electrodermal
activity. Journal of neuroscience methods, 190(1), 80–91.
[2] Matu, S., Cristea, I., Coteţ, C., Valenza, G., Gentili, C., Scilingo, E., et al. (2012). Are
socially anxious individuals less empathic? a psychophysiological investigation of facial
mimicry for emotional expressions. International Journal of Psychophysiology, 85(3), 375–
376.
[3] Cristea, I., Valenza, G., Gentili, C., Tatar, A., Scilingo, E., & David, D. (2012). Cognitive
reappraisal and acceptance distinctively impact heart rate variability in socially anxious
individuals. International Journal of Psychophysiology, 85(3), 339.
[4] Zajonc, R. (1984). On the primacy of affect. American Psychologist, 39(2), 117–123.
[5] Gross, J., & Muñoz, R. (1995). Emotion regulation and mental health. Clinical Psychology:
Science and Practice, 2(2), 151–164.
[6] Lazarus, R., & Averill, J. (1972). Emotion and cognition: With special reference to anxiety.
Anxiety: Current Trends in Theory and Research, 2, 242–284.
[7] Damasio, A. (2000). Descartes’ error: Emotion, reason, and the human brain. New York:
Quill.
[8] Valenza, G., Gentili, C., Lanata, A., & Scilingo, E. (2013). Mood recognition in bipolar
patients through the psyche platform: preliminary evaluations and perspectives. Artificial
Intelligence In Medicine, 57(1), 49–58.
[9] Greco, A., Lanata, A., Valenza, G., Rota, G., Vanello, N., & Scilingo, E. (2012). On
the deconvolution analysis of electrodermal activity in bipolar patients. In 2012 Annual
International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)
(pp. 6691–6694). IEEE.
[10] Vanello, N., Guidi, A., Gentili, C., Werner, S., Bertschy, G., Valenza, G., et al. (2012).
Speech analysis for mood state characterization in bipolar patients. In 2012 Annual
International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC)
(pp. 2104–2107). IEEE.
[11] Greco, A., Valenza, G., Lanata, A., Rota, G., & Scilingo, E. P. (2014). Electrodermal activity
in bipolar patients during affective elicitation. IEEE Journal of Biomedical and Health
Informatics, 18(6), 1865–1873.
[12] Fowles, D., Christie, M., Edelberg, R., Grings, W., Lykken, D., & Venables, P. (1981).
Publication recommendations for electrodermal measurements, Psychophysiology, 18(3),
232–239.
[13] Edelberg, R. (1972). Electrical activity of the skin: Its measurement and uses in psychophys-
iology. Handbook of psychophysiology (Vol. 12, p. 1011). New York: Holt.
[39] Martínez-Rodrigo, A., Zangróniz, R., Pastor, J. M., & Fernández-Caballero, A. (2015).
Arousal level classification in the ageing adult by measuring electrodermal skin conductivity.
In Ambient intelligence for health (pp. 213–223). New York: Springer.
[40] Martinsen, Ø., Grimnes, S., & Sveen, O. (1997). Dielectric properties of some keratinised
tissues. part 1: Stratum corneum and nail in situ. Medical and Biological Engineering and
Computing, 35(3), 172–176.
[41] Hanson, M. A., Powell, Jr. H. C., Barth, A. T., Ringgenberg, K., Calhoun, B. H., Aylor, J. H.,
et al. (2009). Body area sensor networks: Challenges and opportunities. Computer, 42(1), 58.
[42] Carbonaro, N., Greco, A., Anania, G., Dalle Mura, G., Tognetti, A., Scilingo, E., et al.
(2012). Unobtrusive physiological and gesture wearable acquisition system: A preliminary
study on behavioral and emotional correlations. Global Health, 1, 88–92.
[43] Lee, Y., Lee, B., & Lee, M. (2010). Wearable sensor glove based on conducting fabric using
electrodermal activity and pulse-wave sensors for e-health application. Telemedicine and
e-Health, 16(2), 209–217.
[44] Patel, S., Park, H., Bonato, P., Chan, L., & Rodgers, M. (2012). A review of wearable
sensors and systems with application in rehabilitation. Journal of Neuroengineering and
Rehabilitation, 9(1), 1.
[45] Garbarino, M., Lai, M., Bender, D., Picard, R. W., & Tognetti, S. (2014). Empatica e3—
a wearable wireless multi-sensor device for real-time computerized biofeedback and data
acquisition. In 2014 EAI 4th International Conference on Wireless Mobile Communication
and Healthcare (Mobihealth) (pp. 39–42). IEEE.
[46] Martinsen, O., & Grimnes, S. (1998). On using single frequency electrical measurements for
skin hydration assessment. Innovation et Technologie en Biologie et Médecine, 19, 395–400.
[47] Martinsen, Ø. G., & Grimnes, S. (2001). Facts and myths about electrical measurement of
stratum corneum hydration state. Dermatology, 202(2), 87–89.
[48] Martinsen, Ø. G., Grimnes, S., Nilsen, J. K., Tronstad, C., Jang, W., Kim, H. et al.
(2008). Gravimetric method for in vitro calibration of skin hydration measurements. IEEE
Transactions on Biomedical Engineering, 55(2), 728–732.
[49] Picard, R. (2010). Emotion research by the people, for the people. Emotion Review, 2(3),
250.
[50] Marieke van Doorena, J.J.G. (Gert-Jan) de Vriesa, & Janssena, J. H. (2012). Emotional
sweating across the body: Comparing 16 different skin conductance measurement locations.
Physiology & Behavior, 106(2), 298–304.
[51] Poh, M.-Z., Swenson, N. C., & Picard, R. W. (2010). A wearable sensor for unobtrusive,
long-term assessment of electrodermal activity. IEEE Transactions on Biomedical Engineer-
ing, 57(5), 1243–1252.
[52] Strauss, M., Reynolds, C., Hughes, S., Park, K., McDarby, G., & Picard, R. W. (2005).
The handwave bluetooth skin conductance sensor. In International Conference on Affective
Computing and Intelligent Interaction (pp. 699–706). Springer.
[53] Thought Technology Ltd: ProComp Infinity Encoder and amplifiers. http://
thoughttechnology.com/index.php/hardware/flexcomp-system-with-biograph-infiniti-
software-t7555m.html (2016).
[54] Biopac Systems, Inc. http://www.biopac.com/ (2016).
[55] BioSemi Instrumentation. http://www.biosemi.com/ (2016).
[56] Scheirer, J., & Picard, R. (2001). The Galvactivator: A glove that senses and communicates
skin conductivity. In Proceedings of the 9th International Conference on Human-Computer
Interaction.
[57] Lanata, A., Valenza, G., & Scilingo, E. (2012). A novel EDA glove based on textile-
integrated electrodes for affective computing. Medical and Biological Engineering and
Computing, 50, 1163–1172.
[58] Brainquiry, B. V. (2005) PET-GSR Wireless. http://www.brainquiry.com/.
[59] Analog Device: AD9833 Low Power, Programmable Waveform Generator. http://www.
analog.com/en/products/rf-microwave/direct-digital-synthesis-modulators/ad9833.html
(2016).
126 Bibliography
[82] de Rooi, J., & Eilers, P. (2011). Deconvolution of pulse trains with the L0 penalty. Analytica
Chimica Acta, 705(1), 218–226.
[83] Levinson, D., & Edelberg, R. (1985). Scoring criteria for response latency and habituation
in electrodermal research: a critique. Psychophysiology, 22(4), 417–426.
[84] Ishchenko, A., & Shev’ev, P. (1989). Automated complex for multiparameter analysis of the
galvanic skin response signal. Biomedical Engineering, 23(3), 113–117.
[85] Posada-Quintero, H. F., Florian, J. P., Orjuela-Cañón, A. D., Aljama-Corrales, T.,
Charleston-Villalobos, S., & Chon, K. H. (2016). Power spectral density analysis of elec-
trodermal activity for sympathetic function assessment. Annals of Biomedical Engineering,
44, 1–12.
[86] Grassi, G., & Esler, M. (1999). How to assess sympathetic activity in humans. Journal of
Hypertension, 17(6), 719–734.
[87] Bauer, A., Malik, M., Schmidt, G., Barthel, P., Bonnemeier, H., Cygankiewicz, I., et al.
(2008). Heart rate turbulence: Standards of measurement, physiological interpretation, and
clinical use: International society for Holter and noninvasive electrophysiology consensus.
Journal of the American College of Cardiology, 52(17), 1353–1365.
[88] Crider, A., & Lunn, R. (1971). Electrodermal lability as a personality dimension. Journal of
Experimental Research in Personality, 5(2), 145–150.
[89] Kira, Y., Ogura, T., Aramaki, S., Kubo, T., Hayasida, T., & Hirasawa, Y. (2001). Sympathetic
skin response evoked by respiratory stimulation as a measure of sympathetic function.
Clinical Neurophysiology, 112(5), 861–865.
[90] Lang, P., Bradley, M., & Cuthbert, B. (2005). International affective picture system iaps):
Digitized photographs, instruction manual and affective ratings. Technical Report A-6.
University of Florida.
[91] Cook, E. W., Hawk, L. W., Davis, T. L., & Stevenson, V. E. (1991). Affective individual
differences and startle reflex modulation. Journal of Abnormal Psychology, 100(1), 5.
[92] Lang, P., Greenwald, M., Bradley, M., & Hamm, A. (1993). Looking at pictures: Affective,
facial, visceral, & behavioral reactions. Psychophysiology, 30(3), 261–273.
[93] Hollander, M., Wolfe, D. A., & Chicken, E. (2014). Nonparametric statistical methods. New
York: Wiley.
[94] Frijda, N. (1986). The emotions. Cambridge: Cambridge University Press.
[95] Ekman, P. E., & Davidson, R. J. (1994). The nature of emotion: Fundamental questions.
Oxford: Oxford University Press.
[96] Russell, J., & Carroll, J.M. (1999). On the bipolarity of positive and negative affect.
Psychological Bulletin, 125(1), 3–30.
[97] Watson, D., Wiese, D., Vaidya, J., & Tellegen, A. (1999). The two general activation systems
of affect: Structural findings, evolutionary considerations, and psychobiological evidence.
Journal of Personality and Social Psychology, 76(5), 820–838.
[98] Watson, D., & Clark, L. (1992). On traits and temperament: General and specific factors
of emotional experience and their relation to the five-factor model. Journal of Personality,
60(2), 441–476.
[99] LeDoux, J. (1998). The emotional brain: The mysterious underpinnings of emotional life.
New York: Simon and Schuster.
[100] Pegna, A. J., Khateb, A., Lazeyras, F., & Seghier, M. L. (2005). Discriminating emotional
faces without primary visual cortices involves the right amygdala. Nature Neuroscience,
8(1), 24–25.
[101] Ekman, P. (1993). Facial expression and emotion. American psychologist, 48(4), 384.
[102] Darwin, C. (1872). The expression of the emotions in man and animals; with an introduction,
afterword, and commentaries by Paul Ekman. New York: Oxford University.
[103] Ekman, P. (1974). 1. universal facial expressions of emotion. In R. A. LeVine (Ed.), Culture
and personality: Contemporary readings (pp. 8–15). Chicago: Aldine.
[104] Ekman, P. (1999). Basic emotions. In T. Dalgleish, & T. Power (Eds.), The Handbook of
cognition and emotion, (pp. 45–60). Sussex: Wiley.
128 Bibliography
[105] Tompkins, S. (1962). Affect Imagery Consciousness: Volume I: The Positive Affects.
New York: Springer Publishing Company.
[106] Izard, C. (1971). The face of emotion (Vol. 23). New York: Appleton-Century-Crofts.
[107] Plutchik, R. (1984). Emotions: A general psychoevolutionary theory. Approaches to Emo-
tion, 197–219.
[108] Ekman, P. (1973). Cross-cultural studies of facial expression. In P. Ekman (Ed.), Darwin and
facial expression: A century of research in review, (pp. 169–222). New York: Academic.
[109] Watson. J. (1997) Behaviorism. New Brunswick: Transaction Publication.
[110] Ortony, A., & Turner, T. (1990). What is basic about basic emotions. Psychological Review,
97(3), 315–331.
[111] Wundt, W. (1905). Grundriss der psychologie [Fundamentals of psychology]. (7th rev. ed.).
Liepzig: Engelman.
[112] Schlosberg, H. (1954). Three dimensions of emotion. Psychological Review, 61(2), 81–88.
[113] Osgood, C. (1975). The measurement of meaning. Champaign: University of Illinois Pr.
[114] Davitz, J. (1969). The language of emotion. New York: Academic.
[115] Lang, P., Bradley, M., & Cuthbert, B. (1998). Emotion, motivation, and anxiety: Brain
mechanisms and psychophysiology. Biological Psychiatry, 44(12), 1248–1263.
[116] Panskepp, J. (1998). Affective neuroscience: The foundations of human and animal emo-
tions. Oxford: Oxford University Press.
[117] Breazeal, C. (2003). Emotion and sociable humanoid robots. International Journal of
Human-Computer Studies, 59(1–2), 119–155.
[118] Russell, J. (1980). A circumplex model of affect. Journal of Personality and Social
Psychology, 39(6), 1161–1178.
[119] Russell, J., & Mehrabian, A. (1977). Evidence for a three-factor theory of emotions* 1.
Journal of Research in Personality, 11(3), 273–294.
[120] Arnold, M. B. (1950). An excitatory theory of emotion. In M. L. Reymert (Ed.), Feelings
and emotions (pp. 11–33). New York: McGraw-Hill.
[121] Ortony, A., Clore, G., & Collins, A. (1990). The cognitive structure of emotions. Cambridge:
Cambridge University Press.
[122] Scherer, K., & Ekman, P. (1984). Approaches to emotions. Londres et New Jersey: Lawrence
Erlbaum Associates, Publication.
[123] Lisetti, C., & Gmytrasiewicz, P. (2002). Can a rational agent afford to be affectless? A formal
approach. Applied Artificial Intelligence, 16, 1–33.
[124] Scherer, K., Schorr, A., & Johnstone, T. (2001). Appraisal processes in emotion: Theory,
methods, research. Oxford: Oxford University Press.
[125] Egges, A., Kshirsagar, S., & Magnenat-Thalmann, N. (2003). A model for personality and
emotion simulation. In Knowledge-based intelligent information and engineering systems
(pp. 453–461). Berlin: Springer.
[126] Posner, J., Russell, J., & Peterson, B. (2005). The circumplex model of affect: An
integrative approach to affective neuroscience, cognitive development, and psychopathology.
Development and Psychopathology, 17(03), 715–734.
[127] Nasoz, F., Alvarez, K., Lisetti, C., & Finkelstein, N. (2003). Emotion recognition from phys-
iological signals using wireless sensors for presence technologies. Cognition, Technology
and Work, 6, 4–14.
[128] Janig, W. (1989). Autonomic nervous system. In R. F. Schmidt, & G. Thews (Eds.), Human
physiology, (2nd ed.). Berlin: Springer.
[129] Lewis, M., Haviland-Jones, J. M., & Barrett, L. F. (2010). Handbook of emotions. New York:
Guilford Press.
[130] Picard, R. (2000). Affective computing. Cambridge: MIT.
[131] Picard, R., Vyzas, E., & Healey, J. (2001). Toward machine emotional intelligence: Analysis
of affective physiological state. IEEE Transactions on Pattern Analysis and Machine
Intelligence, 23(10), 1175–1191.
Bibliography 129
[132] Kim, J., & André, E. (2008). Emotion recognition based on physiological changes in music
listening. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(12), 2067–
2083.
[133] Katsis, C. D., Katertsidis, N. S., & Fotiadis, D. I. (2010). An integrated system based on
physiological signals for the assessment of affective states in patients with anxiety disorders.
Biomedical Signal Processing and Control, 6(3), 261–268.
[134] Valenza, G., Greco, A., Citi, L., Bianchi, M., Barbieri, R., & Scilingo, E. (2016). Inhomoge-
neous point-processes to instantaneously assess affective haptic perception through heartbeat
dynamics information. Scientific Reports, 6(28567), 1–14.
[135] Valenza, G., Greco, A., Gentili, C., Lanata, A., Sebastiani, L., Menicucci, D., et al. (2016).
Combining electroencephalographic activity and instantaneous heart rate for assessing
brain–heart dynamics during visual emotional elicitation in healthy subjects. Philosophical
Transactions of the Royal Society A , 374(2067), 20150176.
[136] Valenza, G., Nardelli, M., Gentili, C., Bertschy, G., Kosel, M., Scilingo, E. P., et al.
(2016). Predicting mood changes in bipolar disorder through heartbeat nonlinear dynamics.
Biomedical and Health Informatics, 20(2), 1034–1043.
[137] Valenza, G., Citi, L., Gentili, C., Lanata, A., Scilingo, E. P., & Barbieri, R. (2014). Point-
process nonlinear autonomic assessment of depressive states in bipolar patients. Methods of
Information in Medicine, 53(4), 296–302.
[138] Valenza, G., Lanatà, A., Scilingo, E. P., & De Rossi, D. (2010). Towards a smart glove:
Arousal recognition based on textile electrodermal response. In 2010 Annual International
Conference of the IEEE Engineering in Medicine and Biology (pp. 3598–3601). IEEE.
[139] Mazzei, D., Greco, A., Lazzeri, N., Zaraki, A., Lanatà, A., Igliozzi, R., et al. (2012). Robotic
social therapy on children with autism: preliminary evaluation through multi-parametric
analysis. In 2012 International Conference on and 2012 International Confernece on Social
Computing (SocialCom) Privacy, Security, Risk and Trust (PASSAT) (pp. 955–960). IEEE.
[140] Betella, A., Zucca, R., Cetnarski, R., Greco, A., Lanatà, A., Mazzei, D., et al. (2015).
Inference of human affective states from psychophysiological measurements extracted under
ecologically valid conditions. Using Neurophysiological Signals that Reflect Cognitive or
Affective State, 8, 66.
[141] Lanatá, A., Valenza, G., Mancuso, C., & Scilingo, E. (2011). Robust multiple cardiac
arrhythmia detection through bispectrum analysis. Expert Systems with Applications, 38(6),
6798–6804.
[142] Valenza, G., & Scilingo, E. P. (2014) Autonomic nervous system dynamics for mood and
emotional-state recognition. Springer.
[143] James, W. (1884). II.—what is an emotion? Mind, os-IX (34), 188–205. doi:10.1093/
mind/os-IX.34.188.
[144] Ellsworth, P. C. (1994). William James and emotion: is a century of fame worth a century of
misunderstanding? Psychological Review, 101(2), 222.
[145] Lang, P. J. (1994). The varieties of emotional experience: A meditation on james-lange
theory. Psychological Review, 101(2), 211.
[146] Scherer, K. R., & Wallbott, H. G. (1994). Evidence for universality and cultural variation
of differential emotion response patterning. Journal of Personality and Social Psychology,
66(2), 310.
[147] Stemmler, G., Heldmann, M., Pauls, C. A., & Scherer, T. (2001). Constraints for emotion
specificity in fear and anger: The context counts. Psychophysiology, 38(2), 275–291.
[148] Ekman, P., Levenson, R., & Friesen, W. (1983). Autonomic nervous system activity
distinguishes among emotions. Science, 221(4616), 1208–1210.
[149] Christie, I., & Friedman, B. (2004). Autonomic specificity of discrete emotion and
dimensions of affective space: A multivariate approach. International Journal of Psy-
chophysiology, 51(2), 143–153.
[150] Cacioppo, J. T., Berntson, G. G., Larsen, J. T., Poehlmann, K. M., Ito, T. A., et al. (2000).
The psychophysiology of emotion. Handbook of Emotions, 2, 173–191.
[151] Arnold, M. B. (1960). Emotion and personality. New York: Columbia University.
130 Bibliography
[152] Hillman, J. (1960). Emotion: a comprehensive phenomenology of theories and their meaning
for therapy. Evanston: Northwestern University Press.
[153] Campos, J. J., & Johnson, H. J. (1967). Affect, verbalization, and directional fractionation
of autonomic responses. Psychophysiology, 3(3), 285–290.
[154] Bradley, M. M., & Lang, P. J. (2000). Measuring emotion: Behavior, feeling, and physiology.
Cognitive Neuroscience of Emotion, 25, 49–59.
[155] Stemmler, G. (2004). Physiological processes during emotion. In P. Philippot & R. S.
Feldman (Eds.), The regulation of emotion (pp. 33–70). Mahwah: Erlbaum.
[156] Calvo, R., & D’Mello, S. (2010). Affect detection: An interdisciplinary review of models,
methods, and their applications. IEEE Transactions on Affective Computing, 1(1), 18–37.
[157] Greco, A., Valenza, G., Nardelli, M., Lanata, A., Bianchi, M., & Scilingo, E. P. (2015). Elec-
trodermal activity analysis during affective haptic elicitation. In 2015 Annual International
Conference of the IEEE Engineering in Medicine and Biology Society (EMBC). IEEE.
[158] Swangnetr, M., & Kaber, D. B. (2012). Emotional state classification in patient–robot
interaction using wavelet analysis and statistics-based feature selection. IEEE Transactions
on Systems, Man and Cybernetics, 43(1), 63–75.
[159] Katsis, C., Katertsidis, N., Ganiatsas, G., & Fotiadis, D. (2008). Toward emotion recognition
in car-racing drivers: A biosignal processing approach. IEEE Transactions on Systems, Man
and Cybernetics, Part A: Systems and Humans, 38(3), 502–512.
[160] Lanatà, A., Valenza, G., Greco, A., Gentili, C., Bartolozzi, R., Bucchi, F., et al. (2015).
How the autonomic nervous system and driving style change with incremental stressing
conditions during simulated driving. IEEE Transactions on Intelligent Transportation
Systems, 16(3), 1505–1517.
[161] Chakraborty, A., Konar, A., Chakraborty, U., & Chatterjee, A. (2009). Emotion recognition
from facial expressions and its control using fuzzy logic. IEEE Transactions on Systems,
Man and Cybernetics, Part A: Systems and Humans, 39(4), 726–743.
[162] Chanel, G., Rebetez, C., Bétrancourt, M., & Pun, T. (2011). Emotion assessment from
physiological signals for adaptation of game difficulty. IEEE Transactions on Systems, Man
and Cybernetics, Part A: Systems and Humans, 41(6), 1052–1063.
[163] Lisetti, C., & Nasoz, F. (2004). Using noninvasive wearable computers to recognize human
emotions from physiological signals. EURASIP Journal on Applied Signal Processing, 2004,
1672–1687.
[164] Haag, A., Goronzy, S., Schaich, P., & Williams, J. (2004). Emotion recognition using bio-
sensors: First steps towards an automatic system. In Tutorial and research workshop on
affective dialogue systems. Lecture Notes in Computer Science (Vol. 3068, pp. 36–48).
Berlin/Heidelberg: Springer.
[165] Kim, K., Bang, S., & Kim, S. (2004). Emotion recognition system using short-term
monitoring of physiological signals. Medical and Biological Engineering and Computing,
42(3), 419–427.
[166] Yoo, S., Lee, C., Park, Y., Kim, N., Lee, B., & Jeong, K. (2005). Neural network
based emotion estimation using heart rate variability and skin resistance. In International
conference on natural computation. Lecture Notes in Computer Science (Vol. 3610,
pp. 818–824). Berlin/Heidelberg: Springer.
[167] Choi, A., & Woo, W. (2005). Physiological sensing and feature extraction for emotion
recognition by exploiting acupuncture spots. In International conference on affective
computing and intelligent interaction. Lecture Notes in Computer Science (Vol. 3784,
pp. 590–597). Berlin/Heidelberg: Springer.
[168] Healey, J., & Picard, R. (2005). Detecting stress during real-world driving tasks using
physiological sensors. IEEE Transactions on Intelligent Transportation Systems, 6(2),
156–166.
[169] Li, L., & Chen, J. (2006). Emotion recognition using physiological signals. In Advances
in artificial reality and tele-existence. Lecture Notes in Computer Science (Vol. 4282,
pp. 437–446). Berlin/Heidelberg: Springer.
Bibliography 131
[170] Rani, P., Liu, C., Sarkar, N., & Vanman, E. (2006). An empirical study of machine
learning techniques for affect recognition in human–robot interaction. Pattern Analysis &
Applications, 9(1), 58–69.
[171] Rainville, P., Bechara, A., Naqvi, N., & Damasio, A. (2006). Basic emotions are associated
with distinct patterns of cardiorespiratory activity. International Journal of Psychophysiol-
ogy, 61(1), 5–18.
[172] Zhai, J., & Barreto, A. (2006). Stress detection in computer users based on digital signal
processing of noninvasive physiological variables. In 28th Annual International Conference
of the IEEE Engineering in Medicine and Biology Society (pp. 1355–1358).
[173] Leon, E., Clarke, G., Callaghan, V., & Sepulveda, F. (2007). A user-independent real-time
emotion recognition system for software agents in domestic environments. Engineering
Applications of Artificial Intelligence, 20(3), 337–345.
[174] Liu, C., Conn, K., Sarkar, N., & Stone, W. (2008). Physiology-based affect recognition for
computer-assisted intervention of children with Autism Spectrum Disorder. International
Journal of Human-Computer Studies, 66(9), 662–677.
[175] Yannakakis, G., & Hallam, J. (2008). Entertainment modeling through physiology in
physical play. International Journal of Human-Computer Studies, 66(10), 741–755.
[176] Gouizi, K., Bereksi Reguig, F., & Maaoui, C. (2011). Emotion recognition from physiolog-
ical signals. Journal of Medical Engineering & Technology, 35(6–7), 300–307.
[177] Valenza, G., Lanatà, A., & Scilingo, E. (2011). The role of nonlinear dynamics in affective
valence and arousal recognition. IEEE Transactions on Affective Computing. doi:10.1109/T-
AFFC.2011.30.
[178] Jang, E.-H., Park, B.-J., Kim, S.-H., & Sohn, J.-H. (2012). Emotion classification based on
physiological signals induced by negative emotions: Discriminantion of negative emotions
by machine learning algorithm. In 2012 9th IEEE International Conference on Networking,
Sensing and Control (ICNSC) (pp. 283–288). IEEE.
[179] Ren, P., Barreto, A., Gao, Y., & Adjouadi, M. (2011). Comparison of the use of pupil
diameter and galvanic skin response signals for affective assessment of computer users.
Biomedical Sciences Instrumentation, 48, 345–350.
[180] Torres, C. A., Orozco, Á. A., & Alvarez, M. A. (2013). Feature selection for multimodal
emotion recognition in the arousal-valence space. In 2013 35th Annual International Con-
ference of the IEEE Engineering in Medicine and Biology Society (EMBC) (pp. 4330–4333).
IEEE.
[181] Zhaofang Y., & Guangyuan, L. (2013). Emotion recognition based on nonlinear features
of skin conductance response. Journal of Information & Computational Science, 10(12),
3877–3887.
[182] Maaoui, C., Abdat, F., & Pruski, A. (2014). Physio-visual data fusion for emotion recogni-
tion. IRBM, 35(3), 109–118.
[183] Kukolja, D., Popović, S., Horvat, M., Kovač, B., & Ćosić, K. (2014). Comparative
analysis of emotion estimation methods based on physiological measurements for real-time
applications. International Journal of Human-Computer Studies, 72(10), 717–727.
[184] Hariharan, A., & Adam, M. T. P. (2015). Blended emotion detection for decision support.
IEEE Transactions on Human-Machine Systems, 45(4), 510–517.
[185] Khezri, M., Firoozabadi, M., & Sharafat, A. R. (2015). Reliable emotion recognition system
based on dynamic adaptive fusion of forehead biopotentials and physiological signals.
Computer Methods and Programs in Biomedicine, 122(2), 149–164.
[186] Khan, A. M., & Lawo, M. (2016). Recognizing emotion from blood volume pulse and
skin conductance sensor using machine learning algorithms (pp. 1291–1297). Cham:
Springer International Publishing. Available: http://dx.doi.org/10.1007/978-3-319-32703-
7_247 [Online].
[187] Rukavina, S., Gruss, S., Hoffmann, H., Tan, J.-W., Walter, S., & Traue, H. C. (2016).
Affective computing and the impact of gender and age. PloS One, 11(3), e0150584.
132 Bibliography
[188] Bialoskorski, L. S., Westerink, J. H., & Broek, E. L. (2009). Mood swings: An affective
interactive art system. In International conference on intelligent technologies for interactive
entertainment. Lecture Notes of the Institute for Computer Sciences, Social Informatics and
Telecommunications Engineering (Vol. 9, pp. 181–186). Berlin/Heidelberg: Springer.
[189] Lisetti, C. L., & Nasoz, F. (2004). Using noninvasive wearable computers to recognize
human emotions from physiological signals. EURASIP Journal on Advances in Signal
Processing, 2004(11), 1–16.
[190] Nardelli, M., Valenza, G., Greco, A., Lanata, A., & Scilingo, E. (2015). Recognizing
emotions induced by affective sounds through heart rate variability. IEEE Transactions of
Affective Computing, 6(4), 385–394.
[191] Janssen, J., Van den Broek, E., & Westerink, J. (2009). Personalized affective music player.
In 3rd International Conference on Affective Computing and Intelligent Interaction and
Workshops, 2009 (ACII 2009) (pp. 1–6.). IEEE.
[192] Lin, Y., Wang, C., Jung, T., Wu, T., Jeng, S., Duann, J., & Chen, J. (2010). Eeg-based
emotion recognition in music listening. IEEE Transactions on Biomedical Engineering,
57(7), 1798–1806.
[193] Wagner, J., Kim, J., & André, E. (2005). From physiological signals to emotions: Imple-
menting and comparing selected methods for feature extraction and classification. In 2005
IEEE International Conference on Multimedia and Expo (pp. 940–943). IEEE.
[194] Van den Broek, E., Schut, M., Westerink, J., & Tuinenbreijer, K. (2009). Unobtrusive
Sensing of Emotions (USE). Journal of Ambient Intelligence and Smart Environments, 1(3),
287–299.
[195] Van den Broek, E., & Westerink, J. (2009). Considerations for emotion-aware consumer
products. Applied Ergonomics, 40(6), 1055–1064.
[196] Zeng, Z., Pantic, M., Roisman, G., & Huang, T. (2008). A survey of affect recognition
methods: Audio, visual, and spontaneous expressions. IEEE Transactions on Pattern
Analysis and Machine Intelligence, 31(1), 39–58.
[197] Poels, K., & Dewitte, S. (2006). How to capture the heart? reviewing 20 years of emotion
measurement in advertising. Journal of Advertising Research-New York, 46(1), 18.
[198] Chanel, G., Kierkels, J., Soleymani, M., & Pun, T. (2009). Short-term emotion assessment
in a recall paradigm. International Journal of Human-Computer Studies, 67(8), 607–627.
[199] Healey, J. (2009). Affect detection in the real world: Recording and processing physiological
signals. In 3rd International Conference on Affective Computing and Intelligent Interaction
and Workshops, 2009 (ACII 2009) (pp. 1–6). IEEE.
[200] Lang, P., & Bradley, M. (1999). International affective digitized sounds (IADS): Stimuli, 802
instruction manual and affective ratings (Technical Report no. b-2) (Vol. 803). Gainsville.
[201] Bensafi, M. (2001). Le traitement affectif des odeurs: aspects implicites et explicites. Ph.d
Thesis in, Lyon 2 University.
[202] Lang, P., Bradley, M., & Cuthbert, B. (1997). International affective picture system
(IAPS): Technical manual and affective ratings. Gainesville: The Center for Research in
Psychophysiology, University of Florida.
[203] Lang, P., Greenwald, M., Bradley, M., & Hamm, A. (1993). Looking at pictures: Affective,
facial, visceral, and behavioral reactions. Psychophysiology, 30(3), 261–273.
[204] Lang, P. (1980). Behavioral treatment and bio-behavioral assessment: Computer applica-
tions. Technology in Mental Health Care Delivery Systems, 119–137.
[205] Grimm, S., Schmidt, C., Bermpohl, F., Heinzel, A., Dahlem, Y., Wyss, M., et al. (2006).
Segregated neural representation of distinct emotion dimensions in the prefrontal cortex–an
fMRI study. Neuroimage, 30(1), 325–340.
[206] Hariri, A., Mattay, V., Tessitore, A., Fera, F., & Weinberger, D. (2003). Neocortical
modulation of the amygdala response to fearful stimuli. Biological Psychiatry, 53(6),
494–501.
[207] Chatel-Goldman, J., Congedo, M., Jutten, C., & Schwartz, J.-L. (2014). Touch increases
autonomic coupling between romantic partners. Frontiers in Behavioral Neuroscience, 8, 95.
http://doi.org/10.3389/fnbeh.2014.00095.
Bibliography 133
[208] Liljencrantz, J., & Olausson, H. (2014). Tactile c fibers and their contributions to pleasant
sensations and to tactile allodynia. Frontiers in Behavioral Neuroscience, 8, 37. http://doi.
org/10.3389/fnbeh.2014.00037.
[209] Zotterman, Y. (1939). Touch, pain and tickling: An electro-physiological investigation on
cutaneous sensory nerves. Journal of Physiology, 95(1), 1–28.
[210] Rolls, E. T. (2010). The affective and cognitive processing of touch, oral texture, and
temperature in the brain. Neuroscience & Biobehavioral Reviews, 34(2), 237–245.
[211] Triscoli, C., Olausson, H., Sailer, U., Ignell, H., & Croy, I. (2013). CT-optimized skin
stroking delivered by hand or robot is comparable. Frontiers in Behavioral Neuroscience,
7, 208.
[212] Löken, L. S., Wessberg, J., McGlone, F., & Olausson, H. (2009). Coding of pleasant touch
by unmyelinated afferents in humans. Nature Neuroscience, 12(5), 547–548.
[213] Kessler, R., McGonagle, K., Zhao, S., Nelson, C., Hughes, M., Eshleman, S., et al. (1994).
Lifetime and 12-month prevalence of dsm-iii-r psychiatric disorders in the united states:
Results from the national comorbidity survey. Archives of General Psychiatry, 51(1), 8.
[214] Kauer-SantAnna, M., Kapczinski, F., & Vieta, E. (2009). Epidemiology and management of
anxiety in patients with bipolar disorder. CNS Drugs, 23(11), 953–964.
[215] Young, R., Biggs, J., Ziegler, V., & Meyer, D. (1978). A rating scale for mania: Reliability,
validity and sensitivity. The British Journal of Psychiatry, 133(5), 429–435.
[216] Vieta, E., Reinares, M., & Rosa, A. (2011). Staging bipolar disorder. Neurotoxicity Research,
19(2), 279–285.
[217] Andreazza, A., Kauer-Sant’Anna, M., Frey, B., Bond, D., Kapczinski, F., Young, L., et al.
(2008). Oxidative stress markers in bipolar disorder: A meta-analysis. Journal of Affective
Disorders, 111(2), 135–144.
[218] Phillips, M., & Vieta, E. (2007). Identifying functional neuroimaging biomarkers of bipolar
disorder: Toward dsm-v. Schizophrenia Bulletin, 33(4), 893–904.
[219] Cohen, H., Kaplan, Z., Kotler, M., Mittelman, I., Osher, Y., & Bersudsky, Y. (2003). Impaired
heart rate variability in euthymic bipolar patients. Bipolar Disorders, 5(2), 138–143.
[220] Henry, B. L., Minassian, A., Paulus, M. P., Geyer, M. A., & Perry, W. (2010). Heart rate
variability in bipolar mania and schizophrenia. Journal of Psychiatric Research, 44(3),
168–176.
[221] Levy, B. (2013). Autonomic nervous system arousal and cognitive functioning in bipolar
disorder. Bipolar Disorders, 15(1), 70–79.
[222] Valenza, G., Nardelli, M., Lanata, A., Gentili, C., Bertschy, G., Paradiso, R., et al., Wearable
monitoring for mood recognition in bipolar disorder based on history-dependent long-term
heart rate variability analysis. IEEE Journal of Biomedical and Health Informatics, 18(5),
1625–1635.
[223] Iverson, G., Gaetz, M., Rzempoluck, E., McLean, P., Linden, W., & Remick, R. (2005). A
new potential marker for abnormal cardiac physiology in depression. Journal of Behavioral
Medicine, 28(6), 507–511.
[224] Taillard, J., Lemoine, P., Boule, P., Drogue, M., & Mouret, J. (1993). Sleep and heart rate
circadian rhythm in depression: The necessity to separate. Chronobiology International,
10(1), 63–72.
[225] Taillard, J., Sanchez, P., Lemoine, P., & Mouret, J. (1990). Heart rate orcadian rhythm
as a biological marker of desynchronization in major depression: A methodological and
preliminary report. Chronobiology International, 7(4), 305–316.
[226] Iacono, W. G., & Tuason, V. B. (1983). Bilateral electrodermal asymmetry in euthymic
patients with unipolar and bipolar affective disorders. Biological Psychiatry, 18(3), 303–315.
[227] Iacono, W. G., Lykken, D. T., Peloquin, L. J., Lumry, A. E., Valentine, R. H., & Tuason,
V. B. (1983). Electrodermal activity in euthymic unipolar and bipolar affective disorders: A
possible marker for depression. Archives of General Psychiatry, 40(5), 557.
134 Bibliography
[228] Sponheim, S., Allen, J., & Iacono, W. (1995). Selected psychophysiological measures
in depression: The significance of electrodermal activity, electroencephalographic asym-
metries, and contingent negative variation to behavioral and neurobiological aspects
of depression. The behavioral high risk paradigm in psychopathology (pp. 222–249).
New York: Springer.
[229] Damasio, A. R. (1998). Emotion in the perspective of an integrated nervous system. Brain
Research Reviews, 26(2), 83–86.
[230] Coan, J. A., & Allen, J. J. (2007). Handbook of emotion elicitation and assessment. Oxford:
Oxford university press.
[231] Ruiz-Padial, E., Vila, J., & Thayer, J. (2011). The effect of conscious and non-conscious
presentation of biologically relevant emotion pictures on emotion modulated startle and
phasic heart rate. International Journal of Psychophysiology, 79(3), 341–346.
[232] Beissner, F., Meissner, K., Bär, K.-J., & Napadow, V. (2013). The autonomic brain: An
activation likelihood estimation meta-analysis for central processing of autonomic function.
The Journal of Neuroscience, 33(25), 10,503–10,511.
[233] Heller, A., Johnstone, T., Shackman, A., Light, S., Peterson, M., Kolden, G., et al. (2009).
Reduced capacity to sustain positive emotion in major depression reflects diminished
maintenance of fronto-striatal brain activation. Proceedings of the National Academy of
Sciences, 106(52), 22,445–22,450.
[234] Picard, R. W. (2003). Affective computing: Challenges. International Journal of Human-
Computer Studies, 59(1), 55–64.
[235] Rottenberg, J., Ray, R. R., & Gross, J. J. (in press). Emotion elicitation using films. In J. A.
Coan & J. J. B. Allen (Eds.), The handbook of emotion elicitation and assessment. New York:
Oxford University Press.
[236] Nasoz, F., Alvarez, K., Lisetti, C. L., & Finkelstein, N. (2004). Emotion recognition
from physiological signals using wireless sensors for presence technologies. Cognition,
Technology & Work, 6(1), 4–14.
[237] Pecchinenda, A. (1996). The affective significance of skin conductance activity during a
difficult problem-solving task. Cognition & Emotion, 10(5), 481–504.
[238] Scheirer, J., Fernandez, R., Klein, J., & Picard, R. W. (2002). Frustrating the user on purpose:
A step toward building an affective computer. Interacting with Computers, 14(2), 93–118.
[239] Ilves, M., & Surakka, V. (2012). Heart rate responses to synthesized affective spoken words.
Advances in Human-Computer Interaction, 2012, 14.
[240] Yang, Y.-H., & Chen, H. H. (2011). Music emotion recognition. Milton Park/Abingdon-on-
Thames/Oxfordshire: Taylor & Francis Group.
[241] Yang, Y.-H., Lin, Y.-C., Su, Y.-F., & Chen, H. H. (2008). A regression approach to music
emotion recognition. IEEE Transactions on Audio, Speech, and Language Processing, 16(2),
448–457.
[242] Kim, J., & Andre, E. (2008). Emotion recognition based on physiological changes in
music listening. IEEE Transactions on Pattern Analysis and Machine Intelligence, 30(12),
2067–2083.
[243] Ramakrishnan, S. (2012). Recognition of emotion from speech: A review. In S.
Ramakrishnan (Ed.), Speech enhancement, modeling and recognition–algorithms
and applications (p. 121). ISBN: 978-953-51-0291-5, 2012. InTech, Available from:
http://www.intechopen.com/books/speech-enhancementmodeling-and-recognition-
algorithmsandapplications/recognition-of-emotion-from-speech-areview.
[244] Levenson, R. W. (1988). Emotion and the autonomic nervous system: A prospectus for
research on autonomic specificity. Social psychophysiology and emotion: Theory and
clinical applications (pp. 17–42). Oxford: Wiley.
[245] Andreassi, J. L. (2000). Psychophysiology: Human behavior and physiological response.
New York: Psychology Press.
Bibliography 135
[246] Lanata, A., Greco, A., Valenza, G., & Scilingo, E. P. (2014). A pattern recognition approach
based on electrodermal response for pathological mood identification in bipolar disorders. In
2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)
(pp. 3601–3605). IEEE.
[247] Bianchi, M., Valenza, G., Serio, A., Lanata, A., Greco, A., Nardelli, M., et al. (2014). Design
and preliminary affective characterization of a novel fabric-based tactile display. In 2014
IEEE Haptics Symposium (HAPTICS) (pp. 591–596). IEEE.
[248] Valenza, G., Lanata, A., & Scilingo, E. (2012). The role of nonlinear dynamics in affective
valence and arousal recognition. IEEE Transactions on Affective Computing, 3(2), 237–249.
[249] Bradley, M., & Lang, P. J. (1999). The International affective digitized sounds (IADS)
stimuli, instruction manual and affective ratings. NIMH Center for the Study of Emotion
and Attention. Gainesville: University of Florida.
[250] Bensafi, M., Rouby, C., Farget, V., Bertrand, B., Vigouroux, M., & Holley, A. (2002).
Autonomic nervous system responses to odours: The role of pleasantness and arousal.
Chemical Senses, 27(8), 703–709.
[251] Gibbons, J. D., & Chakraborti, S. (2011). Nonparametric statistical inference. Berlin:
Springer.
[252] Jain, A., & Zongker, D. (1997). Feature selection: Evaluation, application, and small sample
performance. IEEE Transactions on Pattern Analysis and Machine Intelligence, 19(2),
153–158.
[253] Wang, J.-C., Wang, J.-F., He, K. W., & Hsu, C.-S. (2006). Environmental sound classification
using hybrid svm/knn classifier and MPEG-7 audio low-level descriptor. In International
Joint Conference on Neural Networks, 2006 (IJCNN06) (pp. 1731–1735). IEEE.
[254] Kearns, M., & Ron, D. (1999). Algorithmic stability and sanity-check bounds for leave-one-
out cross-validation. Neural Computation, 11(6), 1427–1453.
[255] Duda, R. O., Hart, P. E., & Stork, D. G. (2012). Pattern classification. New York: Wiley.
[256] Kohavi, R., & Provost, F. (1988). Glossary of terms. Machine Learning, 30, 271–274.
[257] Lemke, M. R., Fischer, C. J., Wendorff, T., Fritzer, G., Rupp, Z., & Tetzlaff, S. (2005).
Modulation of involuntary and voluntary behavior following emotional stimuli in healthy
subjects. Progress in Neuro-Psychopharmacology and Biological Psychiatry, 29(1), 69–76.
[258] Schaaff, K., & Schultz, T. (2009). Towards emotion recognition from electroencephalo-
graphic signals. In 3rd International Conference on Affective Computing and Intelligent
Interaction and Workshops, 2009 (ACII 2009) (pp. 1–6). IEEE.
[259] Frantzidis, C. A., Bratsas, C., Papadelis, C. L., Konstantinidis, E., Pappas, C., & Bamidis,
P. D. (2010). Toward emotion aware computing: an integrated approach using multichannel
neurophysiological recordings and affective visual stimuli. IEEE Transactions on Informa-
tion Technology in Biomedicine, 14(3), 589–597.
[260] Palomba, D., Angrilli, A., & Mini, A. (1997). Visual evoked potentials, heart rate responses
and memory to emotional pictorial stimuli. International Journal of Psychophysiology,
27(1), 55–67.
[261] Junghöfer, M., Schupp, H. T., Stark, R., & Vaitl, D. (2005). Neuroimaging of emotion:
Empirical effects of proportional global signal scaling in fMRI data analysis. NeuroImage,
25(2), 520–526.
[262] Jackson Davis, W., Rahman, M. A., Smith, L. J., Burns, A., Senecal, L., McArthur, D.,
et al. (1995). Properties of human affect induced by static color slides (IAPS): Dimensional,
categorical and electromyographic analysis. Biological Psychology, 41(3), 229–253.
[263] Sloan, D. M., Bradley, M. M., Dimoulas, E., & Lang, P. J. (2002). Looking at facial
expressions: Dysphoria and facial EMG. Biological Psychology, 60(2), 79–90.
[264] Gavazzeni, J., Wiens, S., & Fischer, H. (2008). Age effects to negative arousal differ for
self-report and electrodermal activity. Psychophysiology, 45(1), 148–151.
[265] Norris, C. J., Larsen, J. T., & Cacioppo, J. T. (2007). Neuroticism is associated with
larger and more prolonged electrodermal responses to emotionally evocative pictures.
Psychophysiology, 44(5), 823–826.
136 Bibliography
[266] Greco, A., Lanata, A., Valenza, G., Scilingo, E. P., & Citi, L. (2014). Electrodermal activity
processing: A convex optimization approach. In 2014 36th Annual International Conference
of the IEEE Engineering in Medicine and Biology Society (EMBC) (pp. 2290–2293). IEEE.
[267] Pollatos, O., Herbert, B. M., Matthias, E., & Schandry, R. (2007). Heart rate response
after emotional picture presentation is modulated by interoceptive awareness. International
Journal of Psychophysiology, 63(1), 117–124.
[268] Lang, P. (1995). The emotion probe. Studies of motivation and attention. The American
Psychologist, 50(5), 372.
[269] McCurdy, H. G. (1950). Consciousness and the galvanometer. Psychological Review, 57(6),
322.
[270] Bradley, M. M., & Lang, P. J. (2007). The international affective digitized sounds (iads):
Affective ratings of sounds and instruction manual. Technical Report B-3, University of
Florida, Gainesville.
[271] Lang, P. J., Bradley, M. M., & Cuthbert, B. N. (2005). International affective picture system
(IAPS): Affective ratings of pictures and instruction manual. Cuthbert: NIMH, Center for
the Study of Emotion & Attention.
[272] Gobl, C., & Ni, A. (2003). The role of voice quality in communicating emotion, mood and
attitude. Speech Communication, 40(1), 189–212.
[273] Goldstein, A. (1980). Thrills in response to music and other stimuli. Physiological Psychol-
ogy, 8(1), 126–129.
[274] Johna, S. (1991). Music structure and emotional response: Some empirical findings.
Psychology of Music, 991(9), L120.
[275] Marcell, M., Malatanos, M., Leahy, C., & Comeaux, C. (2007). Identifying, rating, and
remembering environmental sound events. Behavior Research Methods, 39(3), 561–569.
[276] Gaver, W. W. (1993). What in the world do we hear? An ecological approach to auditory
event perception. Ecological Psychology, 5(1), 1–29.
[277] Blood, A. J., & Zatorre, R. J. (2001) Intensely pleasurable responses to music correlate
with activity in brain regions implicated in reward and emotion. Proceedings of the National
Academy of Sciences, 98(20), 11,818–11,823.
[278] Koelsch, S., Fritz, T., Müller, K., Friederici, A. D., et al. (2006). Investigating emotion with
music: An fMRI study. Human Brain Mapping, 27(3), 239–250.
[279] Anders, S., Eippert, F., Weiskopf, N., & Veit, R. (2008). The human amygdala is sensitive to
the valence of pictures and sounds irrespective of arousal: An fMRI study. Social Cognitive
and Affective Neuroscience, 3(3), 233–243.
[280] Roque, A. L., Valenti, V. E., Guida, H. L., Campos, M. F., Knap, A., Vanderlei, L. C. M., et
al. (2013). The effects of auditory stimulation with music on heart rate variability in healthy
women. Clinics, 68(7), 960–967.
[281] Orini, M., Bailon, R., Enk, R., Koelsch, S., Mainardi, L., & Laguna, P. (2010). A method
for continuously assessing the autonomic response to music-induced emotions through HRV
analysis. Medical & Biological Engineering & Computing, 48(5), 423–433.
[282] Anttonen, J., & Surakka, V. (2005). Emotions and heart rate while sitting on a chair.
In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems
(pp. 491–499). ACM.
[283] Ivonin, L., Chang, H.-M., Chen, W., & Rauterberg, M. (2013). Unconscious emotions: Quan-
tifying and logging something we are not aware of. Personal and Ubiquitous Computing,
17(4), 663–673.
[284] Verona, E., Patrick, C. J., Curtin, J. J., Bradley, M. M., & Lang, P. J. (2004). Psychopathy and
physiological response to emotionally evocative sounds. Journal of Abnormal Psychology,
113(1), 99.
[285] Hariharan, A., & Adam, M. T. P. (2015). Blended emotion detection for decision support.
IEEE Transactions on Human-Machine Systems, 45(4), 510–517.
[286] Storm, H. (2008). Changes in skin conductance as a tool to monitor nociceptive stimulation
and pain. Current Opinion in Anesthesiology, 21(6), 796–804.
Bibliography 137
[287] Hellerud, B., & Storm, H. (2002). Skin conductance and behaviour during sensory stimula-
tion of preterm and term infants. Early Human Development, 70(1), 35–46.
[288] Olausson, H., Cole, J., Rylander, K., McGlone, F., Lamarre, Y., Wallin, B. G., et al. (2008).
Functional role of unmyelinated tactile afferents in human hairy skin: Sympathetic response
and perceptual localization. Experimental Brain Research, 184(1), 135–140.
[289] Löken, L. S., Evert, M., & Wessberg, J. (2011). Pleasantness of touch in human glabrous
and hairy skin: Order effects on affective ratings. Brain Research, 1417, 9–15.
[290] Klöcker, A., Oddo, C. M., Camboni, D., Penta, M., & Thonnard, J.-L. (2014). Physical
factors influencing pleasant touch during passive fingertip stimulation. PloS One, 9(7),
e101361.
[291] Bradley, M. M., & Lang, P. J. (1994). Measuring emotion: the self-assessment manikin and
the semantic differential. Journal of Behavior Therapy and Experimental Psychiatry, 25(1),
49–59.
[292] Posner, J., Russell, J. A., & Peterson, B. S. (2005). The circumplex model of affect: An
integrative approach to affective neuroscience, cognitive development, and psychopathology.
Development and Psychopathology, 17(03), 715–734.
[293] Herz, R. S., & Engen, T. (1996). Odor memory: review and analysis. Psychonomic Bulletin
& Review, 3(3), 300–313.
[294] Van Toller, S. (1988). Emotion and the brain. In Perfumery (pp. 121–143). Berlin: Springer.
[295] Alaoui-Ismaili, O., Robin, O., Rada, H., Dittmar, A., & Vernet-Maury, E. (1997). Basic emo-
tions evoked by odorants: Comparison between autonomic responses and self-evaluation.
Physiology & Behavior, 62(4), 713–720.
[296] Lorig, T. S., Huffman, E., DeMartino, A., & DeMarco, J. (1991). The effects of low
concentration odors on eeg activity and behavior. Journal of Psychophysiology, 5(1), 69–77.
[297] Aggleton, J. P., & Mishkin, M. (1986). The amygdala: sensory gateway to the emotions.
Emotion: Theory, Research and Experience, 3, 281–299.
[298] Price, J. L. (1987). The central olfactory and accessory olfactory systems. In T. E. Finger, &
W. L. Silver (Eds.), Neurobiology of Taste and Smell (pp. 179–203). New York: Wiley.
[299] Greco, A., Valenza, G., Nardelli, M., Bianchi, M., Citi, L., & Scilingo, E. P. (2016). Force-
velocity assessment of caress-like stimuli through the electrodermal activity processing:
Advantages of a convex optimization approach. IEEE Transactions on Human-Machine
Systems, PP(99), 1–10.
[300] Van Toller, C., Kirk-Smith, M., Wood, N., Lombard, J., & Dodd, G. (1983). Skin con-
ductance and subjective assessments associated with the odour of 5-˛-androstan-3-one.
Biological Psychology, 16(1), 85–107.
[301] Robin, O., Alaoui-Ismaili, O., Dittmar, A., & Vernet-Maury, E. (1999). Basic emotions
evoked by eugenol odor differ according to the dental experience. a neurovegetative analysis.
Chemical Senses, 24(3), 327–335.
[302] Brauchli, P., Rüegg, P. B., Etzweiler, F., & Zeier, H. (1995). Electrocortical and autonomic
alteration by administration of a pleasant and an unpleasant odor. Chemical Senses, 20(5),
505–515.
[303] Uryvaev, Y., Golubeva, N., & Nechaev, A. (1986). Differences in human involuntary
reactions to perceptible and imperceptible odors. In Doklady Akademii Nauk SSSR, 290,
501–504.
[304] Henion, K. E. (1971). Odor pleasantness and intensity: A single dimension? Journal of
Experimental Psychology, 90(2), 275.
[305] Moskowitz, H. R., Dravnieks, A., & Gerbers, C. (1974). Odor intensity and pleasantness of
butanol. Journal of Experimental Psychology, 103(2), 216.
[306] Moskowitz, H. R., Dravnieks, A., & Klarman, L. A. (1976). Odor intensity and pleasantness
for a diverse set of odorants. Attention, Perception, & Psychophysics, 19(2), 122–128.
[307] Doty, R. L. (1975). An examination of relationships between the pleasantness, intensity, and
concentration of 10 odorous stimuli. Perception & Psychophysics, 17(5), 492–496.
138 Bibliography
[308] Alaoui-Ismaili, O., Vernet-Maury, E., Dittmar, A., Delhomme, G., & Chanel, J. (1997).
Odor hedonics: Connection with emotional response estimated by autonomic parameters.
Chemical Senses, 22(3), 237–248.
[309] Kring, A. M., & Gordon, A. H. (1998). Sex differences in emotion: Expression, experience,
and physiology. Journal of Personality and Social Psychology, 74(3), 686.
[310] Bradley, M. M., Codispoti, M., Sabatinelli, D., & Lang, P. J. (2001). Emotion and motivation
ii: Sex differences in picture processing. Emotion, 1(3), 300.
[311] Nardelli, M., Valenza, G., Bianchi, M., Greco, A., Lanata, A., Bicchi, A., et al. (2015).
Gender-specific velocity recognition of caress-like stimuli through nonlinear analysis of
heart rate variability. In 2015 37th Annual International Conference of the IEEE Engineering
in Medicine and Biology Society (EMBC) (pp. 298–301). IEEE.
[312] Grossman, M., & Wood, W. (1993). Sex differences in intensity of emotional experience: A
social role interpretation. Journal of Personality and Social Psychology, 65(5), 1010.
[313] Robin, O., Rousmans, S., Dittmar, A., & Vernet-Maury, E. (2003). Gender influence on
emotional responses to primary tastes. Physiology & Behavior, 78(3), 385–393.
[314] Wrase, J., Klein, S., Gruesser, S. M., Hermann, D., Flor, H., Mann, K., et al. (2003).
Gender differences in the processing of standardized emotional visual stimuli in humans:
a functional magnetic resonance imaging study. Neuroscience Letters, 348(1), 41–45.
[315] Soussignan, R., & Schall, B. (1996). Children’s facial responsiveness to odors: Influences of
hedonic valence of odor, gender, age, & social presence. Developmental Psychology, 32(2),
367.
[316] Yousem, D. M., Maldjian, J. A., Siddiqi, F., Hummel, T., Alsop, D. C., Geckle, R. J., et
al. (1999). Gender effects on odor-stimulated functional magnetic resonance imaging. Brain
Research, 818(2), 480–487.
[317] Naudin, M., El-Hage, W., Gomes, M., Gaillard, P., Belzung, C., & Atanasova, B. (2012).
State and trait olfactory markers of major depression. PLoS One, 7(10), e46938.
[318] Kroenke, K., Spitzer, R., & Williams, J. (2001). The PHQ-9. Journal of General Internal
Medicine, 16(9), 606–613.
[319] Wilcoxon, F. (1945). Individual comparisons by ranking methods. Biometrics Bulletin, 1(6),
80–83.
[320] Siegel, S. (1956). The Mann-Whitney U test. Nonparametric statistics for the behavioral
sciences (pp. 116–127). New York: McGraw-Hill.
[321] Meli, L., Scheggi, S., Pacchierotti, C., & Prattichizzo, D. (2014). Wearable haptics and hand
tracking via an RGB-d camera for immersive tactile experiences. In ACM SIGGRAPH 2014
Posters (p. 56). ACM.
[322] Mørkrid, L., & Qiao, Z.-G. (1988). Continuous estimation of parameters in skin electrical
admittance from simultaneous measurements at two different frequencies. Medical and
Biological Engineering and Computing, 26(6), 633–640.
[323] Sawan, M., Laaziri, Y., Mounaim, F., Elzayat, E., Corcos, J., & Elhilali, M. (2007).
Electrode–tissues interface: Modeling and experimental validation. Biomedical Materials,
2(1), S7.
[324] Pavšelj, N., Préat, V., & Miklavčič, D. (2007). A numerical model of skin electroper-
meabilization based on in vivo experiments. Annals of Biomedical Engineering, 35(12),
2138–2144.
[325] Empatica: Human data in real time. https://www.empatica.com/ (2016).
[326] Fernandez, R., & Picard, R. W. (2003). Modeling drivers’ speech under stress. Speech
Communication, 40(1), 145–159.