Professional Documents
Culture Documents
2017-Thermal Facial Signatures
2017-Thermal Facial Signatures
2017-Thermal Facial Signatures
net/publication/317874224
CITATIONS READS
0 41
8 authors, including:
Matthew Sherwood
Wright State University
15 PUBLICATIONS 31 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Hemodynamic Response to Hypoxia - a Naval Medical Research Unit grant View project
All content following this page was uploaded by Matthew Sherwood on 20 July 2018.
Abstract ering around 50 percent [3]. Some deceivers will have concerns
This study explored emotional and physiological states in re- that they will be found out, leading to high levels of anxiety [20].
sponse to stressful situations involving deception, and their rela- Complicating the accurate detection of deception is that some de-
tion to facial imaging. Male participants were evaluated (i.e., ceivers will be unconcerned with discovery and have low emo-
appraised) impending stressors. Stressor appraisals ranged from tional reactions, whereas those innocently accused may have high
threat (i.e., appraisals that there are a lack of sufficient resources levels of anxiety. Consequently, we assume that stress reactivity
to meet the demands of a stressor) to challenge (i.e., appraisals varies under deception conditions, because the act of interrogation
that there are adequate coping resources to manage stressor can be a stressor itself.
demands). We used appraisals to discriminate two appraisal The stress process begins with a persons evaluation or ap-
groups: threat versus challenge. The capacity to detect state praisal of an event, and people appraise situations differently,
changes of the human face was examined using several standoff causing their stress responses to differ [18], [19]. Appraisals
sensing technologies, including 2D visible (VIS) and mid-wave range on a continuum from challenge to threat. Challenge ap-
infrared (MWIR) cameras. Psychophysiological determination of praisals result when people evaluate an impending situation as
the human state was achieved through self-assessment and phys- personally relevant and demanding, but one with which they may
iological measures which formed the ground truth for the clas- be able to cope and even gain mastery. Threat appraisals result
sifier systems. Two deception studies, a false opinion study and when people evaluate the demands of the situation as outweigh-
a false behavior study, served as high-stakes stressors to under- ing their ability to cope; they may become overwhelmed. Re-
stand facial changes with respect to human stress states. The search shows that challenged participants experience more pos-
methodology extracted MWIR statistical features from facial re- itive and less negative affect than threatened participants [18],
gions in response to changes in stress state. Using the statistical [19]. These two groups have also been distinguished by their
features from MWIR sensor and implementing Eigen analysis al- physiological pattern differences. During a stressor, challenged
lowed classification of the threatened and challenged participants participants have increased cardiac output into a more accept-
for the false behavior study with an accuracy of 85%. ing vasculature, whereas threatened participants have somewhat
more blood pumped into a more resistant, constricted vasculature
Introduction [18],[19],[12]. High-stakes situations are stressors that engage
In ancient China, lie detection included putting dry rice in stress responses in participants. Research indicates that threat and
the mouth of the accused. Upon spitting it out, those with rice challenge appraisals initiate the stress process and are robust pre-
sticking to their tongue were considered guilty. Stress causes a dictors of various stress responses. Stressor appraisals are appro-
dry mouth, preventing enough saliva to help spit out all of the priate for discerning the human state and emotional activity. We
rice. Presumably, those guilty would suffer more stress than inno- examined stress responses such as appraisals, affect, and phys-
cents. Assessment of human emotional states is a complex multi- iology and investigated the validity of remote sensing of facial
disciplinary field with theoretical roots in psycho-physiology. To activity to do the same.
empirically investigate emotional states, scientists trained in psy- The human state and emotional assessment can be defined as
chology (e.g., clinical, social, neuroscience and cognitive) utilize a participants stress response to specific stimuli that includes be-
different assessments including self-reports, physiological mon- havioral, physiological and visual components. Behavioral mea-
itoring and observation, to name a few. To study and catego- surements in stress studies are subject to modification, based on
rize emotions, for example, self-reports assess emotional expe- what an individual is able and motivated to display, whereas phys-
rience, physiological assessments assess blood flow and sympa- iological measurements are more implicit, where individuals are
thetic nervous system or amygdala activity, and behaviors indi- often not aware of and furthermore cannot control their physio-
cate approaching or avoiding a stimulus. Our research aims to logical responses to any great degree. Visual facial observations
analyze the face during a high-stakes deception situation, using (facial expression, head gesture, eye movements, gaze, etc.) can
past-validated psycho-physiological metrics as well as advanced provide quality indicators of stress [7]. The domain of human
imaging and pattern recognition techniques to predict the human state assessment and emotion recognition from a psychological
emotional state via non-contact methods. perspective has been expanding over several years. Ekman and
The use of non-contact sensing for detecting intentions to de- Friezen [5] have shown that facial expressions may be linked to
ceive, if valid, can be of great value. People often rely on nonver- specific (i.e., basic) emotions, and that these basic emotions are
bal cues of deception (eye contact patterns, nervous laughter, and relatively universal across human cultures. Stemming from his
hurried speech) and are rarely accurate, with success rates hov- work with Tomkins, Ekman has developed a systematic method of
Study 1 (n=20) Study 2 (n=23) Most challenged person Most Threatened person
Mean SD Mean SD Study 1 Study 2 Study 1 Study 2
Appraisal Ratio 0.75 0.24 Appraisal Ratio 1.82 2.30
Appraisal Ratio 1.10 0.32 Appraisal Ratio 1.06 0.55
Positive Effect 2.50 4.40 Positive Effect 2.30 3.50
Positive Effect 2.75 0.61 Positive Effect 3.25 0.78 Negative Effect 1.00 1.00 Negative Effect 2.90 4.40
Negative Effect 1.62 0.59 Negative Effect 1.66 0.80 Physiology -1.02 1.58 Physiology 0.54 0.26
Physiology -0.11 0.80 Physiology 0.25 0.84
ysis. The research and findings have shown that an appraisal ra-
the first 30 seconds of the task for all 29 segments of the face. tio distinguishes important outcomes for people along the threat-
130 seconds of data (10 seconds of baseline and 120 seconds of challenge continuum [18],[19], [13],[2]. Our preliminary facial
task data) yield 3,600 frames for analysis. Analyzing just a single pattern analysis for both studies indicated that Study 2 uncovered
feature from each facial segment results in over 100k features per interesting and discriminatory results.
subject. Data reduction was achieved by temporal sampling using
various schema such as sliding windows (details will be published Data partitioning using appraisal ratio
elsewhere) to yield both sub-sampled and time-averaged features. In Study 2, the appraisal ratio served as ground truth for the
Features from defined time epochs, such as slopes and differences classification task and led to the identification of three classes of
from baseline values, were included in the final feature pool of participants: threatened, challenged and neutral (Figure 3).
261 features per participant.
Classification
We employed Eigen analysis on the spatio-temporal data set
to facilitate dimensionality reduction and improve classification
results. We combined spatial segments to form six distinct re-
gions as shown in Figure 2, and then focused on the forehead and
nose regions to reduce the feature set to 300 time series features
(30 distinct frames * 10 segments) and 90 time-epoch-based fea-
tures (9 epochs * 10 segments) per participant. In this particular
study, statistical and histogram-based thermal features from the
false behavior dataset were input into a basic k-Nearest Neighbor
classifier wherein the appraisal ratio served as ground truth (GT). Figure 3: Using appraisal ratio as ground truth, participants were
divided into three categories: threatened (z-score > 0.5, n=10),
Results challenged (z-score < -0.5, n=10) and neutral (-0.5 < z-score <
0.5, n=28).
False opinion (Study 1) vs. false behavior (Study
2)
There appeared to be more variability in self-reports of ap-
praisal and emotional state for Study 2 (Table 1). In Study 2,
we also found that the most challenged person appeared to have
higher positive affect, lower negative affect, and challenge-like
physiology, compared to the most challenged person in Study 1
(Table 2). A similar examination of the most threatened persons
in Study 1 versus Study 2 revealed that Study 2s most threatened
person appeared to have higher threat appraisals, lower positive
affect, and higher negative affect, than the most threatened person
in Study 1 (Table 2). However, the physiology appeared to be sim-
ilar for the most threatened person in Studies 1 and 2. Overall, the
psychophysiological metrics appear to be more varied in Study 2,
and in the right direction for discriminating between challenged
and threatened participants. Physiology may have been less reli- Figure 4: Using appraisal ratio as ground truth, participants who
able for Study 1 given the non-continuous measurement of blood were being deceitful were divided into three categories: threat-
pressure. The physiological index for each study was not related ened (z-score > 0.5, n=5), challenged (z-score < -0.5, n=5) and
to self-reported stressor appraisals or affective states, as it has neutral (-0.5 < z-score < 0.5, n=13).
been in past research [18],[19], [13],[2]. Consequently, subse-
quent analyses focused on those variables that were related to one This analysis included all Study 2 participants with usable
another as expected from the theoretical and empirical literature. data and not just the subset in the deception condition. The com-
Given the above examination of the data for each study, we puted z-scores ranged from -2.0 to 3.0, where higher Z-scores
utilized the Study 2 appraisal ratio as our ground truth for anal- indicate threat and lower Z-scores indicate challenge conditions.
the k-NN classifier with k=3, we found that features from certain
regions of the face are more effective than others for our classi-
fication problem (Table 3). Specifically, the forehead and nose
regions were most useful for the discrimination of threatened and
challenged participants, a result that is consistent with other re-
search findings [17]. Combining the data from these segments
yielded a classification accuracy of 85% (Figure 6). This is in
contrast to an accuracy of 73% observed when inputting features
from the entire face.
Discussion
The goal of this study was to examine whether thermal imag-
ing is useful for detecting differences between specific human
states, which we identify as threatened and challenged states. We
applied Eigen analysis and a simple k-NN classifier to a ther-
Figure 10: Accuracy estimation along with confusion matrices mal feature dataset to successfully discriminate emotional states.
for the fusion of nose and forehead for lesser threat and lesser Our approach was tested with a minimum of ten threatened and
challenge participants. challenged participants whose spontaneous changes in thermal
signatures were captured and validated with psychological self-
The cross validation involves a previously trained system reporting ground truth information. From the above sections, the
with 20 extreme threat and challenge participants and testing in- accuracy rate drops considerably from 85% to 70% and further
volves using all the 40 participants. Finally, accuracy is calcu- drops to 52.5% as the lesser threatened and lesser challenged par-
lated for three different cases using k-NN classification as shown ticipants are included in the testing dataset as shown in Table 4.
in Figure 10. The accuracy calculated for each case indicates that The pattern analysis agrees with the psychological ground truth
PC 1, PC 2 and PC 3 together provide similar classification as PC where the participants close to zero z-score are the typical neutral
2 and PC 3. However, the accuracy dropped from 85% to 52.50% participants.
indicating that the lesser threat and challenge participants proba-
bly do not exhibit a characteristic threat or challenge response. Table 4: Accuracy estimation for all experiments calculated using
k-NN clasifier using leave one out cross validation technique
Deception classification using thermal features
We have reported results from Study 2 for the challenged Threat and Accuracy all Accuracy Accuracy
versus threatened classes (Figure 3), which are comprised of both challenge PCs PC 1, 2, 3 PC 2, 3
deceptive and non-deceptive participants. A similar analysis was participants
performed using only the deceptive Study 2 participants, who also 20 partici- 60% 60% 85%
populate the challenged-threatened continuum (Figure 4). pants
30 partici- 53.33% 56.67% 70%
pants
40 partici- 50% 52.50% 52.50%
pants
References [14] James A Levine, Ioannis Pavlidis, and Murray Cooper. The face of
[1] Jeremy N Bailenson, Emmanuel D Pontikakis, Iris B Mauss, James J fear. The Lancet, 357(9270):1757, 2001.
Gross, Maria E Jabon, Cendri AC Hutcherson, Clifford Nass, and
Oliver John. Real-time classification of evoked emotions using [15] Zhilei Liu and Shangfei Wang. Emotion recognition using hidden
facial feature tracking and physiological responses. International markov models from facial temperature sequence. In Affective com-
journal of human-computer studies, 66(5):303–317, 2008. puting and intelligent interaction, pages 240–247. Springer, 2011.
[2] J Blascovich and WB Mendes. Challenge and threat appraisals: The [16] Barbara L O ’ Kane, Philip Sandick, Todd Shaw, and Mike Cook.
role of affective cues. inj. forgas (ed.), feeling and thinking: The role Dynamics of human thermal signatures. In Proceedings of the In-
ofaffect in social cognition (pp. 59–82), 2000. framation Conference. Las Vegas, 2004.
[3] David B Buller and Judee K Burgoon. Interpersonal deception the- [17] Ioannis Pavlidis, Norman L Eberhardt, and James A Levine. Hu-
ory. Communication theory, 6(3):203–242, 1996. man behaviour: Seeing through the face of deception. Nature,
415(6867):35, 2002.
[4] Carl B Cross, Julie A Skipper, and Douglas Petkie. Thermal imag-
ing to detect physiological indicators of stress in humans. In SPIE [18] Tamera R Schneider. The role of neuroticism on psychological and
Defense, Security, and Sensing, pages 87050I–87050I. International physiological stress responses. Journal of Experimental Social Psy-
Society for Optics and Photonics, 2013. chology, 40(6):795–804, 2004.
[5] Paul Ekman and Wallace V Friesen. Constants across cultures in [19] Tamera R Schneider. Evaluations of stressful transactions: what’s in
the face and emotion. Journal of personality and social psychology, an appraisal? Stress and Health, 24(2):151–158, 2008.
17(2):124, 1971.
[20] Kamila E Sip, David Carmel, Jennifer L Marchant, Jian Li, Predrag
[6] Paul Ekman, Wallace V Friesen, and Silvan S Tomkins. Facial affect
Petrovic, Andreas Roepstorff, William B McGregor, and Christo-
scoring technique: A first validity study. Semiotica, 3(1):37–58,
pher D Frith. When pinocchios nose does not grow: belief regarding
1971.
lie-detectability modulates production of deception. 2012.
[7] Paul Ekman and Erika L Rosenberg. What the face reveals: Basic
and applied studies of spontaneous expression using the Facial Ac- [21] Y-I Tian, Takeo Kanade, and Jeffrey F Cohn. Recognizing action
tion Coding System (FACS). Oxford University Press, USA, 1997. units for facial expression analysis. IEEE Transactions on pattern
analysis and machine intelligence, 23(2):97–115, 2001.
[8] Irfan A Essa and Alex P Pentland. Facial expression recognition
using a dynamic model and motion energy. In Computer Vision, [22] Panagiotis Tsiamyrtzis, Jonathan Dowdall, Dvijesh Shastri, Ioan-
1995. Proceedings., Fifth International Conference on, pages 360– nis T Pavlidis, MG Frank, and P Ekman. Imaging facial physiology
367. IEEE, 1995. for the detection of deceit. International Journal of Computer Vi-
sion, 71(2):197–214, 2007.
[9] Ms SS Ghatge and VV Dixit. Face recognition under varying illumi-
nation with local binary pattern. International Journal of Advanced [23] Yaser Yacoob and Larry Davis. Computing spatio-temporal repre-
Research in Electrical, Electronics and Instrumentation Engineer- sentations of human faces. In IEEE Computer Society Conference
ing, 2(2), 2013. On Computer Vision And Pattern Recognition, pages 70–70, 1993.