Professional Documents
Culture Documents
Assessing Personality Virtual Agent
Assessing Personality Virtual Agent
Assessing Personality Virtual Agent
1, JANUARY-MARCH 2016
Abstract—Personality can be assessed with standardized inventory questions with scaled responses such as “How extraverted is this
character?” or with open-ended questions assessing first impressions, such as “What personality does this character convey?” Little is
known about how the two methods compare to each other, and even less is known about their use in the personality assessment of
virtual agents. We tested what personality virtual agents conveyed through gesture alone when the agents were programmed to display
introversion versus extraversion (Experiment 1) and high versus low emotional stability (Experiment 2). In Experiment 1, both
measures indicated participants perceived the extraverted agent as extraverted, but the open-question technique highlighted the
perception of both agents as highly agreeable whereas the inventory indicated that the extraverted agents were also perceived as more
open to new experiences. In Experiment 2, participants perceived agents expressing high versus low emotional stability differently
depending on assessment style. With inventory questions, the agents differed on both emotional stability and agreeableness. With
the open-ended question, participants perceived the high stability agent as extraverted and the low stability agent as disagreeable.
Inventory and open-ended questions provide different information about what personality virtual agents convey and both may be useful
in agent development.
1 INTRODUCTION
examined scores for all five Big Five traits, as many traits are Similar effects may be present in the work on virtual
often intercorrelated (e.g., those who are low on emotional agents. Participants may be hesitant to ascribe personality
stability may also be lower on agreeableness) [30], [31]. to agents [43], or may use the presence of trait-specific ques-
Using the Big Five Inventory, we confirmed prior tions to infer how they should judge an agent’s behavior.
researchers’ results using the TIPI. We also suggest that This may be particularly relevant in cases where personality
using open-ended measures can provide designers with a tests are trimmed to just those trait questions relevant to a
tool that may offer additional information as to how manip- specific manipulation [22], [44].
ulations of agents’ expressive behavior may change user An additional issue with questionnaire methods is that
perception of agent personality. asking about specific personality traits may under-represent
the breadth of personality types observed. In the case of the
2 PERSONALITY AND PERSONALITY ASSESSMENT Big Five, many have argued for other personality dimen-
IN HUMANS AND VIRTUAL AGENTS sions, either replacing existing Big Five dimensions or add-
ing more dimensions currently thought to be overlooked
2.1 Assessing Personality [45], [46], [47]. The influence of the questions asked on the eli-
The Big Five model of personality has widespread accep- cited personality ratings may be exacerbated for virtual
tance in psychology [33], [34], determining human personal- agents, as virtual agents are often less dynamic than humans
ities along five dimensions: Openness, Conscientiousness, in movement and expression. An agent may display incon-
Extraversion, Agreeableness and Neuroticism (n.b. Neuroti- sistent cues for personality, the effects of which may be
cism is also called Emotional Stability, such that high neu- masked by standard questionnaires [16]. Likert scales allow
roticism is equivalent to low emotional stability and vice for neutral or neither agree nor disagree ratings, but these argu-
versa). Models of agent behavior have followed suit with ably do not carry the same connotation as is inconsistent. Like-
assessing personality along the dimensions used for wise, it is also possible that an observer may think an agent
humans [16], [18], [22], [24], [35]. lacks personality. The scales also do not allow additional
descriptions such as robotic or creepy. As virtual agents are
2.2 Open-Ended versus Forced Choice Question likely to be used in contexts in which observers may not nec-
Methods essarily be prone to making in-depth personality judgments,
Human and virtual agent personalities are typically methods for assessing agent personality that tap into initial
assessed using questionnaires that involve either answers to impressions may provide useful information about the traits
questions or ratings on behaviors, including adjective rating that observers find salient enough to comment upon without
scales. Popular tools are the Revised NEO Personality worrying that they may be biased by trait-specific questions.
Inventory [36], the BFI [32] and TIPI [27]. These have been Open-ended response methods are not a replacement for the
shown to be reliable for self-evaluation or evaluation by thorough personality inventory but rather can convey com-
friends or family, but these methods may not accurately plementary data that give those designing virtual agents a
capture the first impressions and reactions that are quickly fuller idea of how observers view specific agents, particu-
formed by human observers in more naturalistic and situ- larly during short interactions.
ated contexts with strangers [37]. An inventory’s exhaustive
scope is part of its strength: observers are forced to consider 2.3 Contrast Effects
personality facets that might not have occurred to them. But The ability to create multiple identical virtual agents who
doing so may underestimate the significance of the thoughts differ only in highly specific behavioral cues allows for
that spontaneously arise on their own. If you verbally asked effective assessment of observers’ perceptions of personal-
somebody about the personality of a chatty stranger who ity, using a within-subjects or between-subjects design.
was blinking more because of dry contact lenses, that per- Many studies use a within-subjects design where partici-
son might tell you that the stranger was an extravert who pants are shown multiple agents with different target per-
enjoyed engaging others; he or she may or may not mention sonality traits, including those that are meant to be the
anything related to the blinking behavior. However, if you opposites of each other (e.g. extraverted and introverted)
gave him or her a personality inventory that required rating [6], [8], [19], [22], [44], [48]. While this method maximizes
the blinking stranger’s emotional stability, the blinking may statistical power and is more logistically efficient, a within-
be retrospectively cast as a nervous tic. In other words, ask- subjects design may introduce unintended contrast effects.
ing specific questions may bias an observer’s response, Differences found between agents might be driven more by
especially because the attribution of personality often hap- the immediate contrast effects rather than a determination
pens quickly [1]. of personality made based on anything an agent was pro-
This kind of trait or intent attribution has been observed grammed to do. As many agents are ultimately intended to
in a variety of communicative domains, both verbal and be viewed individually, and not contrastively, assessing
nonverbal. For example, when asked about what um means personality in the absence of contrast effects using a
without priming, people reported that ums have something between-subjects design (i.e., showing only one agent to
to do with speech production trouble [38]. However, when each participant) would be more in line with the context in
asked to consider honesty, people reported that those begin- which the agent is ultimately used.
ning a turn with um were more likely to be lying and less Evaluations of personality often rely on the agreement
comfortable than those beginning a turn without an um [39], between judgments of either na€ıve or trained observers. A
[40], [41]. Biasing effects have also been found with tone of common methodology within literature exploring the rela-
voice [42]. tionship between nonverbal behavior and personality is to
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
96 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 7, NO. 1, JANUARY-MARCH 2016
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
LIU ET AL.: TWO TECHNIQUES FOR ASSESSING VIRTUAL AGENT PERSONALITY 97
TABLE 1 TABLE 2
Differences in Animations for Expansive and Compact Agents Differences in Animations for Shaky and Smooth Agents
Parameter Compact Agent Expansive Agent Parameter Shaky Agent Smooth Agent
Stroke Slower, smaller, Shorter, larger, further Stroke Smaller and cross in Outwards; no jerk
Scale closer to body. out from body. Scale front of body; jerks added so smooth
Stroke Gestures lower along Gestures higher in added to motion path trajectory.
Position the vertical axis, closer vertical axis, further Stroke Retract position is Hands raised and
to centerline. from the center line. Position held out low, to the more in front of the
Duration Longer gesture Faster gesture side of the body. body for relaxed
durations. durations. appearance and swing
Arm N/A Elbows move away Arm Elbow rotated inward. N/A
Swivel from body during Swivel
gestures. Gesture Sharper More rounded
Body Decreased motion in N/A Phase
Motion torso and lower body. Connection
Stance Shoulders lowered, Shoulders raised, Stance Collarbones up and Collarbones down
leaning back. leaning forward. further back. and slightly back.
3.2 Procedure
For the emotionally stable and unstable clips, the quality Using a between-subjects design to avoid carry-over effects,
of the movement was changed so that the high emotional the participants watched either a Compact Agent clip or an
stability variant (Smooth Agent) had a more relaxed stance Expansive Agent clip. They were then immediately asked
and produced smoother trajectories while the low emo- the open-ended question, “What personality does this animated
tional stability variant (Shaky Agent) had a more tense stance character convey to you?” and given the option of watching the
and jerky gestures. The specific manipulations used are clip over again before answering this question. They were
summarized in Table 2. then presented with a modified version of the Big Five Inven-
tory (BFI [32]), a 44-item Big Five assessment that uses 5-point
2.6 Current Study Likert scales (1 = Disagree Strongly, 5 = Agree Strongly). While
The current study examined whether virtual agents’ gesture the original BFI prefaced each item with “I am someone
and body movement alone conveyed the personalities of who. . .,” we replaced it with “If I had to guess, I would describe
extraversion and emotional stability, manipulated sepa- the character in the video as someone who. . ..”
rately, and whether the personality perceived varied
depending on the method of collecting personality impres- 3.3 Coding and Scoring
sions: via surveys (closed, forced-choice questions) or first Open-ended responses that only mentioned that the agent
impressions (open-ended question). The specific open- appeared unnatural or robotic were counted but were not
ended method used in this study provides more nuanced coded further. Those focused purely on the physical quali-
information about an agent’s perceived personality that can ties of the agent (“gesturing”) or ones that seemed not to be
be used in addition to standard personality questionnaires. a description of personality (“like a round ball”) were also
Critically, we used a between-subjects paradigm for greater not coded to prevent the blind coders from over-interpret-
verisimilitude (i.e., agents are often not meant to be used in ing descriptions that were likely not meant to be about per-
contrastive pairs so testing them in contrastive pairs may sonality. Descriptions that were longer than one word (or
cause unintended carryover effects) and tested whether phrase) were split into multiple descriptors. Only half of the
agents successfully conveyed an intended personality in participants listed two or more descriptors that all fell under
absence of specific questions that could prime them to see a a single personality factor; some participants used words
trait they would not ordinarily see without prompting. that fell under as many as four personality factors. Descrip-
tors were randomized and two blind coders matched them
3 EXPERIMENT 1: EXTRAVERSION to one of the 339 trait adjectives found in Goldberg’s [60]
Big Five factor structure. If a description was already listed
Using a between-subjects design, participants saw one of as a Goldberg adjective loading onto a particular dimension,
two physically identical agents who were either designed to then the factor it was associated with was counted as the
portray the physical mannerisms of someone who is intro- personality factor that the participant found most salient
verted (Compact Agent) or someone who is extraverted (e.g., “assertive” is associated with high extraversion).
(Expansive Agent). They first described the agent using an Because few word descriptions matched exact adjectives
open-ended question, and then they assessed the agent’s found in Goldberg’s published list, coders chose up to three
personality using a Big Five inventory. of the closest synonyms amongst the 339 trait adjectives. To
be included in the analysis, adjectives chosen by both coders
3.1 Participants had to fall under the same personality factor or they
Fifty-nine Amazon Mechanical Turk workers completed were excluded from analysis. For example, the descriptor
both the open-ended question and the personality inven- “arrogant” was kept because one coder chose “boastful”
tory. Compensation was $0.75 for a task that usually took and the other, “pompous” (both low agreeableness).
about 5-7 minutes. However, for the description “has leadership qualities,” one
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
98 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 7, NO. 1, JANUARY-MARCH 2016
TABLE 3
Examples of Trait Coding by Raters
Examples of the coders’ judgments for descriptions provided by participants that were not already listed by Goldberg (1990). Traits
were included if the raters agreed and excluded if they did not.
coder chose “dominant” (high extraversion) and another high in extraversion (17 out of 32 descriptions, or
chose “cooperative” (high agreeableness). Such instances of 53.13 percent). For the Compact Agent, a plurality of par-
lack of agreement were excluded from the final analysis, as ticipants suggested that they viewed the agent as high in
we could not be sure what personality dimension the agreeableness (11 out of 25 descriptions, or 44 percent).
description was intended to tap into (if they were meant to Table 4 shows the raw count by personality dimension of
tap into a conventional Big Five trait at all); thus, no conven- the 57 total descriptors that the two coders agreed on.
tional statistical evaluation of inter-rater reliability was A Fisher’s Exact Test was conducted to see whether there
undertaken. Table 3 shows examples of what counted as was at least one personality trait that was overwhelmingly
agreement or disagreement in this analysis; descriptors that associated with a particular type of agent. While there was
had agreement were included while those that had dis- an obvious plurality of descriptions that favored labeling
agreement were excluded from analysis. the Expansive Agent as Highly Extraverted and a slight
The BFI was scored according to John et al. [32] with per- advantage for labeling the Compact Agent as Highly Agree-
sonality scores calculated for all five dimensions – openness, able (or, to a lesser extent, Highly Extraverted), the Fisher’s
conscientiousness, extraversion, agreeableness, and emo- Exact Test came close to but did not exceed an alpha of .05
tional stability (which they call neuroticism, i.e. high neuroti- (p ¼ .054). However, these results do seem to suggest that
cism is equivalent to low emotional stability). there was a slight preference for describing the Expansive
Agent in terms related to High Extraversion and the Com-
3.4 Results pact Agent in terms related to High Agreeableness.
The 59 participants provided 102 personality-related words
that were then coded using the Goldberg Adjective List. 3.4.2 Big Five Inventory
Nineteen of the 59 (32 percent) provided single-word
BFI ratings of the Compact and Expansive Agents were
answers while six provided descriptions with four or more
compared. Because the Big Five Inventory scores for each of
relevant words. The remainder of the participants gave
the personality factor scales were moderately correlated to
descriptions that counted for two or three words. However,
each other, we used five separate t-tests, with the Big Five
of the 102 personality-related words coded, only 57 words
traits as dependent variables and Agent Video as a
(55.9 percent) were agreed upon by coders to load onto the
between-subjects factor. Of the Big Five traits assessed by
same personality-related dimension.
Nine of the 59 participants provided descriptions that TABLE 4
did not refer to the agent’s personality in recognizable terms Frequencies of Trait Descriptions for Exp 1
(e.g. “like a round ball”), including four of the 27 Expansive
Agent participants (14.8 percent) and five of the 32 Compact Open Consci. Extra. Agree. Emo St.
Agent participants (15.7 percent). The agents were equally Hi Lo Hi Lo Hi Lo Hi Lo Hi Lo
likely to be described in terms that referred to potential per- Expansive 1 0 1 0 17
2 5 6 0 0
sonality traits. Compact 1 0 1 0 6 4 11 2 0 0
3.4.1 Open-Ended Question Frequencies of Goldberg (1990) adjectives and adjective-equivalents produced
by participants by factor. Modal personality trait described is indicated with
For the Expansive Agent, a plurality of participants used an asterisk. Open ¼ openness, Consci ¼ conscientiousness, Extra ¼ extraver-
descriptions that suggested that they viewed the agent as sion, Agree ¼ agreeableness, Emo St. ¼ emotional stability.
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
LIU ET AL.: TWO TECHNIQUES FOR ASSESSING VIRTUAL AGENT PERSONALITY 99
TABLE 5
BFI Ratings for Agents in Exp 1
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
100 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 7, NO. 1, JANUARY-MARCH 2016
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
LIU ET AL.: TWO TECHNIQUES FOR ASSESSING VIRTUAL AGENT PERSONALITY 101
TABLE 7
BFI Ratings for Agents in Exp 2
Mean (and standard deviation) BFI scores for agents by Agent Video.
2. Of the eight items that comprise the BFI Neuroticism scale, only Fig. 4. Boxplot for the distribution of BFI Agreeableness by Agent Video.
two showed significant effect of the video clip viewed using a Bonfer- Again, the grey area denotes the range where many responses were
roni-corrected p ¼ .006: Emotionally Stable and Can Be Moody. likely to be neither agree nor disagree.
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
102 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 7, NO. 1, JANUARY-MARCH 2016
less accurately – formed when given only one of these what traits the participants thought were the most rele-
channels [62]. vant and visible. It reflects a naturalistic process for the
It is critical for the successful implementation of agents ascription of personality, which is far more free-form and
across applications that they be perceived as coherent indi- unstructured than the structured querying of an inven-
viduals, such that their behavior may be interpreted using tory. Personality inventories, on the other hand, necessar-
the same social strategies employed in social interaction ily allow probing of multiple personality dimensions,
with humans [12]. As such, designers need two things: which can be an advantage when observers forget to
models of how personalities are expressed across various mention something they may have genuinely observed.
modalities, which may be borrowed wholesale from the Thoughtful choice and interpretation of the method (or
psychological literature, and also measures to test whether methods) of assessment are necessary, as different meth-
observers indeed perceive the intended personalities. We ods can provide very different information.
compared two methods for exploring how manipulations of Together, the inventory and open-ended methods pro-
expressive nonverbal style lead to changes in the judgment vide evidence that opposite gestures are not equivalent to
of personality: questionnairestyle personality inventory and opposite poles of personality dimensions. For example,
an openended question. We used a between-subjects tech- both open-ended responses and BFI ratings demonstrated
nique that purposefully avoided contrast and carry-over that performing the opposite of a certain behavior does not
effects. Many agents are presumably not intended for use as necessarily lead to an equivalent judgment of an opposing
contrastive pairs of opposite personalities, making it neces- personality trait. The agent that used wide gestures (Expan-
sary to see if virtual agent personality can be conveyed sive) was seen as at least somewhat extraverted, but the
when each agent is viewed in isolation. agent that used narrow gestures (Compact) was not seen as
Similar to the current study, previous work on the attri- being particularly introverted.
bution of Big Five personality dimensions to virtual agents Each method, when used in isolation, had advantages
has also shown a relationship between the attribution of and disadvantages. While more labor-intensive, an open-
emotional stability and the attribution of agreeableness by ended question allows a more detailed picture of an
observers to agents under certain conditions [18]. Other agent’s personality to emerge. Although the individual
previous studies, however, focused exclusively on a single items of a Big Five scale can be examined on a longer
target personality dimension, to the point of reducing per- assessment such as the 44-item BFI, this may not always
sonality surveys to just those scales measuring that dimen- be that informative. For example, in Experiment 2, the
sion [22], [43], [44]. This confirmatory bias is in conflict with differences between the Smooth and Shaky Agents’ emo-
the generally accepted conceptualization of personality as a tional stability scores were driven by ratings on two
unified whole across the traits. This study, as well as others items: Is emotionally stable, not easily upset and Can be
(e.g. [6]), demonstrate that manipulation along a single moody. These are somewhat vague (and sometimes dou-
dimension may not be perceived as such by users. Under- ble-barreled) descriptions, yet people would probably
standing how these unintended perceptions are produced approach someone with a short fuse differently than they
and whether they are reliable should be a focus of future would a worrier. That is, with just the BFI, we don’t
work. In the creation and control of task- or user-specific know which of the more precise behaviors dominated an
agents, these effects are critically important. observer’s interpretation. With the open-ended question,
Virtual agents can lend themselves to some descriptions we do know. A few participants described the Shaky
that might be less likely to be ascribed to humans, such as Agent as anxious or nervous, but more described the agent
robotic. Yet there is reason to believe that methods used with as mean, angry, short-tempered, and aggravated. We also
humans, such as the BFI, are appropriate for the assessment know more than the BFI allows. Agents were commonly
of virtual agent personality. People not only ascribe person- described as confused, for example, which was not consis-
ality traits to other people based on very little information, tently coded into any Big Five category. In other words,
they ascribe human behaviors to inanimate things. We can qualitative data and closer scrutiny of score distribution
observe this facility even in the least human of stimuli such and a scale’s individual items (when possible) can help
as Heider and Simmel’s silent, black-and-white shapes flesh out numbers that might otherwise gloss over impor-
“chasing” each other [63]. Although a personality-related tant distinctions or even tap into alternate interpretations
description was not always spontaneously offered, these of intended manipulations.
shapes were ascribed the intentionality of animate, if not Open-ended measures are helpful when an agent’s per-
human, objects. In our work with virtual agents, we found sonality ratings on an inventory largely fall into “neutral”
that many of the responses often mentioned some sort of territory, which is what we saw in Experiment 2, where the
goal-directed, intentional behavior, even if there was no Shaky Agent was seen as less emotionally stable, on aver-
description of personality or distinct personality traits per age, than the Smooth Agent (which replicates previous find-
se: the agents were not infrequently described as explaining, ings by Neff and colleagues [18], [22]) but the spread of the
asking, or instructing. Smooth Agent’s ratings were mostly contained within
In two experiments, we showed that the standardized neither agree nor disagree on the BFI’s Neuroticism items.
personality inventory and open-ended question methods While researchers can (and have) shown that there are
of assessing personality are complementary. Open-ended mean differences in personality ratings when observers
questions allow participants to express their perceptions view multiple contrastive agents, it is also useful to examine
without priming them on any particular dimension of the open-ended descriptions of agents viewed in isolation. This
Big Five. This method is useful for obtaining a sense of gives those who design agents a clearer idea of how their
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
LIU ET AL.: TWO TECHNIQUES FOR ASSESSING VIRTUAL AGENT PERSONALITY 103
agents are seen in the event that their agents are not yet hit- ACKNOWLEDGMENTS
ting intended personality notes hard enough to register con-
The authors wish to thank their research assistants, Jasmine
sistently on standardized inventories.
Norman, Gabe Taylor, and Megan Kostecka. This work was
More work, however, will need to be done to understand
supported in part by a grant from the NSF (IIS-1115742).
why participants chose to mention certain personality
Results of Experiment 2 were presented at Intelligent Vir-
dimensions, particularly when those dimensions are not
tual Agents 2013 [65].
thought to be particularly visible in gesture (such as agree-
ableness): Was this because our participants were used to
talking about whether others are friendly or unfriendly, REFERENCES
because the gestures we used really tapped into actual ges- [1] N. Ambady, M. Hallahan, and R. Rosenthal, “On judging and
being judged accurately in zero-acquaintance situations,” J. Per-
tural correlates of agreeableness, or because another trait
sonality Social Psychol., vol. 69, no. 3, pp. 518–529, 1995.
such as emotional stability mediated judgments of agree- [2] M. J. Levesque and D. A. Kenny, “Accuracy of behavioral predic-
ableness (i.e., those who come off as angry were also tions at zero acquaintance: A social relations analysis,” J. Personal-
thought to be less friendly)? There are also cultural consid- ity Social Psychol., vol. 65, no. 6, pp. 1178–1187, 1993.
[3] L. Ross, “The intuitive psychologist and his shortcomings: Distor-
erations that are outside the scope of this current study that tions in the attribution process,” Adv. Experimental Social Psychol.,
should be investigated in future work. Other work shows vol. 10, pp. 173–220, 1977.
that the relatively new ability of TTS engines to produce voi- [4] L. K. Kammrath, D. R. Ames, and A. A. Scholer, “Keeping up
ces with dialectical variations leads to stereotypical attribu- impressions: Inferential rules for impression change across the big
five,” J. Experimental Social Psychol., vol. 43, no. 3, pp. 450–457,
tions to agents with those voices: for example, an agent that 2007.
speaks with a regional accent has a sense of humor, whereas [5] E. Campana, M. K. Tanenhaus, J. F. Allen, and R. W. Remington,
an agent that speaks with Received Pronunciation is more “Evaluating cognitive load in spoken language interfaces using a
dual-task paradigm,” presented at the 8th International Conf. Spo-
educated and trustworthy [11]. ken Language Processing, Jeju Island, Korea, 2004.
Future work should also involve the direct comparison of [6] M. McRorie, I. Sneddon, G. McKeown, E. Bevacqua, E. de Sevin,
the perception of personality traits expressed by agents of and C. Pelachaud, “Evaluation of four designed virtual agent
different genders as the current study did not examine gen- personalities,” IEEE Trans. Affective Comput., vol. 3, no. 3, pp. 311–
322, Jul–Sep. 2012.
der differences. Previous work for example has found task- [7] A. Tapus and M. J. Mataric, “Socially assistive robots: The link
specific effects for gender, where female agent voices are between personality, empathy, physiological signals, and task
viewed as more competent at prototypically female tasks performance,” in Proc. AAAI Spring Symp.: Emotion, Personality,
[14]. There may also be additional physical cues, such as Social Behavior, 2008, pp. 133–140.
[8] E. Bevacqua, E. de Sevin, C. Pelachaud, M. McRorie, and
clothing or physique, that could influence personality per- I. Sneddon, “Building credible agents: Behaviour influenced
ception when gestures are added [64]. Finally, though we by personality and emotional traits,” presented at the Kansei
find it likely that additional contextual, physical, and Engineering and Emotion Research, Paris, France, 2010.
[9] I. Damian, T. Baur, and E. Andre, “Investigating social cue-based
expressive cues will change interpretations of agent behav- interaction in digital learning games,” in Proc. 8th Int. Conf. Found.
ior, as they change interpretations of human behavior, our Digit. Games, 2013, http://www.fdg2013.org/program/work-
study can make no claim regarding how additional factors shops/papers.html
would influence personality perception. [10] I. Damian, B. Endrass, P. Huber, N. Bee, and E. Andre,
“Individualized agent interactions,” in Motion in Games, New
The Big Five and its attendant personality inventories York, NY, USA: Springer, 2011, pp. 15–26.
are, arguably, most powerful when putting individuals into [11] B. Krenn, S. Schreitter, F. Neubarth, and G. Sieber, “Social evalua-
analyzable categories in order to describe the average tion of artificial agents by language varieties,” in Proc. 12th Int.
Conf. Intell. Virtual Agents, 2012, pp. 377–389.
behavior of a group [34]. Nevertheless, such inventories [12] B. Reeves and C. Nass, “The media equation: How people treat
may be less effective when trying to create an agent that computers, television? New media like real people? Places,”
expresses a particular personality, as their scales and Comput. Math. Appl., vol. 33, no. 5, pp. 128–128, 1997.
individual items are designed with wording that is inten- [13] F. Mairesse and M. A. Walker, “Towards personality-based user
adaptation: Psychologically informed stylistic language gener-
tionally terse and efficient. Open-ended descriptions do ation,” User Model. User-Adapted Interaction, vol. 20, no. 3,
not have to be highly consistent to offer guidance on how pp. 227–278, 2010.
to design agents in a future iteration to more strongly [14] D. Kuchenbrandt, M. H€aring, J. Eichberg, F. Eyssel, and E. Andre,
“Keep an eye on the task! How gender typicality of tasks influence
convey a target personality trait. Even when agents dif-
human–robot interactions,” Int. J. Social Robot., vol. 6, no. 3,
fered on many items on a given scale, as the Expansive pp. 417–427, 2014.
and Compact Agents did on Extraversion, it can still be [15] K. Isbister and C. Nass, “Consistency of personality in interactive
useful to know that Expansive Agent was seen by some characters: Verbal cues, non-verbal cues, and user characteristics,”
Int. J. Human-Comput. Stud., vol. 53, no. 2, pp. 251–267, 2000.
as excited and outgoing, though also bossy, angry, and [16] F. Mairesse and M. A. Walker, “Controlling user perceptions of
never friendly. The Compact Agent elicited many descrip- linguistic style: Trainable generation of personality traits,” Com-
tions like calm, laid-back, nonchalant, relaxed, and easy- put. Linguistics, vol. 37, no. 3, pp. 455–488, 2011.
going. Though a more laborious process, this qualitative [17] A. J. Gill, C. Brockmann, and J. Oberlander, “Perceptions of align-
ment and personality in generated dialogue,” in Proc. 7th Int. Nat-
type of information can help in the creation of agents ural Language Gener. Conf., 2012, pp. 40–48.
with a cohesive suite of expressive and stylistic behaviors, [18] M. Neff, N. Toothman, R. Bowmani, J. E. Fox Tree and M. A.
based on user perception of prescribed personality Walker, “Don’t scratch! Self-adaptors reflect emotional stability,”
in Proc. 10th Int. Conf. Intell. Virtual Agents, 2011, pp. 398–411.
parameters. Accurately applying such personality models [19] N. Bee, C. Pollock, E. Andre, and M. Walker, “Bossy or wimpy:
and testing them with appropriate measures allows for Expressing social dominance by combining gaze and linguistic
the successful implementation of expressive agents with behaviors,” in Proc. 10th Int. Conf. Intell. Virtual Agents, 2010,
cohesive, believable personalities. pp. 265–271.
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
104 IEEE TRANSACTIONS ON AFFECTIVE COMPUTING, VOL. 7, NO. 1, JANUARY-MARCH 2016
[20] S. Kopp, B. Krenn, S. Marcella, A. N. Marshall, C. Pelachaud, H. [43] K. Van den Bosch, A. Brandenburgh, T. J. Muller, and A. Heuvelink,
Priker, K. R. Th orisson, and H. Vilhjalmsson, “Towards a common “Characters with personality!” in Proc. 12th Int. Conf. Intell. Virtual
framework for multimodal generation: The behavior markup Agents, 2012, pp. 426–439.
language,” in Proc. 6th Int. Conf. Intell. Virtual Agents, 2006, [44] D. Arellano, J. Varona, F. J. Perales, N. Bee, K. Janowski, and E.
pp. 205–217. Andre, “Influence of head orientation in perception of personality
[21] A. Egges, S. Kshirsagar, and N. Magnenat-Thalmann, “Generic traits in virtual agents,” in Proc. 10th Int. Conf. Auton. Agents Multi-
personality and emotion simulation for conversational agents,” agent Syst.-Vol. 3, 2011, pp. 1093–1094.
Comput. Animation Virtual Worlds, vol. 15, no. 1, pp. 1–13, 2004. [45] S. V. Paunonen and D. N. Jackson, “What is beyond the big five?
[22] M. Neff, Y. Wang, R. Abbott, and M. Walker, “Evaluating the Plenty!” J. Personality, vol. 68, no. 5, pp. 821–835, 2000.
effect of gesture and language on personality perception in con- [46] M. C. Ashton, K. Lee, M. Perugini, P. Szarota, R. E. de Vries, L. Di
versational agents,” in Proc. 10th Int. Conf. Intell. Virtual Agents, Blas, K. Boies, and B. De Raad, “A six-factor structure of personal-
2010, pp. 222–235. ity-descriptive adjectives: Solutions from psycholexical studies in
[23] C. Pelachaud, “Studies on gesture expressivity for a virtual seven languages,” J. Personality Social Psychol., vol. 86, no. 2,
agent,” Speech Commun., vol. 51, no. 7, pp. 630–639, 2009. pp. 356–366, 2004.
[24] F. Mairesse, M. A. Walker, M. R. Mehl, and R. K. Moore, “Using [47] M. Zuckerman, D. M. Kuhlman, J. Joireman, P. Teta, and M. Kraft,
linguistic cues for the automatic recognition of personality in con- “A comparison of three structural models for personality: The big
versation and text,” J. Artif. Intell. Res., vol. 30, pp. 457–500, 2007. three, the big five, and the alternative five,” J. Personality Social
[25] A. Cafaro, H. H. Vilhjalmsson, T. Bickmore, D. Heylen, K. R. Psychol., vol. 65, no. 4, pp. 757–768, 1993.
J
ohannsd ottir, and G. S. Valgar+sson, “First impressions: Users’ [48] J. Kim, S. S. Kwak, and M. Kim, “Entertainment robot personality
judgments of virtual agents’ personality and interpersonal atti- design based on basic factors of motions: A case study with rolly,”
tude in first encounters,” in Proc. 12th Int. Conf. Intell. Virtual in Proc. 18th IEEE Int. Symp. Robot Human Interactive Commun.,
Agents, 2012, pp. 67–80. 2009, pp. 803–808.
[26] C. Faur, C. Clavel, S. Pesty, and J.-C. Martin, “PERSEED: A self- [49] P. E. Gallaher, “Individual differences in nonverbal behavior:
based model of personality for virtual agents inspired by socio- Dimensions of style,” J. Personality Social Psychol., vol. 63, no. 1,
cognitive theories,” in Proc. Humaine Association Conf. Affective pp. 133–145, 1992.
Comput. Intell. Interaction, 2013, pp. 467–472. [50] M. R. Mehl, J. W. Pennebaker, D. M. Crow, J. Dabbs, and J. H.
[27] S. D. Gosling, P. J. Rentfrow, and W. B. Swann, “A very brief mea- Price, “The electronically activated recorder (ear): A device for
sure of the big-five personality domains,” J. Res. Personality, sampling naturalistic daily activities and conversations,” Behav.
vol. 37, no. 6, pp. 504–528, 2003. Res. Meth., Instruments, Comput., vol. 33, no. 4, pp. 517–523, 2001.
[28] G. Saucier, “Mini-markers: A brief version of Goldberg’s unipolar [51] K. Lee, B. Ogunfowora, and M. C. Ashton, “Personality traits
big-five markers,” J. Personality Assessment, vol. 63, no. 3, pp. 506– beyond the big five: Are they within the hexaco space?” J. Person-
516, 1994. ality, vol. 73, no. 5, pp. 1437–1463, 2005.
[29] Z. Callejas, D. Griol, and R. L opez-C ozar, “A framework for the [52] R. M. Tobin, W. G. Graziano, E. J. Vanman, and L. G. Tassinary,
assessment of synthetic personalities according to user “Personality, emotional experience, and efforts to control
perception,” Int. J. Human-Comput. Stud., vol. 72, no. 7, pp. 567– emotions,” J. Personality Social Psychol., vol. 79, no. 4, pp. 656–669,
583, 2014. 2000.
[30] A. Panter, J. Tanaka, R. Hoyle, C. Halverson, G. Kohnstamm, and [53] R. E. Riggio and H. S. Friedman, “Impression formation: The role
R. Martin, “Structural models for multimode designs in personal- of expressive behavior,” J. Personality Social Psychol., vol. 50, no. 2,
ity and temperament research,” in The Developing Structure of Tem- pp. 421–427, 1986.
perament and Personality from Infancy to Adulthood. East Sussex, [54] R. Lippa, “The nonverbal display and judgment of extraversion,
U.K.: Psychology Press, 1994, pp. 111–138. masculinity, femininity, and gender diagnosticity: A lens model
[31] J. M. Digman, “The curious history of the five-factor model,” in analysis,” J. Res. Personality, vol. 32, no. 1, 1998, pp. 80–107.
The Five-Factor Model of Personality: Theoretical Perspectives. New [55] M. Argyle, Bodily Communication. Evanston, IL, USA: Routledge,
York, NY, US: Guilford Press, 1996, pp. 1–20. 2013.
[32] O. P. John, E. M. Donahue, and R. L. Kentle, “The big five inven- [56] A. Campbell and J. P. Rushton, “Bodily communication and per-
tory—versions 4a and 54,” Berkeley: University of California, sonality,” Brit. J. Soc. Clinical Psychol., vol. 17, no. 1, pp. 31–36,
Berkeley, Institute of Personality and Social Research, 1991. 1978.
[33] O. P. John and R. W. Robins, “Determinants of interjudge agree- [57] P. Feyereisen and J.-D. de Lannoy, Gestures and Speech: Psychologi-
ment on personality traits: The big five domains, observability, cal Investigations. Cambridge, U.K.: Cambridge Univ. Press, 1991.
evaluativeness, and the unique perspective of the self,” J. Personal- [58] P. H. Waxer, “Nonverbal cues for anxiety: An examination of
ity, vol. 61, no. 4, pp. 521–551, 1993. emotional leakage,” J. Abnormal Psychol., vol. 86, no. 3, pp. 306,
[34] B. H. LaFrance, A. D. Heisel, and M. J. Beatty, “Is there empirical 1977.
evidence for a nonverbal profile of extraversion?: A meta-analysis [59] D. C. Funder and K. M. Dobroth, “Differences between traits:
and critique of the literature,” Commun. Monographs, vol. 71, Properties associated with interjudge agreement,” J. Personality
nos. 38–48, 2004. Social Psychol., vol. 52, no. 2, pp. 409–418, 1987.
[35] J. Allbeck and N. Badler, “Toward representing agent behaviors [60] L. R. Goldberg, “An alternative ‘description of personality’: The
modified by personality and emotion,” Embodied Conversational big-five factor structure,” J. Personality Social Psychol., vol. 59,
Agents AAMAS, vol. 2, pp. 15–19, 2002. no. 6, pp. 1216–1229, 1990.
[36] P. T. Costa and R. R. McCrae, “‘Normal’ personality inventories in [61] J. M. Digman, “Personality structure: Emergence of the five-factor
clinical assessment: General requirements and the potential for model,” Annu. Rev. Psychol., vol. 41, no. 1, pp. 417–440, 1990.
using the neo personality inventory’: Reply,” Psychological Assess- [62] P. Ekman, W. V. Friesen, M. O’Sullivan, and K. Scherer, “Relative
ment, vol. 4, pp. 20–22, 1992. importance of face, body, and speech in judgments of personality
[37] L. Albright, A. I. Cohen, T. E. Malloy, T. Christ, and G. Bromgard, and affect,” J. Personality Social Psychol., vol. 38, no. 2, pp. 270–277,
“Judgments of communicative intent in conversation,” J. Exp. 1980.
Social Psychol., vol. 40, pp. 290–302, 2004. [63] F. Heider and M. Simmel, “An experimental study of apparent
[38] J. E. Fox Tree, “Folk notions of um and uh, you know and like,” behavior,” Am. J. Psychol., vol. 57, pp. 243–259, 1944.
Text Talk, vol. 27, no. 3, pp. 297–314, 2007. [64] L. P. Naumann, S. Vazire, P. J. Rentfrow, and S. D. Gosling,
[39] J. E. Fox Tree, “Interpreting pauses and ums at turn exchanges,” “Personality judgments based on physical appearance,” Personal-
Discourse Processes, vol. 34, no. 1, pp. 37–55, 2002. ity Social Psychol. Bull., vol. 35, p. 1661, 2009.
[40] L. A. Hosman, T. M. Huebner, and S. A. Siltanen, “The impact of [65] K. Liu, J. Tolins, J. E. Fox Tree, M. Walker, and M. Neff, “Judging
power-of-speech style, argument strength, and need for cognition IVA personality using an open-ended question,” in Proc. 13th Int.
on impression formation, cognitive responses, and persuasion,” J. Conf. Intell. Virtual Agents, 2013, pp. 396–405.
Language Social Psychol., vol. 21, pp. 361–379, 2002.
[41] H. H. Clark and J. E. Fox Tree, “Using uh and um in spontaneous
speaking,” Cognition, vol. 84, no. 1, pp. 73–111, 2002.
[42] G. A. Bryant and J. E. Fox Tree, “Recognizing verbal irony in spon-
taneous speech,” Metaphor Symbol, vol. 17, no. 2, pp. 99–119, 2002.
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.
LIU ET AL.: TWO TECHNIQUES FOR ASSESSING VIRTUAL AGENT PERSONALITY 105
Kris Liu received the BA degree in cognitive Michael Neff received the PhD degree from the
and linguistic sciences from Wellesley College University of Toronto and is also a certified laban
and an EdM in Higher (Post-Secondary) Edu- movement analyst. He is an associate professor
cation from Harvard University Graduate in computer science and cinema & digital media
School of Education. She is currently working at the University of California, Davis, where he
toward the PhD degree in cognitive psychology leads the Motion Lab, an interdisciplinary
at the University of California, Santa Cruz. Her research effort in character animation and
research examines how friends and strangers embodied input. His interests include character
converse, collaborate and make use of polite- animation tools, especially modeling expressive
ness strategies across different communicative movement, physics-based animation, gesture
modalities. She also looks at how humans per- and applying performing arts knowledge to ani-
ceive and interact with virtual agents. mation. He received the US National Science Foundation (NSF)
CAREER Award, the Alain Fournier Award for his dissertation, a Best
Paper Award from Intelligent Virtual Agents and the Isadora Duncan
Jackson Tolins received the BA degree in lin- Award for Visual Design. He also currently serves as the co-director for
guistics from the University of Minnesota and the the Program in Cinema and Digital Media.
MA degree in linguistics and cognitive science
from University of Colorado, Boulder. He is cur-
rently working toward the PhD degree in cognitive Marilyn A. Walker received the PhD degree in
psychology at the University of California, Santa 1993 in computer science from the University of
Cruz. His research focuses on exploring the role Pennsylvania. She is a professor of computer sci-
of language in social interaction. This includes the ence and computational media, and the head in
study of discourse markers and backchannels, the Natural Language and Dialogue Systems
the coordination of multiple perspectives through Lab, Baskin School of Engineering at University
language, the use of embodied and iconic multi- of California, Santa Cruz (UCSC). From 2003 to
modal communication practices, and entrainment and mimicry between 2009, she was a professor of computer science at
humans, as well as humans and virtual agents. the University of Sheffield, where she was a
Royal Society Wolfson Research Merit fellow,
recruited to the United Kingdom under Britain’s
Jean E. Fox Tree received the AB degree from “Brain Gain” program. From 1996 to 2003, she was a principal member
Harvard University in linguistics, the MSc degree of Research Staff in the Speech and Information Processing Lab at
from the University of Edinburgh in cognitive AT&T Bell Labs and AT&T Research. While she was at AT&T, she devel-
science, and the PhD degree from Stanford Uni- oped a new architecture for spoken dialogue systems and statistical
versity in psychology. She is a professor of psy- methods for dialogue management and generation. She has served the
chology at the University of California, Santa ACL community as an ACL program chair, reviewer for dozens of confer-
Cruz, where she directs the Spontaneous Com- ences, as a senior area chair, and organized many workshops. She was
munication Laboratory. A member of the Cogni- a member of the founding board for the North American ACL, serving to
tive Area of the Psychology Department, her set up and orchestrate its first conference between 1998 and 2001. She
specialty is psycholinguistics, with a focus on was also the program a co-chair and a local organizer of the Intelligent
spontaneous communication, both spoken and Virtual Agents conference held at UCSC in 2012. She has supervised 16
written. She studies the production and comprehension of discourse doctoral students and 30 undergraduate researchers. She has published
markers, quotation devices, prosody, and gestures, among other topics. more than 200 papers, and has 10 granted US patents. Her H-index, a
measure of research impact and excellence is currently 49.
Authorized licensed use limited to: ULAKBIM UASL - Bilkent University. Downloaded on December 07,2020 at 08:59:58 UTC from IEEE Xplore. Restrictions apply.