Professional Documents
Culture Documents
Zlatev, Sonessen & Konderak (2016) - Meaning, Mind and Communication
Zlatev, Sonessen & Konderak (2016) - Meaning, Mind and Communication
Meaning, Mind
and Communication
Explorations in Cognitive Semiotics
Meaning, Mind and Communication
Jordan Zlatev / Göran Sonesson / Piotr Konderak (eds.)
Carlos A. Pérez
Chapter 2
Mutual Enlightenment: A Phenomenological Interpretation of the
Embodied Simulation Hypothesis........................................................................................31
Joel Parthemore
Chapter 3
A Cognitive Semiotic Perspective on the Nature and Limitations of
Concepts and Conceptual Frameworks.............................................................................47
Morten Tønnessen
Chapter 4
Agency in Biosemiotics and Enactivism...........................................................................69
Lorraine McCune
Chapter 7
Meaning, Consciousness, and the Onset of Language..............................................127
6 Table of Contents
Mutsumi Imai
Chapter 8
The “Symbol Grounding Problem” Reinterpreted from the Perspective
of Language Acquisition........................................................................................................145
Keith E. Nelson
Chapter 9
Key Roles of Found Symbolic Objects in Hominin Physical and
Cultural Evolution: The Found Symbol Hypothesis..................................................161
Alessandra Chiera
Chapter 11
From Conversation to Language: An Evolutionary Sensory-Motor
Account ........................................................................................................................................189
Serena Nicchiarelli
Chapter 12
Protolanguage as Formulaic Communicaction............................................................199
Cornelia Müller
Chapter 13
From Mimesis to Meaning: A Systematics of Gestural Mimesis for
Concrete and Abstract Referential Gestures.................................................................211
Jamin Pelkey
Chapter 15
Symmetrical Reasoning in Language and Culture: On Ritual Knots and
Embodied Cognition................................................................................................................239
Table of Contents 7
Štěpán Pudlák
Chapter 16
Cognitive Semiotics of Mental Disorders, with Focus on Hallucinations........251
Gisela Bruche-Schulz
Chapter 17
Pictorial Responses and Projected Realities: On an Elicitation
Procedure and its Ramifications.........................................................................................261
Todd Oakley
Chapter 19
Deonstemic Modals in Legal Discourse: The Cognitive Semiotics of
Layered Actions.........................................................................................................................299
Vlado Sušac
Chapter 20
Commutation of Cognitive Source Domains as a Semiotic Tool for
Paradigmatic Analysis............................................................................................................317
Maíra Avelar
Chapter 21
The Emergence of Multimodal Metaphors in Brazilian
Political-electoral Debates.....................................................................................................331
Marco Bagli
Chapter 22
“A Light in the Darkness”: Making Sense of Spatial and Lightness
Perception....................................................................................................................................349
Francesco-Alessio Ursini
Chapter 24
Objects and Nouns: An Account of the Vision-Language Interface...................379
References....................................................................................................................................409
List of Contributors..................................................................................................................473
Index: Subject..............................................................................................................................477
Jordan Zlatev, Göran Sonesson & Piotr Konderak
Chapter 1
Introduction:
Cognitive Semiotics Comes of Age
0. Introduction
A decade since the appearance of the journal Cognitive Semiotics, the establishment
of the International Association for Cognitive Semiotics (IACS) in 2013, and two
successful international conferences, in Lund in 2014 and Lublin 2016, cognitive
semiotics can hardly be characterized as an “emerging” discipline anymore. It is
already here. Yet, we who are involved with this field are often pressed to answer:
what is it really?
The chapters in this volume, which originate from the IACS conference in Lund,
are a kind of extensional answer to this question: this is what cognitive semiotics is
like! The reader may note that the editors of the volume and authors of this intro-
duction – who have served as “founding fathers” or, less pretentiously, as the main
organizers of the first two international conferences – do not participate as authors
in this anthology. Rather, we have taken up the role of “mid-wives”, for a proper mix
of metaphors, of this important project, which in effect is the first published volume
of explorations in cognitive semiotics.
This discipline of cognitive semiotics can be described as the study of meaning,
mind and communication, as reflected by the title of this book. Admittedly, this is
a broad object of study, but as this volume aims to show, there is both an internal
coherence to, and an important mission for, cognitive semiotics: to help in “mending
the gap between science and the humanities”, in the words of Stephen Jay Gould
(2003). Crucially, this is to be done through mutual respect and methodological
understanding, rather than a reductionist takeover from the side of natural science,
or a postmodernist relativism from the side of the liberal arts.
In previous works, we have highlighted a number of features of the new dis-
cipline of cognitive semiotics (Sonesson 2009, 2012; Zlatev 2012, 2015). All of these
features do not have to be fulfilled, but together they define a prototype-kind of
structure; as we can see, the chapters in this volume conform to this characterization
by displaying at least the first two features, and in many cases more.
The first feature is that cognitive semiotics focuses on the study of meaning,
and does so through a trans-disciplinary (implying tighter contact than “interdis-
ciplinary”) combination of methods and concepts from at least semiotics, cognitive
science, and linguistics. As these fields are interdisciplinary themselves, this opens
the doors to a number of related fields such as anthropology (e.g. Pelkey, this vol-
ume), graphic reasoning (Coppin, this volume), cognitive development (McCune,
10 Jordan Zlatev, Göran Sonesson & Piotr Konderak
this volume), language acquisition (Imai, this volume), and political discourse analy-
sis (e.g. Avelar, this volume, Sušac, this volume).
The second feature is what we have referred to as the conceptual-empirical loop
(Zlatev 2012) or more adequately: spiral (Zlatev 2015). One particular variant of
this is what we have called “the dialectics of phenomenology and experiments”
(Sonesson 2013). The point is that cognitive semiotics takes a keen interest in
the analysis of concepts, not unlike philosophy, starting with proverbially am-
biguous notions such as meaning, sign, language, culture, and consciousness. But
to do so adequately, it is necessary to plunge into empirical studies where these
phenomena are studied through scientific (in the broad sense) methods, and then
re-emerge, reinvigorated, on the conceptual side. Conversely, empirical inves-
tigations, for example on the difficult issues concerning how language emerges
in evolution and development (discussed in Part 2), inevitably become involved
with conceptual issues, since “what”-questions constrain the answers to “how
and why”-questions. This also has the advantage of making it possible to specify
psychological experimental paradigms in order to answer specifically semiotic
questions, and of integrating experimental results into discussions within se-
miotics, phenomenology, and other variants of philosophy (see Sonesson 2013,
Pérez, this volume).
A third feature, more or less explicitly manifested in cognitive semiotic research,
is to proceed in the study of the phenomenon in question – be it children’s symbolic
play, the use of metaphors, symmetrical reasoning, hallucination etc. – by com-
bining methods using a first-person perspective (of the analyst or the participant),
a third-person perspective of detached observation and experimentation (allowing
quantification in many cases), united by a second-person perspective acknowledg-
ing that every form of scientific exploration is an act of communication, between
experimenter and participant, analyst and informant, author and reader etc. The
chapters by Parthemore, Bruche-Schulz and Pudlak discuss this procedure overtly,
but it is present in many of the other chapters, especially in those influenced by
phenomenology, which is in itself important enough to be given as a fourth feature
of cognitive semiotics.
The “scientific study of consciousness”, or more adequately “the study of human
experience and of the ways things present themselves to us in and through such
experience” (Sokolowski 2000: 2), phenomenology is the philosophical tradition
inaugurated by Edmund Husserl over a century ago, and continued by Maurice
Merleau-Ponty and others up to the present day (e.g. Zahavi and Gallagher 2008).
In cognitive semiotics it is especially useful for the study of subjectivity (e.g. per-
ception, affect, sense-making) from the “inside” in a manner that is intersubjective,
providing results that are, in a sense, objective (see Sonesson 2015a). It is also a very
useful tool for analysing the difference between signs and other meanings (see
Sonesson 1989, 2012). Further, as phenomenologists have been consistently anti-
reductionist while open to science, concepts and methods from phenomenology
have been very productive for researchers searching for “mutual enlightenment”
(Gallagher 1997, see Pérez, this volume) between lived experience and detached
Introduction 11
tween biology and meaning (see Sonesson & Zlatev 2009), while enactivism has been
highly influential for cognitive semiotics (see Zlatev 2012; Parthemore, this volume;
Collazos, this volume). Tønnessen shows that neither biosemiotics nor enactivism
are internally unified frameworks, with single coherent conceptual systems, and
both harbor different conceptions of subjectivity and agency. The notion of agency is
intrinsically related to the conception of action, and the understanding of what con-
stitutes action varies. Thanks to the variation in views, however, specific similarities
between different approaches in biosemiotics and enactivism can be identified. Tøn-
nessen discovers a clear affinity between the Uexküllian approach in biosemiotics,
and the “mind in life” approach of Thompson (2007). In particular, both theories
find agency and thus (minimal) meaning in even the simplest organisms such as
bacteria. At the same time, both are capable of distinguishing between this and
the experience of agency, which presupposes self-awareness, and is only found in
“higher” organisms (cf. Zlatev 2009a, 2009b).
As far as cognitive semiotics is concerned, Tønnessen suggests that, since biose-
miotics and enactivism are two of the most innovative and integrative contemporary
approaches to the nature of life and living systems, cognitive semiotics, as a field
devoted to the study of cognition, should look for its foundations in them. Or,
as he formulates his point rather bracingly: “cognitive semiotics should therefore
be conceived of as a subfield of biosemiotics”. A different conclusion, however,
might be that cognitive semiotics, being an even newer and (hopefully) even more
integrative approach than either biosemiotics or enactivism, may learn from the
strengths and weaknesses of both, as well from other theoretical resources such as
phenomenology and cognitive linguistics, in order to avoid using notions such as
agency, semiosis, and meaning too broadly and, when necessary, to introduce clear
subdivisions and “semiotic thresholds”.
In chapter 5, Juan Carlos Mendoza Collazos presents an overview of a new
approach to signification, known as agentive semiotics, that “links achievements of
logic, phenomenology and cognitive sciences” (Niño 2015). As the author states, this
clearly aligns this approach with cognitive semiotics. Similarly, agentive semiotics
is influenced by enactivism, though it takes a more specific stand on the notion of
agency, defining an agent as a being that is animate, situated, and capable of pay-
ing attention. This means that artifacts for example only have derived agency, that
is, a kind of agency that has been assigned by agents proper, which in the case of
artifacts means designers.2
The bulk of the chapter applies the theory of agentive semiotics precisely
to the semiotics of artifact design. Unlike traditional semiotic design analysis,
the agentive approach implies focus not on the artifacts themselves, but on acts
erence during the first two years of life (McCune 2008), relating it to concepts in
cognitive semiotics. Consistent with Sonesson (2007) for example, according to
McCune “all experience of meaning (even sensation) can be considered semiotic
(i.e. meaningful), but only some special kinds of meaning are signs”. Also, in line
with both Piaget (1962) and many scholars in cognitive semiotics, McCune regards
mental representation as the capacity for “contemplation beyond the here and now”,
and thus crucially dependent on consciousness.
McCune argues that semiotic development starts with relatively holistic ex-
periences of perception and movement, and proceeds largely through processes
of differentiation, between self and other, and between expression and content.
McCune documents five levels of representational play, aligning these with stages
in Piagetian theory, and thus charts the transition from pre-sign meanings to
mental representation and sign use. Yet, human language is predominantly vocal,
and McCune proposes a theory for “embodying symbols in the vocal medium”
consistent with the approach of Werner and Kaplan (1963). She extends this con-
ception by hypothesizing that laryngeal vocalizations (“grunts”) are of particular
importance, as they undergo a transition from automatic responses to “personal
symbols”. The author reviews a study showing how such grunts co-occur with
effort, attention and communicative use. The third crucial prerequisite for the
transition to referential speech, according to McCune, are vocal motor schemes,
deriving from babbling.
In chapter 8, the well-known cognitive scientist Mutsumi Imai addresses the so-
called “symbol grounding problem” (Harnad 1990), i.e. how to connect meaningless
“symbols” to experience, from the perspective of language acquisition by children.
While initially presented as a challenge for “symbolic AI”, the author argues that it
is no less damaging for “connectionist” models, as children cannot acquire a lexicon
by associating perceptual experiences with word-tokens. Rather, a realistic model
of language development needs to account for the following cognitive-semiotic
capacities and achievements: (a) understanding that words refer to concepts, (b)
finding the referent in a particular context, (c) finding the semantic domain to
which the word belongs, (d) generalizing the meaning of the word in the context
of previous linguistic experiences and (e) acquiring “an adult-like representation
of the domain as a whole”.
Reviewing and combining much of her own research on these topics, Imai first
shows that sound symbolism, i.e. non-arbitrary mappings between expression and
content, may serve as a powerful “bootstrapping mechanism” for (a) and (b). Still,
“children have to learn to infer the meaning of words when the help from sound
symbolism is not available, and they have to do that in relation to other words they
have already learned”, i.e. proceed to (c-e). To account for this, the author outlines a
model grounded in empirical studies, in which fast-mapping (grasping rough word
meanings) and slow mapping (further adjustments of meanings of words in relation
to other words) processes combine in constructing lexical systems. Finally, Imai pro-
poses a cognitive function that may be crucial for these achievements: a uniquely
human capacity for a “bidirectional reasoning bias”, related to Peircean abductive
16 Jordan Zlatev, Göran Sonesson & Piotr Konderak
and the audience draws inferences on the basis of “mindreading”. Such a model
of language evolution has actually been proposed and argued for in some detail
by Scott-Phillips (2014).
However, Ferretti and Adornetti contend that “models such as these suffer from a
serious difficulty: the exclusive attention paid to the speaker’s intentions leads one
to exclude the temporal dimension from discourse”. In the remainder of the chapter,
they propose a remedy by outlining a model where priority is given to events, and
discourse is organized as a temporal chain of events. Such a revision requires ad-
ditional cognitive abilities underlying discourse comprehension and communicative
skills, which the authors link to navigation in space in time, and specifically to the
proposal by Corballis (2011) that the uniquely human capacity for “mental time
travel” underlies the ability to construct and interpret narratives. Research on this
hypothesis is in progress, e.g. concerning schizophrenia, possibly supporting the
authors’ conclusion that “the metaphor of navigation we have assumed as a key
explanation of human narrative capacities is more than a simple metaphor”.
In chapter 11, Alessandra Chiera continues the theme from the previous
chapter: the development of a “holistic model of language evolution”. Like Ferretti
and Andornetti, she rejects bottom-up, computational approaches to language in
linguistics and cognitive science. The motivations for the critique and the proposed
solution are, however, somewhat different. The main problem of “modular” ap-
proaches in the spirit of Fodor (1983), according to Chiera, is that they imply rigid
boundaries between semantics (“code”) and pragmatics (“context”, “inference”).
The reason this is problematic is that empirical studies in language comprehension
and production strongly suggest that contextual, top-down information influences
language use in real time: i.e. that “speakers build a cumulative representation of
the global message conveyed by the conversation and that such a representation
immediately constrains production and comprehension processes.”
Consequently, the author seeks to ground the evolution of language primarily
in the contingencies of online conversation (more than the coherence of narrative).
In the next step, she asks where human capacities for perceiving socially relevant
contextual and conversational features derive from, and proposes that these are
essentially not a matter of “mind reading”, but of affordance perception: “in order
to extract relevant contextual information, individuals rely on the same processes
by which they interact with the physical environment, that is action and percep-
tion processes”. Such processes allow what she calls pragmatic alignment, in which
communicative actions are coordinated, and successful conversations emerge. The
advantage of such an account is that it allows successful communication “despite
fragmentary and ambiguous coding”, while at the same time providing the frame for
the gradual evolution of a conventional-normative system, in a dialectical manner.
The author explicitly applies the conceptual-empirical spiral of cognitive semiotics
in several cycles, offering original and productive definitions: “In this framework,
conversation can be defined as a dynamic interactive exchange, situated within a
jointly determined – and constantly evolving – semiotic system.”
18 Jordan Zlatev, Göran Sonesson & Piotr Konderak
The final chapter in this section (chapter 12) by Serena Nicchiarelli adds one
more brick to the (putative) bridge between the linguistic and cognitive skills of
the common ape-human ancestor and our own: a blend of practical, goal-directed
actions and communicative acts that Nicchiarelli terms “communic-action”. The
chapter starts with a discussion of the concept of protolanguage (a hypothetic
semiotic system that helps span the gap between ape communication and human
language), outlining the two major models that have been proposed: a lexical,
“grammarless” protolanguage (e.g. Bickerton 2010) and a holistic protolanguage
with utterances expressing complex, but non-compositional, communicative acts
(e.g. Arbib 2005a). Nicchiarelli defends the latter view, presenting her specific
communic-action hypothesis, according to which protolanguage evolved from the
combination of (conventionalized) pantomimes and “vocal gestures”. This is con-
sistent with Arbib’s (2012) evolutionary model, but the author differs by emphasiz-
ing the pragmatic dimension of such formulaic utterances as performatives (Austin
1962) and facilitators of social interaction and online conversations.
Evidence for the hypothesis is twofold. First, by building on work by Wray
(2002), Nicchiarelli argues that the communic-action strategy serves today as a
“living fossil” from the holistic protolanguage stage. Second, the chapter reviews
neurolinguistic evidence that formulaic language, along with discourse and non-
verbal communication, is dependent on the basal ganglia and subcortical pre-
frontal circuitry in the right hemisphere – areas responsible for activities such
as planning, evaluating cues, acting in an environment. As a capstone of the
chapter, Nicchiarelli points out an intriguing dissociation: it is precisely these
areas that are damaged and these capacities, including formulaic language use,
that are impaired in Parkinson’s disease. On the other hand, formulaic language
and motor control are preserved in Alzheimer’s disease, while higher functions
such as coherence and navigation are damaged, as noted in chapter 10. If so, then
it appears that the cognitive-semiotic capacity targeted by this chapter (perhaps
along with the alignment mechanisms studied by Chiara) is more basic, and evol-
utionarily more ancient, than the one explored by Ferretti and Adornetti. This is
just one indication of how the ideas in the chapters of this section could possibly
be combined in a more synthetic account.
of a digital web app prototype in which participants are to observe and retrace
intertwining patterns, and subsequently are tested for “cognitive benefits”. While
pointing out that this is still ongoing research, Pelkey thus manages a bold foray
into answering the riddle of symmetrical design and thought, thereby showing the
potential of cognitive semiotics.
Štěpán Pudlák addresses in chapter 16 another, no smaller “mystery”: the
nature of mental disorders, with a focus on schizophrenia and more specifically
on hallucinations. He shows how these have been misrepresented in psychiatric
theories either as “misinterpretations of the inner speech” or as “especially vivid
images”, since such descriptions do not get at the core of their phenomenology:
they are experienced as indexical (directed to their object as something contiguous
in space and time) in the same manner as sensory perceptions. Using Peircean ter-
minology, he explains that the hallucinatory nature of, for example, the utterance
“There is a unicorn on the porch” does not have to do with the semantic content of
the dicisign or proposition (see Stjernfelt 2014), or its mode (verbal or visual), but
with the experience of indexicality: the (immediate) Object of the sign (the “utterer”
of the sentence, or the “vision” of the unicorn on the porch) is indistinguishable
from objects of perception in schizophrenic first-person experience.
Such an analysis is obviously very close to one that can be provided by phenom-
enology, where the indexicality in question corresponds to the most fundamental
kind of intentionality: the directedness of perception (cf. Sokolowski 2000). Accord-
ingly, Pudlak finds resonances between his Peircean analysis and a study of mental
illness by Gallagher (1997), but is critical toward the latter’s claim that hallucinations
may be attributed to failure in distinguishing between self and other. Still, to dis-
tinguish between a hallucination and a real worldly object, Pudlak uses the Peircean
division between Immediate and Dynamic object: the hallucinated unicorn may only
be the first (as it can be subjectively experienced) but not the latter, as it does not
exist in intersubjective reality. Failures in indexicality may thus be related to failures
in intersubjectivity. In any case, Pudlak’s chapter is another excellent illustration
of the integrative potential of cognitive semiotics to unlock (if not yet fully open)
the doors to long-standing mysteries.
In chapter 17, Gisela Bruche-Schulz analyzes the “pictorial responses” (see
below) produced by student-participants from five different cultures and languages
(English, Chinese, German, Russian and Turkish) when presented with a page of
Saint-Exupery’s Le Petit Prince, translated into their own language, and asked to
“jot down whatever comes to mind” in the margins. All five groups produced ap-
proximately as many responses, but interestingly, the “English group” (actually
Cantonese-English bilinguals living in Hong Kong) produced between 4 to 10 times
more pictorial responses than the other groups. However, the author is less inter-
ested in accounting for any (cultural) differences between the groups than in finding
common (experiential) structures. This is in part due to previous research (Bruche-
Schulz 2014), which showed that all the responses were distributed into text seg-
ments in a comparable fashion across the five groups, with peaks and valleys that
correlated with the content of the text, analyzed in terms of force gestalt patterns.
22 Jordan Zlatev, Göran Sonesson & Piotr Konderak
5. Concluding remarks
The reader that has had the patience to read through these summaries – on which
we have labored extensively – should be able to appreciate both the richness
of cognitive semiotics, and to notice challenges and potential disagreements.
For example, the notion of (mental) simulation is both criticized (chapter 2) and
28 Jordan Zlatev, Göran Sonesson & Piotr Konderak
utilized (chapters 18, 25). There are also differing interpretations of the notion of
mimesis (chapter 13 vs. 23). A possible criticism (and one which we have met at
conferences) of cognitive semiotics is the lack of sufficient internal coherence
(cf. the analysis of agency in biosemiotics in chapter 4). Our reply is, however,
that such a criticism would misunderstand the difference between a discipline
and a theory. Cognitive semiotics is the former and not the latter. While it would
indeed (most likely) be incoherent if a single theory used both a structural and a
Peircean sign concept, or a given theoretical term with different meanings, we
cannot and should not expect different theories in cognitive semiotics to always
agree. That would actually be stifling and “block the way of inquiry” (to quote the
famous Peircean principle on what is not to be done).
What we can and do expect is that disagreements are acknowledged, and dealt
with the necessary respect, asking questions such as: are the issues conceptual or
empirical? If the former, can the concepts be calibrated? If the latter, what kind of
evidence can be adduced to make an informed decision between the alternatives?
In sum, cognitive semiotics has created a platform for asking such questions –
with respect to what is arguably most central to our lives, and to the future of life
on our planet: meaning. With this we step back, and urge the reader and possible
beginner to cognitive semiotics to jump in, and participate in the discussion!
Part I.
Metatheoretical Perspectives
Carlos A. Pérez
Chapter 2
Mutual Enlightenment:
A Phenomenological Interpretation of the
Embodied Simulation Hypothesis
enology of Merleau-Ponty, who is regarded as “one of the few whose work was
committed to an exploration of the fundamental entre-deux between science and
experience, experience and world” (1991: 15).1
For Varela, Thompson and Rosch, it is only possible to understand the embodied
nature of human cognition and the situated character of the enactive engagement
with the environment if one takes into account the phenomenological findings
concerning the structure of human experience. And this is one of the points that
differentiates their work from that of George Lakoff and Mark Johnson who also
called attention to the role of the body in the constitution of human thinking. In
their most recent joint publication, Lakoff and Johnson (1999) declare that “the
peculiar nature of our bodies shapes our very possibilities for conceptualization
and categorization” (ibid: 19), an idea originally suggested in their famous book,
Metaphors We Live By (Lakoff and Johnson 1980) and developed later in Lakoff’s
Women Fire and Dangerous Things (Lakoff 1987) and Johnson’s The Body in the
Mind (Johnson 1987). Far from phenomenology, they build up their defense mostly
through linguistic analyses pointing to the fact that (a) the human conceptual reper-
toire is developed metaphorically, and (b) the grounding of our conceptual system,
which gives human thinking its definite structure, is realized by image schemas that
emerge from sensorimotor interactions with the environment.
Without ruling out consciousness from the cognitive science domain, neither
Lakoff nor Johnson take phenomenology as a part of their theoretical frame. Lakoff
has been recently concerned with linguistic analysis and its neurological ground-
ing (Dodge and Lakoff 2005), and Johnson has adopted a pragmatist philosophical
position, influenced mostly by John Dewey (Johnson 2007), stepping aside from
phenomenology and giving more importance to empirical findings in neurology
and cognitive psychology.2 This stance has been critiqued from the perspective of
cognitive semiotics (Sonesson 2007; Zlatev 2010), for reasons such as those discussed
in the rest of this chapter.
1 At the time, the authors were not familiar with Husserl’s later phenomenology,
and therefore expressed criticism: “The irony of Husserl’s procedure, then, is that
although he claimed to be turning philosophy toward a direct facing of experience,
he was actually ignoring both the consensual aspect and the direct embodied aspect
of experience. (In this Husserl followed Descartes: he called his phenomenology a
twentieth-century Cartesianism.)” (Varela et al., 1991: 17). Their attitude to Husserl
within enactivism, nonetheless, changed with the years, and finally turned into a
positive one (Thompson 2007).
2 Regarding phenomenology, Johnson declares: “We must keep in mind that phenom-
enological analysis alone is never enough, because image schemas typically operate
beneath the level of conscious awareness. That is why we must go beyond phenom-
enology to employ standard explanatory methods of linguistics, psychology, and
neuroscience that allow us to probe structures within our unconscious thought
processes.” (Johnson 2005: 21)
Phenomenological Interpretation of the Embodied Simulation 33
3 Against, for example, Dennett (2001), who claims: “First-person science of conscious-
ness is a discipline with no methods, no data, no results, no future, no promise. It
will remain a fantasy.”
34 Carlos A. Pérez
must decide as quickly as possible if it was mentioned or not. The key manipulation
was that, along with pictures of objects not mentioned in the sentence (an elephant,
for example), the screen also showed the picture of a nail, sometimes horizontally
oriented, and sometimes vertically oriented. The results showed that “people were
faster to answer correctly when the orientation of the object implied by the sentence
matched the orientation of the picture” (Bergen 2012: 54). That is, participants took
more time to identify the match between the linguistic expression and the picture
when the orientation of the nail was not the one implied by the sentence. So, Bergen
argues, “a very reasonable explanation is that when people read sentences, they
construct visually detailed simulations of the objects that are mentioned” (ibid:
54). He describes simulation mostly in terms of brain activity; it is our brain that
simulates, using the same areas involved in direct perception. Although Bergen
recognizes that simulations can sometimes be conscious, as in the case of mental
imagery, he claims that “many of the same brain processes are engaged, invisibly
and unbeknownst to you” when consciousness is not involved (ibid: 14). Since
“invisible” means inaccessible to consciousness, embodied simulation would have
nothing to do with first-person methodologies, and meaning would have nothing
to do with consciousness. But this is, precisely, the opposite of the position held by
phenomenology: meaning is potentially conscious meaning, and must be elucidated
by means of phenomenological analysis.
In fact, returning to our question about the relationship between phenomenology
and cognitive semiotics, Sonesson (2009) insistently remarks on the intimate link
between meaning and consciousness, not only for experimental design, but for
theoretical precision: “Without the elucidation offered by the phenomenological
method, semiotics and cognitive science risk indeed to end up forming an eclectic
patchwork.” (ibid: 28) and “semiotics takes meaning as its perspective on the world.
What this means, no doubt, will remain somewhat obscure until meaning has been
phenomenologically elucidated.” (ibid: 35). Thus, following Sonesson, the main task
for a cognitive approach to semiotics is to account for the way in which signs and
meanings are accessible to consciousness, as the task for phenomenology is pre-
cisely to understand the ways in which consciousness relates to meaning.
example, (…) a large, voracious cat, having already devoured the curtains, is now eating
the mat. Maybe the cat is a tiger in a cartoon, who has just lost a boxing match and
is lying unconscious on the canvas. Or suppose we are using a light-colored mat as a
makeshift screen for a slide show. To find where to place the projector and how to aim
it, you put in a slide with the image of a cat. When the projector is finally positioned
properly, I can let you know by saying OK, the cat is on the mat.
The sentence can take different meanings in different contexts, but in each case, the
utterer knows exactly what meaning she has conferred to the expression, and, if
communication is successful, so does the listener. In Husserl’s account, meaning is
linked to the act, not to the sentence, and referring is an activity carried out by the
subject in intentional acts, and not a property of signs themselves.
But this is only the first part of Husserl’s account. Crucial for my discussion is
the link between meaning-bestowing acts and meaning-fulfilling acts:
If we leave aside the sensuous acts in which the expression, qua mere sound of words,
makes its appearance, we shall have to distinguish between two acts or sets of acts. We
shall, on the one hand, have acts essential to the expression if it is to be an expression
at all, i.e. a verbal sound infused with sense. These acts we shall call the meaning-
conferring acts or the meaning-intentions. But we shall, on the other hand, have acts,
not essential to the expression as such, which stand in the logical relation of fulfilling
(confirming, illustrating) it more or less adequately, and so actualizing its relation to its
object. These acts, which become fused with the meaning-conferring acts in the unity of
knowledge or fulfillment, we call the meaning-fulfilling acts. (Husserl 2001 [1900]: 192)
When I say there is a hamster in the room next door, I express a meaning-intention,
according to which there is a furry little rodent in the room next door. Since the
object is not directly given, the meaning-intention is described by Husserl as an
empty intention, as is the case for any act of intending to use a sign – a signitive
intention – to direct consciousness to an object or state of affairs. But I can go to
the room next door and see the hamster; this perceptual act would, then, fulfill the
previous meaning intention.
Of course, meaning-fulfilling acts are not necessary to confer meaning to a sen-
tence. It is possible for a meaning-conferring act never to be fulfilled. The perceptual
act is not a part of the signitive, empty intention, but – only – its fulfillment, and
the fulfilling act is not the simple, direct presence of the perceptual situation, but
“the experience of coincidence between an empty intention and its fulfilling object”
(Moran 2012: 131). What Husserl is trying to show is that the meaning-conferring
act would remain phenomenologically unintelligible without taking into account
its relation to the meaning-fulfilling act.
Now, although secondary and “not essential”, meaning-fulfilling acts are important
for understanding the relationship between perception and linguistic meaning. Fol-
lowing Husserl, perceptual acts must be regarded as the confirmation of the meaning
intentions expressed through linguistic signs in speech acts. But, conversely, Husserl
himself claims that the content of an expression is the content of the act that fulfils it.
Phenomenological Interpretation of the Embodied Simulation 37
The word ‘expression’ is normally understood – wherever, that is, we do not speak of a
‘mere’ expression – as sense-animated expression. One should not, therefore, properly
say (as one often does) that an expression expresses its meaning (its intention). One
might more properly adopt the alternative way of speaking according to which the
fulfilling act appears as the act expressed by the complete expression: we may, e.g., say
that a statement ‘gives expression’ to an act of perceiving or imagining. (Husserl 2001
[1900]: 192)
If this is the case, (a) the specific structure of speech acts can be described indepen-
dently from its fulfillment in the act of perception (the fulfilling act is not essential
to the expression), but (b) in order to fulfill a meaning intention, perceptual meaning
must be related to the semantic characterization of speech acts. The problematic
relation between speech and perception is further addressed by Husserl at the be-
ginning of the 6th Logical Investigation:
I have just looked out into the garden and now give expression to my percept in the
words: ‘There flies a black bird!’ What is here the act in which my meaning resides? I
think we may say (…) that it does not reside in perception, at least not in perception
alone. It seems plain that we cannot describe the situation before us as if there were
nothing else in it – apart from the sound of the words – which decides the meaningful-
ness of the expression, but the percept to which it attaches. For we could base different
statements on the same percept to which it attaches, and thereby unfold quite different
senses. I could, e.g., have remarked: ‘That is black!’, ‘That is a black bird!’, ‘There flies
that black bird!’, ‘There it soars!’, and so forth. (Husserl 2001 [1900]: 195)
A perceptual scene, by itself, doesn’t mean anything. In simple perception, the ob-
jects are given in a direct, immediate, temporal manner. On the other hand, to see
that a state of affairs obtains, is to capture it in a perceptual judgment. A meaningful
perceptual act is, thus, founded in the sensuous content of perception, but is not
derived from it. Rather, epistemic perception, that is, a perceptual act conceptually
and propositionally framed, belongs to a different realm of intentional acts.
For example, the table in front of me is captured from a particular point of view,
and my perspective on the table changes with every movement I make. Nonetheless,
the object picked out through a meaning in what Husserl calls a nominal act, that
is, the object seen as a table, or the state of affairs intended by the perceptual judg-
ment This table is black, is one and the same, independently of my particular point
of view. We can say, thus, that one thing is to see x and another thing is to see that
x; one thing is to – simply – see the perceptual thing, and another one is to see that
this table is black. That perception, in this sense, has epistemic value, is clear when
we realize that I can be right or wrong about what I see or judge to see: maybe it is
not a table, but a really big chair, or maybe the table is not black, but brown, and, in
both cases, I would be mistaken. In any case, the epistemic value of the perceptual
act is independent of the sensuous perceptual act.
Intuition may indeed be allowed to contribute to the meaning of a perceptual statement,
but only in the sense that the meaning could not acquire a determinate relation to the
38 Carlos A. Pérez
object it means without some intuitive aid. But this does not imply that the intuitive
act is itself a carrier of meaning, or that it really makes contributions to this meaning.
(2001 [1900]: 196)
Perception as such, according to Husserl in the Logical Investigations, is deprived
of meaning. In order to be saturated with meaning, perception must ground (and
be accompanied by) another kind of act, either a propositional one (whose object
would be a state of affairs), or a nominal one (whose object would be a meaningful
presentation of an object).
we perceive the object as a whole; when we move ourselves, the previously hidden
profiles may appear, and there must be a temporal continuity between the intuited
profile and the other one that shows itself in correlation with the movements of
my own body. Such a thing is not possible for a two-dimensional representation
of an object.
However, let’s assume, for the sake of our discussion, that image-consciousness
can fulfill the meaning-conferring act linked with the verbal expression previously
shown. If this is the case, the verbal expression and the image are semantically
related. In the specific scenario designed in the experiment, what is shown on the
screen is not the image linked with the state of affairs that would fulfill the verbal
meaning-intention, but an image whose intentional object is picked out by a pre-
viously read sentence. The important issue here is that the distinctive orientation
of the nail is not constitutive of the nominal act related to the word, something that
is presupposed and accepted by the experiment, since incorrect responses were
dropped from the analyses (Stainfield and Zwaan 2001: 155). The relationship be-
tween conceptual meaning and its objects is one and the same, and is not affected
by differences in the object. The experiment only measures the reaction time, which
is something that falls outside the limits of Logical Investigations. As we will see,
time is the central problem in the genetic analysis developed by Husserl around
1920, and it is in this respect that the phenomenological approach to the experiment
may be illuminating.
The absent sides of the object are, for Husserl, presentifications. As we saw
earlier, there are different types of presentifications. Both memory and fantasy
make present to consciousness an absent object, the former a previously perceived
one, the latter an imagined one. But in perception, what is presentified is not an
object, but a profile, which is intended in the form of a retention or a protention.
When I (now) see the spine of the book, the previously present profile –the just-seen
one – remains present (although not directly given) in my retentional consciousness.
Correspondingly, the present, given profile, points to an upcoming profile, not yet
perceived but expected in my protentional consciousness. When I move my head
and the spine of the book shows itself, the protention is fulfilled, giving place to a
new protention and a new retention.
Primal impression, retention and protention configure an internal system of
referential implications that is dependent upon a system of lived body movements:
I can move my head and see the spine, I can manipulate the book and turn it around,
etc. The appearances are kinaesthetically motivated, and there is a correlation be-
tween the series of kinaesthetic sensations by which I’m aware of my corporal
movement:
Only as dependent upon kinaestheses can [the appearances] continually pass into one
another and constitute a unity of one sense. Only by running their course in these ways
do they unfold their intentional indicators. Only through this interplay of independent
and dependent variables is what appears constituted as a transcendent perceptual ob-
ject. (Husserl 2001: 52)
The temporal horizon that merges in the dynamic correlation between adum-
brations and kinaestheses is organized in and through perceptual schemata (Welton
2000: 332), which are entirely different from the conceptual meaning instantiated in
meaning-conferring acts. First, conceptual meaning can subsume different objects
independently of the differences between them. All the books in my library are,
precisely, books. Perceptual schemata, on the other hand, change along with the
object. Having in front of me three different books, my old edition of the Cartesian
Meditations, the 1550 pages of Tolstoy’s War and Peace, and my little son’s favorite,
Lost and Found, the perceptual sense of each object is not the same. Second, per-
ceptual schemata are supported by bodily knowledge, while conceptual meaning has
a referential function (whether something is the case or not) that has no relationship
with bodily-kinesthetic skills. The practical involvement with the world is possible
because of the mastering of my bodily movement. Third, linguistic meanings are
inherently intersubjective (Zlatev 2010), while perceptual schemata are subjective
and embodied.
Still, the question remains: how does the conceptual layer of meaning relate to
the perceptual, practical dimension of sense making? And this is precisely the ques-
tion that lies beneath Stanfield and Zwaan’s experiment, as I show in the following
section.
Phenomenological Interpretation of the Embodied Simulation 43
10 Although not spelled out by Husserl himself (he wrote only about far retentions)
the distinction between near and far protentions has been addressed by his com-
mentators (Rodemeyer 2006; De Warren 2009; Lohmar 2002)
44 Carlos A. Pérez
the sense of “protentional expectations” projected into the remote future, which are
“pre-given” as horizons on the outer limit of the living present, lurking and waiting
to be discovered.
Like Lohmar, De Warren is trying to account for the temporal dimension of percep-
tual experience. But again, this characterization may be extended to speech acts as
well: I anticipate in far protention the perceptual appearance of a situation. The situ-
ation appears as something that is about to happen, an appearance co-determined by
my practical and embodied engagement with the world. And this holds even when
the intended object is a past event, since it is possible to have in protention a past
or an imagined event. Both fantasy and memory are lived experiences, and, thus,
can be described according to the retention-protention structure. But protention
is not fantasy, nor memory, but a constitutive feature of temporal consciousness.
In other words, neither retention nor protention are intentional objectifying acts.
Rather, protention is an associative connection which arises passively, and not an
active anticipation of an event directed to the future, as in a prophecy. When I hear
my brother’s utterance, I’m passively associating my current ongoing situation with
the situation of the dog lying on the rug.
Returning to the experiment, the sentence The carpenter hammered the nail into
the wall, can be fulfilled, for example, either by a carpenter standing upright and
hammering a nail into the side a wall (Figure 1, A), or by a carpenter hammering a
nail into the top of a wall (Figure 1, B). That is, it can be fulfilled in epistemic per-
ception. In each case, the orientation of the nail is different, but this is irrelevant
for meaning-fulfilling acts.
However, when reading the sentence, I anticipate in protention the perceptual sense
of an intended state of affairs. This means that I anticipate a particular, subjective,
perceptual scene organized in terms of perceptual schemata, based on my own past
experiences. As stated above, past experiences do not disappear into nothingness; the
repetition of certain experiences becomes part of me through habitualization. This is
why, when I interpret the sentence The carpenter hammered the nail into the wall, I pro-
Phenomenological Interpretation of the Embodied Simulation 45
tend something like Figure 1A more than 1B. Such a phenomenological analysis can
account for the results of the experiment without assuming any kind of simulation.
3. Conclusions
Husserl’s account of meaning is rich and complex. My aim in this chapter was a
modest one: to illustrate, with a particular example, the way in which phenom-
enology can clarify the results of empirical cognitive research. In doing so, I high-
lighted the richness of the phenomenological description of the temporal dimension
of human experience. Besides, I hope to have shown the fertility of addressing the
question of the relationship between linguistic meaning and perceptual sense, and
for the evaluation of “embodied simulation hypothesis”.
In summary, to claim that meaning is something that you construct in your
mind, based on your own experiences, is phenomenologically misleading. Con-
ceptual meaning is accessible to consciousness through conscious acts, but it has
an intersubjective value. What you construct, based on your own experiences, is a
system of practical implications that shapes your involvement with the world. The
phenomenological description of such a system must take into account the insights
provided by Husserl regarding time-consciousness. And finally, it must be clear that
one of the most challenging questions for cognitive semiotics is the one about the
relationship between perceptual sense and conceptual meaning.
Acknowledgements
I would like to thank Catalina Palomino for her insightful observations. I would
also like to acknowledge the two anonymous reviewers for their careful reading of
the chapter and their helpful comments
Joel Parthemore
Chapter 3
A Cognitive Semiotic Perspective on the
Nature and Limitations of Concepts and
Conceptual Frameworks
1 Substance dualists are notoriously thin on the ground, but physicist Henry Stapp
(2007) is one, though his “quantum” interactive substance dualism is meant to avoid
the problems of earlier such accounts.
Concepts and Conceptual Frameworks 49
ing name of “Computing Machinery and Intelligence” (Turing 1950), aimed straight
for the heart of one of the most basic of questions: what does it mean to be a thinking
(conceptual) being? Turing was one of the first to see the powerful metaphorical
connections between the new digital computers – remembering, of course, that
the word “computer” originally applied to persons who performed calculations –
and the conceptual structure of human thought, raising important questions about
how far such thought could be reduced to some amount of computation, some set
of algorithms, and where any such explanation must forcibly stop. Although his
Imitation Game, introduced in that paper, has long been taken as providing some-
thing like a gold standard for establishing “true” intelligence (e.g. Whitby 1996),
Blay Whitby and I have argued (Parthemore and Whitby 2014) that Turing never
intended it as an operational test for intelligence but rather as a thought experiment
to get his readers to think about thinking: where does simply going through some
set of predefined motions end, and flexible conceptual thought begin? What is it
that sets a thinking agent apart from an automaton?
One further name bears mention with relation to the heady days of cognitive
science research in the 1970s and ’80s: John Searle, with his Chinese room thought
experiment (Searle 1980): a fascinating counterpoint to Turing’s Imitation Game.
The Chinese room can and has been taken as a lecture on the sin of disregarding
embodiment and its role in cognition (e.g. Preston and Bishop 2002) – a concern
I enthusiastically endorse. Nevertheless, I object to the unstated, yet critical,
intuition/presupposition that Searle quietly slips in: namely that, “if one looks
inside the Chinese room – or, by analogy, inside the skull – sees the operations
there, and ‘knows’ that those operations could not even in principle produce
intelligence/consciousness, then there is no intelligence/consciousness regardless
of any observable behavior or indeed any measurable differences whatsoever” (Par-
themore and Whitby 2013: 113). Consider the ethical minefield that this opens:
if one “knows” that the agent is “really” an automaton, then that might seem to
justify whatever one might do to it in terms of torture, abuse, and so on: after
all, one “knows” that it feels nothing, despite all its perhaps frantic claims to
the contrary.2 For Searle, there are no possible grey areas between being an
automaton and a thinking agent, but rather (as for Descartes, in his own way)
an absolute divide.
I am as convinced as Searle that intelligence and consciousness cannot be achiev-
ed by brute-force programming. Nevertheless, what Searle would place beyond
possibility of doubt I prefer to leave open in principle to empirical discovery – in
the end, my empiricist tendencies trump my rationalist ones. Like Sloman, I want
to try things out and see what happens; so the theory of concepts I am developing
(see Section 3) has been accompanied, from the beginning, by a software program
as a direct translation of that theory: one that shows both where the theory works
and where it still has gaps.
Logical fiats – such as those Searle appears to offer – should always be viewed
with suspicion. For all of their errors and omissions in other regards, the behav-
iourists had this much right: all one can go on, in the end, is what one observes.
Echoing Whitby’s (2003) sentiments, the take-home message I wish to deliver is
that, if cognitive science was never that day or week or month or year away from
achieving “true” AI – as its more naive or unprincipled advocates declared – it
was never quite so blind to situatedness or embodiment or neuroscience, nor so
impoverished of accomplishments, as many contemporary researchers have seen
fit, for their own rhetorical purposes, to portray it.3 Nevertheless, like most if not all
intellectual endeavours, cognitive science faces the need to periodically re-invent
itself – in light of the present age, in keeping with contemporary insights and dis-
coveries.
Cognitive semiotics – characterized by Zlatev (2012: 2) as “an emerging field
dedicated to the ‘transdiciplinary study of meaning’, involving above all researchers
from semiotics, linguistics, developmental and comparative psychology and phil-
osophy” – offers one tantalizingly encouraging opportunity to do so. Cognitive
science, he complains, has often been explicitly reductionist and physicalist; “[cogni-
tive semiotics] is… considerably more pluralist in its ontological and methodological
commitments” (ibid: 3).
For all I see as the enduring strengths of Turing’s insights, what is perhaps
most striking to me about the mind-as-computer metaphor is how badly so many
of both its fiercest advocates and opponents seem to misunderstand the nature of
minds and computers. Computers do not simply “do as they are told” – even as
existing models lack what could be called, in anything but the most metaphorical
of terms, intelligence, never mind free will. Despite what Roger Penrose (1994: 66)
claims to the contrary, they are not equivalent to Turing machines – which are
strictly idealized mathematical entities. The problem is not that computers are
not embodied, and embedded in an environment with which they richly interact:
they are. The problem is rather that they are embodied in the wrong way, not least
because of the seemingly unavoidable dependence of cognition on life – even if
that life is something different than traditionally biologically conceived (Zlatev
2001, 2002, 2009a).
Meanwhile, for all I disagree with her on other matters, Patricia Churchland
is right to observe – contra Penrose – that it is unclear whether anything in the
universe is not, in principle, algorithmically describable or “computable”, in the
very broad way of computability Penrose has in mind (Grush and Churchland
3 Again, if one thinks that they are erecting a straw man, one need only look at the
(growing) field of artificial general intelligence, which unabashedly attempts to revive
an explicit-rule-based GOFAI-style approach. To be fair to AGM advocates, including
Claes Strannegård, whom I cite below, they would certainly not describe themselves
as “blind”; they would simply say that “true” AI can be achieved without necessarily
needing to address one or more of these issues. See (Nizamani et al. 2015; Strannegård
et al 2013) to get a good feel for the general AGM style.
Concepts and Conceptual Frameworks 51
1995: 190). That does not mean that an algorithmic description is always desir-
able, or even possible. After all, some things may, and probably do, ultimately
outstrip the capabilities of human cognition – including, almost certainly, its
capacity to understand itself. Here again cognitive semiotics, with its openness
to methodological and theoretical pluralism, shows its strengths, in allowing the
possibility that there may be (and probably is) no one single “correct” way to look
at the mind, and that the mind-as-computer metaphor, properly understood, has
its role to play. The problem is not the metaphor; the problem, again, is the need
for periodic renewal. To echo a concern from elsewhere in cognitive semiotics (see
below), understanding how the human mind differs from a digital computer – and
the gulf is wide indeed! – requires first having an understanding of those things
we have in common.
In Section 2, I offer my interpretation of the field of cognitive semiotics. In
Section 3, I discuss the focus of my own research interests: namely, the nature of
conceptually structured thought. Section 4 offers a cognitive semiotic perspective
on concepts, looking at how cognitive semiotics improves over traditional cog-
nitive science in offering insights into the nature of concepts and conceptual
frameworks. Section 5 applies these ideas to metaphor theory. Throughout there
will be echoes of both the relationship between cognitive science and cognitive
semiotics, and the enduring power of the mind-as-computer metaphor. I close
with some thoughts about the way forward both for theories of concepts and for
cognitive semiotics.
4 Or sense making: see e.g. Thompson and Stapleton 2009; De Jaegher and Di Paolo
2007.
52 Joel Parthemore
semiotics, Göran Sonesson “has consistently argued for the primacy of perceptual
meaning over other kinds of meaning – including signs” (Zlatev 2012: 6).
Signs and sign use are the most obvious “outward” indicators of meaning. Cog-
nitive semiotics finds some significant part of its origins in semiotics. That said,
“it is not to be seen as a branch of the overall field of semiotics, defined either in
terms of ‘domain’ (in the manner of e.g. biosemiotics, semiotics of culture or social
semiotics), or ‘modality’ (e.g. visual semiotics, text semiotics)” (Zlatev 2012: 2).
In keeping with this relationship, cognitive semiotics tends, when it talks about
signs, to prefer a far narrower definition than offered by Charles Sanders Peirce
for example, for whom a sign may be taken to include nearly anything to do with
meaning (see e.g. Peirce 1981, in stark contrast to Sonesson 2007) – but a broader
definition than one that restricts signs to language or human-language-like com-
munication.
For myself, I prefer to take signs as conceptually mediated attempts at communi-
cation that are at least semi-conventionalized, of which language is a subset; the
intended target could be another agent, or it could be oneself (though the former
is presumably primary). Signs and sign use presuppose communication and social
context. Like concepts (see Section 3), signs have both expression and content,
each of which may be distinguished from the other. Unlike concepts, expression
and content must be distinguishable – at least normally – not just to an “outside”
observer but to the sign-using agent herself. On such a view, concepts are prior to
signs and sign use ontogenetically and phylogenetically, while signs and sign use
are prior to language.
Zlatev (2012, 2015a) describes cognitive semiotics as intrinsically interdis-
ciplinary – or, as he prefers, transdisciplinary. Cutting across rigidly delineated
academic subdomains is, of course, hardly new to cognitive semiotics. What is
new is the particular mix of disciplines it brings and, in particular, the way it
takes lessons from semiotics (regarding meaning), phenomenological philosophy
(regarding experience), and linguistics (regarding language) to re-invigorate dis-
cussions that have been going on for half a century in cognitive science. Of
course, interdisciplinarity is a two-edged blade: focus too narrowly on one’s
specialization, and one risks facing endless terminological turf wars when meet-
ing researchers who use the terms differently; focus too broadly, and one risks
losing oneself in terminological and conceptual vagueness. Different disciplines
use terminology differently for good reasons: because their needs for that ter-
minology, though overlapping, are also different. Nevertheless, a certain disregard
if not irreverence for traditional academic boundaries – as cognitive semiotics
clearly shows – is hard to see as anything but a good thing; as is its refusal to be
drawn either toward a theoretically strong but empirically weak perspective (all
too common in philosophy), or an empirically strong but theoretically weak one
(all too common in empirical science; see Zlatev 2012: 14).
Finally, taking both an onto- and phylogenetic perspective, cognitive semiotics
attempts to discover what is uniquely human through a better understanding of
Concepts and Conceptual Frameworks 53
what is not,5 with the understanding that what appears to be key, at first blush, may
not be. So far, language – in the sense of a rich integration of syntax, semantics,
and pragmatics and not merely innovative communication – seems to be. On the
other hand, concepts and conceptually structured thought arguably are not (see
e.g. Parthemore 2011a, 2013, 2014).
Cognitive semiotics is not an attempt to replace cognitive science. Neither is it,
as Zlatev (2012: 3) considers and rejects, “a new and fancier name” for it. It is rather
an attempt to shine new light on old phenomena: whether that be the origins of
proto-language and language or of consciousness, or the limitations of AI and so-
called machine consciousness – or the nature of conceptually structured thought.
Concepts are the framework underlying not just language but unspoken thought. They
may roughly be taken equally either as units of structured thought or as the ability/-ies
to possess and employ structured thought – I take the two formulations as equivalent;
such that that thought is (a) systematic… (b) productive… (c) compositional… (d) inten-
tional per Brentano … (e) re-identifiable… (f) “spontaneous” per Kant’s terminology…
and… (g) subject to revision… (Parthemore 2014: 193–194).
What this definition amounts to is that a concept of x is one’s structured under-
standing of some concrete or abstract entity x, such that one might, but need not, be
able to explain it even to oneself, let alone anyone else. After all, many of the skilled
basic activities we engage in we feel like we “just do”, without any need to think
about them (Felix 2015: Ch. 3). This claim – that concepts, in effect, bridge Gil-
bert Ryle’s (1949) knowledge how / knowledge that divide – will be controversial in
certain circles; as will the claim that concepts must be relatively stable but not too
stable: that they are not just intrinsically open to change but indeed to continuous,
if incremental, revision.6 Concepts do not stand still. They must be stable to be
reusable across unboundedly many contexts but not too stable so as to be unable
to adapt to each new context – in some way, large or small, different in a thousand
details from any context that preceded it and any that follow.
The other properties I list are, by contrast, all but universally accepted. I have
already explained systematicity and productivity. Compositionality – the idea that
concepts can be added together (and at least some concepts can be taken apart) –
falls out directly; as does re-identifiability: the ability to describe the same thing
as the same thing – from many different perspectives, in unboundedly many con-
texts, by various sensory modalities.7 Intentionality8 is the deceptively simple idea
that concepts are always concepts of: they always have some target (indeed, for
someone).
All of this, however, is to describe concepts as if they were carefully pinned out
lepidoptera. What are concepts in use?
Concepts simplify (and, by simplifying, distort) in pursuit of understanding,
reducing the complexity of the world to a level we can grasp. They allow us to ap-
proach the world in a distinctively flexible way at the price of removing us from
that world – in favour of a conceptual model of it.9 That flexibility of response is
the hallmark of both conceptually structured thought and consciousness. Indeed,
conceptual agency and consciousness may be seen as two sides of a single coin
(Parthemore and Whitby 2014), where the appropriate attribution of the one is the
appropriate attribution of the other. On my reading, such a link is assumed by most
contemporary theories of consciousness – Information Integration Theory (Tononi
2008; Balduzzi and Tononi 2008), Global Workspace Theory (Baars 1988, 1996), Dual
Aspect Theory (Chalmers 1997), to name just three – but rarely if ever made explicit,
an omission I wish to correct.
In keeping with arguments made by Douglas Hofstadter (2000 [1979]) with
respect to Gödel’s Incompleteness Theorem (see also Parthemore 2011: 192–195),
concepts logically entail their own limitations. What this means is – pace Penrose
(1994) – that any given conceptual framework cannot be complete and consistent
at the same time, as a conceptually well-structured argument derived from Gödel’s
two proofs can be made to show. Otherwise, the very expressive power of con-
ceptual frameworks allows the formation of paradox-inducing self-referential
conceptual structures that threaten to bring down the entire framework. Indeed,
as Hofstadter suggests, one might imagine – in a manner reminiscent of the
ancient Greek Pyrrhonists (Lind 2013) – deliberately exploiting the paradoxes to
achieve just that.
Putting this in another way, concepts portray the world as being one way and not
another, with no conceptual perspective privileged above all others: a conceptual
perspective is necessarily a limited one. That most pointedly applies, in paradox-
threatening fashion, to a conceptual view on concepts themselves. For a good vis-
ual equivalent, consider what happens if you train a camera on its own playback
monitor. If the focus is a bit off, one gets an amusing regress (see Figure 1); if it is
straight on, one gets an attractive but uninterpretable abstract pattern.
At least for human conceptual agents (simpler conceptual agents may be less en-
cumbered), concepts have a habit of getting very abstract very quickly, particularly
when we stop to contemplate ourselves contemplating them, as opposed to merely
getting on with using them non-reflectively. Consider the concept of grasp, high-
lighted by Vittorio Gallese and George Lakoff (2005) in making their claim that
all that any concept amounts to is nothing more than a specific sensorimotor en-
gagement, albeit with parts of that engagement typically suppressed. Their assertion
is oddly reminiscent of George Berkeley (1999 [1710]), with his claim that no one
has a general concept of “triangle” that is not the image of a specific triangle, albeit
with parts of that image ignored.
Their choice of such a seemingly concrete concept is oddly telling, since their
account is meant to apply to all concepts – even the most abstract. The problem is
that even the concept of grasp is not so clearly an “actual” sensorimotor grasping as
Gallese and Lakoff would have one believe. Pace their account, the concept of grasp
is at least as unlike a specific instance of grasping as it is like. The very power of
the concept – the power of concepts in general – is the way it abstracts away from
any specific instance of application at the same time that it applies back to every
56 Joel Parthemore
conceivable application: both the cases of physical grasping and those – such as
“eureka” moments – where the grasping is noticeably metaphorical/abstract.
way, borrowed from Inman Harvey (1992), according to which agent A intentionally
(with wilful motivatation) uses B to stand in place of C for herself or another agent D;
and, most importantly, provides a framework within which all of an agent’s (or, on
another level, society’s) many conceptual spaces, as discussed by CST, can come
together in a single, unified space of spaces.
UCST makes the following empirically testable claims (see Parthemore 2015):
1. All concepts can be assigned to one of three sub-categories (objects, events, and
properties) that derive from three innate proto-concepts (proto-objects, proto-
events, and proto-properties): so-called because they fail to fulfil all the properties
of “true” concepts (in particular, they are too few in number to be, of themselves,
productive; and, being innate,12 they are not under the agent’s endogenous con-
trol). These sub-categories align, more or less, with the English lexical categories
of noun/pronoun, verb, and adjective/adverb.
2. All concepts can be oriented with respect to one of three dimensions that de-
fine the unified space and that represent the integral dimensions (see Gärdenfors
2004: 24–26) in common to all concepts: namely, an axis of generalization from
most specific to most abstract, reprising the familiar “ISA” hierarchy whereby
e.g. a cat is a mammal is an animal is an organism, etc.; an axis of abstraction
from most concrete and seemingly non-conceptual/physical to most abstract and
straightforwardly (higher-order) conceptual/mental, where all points along the axis
represent one and the same target at different levels of abstractness; and, finally, an
axis of alternatives, along which one finds all the possible variations of a concept
at the same level of generality/specificity and concreteness/abstractness, derived
by varying the values of any one or more of the integral dimensions defining the
concept most specifically (e.g., hue, saturation, and brightness in the case of colour).
3. In addition to these proximal connections to “neighbouring” concepts within the
unified space, concepts relate to distal (“non-neighbouring”) concepts in three
possible ways:
a. Some concepts (more concrete object and event concepts, but not abstract ob-
jects or events, or properties in general) have one or more necessary subcom-
ponents of the same broad category (the subcomponents of object concepts
are always object concepts, that of event concepts always event concepts).
b. All concepts are defined by one or more integral dimensions.
c. All concepts exist within a context of other concepts with which they are
more or less routinely associated (co-occurring).
A useful metaphor is children’s building blocks, but a special kind where
blocks affect not only directly adjacent blocks but ones that may be consider-
ably removed in the pile.
12 In this way, UCST endorses a very modest nativism, in stark contrast to the radical
nativism commonly associated with the early Fodor, whereby all – or nearly all – of
our concepts are innate.
58 Joel Parthemore
artes for all non-human animals, extended to human beings by T.H. Huxley (1874;
see also Kim 2007). Once again, cognitive semiotics offers a contrasting perspective.
only function as concepts to the extent that they are able to change as needed
to fit each new context: to revise the old proverb, one cannot fit an unchanging
peg into a constantly changing hole; the essentially static nature of concepts is
an illusion, and meaning only appears fixed as a necessary convenience (Par-
themore 2014).
The question of whether concepts change (Woodfield 1990) and, if so, how intrinsic
is that capacity for change (do some change? all? must they change?) remains
one of no small debate. Fodor’s concepts cannot change because concepts, on his
account, are strictly public entities defined purely by their extension: my concept
of gold is exactly the same as the ancient Greeks’ concept of gold, despite our
differing understandings of and beliefs about gold, because the thing that gold
62 Joel Parthemore
refers to is the same. From a cognitive semiotic perspective, in contrast, beliefs are
partly constitutive of concepts and not just (as Fodor and everyone else allows)
concepts constitutive of beliefs: i.e., concepts are defined both by their extension
and intension.
Cognitive semiotics is open not just to the possibility but to the inescapable
reality of conceptual change because of the way it “studies meaning on all le-
vels – from perception to language, along with the various forms of ‘external’,
cultural representations (theatre, music, pictures, film, etc.) – primarily as dy-
namic processes rather than static products…. Nearly all [cognitive semiotics]
scholars have made the point that viewing meaning in purely static, structural
terms is insufficient for understanding the essentially relational, subject-relative,
and (often) interpretive nature of semiosis” (Zlatev 2012: 16). Cognitive semiotics
views meaning as fundamentally dynamic, because of the way it views not just
life but the lifeworld itself as fundamentally dynamic.
In alignment with enactive philosophy, which takes interaction as a fundamen-
tal entity and agent as continuous with (not ultimately separable from) environ-
ment, cognitive semiotics shows how concepts impose crisp boundaries on what
are, from a conceptual perspective, underlying continuities. In psychological
parlance, this is the phenomenon known as categorical perception. As Stevan
Harnad explains (1990: ix), “categorical perception occurs when the continuous,
variable, and confusable stimulation that reaches the sense organs is sorted out
by the mind into discrete, distinct, categories whose members somehow come
to resemble one another more than they resemble members of other categories”.
He continues (1990: 3): “a [categorical perception] effect occurs when… a set of
stimuli ranging along a physical continuum is given one label on one side of a
category ‘boundary’ and another label on the other side…. In other words, in CP
there is a quantitative discontinuity in discrimination at the category boundaries
of a physical continuum…”.
The implication is that, pace the natural kinds philosophers, so-called natural
kinds concepts do not, to borrow a well-worn phrase from Plato’s Phaedrus, “carve
nature at its joints”; if nature has joints at all, if the notion of natural “joints” is
even coherent, then these are not the joints that matter conceptually. This is not
to say that concepts (for the individual or the society) are free to draw lines where
they want: they are not, any more than agents are free to believe whatever they
wish about the world; only that even such basic things as our seemingly inescap-
able tendency to view the world in terms of objects, events, and properties may say
more about us than it does about the world: we literally cannot imagine seeing
the world any other way.
With its refusal to shy away from paradoxical or even seemingly contra-
dictory perspectives, cognitive semiotics shows how concepts hide the very world
they reveal, beneath a veil of interpretation. They both vastly extend cognitive
capacities and, at the same time, in fundamental ways limit them. On the one
hand, they facilitate an active choice of response: faced with the “same” circum-
Concepts and Conceptual Frameworks 63
stances on different occasions, the same agent can choose differently, based on
her reasoned consideration of what has gone before and what is yet to come.
Her mind is not trapped in the present moment but free for mental time travel
(see e.g. Suddendorf and Corballis 2007; Osvath and Gärdenfors 2007) into the
past and the future. On the other hand, they rob the agent of a certain essential
spontaneity: a fully in-the-moment experience that, perhaps, only strictly non-
conceptual agents can have; at the same time, by presenting the world in one
way rather than another, they blind agents to the range of possible alternative
perspectives. Conceptual abilities are neither a straightforwardly good nor bad
thing; they are something that sets us very much apart, though, from species
that lack those abilities.
Finally, through its theoretical and methodological pluralism, cognitive se-
miotics reveals how concepts are not one unified phenomenon. No single theory
will ever capture conceptual abilities and frameworks in all the ways one might
be interested in. Cognitive semiotics is in general open to the possibility that
there is no one, single, “correct” perspective on many if not most matters of
substance, including the endless tedious debates over representationalism and
whether concepts are best seen as representations (as Fodor or Prinz would
have it) or abilities (as Evans (1982) is commonly read, as Gottleb Frege (1951)
doubtless intends, and as Alva Noë (2004) would have it; see also Parthemore
2011a: 20–30). Given its rejection – with the enactivists and phenomenologists –
of unconscious representation but its openness to a kind of (conscious or reflec-
tively self-conscious) representationalism, cognitive semiotics is amenable to the
possibility that concepts both are (when we look at them) and are not (when we
simply use them) representations (see again Table 1). That is, cognitive semiotics
advocates a measured representationalism/anti-representationalism and rejects
the dogmatic anti-representationalism that many discern in radical enactivism
(see Zlatev 2012: 12).
Where does all this conceptual revisionism leave us in terms of the mind-
as-computer metaphor? Just as digital computers were named after their human
predecessors, because of a certain striking resemblance. In particular, the com-
puters’ capacity to perform calculations that previously only a human computer
could do; so, too, may the human mind be seen to reflect the operations of a digital
computer – in some ways – albeit a massively parallel one, with a storage and
computing capacity that, in certain key respects, continues to put computers to
shame. Moreover, certain living creatures, including at minimum human beings,
are conceptual agents – something that, given all available evidence, no existing
artefact can lay claim to.
At the same time, the take-home message that cognitive semiotics invites us to
draw from Brooks among others is that much of our vaunted intellectual capac-
ities – supposedly so flexible, so subject to our free will, so demonstrative of our
superiority over other species – are surprisingly mechanical, straightforwardly
algorithmic, and utterly dependent on the environment being a certain way. Much
64 Joel Parthemore
5. A
cognitive semiotic perspective on concepts: The case
of metaphor
Before I close, I would like to offer a more specific and concrete example of just
how cognitive semiotics can re-shape our thinking about concepts, by looking at
how it can help us re-conceptualize not just this one metaphor but metaphor in
general – not as a strictly linguistic phenomenon but, in keeping with Conceptual
Metaphor Theory [CMT] (Lakoff and Johnson 2008 [1980]; see also Fusaroli and
Morgnani 2013) first and foremost a conceptual one, underlying and making poss-
ible linguistic metaphor.15
CMT has undoubtedly contributed to the study of language and cognition with its
recognition that what is metaphorical in language is first metaphorical in thought:
again, language and systematically structured thought, though deeply intertwined,
pull apart. Nevertheless, CMT remains burdened with conceptual baggage from its
predecessors: most notably a tendency toward a strict ontological (as opposed to
pragmatic and conceptual) distinction between literal and metaphorical and a con-
sequent naive assumption that literal meaning can more or less be taken for granted,
while metaphorical meaning requires explanation. This leads to many a tortuous
debate over whether a given construction is or is not metaphorical and whether, if
it is, it qualifies as “dead” metaphor.
In keeping with CMT, and in alignment with the discussion above, I hold that
metaphor is not a strictly linguistic phenomenon. In contrast to CMT, and again
following on from the earlier discussion, I wish to suggest that metaphor is not
definable in opposition to literal meaning, at least without a full explanation of
what counts as literal meaning, which Allwood (1981) argues to require as much
explanation as metaphor; that it is not clearly demarcated from non-metaphor; and
that it is consequently not meaningfully separable into “living” and “dead”. Meta-
phor is not some type of natural kind, nor indeed any other reifiable entity. It is not
something one possesses a stable collection of so much as something one actively
and continuously engages with and helps to create.
What, then, is the relation between literal meaning and metaphor? The literal/
metaphorical distinction – often useful, perhaps necessary – is a conceptual rather
than a prior ontological one. Alternatively, and, I believe, equivalently, metaphor
may be defined as a special case of contextual meaning determination, based on
meaning potentials (Allwood 2003). From a UCST-inspired view, essentially all con-
ceptual meaning involves the mapping across domains meant to be characteristic
15 This is based on work I am currently doing with Jens Allwood at the University of
Gotherburg, Sweden.
Concepts and Conceptual Frameworks 65
16 Consider the English word haven, whose Nordic equivalent continues to retain its
earlier meaning of “port” or “harbour”. In English, what had been the metaphorical
66 Joel Parthemore
and images, from just another tool into something playful. When one’s view of the
world becomes too settled, too mundane, metaphor enters in.
Figure 3. “ This is your brain on drugs”: centerpiece of a popular and widely memed
campaign by the US government as part of its “war on drugs” in the late 1980s
and early ’90s. Public-domain image downloaded from Wikimedia Commons
(http://commons.wikimedia.org).
This perspective on metaphor eschews talk of the literal and sees the seemingly
sharp boundaries between the metaphorical and non-metaphorical in light of cat-
egorical perception. The upshot is that the sharp boundaries are a product of how
we use and interpret metaphor and not part of its underlying ontological nature,
whatever that might be. Such a move makes questions of what precisely is and is
not metaphor highly dependent on context, and arguments over such – whenever
some global perspective is assumed – ultimately meaningless.
This perspective on metaphor reveals how metaphor both tremendously extends
our cognitive and linguistic possibilities – allowing the juxtaposition of nearly any
conceptual domain with any other domain – and, at the same time, limits them. It
does so in part by opening the possibility to yet more confused communications
and confused thoughts (just because any two domains can be juxtaposed does not
mean that they should be: a good metaphor should not be “too” novel but just novel
enough, in part by lending the abstract – the most common target of metaphor – a
meaning has become the primary meaning, and the earlier primary meaning has
been all but completely forgotten.
Concepts and Conceptual Frameworks 67
seeming concreteness that it does not genuinely have). The lesson may be that much
of what we think of as paradigmatically concrete may be far more abstract than we
realize: not least our seemingly most concrete concepts.
Finally, through its theoretical and methodological pluralism, cognitive semiotics
shows how metaphor can usefully be considered through its own sets of contrasting
perspectives (Table 2).
6. Conclusions
This chapter began from the premise that GOFAI was never so “bad” FAI as it is
often made out to be but that, nevertheless, like any field of inquiry, cognitive
science is up for periodic renewal, and cognitive semiotics – with its focus on
the meaning of meaning; with its prioritizing of dynamic processes over stable or
static entities; with its inspiration from enactivism, phenomenology, and semiotics;
with its inclination toward theoretical and methodological pluralism – provides
one highly promising opportunity to do so. A proper understanding of concepts is
necessary for understanding both mind in general and consciousness in particular,
where concepts and consciousness are two sides of one coin. However, too much
of the thinking about concepts in “traditional” cognitive science remains mired in
discussions about (generally symbolic – indeed generally propositional, explicit as
opposed to implicit, etc.) knowledge representation.
Cognitive semiotics has much to inform theories of concepts regarding e.g. the
nature of phenomenal experience and the lifeworld but, even more basically, to break
discussion of concepts out of the traps into which it has all too frequently fallen:
allowing one to see concepts as dynamic rather than fixed, prior to and foundational
to language rather than part-and-parcel with, of one nature when we stop and reflect
on them but logically of another when we possess and employ them non-reflectively,
and so on. The hope is that a better understanding of concepts can, in turn, inform
68 Joel Parthemore
Acknowledgments
The author gratefully acknowledges the financial support and creative intellectual
environment of the Centre for Cognitive Semiotics at Lund University, where much
of the work for the present chapter was conducted; his colleagues in the Department
of Cognitive Neuroscience and Philosophy at the University of Skövde; as well as
useful feedback received during his presentation at the International Association
for Cognitive Semiotics (IACS) conference in Lund, Sweden, in September 2014, and
from the reviewers of earlier versions of this text.
Morten Tønnessen
Chapter 4
Agency in Biosemiotics and Enactivism
1 I will not attempt to define each of these four features. Some examples of their
common usage is provided by Tønnessen (2015). Precisely defining each of the four
features mentioned in this biosemiotic definition of agency would not necessarily
clarify matters much, given that each new definition would in turn call for yet
another definition, and so on ad infinitum. Most definitions are provisional and
indicitative, nothing more. However, let me say this much: The phrase “choice of
action”, as it is understood in biosemiotics, does not refer only to conscious choices,
since agency is taken to occur in all living systems. The “choices” being made refer
to the (sign-based) taking of different paths among several alternatives, depending
on “processing of semiosis” at any particular level of biological organization.
70 Morten Tønnessen
idea that “autonomous agents actively generate and maintain themselves thereby
enacting or bringing forth their own cognitive domains”.3
Biosemioticians differ in their attitudes to enactivism. While some biosemiotic
scholars see these two approaches as complementary and related to each other,
others view enactivism as incompatible with and inferior to biosemiotics. The latter
group of scholars will, unsurprisingly, typically criticize enactivism for not being
able to account for the semiotic aspects of life processes. If living systems are in-
trinsically of a semiotic nature, as biosemioticians hold, enactivism is of course at
a loss insofar as it does not relate to the semiotic nature of nature. But as we will
see, enactivism envelops a whole range of different perspectives, and such dis-
missive attitudes cannot possible apply to all existing versions of enactivism. As
Andreas Weber (2001: 11) states, central arguments or views in enactivism “can be
fruitful for a Biosemiotic approach to [the] organism”. Weber (ibid) further points
out that “Varela himself already applies concepts as e.g. “signification”, “relevance”,
“meaning” which are de facto Biosemiotic.” While Varela was personally “always
reluctant about semiotics” due to its formal design, Weber thinks his work “shows
a close association of the fields of embodied cognition, biological phenomenology,
and semiotics, conflating in what could best be called a biology of subjective agents”
(ibid: 12). Although Weber makes convincing points, it is worth remembering that
the varieties of enactivism advanced by him, as well as by Varela and Thompson,
only form one prominent kind. Nevertheless, this demonstrates that bridging en-
activism and biosemiotics is possible in principle.
The fact that enactivism is not a single theory can be illustrated by Daniel Hutto
and Erik Myin’s recent criticism of another central enactivist, Alva Noë. In general
terms, Hutto and Myin (2013: 3) state that it “is certainly true that there hasn’t
yet been a definitive articulation of the central and unifying assumptions of [en-
activism]”, and observe (ibid: 14) that “when Enactivists speak of “embodied action”
and their intellectual opponents talk of “action,” they are not operating with the
same notion of action.” Noë (2008: 663), on his part, holds that “[p]erceiving is an
activity of exploring the environment drawing on an understanding of the ways in
which one’s movements affect one’s sensory relations to things” (italics in the original).
In other words, his understanding of perception is tied to both understanding, or
knowledge, and movement. For Noë, there is no sharp distinction between con-
ceptual knowledge and practical ability – perception as such is conceived of as a
thoughtful activity. In Hutto and Myin’s eyes (2013: 23), Noë’s work is representative
of “sensorimotor enactivism”, which they regard as distinct from their own “radical
enactivism”. In their interpretation, Noë’s “Sensorimotor Enactivist claims can be
read as concerning essentially personal-level phenomena”, or in other words “inter-
active organismic engagements” (ibid), and this “version of Sensorimotor Enactivism
3 Cf. Varela’s statement in the Afterword of Maturana and Varela (1998: 255): “I have
proposed using the term Enactive to […] evoke the idea that what is known is brought
forth”.
72 Morten Tønnessen
4 I thank Paulo De Jesus for pointing out to me, in response to reading a draft of this
chapter, that “autopoietic enactivism […] speaks of sense-making which is” in De
Jesus’ opinion “almost equivalent to semiosis”. De Jesus holds that autopoietic en-
activism “provides a more encompassing framework” than sensorimotor enactivism
or radical enactivism, and that it can “subsume other strands of enactivism insofar
as these are compatible.”
5 The selection of biosemiotic scholars presented here is intended to indicate some
of the variation of biosemiotic perspectives, but as we will see the two perspectives
overlap and are on most points not mutually exclusive. In a similar manner, the
selected enactivist scholars have been chosen for study in order to indicate some of
the most important (but certainly not all) variation in views within enactivism.
Agency in Biosemiotics and Enactivism 73
what more extensive class of organisms) make choices. This discussion can only
be carried out in light of one’s understanding of the range of agency in nature: Are
all living organisms agents, i.e. endowed with agency? Are there non-living agents
as well? And if there are non-living agents too, are all these made by or controlled
by living agents? As we will see in the presentations of three perspectives within
biosemiotics, even though biosemioticians tend to operate with a wide range of
agency in nature, there are some important differences in the understanding of
agents even within the biosemiotic community. In this section, I outline biosemiotic
understanding(s) of agency, by presenting firstly the views of Hoffmeyer, followed
by those of Sharov (and Vehkavaara).
2.1. Hoffmeyer
Jesper Hoffmeyer, a Danish molecular biologist, is arguably the most emblematic
contemporary biosemiotician. His book Signs of Meaning in the Universe (Hoff-
meyer 1996) helped define biosemiotics as an emerging field, which since 2001 has
had annual conferences (“Gatherings in biosemiotics”) and since 2005 a society
(International Society for Biosemiotic Studies). Of the nine citations that were sur-
veyed by Tønnessen (2015: 129–130), the most popular was a citation from a text
by Hoffmeyer (1998: 38):
The question of how to account for the existence of natural autonomous agents […]
leads us to the question of the origin of semiotic competence, which according to the
American linguist and semiotician Thomas A. Sebeok is probably congruent with the
question of the origin of life.6
The understanding that life is fundamentally agential, which is typical for biose-
miotics, is consistent with Hoffmeyer’s (2009: 940) observation that
[i]t is indeed possible to explain non-human living systems as if they were deprived
of semiosis, agency and true interests. This is exactly what biology has been doing for
more than a century and often in ingenious ways. In the course of the last few decades,
however, it has become clear that the reductionist assumptions that lead scientist[s]
to pursue such an awkward approach are no longer supported by scientific evidence.
What Hoffmeyer is claiming here is that there is today overwhelming scientific
evidence for the existence of agency and semiosis in living systems in general.7
This understanding presupposes a simple conception of signs, and of action, which
can be applied widely. Hoffmeyer tends to make a distinction between semiosis –
sign exchange – as a phenomenon that occurs throughout the living realm, and
8 With Hoffmeyer’s perspective, we could say that action is tied to either perception
(in animals with a central nervous system) or simpler semiosis (in other animals,
and living systems).
Agency in Biosemiotics and Enactivism 75
2.2. Sharov
Alexei Sharov, another central biosemiotician, holds that agents are “systems with
goal-directed programmed behavior” (Sharov 2010: 1052), and that agents are “either
living organisms or their products” (ibid), since only such systems can pursue goals.
Notably, Sharov holds that humans are not the only species that recruit subordinate
agents, or “subagents”, as he calls them. Various agents therefore tend to be inter-
connected, in Sharov’s perspective, and agents “are always produced by other agents
of comparable or higher complexity” (ibid). Elsewhere, Sharov (2013: 345) defines
an agent as “a system with spontaneous activity that selects actions to pursue its
goals”. This reflects the aforementioned biosemiotic definition of agency as invol-
ving “choice of action” – and, relatedly, Sharov (2010: 1053) states that “agents select
specific actions out of multiple options”. He thus indicates that for agents in general,
whether these are living agents or abiotic agents controlled by humans, different
actions are possible, and being an agent amounts, among other things, to choosing
specific actions from the available options. Sharov acknowledges the existence of
very simple agents, e.g. ribosomes and viruses (Tønnessen 2015, Appendix: 4).
An article of particular interest in our context, “Protosemiosis: agency with re-
duced representation capacity” (Sharov and Vehkavaara 2015), has just appeared.
Here Sharov and Tommi Vehkavaara, a Finnish philosopher and biosemiotician,
distinguish between “protosemiosis”, where “agents associate signs directly with
actions without considering objects” (ibid: 103), and “eusemiosis”, where “agents
associate signs with objects and only then possibly with actions” (ibid). Protosemio-
sis thus refers to simpler forms of sign exchange than eusemiosis does. The term
“object” refers first of all to the object in Peirce’s triadic sign model, but simulta-
neously objects are also defined as “distinct components of the environment which
can be addressed selectively and repeatedly by agents for sensing and action pur-
poses” (ibid: 106). An implication of Sharov and Vehkavaara’s acknowledgement
of protosemiosis is that Peirce’s triadic sign model is at work in some places in the
living realm (namely, in eusemiosis), but not everywhere in the living realm. In
protosemiotics, Sharov and Vehkavaara (ibid: 105) state that “[the] Peircean logical
concept of sign should be substituted by a more general one”. Interestingly, Sharov
and Vehkavaara refers to “actions” even in protosemiosis – which means that their
notion of action is of wider application than their notion of proper signs.
From the two biosemioticians’ perspective (ibid: 103), “[p]rotosemiosis started
from the origin of life, and eusemiosis started when evolving agents acquired the
ability to track and classify objects”. Protosemiotic agents – such as bacteria – do
not have the capacity to recognize objects. Protosemiosis is thus “a primitive kind
of semiosis that supports ‘know-how’ without ‘know-what’ ” (ibid). “Our conclusion
that sensing in bacteria does not refer to objects”, Sharov and Vehkavaara (ibid: 113)
observe, “contradicts […] the views of Hoffmeyer and [Claus] Emmeche (1991) […]
who believe that categorization as well as Peircean triadic relationship between
76 Morten Tønnessen
sign vehicle, object, and interpretant are universal for semiotic processes in all
living organisms.”9
As we have seen, Sharov and Vehkavaara pinpoint specific disagreements with
Hoffmeyer with regard to the range of what the two call eusemiosis. In their ter-
minology, Hoffmeyer would hold that all biosemiosis (i.e., sign exchange in the
living realm) is eusemiosis, and that there is no protosemiosis (or at least this is what
they indicate as probable). Sharov and Vehkavaara (ibid: 106), on the other hand,
stress that “[o]rganisms with a low level of functional organization (e.g., with [a]
small number of sensors and/or weak information processing) tend to have reduced
representations of objects around them”, and claim that “it seems reasonable to ex-
pect that there is some threshold of functionality below which objects are no longer
represented by organisms”.
A difficulty in understanding how exactly Sharov’s perspective differs from Hoff-
meyer’s concerns the term “object” as it is applied by the two scholars. Notably,
while Hoffmeyer uses snakes’ perception as a case study on relating to objects,
Sharov and Vehkavaara discuss bacteria’s sensing – and thereby refer to a much
simpler organism. They can hardly refer to “objects” in the exact same sense. In
Sharov and Vehkavaara’s view, snakes must clearly relate to objects, in their sense
of objects – the two indicate that even ticks relate to objects (ibid): “We cannot
say that the object (deer) is entirely absent in this sign relationship, because after
dropping down the tick expects to land on a warm and furry surface of an animal
and is prepared for a certain sequence of actions.” In sum, this implies that, on one
hand, Sharov and Vehkavaara operate with a lower threshold for organisms’ relating
to objects than Hoffmeyer does, but, on the other hand, they claim that they might
differ from Hoffmeyer in acknowledging that Peircean semiosis (broadly speaking,
eusemiosis) is not universal in the living realm.
3.1. Gallagher
Based on a review of various theories and brain-imaging experiments, Gallagher
(2007d) claims, in “The Natural Philosophy of Agency”, that “there is no consensus
about how to define the sense of agency.” In Gallagher’s understanding, agency in
its proper sense depends on the agent’s “consciousness of agency”. The “sense” of
9 Sharov and Vehkavaara (ibid) acknowledge that “this disagreement possibly results
from differences in terminology”.
Agency in Biosemiotics and Enactivism 77
Gallagher underlines that “in most cases of intentional action, [we] engage in such
action without having an action plan” (ibid: 20), and acknowledges (ibid: 28) that
[w]e do not attend to our bodily movements in most actions. We do not stare at our
own hands as we decide to use them; we do not look at our feet as we walk; we do not
attend to our arm movements as we engage the joystick.10 Most efferent motor control
and body-schematic processes are non-conscious and automatic.
In attempting to explain how we relate to our own bodies, Tsakiris, Schütz-Bosbach
and Gallagher (2007: 645) suggest that “the coherent experience of the body depends
on the integration of efferent information with afferent information in action con-
texts”. Here, efferent information, manifested as efferent signals, is derived from
motor neurons, whereas afferent information, manifested as afferent signals, is
derived from sensory neurons. “Overall”, the authors write (ibid), “whereas afferent
signals provide the distinctive content of one’s own body experience, efferent signals
seem to structure the experience of one’s own body in an integrative and coherent
way.” Such an understanding, based on modern neurophysiology, should on the
whole be compatible with Jakob von Uexküll’s (1928) model of the functional cycle.
Reframed in Uexküllian terminology, we can in simplified terms state that efferent
(motor) signals effectively constitute the Innenwelt (inner world) whereas afferent
(sensory) signals effectively constitute the Umwelt (outer world) of an organism.
What Tsakiris, Schütz-Bosbach and Gallagher are claiming is in other words that
conscious animals relate to their own bodies by (continuously, we must assume)
integrating Umwelt and Innenwelt – and, as we might add, Merkwelt (perceptual
world) and Wirkwelt (effective world, world of action).
3.2. Thompson
While Gallagher’s treatment of agency has largely been devoted to examining the
phenomenon of having a sense of agency, Thompson’s approach to agency is a bit
harder to summarize concisely. In this chapter I limit myself to aspects of his mono-
graph Mind in Life: Biology, Phenomenology, And the Sciences of Mind (Thompson
2007). I generally agree with Thompson (ibid: x) that “we need scientific accounts of
mind and life” that are informed by rich phenomenological accounts of the structure
of experience, and that “[p]henomenology in turn needs to be informed by psychol-
ogy, neuroscience, and biology.”
From the perspective of a biosemiotician such as myself, whose chief source of
inspiration is Jakob von Uexküll’s Umwelt theory, it is noteworthy that Thompson
(ibid: 59) explicitly adopts Uexküll’s Umwelt concept: “This idea of a sensorimotor
world – a body-oriented world of perception and action – is none other than von
Uexküll’s original notion of an Umwelt.” While Thompson refers to sensorimotor
worlds only, however, it is worth keeping in mind that Uexküll did not limit the ap-
plication of the Umwelt notion to the sensorimotor worlds in which animals with a
central nervous system live, but extended the notion even to unicellular organisms
such as bacteria. At any rate, in his modified version of Uexküll’s functional cycle,
Thompson (ibid: 60, Figure 3.3) depicts the co-emergence of autonomous selfhood
and world, and refers to the “significance/valence” that results from the “domain
of interactions”, which is in turn affected by the “identity” of the animal and its
“operational closure”.
Later on in the book, Thompson (2007: 146–147) recounts how Varela in one
of his final essays revised his view on whether or not “autopoiesis involves any-
thing teleological” (ibid: 146). As we shall see, in this late text Varela also refers
to Uexküll’s Umwelt notion as a given. Weber and Varela (2002) “argue that auto-
poiesis entails immanent purposiveness in two complementary modes” (Thompson
2007: 146), namely identity and sense-making. As for the latter, in Thompson’s
words (ibid: 146–147) “an autopoietic system always has to make sense of the world
so as to remain viable. Sense-making changes the physicochemical world into an
environment of significance and valence, creating an Umwelt for the system.” In
both Varela’s and Thompson’s view, sense-making “is none other than intentionality
in its minimal and original biological form.”11 In the words of Weber and Varela
(2002: 117),
because there is an individuality that finds itself produced by itself it is ipso facto a
locus of sensation and agency, a living impulse always already in relation with its
world. There cannot be an individuality which is isolated and folded into itself. There
can only be an individuality that copes, relates and couples with the surroundings, and
inescapably provides its own world of sense.
This world of sense can be conceived of in terms of Uexküll’s Umwelt notion, and is
in Weber and Varela’s outlook (ibid) related to the organism’s intrinsic teleology as-
sociated with its “sense-creation purpose whence meaning comes to its surrounding”,
whereby the organism introduces a difference between environment and world. In
short (ibid: 117–118):
[b]y defining itself and thereby creating the domains of self and world, the organism
creates a perspective which changes the world from a neutral place to an Umwelt that
always means something in relation to the organism.
So far we have examined some similarities between biosemiotic thinking informed
by Uexküll’s Umwelt theory and Thompson’s and Varela’s enactivism. A key term
that has consistently appeared is that of “self”. In some sentences, Thompson ap-
pears to equate the term “self” with “agent”. For instance, he writes (2007: 260) that
“[k]nowledge implies a knower or agent or self that embodies this knowledge.”
12 Another statement in the Preface (ibid) is, despite its deliberately wide-reaching
content, far more accurate: “The roots of mental life lie not simply in the brain, but
ramify through the body and environment.”
Agency in Biosemiotics and Enactivism 81
4. Conclusion
We have seen in this chapter that there is considerable variation in views on
agency within both biosemiotics and enactivism. This makes direct comparisons
between the two fields difficult. However, a general tendency is that biosemiotics
most often operates with a notion of action, and thereby of agency, that is of a
wider range than enactivist notions of action and agency. In other words, biose-
mioticians tend to have a lower threshold for agency than enactivists do. In the
work of Thompson, Varela and Weber, however, there is considerable overlap with
the biosemiotic approach, since all these scholars make use of Jakob von Uexküll’s
Umwelt theory. Overall the treatment of Hoffmeyer’s, Sharov’s, Gallagher’s and
Thompson’s views on agency have shown that biosemiotics and enactivism are
to some extent compatible.
Within a biosemiotic perspective, I think it is very important to distinguish
between proper subjects and quasi-subjects (Tønnessen and Beever 2014). Proper
subjects stand out because their lives have another dimension – namely, unified,
cohesive experience of their surroundings – which quasi-subjects lack. Quasi-
subjects such as plants, fungi and animals with decentralized bodies also have
semiotic agency and quasi-experience, but only proper subjects have cohesive,
integrated experience. In consequence, while biosemiotics can refer to “subjec-
tive experience” and “subjectivity” everywhere in the living realm, we should
distinguish sharply between subjective experience proper and quasi-subjective
experience.13
As I promised in the introduction, I conclude by situating cognitive semiotics
within a relevant, thematically more extensive field. The field I have in mind is
none other than that of biosemiotics. Just as biosemiosis, i.e., sign exchange in
the living realm, is the study matter of biosemiotics, the proper study matter
of cognitive semiotics is cognitive semiosis, understood as biosemiosis related
to the operation of consciousness. Since cognitive semiosis is a special case of
biosemiosis, we can conclude that, in a similar vein, cognitive semiotics is a special
case of biosemiotics. Cognitive semiotics should therefore be conceived of as a
subfield of biosemiotics.
Contemporary cognitive semioticians and biosemioticians have different theor-
etical backgrounds, and to some extent different semiotic and philosophical sources
13 As we have seen, Hoffmeyer holds, in this terminology, that snakes and possibly
reptiles in general, plus even simpler organisms, are quasi-subjects. Sharov and
Vehkavaara could be said to hold that at least bacteria and plants and fungi are
quasi-subjects.
82 Morten Tønnessen
Acknowledgements
This work has been carried out thanks to the support of the research project Ani-
mals in Changing Environments: Cultural Mediation and Semiotic Analysis (EEA
Norway Grants/Norway Financial Mechanism 2009–2014 under project contract
no EMP151).
Juan Carlos Mendoza Collazos
Chapter 5
Design Semiotics with an Agentive Approach:
An Alternative to Current Semiotic Analysis
of Artifacts
agenda relationship, that is, the relation between an agent’s actions and the type
of outcome that the agent seeks to achieve through his or her actions (Niño 2015),
establishes the agentive approach.
It can be noted that agentive semiotics has principles in common with biose-
miotics, but some differences must be pointed out. Agentive semiotics focuses on
human agency and it is centered on the explanations of the conditions that enable
human beings to signify, whereas biosemiotics is centered on the biological basis
of semiosis with more of a stress on primeval organisms rather than complex,
high-level agents. According to agentive semiotics, the conditions of the agency are
animation (Sheets-Johnstone 2010, 2011), situatedness (Thompson 2007: 13; Hutchins
2010: 428) and attention (Goldstein and Naglieri 2014). These three conditions are
mandatory if an entity is to be considered an agent.
Conversely, within some approaches in biosemiotics (e.g. Sharov 2010), artificial
devices such as automated machines and robots are also regarded as agents, as they
may fulfill the three key conditions: “(1) agents select specific actions out of multiple
options, (2) these selected actions are useful in a sense that they help agents to reach
their goals, and (3) agents do not emerge by chance, they are produced only by other
agents of comparable or higher level of functional complexity” (Sharov 2010: 1053).
These conditions are sound but do not go far enough because they do not address
the situated, animated and attentive conditions of an agent, which are not present in
artificial devices. In agentive semiotics, an agent acts only if it is situated. Its action
must be embedded and embodied, and have a kineto-perceptual basis and a specific
place and time. An agent’s action is always affective; it tends to interact with other
agents and it is under several degrees of control (from unconscious to highly con-
centrated actions); thus, it is always attentive (Niño 2015: 92). The agentive approach
is centered on the agent/agenda relationship with a focus on the agent in context
(Zlatev 2002), where “context” is understood as a sociohistorical and cultural con-
struction regulated by agendas. The narrative character of a (complex) human agent
and his volition, memory and capacity for action are comprehensively explained by
agentive semiotics, which is based upon the following principles:1
a. Signification is an exclusive activity of agents.
b. The process of signifying is intrinsically apt to be assessed.
c. The distinction between significance vs. signification.
(a) Signification is an activity that is exclusive to agents, and this is the core idea
in agentive semiotics. Therefore, things, words, and signs do not intrinsically have
meanings, because they do not have inherent agency. In consequence, signification
is not a property of signs. If signs or things do not have signification, they are not
the starting point in semiotics. Understanding how agents construct meanings, as
well as the cognitive processes that make possible the construction of meaning, are
1 The original work Elementos de Semiótica Agentiva by Niño (2015) can be consulted
for further explanations.
Design Semiotics with an Agentive Approach 87
For example, in the design process for a cell phone, the designer assigns functions
to the artifacts. The cell phone acquires a derivative agency through the design
process. But the cell phone cannot act for itself, even if it executes automatic func-
tions. When users recognize the cell phone functions, they may attribute agency
to the cell phone. Derivative agency depends on agents’ actions in order to obtain
meaning. Only agents can detect derivative agency in the artifacts. For this reason,
strictly speaking, an artifact does not interact with users. A blender, for example, has
functions assigned to it by the designer (derivative agency). But a blender does not
have the capacity to act for itself, nor to recognize other artifacts or the agency of
users. The agency of a blender depends upon its users, and they alone can recognize
its functions. Artifacts are not agents, even if they work automatically. The capacity
to act requires enactive conditions such as animation, situatedness and attention. It
also requires cognitive capabilities such as perception, memory or volition (Prinz
et al. 2013), as well as special properties such as autopoiesis and an intrinsic value
system (Zlatev 2002: 289). Interaction is only possible between agents; however,
interaction between agents is often mediated by artifacts.
88 Juan Carlos Mendoza Collazos
Derivative agency may look similar to the notion of functional information intro-
duced by Sharov (2010). However, there is one notable difference: Sharov uses the
notion of functional information in order to argue that artificial devices are agents,
whereas in agentive semiotics only agents can recognize, detect, activate and benefit
from derivative agency in artifacts. Hence, agents attribute meanings to artifacts.
Artificial devices are not agents; they are only prosthetic incorporations that agents
use in order to improve efficiency and obtain their goals. Agents incorporate ar-
tifacts into their ongoing agendas. For this reason, artifacts do not have their own
agendas. They are just a means to an end, and the agendas belong exclusively to the
agents. For example, the agenda or goal of a comfortable divan is not that people sit
there; the divan merely exists with the properties and functions assigned to it by
the designers. It does not possess the will to persuade people to use it. Only agents
who see the divan recognize its functions and properties (e.g. comfortable, safe,
stable); thus, agents incorporate the divan into their agenda. Similar arguments can
be made for advanced automated devices such as robots or CNC machines. This kind
of artifact executes functions automatically, but it is not plausible to suggest that
robots have their own agendas. The meaning of their functions is only of benefit
to the agents.
The principles (b) and (c) are best explained jointly. Any kind of possible re-
sponse, grounded or justified, is comprised of a nodal network of potential re-
sponses, namely, significance. Signification, on the other hand, is an actual response
that an agent activates (see Figure 2).
For example, a bicycle has a complex nodal network of possible responses that are
justified or grounded by its significance conditions. Signification emerges when
an agent updates a response or a set of responses. The agent’s potential responses
prompted by the significance conditions of the bicycle can be actions, sensations,
opinions, memories, etc. Any kind of possible response (potential or actual) is called
responsivity. In brief, significance is a grounded semiotic responsivity, while sig-
nification is an active responsivity. In the case of the bicycle, the agent could execute
the action of using it, activating the responsivity of the action. Alternatively, the
agent could simply remember his childhood when he was starting to ride a bicycle,
activating the responsivity of remembering. In this way, the potential significance
of the bike becomes actual signification to the agent. The matching process between
significance and signification can be assessed in relation to the agenda of the agent.
Correction criteria can be introduced into the activity of signifying, but only within
the framework established by the agent’s agenda.
Taking into account these theoretical foundations, the main goal of this chapter
is to set out the applications of agentive semiotics in professional design practice.
The theoretical structure that was built upon the foundations of Niño (2015) is linked
with design semiotics, in order to analyze the significance conditions of artifacts.
In addition, issues taken from Elementos de Semiótica Agentiva are developed and
shown in the application proposal.
3. A
pplication of the agentive approach to design
semiotics
Describing and analyzing the artifact’s significance conditions enhances the design
process, because the designer will be able to identify the conditions that determine
the user’s response to an artifact. Therefore, the designer obtains tools to conceive
of better and more efficient artifacts based on more thorough knowledge of the
foundations of users’ signification processes.
The conditions of significance of artifacts can be divided into three axes of analy-
sis. The first axis is the conditions of the agency in order to analyze the user. Second,
the conditions of the agenda required for the analysis of the design. And third, the
conditions of the material world so as to analyze the artifact. Significance conditions
shape and determine the possible response of any agent.2
2 See Mendoza Collazos (2015) for a more detailed exposition of these conditions.
90 Juan Carlos Mendoza Collazos
biographical background, the description of the user’s self, that is, his personality
as it has been shaped by experience; (c) the bodily-experiential background, which
refers to the user’s kineto-perceptual patterns determined by the influence of the
material world and the environment; and (d) the memory systems, the information
stored in the user’s long term memory. This information constitutes a system com-
prised of episodic memory, i.e. the user’s events and experiences; semantic memory,
i.e. the user’s entrenched cognitive models and meanings; skill memory, i.e. the
user’s miscellaneous skills acquired by training; and finally, emotional memory,
i.e. the affective pattern that the user displays in his life span and for specific events
or circumstances.
The structure of the agendas is important for ascertaining the motivations for
the user’s actions. Agendas are driven by the agents’ actions and goals; as we have
seen, an agent is an entity with a capacity for action; and agenda is the goal or
type of outcome that an agent seeks to obtain. Agendas can be general and vague
(e.g. calling) or specific (e.g. calling the police on a cell phone for help). Agendas can
be complex (e.g. reducing the crime rate) or simple/basic (e.g. dialing a number on
a cell phone). Agendas are embedded into one another. The structure of an agenda
comprises the main agenda, sub-agendas, and goals.
For example, the main agenda in calling someone by cell phone constitutes
the reason for the call (e.g. asking for help); the sub-agenda is achieving effective
communication (e.g. managing to talk to someone); and the goals are the simple ac-
tions inherent in the act of calling (e.g. hold the cell phone, dialing, etc.), see Figure 3.
agendas are aims that the agent seeks to achieve through his or her actions, they
can be completely achieved, partially achieved, or not achieved at all.
In agentive semiotics there is a special kind of agenda known as per-agendas.
The accomplishment of per-agendas depends on actions performed by other agents.
For example, in product design, the designer has a per-agenda because his goals are
achieved by user’s actions. An agent’s per-agenda seeks to persuade other agents
to act in a certain way. For this reason, the technical term per-agenda employs the
Latinate prefix per, used in words such as persuasion, to persuade. Thus, if the
designer seeks to persuade users, for example, to dial a cell phone in a certain way,
that designer’s per-agenda could be to dial using voice recognition rather than
tapping the keys. Thus, the designer’s per-agenda is only accomplished if the user
dials using the voice recognition tool.
Further, agentive semiotics introduces the notion of agentive scene. The agen-
tive scene must be analyzed by designers in order to accurately evaluate the
artifacts or improve the designer’s choices in the design process. The agentive
scene is comprised of a base scene and a semiotic scene; a base scene is the frame
in which the agentive meaning emerges, while the semiotic scene is where the
agent focuses his/her attention in order to accomplish the agenda. It comprises the
environment or physical background, whereas the semiotic scene is the focus of
attention and signification.3 In the typical relationship between agent and environ-
ment, the base scene and semiotic scene are one and the same. That said, there are
events in which the base scene (background, material world) is inactive and the
agent focuses his/her attention on a “virtual” semiotic scene, for example when
an agent remembers, imagines, or engages with a movie. For example, when an
agent uses a hammer, the base scene and semiotic scene are fused and they are the
same, but when the agent stops hammering, say, to remember his girlfriend, the
agentive scene breaks into two scenes. Also, when an agent is watching a movie,
the base scene is inactive and the agent focuses his attention and signification on
the movie’s actions. The agentive scene is divided because the physical actions
of the agent (sitting on a cinema chair) in the base scene do not correspond to
actions that he is giving significations to. This division of agentive scene can be
useful for the analysis of all interface types.
The final point to be considered in this analysis of the conditions of agency is the
conditions of goal achievement, based on the notions developed by Austin and Searle
(Austin 1962; Searle 1969, 1983; Searle and Vanderveken 1985). In agentive semiotics,
these conditions are related to the agenda’s conditions and are expanded to all
type of acts/actions, not only applied to verbal communication. Details about these
adaptations are explained by Niño (2015: 96). The conditions of goal achievement
describe the acts and the agent’s actions to accomplish his agenda. The conditions
3 However, the agentive scene is a phenomenologically indivisible unit, and its division
into two scenes here is only for the purpose of facilitating the analysis and presenting
noteworthy distinctions.
92 Juan Carlos Mendoza Collazos
by using an artifact established by the designer, while the latter are necessary con-
ditions that must be fulfilled. The designer proposes the ideal way of accomplishing
the agenda of taking notes, for example, using an electronic tablet to fulfill the at-
tainment conditions; thus, he defines the time and resources required to accomplish
the agenda of taking notes.
lyzed with respect to the actions required to fulfill their functions. This is a novel
perspective on analyzing an artifact’s functions. According to agentive semiotics,
actions are broken down into kinetic actions, intellective actions and expressive
actions. These types of actions are present and interlink with one another in
any artifact, but any one of them may receive more emphasis depending upon
the main artifact’s function. Thus, there are artifacts that encourage intellective
actions, e.g. electronic tablets, books, and calculators. Others bring about kinetic
actions, such as locomotion, manipulation and exploration. Locomotive actions
are present in bikes, cars, skates; manipulative actions are prompted by joysticks,
keyboards, or hammers; explorative actions are evident in binoculars, flashlights,
maps, etc. Finally, expressive actions can be found in guitars, rings, speakers,
medals, etc. This taxonomy is useful for classifying artifacts according to their
functions.
Artifacts are described in the agentive approach on the basis of multimodal or
intermodal ways of obtaining information from the environment. The information
captured is related to the sensory organs and encompasses sense, sound, sight,
touch, and taste. For example, visual information allows an artifact to be described,
in terms of colour, shape, texture, position relative to body, position relative to
other objects, movement, etc. Thus, the agentive approach always focuses on the
agents and differs from conventional design descriptions that employ technical
terminology.
Finally, to establish the artifact conditions, a designer analyzes the usability
conditions. These are related to an artifact’s characteristics and features. Usability
conditions are composed of conditions of use and conditions of functioning. The
former are actions that the user has to take in order to make effective use of the
artifact, while the latter are required to achieve the artifact’s goal. If the agent
performs the tasks properly, but the conditions of functioning do not obtain,
the agent has not achieved the artifact’s function. For example, the conditions
of use for a washing machine include the following usage sequence: connecting
to electricity, pressing the on/off button, putting the laundry in, pressing the
wash cycle button, and so on. A washing machine’s conditions of functioning are
technical features and mechanisms that actually make the machine work. When
the conditions of functioning fail (the washing machine does not work) for any
reason, the user cannot achieve the agenda (washing clothes) even if the con-
ditions of use are followed correctly. It should be noted that the conditions of
attainment (Section 3.2) coincide with the conditions of use, but there is a subtle
difference: the conditions of attainment are the ideal way of fulfilling an agenda;
the designer proposes a method for doing so (time and resources including ar-
tifacts), but following this proposal is optional and users can possibly find a
better way based upon their own agendas. On the other hand, conditions of use
are the prescriptive way of using a given artifact. These conditions are stated in
the user manual or instructions.
96 Juan Carlos Mendoza Collazos
Figure 8. E
naction complexity as the inverse proportion of agentive resources and
cognitive economy (prepared by author based on Niño 2015)
properly, e.g. the user makes a call by means of cell phone in a concert hall when
the concert is in progress. Finally, axiologically inadmissible failure is when the
artifact is used, for example, to kill someone. This detailed classification of failures
is useful to the designer because he can establish the cause and type of failure and
can redesign the artifact accordingly.
Figure 9. F ailures in matching the significance of use (prepared by author based on Niño
2015)
4. Conclusions
Design semiotics with an agentive approach provides a fresh insight into users’
actions and experience of using artifacts. As shown in this chapter, the approach
offers semiotic tools that permit, for example, analysis of types of failures from
Design Semiotics with an Agentive Approach 99
semiotics. Once again it is necessary to acknowledge the work being done, because
it has established itself as the main approach involved in the empirical investigation
of educational processes from a semiotic point of view (Kress, Jewitt, Ogborn, and
Tsatsarellis 2014; Kress and Selander 2012). Our aim here is to indicate an alternative
approach within the framework of cognitive semiotics.
Cruse 2004). In a similar way, we should not equate the collective map of the Mole
concept with the scientific concept, because the process of gradually constructing
relevant conceptual relations from the first-person perspective of a learner is dif-
ferent from the established third-person perspective of a scientific community of
practice, and it will also be different from its gradual development within a dialogical
second-person perspective in the history of science. What is common to those
situations is not the “collective map” but the differential grounding of conceptual
relations in actual problem situations. How these situations might be related is an
empirical issue that can be investigated from the point of view of didactics, cognitive
semiotics and the history of science.
After the “cognitive turn” a secondary shift in pedagogical theories occurred
towards more dialogical and social dimensions of science learning, and this has led
to a renewed interest in John Dewey and Lev Vygotsky (1896–1934). It has also led
to a recognition of semiotic aspects of science learning such as the scientific literacy
problems associated with the acquisition and mastery of multiple representational
forms within the disciplinary discourses of, for instance, physics (Linder 2013)
and biochemistry (Schönborn and Anderson 2006). Educators and researchers find
themselves dealing with didactic difficulties beyond the purely disciplinary content
insofar as these difficulties require the linguistic forms of content knowledge (i.e. its
lexical, grammatical, narrative and rhetorical forms) to be analysed and planned
for in their own right. It is an important point that different representational forms
have different disciplinary affordances for supporting student learning (Ainsworth
2006; Fredlund, Airey and Linder 2015; Fredlund, Linder, Airey, and Linder 2014).
In the following section selected aspects of the history and didactics of physical
chemistry are introduced as a case study. After a historical introduction to the
conceptual development of chemical reaction kinetics, we will turn to the issues
of scientific language and imagination as conceptualized by the founding fathers
of physical chemistry.
Without assistance students do not always take notice of the literacy issues posed
by the multiple representational forms they encounter in science learning. As one
student (who did focus on the literacy issue) said in discussing the difficulties of
learning biochemistry: “It is like learning a new language”.
place as reversible processes, where forward and reverse reactions occur simulta-
neously, and when these processes find an equilibrium, it means that forward and
reverse reactions occur at the same rate. In introductory chemistry teaching, equi-
librium is sometimes misunderstood by students as a static state where no processes
occur, and this was in fact a widespread belief in earlier chemistry (Laidler 2000).
For a simple reaction with two substances A and B being transformed into the
products C and D with stoichiometric coefficients a, b, c, and d, a reversible reaction
can be written as in (1). The law of “mass action” would then predict an equilibrium
constant Keq for this process to be as in (2), where the brackets indicate concen-
trations of the involved substances.
(1) 𝑎𝐴 + 𝑏𝐵 ⇌ 𝑐𝐶 + 𝑑𝐷
(2)
The “law of mass action” is still taught in high school chemistry, because it correctly
states the equilibrium constant for simple reactions, but it is arrived at through an
argument about rate laws that is not generally valid (Laidler, Meiser and Sanctuary
2003). In order to explain this, we will go back to the original discovery of reversible
reactions and its relation to the concept of a chemical equilibrium.
The idea of a chemical equilibrium in reversible reactions had been introduced
at the beginning of the 19th century by Claude Berthollet (1748–1822), a professor
of chemistry at the École Normale Supérieure in Paris. Berthollet knew a one-way
reaction from the laboratory, where sodium carbonate and calcium chloride (both
dissolved in water) would react and form calcium carbonate and sodium chloride
(ordinary salt). This exothermic (heat producing) reaction can be written as in (3).
(3) 𝑁𝑎2𝐶𝑂3 + 𝐶𝑎𝐶𝑙2 ⟶ 𝐶𝑎𝐶𝑂3 + 2 𝑁𝑎𝐶𝑙
In 1798, Berthollet accompanied Napoleon as a scientific advisor on an expedition
to Egypt, and there he noticed large deposits of sodium carbonate (“soda”) around
the edges of local salt lakes. This caused him realize that these deposits of 𝑁𝑎2𝐶𝑂3
had been formed by a reverse process to the one he knew from the laboratory, and
this process was caused by the excessive concentration of salt in the lake exposed to
intense heat and evaporation of water. Concentrations could act as a kind of force
“pushing” the reaction in the reverse direction. According to Berthollet, chemical
“affinities” were a manifestation of a universal attraction, and in this way chemistry
was articulated conceptually according to the ideal of Newtonian physics (Califano
2012). Back in France, Claude Berthollet published his findings in Essai de statique
chimique (1803) (Holmes 1962; Weller 1999).
It was with a background in Berthollet’s discoveries that half a century later
Waage and Gulberg investigated the equilibrium constants of different reversible
processes and formulated the law of mass action. Conceptually, the idea of an
equilibrium constant arose from the proportional dependency of both a forward
and a reverse reaction. For both reactions viewed in isolation the reaction rate was
108 Michael May, Karen Skriver, Gert Dandanell
formulated as proportional to what Waage and Gulberg called “active masses” with
each concentration in the exponential power of its stoichiometric contribution to
the process, as in (4).
(4) rate of forward reaction = 𝑘𝑓[𝐴]a [𝐵]b
rate of reverse reaction = 𝑘𝑟[𝐶]c [𝐷]d
At the point of equilibrium, the rate of the forward and reverse reactions will be
exactly the same, and this is how the equilibrium constant Keq in (2) is derived as
a fraction, i.e. by isolating the two rate constants kf and kr on the left side of the
equation and isolating the “active masses” on the other side. There is a philosophi-
cally important point in the concept of chemical equilibrium with regard to the
separation of a macroscopic level of observable or measurable phenomena and the
microscopic level of the dynamics implied in explaining chemical reactions. Equi-
librium is, in a sense, static at the macroscopic level, i.e. when it is attained it does
not change unless something disturbs the chemical process, whereas it is dynamic
at the microscopic level. This is what is sometimes misconceived by students in
introductory chemistry, when they believe that equilibrium means that processes
have come to a halt.
The problem with the formulation of Waage and Gulberg is the so-called rate
laws in (4). They are true for simple reactions with no intermediate steps, where the
stoichiometric constants will be numerically identical to the exponents of the con-
centrations in the rate laws, but this will not generally be the case. Since students,
however, are introduced to the simple cases first, they are very likely to make the
false assumption that the rate laws can be seen directly in chemical equations. Learn-
ing that reaction rates have to be measured experimentally is therefore a critical
point, i.e. they cannot be inferred directly from the stoichiometric expressions. This
turns into a literacy issue that is aggravated by some textbooks, because they repeat
the familiar law of mass action (where reaction rates appear to be determined from
reaction equations) while at the same time stressing that reaction rates have to be
measured experimentally. If this discrepancy is not addressed explicitly, students
will be confused, i.e. they will not understand why the experimental procedure is
necessary.
Other contributions to the growing field of physical chemistry were made by the
French chemist Henry Louis Le Châtelier (1850–1936) who investigated the conser-
vative nature of chemical equilibrium under disturbances (expressed in “Le Châtelier’s
Principle”), and the Swedish chemist Svante Arrhenius (1859–1927) who investigated
the temperature dependency of chemical reaction rates (expressed in “the Arrhenius
equation”). Physical chemistry as a coherent and mathematically expressed domain
was, however, established on the basis of these developments as well as independently
discovered by the Dutch chemist Jacobus van’t Hoff (1852–1911) in collaboration
with the Latvian-German chemist Wilhelm Ostwald (1853–1932). Jacobus van’t Hoff
published his research in chemical kinetics in an influential volume on Études de
dynamique chimique (Studies in Chemical Dynamics) in 1884, and in 1887 he and Ost
wald jointly published the first journal in the domain, the Zeitschrift für physikalische
Cognitive Semiotics of Science 109
Figure 1. L
eft: Jacobus van’t Hoff (on the left) visiting Wilhelm Ostwald in his laboratory
in the Department of Physical Chemistry at Leipzig University (around 1900).
Right: van’t Hoff cardboard molecular models, Museum Boerhaave, the Dutch
National Museum for the History of Science and Medicine, Leiden. Creative
commons (Dragicevic and Jansen, 2012).
3 The Nobel prize in chemistry was given in turn to the founding fathers of physical
chemistry: first to van’t Hoff in 1901, then to Arrhenius in 1903, and finally to Ost-
wald in 1909. Ostwald not only created the first journal of physical chemistry, but also
headed a Department of Physical Chemistry at Leipzig University (Van Houten, 2002).
110 Michael May, Karen Skriver, Gert Dandanell
use of spatial models to represent molecular structures (Figure 1, right). His work
in La Chimie dans l’espace (1878) was harshly criticised by one of the important
contributors to organic chemistry, Hermann Kolbe (1818–1884), who found that his
speculative use of spatial models was “quite incomprehensible to the sober chemist”,
and Kolbe even ridiculed van’t Hoff’s teaching of applied chemistry to veterinary
students in Utrecht. Kolbe claimed that the theory of spatial structures did not count
as “exact chemical research”, but had to be the result of a preference for poetic
speculation: according to Kolbe, van’t Hoff had found it “more convenient to mount
Pegasus (borrowed, no doubt, from the Veterinary School) and to proclaim in his
‘La Chimie dans L’Espace’ how on his daring flight to the chemical Parnassus the
atoms appeared to be arranged in space” (Benfey 1960). This personal attack actually
contributed to making van’t Hoff famous in the European chemical community.
In an inaugural talk for a new position as professor of chemistry at the University
of Amsterdam in 1878, van’t Hoff chose to speak – not directly of his contributions
to physical chemistry – but of “The Role of Imagination in Science” (van’t Hoff
1878). This apparently surprising topic was motivated by Kolbe’s attack. Rather
than excusing his “poetic” exploration of spatial structures, he listed a number of
cases from the history of science, where the creative or even artistic production
of scientists had played an important role in their scientific discoveries. Imagin-
ation (in the Dutch original, “De Verbeeldingskracht”) should be seen as necessary
to scientific discovery, because observation of scientific facts is not even possible
without an imaginative anticipation of events and recognition of similarities with
past events, and neither without a posterior simplification and conceptualization
of the observed phenomena. Specifically, van’t Hoof addresses the role of imagin-
ation in explicating the relation of cause and effect in the sciences, and imagination
is understood as “the capacity to visualize a particular thing so clearly, that all its
properties can be recognized with the same certainty as if the object was directly
observed” (van’t Hoff 1878). He stresses what Charles S. Peirce would have called
abductive reasoning, i.e. the role of imagination in hypothesizing about possible
causes behind observed regularities.
One of van’t Hoff’s examples was the work of the German chemist Eilhardt Mit-
scherlich (1794–1863), who had noticed similarities between different salts such as
ammonium arsenate (NH4)3AsO4 and ammonium phosphate (NH4)3PO4. These are
different chemical compounds with different affordances and properties (e.g. am-
monium arsenate is highly toxic and ammonium phosphate is highly unstable), but
what was striking to Mitscherlich was that the structural analogy of the composition
of the two compounds seemed to be approximately reflected in a similarity of their
crystalline structures. The visual similarity of the salts as crystalline structures was
therefore hypothesized by Mitscherlich to be an effect of an underlying structure at
the molecular level, and he stated this as an (approximate) empirical law of isomor-
phism (Freund 1904; Ihde 1964). As stated by van’t Hoff, imagination was needed for
this discovery to be made: “if the crystal form of the first was not vividly present to
him during the observation of the second [compound], the correspondence between
them could not have occurred to him” (van’t Hoff, 1878). More importantly, how-
Cognitive Semiotics of Science 111
ever, imagination had to play a role in the formation of the fundamental analogy
between the microscopic level of structural organization and the macroscopic level
of observable crystalline forms. The discovery of Mitscherlich was important for
van’t Hoff, because it was a precursor to his own discovery of 1874, in which he ex-
plained the isomerism4 of certain organic compounds in terms of the stereochemical
properties of carbon bonds (Ramsay 1981).
With his emphasis on the importance of imagination in scientific discovery, van’t
Hoff implicitly recognized its importance for teaching and understanding chemistry.
He noted that imagination would not always have a constructive role, but could find
a “pathological” expression in superstition and hallucination5, and he also indicated
that its role in modern science had shifted with the extensive use of collaborative
laboratory work: “the imagination can now be replaced by the sacrifice of a great
amount of labour” (Benefit 1960).6 As noted above, van’t Hoff pioneered the use
of spatial models to represent molecular structures. He introduced diagrammatic
representations of molecular structures and also produced physical models as aids
to visualization (Figure 1, right). The latter was actually meant to assist his research
colleagues (van der Spec 2006), but they would have also played a didactic role in
teaching organic chemistry. The importance of molecular models in chemistry was
underestimated or even “forgotten” in classical philosophy of science, although they
played an important role in chemical scientific practice (Rancour, 1997), but this is
not the case in more recent cognitively oriented philosophy of science, where the
role of conceptual tools and material artefacts in scientific modelling – in different
ways – is seen as a central topic (Baird 2004; Gere 1992, 2006; Nersessian 2008).
Wilhelm Ostwald, on his part, was deeply involved in philosophical aspects of
natural science. He wrote a treatise on Natural Philosophy (Ostwald 1910), and he
defended a form of “energetics” that denied the material existence of atoms. Stipulat-
ing energy as more fundamental than matter was not unusual at the time, but his
4 Isomerism had been introduced as a concept by the Swedish chemist Jöns Jacob
Berzelius (1779–1848) to describe the existence of chemically different substances,
isomers, that have the same composition but different structures and different
chemical properties. A simple example is the chemical composition C2H6O which
has two isomers, the liquid we know as ethanol (or ethyl alcohol) with the structure
CH3CH2OH and the gas dimethyl ether with the structure CH3OCH3. Isomers are an
important concept in organic chemistry because of the spatial combinations made
possible by carbon bonds.
5 A more mundane expression would be the conceptual mistakes resulting from mis-
leading analogies such as the one discussed in Section 3.2 on the concept of an ac-
tivation energy barrier.
6 This could be read as a sarcastic reply to the criticism of Kolbe as an experimen-
tal scientist: a lack of imagination might be compensated by a massive amount of
laboratory work, i.e. investigating “in all directions and by trial and error” might
eventually produce results that could have also been arrived at through abductive
reasoning!
112 Michael May, Karen Skriver, Gert Dandanell
opposition to a mechanical world view went further into a unitary foundation of all
sciences including not only physics and chemistry, but also biology and sociology.
As a chemist he paradoxically wanted a “Chemie ohne Stoffe”, i.e. chemistry without
substances (Holt 1970). The “ideological” conception of an “energetics” opposed to
mechanical physics can be seen as a part of the perceived “crisis of physics” at the
end of the 19th century, but it was widely opposed by leading German physicists
including Ludwig Boltzmann (1844–1906) and Max Planck (1858–1947) (Krogh,
2015). His anti-atomism was effectively refuted by Albert Einstein’s famous 1905
paper on observable Brownian motion, where the jerky motion of small particles in
liquid suspension was interpreted as a consequence of molecular motion according
to the kinetic theory of heat (Ren 2005). From our present perspective it can be dif-
ficult to understand the position of Ostwald, but it should be noted that chemists
debated the existence of atoms throughout the 19th century. This is an expression
of a difference in the level of reality articulated by physics and chemistry: for the
chemist at the end of the 19th century it could still make sense to treat atoms and
molecules as mathematical fictions or conceptual artefacts, but this was at a time
when experimental physicists like J.J. Thomson (1856–1940), Ernst Rutherford
(1871–1937), Wilhelm Rontgen (1845–1923) and Marie Curie (1867–1934) were al-
ready doing things to atoms7 (Pays 1982).
As a chemist and natural philosopher Wilhelm Ostwald took a practical interest
in science as a language, an interest that also grew out of his work in translating
key papers in chemistry into German as he had done with Waage and Gulberg, and
later with the work of Gibbs on thermodynamics. Ostwald saw that science suffered
from a language barrier created by the national languages used by his colleagues in
chemistry, and he saw the need to establish an international language for science.
Beyond the language barrier, however, Ostwald identified another problem: natural
languages had evolved for practical communication in everyday situations, but they
were inconsistent in their grammar and inherently vague in their semantics. Science
needed a special language to supplement mathematics, i.e. a universal world lan-
guage (“Weltsprache”) that had to be constructed for the specific purpose of scientific
communication (Gordin 2015). This should remind us of the rationalist dream of in-
7 One should reference here the famous phrase of “entity realism” by Ian Hacking
recalling an experiment at Stanford University that convinced him of the reality of
electrons. He had observed the manipulation of a macroscopic object in an electrical
field through a process of “spaying” electrons on the object, and how this affected
the behaviour of the object: “So far as I’m concerned, if you can spray them, then
they are real” (Hacking 1983). Although electrons might have started out as con-
ceptual artefacts in physics, this changed when we began to use electrons as tools
in experiments and engineering: they are considered real when we can use them
instrumentally. This instrumental form of realism might, however, be too narrow
to account for scientific practice, and in another sense it could be illusory, because
manipulation without specific knowledge of what is being manipulated cannot be
sufficient for a sound scientific realism (Gelfert 2003).
Cognitive Semiotics of Science 113
8 Translation of science into national languages such as French, German, Spanish, Rus-
sian and Chinese is still an influential factor for the knowledge available within local
scientific communities, but the more equitable distribution of scientific languages
that existed at the beginning of the last century seems to have broken down after
World War 1, specifically with regard to scientific German and scientific French
(Gordin 2015). As pointed out by Gordin, the transition to scientific English was easy
in mathematics and natural sciences by the role of formal languages and notational
114 Michael May, Karen Skriver, Gert Dandanell
3. Th
e different perspectives of kinetics and
thermodynamics
The different approaches of Gibbs and van’t Hoff to physical chemistry are sometimes
portrayed as a matter of personal style (Deltete and Thorsell 1996), but this is a distortion
and simplification of the issue. Although they both contributed to chemical thermody-
namics in a form which is recognizable in current textbooks, the actual conceptual
content is divergent. This is not just a simple issue of terminology and nomenclature,
even though some interesting points can be made at a purely lexical level of analysis.
Figure 2. Network visualization9 of word frequencies in (Gibbs 1906) and (van ‘t Hoff 1896).
surface
9 The method used requires three levels of measurement, data manipulation and
visualization. Word counts are performed on digital copies of the texts using the
open source software TextSTAT (TextSTAT, Version 2.9, 2014, Matthias Hüning,
http://neon.niederlandistik.fu-berlin.de/textstat/) and normalized word frequencies
are calculated in order to arrive at relative frequencies within the individual docu-
ments. Singular and plural forms were combined in the count (e.g. “salt” stands for
“salt” and “salts”). Ordinary words in English were filtered from the sample, and an
116 Michael May, Karen Skriver, Gert Dandanell
The distribution and frequency of lexical units in technical English is just to be taken
as an indication of more conceptual differences in the organization and presentation
of topics. The status of physical chemistry in research and education is a key issue
in the philosophy of chemistry, and the potential “derivation” of chemistry from
physics is a relevant problem (Brakel 2000). Today, the frontline of research would
be in quantum chemistry, involving questions such as: can the observed chemical
properties of the elements of the periodic table be derived from quantum physics?
(Scerri 1994).10 In the present context of reaction kinetics and chemical thermody-
namics, the issue is not reductionism, however, but the difference in perspective of
the two approaches as conceptual constructions.
The didactic problem of the different perspectives is mainly that they are not
addressed explicitly. When students first learn about physical chemistry (in high
school), they are introduced to reaction kinetics as a discipline based on empirical
observations of reaction rates; they learn about the “mass action law”, and event-
ually they might also derive the integrated rate laws and become familiar with their
mathematical and graphical representations. Thermodynamics of chemical reactions
is introduced at more advanced levels of high school chemistry. For students who
choose to study chemistry, biochemistry, pharmacy, molecular biology or chemical
engineering at university, the order of presentation of these topics will be drastically
reversed: courses and textbooks on physical chemistry present topics deductively
starting from the abstract principles of thermodynamics and statistical mechanics,
i.e. from the point of view of Gibbs, and reaction kinetics appear as an application
of thermodynamics.11 It is well documented that this creates conceptual problems
arbitrary number of top level frequency words were selected (25 from each text).
These word lists were then used to produce node and edge files to import to the
Gephi visualization software (Gephi 0.8.2, 2012, http://gephi.org). Word frequencies
are approximately represented as line thickness of the edges. The selected layout
algorithm is Fruchterman Reingold.
10 This issue was raised originally by Paul Dirac (1902–1984) in 1929 in a famous paper
on “Quantum Mechanics of Many-Electron Systems”, when he optimistically claimed
that chemistry could – at least in principle – be derived from quantum physics.
The reservation of Dirac was mainly that the known mathematical physics might
be computationally too difficult: “the difficulty is only that the exact application of
these laws leads to equations much too complicated to be soluble” (Simões, 2002).
11 Even an author like Keith Laidler, who is engaged in the history of his discipline (cf.
Laidler 1985), follows this deductive approach in his textbook on physical chemistry
(Laidler et al. 2003): the basic principles of chemical kinetics are introduced only
after 360 pages of chemical thermodynamics. This is scientifically correct, of course,
although it might be a didactic advantage to approach the topic more inductively
(e.g. from experimental observations) and interwoven with the historical develop-
ment (which retroactively anchored the discipline in thermodynamics and quantum
chemistry). We should also be aware that there are disciplinary differences within
thermodynamics as presented in e.g. physics, physical chemistry and mechanical
engineering (Christiansen and Rump 2008).
Cognitive Semiotics of Science 117
Figure 3. A
mechanical analogy to explain the concept of an activation energy barrier
as seen in a textbook on general chemistry (Holtzclaw, Nebergall and Robinson
1984)
Cognitive Semiotics of Science 119
12 In some contexts there is even a reference to the myth of Sisyphus and his never
ending effort of pushing a stone uphill, see for instance the http://chemwiki.ucdavis.
edu on the Arrhenius law and activation energies.
13 In the theory of Cognitive Grammar, the concept of construal refers to the ability of
human speakers to construe the same basic situation in many different ways. That
is, the linguistic expression of a conceived situation can be structured and modified
by the focus on particular aspects of the situation, by the level of detail with which
it is described, and by the perspective from which it is viewed (Langacker 1987).
Perspective in turn includes a number of phenomena such as viewpoint, deixis and
the construal of objectivity and subjectivity.
120 Michael May, Karen Skriver, Gert Dandanell
The two steps of an enzymatic reaction are the binding of a substrate (S) to an
enzyme (E) forming an intermediate complex ES; the catalytic step produces the
product (P) and releasing the enzyme. Understanding the mechanism of catalytic
and enzymatic reactions involves an argument from thermodynamics about “free
energy” usually accompanied by a reaction coordinate diagram as shown in Figure 5.
The concept of free energy was proposed by Gibbs and by Helmholtz in two different
versions (for different constraints on volume, temperature and pressure), but with a
similar intention of explaining why reactions happen according to thermodynamics
and avoiding the allusive concept of chemical “affinity”.
van’t Hoff followed Helmholtz in considering the free energy as the potential
of a chemical system to do work: the free energy (not “wasted” as heat, that is
as an increase in entropy) is the maximal work realized in a reaction (at a fixed
temperature and volume). Chemical equilibrium can then be reconceptualized as
the dynamic state where this “work of affinity” is minimal (Kragh 2001). The per-
spective of Gibbs on free energy, on the other hand, was on the development of
entropy of a system under constant pressure. In the diagram in Figure 5, ∆ G++
denotes the change in “free energy” (G for “Gibbs energy”) for the two transition
states corresponding to the energy barrier with and without catalysis. The positive
free energy describes the energy barrier, whereas the negative free energy de-
scribes the difference (∆ G) of energy released in the reaction (notice that ∆ G is
not changed by catalysis).
In a biochemistry course at the University of Copenhagen14 we observed that
some students wondered why a given enzyme-enhanced reaction occurs at all con-
sidering that the energy barrier is not “removed” but only “lowered” by the action
of enzymes. Commenting on the reaction coordinate diagram, one student asked:
“Why does a reaction happen, when the activation energy barrier is only lowered?”
This reveals a misconception about the nature of the barrier as an absolute barrier
that can be “removed” rather than a statistical conception, although the question
itself is meaningful and the full answer quite complex, because the concepts of
“free energy” and “reaction coordinate” themselves are inherently quite complex
(Keeler & Wothers 2003).
Figure 5. “ Reaction coordinate” diagram used to explain the action of enzymes with
regard to the lowering of the activation energy barrier, simplified after
(Nelson and Cox 2008)
4. Conclusion
Future research will have to investigate the nature of different literacy issues in
science education and the interwoven conceptual issues in the history and didactics
of science. In physics education, the recurrent conceptual problems of students are
no longer seen as a sign of motivated and stable misconceptions, but rather as a
sign of a conceptual struggle of disambiguation. In other words, students will try
to figure out what the ontology of physical concepts could be across the multi-
tude of descriptions, equations, metaphors, analogies, visualizations and experi-
ments they encounter through the disciplinary discourse and practice of physics
(Brookes and Etkina 2009). It will accordingly be important not to reduce the role
of representations to a question of “language” in the narrow sense of terminology,
or a question of a “rhetoric” of science. In summary, a number of significant issues
for future research can be stated as follows:
• We need a semiotics of science rather than a purely linguistic approach because
of the multimodal nature of representational forms used by instruments, models
and theories in scientific practices (Giere 2006; Baird 2004),16 and because of the
importance of multiple representations in science learning (Ainsworth 2006).
Ronald N. Giere’s proposal of a “scientific perspectivism” is important, but the
foundation of scientific instruments and models as distributed cognitive systems
has not been worked out in detail (Brown 2009). Scientific perspectivism can be
seen as a “quasi-semiotic” theory in the way it stresses representational models
and distributed cognition, but this relation to semiotics should be explicitly
elaborated.
• We need a cognitive semiotics of science because of the transdisciplinary nature
of key problems (Zlatev 2015a). Issues such as scientific literacy, ontological
16 This is also stressed in the multimodal approach of social semiotics (Kress, 2010;
Kress, Jewitt, Ogborn, and Tsatsarellis 2014), but as mentioned earlier the conception
of semiotics is made problematic by the idiosyncratic nature of systemic functional
linguistics and the misrepresentation of foundational works in semiotics.
Cognitive Semiotics of Science 123
1. Introduction
Cognitive semiotic theory, as a systematic study of meaning-making, offers a rich
opportunity for considering children’s gradual development from early conscious-
ness of perceptual and movement experiences, to representational consciousness
and language. The present paper develops a cognitive semiotic theory of the tran-
sition into language based on empirical findings interpreted through both cognitive
(Piaget 1962; Werner and Kaplan 1963) and Peircean theory. For the latter I rely on
the work of colleagues who have made strong inroads in the process of integrating
developmental and semiotic approaches (e.g. Daddesio 1995; Sonesson 2007; Zlatev
2009a; 2013; Lenninger 2012). Semiotics can be defined as (a) the study of signs,
which is the historically more common description or, more broadly, (b) the study
of all meanings. Under (a) there is variation in what constitutes a sign, but only
sign-based meanings are included in semiotics. Under (b), favoured by many in
cognitive semiotics, all experience of meaning (even sensation) can be considered
semiotic (i.e., meaningful), but only some special kinds of meanings are signs. From
a developmental perspective, equating semiotics with meaning, while allowing
signs a developmental course of emergence is most useful and will be adopted here.
The field of cognitive semiotics requires a developmental perspective in order to
track the time and growth trajectory of children’s development of meaning. This
perspective is represented by a number of pioneers in the field including Lenninger
(2012), Andrén (2010) and Zlatev (2009a, 2013). My work has centred on the cog-
nitive basis of children’s transition into language, and initially focused on the under-
lying development of the capacity for mental representation as observed in play.
Since both language and representational play are symbolic modes, I hypothesized
a temporal relationship between developments across these modalities (McCune-
Nicolich 1981a). Children’s expressions in both play and language are interpreted
from a phenomenological perspective (Sonesson 2007). That is, for example, in iden-
tifying a play act at a particular level, evidence of the child’s state of mind is sought
in the accompanying behaviour and attitudes (cf. Bloom 1993; Searle 1992; McCune
2008). While research demonstrated that a given level of language did not appear
until the same time or later than the analogous representational play behaviour,
the temporal distance between these developments indicated that additional behav-
ioural developments, beyond mental representation contributed to the children’s
transition to linguistic productivity (McCune 1995). In fact, the missing element
128 Lorraine McCune
Prior to adoption by cognitive science, the terms representation and mental rep-
resentation were understood as describing conscious mental states. Contemporary
cognitive science employs the term representation to describe information stored
in the brain or utilized in computer simulation. I use these terms to refer to mental
states with the potential for contemplation beyond present perceptual reality. Rep-
resentational states are distinguished from perceptual mental states: the latter are
limited to meanings and experiences of present reality. In the view expressed here,
sensorimotor and perceptual meanings develop during the first year of life. These
initial meanings, considered broadly as semiotic, provide the foundation for the
representational development evident between one and two years of age in both
play and language (McCune 2008).
In semiotics, the use of the term sign usually involves one entity standing for
another in a one-way relationship; the term symbol is reserved for conventional
signs, such as those of a language or mathematics (cf. Sonesson 2007). A represen-
tational intention is the purposeful use of a sign/meaning pairing. Developmentally
the relationship between sign and referent varies based on extent of differentiation
or distancing between them (Werner and Kaplan 1963; Muller, Yeung and Hut-
chison 2013). Werner and Kaplan (1963), to accomplish their detailed analysis of
symbolic development, invented their own terminology. So what for many might
be a “sign/meaning relationship” is, in their terminology, a relationship between
a symbolic vehicle and a referent, while the term sign or signal is used for non-
representational material that cues an animal or human to action or avoidance (ibid:
14). The child’s capacity for building symbolic vehicle/referent relationships develops
and differentiation between the two is gradual, culminating in relationships between
conventional symbolic vehicles (words) and their meanings. It is only at this most
differentiated level of symbolic development that word meanings include syntactic
roles in utterances allowing their grammatical combination.
Peirce’s well-known division of signs into icon, index, and symbol is described
as, in its pure form, having the following relationships between sign and referent:
icon by similarity, index by contiguity, symbol, by convention. Real-world sign
relationships (according to Peirce) are not likely to be purely of one type (Jakobson
1965; Sonesson 1989).1 Any real-world sign-referent relationship is likely to be a
hybrid of two or all three of the basic types. Developmental cognitive semiotics has
the goal of encompassing all forms of meaning (as the study of semiotics in general
does). So the child’s sense of meaning prior to the development of sign relationships
is also of interest.
Both Piaget (1962) and Werner and Kaplan (1963) proposed that the capacity for
learning to use conventional symbols, such as the words of a language, is built on
the earlier development of sensorimotor and perceptual knowledge, and of personal
symbols. Piaget simply terms these “symbols” reserving the term “sign” for con-
1 This may explain why it can be hard for those of us in psychology, unfamiliar with
the nuances of semiotic theory, to deal well with these terms.
130 Lorraine McCune
ventional symbols. So Piaget’s sign is (more or less) a Peircean symbol. At first glance
the difference between them may appear to be one of terminology, but the Piagetian
personal symbol offers an important developmental distinction, also respected by
Werner and Kaplan, and not found in Peirce, as Peirce preferred to consider sym-
bols without reference to concrete minds in the stage of development that might
instantiate them (Daddesio 1995).
The term index also seems to have somewhat different meanings for Peirce
(cf. Sonesson, 2007) and for Piaget (1954; 1962). For Piaget, the index serves to
bring to mind the mental object of contemplation or action. The perceptual experi-
ence of a part of a hidden object, for example, facilitates the action of uncovering
the object. For Piaget there is neither mental differentiation between the part and
the whole object, nor confusion between the two. Rather, perception of the part
brings to mind (leads to a representation of) the whole. At an earlier developmental
point, the sight of a part of a covered object does not lead to expectation of finding
the object as a whole. For Peirce, indices, as all signs, necessarily involve several
independent instances. Using the Piagetian criterion of differentiation, Sonesson
(2007) suggests that the two notions of index can be made compatible, if we take
Piaget to be concerned with the “indexical ground”, or “indexicality”, which is a
prerequisite for something to be an index, but not a sign in itself before the dif-
ferentiation occasioned by the sign function.
Piaget (1962) and Werner and Kaplan (1963) provide parallel analyses of how
personal symbols initially emerge, following earlier capacity for indexical relation-
ships: Piaget through play, Werner and Kaplan through vocalization and gesture,
both influenced by imitation of persons and object motion in the environment. These
developments culminate in the use of conventional symbols: that is, those recognized
as symbols in Peirce’s view. In what follows I use the terms personal symbol and
conventional symbol (or simply symbol) to preserve the developmental distinction
and respect the Peircean terminology.
dimensions. So, important pre-sign meanings include the consciousness that change
is immanent and a simultaneous sense of the anticipated outcome. Early words refer
to events that are reversible over time or space, (e.g. up, out) as well as to names of
objects and people (McCune 2006).
The three major periods of cognitive development proposed by Piaget are termed
(1) sensorimotor, (2) concrete operations, and (3) formal operations. The critical
element in operations is reversibility, which characterizes all three levels. Perhaps
the simplest example of operations that comes to mind is the process of addition and
subtraction. An amount can be increased by adding elements. One can reverse this
operation of addition by one of subtraction: remove the same number of elements
and the total will be as it was before any operations were begun.2 Simple addition
and subtraction are considered to be concrete operations because they can be easily
demonstrated with real-world, concrete items. The systematic control of variables
and their influence typical of the scientific method is considered to involve formal
operations because the operations involved occur in the abstract, but this level is
also characterized by reversibility. Reversibility is first evident in the sensorimotor
period when the child is able to use motor action to go in one direction; then
return to the starting point. Children of one to two years of age are fascinated by
the possibility of reversibility as witnessed in their play involving dumping objects
from a container and refilling it, building a block tower and knocking it down, or
simply stepping up and down on a single step. These activities no doubt contribute
to their internal sense of meaning regarding physical reversibility.
In Piaget’s theory semiotic development is what allows the child to progress
from the sensorimotor period which culminates in operations restricted to revers-
ing literal actions or movements to eventual concrete and formal operations. The
sensorimotor and concrete operational levels are considered analogous with re-
spect to operations, but are separated by the long period from about two years
of age (when mental representation becomes initially available) until eight years.
This developmental period is required to internalize operations and build the con-
tent knowledge, or figurative knowledge (see Section 3.1) that is also required for
concrete operations. The further capacity for abstraction develops between 8 or 9
years of age and later adolescence, supporting the formal operations. These devel-
opments no doubt are integrated with physiological maturation of brain and other
neurological structures. Lenninger (2012) refers to “Piaget’s four-stage model of
cognitive development” (p. 119 et seq.) in an analysis emphasizing semiotics, includ-
ing the period of initial development of mental representation, that is symbolic
development (between sensorimotor and concrete operations), as a developmental
stage. For Piaget, stage change relates to operations only, but the period of initial
symbolic development is of equal importance because without the capacity for
internal mental representation no operations beyond sensorimotor action would
2 One can see the analogy of this “operation” with the earlier pre-sign meaning of
reversible transition in motion from up to down.
132 Lorraine McCune
be possible. As more advanced operations develop, they both utilize and contribute
to semiotic development.
actions that “represent” the real function of the object. They are considered “play”
because they are done for their own sake, and for the child’s own interest. At the
same time, they express early meanings, pre-sign, yet semiotic in the broader sense.
The action of using a comb to mime combing the hair shows comprehension of this
meaning, but the act showing the meaning is the act of combing itself, performed
out of context. Hence, from a semiotic perspective, sign and referent are very closely
aligned. That is, touching the comb to the hair is a sign representing combing the
hair, an act using a highly similar action. Still, the child may be unaware of the
representational correspondence. Hence these acts are transitional, as the child’s
internal sense of meaning is unknown. Further semiotic growth, the development
of explicit mental representation, recognized by observable evidence as described
in the following is an essential component of language acquisition, but it is not the
sole influence.
and simulated act. A look to mother with a smile seems to inquire whether she
recognizes the meaning as well. The action is nearly the same as Level 1, but a shift
toward the representational in consciousness seems to arise. At Level 3, the child
moves beyond his or her own action to include others’ typical activity, or make an
“other” the focus of the action. When the child extends the cup (Level 3) to give
mother a drink, the action embodying the meaning differs crucially from his own
simulated action: he extends the cup to mother, a different gesture from putting it
to his own mouth, adjusted to the representational situation. While these devel-
opments may seem small, they testify to different internal consciousness underlying
these acts and a differentiation of the representation of action (the play) from what
it represents (the real action). Empirical studies (McCune, 1995, 2008) have shown
that these developments occur fairly closely in time between 8 and 12 months of
age for most children. The term “pretend” often used informally seems applicable
to Level 3, but not earlier.
Level 4, the combinatorial pretend, shows greater variability, and is typically at-
tained at 13 to 18 months of age, but often later. Children’s play combinations may
begin as simply as the child pretending to drink, then offering the empty cup to mother
or doll: a combination of a self-pretend act that perhaps suggests the subsequent other-
pretend act. Often a single pretend scheme such as drinking or grooming is played
out with all available animates, child herself, mother, dolls, and experimenter, as if the
child’s goal is to express and experience a given meaning through varied but similar
acts. With development, however, broad elaboration continuing for several minutes
can ensue. A dress-up sequence may include donning adult shoes from the toy set,
putting the toy purse over an arm, and waving “Bye-bye”, simulating a shopping trip.
When play acts are combined in these ways it seems there is an ongoing mental stream
of meaningful consciousness, supported by the played acts. The inclusion of an act in
varied combinations of action and recipient shows growth in both the differentiation
of symbols expressed in play and their potential integration among themselves. It is
at this level that McCune (1995; 2008) identified referential words.
Level 5: hierarchical pretend exhibits further differentiation between the play
act and the meaning it represents as well as even greater variability in the age of
attainment (typically between 18–24 months of age). A significant shift is observed
at Level 5, when the child gives evidence of internally “planning” an act prior to
execution. Such play suggests that the child is now capable of an internal symbolic
or representational consciousness, rather than being dependent on enactment with
objects to support internal meaning. The representational intention can now arise
without external perceptual support, with internal plans guiding external action,
fulfilling Sartre’s definition of imaginal consciousness.
The hierarchical aspect of this behaviour rests on this prior designation. In object
substitution, a prior implicit internal designation guides play, e.g. “For the purposes
of this activity, this block is a telephone”. Prior designation, or planning, can some-
times be identified by the child announcing a play intention, by performing pre-
paratory actions indicating a plan, or by searching for objects needed to accomplish
the activity. When a doll is treated as a centre of its own action (e.g. placing the
Meaning, Consciousness, and the Onset of Language 135
cup in the doll’s hand rather than to her mouth for pretend drinking) the implicit
internal designation of “doll = animate agent” provides the hierarchical structure
for the ensuing activities. Once a play theme is established, at Level 4 or 5, one act
can suggest another and elaborate scenarios can ensue.
This is the first point at which a differentiated sign/meaning relationship that
would allow the development of conventional symbols (words) capable of entering
into grammatical linguistic structures seems plausible. However, iconicity remains
in force throughout all representational play acts, as the child’s actions mime those
she or others have performed with practical goals in the past and the play acts are
personal symbols, as the child chooses the actions or objects that become signs of an
internal meaning. Objects that resemble the real objects are always a first choice for
play action. The Appendix details Piaget’s original analysis of play levels and those
operationalized by Nicolich (1997), providing background for McCune-Nicolich’s
(1981) theoretical analysis of play/language relationships, and empirical study by
McCune (1995).
Where in this sequence does the sign/referent relation fulfil the criterion for
representation of the ability to have consciously in mind some entity or situation
not present to the senses? This capacity develops gradually. While clearly absent
at Level 1, and clearly present at Level 5, intermediate levels can be difficult to
read. Differentiation between the sign and what it represents is a gradual process
(McCune 1993, 2010).
3 Recall that Werner and Kaplan use the term symbolic vehicle for signs in sign/referent
relationships in general, while specifying various levels of differentiation.
136 Lorraine McCune
have a natural resonance with objects and people that surround them, prompting an
internal process of dynamic schematizing. The vocalizations and bodily movements
they describe also fulfil the definition of bodily mimesis (Zlatev 2008a, 2013), and such
internal dynamic schematizing can be understood as a process potentially under-
lying mimetic functions. Over the course of the first two years of life, observable
activities and, no doubt, underlying neurological processes bring about the ability to
create symbols both in play action as above, and in vocalization. Initial pre-symbolic
meanings become gradually differentiated from their close practical relationships
with entities and actions in the world (McCune 1993; 2008). The culmination of the
differentiation process is the integration of conventional verbal symbols (words)
into combinations exhibiting the grammar of their ambient language (Roy, Copley
and McCune 2015). At this point, symbol and referent are differentiated, as these
symbols (words in sentences) include, along with their somewhat directly accessible
definitions, the abstract grammatical properties that allow integration in sentences.
Werner and Kaplan traced the emergence of clearly referential words (those
recognizing classes of elements with the same vocal symbol) along a pathway be-
ginning with sounds produced automatically in certain circumstances (such as the
sounds experienced while eating) and culminating in sufficient differentiation to
allow grammatical combinations. The initial sounds exemplify pre-symbolic mean-
ings: that is, the sound and meaning are unified and experienced as such by the
child. Werner and Kaplan report that both Hildegard Leopold and a child studied
by Lewis (1936) derived their initial vocables, as Werner and Kaplan term these
initial meaningful vocalizations, from the sounds accompanying food ingestion
(Hildegarde: [m]; Ament’s niece, studied by Lewis: [mammam]). The vocable in-
itially “referred to a large sphere of events related to food-getting and food eating”
(Werner and Kaplan 1963: 111). In this example, rather than highly restricted, the
word’s application is diffuse, yet context-imbedded. In theory, the child first experi-
ences the sounds generated while ingesting food, then begins to be reminded of
these sounds in contexts related to eating, for example seeing a desirable food.
This leads to production somewhat separate from the actual eating activity that
originally generated the sounds. For Ament’s niece the vocable mammam began as
a context-limited word related to eating and food, but was gradually differentiated
in both form and meaning through interaction with the ambient language. Context-
limited words are similar in their symbolic status to the self-pretend acts of Level
2 described above. That is, differentiation between symbolic vehicle and referent
is partial.
Werner and Kaplan proposed that children form their initial symbols in the pro-
cess of gradually developing the ability to represent (i.e., consciously experience)
situations internally, in the absence of perceptual support. The steps in represen-
tational play mentioned above also demonstrate this process. Grounding in bodily
experience is seen as critical. An external symbol, such as a word form, is con-
structed in relation to the underlying meaning that it comes to express. As additional
words are learned, the reference of the initial vocable becomes restricted. Ament’s
niece initially (at 354 days of age) used [mammam] in reference to her mother and
Meaning, Consciousness, and the Onset of Language 137
sister as well as bread, cakes, and cooked dishes. By 597 days of age clearly referen-
tial words were produced: [mammam] was delimited to cooked dishes, while bread
and cakes were [brodi] and her mother was [mama], her sister [desi].
Werner and Kaplan, as does Piaget regarding play, attribute these developments
to an underlying process of differentiation. As the child differentiates the sound
produced while eating and producing it in the absence of actual ingestion, on-
going experience of the language accompanying various events of importance to
the child facilitates the development of additional meanings in relation to adult
words. The sounds of objects are also co-opted as early vocal signifiers (e.g. the
tick-tock of a clock coming to refer to all sorts of watches and clocks and similar-
appearing objects). As new meanings come to be used, the original vocalization
becomes restricted in its meaning and use. Then, having established the potential
for meaningful vocalization outside the natural source context, the child begins
to differentiate the wider variety of meanings expressed with various vocal forms
produced by speakers in her environment. That is, due to the developmental pro-
cess of differentiation, the child now recognizes sign/meaning correspondence in
conventional words presented by adults in typical situations.
Werner and Kaplan did not consider the form-meaning link to be arbitrary,
as word-form and meaning are co-constructed through dynamic schematizing.
Words become conventional symbols as they partake of the ambient language and
its grammatical properties. However, word-form (symbolic vehicle) and meaning
remain related at a neurological level throughout this developmental process and
beyond. Dynamic schematizing is defined as the underlying process that allows the
differentiation of varied aspects of meaning and form, for example, supporting the
transition from use of the original form mammam across varied entities related to
eating/food, to the development of separate forms in relation to each of these. At
the same time, specific forms become more integrated with specific meanings, and
can reach hierarchical integration among themselves when produced in sentences.
While from the beginning all of these examples suggest the sign function, a greater
level of differentiation is needed for both the personal symbol of Piaget and [con-
ventional] symbols recognized by both Piaget and Peirce.
Vihman and McCune (1994) distinguished context-limited expressions, such
as the initial mamam, from context-flexible expressions, exemplified by mamam,
when it refers to a variety of cooked dishes. The latter, termed referential words
by McCune (2008) show form/content differentiation by their application across
entities or situations sharing relevant content properties.
As referential words become differentiated, each becomes more fully integrated
with its internal meaning, and differentiated from context. According to Werner and
Kaplan, children’s single words must “shift function”, losing their close connection
with context and specific communicative goals to take on syntagmatic roles in the
sentences of the ambient language. Children experience the words of the language
from adults, but then construct form and meaning through their own internal pro-
cesses. Because of the joint developmental history of sound and meaning, hearing
a word instantiates an internal contentful state. For a child who has achieved the
138 Lorraine McCune
sign relationship. Grunts accompanying effort reflect the same biological processes
as those accompanying physiological stress. They occurred accompanying move-
ments that required energy, such as shifting from a crawling position to standing
or the reverse. An infant conscious of effort would experience vocal correspon
dence between this autonomic vocalization and the internal sense of effort. Focused
attention is, by definition, an intentional state, providing the opportunity for sign/
meaning construction once the child has the capacity for mental representation.
Communicative use of this vocalization signals the construction of a personal
symbol, based on the indexicality existing in the physiological process. Communi-
cative grunts differed in form from effort grunts, which continued to occur, but with
reduced frequency, during the same time period where communicative grunts were
used. The effort grunts, across the five children studied, utilized the central vowels
that require least shaping of the oral tract, suggesting involuntary production. In
contrast, communicative grunts exhibited form/meaning synergies with words,
such that children who had begun context-limited words produced communicative
grunts utilizing the vowels most prominent in their early words. This shaping of the
vocalization is evidence that communicative grunts are constructed as symbolic,
and based on their indexical history in relation to biological processes.
When grunts began to be used communicatively there was no doubt of their
voluntary character and specific meaning, the latter varying with context of use. If
the desired response was not forthcoming the grunt was repeated with increasing
intensity. In one case, of a 16-month-old extending a toy bottle for his mother to
open, his grunts finally graded into crying, a more emotional aspect of laryngeal
activation, before she complied with his request. Another example demonstrates
the specificity of meaning intended in some communicative grunt episodes. The
child, who showed referential language comprehension, but not production, wanted
his mother to name something for him. He was pointing at toy a truck, so she first
said truck. This led to more intense grunts and points. She then touched and named
parts of the truck until she finally said steering wheel, and he gave an expression
of satisfaction.
I propose that for each of the children studied the communicative grunt became
a personal symbol, where the referent is the child’s current intentional mental con-
tent. That is, the child seeks to convey his current internal mental state with each
production, so semantic content varies from occasion to occasion. Thus the sym-
bolic vehicle, the communicative grunt, is differentiated from its meaning, but it is
not conventional. The relationship between communicative grunt and meaning is
not one of similarity or contiguity, but it is fleeting and personal. The vocalization
thus symbolizes the internal state the child has in mind in the moment, including
both the sense of meaning (which varies freely from one situation to the next) and
the communicative goal.
In their analysis of diary findings, Werner and Kaplan identify vocalizations simi-
lar to communicative grunts as “call sounds” which tend to occur at the transition to
conventional language. Similarly, McCune et al. (1996) found that, for children who
were phonetically prepared (see the following section), the shift to referential words
140 Lorraine McCune
and a sharp increase in word production occurred within one session of productive
communicative grunt use, while for those lacking phonetic skill, referential com-
prehension was observed.
Children producing this simple vocalization implicitly recognize that a self-
produced sound can reference a thought. Experiencing this communicative goal
state and producing the simple communicative grunt may come to prompt more
frequent vocalization in relation to meaningful internal states, perhaps providing
the basis for acquiring more conventional symbol/meaning relations from the am-
bient language. This shift to conventional symbols of the ambient language requires
further phonetic ability, the subject of the next section. In learning referential words,
the child generalizes initially, based on her own associations, so these words are
also at least partially personal symbols, but with more stable meanings that are ap-
proaching the conventional. It is only when words enter into sentences that their
full conventional use is exploited.
5. C
onclusions: From pre-symbolic meaning to referential
language
McCune (1992; 2008) found that children made the transition to referential language
production (conventional symbols) when the following conditions were in place:
(1) Mental representation reached the level shown in play by combining pretend
acts,
(2) Communicative intent comes to be realized by the production of the “natural”
vocalization defined as a communicative grunt, and
(3) Phonetic development reached a critical point (defined in early talkers by
identification of at least two VMS).
The children studied all showed communicative gestures earlier than communi-
cative grunts (Lennon 1984), and gesture development may have provided the
communicative foundation for shifting from grunts of effort and attention to
communicative use. Children lacking the phonetic skill assessed by two VMS
but exhibiting the requisite communicative and representational skills showed
referential comprehension by gesture, in the absence of word production only in
comprehension. They used communicative grunts in combination with gesture
to solicit assistance and share meanings. The variables identified above are all
measures of underlying abilities and so might be assessed in other ways. The vocal
variables I have emphasized in this chapter, VMS and communicative grunts, share
the property that each facilitates some aspect of language development over time.
In addition, they both contribute “in the moment” to facilitating communicative
production. This dual behavioural and developmental role is typical of variables
within a dynamic system.
This chapter has provided a cognitive semiotic account of children’s development
of meaning, from the time before their first actions show any cultural knowledge
up to their capacity for expressing meaning in the conventional words of their
language. The several variables contributing to the transition, as is typical in a
dynamic system, need not become evident in a particular order. Importantly, the
reviewed studies showed that vocalizations in the behavioural repertoire for several
months were not co-opted for communicative use and word production until mental
representation, as identified in play, became available.
Through entering into the field of cognitive semiotics I have gained a new per-
spective on previous empirical findings and their theoretical interpretation. I can
only hope that the information I offer here can have a reciprocal effect in the cog-
nitive semiotics community.
142 Lorraine McCune
Appendix
Sequence of representational play levels according to McCune (1995) with examples,
aligned with developmental stages according to Piaget (1962), discussed in Sec-
tion 3.2.
1. Introduction
In a seminal paper Harnad (1990) defined the so-called symbol grounding problem,
referring to the well-known “Chinese Room Argument” posited by Searle (1980).
Suppose you had to learn Chinese as a second language and the only source of in-
formation you had was a Chinese/Chinese dictionary. The trip through the dictionary
would amount to a merry-go-round, passing endlessly from one meaningless symbol
or symbol-string (the definientes) to another (the definienda), never coming to a halt
on what anything meant. […] How can you ever get off the symbol/symbol merry-
go-round? How is symbol meaning to be grounded in something other than just more
meaningless symbols? This is the symbol ground problem. (Harnad 1990: 339–340)
Although Harnad (1990) raised this as a challenge for artificial intelligence em-
ploying the “classical” physical symbol systems approach, this problem may also
be seen as one faced by young children learning their first language, who have to
learn thousands of words to build up their lexicon. At the earliest stages, when their
vocabulary is still small, it is not possible for adults to use language to explicitly
teach children new words, since they would not be able to understand verbal ex-
planations of word meanings. So children have to find the elementary semantic
categories by themselves. Together with Cangelosi, Harnad imagines how language
could have evolved on earth from the view-point of a Martian anthropologist. First
beings would acquire “an entry-level set of categories the honest way, like everyone
else, but then assign them arbitrary names” (Cangelosi and Harnad 2001: 139). Then
as a next step: “Once the entry-level categories had accompanying names, the whole
world of combinatory possibilities opened up and a lively trade in new categories
could begin probably more in the spirit of barter than theft” (ibid: 139).
Harnad assumed that the elementary level categories can be easily and almost
automatically learned by trial and error because such basic-level categories are
already cut out by the world with natural gaps between them (Rosch and Mervis
1975; Rosch 1978). In this sense, Harnad may not have considered how the first set
of symbols arises to be a part of the symbol grounding problem. Harnad further
assumed that, once these categories are formed, the language learner’s task is just
to learn the associations between the learned categories and arbitrary strings of
146 Mutsumi Imai
sounds (“names”). However, these assumptions are far too simplistic, when con-
sidering how young children acquire meanings of words and build up a vocabulary
that contains thousands of words.
First, before they can map words to elementary level categories, infants have
to realize that the speech sounds they hear are associated with particular things
or other elements of the events they observe. In other words, to start learning the
meaning of individual words, they need to understand that the speech sounds can
be segmented into units (i.e., words) and that each word denotes a concept (Piaget
1962). How infants gain this referential insight remains yet not fully clear (Asano,
Imai, Kita, Kitajo, Okada and Thierry 2015; McCune and Zlatev 2015).
In order to learn the meanings of individual words, infants need to first establish
the indexical associations (Bates, Benigni, Bretherton, Camaioni and Volterra 1979)
between words and their referents. This is not an easy task for infants who have just
begun word leaning (e.g. Stager and Werker 1997; Imai, Miyazaki, Yueng, Hidaka,
Kantartzis, Okada and Kita 2015). But even when infants have succeeded at that
task, this is not sufficient to acquire the meaning of the word (Bates et al. 1979). And
since adults cannot teach it to infants using language, they need to infer it on their
own. However, inductive generalization from a single word-referent association
can go in any number of directions. That is to say, there are an infinite number of
meanings that can be generalized from a single association, unless the possibilities
are constrained in some way.
It is now known that the way infants generalize words when they just start word
learning indicates that basic-level (or elementary-level) categories do not emerge
naturally, in contrast to what was assumed by Harnad and Rosch. Children often
overgeneralize a word to form a chain-structured category (Bowerman 1978). For
example, a child who has learned that moon refers both to a full moon and to a cres-
cent moon may extend the word not only to other round things (e.g. a round clock,
a watch, a grapefruit) and to other crescent shaped things (e.g. a horn, a croissant)
but also to things that are yellow or shiny (e.g. shiny leaves). In other words, in
order to successfully map an object name to a basic-level category, children need
to realize that they should not generalize the word based on just any salient per-
ceptual property (such as color, texture, or size); instead, they need to pay attention
to shape similarity and ignore other perceptual properties (e.g. Imai, Gentner and
Uchida 1994; Landau, Smith and Jones 1988). Many developmental psychologists
have studied how young children use strategies such as shape bias to get around
this problem when inferring the meaning of a word (e.g. Carey 1982; Clark 1993;
Hollich, Hirsh-Pasek and Golinkoff 2000; Gleitman and Gleitman 1994; Golinkoff,
Hirsh-Pasek, Bailey, and Wegner 1992; Imai, Gentner and Uchida 1994; Imai and
Haryu 2001; Imai and Gentner 1997; Smith 1995; Tomasello 1997).
However, this problem is just a part of the story. The lexicon is not merely an
assembly of words each standing on its own; rather it is a complexly structured
system, in which words are contrasted with one another along multiple dimensions
at multiple levels (cf. Saussure 1916). In order to learn words so that they can use
them according to the conventions of the adult speakers of the ambient language
The “Symbol Grounding Problem” Reinterpreted 147
community, children need to know how a particular word differs from the other
words that surround it in the same semantic domain.
Take color words, for example. We cannot say that a child “has acquired” the
meaning of the word red if all he knows is that red is the color of apples and hence
cannot apply this word to other variants of red, such as the color of fresh blood,
red bricks, red autumn leaves, and red roses. Languages differ widely in the way in
which they divide the continuous visible spectrum by color names (e.g. Berlin and
Kay 1969; Cook, Kay and Regier 2005; Roberson, Davies, Corbett, and Vandervyver
2005), and boundaries for referents of color words are not available in the environ-
ment. Even among languages that seem to have highly comparable color vocabulary
(i.e., the same number of words and the same focal colors for the corresponding
categories), the boundaries can vary considerably. For example, although English
and Japanese both have the color terms brown / cha-iro and orange / daidai-iro, the
range covered by orange is much wider than daidai-iro, and consequently, the range
covered by cha-iro is broader than the range brown refers to. A natural consequence
of this is that children cannot acquire a color name from individual episodes of
word-to-world mapping alone; English-reared children cannot be said to have ac-
quired the meaning of the word red, unless they have learned how red contrasts
with pink, orange, purple and other color names surrounding red, and where the
boundaries are delineated between red and these neighboring words (Saji, Asano,
Ohishi and Imai 2015).
A riddle arises here. To understand the meaning of a particular color word, the
learner seems to need to know other color names surrounding that word. More
concretely, he or she cannot acquire the meaning of a single color word without
knowing the structure of the color lexicon as a whole — how many words there
are and which section of the visible color spectrum each word covers. Then, theor-
etically, children can never acquire a color word, because they do not already know
the other color words. How can children get off this “merry-go-round” (cf. Harnad
1990) to acquire the color lexicon?
This problem is not limited to the domain of color names, of course. Cross-
linguistic diversity is more of a rule than an exception for the majority of semantic
domains including names of household containers, color, causality, mental states,
motion, direction, and spatial relations (see chapters in Gumperz and Levinson
1996, Gentner and Goldin-Meado, 2003; Wolff and Malt 2010 for detailed illus-
trations; also see Kay, Berlin and Merrifield 1997; Malt, Sloman, Gennari, Shi
and Wang 1999; Wierzbicka 1999; Majid, Boster, and Bowerman 2008; Saji, Imai,
Saalbach, Zhang, Shu and Okada 2011; Malt, Gennari, Imai, Ameel, Saji and Majid
2014). The diversity arises because language is highly selective in what elements
of experience it encodes in words, and there are many possible ways to map be-
tween words and the world (Wolff and Malt 2010; Malt, Gennari, Imai, Ameel, Saji
and Majid 2014). This means that it is largely up to each language and culture to
choose how and on what basis to divide a given conceptual domain to form the
categories to be named.
148 Mutsumi Imai
In fact, finding the conceptual domain to allocate an incoming new word may not
be obvious, because languages differ not only in how a given conceptual domain is
divided, but also in how conceptual domains are formed from sensory experiences.
For example, Goddard and Wierzbicka (2013) point out that the concept of color,
which seems nothing other than universal to speakers of English, French, Ger-
man, Japanese, Chinese (and many other languages) is in fact not universal across
languages in the world. What we see on the surface of an object is a conjunction of
multiple properties such as patterns, visual textures, shininess, shape etc. When we
(i.e., speakers of a language with color names) talk about color, we abstract color by
segregating it from these other perceptible visual properties.
Thus, when we consider the diversity in the lexical systems, it is clear that
children cannot acquire the meanings of words by simply finding a way of hooking
sensory or perceptual experience to some entry level words. To be able to use a
word as adults do, children need to discover the range of the category that the
given word covers. This requires finding the boundaries among a cluster of words
belonging to the same lexical domain, which further means that children need to dis-
cover the lexical domain itself (Saji et al. 2011). Thinking this way, the real problem
(for children, at least) for understanding the mechanism of lexical acquisition is to
find out how children learn the meaning of a word without knowing the semantic
domain that the word belongs to, as well as without knowing the words surrounding
that word.
To recapitulate, in order to “ground” not just a symbol but a system of symbols,
children need to go through at least the following achievements (not necessarily in
this chronological order): (1) to find out that words refer to concepts; (2) to find the
referents of particular words in the particular context in which they are said; (3) to
find the semantic domain to which the incoming words belong; (4) to generalize
each word beyond the referent originally associated with the word in light of the
similar words already existing in the lexicon and, if necessary, modify the meanings
of the existing words; and (5) to acquire an adult-like representation of the domain
as a whole.
Developmental psychologists have been engaged in uncovering the detailed pro-
cess by which each of the above achievements takes place. But an equally important
challenge for researchers is to uncover the bootstrapping mechanism through which
children construct a vastly complex linguistic system, starting with virtually no
words and no knowledge about the structure of the lexicon.
In this chapter, I extend and reformulate the original “symbol grounding prob-
lem” (Harnad 1990) to address the problems children need to solve in the process
of lexical acquisition, which include symbol emergence, embodiment, and con-
struction of a system of symbols. For this goal, I review three lines of my work,
which I believe are crucially relevant to this reformulated version of the problem.
The first line of work concerns the question of how infants first hook sensory/
perceptual experiences onto language and realize that speech sounds refer and
have meanings. The second documents how young children infer the meanings
of new words, how they integrate these words into the vocabulary and how
The “Symbol Grounding Problem” Reinterpreted 149
they restructure the meanings of already learned words to construct the complex
system of lexicon. The third part explores the roots of the cognitive function that
makes flexible and complex inference possible. At the end, I revisit the original
symbol grounding problem, asking whether AI, with current computer hardware
and learning algorithms, can solve it.
2. H
ow children come to realize that speech sounds are
symbols
During the first year of life, infants start to map speech sounds onto meanings, and
in subsequent years they acquire a great number of words to build up a lexicon.
However, for children, at the beginning speech sounds may be no different from
other sounds in the environment, sounds which they hear while simultaneously
observing rich visual scenes. How do children come to know that the sounds people
make with their mouths are words, and that they refer to objects, actions or proper-
ties?
One possibility is that a biologically endowed ability to realise cross-modal map-
ping, particularly between auditory and visual percepts, scaffolds language learn-
ing in human infants (Maurer, Pathman, and Mondloch 2006; Imai and Kita 2014).
Human infants can already map information in different modalities in the way
adults would. For example, they can map size and numerosity (Lourenco and Longo
2010), and acoustic properties of speech and non-speech sounds onto properties
of visually presented objects (Yeung and Werker 2009; Walker, Bremner, Mason,
Spring, Mattock, Slater and Johnson 2010). Importantly, this cross-modal mapping
ability is likely to be a heritage from our non-human ancestors, as chimpanzees can
map auditory pitch and luminance (Ludwig, Adachi, and Matsuzawa 2011). Con-
sistent with this idea, 4-month-old infants appear to sense intrinsic correspondences
between speech sounds and certain features of visual input (Ozturk, Krehm, and
Vouloumanos 2013; Peña, Mehler, and Nespor 2011).
The question is whether infants’ ability for cross-modal mapping is linked to
language processing. We investigated this question with Japanese 11-month-olds
by measuring their brain activity using electroencephalograms (EEG) (Asano et al.,
2015). In each trial, infants were presented with a picture of a shape (either spikey
or rounded) followed by a novel word (kipi or moma, see Figure 1). The word-
shape pairs were either sound symbolically matching or mismatching, according
to Köhler’s original proposal (Köhler 1947)
The recorded EEGs were analyzed to explore (a) whether 11-month-old infants
detect sound symbolism through cross-modal perceptual binding; (b) how differ-
ent regions of the infant brain communicate while sound-symbolically matching
and mismatching words are processed; and (c) whether infants at this age process
novel sound-shape pairs over and above simple perceptual cross-modal binding and
respond to sound-symbolically mismatching words as a form of semantic anomaly.
150 Mutsumi Imai
Figure 1. E
xamples of the rounded shapes and the spikey shapes used in Asano et al.’s
study
During the same time window in which the infants’ brains showed a differential
style of communications across different brain regions, the N400 component was
significantly larger for sound symbolically mismatching than for matching pairs
in the analysis of ERP. Combined with the results from the amplitude change in
the earlier time window, the large-scale posterior-anterior synchrony observed
in the beta band over the left hemisphere in the N400 time window, and the N400
modulation in ERP suggest that the infants detected anomaly as the meaning of a
word when they encountered the sound-shape mismatch. To summarize, the re-
sults of Asano et al. (2015) suggest that 11-month-old infants who had not actively
started word meaning acquisition could spontaneously detect the inherent similarity
between speech sounds and visual referents in their attempt to bind visual shapes
and sounds, first perceptually and then conceptually.
However, this study did not provide direct evidence that sound symbolism ac-
tually helped infants in the context of novel word learning. As discussed earlier, in
order for children to acquire the meaning of a word, they need to start establishing
an association between a word form (sound) and its referent.
To examine whether sound symbolism indeed helps infants learn meanings of
novel words, we tested 14-month-old infants in a word learning task using their
looking pattern as the measure (Imai et al. 2015). The infants were repeatedly pres-
ented with two word-shape pairs. For half of the infants, the word and the shape
sound-symbolically matched; for the other half, they mismatched. After the infants
had been habituated, they heard either kipi or moma and saw the two shapes side by
side. Infants looked at the correct referent (i.e., the shape that had been associated
with the word during habituation) faster and longer when they had been trained
on the sound-symbolically matching word-referent pairs than when they had been
trained on the sound-symbolically mismatching pairs.
To initiate language learning, infants need to realize that each set of speech sound
denotes a concept and has a meaning. The two studies reviewed here (Asano et al.
2015; Imai et al. 2015) together suggest that infants’ ability to detect a non-arbitrary
iconic relation between vision and audition may help them gain this insight.
Smith and Jones 1988). When they hear a novel noun in association with a novel
object, children assume that the word refers to the entirety of the object (Mark-
man 1989), and spontaneously generalize it to other objects of similar shape (Imai
and Haryu 2001). At the same time, by 2 years of age, children also know that the
generalize-by-shape rule does not apply to names of substances (Imai and Gentner
1997; Imai and Mazuka 2007).
My collaborators and I have demonstrated that young children can combine these
sources of knowledge flexibly in various different situations to come up with the
most plausible interpretation of a novel word. Imai and Haryu (2001) demonstrated
that 2-year-old children can shift their interpretation of a novel name flexibly. When
children knew the basic-level name (e.g. cup), and the named object was an artifact
(a paper cup with a particular design), they interpreted the novel label as referring
to a subordinate category (that is, a particular kind of cup). However, when they
heard a novel label together with a familiar animal (a penguin), they interpreted the
label as the personal name of the animal. Importantly, when they did not know the
basic level name of the named object, they mapped the newly taught label to the
basic-level category.
Although it is more difficult to extract strong constraints for inferring meanings
of a new verb, children do their best to make as plausible an inference as possible
for verbs as well as by combining whatever sources of knowledge are available
to them. For example, they assume that an action word appearing with only a
single object word (i.e., intransitive verbs) tend to refer to people’s spontaneous
movement (e.g. walking, running, etc.), while an action word co-occurring with
two object names (i.e., transitive verbs) tend to refer to an action people do to an
object to affect the state or the nature of the object (e.g. Fisher and Song 2006;
Naigles 1990).
However, knowing the difference between transitive and intransitive verbs does
not narrow down the possible verb meaning all that strongly, as there are numer-
ous possible ways to generalize an single instance of action within the categories
of caused action or spontaneous action. Previous research has indeed shown that
preschool children in general have difficulty in generalizing a novel verb. In fact, it
has been reported that, across English, Chinese and Japanese, 3-year-old children
could not even generalize a novel verb to the very same action, when the agent (or
the theme object, or the instrument of the action) in the original scene is replaced
with a new one (Imai, Haryu and Okada 2005; Imai, Li, Haryu, Okada, Hirsh-Pasek,
Golinkoff and Shigematsu 2008). Thus, children need cues to further narrow down
the possible verb meanings within transitive or intransitive verbs.
We found that sound symbolism could be one such cue. We demonstrated that
sound symbolism could help Japanese- and English-speaking 3-year-olds find the
semantic invariance for a newly taught verb and help them generalize it (Imai,
Kita, Nagumo and Okada 2008; Kantartisz, Imai and Kita 2011). Children were
assigned to one of three conditions and were taught novel verbs while observing
a person walking in different manners. In the experimental condition, a novel
The “Symbol Grounding Problem” Reinterpreted 153
verb (which had been created by modifying an existing Japanese mimetic word,
i.e., a sound symbolic word) was paired with a manner of walking that matched
sound-symbolically. For example, to describe the action of walking quickly with
small steps, the novel mimetic choka-choka was created from the existing Japanese
mimetic choko-choko, and presented as a verb (choka-choka-shiteru in Japanese,
doing choka-choka in English).
Consistent with previous studies (Golinkoff, Jacquet, Hirsh-Pasek, and Nanda-
kumar 1996; Imai et al. 2005, 2008), Japanese and English 3-year-olds both failed
to generalize a newly taught verb to the identical action performed by a different
actor when the sound of the verb did not have sound symbolic correspondence to
the action it denoted. However, when the novel verb sound-symbolically matched
the action, not only Japanese 3-year-olds but also English-reared 3-year-olds who
were not familiar with the sound symbolic system of Japanese mimetics were able
to use this cue to generalize the verb to a new event.
4. C
onstructing the lexical systems by combining fast and
slow learning
As I reviewed in the previous section, young children can come up with the most
plausible meaning at the moment, and in so doing, they can recruit different sources
of cues and coordinate them flexibly. At the same time, the meaning they first as-
sign to a new word is only tentative. As I pointed out earlier, the acquisition of the
meaning of any given word requires delineation of the boundaries between it and
its neighboring words. Previous studies have shown that delineating boundaries
between words and building up an adult-like representation of the entire con-
ceptual domain is not an easy task for children (e.g. Ameel, Malt and Storms 2008).
Here, I review a study that investigated how children might construct a complex
lexical system of verbs, using a set of verbs denoting various actions of carrying in
Mandarin Chinese as a test case (Saji et al. 2011).
Mandarin Chinese makes fine distinctions for a variety of actions which are
all denoted by carry (or hold) in English. For example, carrying/holding an object
on one’s head is denoted by ding (顶), while carrying/holding an object on one’s
shoulder is kang (扛). Carrying/holding an object with two arms is denoted by
bao (抱), but if the object is held with one arm at the side of the body, the action
is called jia (夹). Several verbs like na (拿), ti (提), and lin (拎) denote carrying/
holding actions with one hand, and verb choice depends largely on the shape of
the hand holding the object. This description may give readers an impression that
these verbs are all mutually contrastive with clear gaps among them, but this is not
the case. Just like most semantic domains in any language, one part of the semantic
space is densely covered by several close synonyms with overlapping boundaries,
while other parts of the space are only sparsely covered with clear gaps between
verbs, as Figure 2 shows.
154 Mutsumi Imai
Saji et al. (2011) provided a comprehensive picture of the process Chinese children go
through to eventually acquire this complexly structured semantic system. The verbs
that tend to be included in children’s early lexicons are those that they hear most
frequently, and the words that are frequently used by adults are usually those which
are used broadly and polysemously, as is the case with Chinese verb na (or in the case
of English speakers, verbs such as go, make or run). Children thus start to use these
verbs early and to apply them to a broad range of events. When these frequent and
broad-covering verbs have close neighbors that cover a relatively narrow range of
referents, the more frequently used ones may be overextended to cover actions that
adults would denote with less frequent but more specific ones. As children’s lexical
knowledge of the domain develops with the inclusion of more verbs in the domain,
the boundaries of the originally overextended verb are gradually modified.
The process of fast-mapping and continuous adjustment of the meaning there
after provide sufficient grounds to make my point: that the symbol grounding -
problem for children is not simply a problem of mapping already divided elements
of the world to an elementary set of words; rather, it is a problem of constructing the
vastly complex system of symbols. It is important, in this process, that children can
use words when the system is still under construction, so that they can communicate.
To accomplish this goal, children combine fast learning and slow learning. To begin
to construct a system, one first has to have elements (i.e., the words). Children thus
attempt to build up a sizable vocabulary as fast as possible by learning rough (and
The “Symbol Grounding Problem” Reinterpreted 155
often inaccurate) meanings of words by fast-mapping. But they are open to the
possibility of modifying the meaning of the word they have learned, and in fact,
they continue to update their knowledge of word meanings over many years as they
learn new (contrasting) words and witness already learned words in new contexts.
5. W
hat cognitive functions make system construction
possible?
As argued so far, children learn words as symbols by inferring their meanings from
observing a single or a small number of referents. Developmental psychologists
have long characterized the problem of word meaning inference as a “problem of
induction” (e.g. Carey 1982; Carey and Bartlett 1978; Markman 1989; cf. Quine 1960,
1969). However, I maintain that this kind of inference should rather be seen as a
form of abduction (cf., Peirce, 1931–58). Children do make generalizations from an
observation of a single word-reference mapping, which can indeed be thought of
as inductive generalizations. However, inductive generalizations need to be con-
strained, because there are too many ways the single exemplar can be generalized
(Markman 1989). In inferring the word-meaning, children attempt to come up with
the best possible interpretation of the meaning of a new or an old word, given the
information available at that time, using whatever knowledge they have (cf. Hollich
et al. 2000; Imai 1999). This is just like the process of hypothesis construction in the
domain of science, which Peirce characterized as abductive reasoning.
Fast-mapping inference of word meanings allows children to learn words rapidly,
which bootstraps them into discovering distributional patterns and the hidden struc-
tures of language. Using this knowledge, children can further accelerate the learning
of new words. If flexible abductive inference is the key to solving the symbol ground-
ing problem, it may also be the key to the evolution of language — in other words,
to the question of why human beings, but not other animal species, have language.
There have been attempts to teach language to our closest evolutionary relatives,
i.e., chimpanzees. Although they were able to learn the associations between some
numbers of symbol-referent pairs, researchers were not successful in attempts to
get them to use the trained symbols spontaneously as a tool for communication
(e.g. Premack and Premack 1972; Terrace, Petitto, Sanders, and Bever 1979/1981). In
fact, the way the trained chimpanzees understand the symbol-referent pairs seems
to be fundamentally different from the way human children understand them.
A study that attempted to teach chimpanzees language (Asano, Kojima, Matsu-
zawa, Kubota and Murofushi 1982) tells us that, in order to learn the meaning of a
word, memorizing the association between a sound and an object is necessary but
not sufficient. In this study, a female chimpanzee was trained to associate a set of
symbols with color chips. During the training, she was shown a color chip and was
taught to choose the symbol that designated that color from a list of symbols. After
training, the chimpanzee seemed to have learned color-symbol parings as she was
able to choose the correct symbol when shown one of the color chips she had been
156 Mutsumi Imai
trained with. However, when the directionality of the contingency was reversed
(i.e., when she was meant to choose the correct color for a given symbol), she failed,
indicating that she did not understand that the relation between the color and the
symbol was intended to be bi-directional.
This situation is analogous to the following scenario: a human child has been
taught the name of a novel object, say dax, and she can now pronounce dax when that
object is shown; nevertheless, she cannot choose the object from an array of several
objects when you request that she hand the dax to you. This would be quite startling
from our (human) perspective, and it reveals the prerequisite for language learning:
for human infants to initiate language learning, the first thing they need to understand
is the bi-directional relationship between symbols (i.e., words) and their referents.
Note that from one point of view, the chimpanzee was making a logically sound
inference: with a premise that the A-then-B rule is true, one cannot deduce that B-
then-A is true. The assumption that bi-directional relation holds between the word
form and its referents thus requires logically unsound heuristic reasoning.
Humans have a strong tendency to generalize a learned contingency in the re-
verse direction. For example, having heard someone say “if X happens, I will come,”
then he actually appears, people naturally infer that he came because X happened,
although he may have come for another reason. Language learning seems to become
possible when children make a commitment to the non-logical assumption that bi-
directional relations hold true between symbols and their referents.
Importantly, bi-directional inference goes beyond the relation between A and
B. Let us think about the relationship among sound, referent, and orthography.
We now have three items. A child learns the bi-directional relationship between
the sound inu and its referent DOG. At a different time, she may also learn the
bi-directional relationship between the sound inu and the character 犬. Now, the
child does not need to make any further effort to learn the relationship between 犬
and DOG. Because she has learned that inu⇒DOG and inu⇒犬, she automatically
assumes that DOG⇒inu and 犬⇒inu; furthermore, 犬⇒DOG and DOG⇒犬. In
other words, the relation between the character and the referent comes for free.
Thus, if one has a bias to assume bi-directionality, learning the one-directional
relation between two elements (e.g. A→B and A→C) leads to the learning of four
other relations (B→A, C→A, B→C, C→B) for free. Obviously, this is an extremely
efficient learning mechanism, even though it could sometimes cause errors.
Numeral studies with various animal species have shown that non-human ani-
mals rarely generalize a learned contingency to the reverse direction (e.g. D’Amato,
Salmon, Loukas, and Tomie 1985, Lionello-DeNolf 2009 for a review, Sidman, Rauzin,
Lazar, Cunningham, Tailby and Carrigan 1982), as the above example by Chimpan-
zee Ai shows. Some other researchers tested whether two chimpanzees who had
been extensively trained on lexigram-referent associations in both directions (lexi-
gram⇒referent and referent⇒lexigram) could take advantage of this experience to
learn bi-directional contingencies for a new set of lexigram-referent pairs (Dugdake
and Lowe 2000). These two animals were trained to match a shape to a color, and they
were able to learn the associations in the direction they were trained. However, when
The “Symbol Grounding Problem” Reinterpreted 157
they were tested with the same shape-color associations in the reverse direction, their
performance was at the chance level. Thus, chimpanzees are not likely to generalize
the learned contingency in one direction to the other direction, even if they have been
trained on contingencies on both directions in other stimulus sets.
A question that is extremely important in considering the ontogenesis of language
is whether human infants possess the bi-directional reasoning bias prior to language
learning. If they do, the presence of a bias to assume a bi-directional relation may be
something unique to human beings and we might even be bold enough to maintain
that it is because of this bias that human children are able to learn language.
To investigate this possibility, in my laboratory we compared 8-month-old
human infants and adult chimpanzees with regard to the bi-directional reasoning
bias in a non-linguistic context, using the same stimuli and comparable experimental
procedure (preferential looking paradigm) across the two species (Murai, Miyazaki,
Tomonaga, Okada and Imai 2014; Imai, Miyazaki, Murai, Tomonaga and Okada,
in preparation). In this study, both species watched ‘A(object)⇒B(motion)’ con-
tingencies until they were habituated. Specifically, one object (a toy dog) appeared
on the monitor. It then shrunk into a dot and moved in a zigzag path. Likewise, an-
other object (a toy dragon) always moved in a curvy path. After participants (human
babies and chimpanzees) were habituated, they received test trials in the reverse
direction, i.e., a motion followed by an object. In half of the test trials, the motion-
object pairings were kept, e.g. B (zigzag motion) ⇒A (toy dog); in the other half,
the object followed the motion that was not paired with it during familiarization,
e.g. B (curvy motion) ⇒notA (toy dragon), see Figure 3). We expected that human
infants but not chimpanzees would show the bi-directional reasoning bias, detecting
anomaly in the ‘B⇒not A’ contingencies.
Figure 3. Th
e design of the comparative study testing the bi-directional bias in human
infants and chimpanzees
158 Mutsumi Imai
As expected, the human infants showed the bi-directional reasoning bias. At test
trials, they looked at the object longer when it followed the motion that was not
paired with it during the familiarization (B1⇒A2, B2⇒A1) than when it followed
the originally paired motion (B1⇒A1, B2⇒A2). In contrast, when the direction was
reversed, the chimpanzee did not show any difference in looking times across the
trained association and the switched association. We checked whether the chim-
panzees had learned the contingency at all, comparing the looking times for the
trained contingencies in the original order (e.g. A1⇒B1, A2⇒B2) and the looking
times for the pairs in which the pairing was switched (e.g. A1⇒B2, A2⇒B1). The
chimpanzees were able to detect the violation this time. Thus, although they indeed
had learned the contingencies during the training, they did not (over-)generalize
the learned contingencies to the other direction. These results suggest that the bi-
directional reasoning bias is present in human infants before they start initiating
active word learning but that this bias is not present in non-human animals.
As discussed, the bi-directional reasoning bias can produce fast and efficient
learning. Unlike chimpanzees who are only able to learn a few hundred lexigram-
referent pairs at most, human children learn thousands of words, words that are all
connected and integrated into the system of the lexicon. Constructing this lexicon
is only possible through a bootstrapping mechanism: a learner starts small, using
biologically endowed perceptual abilities such as multi-modal mapping ability, to
learn some initial elements, i.e., sound-referent associations. From there, children
infer extensions of individual words. At the same time, as soon as a certain number
of elements are learned, they begin to construct hypotheses about the nature and
structure of the lexicon to make further learning more effective.
I conjecture that the bi-directional reasoning bias may be crucially related to the
root of flexible heuristic thinking in human beings, and hence, to the ontogenesis as
well as phylogenesis of language. Heuristic thinking using a variety of sources of
knowledge for abductive inference further enables children to advance their knowl-
edge almost limitlessly through a continuous bootstrapping process.
6. Conclusions
Some developmental scientists, especially those who adopt a connectionist ap-
proach (e.g. Rogers and McClelland 2004; Hills, Maouene, Riordan and Smith 2010;
McClleland, Botvinick, Noelle, Plaut, Rogers, Seidenberg, and Smith 2010) char-
acterize abductive reasoning as association learning. However, although the as-
sociationist view might be able to explain how children could extract statistical
patterns in language, it is doubtful if it can explain the entire bootstrapping process
through which children not only ground individual symbols per se but integrate an
entire system of symbols within their internal representations. As I have shown in
this chapter, one should specify at least the following three points in re-thinking
“the symbol grounding problem”: (1) how children make reasonably constrained ab-
ductive inferences about the meanings of words, (2) how they discover subsystems
The “Symbol Grounding Problem” Reinterpreted 159
of language and how they integrate these subsystems into a vastly complex system
of the lexicon; and (3) what are the cognitive functions that make this possible.
To recap, Hanard raised the symbol grounding problem to criticize the so-called
“symbolic AI” approach, i.e. the view of intelligence as the manipulation of physical
symbols (e.g. Newell 1974), and expressed high expectations concerning the con-
nectionist approach. During the past two decades, machine learning technologies
have shown tremendous advancement to such an extent that machine intelligence
has now surpassed human performance in a number of domains (e.g. board games
such as chess, shogi, Othello; and search engines such as Google) and people are
even concerned about the “singularity problem” (Kurzwel 2006). However, it is not
likely that non-symbolic approaches with current technologies such as deep learn-
ing (e.g. Le 2012) can deal with the symbol grounding problem. Such approaches can
certainly excavate hidden structure by mining a huge amount of data that humans
can never handle. They can discover new knowledge, as long as the domain — the
problem space — is a priori defined (McCarthy and Hays 1969; Dennett 1987). The
question is whether machines can ever discover a domain or create a new domain
by themselves, as human children do so naturally.
Currently, researchers in natural language processing work in specific subsys-
tems of language such as speech recognition, speech production, morphology learn-
ing, structure building of sentences, semantic ambiguity resolution, interpretation
of word meanings, etc. However, children do not learn language separately within a
prefixed domain. They need to learn rules for various aspects of language, including
phonology, grammar, semantics, and pragmatics. Parents do not tell children in
advance that they need to find rules for this or that aspect of language. Children
discover the domains, learn elements and relations among elements within the
domains, and spontaneously integrate different domains by abduction. Thus, the
reformulated symbol grounding problem for AI now is how a machine can discover
domains when it is placed in an unbounded and unlimited environment as we are,
and how it can build up an integrated system of knowledge that continuously
changes and grows as it discovers new information in the world, all on its own.
Whether that is possible at all, or whether conscious human minds are needed for
this feat, as argued within cognitive semiotics (e.g. Zlatev 2002), remains to be seen.
Acknowledgements
This research was supported by MEXT KAKENHI (#15300088, #22243043, Grant-
in-Aid for Scientific Research on Innovative Areas #23120003). I would like to thank
the reviewers and Jordan Zlatev for their insightful comments on earlier versions
of the manuscript.
Keith E. Nelson
Chapter 9
Key Roles of Found Symbolic Objects in
Hominin Physical and Cultural Evolution: The
Found Symbol Hypothesis
1. Introduction
A fundamental, intriguing, but as yet unanswered evolutionary question is when
did any species first begin to use intentional symbols. The definition of a full symbol
here adopted is that one object or action or object-plus-action is used intentionally
to represent a referent that is distinct from the symbol. In this chapter, these full
symbols are distinguished from artefacts and materials that are non-utilitarian and
often termed “symbolic” in function in a broad sense, such as red or black ochre
blocks or beads with apparent decorative value (Bouzouggar et al. 2007; D’Errico,
Henshilwood, Vanharen, and Van Niekerk 2005; Vanhaeren et al. 2006; Marean et al.
2007; Marean 2011). Further, full symbols refer to categories of referent instances, as
when a nut that resembles a fish is used to refer symbolically to a diverse range of
fish. On the other hand, there is no requirement for these symbols to be “arbitrary”,
and as we will see, their reference is typically based on iconicity.
Figure 1. P ortable contemporary, potential symbols for “faces” all within hard black
walnut shells opened by squirrels and ground squirrels who ate the walnut
meat and left varied shapes through chewing on the shells.
162 Keith E. Nelson
In this chapter I propose the novel hypothesis that the first significant use of full
symbols referring to specific inanimate objects, animate beings, actions, scenes, reac-
tions, or events by any species occurred when members of the species began identify-
ing and sharing found symbols. Figures 1 and 2 provide examples of natural objects
which could readily serve as found symbols, and multiple additional examples occur
in later sections of the chapter.
Figure 2. P ortable contemporary, potential symbol for “vulva,” Cowrie (Cypraea) shell
with natural opening.
clans who had never made a symbol of any kind, and whose skills in signed or
spoken language or visual image creation were absent or very limited, were able
to use natural patterns as the gateway to symbolic communication using a variety
of externally available patterns. This way of entering into symbolic activity would
almost certainly require less sophisticated brains and culture than creating sym-
bolic art through careful planning and tool use. By this account, the creation of
explicit full symbols through tool use may have depended upon and followed after
a period of hominin use only of found external symbols.
If you and I were to reach agreement on “angry face” as the symbolic meaning
of the faces/nuts in Figure 1, we would need to have some back and forth negoti-
ation to establish that shared meaning. Some kind of similar negotiation would
also be required if two or more early hominins were to reach agreement on the
symbolic meaning of a found object. We can speculate on whether early-stage
communicative behaviour involving proto-symbols, imitation, pantomime, and
gesture were involved in such negotiations. Nevertheless, the mere presence of
a shared, transported object found in hominin settlement/activity sites, with no
transparent utilitarian value/function, but with a definite iconic (i.e. resemblance-
based) relation to other objects/beings/actions, raises the distinct probability that
shared, intentional symbolic communication was achieved using the found object
as a symbolic vehicle.
Bouissac (2004: 1) has highlighted the heavy reliance on inferences for interpre-
ting the functions of prehistoric artefacts:
While anthropologists usually can assess the functions of most [contemporary] ar-
tefacts by correlating them with specific, observable behaviours, prehistorian archae-
ologists must construct hypothetical behaviours which can never be verified.
This caveat applies to all prehistoric objects with potential interpretations as
symbolic (Bousissac 2000). In the case of Palaeolithic portable statuettes known
as “Venus figures” what is common for most modern commentators is a set of
inferences that such figures had value sufficient to warrant their manufacture and
transport. As modern authors readily “see” a representation of female human bodies
in these figures, they assume that they also had a similar symbolic meaning 20,000
to 40,000 years ago for the local clans. However, it is hard to demonstrate how the
statuettes fit into religious, planning, ritual, mate selection, or trading aspects of cul-
ture at the time (Sonesson 1994). Note, though, that if these statuettes do represent
female human beings the shared interpretation relies upon a modest set of visual
features that carry the symbolic interpretation – most either lack heads entirely or
incorporate heads with quite limited facial features, but emphasize breasts, hips,
and vulvas. Similarly, the oldest known “found symbol” candidate is a pebble (see
below) that many modern authors readily see as a representation of a person even
though it appears as a body-less face. Again, social negotiation of the particular
meaning-holding between symbol and referent determines whether an object with
only some of the features of a person refers to a whole person, some part/s of a
164 Keith E. Nelson
2. O
bjects with symbolic significance at archaeological
sites/layers dating to 200,000 or more years ago
Here we consider just three natural objects which have been interpreted as having
symbolic value because in their natural state they bear resemblance to human
figures or parts of human figures and because they were transported and valued
by early hominins at established sites for hominin activity. Before our first case of
this sort, it is essential to state again that if sense is to be made of the archaeological
record for symbolic activity it is crucial to use consistent criteria for assigning sym-
bolic values regardless of the age of an artefact/object. Bednarik (2003: 406) drives
home this point strongly.
The greatest schism that has developed in our concepts of hominid evolution concerns
the antithetical positions of “long-range” and “short-range” theories of the cognitive
development of humans. Sometimes called the gradualist and the discontinuist mod-
The Found Symbol Hypothesis 165
els, they perceive two entirely different paths of non-physical human evolution. The
currently still-dominant short-range model, which perceives the use of symbolism,
language, and paleoart to be limited to the last quarter of the Late Pleistocene, survives
by rejecting every instance of earlier evidence of this kind or by explaining it away as
a “running ahead of time.”
The prehistoric object shown in Figure 3 is the Makapansgat jasperite cobble from
South Africa, excavated in 1925. It is dated to 2–3 million years Before Present
(BP) (Bednarik 1998, 2003, 2008). It had been transported to the cave where it was
found, a site occupied at least by Australopithecines but perhaps also by early
hominins. Its resemblance to a human face is noted by virtually all commentators
on the find. But for the most part it has not been integrated into prominent ac-
counts of hominin evolution, instead being either ignored completely or treated
as a strange “curiosity.”
This second prehistoric object (Figure 4) is seen by modern commentators as
highly similar in appearance to a human penis. Dated at 200,000 to 300,000 yr. BP, it
was found near Erfoud, Morocco. This apparent penis symbol is a fossil (cuttlefish)
form that had been transported to a hominin site rich with stone tools. The TanTan
Venus figurine shown in Figure 5, from about 400,000 yr. BP and from Morocco, is
also among the earliest exemplars of objects that may be found symbols (Bednarik
2003). It is a naturally formed stone shape that bears 8 grooves, 5 of which have
been emphasized by impact, and it had been coated in bright red ochre. As with the
Venus of Willendorf and various other Venus figures from the Upper Palaeolithic
there are no facial features. Nevertheless, it readily serves as a potential symbol of
a human figure with a head, torso, arms, and legs (Bednarik 2003).
166 Keith E. Nelson
Figure 4. C
uttlefish fossil resembling a human penis
I propose that we treat these examples of probable symbols in the same way that
most commentators have treated the Venus of Willendorf, the Venus of Hohle Fels,
and other Upper Palaeolithic Venus statuettes: as potentially legitimate exemplars of
active, intentional and socially-shared use of visual patterning in a physical object to
represent an absent referent. The cognitive and social processes for shared reference-
making remain largely the same regardless of whether the origin of a symbol is a found
natural object or a crafted pattern in stone, antler, bone, or wood. Viewers of any such
symbol need only to make sense of the symbol-to-referent meaning; they do not need to
know how an object was procured from nature or created step-by-step by a craftsman.
Figure 5. Th
e Moroccan Tan Tan figurine
The Found Symbol Hypothesis 167
Figures 6 and 7 illustrate two additional objects with symbolic value from con-
temporary natural contexts. Together with all the above examples of found sym-
bolic objects they serve the point that in any local context in which hominins were
evolving there would be an abundance of varied natural objects with high potential
as symbolic vehicles.
that all the necessary conditions for moving from pre-symbolic thinking to rapid
symbolic thinking are: (1) lacking in contemporary nonhuman primates and in
human infants or in early hominins whose brain development had not yet reached
a 600 gram size, but (2) present both for early hominins and for children when brain
sizes (and accompanying sophistication) exceed 600 grams. Taking all the lines of
evidence together within a cognitive semiotics approach, we see then a powerful
basis for interpreting the above instances of symbolizing with found natural objects
as evidence for sophisticated referential thinking rather than some simpler level of
non-symbolic thinking. Further, the integrated lines of evidence lead to multiple
specific predictions about what new exemplars of visual symbolizing in prehistory
can be found if research programs systematically and without prejudice explore
existing and new archaeological collections of objects.
where experimental evidence of symbol acquisition (at quite slow rates compared
to children) has been provided by research, brain size is fairly comparable to that
of human infants at about 2 months, before even the earliest stages of children’s
symbol production or comprehension.
Fossil evidence gives some reasonable estimates of brain sizes for early hominin
species. Assuming that, as in modern humans, infants in such species reached 80 %
of their mature brain size by ~21 months, here are the relevant estimates for three
species using only fossils less than 500,000 BP: H. erectus, 21 mo. 960 cc and 1200
cc adult; H. Heidelbergensis, 21 mo. 1040 cc and 1300 cc adult; H. sapiens neander-
thalensis, 21 mo. 1120 cc. and 1400 cc. adult. In all three of these species, then,
the estimated neural readiness (if supportive conditions arose) for learning to use
symbols by 21 months of age (or later) as judged by brain size alone is remarkably
similar to the ~1000 to 1100 cc typical for extant human infants at 21–26 months.
As reviewed below, for such human infants there is strong and convergent evidence
that rapid learning of many kinds of symbolic forms and symbol/referent relations
is within their capabilities (cf. Imai, this volume).
Authors from many backgrounds have stressed that the above changes in the
size and functional complexity of the brain both across primate species (although
for non-primate species behaviour/brain relations are less clear) and across the
months and years of early human development serve to support increasingly greater
advances in cognitive complexity (Calvin 1989, 2006; Elman et al. 1996; Deacon
1997; Stiles 2001; Diamond 2002; Paterson, Heim, Friedman, Choudhury and Be-
nasich 2006; Schoenemann 2006; Gao et al. 2009; Herculano-Houzel 2009; Stiles and
Lernigan 2010; Sporns 2011; Stringer 2012). In part, this can be understood in terms
of more and more processing units in the brain interacting in far more combinations
and at greater speed to support increasing behavioural complexity in, among other
behaviours, language and other symbolic modes, social cooperative mindful behav-
iours, pattern-detection, planning and problem-solving.
A related important behavioural development by 1.5 to 2 million years BP, with
highly convergent evidence across sites and investigators, is the deployment of
sufficiently sophisticated pattern-detection processes to select rock cores which are
transported and then struck by selected smaller rocks to produce flakes which are
then selected/rejected as adequate for tool use as knifes, blades, and scrapers (Don-
ald 1991; Noble and Davidson 1996; Deacon 1997). Going beyond these intentional,
selective, and shared behaviours to also intentionally select and transport objects
lying about in nature interpreted as shared symbols rather than tools would not
necessarily require any new brain capacities. Instead, rapid symbolic advances could
rest upon the already-existing visual pattern abstraction and sorting, joint attention,
analogic, memory, and social-sharing capacities of early hominin clans.
We further propose that 8- to 12-month-old human infants as well as chimpan-
zees and gorillas have brain readiness sufficient to support the interpretation of
certain selected natural objects such as patterns in shells, rocks, fossils, and parts
of trees as “found” symbolic representations. For wild chimpanzees and gorillas,
it appears that their evolutionary path did not exploit their brain readiness for
170 Keith E. Nelson
4. E
xperimental documentation of symbol learning by
young children
In many different experimental studies children between 12 months and 26 months
of age have been shown to learn symbols for totally invented concepts (e.g. Blicket,
Fiffin, Bloop,) as well as for conventionally-named objects, actions, and events
(e.g. Rabbit, Cat, Lobster, Fish, Hedgehog, Chase). Furthermore, each of the following
kinds of symbols have been acquired by some children in this age range: the conven-
tional signs of signed languages, symbolic gestures, spoken words, written words,
photographs, artist illustrations, pictograms, and arbitrary geometric patterns on
communication boards (Söderbergh 1977; Nelson 1982, 2006: Nelson, Craven, Xuan,
and Arkenberg 2004). The brains of young children at 12 to 26 months are clearly
open to the development of successful symbolic communication through a wide
range of symbolic vehicles.
Consider those children whose first comprehension of certain symbols – such
as a photo of a lobster, understood as representing a lobster, or a photo of a blicket
understood as representing a novel 3-dimensional object – occurred for photo-
graphs (or artist illustrations) when the children reached 15 to 24 months of age.
Typically, children also display comprehension of a linguistic symbol/word for the
same referent. In these cases, it seems evident that both the spoken word and the
two-dimensional photo are serving as “found” symbols whose meaning emerges in
social discourse, but whose origin/manufacture is unclear to the infant. Although
this line of research has not been integrated well with evolutionary discussions
of symbol processes before, it is of high relevance (cf. Zlatev et al. 2013). In line
with other investigators, DeLoache and Ganea (2009) give a clear interpretation
of children’s symbolic behaviour under well-specified experimental conditions:
“some children as young as 15 months of age interpret pictures symbolically, and
children’s appropriate interpretation of the referential nature of pictures increases
gradually with age” (2009: 276).
Highly controlled experimental procedures with human infants and toddlers
are of special importance to the conclusions regarding early symbols and their
precursors in behavioural evolution. These procedures reveal whether the infants
and toddlers have the capability despite brain immaturity to acquire symbols when
given limited, specific sets of learning opportunities. When particular studies have
used invented concepts (e.g. Blicket, Fiffin, Bloop, Bandock, Weedle) with records
of all contexts/durations of exposure to concepts and their symbolic labels, if a
child does learn symbol production and comprehension then the entire experiential
base for that learning is well documented. Surprisingly, by 15 months or before
many children are able to learn a new concept from a single labelled exemplar (ob-
The Found Symbol Hypothesis 171
ject, picture) in an episode lasting just 5 to 15 minutes and are then able to show
knowledge of the concept and its symbolic label when a second or third example
is presented for the very first time many days later. Accordingly, children as young
as 15 months of age are able to abstract the characteristics of a concept along with
its spoken lexical name from a single brief occasion, and carry this information
in long-term memory across 4 days to 2 weeks, and then appropriately apply the
vocabulary item to other, newly-encountered examples (Nelson 1982; Nelson et al.
2004). Some studies only look at learning within one brief session and so do not
test long-term memory, but they also are able to demonstrate that children at 15
to 26 months can map a variety of symbols to novel new objects/concepts: spoken
words, gestures, and pictograms (DeLoache and Ganea 2009; Namy 2009). For some
well-documented children, their language growth at 15–26 mos. is bilingual and
equivalent in rate and complexity for two different modes of language, American
Sign Language and English (Bonvillian and Folven 1993; Prinz and Prinz 1981).
leading to the creation of new symbolic objects, and so potentially could have
occurred before brain sizes and organization in early hominins were adequate for
complex communication sequences in any sign, gesture, speech, art, or multimodal
forms. Individuals and groups with greater symbolic communication success would
be able to better avoid predator attacks, to better find needed shelter, better find
food and water and other resources, and in turn would have higher survival and
child-bearing rates.
To the extent that greater symbolic success rested upon biological differences
in brain size and complexity, then what may have ensued were cycles of co-
evolutionary increases in brain size and complexity accompanied by increasing
use of external symbols in more and more contexts (Deacon 1997; Bednarik 2006).
Across many generations, in certain lineages, the brain size and numbers of neur-
ons and brain organization for certain hominin groups could have reached the
level of modern human infants at approximately 15 to 18 months of age without
any cultural use of syntactically governed signed language, speech, or gesture
(cf. Donald 1991).
This line of argument leaves open the possibility stressed by Tomasello (2008)
that increases in social cooperation, in understanding mental states and social
grounds, and in complex gesturing events were fundamental at some point in the
evolutionary path to syntactically governed signed language and speech. Yet this
account is not inconsistent with the found symbol hypothesis, according to which
found natural symbols also contributed to the origins of complex human language.
It remains for new kinds of future evidence to clarify whether using natural objects
as symbols preceded, accompanied, or followed the origin of complex, socially-
shared gesturing.
One view related to these issues is that for the purposes of archaeology, it is con-
venient to define language as communication using symbols (Noble and Davidson
1996). Accordingly, one possibility is that communication using found symbols or
found symbols combined with symbolic gesturing may have constituted the first
such “language.” Furthermore, the found-symbol hypothesis has the heuristic value
of generating multiple new predictions. Putting the lines of evidence and cross-
disciplinary discussion presented in this chapter together, the following predictions
can be made about likely future findings on hominin physical and cultural evolution.
1. Review of objects in museum collections and discovery of new artefacts at ar-
chaeological sites will reveal many previously unreported objects of symbolic
value which were in no way produced /shaped by hominins, dating to the period
between 2 million yr. BP and 100,000 yr. BP. Thus, potential support for the found
symbol hypothesis could include new findings from multiple archaeological sites
in Africa, Asia, and Europe of portable natural objects which remained unworked
by tools but were transported and saved at locations with other evidence of
hominin activity. These findings would have a bias in the saved shells, stones,
fossils, tree nuts, bones and other objects toward those with particular features/
patterns that strongly resemble and thus can symbolically represent real-world
The Found Symbol Hypothesis 173
ever the outcomes, investigating the availability of found natural symbols through-
out prehistory will enhance evolutionary knowledge of hominin cognition and
communication. By the same token, the observed empirical patterns together with
other new archaeological investigations will allow an evaluation of a central tenet
of this chapter: that natural objects which attained (through social negotiation)
symbolic value were the first form of explicit and specific symbolic referring to
develop among our hominin ancestors.
Acknowledgments
The author thanks the School of Liberal Arts at Penn State University for monetary
and other assistance, and the reviewers and Jordan Zlatev for their helpful com-
ments. In addition, he appreciates the rich discussions with Jill Cook of the British
Museum, Mark Shriver in Anthropology at Penn State University, and Terrence
Deacon in Anthropology at The University of California, Berkeley. Figures 3, 4, and 5
are used with the permission of Robert Bednarik, owner of the copyrights. All other
figures copyrighted and used with the permission of Keith E. Nelson.
Francesco Ferretti & Ines Adornetti
Chapter 10
Mindreading, Mind-travelling and the
Proto-discursive Origins of Language
1. Introduction1
The idea that syntax is the essence of human language is a conceptual construct
that was strongly supported in the 20th century by the fathers of so-called clas-
sical cognitive science. According to Chomsky (e.g. 1980, 1986), the devices at the
foundation of the language faculty are those that elaborate the constituent struc-
ture of sentences, and according to Fodor (1975, 2008), the predicative structure
of the sentence reflects the propositional structure of the Language of Thought.
Underpinning the primacy of the sentence in these authors’ work is a way to
propose a specific conception of language and cognition. In fact, the idea that
language competence is a device that analyses the shape (syntax) of symbols
regardless of their content and the relationship between the uttered expression
and its context is part of a broader conception of how to analyse the study of the
mind in classical cognitive science.
Rejecting the view of cognition as computations on amodal symbols, indepen-
dent of the brain’s sensori-motor systems for the perception of an action, the
standard position of cognitive science has been criticized by action-oriented per-
spectives (e.g. Barsalou 2008; Clark 1997; Varela et al. 1991). These perspectives,
with their anti-cognitivism and attention to bodily experience, have strongly
influenced cognitive semiotics (cf. Zlatev 2012). Following these action-oriented
perspectives, we propose a model of language strictly tied to the paradigm of
embodied cognition. Against the syntax-centred view, in effect, we maintain that
the nature of language (i.e. its functioning and origin) needs to be analysed in
reference to the human pragmatic capacity to build coherent narratives rather
than to the ability to construct syntactic well-formed sentences. Specifically, our
proposal is that narrative abilities are dependent on the ability to (mind)travel
in space and in time and that the narrative foundation of human language pro-
vides important insights to suggest a proto-discursive model of the origins of our
communication skills.
1 This chapter is the outcome of a collaborative effort between the two authors. For the
specific concerns of the Italian Academy, we specify that F. Ferretti wrote sections 4
and 5 and I. Adornetti wrote sections 2 and 3 for the final draft. Both authors wrote
the Introduction and the Conclusion.
176 Francesco Ferretti & Ines Adornetti
theories of communication that are focused on context and the speaker’s intention
(cf. Sperber and Wilson 1986, Chapter 1).
The second thing to stress for our argument is that from the classical perspective,
the production and comprehension of discourse is just a by-product of the produc-
tion and comprehension of single sentences. Since discourse is in fact composed of
a set of individual sentences, from the perspective of classical cognitive science the
analysis of discourse coincides with the analysis of the microlinguistic dimension.
From this point of view, producing and understanding discourse is equivalent to
producing one sentence after another by means, for example, of the grammatical
devices (e.g. the use of pronouns) that provide the links between consecutive utter-
ances. The idea is that the mechanisms that regulate the structure in constituents
within the sentence are the same that also regulate the establishment of links be-
tween sentences in the external flux of speech. The global level of discourse is
attained starting from the analysis of the utterances of single sentences through
a sequential process of accumulation of information. In this sense, information
processing that underlies narrative abilities has a strong “bottom-up” character:
discourse analysis proceeds incrementally, from the local meanings of sentences to
the global meanings of discourse.2
Disputing the priority given to microanalysis, in the following section we pro-
pose that the ability to process discourse takes priority over the ability to pro-
cess sentences. In support of this, we discuss studies from neurolinguistics and
neuropsychology showing that 1) the ability to process sentences (i.e. the capacity
to construct well-formed utterances) is not a sufficient condition to communicate
efficiently and 2) that it is indeed possible to communicate when the syntactic
competence at the basis of sentence construction is disrupted.
3. F
rom microanalysis to macroanalysis: evidence from
the study of pathologies of language
Our argument is based on the analysis of a specific property of discourse and
narration: coherence. In general, coherence can be defined as the conceptual or-
ganizational aspects of discourse at the suprasentential level (Glosser and Deser
1990: 69). Even if Chomsky does not address the issue explicitly, scholars who are
inspired by generative linguistics and who are interested in the study of narrative
processing maintain that the building of the coherent flow of discourse (the basis of
any narrative ability) must be interpreted as a bottom-up process driven by syntac-
tic parser functioning. Kintch and van Dijk’s (1978) construction-integration model
is a good example in this regard. Specifically, the theoretical models that equate
language with grammar and linguistic processing with sentence processing ex-
plain discourse coherence in terms of the linear relations of cohesion between con-
secutive sentences (e.g. Halliday and Hasan 1976; Reinhart 1980; for a discussion,
2 For discussion and criticism on this point, cf. Cosentino et al. (2013).
178 Francesco Ferretti & Ines Adornetti
cf. Giora 2014).3 In a text, cohesive relations are accomplished through grammatical
and lexical elements (Halliday and Hasan 1976). Grammatical cohesion includes
elements such as reference (inside and outside the text, respectively endophoric
and exophoric reference), substitution, ellipse and conjunction; lexical cohesion is
based on reiteration (e.g. repetition, synonymy) and collocation (i.e. co–occurrence
of lexical item). An example of a cohesive text is the sample of discourse shown
in (1), in which the sentences (a), (b) and (c) are connected through the use of pro-
nouns (a case of grammatical cohesion):
(1) (a) They managed to catch him. It was an all-out abuse. They abused him, and
I don’t think something was done about it. (b) They put him in the toilet, I re-
member the soldier, I remember, he was a friend of mine, a friend from the com-
pany. (c) And he took pride in shoving the kid’s head into the toilet. (Anonymous
[Sergeant] 2000, in Giora 2014: 143).
For the purpose of our argument, it is important to highlight that from this per-
spective, cohesion is conceived as a pre-requisite for coherence (see Daneš, 1974,
1987). The basic idea is that for a discourse to be coherent, its sentences must be
cohesive. The coherence of a text, in fact, is not a given, but rather a product ob-
tained through cohesive ties. These ties help to ensure the unity of the text and act
as signals that the speaker offers to the listener marking the way the listener should
follow in order to interpret the verbal utterances in a coherent way. The idea, in
other words, is that discourse coherence relies on linguistic elements and capacities.
Now, although cohesive relations have an important role in the expression and
recognition of coherence, we argue that the cohesion between consecutive sen-
tences is not a necessary condition for narrative coherence. Our claim is rather that
cohesion is the superficial expression of a deeper level of coherence that concerns
cognition prior to language production (Adornetti 2015). A crucial distinction is
between global discourse coherence and local discourse coherence (cf. Glosser and
Deser 1990). Local coherence refers to the conceptual links between consecutive
sentences or propositions that maintain meaning in a text or discourse. Global
coherence refers to the overall conceptual organization of the flux of speech; it refers
to the manner in which discourse is organized with respect to an overall goal, plan,
theme, or topic. As shown by the text in (1), cohesion contributes to local coherence.
But is local coherence a necessary condition for global coherence? Theoretical ar-
guments and empirical evidence suggest that global coherence does not depend on
the local coherence. Consider the “text” shown in (2).
(2) I bought a Ford. The car in which President Wilson rode down the Champs Élysées
was black. Black English has been widely discussed. The discussions between the
3 Even if Halliday and Hasan (1976) do not equate coherence and cohesion, they
are among the scholars that agree that cohesion — a grammatical phenomenon,
and for this a surface structure phenomenon — b oth reflects and enables discourse
coherence.
Mindreading, Mind-travelling and Origins of Language 179
presidents ended last week. A week has seven days. Every day I feed my cat. Cats
have four legs. The cat is on the mat. Mat has three letters (Enqvist 1978: 110–111).
In this text the sentences are connected through the cohesive mechanism of repeti-
tion. However, the set of sentences is not perceived as a coherent whole because
the sentences do not hang together in any reasonable way. Simultaneously, it can
be argued that this text is just an artificial construction that does not reflect how
human beings communicate with each other. But let us consider the conversation
produced in an actual communicative situation shown in (3).
(3) C: I admit this government we’ve got is not doing a good job but the unions are
trying to make them sound worse than what they are
T: mm
C: they . they . cos I’m a Tory actually but I do vote . if there’s a . er . a communist
bloke there I will vote communist but . it all depends what his principles are but I
don’t agree . with the Chinese communism . and the Russian communism
T: right
C: but I believe every . should be equal but . I’m not knocking the royal family
because you need them
T: mm
C: and they they they bring people in to see take photos (from Perkins et al.
1995: 304)
Despite the local sequential links (and at least a degree of local coherence) between
trade unions/government, government/Tory, Tory/communist, communism/Chinese/
Russian communism, communism/ equality, equality/ Royal Family, Royal Family/
tourist attraction, C shows a form of “topic drift”: he is unable to monitor what has
already been talked about or to relate each individual utterance to some overall
coherent plan or goal.
As mentioned, the text in (3) is produced by a person in an actual communicative
situation. Specifically, it is produced by a subject with brain injury. Indeed, neur-
olinguistic research has shown that in several neurological populations, such as
schizophrenic patients, traumatic brain injury subjects and patients with Alzheimer
disease, there is a dissociation between the abilities that underlie sentence process-
ing (microanalysis) and those that underlie narrative processing (macroanalysis)
(e.g. Dijkstra et al. 2004; Glosser and Deser 1990; Davis et al. 1997; Marini et al.
2008; Marini et al. 2014). Specifically, these patients correctly connect sentences
by using cohesion ties (grammatical devices), but they are unable to construct and
maintain the global coherence of their verbal productions: they cannot relate the
individual sentences to a plan or to a more general purpose, and they often intro-
duce material that is irrelevant to the current context in their verbal productions.
Because of such deficits in coherence, these patients are unable to communicate
180 Francesco Ferretti & Ines Adornetti
in an effective way, despite the fact that their capacities to construct well-formed
sentences are relatively preserved. Interestingly, it has also been shown that aphasic
subjects with syntactic deficits and problems with the construction of well-formed
sentences, did not suffer such a pragmatic deficit and were able to produce coherent
discourse (Gloser and Deser 1990).
To conclude this part dedicated to language functioning, we underline two points
relevant to our argument: First, discourse coherence is an essential property of
language because it is a necessary condition for communicating in an efficient
way. Second, the construction of global coherence in a narrative is not reducible
to cohesion, that is to say, the macrostructure of a narrative discourse cannot be
formally derived by the microstructure of the sentence.
the clues model is coincidental with the analysis of the cognitive architectures that
allow the receiver to infer the content the speaker intends to communicate. Now,
a model of communication focused on the role played by the speaker’s intention
in production-comprehension processing conforms to the idea that the linguistic
processes are driven by a mindreading cognitive system. It is exactly this kind of
cognitive system to which Sperber and Wilson make reference in order to explain
the transition from animal communication (founded on the code model) to human
language (Sperber 2000; Origgi and Sperber 2000; Sperber and Origgi 2010; see also
Scott-Phillips 2014). So far, so good.
Because of the importance we attribute to the discursive nature of human lan-
guage, the point to analyse here is the question of whether the clues model and the
mindreading system can be considered as sufficient conditions to explain human
narrative abilities. The answer Sperber and Wilson give to the question is explicit
and peremptory: as relevance is the principle of human communication that can ex-
plain any feature of language functioning and origin, even global coherence has to be
interpreted in terms of relevance (global coherence is a derivate notion of relevance).
However, contrary to Sperber and Wilson’s hypothesis, Giora (1997, 1998) con-
vincingly shows that relevance cannot be the only principle that governs human
communication. The relevance principle, in fact “can by no means replace current
accounts of discourse coherence since it is neither necessary nor sufficient for
text well-formedness” (Giora 1997: 17). A useful example for understanding the
coherence-pertinence distinction is the case of situations in which it is possible to
distinguish between discourses characterized by different degrees of coherence (as
in 4a and 4b)
(4a) The first time she was married her husband came from Montana. He was the kind
that when he was not alone he would look thoughtful. He was the kind that knew
that in Montana there are mountains and mountains have snow on them. He had
not lived in Montana. He would leave Montana. He had to marry Ida and he was
thoughtful (taken from Ida by Gertrude Stein).
(4b) The first time she was married her husband came from Montana. He was the kind
who loved to be alone and thoughtful. He was the kind who loved mountains, and
wanted to live on them. He loved Montana. But he had to marry Ida and leave
Montana (Giora, 1997: 26).
Giora’s view is that the difference in coherence between (4a) and (4b) cannot be
explained in reference to the principle of relevance. In fact, while the segments of
discourse are both relevant (according to Sperber and Wilson’s definition) “they
nevertheless differ drastically in terms of coherence” (Giora, 1997: 26): (4b) is more
coherent than (4a). According to Giora (1997: 22), the general conclusion that can
be drawn from these considerations is that “coherence is not a derivative notion”.
The stance in favour of the explanatory autonomy of coherence is grounded in
the idea that the narrative dimension of language relies on the identification of the
causal links that regulate the segments of discourse: discourse coherence, which is,
Mindreading, Mind-travelling and Origins of Language 183
sation, but that the intentions themselves can change because of the reciprocal
relationship. Such a reciprocal relationship among intentions, guided by a principle
of coherence, represents the thorn in the side of the clues model. From these con-
siderations, it follows that the primary reason why discourse coherence cannot be
reduced to relevance is that the processing system that grasps the evidence of the
speaker’s intentions cannot account for the temporal dimension of conversation. If
such temporal dimension appears to be a necessary condition for the flux of speech,
then mindreading cannot be considered the only processing system on which to
base the functioning of human communication.
alone.5 But there is more: arguing that temporal navigation is involved in the origin
of narrative abilities inevitably means referring to the idea that spatial navigation
is involved too.
There are anatomical and functional reasons for the necessity of the involve-
ment of spatial navigation in temporal navigation. From an anatomical point of
view, the close link between space and time representation is well demonstrated
by brain structure (Corballis 2013). The discovery of place cells allowed O’Keefe
and Nadel (1978) to argue that the hippocampus is the basis of spatial cognition in
rodents and is the substratum for episodic memory of humans (Dudchenko 2010;
Assmus et al. 2005; Assmus at al. 2003; Oliveri et al. 2009; Parkinson et al. 2014).
The neuroanatomical connections between space and time are commonly used to
justify the close relationship between space and time also from a functional point
of view. According to the proponents of the “spatial representation account”, in fact,
as they “occupy an overlapping temporo-spatial representation” (Cai and Connell
2015: 269), space and time cannot be considered as separate entities (Stocker 2014).
The paradigm of reference is represented by the idea that the close connection
between space and time rests on a profound asymmetry where space is primary.
The starting point of the perspectives that refer to the “spatial metaphor” is Lakoff
and Johnson’s conceptual metaphor theory (1980; Gibbs 2006). When talking about
time, speakers of many languages use spatial metaphors, saying things like the
future is in front of us and the past is behind us (or vice versa in some cultures) as a
way to conceptually interpret abstract entities in reference to more concrete entities.
According to Lakoff and Johnson, the ability to talk about time using space is the
surface effect of a deeper phenomenon: the spatial metaphor is actually the product
of our ability to think about time by means of space. Considerable experimental
evidence supports the priority of space over time, and therefore an asymmetric
interpretation of their relationship (Casasanto and Boroditsky 2008; Merritt et al.
2010). From these considerations, it is possible to argue that the primary source
domain in order to analyse the human narrative capacity is the navigation in space.
5 For the role of time in the evolution of language see also Cosentino (2011) and
Gärdenfors and Osvath (2010).
Mindreading, Mind-travelling and Origins of Language 187
What happens in real navigation is never equivalent to the straight path drawn on
the map: the actual movement in space requires a continuous realignment of the
goal because of the difficulties posed by the harshness of the environment. In a very
similar way, the process of discourse construction also relies on the ability to identify
a goal (the content that the speaker intends to convey to the listener), to construct
a route to this goal and to stay on track. Like navigation in space, the achievement
of the communicative goal depends on the continuous realignments implemented
by speakers to rebuild the route in the face of continual digressions imposed by the
different points of view typical of verbal communication (Ferretti 2014; Ferretti and
Adornetti 2011; Ferretti et al. 2013). Building the route and maintaining the right
trajectory to the goal is equivalent, in narrative terms, to building and maintaining
the global coherence of discourse.
The hypothesis (at the foundation of spatial metaphor) that the more abstract
knowledge domains are interpretable in terms of more concrete knowledge domains
is of great value in order to understand the construction of a coherent discursive
flow. The idea that time navigation is grounded on space navigation, in effect,
allows a step toward the opportunity to understand the nature of the properties
required from the cognitive elaboration of the flow of speech. The description of the
temporal organization of event sequence that, as we said, forms the backbone of the
narrative, has to be guaranteed, not only on the level of the internal relationship
between discourse segments, but even on the level of the external relationship be-
tween the narrative plan and the flow of events that represent the core knowledge
of individual experiences.
The temporal relationship between the segments of discourse, in effect, cannot
be considered in abstract terms alone: if time represents the key element of the
narrative texture of the clues in the expressive speech flow, the spatial metaphor
helps make the speech flow congruent with the flow of events narrated (Ferretti
2014). For this reason, spatial and temporal navigation represent the basic meta-
phor of the discursive nature of human communication. Such arguments lead us to
propose that the extension of the clues model of communication must be linked to
mindtravelling systems in space and time. That said, what kind of evidence could
we offer to justify the involvement of navigational systems in the processing of
discourse coherence?
Schizophrenic derailment serves as a suitable reference point for an analysis that
examines one of the basic building blocks of human narrative capabilities in the
ability to stay the course of speech. Disturbances of the speech of schizophrenics
are a textbook case of the loss of coherence in discursive abilities (Marini et al.,
2008). Although to our knowledge there are no experimental data on the direct
causal relationship between navigational systems in space and time and deficits in
schizophrenic global coherence derailment, experimental data related to the dif-
ficulty in time projection (Peterburs et al. 2013; D’Argembeau et al. 2008) together
with data related to the difficulty with space projection (Weniger and Irle 2008) of
these individuals seem to support a causal link between the navigation systems
188 Francesco Ferretti & Ines Adornetti
and the construction of the flow of discourse (for a discussion on the relationship
between schizophrenia and MTT, cf. Cosentino 2011).
The analysis of the narrative foundation of our communication skills is a useful
tool to hypothesize the protodiscursive origin of human language. In fact, it is in
reference to a perspective of this kind that the intent to extend RT by linking the
clues model of communication with the narrative perspective of language shows
its explanatory power regarding the issue of the origins of language (Ferretti 2014).
If the ability to maintain the route in navigation can be seen as the condition for
the construction of the flow of discourse in human communication, we have good
reason to think that the clues model (and the mindreading system strictly tied to
it) must seek an ally in the navigation systems in space and time. It is only through
projections in space and time that the expressive clues produced by our ancestral
relatives earn a significant distinction from the signals produced in animal com-
munication. From this order of argument, it follows that the transition from the code
model to the clues model is not a sufficient condition to ensure the transition from
animal communication to human language: the reasons we used to maintain that
RT has to be extended in order to account for the functioning of language are the
same reasons that lead us to maintain that RT needs to be extended and integrated
in order to also explain the origin of language.
6. Conclusion
In this chapter, we have argued that the narrative foundation of human language is a
useful tool to investigate the functioning and the origin of our communication skills.
At the basis of our hypothesis is the idea that the production and comprehension of
sentences is not a sufficient condition for an effective communication and that the
primacy usually assigned to the study of sentence grammar must give way to the
investigation of discourse pragmatics. A confirmation of our hypothesis is the fact
that discourse coherence is a property reducible neither to cohesion nor to relevance.
While the relevance principle probably represents a necessary condition for under-
standing the birth of our communicative skills, nevertheless it is not a sufficient con-
dition to account for the narrative texture of the flow of speech, and hence nor can it
be a sufficient condition to explain the origins of language as a whole.
From the point of view of cognitive semiotics, the conclusion to be drawn from
these considerations is that discourse processing requires additional devices beyond
mindreading, as well as very different devices than those implied in the analysis
of constituents of the sentence. In line with an action-oriented perspective of cog-
nition, we have argued that the basis of the ability to produce and comprehend
discourse is located in cognitive-semiotic systems that allow individuals to navigate
through space and time. The experimental data from the pathologies of language
concerning the processing of global coherence lead us to propose that the metaphor
of navigation we have assumed as a key explanation of human narrative abilities
is more than a simple metaphor.
Alessandra Chiera
Chapter 11
From Conversation to Language:
An Evolutionary Sensory-Motor Account
1. Introduction
This chapter offers a holistic model of language evolution, namely a model in which
the specific dimension of conversation, rather than representing a late product of
evolutionary history, marks the first stages of human communication. On the basis
of this model, it is suggested that a specific pragmatic function characterizing the
conversational context, namely alignment (Pickering and Garrod 2004), might have
fostered linguistic communication. The concept of alignment refers to the coor-
dination of situation or mental models (Zwaan and Radvansky 1998) with under-
lying dialogue, and is believed to be achieved by a primitive mechanism. In the same
vein, Zlatev and colleagues (Zlatev and Andrén 2009; Zlatev 2013) pinpoint a specific
stage in the development of intersubjectivity close to the “proto-conversations”
capacities outlined by Trevathen (1979): proto-mimesis. Such a primary phenomenon
based on perception/action systems allows empathetic engagements that could be
considered as a first stage in semiotic development (Zlatev 2013); in other words, the
social-semiotic nature of language rests on several bodily abilities that are involved
in increasingly more complex forms of intersubjectivity (Zlatev 2008a).
A methodological claim at the base of this work is that an evolutionary theory of
language should be constrained by an empirical cognitive account of linguistic use.
To this extent, the question about the evolution of language and the question about
the functioning of language are deeply intertwined. Thus, the model of language
evolution provided here is strictly tied to an empirically plausible model of language
functioning. This methodological premise is deeply connected to the “conceptual-
empirical spiral” (Zlatev 2015a) according to which addressing what something is
requires addressing how something develops in ontogeny and evolves in the species.
Assuming this basic commitment, the reasons why the context of conversation
represents a driving force in semantic elaboration are first discussed. Later, these
arguments are linked to the evolutionary scenario, suggesting that the pragmatic
processes involved in the elaboration of holistic contextual factors were also im-
plicated in language evolution. In the last section, the focus is on the specific role
of conversational alignment, claiming that in our ancestors it could have been sup-
ported by action-perception mechanisms. Combining a top-down model of language
evolution with a bodily grounded account of cognition (Varela et al. 1991; Barsalou
2008), the chapter finally restates the necessity for a proto-conversational model
of language evolution.
190 Alessandra Chiera
In this respect, the two-step model appears to falter in explaining language pro-
cessing in real-time conversation. In opposition to its predictions, the conversational
context has a primary role in building meaning and, by means of a constant adjust-
ment process, drives comprehension among speakers (Altmann and Kamide 1999;
Nieuwland and van Berkum 2006). These indications suggest that meaning can be
defined as a constructive and context-dependent phenomenon (Gibbs 1994; Clark
1996). As predicted by the single-step model, there is not a priority encoding of local
semantic information but rather the global context conditions the interpretation.
From this point of view, bottom-up and top-down processes constantly interact,
thus closely intertwining semantics and pragmatics (Carston 2008).
Within the classical computational framework, the global level of conversation is
achieved by combining the sum of single propositions. On the contrary, according
to the reviewed empirical evidence, conversation is largely a top-down phenom-
enon, in which a situational interpretation at the global meaning level guides local
interpretation (Cosentino et al. 2013). If language use implies the construction of a
situational interpretation at the global meaning level which has effects on the inter-
pretation of a linguistic expression, then such an expression is not isolated but is
strictly tied to the joint actions performed by speakers. In this view, communication
is a collaborative process grounded in the interlaced work of speaker and listener,
entailing an integrated account of bottom-up and top-down perspectives.
Why should these results be relevant to the issue of the evolution of language?
The idea is that the implications of these studies can be used to make a more gen-
eral claim: they suggest that there is no stage of linguistic interpretation that is
completely independent of contextual information, that is, meaning is immediately
contextualized. This claim implies that in order to account for language evolution
we should focus on how context constrains linguistic interpretation, which in turn
implies focusing on the pragmatic abilities of our ancestors. In the following sections,
these abilities are analyzed, along with the possible mechanism that underlies them.
tors could have constrained the interpretative processes. Investigating this issue
implies looking into a crucial preliminary matter, that is, what initial (linguistic)
communication may have looked like. On this point, recent literature on inter-
subjectivity (e.g. Zlatev et al. 2008; Fusaroli et al. 2014) proposes that the linguistic
processes must be examined at the interpersonal level, where the complex social
multimodal interactions display specific dynamic properties. Similarly, Tomasello
(2008) argues that the early forms of communication arose from collaboration with
others in collective activities and, thus, can be interpreted as joint actions. From
this point of view, the problem of explaining how initial communication could work
can be rephrased as the problem of explaining how individuals begin to engage in
joint actions. This problem can be broken into at least two main sub-problems, one
related to the concept of “jointness” and the other to the notion of “action”. Starting
with the latter, the problem is to account for how we understand other people’s
actions. Once we have answered this question, the second problem to be dealt with
is analysing how we realize joint actions.
Thus, the main point so far has been that contextual information is immediately
used in a top-down fashion to interpret the meaning of linguistic expressions. The
proposal of the present chapter is that, in order to extract relevant contextual in-
formation, individuals rely on the same processes by which they interact with the
physical environment, that is action and perception processes. In support of this
claim, Cosentino and colleagues (2014) conducted an experimental study to evaluate
the processes involved in the semantic integration of a sentence in a discourse.
They showed that sensory-motor information is recruited during the process of
sentence meaning composition to extract “ad hoc affordances”, namely dispositional
properties of objects in a given situation that are based on a novel, context-specific
function (Barsalou 1993). For example, the researchers built a situation in which
an ad hoc affordance for funnel was induced such that a funnel could be used to
hang a coat. As ad hoc affordances are contextually induced, their immediate inte-
gration during the process of sentence meaning composition suggests that there
is no principled temporal or functional precedence of local constraints over global
contextual factors. Specifically, the authors link these top-down effects with a mech-
anism of affordance perception, that is, the mechanism by which an individual detects
an opportunity for action suggested by an object or the environment given her
particular bodily structure (Gibson 1979).
Accordingly, Amoruso and collaborators (2013) pinpointed a close link between
sensory-motor cognition and language. In particular, they emphasized that action
meaning and language meaning elicit similar N400 modulations. In their view, it is
plausible to interpret these outcomes within a grounded account of the N400, as the
retrieval of sensory and motor information modulates meaning-related processes
indexed by this component. In this respect, meaning is a situated phenomenon
embedded in prior experiences with the world and shaped by predictions derived
from contextual ongoing information and previous knowledge.
Within a motor account of social cognition, it is possible to fill the gap between
language and action by assuming that the ability of actively perceiving others’
From Conversation to Language 195
speaker’s intentions. In this respect, communication arises from the joint activity of
speaker and listener who are engaged in several coordination problems at many le-
vels (Levinson 2000). The central problem consists of coordinating what the speaker
means and what the addressee understands him to mean by trying to converge on
shared situational models. That is the so-called process of alignment (Pickering and
Garrod 2004). This is the framework wherein the initial communication has been
defined as a form of joint action that implies not only the coordination of practical
activities but also the convergence of meanings.
In the Sperber and Origgi account, language evolution can be explained in as-
sociation with the evolution of a sophisticated mindreading capability, characterized
by fast propositional attitude attributions. However, it is unlikely that the ability to
attribute propositional attitudes was already in place, particularly as present-day
children acquire communicative skills before developing such ability (Zawidzki
2013). A more parsimonious account is that the construction of a shared mental
space involved in building joint communicative actions is based on sensory-motor
mechanisms. More specifically, the affordance perception device described in the
previous section might have a specific role in leading to the key process of align-
ment. How does it complete this task?
Perceiving affordances represents a crucial step in the process of understanding
both one’s own intended actions and the potential actions of the agent with whom
one is interacting. Thanks to this process, people can make predictions and estimates
of the social consequences of those actions; hence, the following performed actions
become social clues which can confirm or not confirm the agent’s expectations
(Cosentino 2014). From this perspective, the affordance perception mechanism,
providing a tool for an immediate comprehension of other people’s intentions,
can serve as a basis for simple forms of alignment. Affordance perception permits
predictions of others’ intentions that in turn condition the planning of one’s own
actions: throughout this adjustment process, the interpretation may be refined
moment-by-moment, driving the convergence between interlocutors towards a
conceptual form of alignment.
In support of this hypothesis, Linkenauger and colleagues (2012) have recently
suggested that some of the social and motor impairments observed in many indi-
viduals with Autism Spectrum Disorders (ASD) may be attributable to an impaired
mechanism of affordance perception. Whether these impairments are different defi-
cits or have a common origin is very controversial. Despite that, some indications
achieved with well-established experimental paradigms provide interesting data in
support of the hypothesis that a difficulty in the perception of affordances is the
putative mechanism underlying both social and motor impairments. As previously
suggested, an impairment in perceiving one’s own affordances may also affect the
ability to perceive the action possibilities of other individuals; such a difficulty
might be a crucial obstacle in the comprehension of others’ actions, leading to
serious limitations in social domains. It is plausible that these limitations may have
effects on the ability to align with people, affecting the pragmatic skills essential
for communication. Indeed, despite the fact that strictly linguistic alignment has
From Conversation to Language 197
been found to be intact in people with ASD (Slocombe et al. 2013; Hopkins et al.
2015), converging with an interlocutor merely at the linguistic level (for instance,
at the syntactic level) does not ensure convergence at the level of situation models.
In accordance with a pragmatic inferential account, it is the situational alignment
which is crucial for successful communication.
Returning to the evolutionary scenario in light of the considerations developed
above, it is plausible that the ability to read others’ intentions based on the interpre-
tation of their actions is necessary for a structured code to develop. The development
of a shared space paves the way for sophisticated forms of mental attunement (Toma-
sello 2008) that might have first sustained communication with natural gestures such
as pantomime and then acted as an infrastructure on which conventional linguistic
communication could rest. To this extent, the simple mechanism of affordance per-
ception can provide an answer to some critical considerations raised by Origgi and
Sperber (2010) among others. The problem they address concerns the convincing claim
that for coded communication to work, speaker and listener have to share exactly the
same code. Any difference between the speaker’s and the listener’s code is likely to
cause some errors and, consequently, to compromise the success of communication
becoming counter-adaptive. In evolutionary terms then, in order to be advantageous
the code has to undergo modifications which do not structurally modify the pre-
existing code of expression. Within the present framework, the situation would be
very different. The alignment model based on affordance perception does not assume
that a code needs to be shared. Instead, the ability to build a shared space allows
communication to make predictions of the speaker’s meaning despite fragmentary
and ambiguous coding. A linguistic code can arise exactly “from affordances that are
brought forth by active engagement, and which enable further action and interaction”
(van Lier 2006: 146). Conversely, once such a poor and fragmentary code emerges, it is
likely to play a major role in semantic alignment. Fusaroli and colleagues (2012) used
an experimental design to investigate the influence of public language in performing
joint tasks. They asked pairs of subjects to cooperate using language in order to carry
out a perceptual test. The findings showed that the more the pairs aligned in their task-
relevant linguistic behaviors, the higher the level of their task performance. Overall,
these indications suggest that there is a close relationship between language and joint
actions: language meaning is built within the intersubjective space and this space, in
turn, develops through language, in a dialectical manner (Zlatev 2008a). This complex
bind represents a privileged key to the origin and evolution of human communication.
6. Conclusions
In this chapter, the evolution of human language was addressed alongside the devel-
opment of action-based abilities of alignment. The concept of alignment paves the
way for a model of language as a joint action that characterizes early communi-
cation in proto-conversational terms. Indeed – starting from the idea that the origin
of language has to be interpreted from a top-down perspective, that is, with respect
to the effects elicited by the situational context on the interpretation of an ex-
198 Alessandra Chiera
pression – the core of language does not rest on an isolated system or property but
rather is strictly tied to the joint activities that led speakers to coordinate both prac-
tical and conceptual actions. This suggests the need to study early communication
with regard to broader capacities that determine the ability to elaborate contextual
variables and to establish a common ground with other people. In this framework,
conversation can be defined as a dynamic interactive exchange, situated within
a jointly determined – and constantly evolving – semiotic system. Interestingly,
even at present in coded and well-structured language, conversation keeps similar
features in continuity with the first forms of communication.
It is proposed that a specific role might have been played by a basic mechanism
grounded in the sensory-motor experience of individuals: affordance perception.
Within the traditional computational model, the contextually varying top-down
influences of the conversational situation are hard to explain. On our account,
the mechanism of affordance perception mediates between the top-down aspects
of language and the sensory-motor nature of cognition, providing a tool for the
definition of key processes as conversational alignment. It is plausible to conceive
an evolutionary scenario where affordance perception started by guiding the co-
ordination of practical activities and then advanced to guiding the alignment of
more complex semiotic actions that involved the understanding of conventional
and normative elements. Language might have arisen within this context, making
us the extraordinary species that we are.
Acknowledgements
This work has been conceived and discussed with Erica Cosentino. I am really
grateful to her for her valuable comments. We presented an early version of this
paper at IACS 2014 in Lund; we would like to thank the audience of the conference
as well as the reviewers of an earlier version of the text for the constructive feed-
back provided.
Serena Nicchiarelli
Chapter 12
Protolanguage as Formulaic
Communicaction
1. Introduction
One of the major debates in evolutionary linguistics is concerned with the nature of
early protolanguage and its transformation into modern language. In that respect,
two competing models have given rise to a lively debate: the synthetic account (Bick-
erton 1990, 2010; Tallerman 2007, 2010), in which word-like units are eventually
composed into sentences, and the holistic account (Arbib 2005; Wray 1998, 2002), in
which sentence units are broken apart into words. The fundamental distinction be-
tween the two accounts lies in the initial conditions, namely in the nature and com-
plexity of the meanings associated with basic units of protolanguage (Smith 2008).
In Section 2, I introduce these two competing accounts and present a proposal that
relies on a notion of protolanguage that is more consistent with the holistic account.
However, a protolinguistic code in which every single signal is strictly associated
with only one atomic meaning, independently of the context, cannot be assumed
to be the evolutionary starting point. The emerging verbal communication could
not be useful unless initial linguistic expressions were contextually constrained.
Hence, protolinguistic abilities need to be explained with regard to those cognitive
systems that allow agents to act in their own environment: “Language can best be
understood as a device which refines an already complex system – it is to be ex-
plained as a ‘recently’ evolved refinement of an underlying ability to interact with
the environment” (Arbib et al. 2014: 62).
More specifically, my proposal is that hominin protolanguage was significantly
related to structures underlying the capacity to perform and recognize complex
communicative goal-oriented actions. In Section 3 I address the issue of a poss-
ible pantomimic foundation of protolanguage, in line with the models proposed
by Donald (1991) and Arbib (2005). In order to evaluate the plausibility of this
proposal, in Section 4 I present an analysis of formulaic language, in line with
the idea that modern holistic processing strategy represents a legacy of ancestral
holistic-protolinguistic ability. The analysis is centered on the description of the
functions and properties of formulaic language in actual dialogical interactions, thus
highlighting the role of formulaicity in pragmatic tasks (e.g. Gibbs, 2007; Kecskes
2014). Finally, I consider clinical data derived from studies on productive language
in Alzheimer’s and Parkinson’s diseases in support of the presented hypothesis of
a holistic communicaction protolanguage.
200 Serena Nicchiarelli
According to one specific stage-based evolutionary model (Arbib 2005a, 2005b, 2006,
2012), a “key neural missing link between the abilities of our non-human ancestors
[…] and the modern human capability for language” involves the mirror-neuron
system “with manual gestures rather than a system for vocal communication pro-
viding the initial seed for this evolutionary process” (Arbib 2006: 6). On this account,
the mirror system for grasping was crucial to the early stages of protolanguage
and its extension represents the decisive step that enabled the development of the
ability to imitate:
Imitation is seen as evolving via a so-called simple system such as that found in chim-
panzees (which allows imitation of complex “object-oriented” sequences but only as
the result of extensive practice) to a so-called complex system found in humans (which
allows rapid imitation even of complex sequences, under appropriate conditions) which
supports pantomime (the artless sketching of an action to indicate either the action
itself or something associated with it). (Arbib 2005a: 106)
According to Arbib, the “language-ready brain” is the result of the following steps
or stages: (1) grasping, an ability based on cerebral mechanisms that allow animals
to interact with objects, and (2) a mirror system for grasping, linked to the compre-
hension of actions performed by others. Such a system represents the foundation for
the development of a simple imitation system for object-directed grasping through
much repeated exposure. Thereafter, (3) evolution furnished the hominin line with
the emergence of complex imitation system, namely the ability to recognize an-
other’s performance as a set of familiar actions and then to repeat it. This is a crucial
step in the evolution of communication insofar as it paves the way to (4) a manual-
based communication system, initially founded on mimesis or pantomime (Donald
1991; Corballis 2011; Zlatev 2008a, 2014b). The transition from the ability of complex
imitation to pantomime is granted by a critical neural change: the activation of the
mirror-neuron system also in “intransitive” acts, in which the action is not oriented
toward (immediately present) objects.1 Similarly, in pantomime the audience needs
to infer the meaning of the action from observation of the movement in isolation.
Pantomime is typically performed with the intention of getting the observer to
think of a specific action or event; it does not require agree-on conventions and
is holophrastic. However, as pointed out by Arbib (2003: 13), it has its drawbacks:
…it may be quite long, limited, and energetically costly. So, the range of communication
can be greatly improved by the development of conventions on the use of gestures
that do not directly pantomime anything, but instead are developed by a community to
refine and annotate the more obvious forms of pantomime – forms which themselves
would became increasingly ritualized with use. The notion, then, is that the manual
1 Mirror neurons for grasping in the monkey brain will fire only if the monkey sees
both the hand movement and the object to which it is directed. A grasping movement
that is not made in the presence of a suitable object, or is not directed toward that
object, will not elicit mirror neuron firing (Umilta et al. 2001).
Protolanguage as Formulaic Communicaction 203
ment of voluntary movement, and at the same time a strongly reduced production
of formulaic sequences in spontaneous conversations. As reported by Altman and
Troche (2011), PD affects firstly the comprehension of non-verbal communication
and the identification of non-verbal meaning. Overall, people with PD show signifi-
cant impairment at the level of discourse (therefore, at the level of complex meaning-
making), resulting in reduced informative content, compromised discursive fluency,
substantial difficulties individuating main conversational topics, and considerable
incompetence in identifying contextual information (Altman and Troche 2011).
In sum, the findings show that the proper language functioning of both verbal
and non-verbal communicative competence is compromised by impairments in the
cognitive systems that support the production of formulaic language. Moreover, in
line with our hypothesis, these cognitive systems are the same mechanisms that
manage complex goal-oriented action.
6. Conclusions
In this chapter, I supported a version of the holistic protolanguage hypothesis which
sees it as (a) emerging from conventionalized pantomime and (b) consisting of con-
ventionalized vocal gestures, i.e. formulaic utterances. The combination of these
features is what I refer to as “communic-action”. Evidence for this hypothesis is that
such communic-action is still prominent in everyday human interaction.
Furthermore, clinical studies show an important dissociation: the linguistic
abilities in Alzheimer’s disease, where the basal ganglia and other structures that
support practical goal-oriented actions are spared, reveal a significantly higher
incidence of formulaic expressions. On the other hand, a pathology that involves
dysfunctional basal ganglia, Parkinson’s disease, is associated with significant defi-
ciencies in the use of formulaic language and non-verbal communication, as well
as discourse in general. Since formulaic language could be considered as a set of
goal-oriented verbal motor gestures, it is likely that it represents a legacy of the
ancient holistic processing strategy, supporting the hypothesis of a holistic proto-
language as communicaction.
Overall, the present work can be seen as contributing to cognitive semiotics by
suggesting how present day language, “a conventional-normative semiotic sys-
tem for communication and thought” (Zlatev 2008a), may be rooted in an action-
based account, which in line with enactivism sees meaning as arising from the
bi-directional relation between the organism and its environment (Thompson 2007).
Part III.
Meaning across Media, Modes and
Modalities
Cornelia Müller
Chapter 13
From Mimesis to Meaning: A Systematics
of Gestural Mimesis for Concrete and
Abstract Referential Gestures
1. Introduction
When the hands engage in gesturing, they are generally used as a mimetic medium.
Basically all types of gestures involve a transformation from instrumental actions to
non-instrumental, communicative ones. Such a transformation is based on gestural
mimesis, and calls for non-trivial cognitive capacities, which makes this topic an
important issue of cognitive semiotic reflections on (bodily) mimesis with regard
to a phylogenetic and ontogenetic development of gestures and language (Andrén
2010; Donald 1991, 1998, 2012; Zlatev 2008a,b, 2014a,b). Miming actions motivates
and grounds the meaning of referential, pragmatic and even pointing gestures and
involves iconicity and indexicality as cognitive-semiotic processes. Gestural mimesis
(in the sense of schematized enactments of bodily actions) extends Zlatev’s concept
of mimetic schemas (Zlatev 2005, 2007b, 2014a) to an experience-based cognitive
semiotics of gestural meaning. While Zlatev’s notion of mimetic schemas addresses
the ontogenetic development of iconic gestures, the concept of gestural mimesis
concerns the semiotic motivation of hand-gestures more generally.
Note, as an important aside, that Zlatev’s use of the term iconic gestures is not
equivalent with the McNeillian (McNeill 1992) notion of iconic gestures. The latter
is quite misleading, since it suggests that only gestures with a concrete referent
are iconic, whereas metaphoric gestures, e.g., gestures referring to abstract actions
and entities, are not. Furthermore, it implies that pragmatic gestures such as the
precision grip or the Palm-Up-Open-Hand (Kendon 2004, chapters 12 and 13; Müller
2004) are not iconic. The term “iconic gestures” conflates a semiotic motivation with
the specific semantics or pragmatics of gestures. Therefore, in the following, the
terms concrete and abstract referential gestures are used, and they are meant to re-
place McNeill’s iconic and metaphoric gestures while at the same time maintaining
the important semantic distinction he introduced (McNeill 1992).
From a cognitive-semiotic perspective, it is fairly obvious that iconicity (and
indexicality) are motivating processes of gestures more generally. The concept of
gestural mimesis thus not only addresses concrete and abstract referential gestures,
it also concerns the historical motivation of conventionalized gestures (for instance,
the ring gesture as “sign for love” or as “precision marker”, Müller 2014c). It thus
also applies to so-called pragmatic gestures, as for instance, the Palm-Up-Open-
Hand (the Open Hand supine in Kendon’s 2004: 264 terms), which derives its core
212 Cornelia Müller
The mimetic schemas involved in the group of away gestures are as-if actions of
mundane instrumental actions: the sweeping away of water from a table, the hold-
ing away of a swinging door, the throwing away of an apple core or the brushing
aside of crumbs from a sweater. Gestural mimesis implies the enactment of mimetic
schemas and involves cognitive-semiotic processes and mimetic techniques that ex-
Systematics of Mimesis for Referential Gestures 213
plain how the transition from a full-fledged action with an object to an “acting-as-if”
is achieved. Notably, gestural mimesis is not restricted to the mimetic enactment
of actions as mimetic schemas of iconic gestures, but as semiotic techniques of
creating abstract and also pragmatic meanings. Gestural mimesis as a capacity and
technique could also be argued to underly the levels of semiotic complexity Andrén
postulated for the ontogenetic development of gestures (Andrén 2010; Zlatev 2014a).
According to Andrén and Zlatev for a bodily action to count as gesture, Level 3 on
either communicative explicitness (#C3), semiotic complexity (#S3) or both – needs
to be reached, as shown in Table 2.
Table 2. A
ndrén’s developmental levels from action to gesture in social interaction (repro-
duced with permission from Andrén 2010: 68).
While gestural mimesis semiotically motivates all types of gestures, this chapter
concentrates on a presentation of the concept of gestural mimesis based on the
Aristotelian understanding of mimesis in the arts. It offers an illustration of how
mimesis motivates concrete as well as abstract referential gestures. It is argued that
the meaning of concrete and abstract referential gestures is experientially grounded
in different forms of bodily mimesis. In doing so, an important touchstone for an
experience-based cognitive semiotics of referential gestures is provided.
By offering a systematics of the experiential grounds of gestural mimesis for con-
crete and abstract referential gestures, the present chapter also aims to contribute to
some of the current evolutionary work on bodily mimesis. It appears that using the
hands not only for instrumental but also for communicative purposes demarcates
one of the most significant steps not only in ontogenesis but also in phylogenesis of
language (Arbib 2005a, 2008a, 2012; Corballis 2002, 2013; Donald, 1991, 1998, 2012;
Kendon, 2009; McNeill 2015, Zlatev 2008a,b, 2014b). Even our closest relatives, the
nonhuman primates, show a (restricted) capacity to use their hands referentially
(Liebal, Call and Tomasello 2004; Liebal, Müller, Pika 2007; Pika and Liebal 2012;
Müller 2014a).
214 Cornelia Müller
The capacity to create and use gestures develops in children along with their
cognitive, social and linguistic development (Andrén 2010; Volterra and Erting 1990;
Goldin-Meadow 2003; McNeill 1992; Zlatev 2007b, 2014a; Zlatev and Andrén 2009).
Zlatev provides a wealth of arguments for a fundamental role of bodily mime-
sis in children’s language development as well as in language evolution (Zlatev
2014b: 202): “[…] human nature – characterized by a consciousness that is uniquely
social and representational – rests on a specific pre-linguistic adaptation: bodily
mimesis (Donald 1991, 1998, 2012; Zlatev 2005, 2007a,b, 2008a,b, 2014b).” Zlatev has
put forward a definition of bodily mimesis, which implies a developmental hierarchy
from proto-mimesis, to dyadic and triadic mimesis, leading into the post-mimetic
stages in which language proper takes over. See Table 3.
Table 3. Z
latev’s Mimesis Hierarchy, with incremental features and corresponding cognitive-
communicative skills (reproduced with permission from Zlatev 2014b: 207)
tial gestures, and how mimesis governs the transformation of instrumental hand
movements into referential gestures.
It is the human capacity for mimesis that makes this transformation possible and
the Aristotelian concept of mimesis as conditio sine qua non of human nature sheds
light on the semiotic and cognitive principles governing gestural mimesis (Müller
2010a). When the hands are used for communicative purposes, i.e. when actions be-
come gestures, mimesis is what makes this happen. We may use our hands (and other
body parts too), to communicate about concrete as well as abstract actions and entities
in the world and this is what concrete and abstract referential gestures are about.
The mimetic processes that motivate gestures as much as the signs of signed
languages1 (etymologically) involve abbreviation, stylization and schematization.
Thus there is a significant formal difference between the instrumental and mimed
action, something that Goffman and Bateson have identified as a core characteristic
distinguishing play and fight in apes (Goffman 1980: 55; Müller and Haferland 1997).
Cohen and colleagues have described abbreviation as an inherent characteristic
of an iconic sign in a signed language: “To indicate an action the signer generally
performs an abbreviated imitation of a characteristic part of it: write, for example,
is signed by making such an imitative movement” (Cohen et al. 1977: 17). Now,
whereas in a signed language forms of imitation used to refer for instance to a
specific action will be limited by convention, (non-conventionalized) gestures allow
for variation in how to imitate one and the same action. Writing, for instance, might
be “gestured” by imitating the holding of a pen and moving in a horizontal plane
from left to right, or by representing the pen with an extended index finger and
moving rightwards (in Western European cultures).
Gestural meaning creation appears to go hand in hand with processes of abstrac-
tion or schematization and this is what differentiates a mimetic schema from a full-
fledged action. But gestural mimesis also involves perspective taking and thus can be
considered a form of embodied conceptualization. Mimetic forms thus come to be the
embodied touchstone of gestural meaning. Note that meaning is understood here in
cognitive-linguistic terms: as conceptual structure and as a form of conceptualizing
the world (Langacker 1987, 1991, 1998, 1999, 2008), which – notably – also includes
pragmatic meaning. In the following, a systematics of gestural mimesis will be pres-
ented, which is based on Aristotle’s concept of mimesis as a fundamental human
capacity. Applying Aristotle’s work on mimesis and poetics to gestural mimesis pro-
vides the framework for a cognitive-semiotic take on an embodied mimetic semantics
of gestures referring to concrete and abstract actions and entities.
1 Although for a long time it was not “politically correct” to make this point, in fact,
researchers of signed languages have over the years come to the conclusion that
mimesis, here dealt with in terms of iconicity, plays a core role in the motivation
of signs (Wundt 1921; Battison 1974; Cohen, Namir, Schlesinger 1977; Mandel 1977;
Kendon 1980a,b, 1986, 1988; Taub 2001; Wilcox 2004). Especially, in classifier predi-
cates, iconicity is seen as a major motivating force of their meanings (Cogill-Koez
2000; Emmorey 2003; Perniss 2007, see also Müller 2009).
216 Cornelia Müller
2. M
imesis and poetics: Aristotle’s systematics of
different forms of mimesis
The concept of mimesis has a long-standing tradition in Western philosophical
reflection upon the arts. In maintaining critical distance from his teacher Plato,
Aristotle attributes mimesis a core role in the arts and considers it as a fundamental
anthropological condition. The capacity for, and the pleasure of, mimesis distin-
guishes the human species from all other species, according to him. Human beings
are a miming species and, importantly, they enjoy the products of (good) mimesis.
Watching a tragedy is thus not only a cognitive but first of all an affective enterprise.
Aristotle discusses the question of mimesis prominently in his Poetics (Aristotle
1968) and uses it to discriminate different artistic and literary genres. A variable
use of media, objects and modes leads to different mimetic forms, e.g. to different
genres and more generally to different forms of art. Table 4 shows an overview of
Aristotle’s forms of mimesis.
Three fundamental aspects distinguish the different forms of mimesis: media, ob-
jects and the modes of mimesis. Media concern the means by which something is
being mimed: poetic language, rhythm, melody, music and dance. Mimetic objects
in an Aristotelian sense (and thus objects of arts) are the ethical and unethical
actions of human beings. The modes of mimesis explicate how mimesis is achieved.
The mimetic modes are critical for Aristotle’s distinction of tragedy and epos.
While epos (the “older” literary form) is characterized by a narrating mode (nar-
rator describes actions and events), tragedy follows an acting mode of mimesis:
actions are performed rather than described. The objects of mimesis in tragedy
are ethically valuable actions and the mimetic mode of tragedy is the depiction
of an action by action not by epic narration. Moreover, tragedy is characterized
by poetic language, temporal rhythm, and prosody or melody. Although Aristotle
discusses mimesis in the Poetics primarily in the context of poetic language and
particularly to distinguish tragedy from epos, it is aimed to apply to all mimetic
arts, including music and dance.
If Aristotle’s analysis is true and mimesis is indeed a basic human capacity,
it might help to shed some light on one of the probably most fundamental mi-
metic practices of human beings, namely, gestural mimesis. As mentioned above,
Systematics of Mimesis for Referential Gestures 217
precursors of gestural mimesis are found in the great apes (Müller 2014a, Zlatev
2008a,b), but only when it comes to miming very basic actions like giving or hit-
ting. The human capacity to use instruments in dealing with their environment in
conjunction with an upright posture has led to an explosion of manual practices
(Leroi-Gourhan 1980; Donald 1991, 1998, 2012) and it is these practices in the first
place that constitute an immense base for the emergence of gestures. Prototypical
gestures are hand-movements that mime actions and objects. In the following, Aris-
totle’s systematics is used as inspiration for arriving at a closer understanding of
gestural forms of mimesis.
3. F
orms of mimesis in gestures: The Aristotelian
systematics as framework for an embodied mimetic
semantics
Taking Aristotle’s concept of mimesis as a starting point for a systematics of
referential gestures offers a possibility to clarify the experiential grounds of ges-
tural meaning. It can even be considered a framework for an embodied mimetic
semantics of concrete and abstract referential gestures. Drawing upon Aristotle’s
concept of mimesis and its role for the arts, three aspects of mimesis can be
distinguished for gestural mimesis as well: the material, the objects of mimesis
and the mimetic modes (see Table 3). The material of gestures is the different
bodily articulators: fingers, hands, arms, shoulders, head, face, eyes, lips, trunk,
legs, feet etc.; the objects of gestural mimesis concern actions and entities that
a gesture may be used to refer to; the mimetic modes regard the techniques of
gesture creation.
The objects of gestural mimesis fall into six semantic subcategories. When
entities are the objects of gestural mimesis, gestures can depict: Entities as
entities, Properties of entities, Movement and localization of entities.
When, on the other hand, the objects of gestural mimesis are actions, gestures
may depict: Actions as actions, Aspects of actions, Actions with objects.
There are two basic techniques by which gestural mimesis is achieved in hand
gestures: acting or representing (Müller 2014a). In the acting mode the hands
enact manual actions and movements of the forelimbs; in the representing mode
the hands present something other than themselves. By acting as-if opening a
window, turning a car key or by moulding the roundness of an event, the hands
become gestures based on the acting mode. By taking on the shape of flat, round,
or hollow objects, they act as-if they were manual sculptures, often sculptures
in motion.2 In the following, it is illustrated how the meaning of concrete and
abstract referential gestures is grounded in the media, the objects and the modes
of gestural mimesis.
2 Zlatev’s notion of mimetic schemas would only apply to gestural mimesis based on
the acting mode. The representating mode primarily uses visual mimesis.
218 Cornelia Müller
Table 5. Th
e Aristotelian systematics as a framework for an embodied mimetic semantics
of referential gestures
kind of manual sculpture of it – while his head is bent forwards, his gaze is oriented
towards the hand. In Figure 1c we see somebody acting as-if his two spread fingers
were a pair of scissors (a manual sculpture in motion). This gesture includes the
hands and the forearm. Unlike in Figure 1b, gaze towards interlocutor is not part
of the semantics of this gesture but rather serves the pragmatic goal of addressing
the recipient of a multimodal utterance. Figure 1d depicts yet a different case of a
mimetic bodily medium: here the speaker’s right leg is the foregrounded articulatory
body-part. The dancer’s leg metaphorically presents a heavy anchor chain. This is a
metaphoric body gesture (Müller and Ladewig 2013) as a heavy leg is a pre-requisite
for the specific style of “walking” in Argentine Tango. It ensures stability and bal-
ance while having most of the weight on the other leg. Argentine Tango is based
on this specific walking technique and students have lots of difficulties learning
this. Now, the dance instructor uses the anchor-chain metaphor to explain how to
achieve this balance: the body of the dancer becomes a ship on the ocean and the
free leg is a heavy anchor chain pending into the water. The anchor chain becomes a
metaphor for stability in dancing. The full metaphor can be paraphrased as: “Balance
(in Argentine Tango) is feeling stable like a ship with a pending anchor chain”. The
hands on his leg have an indexing function, together with the gaze direction, they
direct the interlocutors’ gaze. The articulators involved in this gesture are: the leg,
the arms, head and gaze, maybe also the bent upper trunk area.3
3 For a detailed analysis of this example, see Müller and Ladewig (2013).
220 Cornelia Müller
can depict concrete as well as abstract mimetic objects. So, for instance, the two
women in Figure 2 gesturally describe the crashing of something against a wall. In
(Figure 2a) it is a car crash, and in (Figure 2b) a metaphorical “crash” of an effort
to solve something. In both cases, two entities are depicted: one hand represents
a dynamically moving entity (movement of entity) crashing into another one,
depicted by the other hand, representing a static entity (location of entity). The
speakers establish a manual scenario with one static entity (location of entity),
the wall in (Figure 2a), an abstract effort as movement of entity in (Figure 2b),
and a moving car also as movement of entity.
Figure 2. G
estures referring to movement and location of entity: driving something
against a wall, in (a) a literal, and (b) a figurative sense
When looking at the vivid gestural depiction of the literal and figurative events, one
could be tempted to assume that the event as a whole was presented. However, each
gestural depiction comes with a very specific perspective on the action. For example,
the gesture in Figure (2a) does not show how somebody held a steering wheel and
how her body was thrown back by the crash. The same holds for the metaphoric
gesture in Figure (2b): we do not see how the whole body was affected by an im-
agined crash. Rather the crash is conceptualized from a bird’s eye view, an observer
viewpoint, not from a character viewpoint (McNeill 1992; Stec and Sweetser 2013;
Sweetser 2012). In both cases, a flat laterally oriented hand represents a wall while
the other hand represents in a rather schematic way the energetic movement of an
object crashing into another one. In these gestures the speakers display their specific
view on the world; thus they can be characterized as embodied conceptualizations.
Figure 3 shows two examples of gestures referring to properties of entities.
The two women talk about the roundness of a bench in Figure 3a and of some ab-
stract issue in Figure 3b. In both cases, the hands mould an ephemeral round shape.
An interesting difference between the concrete use and the figurative use of the ges-
ture is the gesture space. In Figure 3a the location of the gesture depicts the location
Systematics of Mimesis for Referential Gestures 221
of the round object (the bench), while in Figure 3b the location of the gesture is not
semantically loaded, it is performed in the default gesture space. It is important to
note that in these examples the mimetic enactments are not used to mime the ac-
tion, e.g. the reference object is not an action of moulding or touching the surface
of an object, rather the mimed action functions as a technique of representation
for the shape of objects (Müller 1998a, b). Thus by acting as-if moulding a round
shape, properties are depicted and often via those canonical properties objects are
being gesturally referred to. As Mittelberg’s work on metonymy in gestures has
pointed out, those semiotic processes involve internal and external metonymy in
a Jakobsonian sense (Mittelberg 2014; Mittelberg and Evola 2014; Mittelberg and
Waugh 2009, 2014). Here internal metonymy (the as-if grasping hand) as well as
external metonymy (the indexing of an external object by as-if touching its surface)
are critical aspects of a semiotic chain.
Figure 3. Gestures depicting Properties of entities: the roundness of (a) a bench (e.g.)
literally and (b) to the “roundness” of an abstract issue e.g. figuratively
To sum up, objects of gestural mimesis can be concrete as well as abstract actions
and entities. Note that metaphorical reference and abstract reference are used syn-
onymically here, although, strictly speaking metaphor may be very concrete – as in
the case of the anchor chain for heavy tango leg metaphor. However, imagining
the leg as an anchor chain is certainly a context of use that is more abstract (or less
basic than how the Pragglejazz Metaphor Identification Procedure (MIP) would put
it, Pragglejaz-group, 2007) than referring to the heaviness of an anchor chain in the
context of a conversation about sailing ships.
tragedy. The narrating mode of mimesis (actions and events are described by a narrator)
characterizes epos, the acting mode tragedy. Notably, this opposition of perspective tak-
ing that the difference between performing actions and narrating actions is also present
in the two basic modes of gestural mimesis: acting and representing. In the acting mode,
the hands enact actions and movements of the bodily articulators. In the representing
mode, the body depicts something other than itself, it become a manual “sculpture”
(See Table 5). This distinction very often goes along with a specific viewpoint: most of
the acting gestures come with a character viewpoint, most of the representing gestures
with an observer viewpoint (McNeill 1992; Stec and Sweetser 2013, Sweetser, 2012).
In earlier accounts, I have termed the mimetic modes gestural modes of repre-
sentation and described them as manual techniques of depiction and as motivating
forces for the iconicity of gestures (Müller 1998a, b, 2014a). At the time four modes
of representation were distinguished: acting, moulding, drawing and representing.
In the acting mode, the hands act as-if performing an everyday action (acting as-if
opening a window). In the moulding mode, the hands act as-if touching a surface
(as-if moulding the round shape of a picture frame). In the drawing mode the hands
act as-if sketching lines and contours (as-if sketching the contour of a picture frame).
In the representing mode, the hands represent an object (as-if the hands were a
sculpture of a picture, an open book, etc.).
Arguably (Müller, 2009, 2010b, 2014a) three of the four modes of representation
are actually based on manual actions: the hands act, the hands mould, and the hands
draw. The difference between the three is not only that they are different types of
manual actions. While the acting mode captures enactments of everyday actions
of all kinds, mostly involving everyday instruments, such as a window handle or
a pitcher, moulding gestures are enactments of hands touching surfaces, which is
why they very often depict shapes of objects. Thus their mimetic object is not the
action, but the ephemeral shape of an object. In drawing gestures the extended index
is used as-if tracing in sand or mud. Drawing gestures are used to depict shapes of
objects too, but often they also outline “object” lines, e.g. the course of a river or the
winding path in a garden. Despite those semantic differences, these are all actions of
the hands and their meaning is grounded in those schematic actions, corresponding
to Zlatev’s mimetic schemas, which is why they are here grouped together under
the category of the acting mode of gestural mimesis.4
4 The term representation is used in the sense of Karl Bühler’s concept of Darstellung
(‘depiction’) (Bühler, 1934; Müller 2013, 2014a). As the English translation of Bühler’s
work uses the term “representation”, I have applied the same term. However, it is im-
portant to be aware that the German word does not involve the prefix re- that comes
with “re-presentation”. Bühler’s theory of representation, thus does not imply a re-
presentation of an objective outside world. His concept of Darstellung (depiction) is
free of any such connotations. So are my earlier accounts of gestural modes of repre-
sentation. By characterizing them as techniques of constructing conceptualizations of
perceived and conceived events in the world, the modes of representation open a door
to systematically re-construct the experiential grounds of a given gesture’s meaning.
Systematics of Mimesis for Referential Gestures 223
Both the acting and the representing mode can be used to refer to concrete as
well as to abstract actions and entities, as shown in Figure 4. The acting mode of
mimesis may be used to create a gesture referring to a literal (Figure 4a) and to a
figurative sense (Figure 4b). In Figure 4a the woman is grasping a window handle, in
4b the man grasps imagined houses and moves them from one city to another one.
Figure 4. Th
e acting mode of gestural mimesis used to create gestures that refer to (a) a
literal opening a window and (b) a figurative sense of moving houses
The examples in Figure 5 show how the hands become manual sculptures, and
represent something other than themselves. In Figure 5a the right hand represents
an opened book, in Figure 5b the left hand becomes the iron curtain which used to
separate the East and the West during the Cold War.
Figure 5. Th
e representing mode of gestural mimesis used to create gestures that refer
to (a) a literal (an open book) and (b) a metaphorical sense (the iron curtain
dividing East and West)
Table 6. Th
e relation between mimetic modes (acting and representing) and mimetic objects
(actions and entities) is flexible.
Mimetic Modes
Mimetic Objects Acting Representing
Actions Ex.: “opening window” Ex.: “driving car against the wall”
Ex.: “moving and placing Ex.: “driving enterprise ‘against
houses” the wall’ ” (figurative sense)
Entities Ex.: “roundness of bench” Ex.: “driving car against the wall”
Ex.: “roundness of abstract Ex.: “driving enterprise ‘against
issue” (figurative sense) the wall’ ”
Ex.: “moving and placing Ex.: “localizing open book”
houses” Ex.: “localizing and moving iron
curtain” (figurative sense)
Some actions that speakers refer to gesturally, such as, for instance, the driving
of a car against a wall, can apparently be depicted in both modes. Depending on
the chosen mode, they construct the event from either an observer or a character
viewpoint in the sense of McNeill (1992). Gestures performed from a character
viewpoint take the perspective of the actor, an example would be somebody acting
as-if holding a stearing-wheel. A gesture made from an observer viewpoint takes
a bird’s eye view on an event. So, for instance, in the driving of something against
the wall contexts described above, both the concrete and the abstract contexts were
gestured from an observer viewpoint.
A consequence of this independence of mimetic mode and mimetic object is that
the specific combination of the two creates a particular perspectivation, a specific
form of gestural conceptualization. Thus, the modes of gestural mimesis always
involve perspective taking on mimetic objects, be they figurative or literal. The dif-
ferent forms of gestural mimesis and the different possible combinations of media
and mode provide variable experiential grounds for the different mimetic objects,
e.g. for creating gestural meaning in concrete and abstract referential gestures.
Systematics of Mimesis for Referential Gestures 225
4. C
onclusion: Mimesis grounds gestural meaning in
experience
Aristotle holds that human beings are a mimetic species. In this chapter it has been
argued that this capacity for mimesis appears not only relevant in the evolution of
the human species, but is present and observable in the here and now of gestures in
use. The capacity for mimesis provides the experiential grounds for the creation and the
understanding of gestures. The different forms of mimesis presented in this chapter
offer variable semiotic resources for the emergence of meaning, in particular with
respect to the meanings of concrete and abstract referential gestures. This experi-
ential motivation of referential gestures opens up the path to an experience-based
cognitive semiotics of gestures, which could also be accounted for as an embodied
mimetic semantics of gestures. It lends support to Zlatev’s reflections upon mimetic
schemas in developmental contexts and indicates that a close analysis of the types
and processes of gestural mimesis might enlighten reflections upon the ontogenetic
and phylogenetic development of gestures and language. It also underlines Zlatev’s
hestiation concerning the explanatory role of image schemas in evolution and devel-
opment, as these are highly abstract and might be more important when it comes to
explaining more abstract linguistic meanings. What we see in gestures is very often
a semiotically dense enactment of motor patterns that the notion of mimetic schema
appears to account more appropriately for than the notion of image schematic
structures. However, this highly pertinent issue needs further and deeper discussion
from a gesture analysis point of view (cf., Mittelberg 2010).
From the point of view of methods for gesture analysis, reconstructing the con-
cept of gestural mimesis offers a key to a differentiated understanding of gestural
meaning construction. It does so because it avoids the pitfalls of projecting linguistic
categories onto gestural meaning as well as the contextual reading meaning into a
gesture. On the contrary, starting out from a close description of the mimetic pro-
cesses involved in a gestural movement proves to be an intersubjectively account-
able descriptive methodology for forms of gestural meaning construction. This is the
basic idea of a linguistic and form-based gesture analysis that can be summarized
as follows (see for more detail, Bressem, Ladewig, Müller 2013).
Starting from a close account of the gestural form, a gesture analyst may ask,
what are the gestural hands actually doing? What mimetic mode is employed and
what are the ephemeral shapes, movements, objects that are created, to arrive at
a first account of a form-based meaning. Take for instance someone enacting the
holding of a steering-wheel. It is possible to describe this dimension of the gestural
meaning without knowing its context. If then, in a second methodological step, the
context (semantic, pragmatic, syntactic, sequential) is taken into consideration, it
is possible to further specify this basic meaning of the gesture. For instance: as an
answer to a question how are you getting home? By driving! Or it could be a part
of a story: we were driving all night long to LA. Thus in gesture analysis there are
at least two steps from form to meaning in gestures: the first step spells out the
motivation of the gesture, the second one explicates the contextualized meaning of
226 Cornelia Müller
it. On the side of the speaker/hearer, these relate to two different cognitive-semiotic
processes: (a) one that ensures sign formation (motivation) and (b) one that specifies
local meaning (semantics and pragmatics) (Mittelberg and Waugh 2009, 2014; Müller
2004, 2010b).
Departing from an account of the particular type of mimetic mode involved
in the creation of a gesture provides a form-based, descriptive ground for a thick
description of the gestural meaning in a particular context. It therefore offers inter-
subjectively accountable descriptions of the particular form of conceptualization
involved in the creation of everyday gestures, be they concrete and tangible or
abstract ideas. Gestural mimesis opens a path towards an understanding of the
embodied roots of gestures referring both to concrete and abstract objects and
actions in the world and it also may shed further light on the specific processes
involved in the evolution as well as in the ontogenetic development of languages,
be they signed or spoken.
Acknowledgments
I would like to thank the artist Mathias Roloff for all the drawings in this chapter
(www.mathiasroloff.de). I am very grateful for the extremely insightful comments
and suggestions by the reviewers of this chapter. Moreover, this research could not
have been conducted without the generous grant of the Volkswagen Foundation:
“Towards a grammar of gesture. Evolution, brain and linguistic structures” (www.
togog.org) and the many wonderful people that have contributed to this research
enterprise. I’d like to mention specifically Silva Ladewig and Jana Bressem, Sedinha
Teßendorf and Susanne Tag, but also my interdisciplinary governing group Ellen
Fricke (Semiotics), Hedda Lausberg (Neurocognition) and Katja Liebal (evolutionary
anthropology). This research has been supported by the Russian Science Foundation
grant #14–48-00067.
Grigory Kreydlin & Lidia Khesed
Chapter 14
Verbal and Nonverbal Markers of
Impolite Behavior in Russian Language and
Non-Verbal Code
1. Introduction
In the present chapter we analyze and describe various aspects of impoliteness –
a linguistically and semiotically marked1 category that embraces some discourse
strategies that cause disharmony in people’s interaction and demonstrate a breach
of the existing norms of social behavior. Our aim is to discuss some cognitive and
linguistic properties of two classes of Russian lexical sign units. The emphasis is
placed on the features encapsulated not only in some Russian language signs (words
and word combinations) but also in some lexical signs of the Russian nonverbal
corporal semiotic code. By this we mean “gestures” in the wide sense of the word,
including meaningful movements of the head, shoulders, hands or feet, sign pos-
tures, facial expressions, meaningful bodily movements, glances and other types of
corporal signs. The model and mode of description chosen clarifies, expands and
elaborates some relevant aspects of more general models of impolite communicative
behavior presented in the linguistic and semiotic literature (Brown and Levinson
1987; Nikolaeva 1990; Post 1996; Wierzbicka 1999b; Kastler 2004; Bousfield 2008;
Сulpeper 2009; Rathmayr 2009).
In most previous research the category of impoliteness has been studied either
together with or in opposition to the category of politeness. It is only in the last
few years that several monographs and a small amount of articles have appeared
in which the category of impoliteness is regarded as a separate category.2 However,
many cognitive and semiotic aspects related to impoliteness and its representation
in language and gestures remain unexamined. We mean here the types of knowledge
displayed through the syntax and semantics of verbal and non-verbal markers of
the category of impoliteness, as well as the exploration and description of general
mechanisms and specific instruments of human interaction in impolite dialogues.
2. Methodology
Our exploratory approach differs in some important ways from previous ones. In
particular, we regard impoliteness as an independent category and not a ‘negative
pole’ within the category of politeness. For example, we show that to not be polite
does not automatically mean “to be impolite” and vice versa. The category of impo-
liteness is also explored through its representations in both verbal and nonverbal
codes, emphasizing the connections between verbal and nonverbal markers of
impoliteness.
The structure and semantic metalanguage of the explications that are presented
in the chapter follow the principles and methodology developed within the Mos-
cow Semantic School and used by Russian non-verbal semioticians as well. The
approach for analyzing language signs applied by the Moscow Semantic School
was developed by a number of Russian linguists including J. D. Apresyan, I. A.
Melchuk, A. K. Zholkovsky and others. The core principles of that approach are
an integral description of language and the systematic lexicographic treatment of
kindred linguistic phenomena. According to these statements, the full description
of the language signs includes not only an analysis of their meaning, but also an
3 The first term belongs to Efron (1941/1972), the second one – to Kendon (2004). The
two are not synonymous, as not all co-speech gestures are emblematic (cf. McNeill
1992).
4 Here we refer to the explanatory Russian language dictionaries including Dal’ (1867,
quotes from 1994 ed.); BAD (1951–1965); Ushakov (1940, quotes from 2000 ed.);
Ozhegov (1949, quotes from 2007 ed.); SAD (1983); Kuznetsov (1998); NEDRS (2004);
and those of gestures, such as DLRG (2001); Armstrong, Wagner (2007) and Akishina,
Kano (2010).
Impolite Behavior in Russian Language and Non-Verbal Code 229
analysis of their usage and formal structure. This description is based on a unified
model and aims to reflect the naïve world view of the native speaker. The meta-
language for the semantic description of the language signs developed at the Mos-
cow Semantic School, corresponds in some respects to Anna Wierzbicka’s theory
of semantic primitives (1999a, 1999b), but differs in the following aspects: it is eth-
nically specific rather than universal and includes signs which reflect the additional
semantic components that lie beyond the primitives. This meta-language is used
within this chapter while categorizing the impoliteness within the sentential form
X is impolite to Y in the situation Z.5
In 2001, G. E. Kreydlin, S. A. Grigorieva and N. V. Grigoriev published the Dic-
tionary of Russian Gestures where they demonstrated that the same model can be
used to describe non-verbal sign units. Each vocabulary entry includes the following
attributes:
• physical shape
• the active and passive somatic object in performing the gesture
• meaning
• conditions of usage
• cultural and historical context
• alternative nominations (including phraseology)
• text illustrations and
• images
The same multimodal and multidisciplinary approach to the study of Russian ges-
tures was presented in the monograph Non-verbal Semiotics (2002), by G. E. Kreydlin.
We use it in the present chapter as well.
The categorization of impoliteness in Russian language and culture is given
through a detailed analysis of its verbal and nonverbal markers and the links be-
tween them. This is in line with the general approach of cognitive semiotics, under-
5 The principles of the Moscow Semantic School were introduced through a number
of works such as Explanatory and Combinatorial Dictionary, by I. A. Melchuk and
A. K. Zholkovsky (1984), Lexical Semantics by Y. D. Apresyan (1995), New Explanatory
Dictionary of Russian Synonyms, ed. by Y.D. Apresyan (NEDRS, 2004), etc. These
principles also formed the basis for the National Corpus of Russian Language
(further – NCRL). Data from NCLR is actively used in this chapter as the main
source of material for semantic analysis. The semantic description methodology for
language signs developed by the Moscow Semantic School was inherited and suc-
cessfully adapted to the needs of Russian non-verbal semiotics. By this we mean the
tradition, methodology and meta-language developed by a group of Russian linguists
specializing in Russian gestures (including T. Akishina, N. Grigoriev, S. Grigorieva,
L. Iordanskaya, G. Kreydlin, T. Krylova, T. Larina, S. Paperno: Tumarkin, G. Zytseva
and others). Their work continues the tradition of study on non-verbal semiotics
developed through the work of specialists like G. Calbris, M. Johnson, A. Kendon,
A. Mehrabian, D. Morris, I. Poggi and A. Sheflen.
230 Grigory Kreydlin & Lidia Khesed
3. Th
e category of politeness vs. the category of
impoliteness: common and distinctive features
Before explicating the semantics and syntax of the adjectives considered we shall
explicate the common and distinctive features of the categories of politeness and
impoliteness and how they are represented in Russian language and gestures. The
common features of the categories are context sensitivity, estimative character of
polite and impolite sign units and the coordination of these signs with the norms of
the social behavior and etiquette existing in Russian culture
Examples (1) and (2) illustrate how the context sensitivity influences the es-
timation of human behavior as polite or impolite: In both cases the addressee of
the letter stays silent.
(1) В ответ на оскорбительные письма Пётр вежливо промолчал.
(V otvét na oskorbítel’noje pis’mó Petr vézhlivo promolchál).
‘In reply to the offensive letter Peter politely remained silent’.
(2) Я получила от Вас четыре письма. <…> Получив пятое, я поняла, что про-
молчать будет невежливо.
(Yá poluchíla ot vás chetýre pis’má. Poluchív p’átoje, ja pon’alá, chto promolchát’
búdet nevézhlivo).
‘I received four letters from you. After I received the fifth, I thought that it would
be impolite to remain silent’ (Stolitsa journal, 1997).
Impolite Behavior in Russian Language and Non-Verbal Code 231
By the estimative character of polite and impolite sign units we imply that one’s actual
behavior is often estimated as polite or impolite, and a person who is the subject
of the estimation is either involved in the actual communicative situation or not.
The semantics and usage of polite and impolite lexical signs must somehow reflect
these peculiarities of estimation.
In (3), the estimation of the actual behavior is given by the addressee who is
involved in the communication, and in (4) the estimation is given by the observer.
(3) Вы – тот самый кот, что садились в трамвай? – Я, – подтвердил польщён-
ный кот и добавил: – Приятно слышать, что вы так вежливо обращаетесь
с котом.
(Vý – tót sámyj kót, chto sadílis’ v tramway? – Yá, – podtverdíl pol’shchónnyj
kót I dobávil: – Priyátno slýshat’, chto vý tak vézhlivo obrasháetes’ s kotóm).
‘But, sir, <…> are you that same cat, sir, who got on the tram? – I am, – the flattered
cat confirmed and added: – It’s pleasing to hear you address a cat so politely’
(M. Bulgakov, The Master and Margarita).
(4) Если бы сейчас была дискуссия, – начала женщина, волнуясь и загораясь румян-
цем, – я бы доказала Петру Александровичу… – Виноват, вы не сию минуту
хотите открыть эту дискуссию? – вежливо спросил Филипп Филиппович
(Ésli by sejchás bylá diskússija, já by dokazála Petrú Alexándrovichu… – Vino-
vát, vý ne sijú minútu hotíte otkrýt’ étu diskússiju? – vézhlivo sprosíl Filípp
Filíppovich).
‘If there were a discussion now, – said the woman, flushing hotly, – I would prove
to Pyotr Alexandrovich… – I beg your pardon, but do you wish to open the dis-
cussion this minute? – inquired Philip Philipovich politely’ (M. Bulgakov, The
Heart of a Dog).
Polite and impolite types of behavior are both closely associated with the concept
of social norms. The norms of Russian social behavior and the system of Russian
etiquette have been analyzed and described in the works of Russian linguists such as
A. Bayburin and A. Toporkov, N. Formanovskaya, G. Kreydlin, M. Kronhaus, T. Kry-
lova, T. Larina, E. Morozova, R. Rathmayr and others. These norms vary according
to the cultural context, historical context, current communicative situation, etc.
If these norms are not explicitly fixed in Russian culture or, by contrast, if the
behavior of a person in an actual situation satisfies the exact and strict norms of the
communicative behavior, then Russian speakers never call the situation вежливая
‘polite’ or невежливая ‘impolite.’ For example, the behavior of small children can’t
be considered polite or impolite as they do not know the norms of social behavior.
Another example is with military communication when a soldier addresses the com-
mander. In this case, the norms are strictly fixed (even in written form), so there is
simply no option for any other type of behavior.
Speaking of the correlation between politeness, impoliteness and etiquette, we
need to mention that these categories do not stipulate each other. One can follow
232 Grigory Kreydlin & Lidia Khesed
the rules of etiquette without demonstrating any good will or sympathy towards
the addressee, and may even try to offend. We may note the Russian phrases
холодное рукопожатие ‘cold handshake,’ надменный поклон ‘arrogant bow,’ and
высокомерный кивок ‘haughty nod’ (as a welcoming gesture), referring to cat-
egories of etiquette.
Further, the differences between politeness and impoliteness are usually re-
garded as the two poles of one pragmatic discourse category called politeness. As
noted in the introduction, we regard politeness and impoliteness as two separate
categories. The reasons for this view are as follows. First, не быть вежливым (ne
být’ vézhlivym) ‘not to be polite’ does not mean ‘to be impolite’. Second, if one
cannot characterize the person’s actual behavior as impolite, it does not mean it’s
obligatory to call it polite. The situation described in (5) accounts for our position
and explicates it.
(5) Imagine a street in Moscow. An old woman is walking down it. At some distance
from her a young man is walking in the same direction. He outraces her, pushes
her and keeps walking without an apology.
The behavior of the young man is, of course, impolite. He broke a rule of etiquette,
according to which he should have apologised to the woman. But if he outraced a
woman without pushing her – just because he is younger and is walking faster, his
behavior could not be called impolite, without being polite either.
4. Th
e basic script of Russian impolite communicative
behavior
Taking into account the typical features of Russian impolite communicative be-
havior, we propose the basic script of the category of impoliteness peculiar to Rus-
sian culture. The entry of the script is the predication вести себя невежливо (vestí
sebyá nevézhlivo) ‘to be impolite’ that is part of the sentential form X is impolite to
Y in the situation Z where variables X, Y and Z show the arguments of the predicate
to be impolite. In formulating the script below, we used the semantic metalanguage
developed in the Moscow Semantic School, as outlined in Section 2.
X is impolite to Y in the situation Z =
‘In the situation Z and other similar situations the social norms prescribe Russians to
behave in a certain way and this behavior is considered good;
in spite of this the speaker thinks that person X is behaving differently now;
the speaker deems X’s behavior to be bad’.
In the present sentential form, X is a person who renders the estimation of being
‘impolite.’ This can either be a narrator (see (6), where an author calls Nekhludoff
impolite for not replying to Kolosoff), or a speaker who is involved in the communi-
Impolite Behavior in Russian Language and Non-Verbal Code 233
cation (see (7), where Margarita considers the behavior of Nikolay Ivanovich to be
impolite as he does not reply to her).
(6) Нехлюдов, рискуя быть неучтивым, ничего не ответил Колосову и, сев за
поданный дымящийся суп, продолжал жевать.
(Nekhlúdoff, riskúja byt’ neuchtívym, nichevó ne otvétil Kólosovu i, sev za pód-
annyj dymyáshyjsya sup, prodolzhál zhevát’).
‘Nekhludoff, at the risk of being impolite, did not answer Kolosoff, and, sitting
down in front of the steaming soup, continued to eat’ (L. Tolstoy. The Awaking).
(7) Левою рукою Маргарита провела по виску, поправляя прядь волос, потом ска-
зала сердито: – Это невежливо, Николай Иванович! Всë-таки я дама, в конце
концов! Ведь это хамство не отвечать, когда с вами разговаривают!
(Lévoju rukóju Margaríta Provelá po viskú, popravlyája pryád’ volós, potóm
skazála serdíto: éto nevézhlivo, Nikoláy Ivánovich! Vsó-taki ya dáma, v kontsé
kontsóv! Ved’ eto hámstvo ne otvechát’. Kogdá s vámi razgovárivajut!).
‘Margarita passed her left hand over her temple, straightening a strand of hair,
and then said crossly: That is impolite, Nikolai Ivanovich! I’m still a woman after
all! It’s boorish not to reply when someone is talking to you’ (M. Bulgakov. The
Master and Margarita).
Y is a person to whom this estimation is addressed. It can be either an addressee
involved in the actual communication, as in (7), a third party as in (8) or the speaker
himself, as in (9), which means that X and Y could indicate the same person.
(8) Ваш сын нагрубил преподавателю английского на уроке. За что и получил
двойку.
(Vásh syn nagrubíl prepodavátelju anglíjskogo na uróke. Za chto i poluchíl dvójku)
‘Your son was impolite to his English teacher today. That’s why he got a bad
grade for the lesson’.
(9) «Неучтиво, но не могу писать. Всё равно увижусь с ней нынче», – подумал
Нехлюдов и пошел одеваться.
(Neuchtívo, no ne mogú pisát’. Vsyo ravnó uvízhus’ s nej nýnche – podúmal
Nekhlúdoff i poshól odevátsya).
‘It is impolite, but I cannot write. But I will see her today – thought Nekhludoff,
and started to dress himself’ (L. Tolstoy. The Awaking).
This script of impolite behavior can be seen as a semantic invariant of all Russian
verbal and non-verbal lexical signs belonging to the field of impoliteness. Thus, the
semantic propositions within the script constitute the nucleus of the semantics of
the words грубый (grúbyj) ‘rude,’ дерзкий (dérzkij) ‘impudent’ and хамский (háms-
kij) ‘caddish’ that demonstrate the isomorphism between the semantics of the words
234 Grigory Kreydlin & Lidia Khesed
5. Th
e semantics of the Russian words грубый, дерзкий
and хамский
According to some Russian explanatory dictionaries7 the word грубый ‘rude’ has
several meanings that have two common semantic components:
a) ‘insufficient correspondence of the object, event or situation described to the
existing norms’
b) ‘negative estimation attributed to the object, event or situation described’.
Considering the semantic and pragmatic fields of impoliteness, one should reinter-
pret the first proposition as ‘insufficient correspondence to the norms of polite
etiquette behavior (as they are determined in Russian culture and etiquette)’. For
example, Russians call a person грубый if he/she usually behaves (or is behaving)
contrary to the norms of behavior etiquette. One of the syntactic functions of the
Russian adjective грубый is that of a modifier. It modifies both nouns referring to
people and nouns referring to verbal or non-verbal acts or their features, e.g. грубое
обращение (grúboje obrashchénije) ‘rude communication’, грубые сцены (grúbyje
scény) ‘rude scenes’, грубые жесты (grúbyje zhésty) ‘rude gestures’.
The adjective дерзкий ‘impudent’ corresponds to another type of the Russian
impolite behavior. It is polysemic and in one of its meanings it is close to the
words смелый (smélyj) ‘brave’ and храбрый (hrábryj) ‘courageous’: all these words
denote positive characteristics of the person or the person’s particular actions.8 For
example, the Russian collocation дерзкое поведение can be translated into English
as ‘brave behavior’ and it denotes behavior that breaks established cultural stereo-
types. In other words, дерзкое поведение is a non-standard type of human behavior,
which may be valuated positively, as in (10).
(10) В 1990 году трое журналистов «Алтайской правды» <…> совершили весьма
дерзкий по тем временам поступок: основали собственную независимую га-
зету.
(V 1990 godú tróje zhurnalístov «Altájskoy právdy» <…> sovershíli ves’má dérzkij
po tém vremenám postúpok: osnováli sóbstvennuju nezavísimuju gazétu).
‘In 1990 three Russian journalists from “Altayskaya Pravda” were brave to found
an independent newspaper’ (www.sky24.ru).
Here the estimation дерзкий ‘brave’ ascribed to the journalists’ action is caused
by the fact that in 1990 there were no independent newspapers in Russia and
founding the new democratic paper was a brave act.
But the word дерзкий also has another meaning, which is opposed to ‘brave’,
and in this meaning it is synonymous to грубый ‘rude’ and невежливый ‘impolite’.
Its explication holds the proposition ‘violate a border between polite and impolite
types of behavior’. Such violation of borders is estimated as breaking the social
norms of politeness, as in (11). The word дерзко ‘impudently’ means that one of
the participants of the dialogue violates the social distance between himself and
the interlocutor.
(11) – Что вам надо? – <…> дерзко спросил я. – Ты, слышь, помалкивай, – ответил
тот же голос.
(– Chtó vam nádo – dérzko sprosíl yá. – Ty, slýsh, pomálkivay, – otvétil tót zhe
gólos).
– ‘What do you need – I asked impudently. – Shut up and listen – replied the
same voice’ (E. Proshkin, The Mechanics of Eternity).
This kind of violation of social norms is typical of Russian young people com-
municating with the elderly people. The Russian verb дерзить (derzít’) ‘to be
impudent’ and most collocations with the verb such as дерзить старшему
(derzít’stárshemu) ‘to be impudent with an elderly person, дерзить родителям
(derzít’ rodítelyam) ‘to be impudent with parents’, дерзить учителям (derzít’
uchitelyám) ‘to be impudent with teachers’ are examples of this sort of impolite
behavior.
As for the adjective хамский (hámskij) ‘caddish’, it has a clear etymology in
Russian. Хам ‘Ham’ is a biblical name of one of Noah’s sons, who was cursed
for disrespecting his father (Bible, Genesis 9:20). The phrase хамское поведение
(hámskoje povedénije) ‘caddish behavior’ has an evident association with the
etymology of the word and with the biblical story. Hence, the component ‘dis-
respect’ is an evident semantic focus of the word хамский. It denotes a person
who is arrogant and very rude, and whose behavior can be called offensive to
others.
Хамское поведение is perceived quite negatively and rejected. This is why
people who behave хамски ‘caddishly’ usually experience social and personal
ostracism. A person who is behaving caddishly sets his or her personality above
the personalities of others, and in this he or she humiliates them; in fact, the
person is mocking or sneering at the interlocutor and demonstrates his/her
superiority.
236 Grigory Kreydlin & Lidia Khesed
9 This class of Russian non-verbal signs was described in detail by Kreydlin (2002);
Kreydlin and Morozova (2003); Morozova (2006); Morris (2010).
Impolite Behavior in Russian Language and Non-Verbal Code 237
inactive handshake when meeting the person, the gesturer gives the addressee not
the whole hand but only the ends of the fingers. In this way, the gesturer demon-
strates his/her disrespect to the addressee and thus behaves impolitely.
Russian gestures that are impolite in general and Russian gestures that are not
impolite themselves, but the usage of which can be considered impolite, can be
illustrated with examples involving pointing. The knowledge of a situation when a
particular pointing gesture may be considered impolite constitutes a significant part
of social education in Russia. For example, Russian children are taught not to point
at the addressee when standing close to him/her. Pointing in this case is considered
impolite. The parents often reprimand the children; Не показывай пальцем – это
невежливо, lit. ‘Don’t point your finger – it is impolite’. The Russian phrase is clearly
elliptical because there is no indication what finger is used here, though one means
the index finger. Children and adults are allowed to point with a small finger <at
somebody or something> or to point with a thumb <sidewise or over the
shoulder>, and both pointing gestures are not regarded as impolite. Using an index
finger to point at a close distance is considered impolite because by performing the
gesture the person breaks the communicative distance between him/her and the
addressee without any permission, thus invading the addressee’s private space.
A second class of impolite gestures are the gestures of bosses. In some situations,
a gesture that is not impolite itself is interpreted as showing off the gesturer’s
superiority over the addressee. Such are the body sign units that we may term the
gestures of bosses. These are to sit solemnly, to sit with hands on the nape of
the neck or to sprawl in a chair among others. This mode of non-verbal behavior
is always estimated as impolite.
A third class of impolite gestures is formed by gender-marked meaningful
corporeal signs. Sometimes the behavior of a man communicating with a woman
is considered indecent, vulgar and offensive, and thus impolite. For example, a man
clapping a woman on the buttocks, hugging her or giving her a pinch –
these types of non-verbal behavior are estimated as sexual harassment. The Rus-
sian verbs expressing these types of non-verbal behavior are the words ordered
from приставать (pristavát’) approx. ‘to put the make on’, which is a weak form
of harassment, and to домогаться (domogátsa) ‘to importune’, which is a strong
form of harassment. This kind of behavior is always judged by society as highly
indecent and impolite because demonstrating sexual desire for a woman in public
is forbidden.
7. Conclusions
In the present chapter we have observed some key verbal and non-verbal mani-
festations of the category of impoliteness in contemporary Russian language and
some non-verbal signs. We described the category of impoliteness and showed that
the estimation of human behavior as polite or impolite is highly dependent on the
situations in which the particular type of behavior is displayed. The estimation is rel-
evant only for those situations where standards of behavioral etiquette are defined.
238 Grigory Kreydlin & Lidia Khesed
1. Introduction
What are the origins of symmetrical reasoning? This is a salient question. Not only is
it focal to the research introduced in this paper, it also happens to be the final question
posed to the closing plenary lecturer, rounding out a recent high-profile gathering on
cognitive semiotics.1 Before addressing the question or its response, it will be helpful
to pose (and answer) a prior question: What is symmetrical reasoning? Technically
speaking, symmetrical reasoning appears to be the inverse reciprocal blending of a sign
vehicle and a semiotic object in some process of heuristic learning relevant to the ends
of a given (human) interpretant. To illustrate, consider Helen Keller’s famous awakening
to linguistic communication. Her breakthrough was not merely a realization that the
sensation of water on her skin was related to the arbitrary hand-signals of her caregiver
(a unidirectional mapping) but, more importantly, that the caregiver’s hand-signals
could also be generalized to evoke water, even when water was not physically required
or present to the senses.2 The latter mapping is dynamic and ambidirectional, invol-
ving inverse antisymmetries, or “chiasmus” patterning, that can be formally rendered
water : signal :: signal’ : water’. In short, Keller’s famous realization is not so much
an awakening to the existence of symbols as it is an awakening to the reflexive, sym-
metrical potential of symbolic activity for modeling possible worlds.
In what follows I suggest that that such modeling activities are manifest not only
in patterned processes of speech and writing but also in explicitly symmetrical designs
found in human material cultures around the world. The discussion zeroes in on symbolic
knots or intertwining lattice designs, using findings from cultural anthropology, archae-
ology and history. Insights from Peircean semiotics are used to frame the discussion and
cognitive semiotic perspectives are applied to suggest vital links between these patterns
and the later embodied phenomenology of Maurice Merleau-Ponty. The chapter ends by
outlining a preliminary agenda for cognitive testing of related hypotheses.
1 The gathering is the First Conference of the International Association for Cognitive
Semiotics, held at Lund University 25–27 September 2014. The verbatim phrasing of
the question was less formal: i.e., “Where does symmetrical reasoning come from?”
2 As Imai’s research shows (Imai 2014, Murai et al. 2014), the former relation can be
taught by conditioning, whereas the latter cannot.
240 Jamin Pelkey
First, it will be helpful to return to the salient question introduced above, a ques-
tion posed by Jordan Zlatev to Mutsumi Imai, the capstone plenary speaker of IACS
2014. Imai’s response to the question was simply, and wisely, stated: “I don’t know.”
Indeed, it may be some time before we can speak with confidence or agreement
on the origins and nature of symmetrical reasoning and related phenomena. In the
meantime, what we do know, thanks to Imai’s ground-breaking work (Imai, 2014,
Murai et al., 2014; Imai this volume) is first that symmetrical reasoning is a key
cognitive function underlying language acquisition and second that this function,
or bias, is apparently unique to human beings. As with the peculiarities of any
species, we can also be certain that such capabilities will necessarily be related to
the body type of the organism in question since, in the words of Deely (2009: 81),
“every Umwelt is a species-specific objective world shared across species lines only
to the degree that the bodily type of the organisms involved permits.” With this in
mind, we turn to the patterns in question.
Figure 1. B
asic lattice networks featuring figure-ground (X, ◊) oppositions from widely
diverse cultures: Detail from a) Blombos red ochre incisions, c. 71,000 BCE,
South Africa (Henshilwood et al., 2009); b) Navaho rug design, c.1890,
Southwestern US (Washburn and Crowe, 1988); c) Traditional Yombe mat
weaving pattern, Lower Congo (Gerdes, 2004a).
Symmetrical Reasoning in Language and Culture 241
The lattice motif appears and reappears across cultures and millennia (see Fig-
ures 1 and 2) as one of a highly selective set of salient patterns (Lewis-Williams
and Dowson 1988; Froese et al. 2013a). According to Froese et al. (2013a), this
poses “a kind of universal selection bias” which calls out for explanation “but
which has so far remained mysterious” (2013a: 208). From a semiotic perspective,
the systematic relations shared between such designs function at the level of the
Iconic Legisign, or diagram type. Although each token (or “Iconic Sinsign”) of
the pattern varies slightly, the sheer number of analogous repetitions congeals
via habit in the memory as a gestalt or generalized icon. More importantly, given
that such patterns arise independently around the world and across millennia,
in spite of their absence in the natural world, they may be identified as Indexical
Legisigns, or unwitting “symptoms”, presenting a stubborn problem in need of
diagnosis – posing a semiotic riddle to all who will pay attention. In short, the
origins of these patterns, and human motivations for producing and reproducing
them, remain unclear.
Figure 2. O
verlapping lattice networks featuring figure-ground blends from widely
diverse cultures. Detail from a) Han Dynasty tomb carving, c.100 BCE, Sichuan,
China (Dye 1937); b) Intarsia knitting design, 17th century, Argyll, Scotland; c)
Mbukushu traditional beaded apron pattern, Botswana (Washburn and Crowe
1988); d) Bora traditional twill-plaited basket weave, Peruvian Amazon (Gerdes
2004b).
Even less clear (and less remarked upon) than the recurring crosshatching lattice
network patterns featured in Figure 1 are the reasons why lattice repetitions of X
and the rhombus geometries they incidentally produce, as “figure and ground”, are
so often brought together to overlap or blend. Once again, this can be observed in
cultural designs that span the globe, throughout history. A sampling from diverse
societies originating on four continents is illustrated in Figure 2.
While such patterns are often repeated on two dimensional surfaces such as
cloth, basketwork or windows, they are just as commonly isolated as self-enclosed
networks, featuring a single path that reverses onto itself repeatedly to produce
symmetrical gestalts or symbolic knots such as those illustrated in Figure 3.
242 Jamin Pelkey
Figure 3. Intertwining lattice networks from widely diverse cultures: a) Endless knot,
contemporary rendering of an ancient symbol from Lama Buddhism, Tibet;
b) Traditional Jokwe cosmology network traced on the ground, Angola
(Zaslavsky 1999); c) Celtic knot embellishment, contemporary rendering of
a medieval pattern found in the Book of Kells, early 9th century, Ireland;
d) Bavarian printer’s ornament, 1569, Germany.
3 Although this is the same A. M. Turing who invented the Turing machine and what
became known as the “Turing Test” (see Parthemore, this volume), these are not
directly related to the pattern description.
244 Jamin Pelkey
others, coordinate with vision in the human perception and production of visual
patterns in space.
The enigma grows further following applications of “plane pattern analysis”, the
leading contemporary method in anthropology for the study of cultural symmetries
(Washburn and Crow 1988, 2004; Washburn et al. 2010; Washburn 2011). This prac-
tice approaches such patterns in terms of bilateral (left-right) contrasts rooted in
mathematical group theory, now the predominant approach to symmetry across the
disciplines. Using the form “p” as an instance of a basic shape for illustration, a given
design can be described in terms of translation (pp), rotation (pd) and reflection (pq)
and then classified according to a core set of relationship classes, including finite
designs, one dimensional patterns, and two dimensional patterns, that are useful for
tracking cultural differences, influences, dating periods and design changes, among
other uses. Figure 3a is reproduced in Figure 4 to illustrate. The Tibetan “endless
knot” is a finite design featuring basic dihedral symmetry (class d4), which combines
both left-right reflection and rotation relative to the centre of the left-right axis.
Figure 4. Finite design involving basic dihedral symmetry (d4, reflection plus rotation)
4 According to Hon and Goldstein (2008) the concept finds its first full articulation
in the work of Adrien-Marie Legendre in 1794. The important distinction to grasp
here is the difference between tacit and explicit conceptual awareness. Although it
is abundantly clear that bilateral symmetries and laterlized antisymmetries have
played a profound role in shaping human cognition and cultural evolution for more
than one million years, our ability to analyze and discuss such phenomena directly
is the result of a very recent conceptual revolution.
5 This reversal has broad implications, the most troubling of which are highlighted in
the work of neuropsychiatrist and cultural historian, Iain McGilchrist (2009).
246 Jamin Pelkey
robust foundations for on-going critical inquiry into the origins and significance of
these patterns, and (2) to begin testing for cognitive benefits that may result from
physically re-tracing visual exemplars of these patterns drawn from traditional
cultures around the world. Progress toward both objectives should help explain
why such designs emerge independently across cultures and millennia, in spite of
their absence in the natural world.
Cognitive testing is needed for the exploration of meaningful benefits that may
result from manually or visually retracing intertwining lattice patterns, including
possibilities for enhanced fluid cognition, analogic/linguistic pattern solving, gen-
eral attitudes toward ambiguity and paradox, and potential increases in symmetrical
reasoning bias. Should cognitive testing prove conclusive in any of these regards,
such results would not only help explain the grand riddle outlined above but might
also provide an additional tool for mindfulness-based therapies.
Figure 5. W
eb app prototype based on Figure 3d: Screen capture taken midway through
tracing cycle
Tracing exercises are to be carried out via a digital interface, using the individual
patterns listed in Figure 3 rendered as vector graphics, using an interactive web
app, as shown in Figure 5. The first four prototype designs are now complete. The
app is cursor driven and multi-layered, allowing interested individuals to retrace
the geometric design by hand (using a touchscreen interface) or visually (using
eye-tracking software). The pattern display is programmed to change in-sync with
the motion of the user’s finger, stylus or eye movement as the pattern unfolds
248 Jamin Pelkey
during the act of tracing. The current design concept is programmed to transition
from a light grey line through successive shades of grey following each layer of
tracing, culminating in a black line. The app is designed to be integrated with an
information-rich website providing further background on both the origins of the
individual patterns and the broader project to which they belong. The relative effec-
tiveness of each design, in terms of potential therapeutic and cognitive benefits, is
to be tested against the others, and against various controls, in a series of cognitive
trials. The results are expected to facilitate both objectives of the research outlined
above, and in the process to potentially help to explain some of the reasons for the
independent persistence of such patterns across cultures.
Cognitive training breakthroughs are often hailed with unmitigated enthusiasm
(see e.g., Doige 2007); but, as others are now reporting (e.g., Rabipour and Raz 2012;
Harrison et al. 2013) cognitive training exercises have mixed results. Most computer-
design interfaces that attempt to enhance “fluid intelligence” do so by testing work-
ing memory with binary puzzles and strategic decisions. Since such practices are
now known to be ineffective (Redick et al. 2013; Harrison et al. 2013), pattern tracing
exercises currently being planned are to be tested through such methods as inter-
view questionnaires (probing attitude and affect), paradigm-completion exercises
and solutions to novel scenarios (probing creativity and imagination). Experimental
design is planned following a thorough literature review of related testing methods
and best practices. Results are to be analyzed both quantitatively and qualitatively.
Since human visual processing is much better adapted to the recognition and repro-
duction of symmetrical patterns compared to manual processing (Humphrey 2004),
the results of manual tracing exercises are expected to differ significantly from the
results of eye-tracking tracings of the same patterns.
5. Conclusions
Returning to the riddle at the heart of this study, we are now prepared to consider
an earlier critical proposal. Could it be that the inherent constraints of geometric
space might explain (away) the phenomenon, making the patterns introduced above
more likely to emerge and re-emerge than others? This, in fact, is a conditioning
factor that can serve to enhance the embodied cognitive account the chapter goes on
to suggest. A pivotal distinction to consider in this connection, however – one that
keeps this account from being merely reductive – is the recognition that geometry
is not itself a set of disembodied abstractions. Two-dimensional Euclidean mappings
in particular are widely claimed to be manifestations of embodied human relations,
both experienced and remembered (Johnson 1987; O’Keefe 1993; VanLier 2003).
When such designs come to be reflected not only visually but also in patterns of
speech and social organization, we find potential common ground shared between
symmetrical visual designs and symmetrical reasoning.
Although relationships shared between linguistic symmetries and cultural dy-
namics are only beginning to be explored, the results of early explorations suggest
that careful reflection on intertwining relationships implied by cultural and lin-
Symmetrical Reasoning in Language and Culture 249
guistic symmetries can help make sense of structures, practices and transitions
that would otherwise seem illogical or contradictory (see Douglas 2007; Strecker
and Tyler 2009; Pelkey 2013a-d; Wiseman and Paul 2014; Strecker 2011, 2014; Paul
2014; Tyler 2014). In considering explanations for persistent symmetrical structures
found across languages, John Haiman argues that only a “creative aesthetic drive”
(2008: 47) could explain the on-going reproduction of such patterns across time
and space by widely diverse generations of speakers. As we have seen, the same
explanation also seems necessary to account for the intertwining gestalts described
above. In addition to helping us understand the origins of symmetrical reasoning,
then, further research at this intersection may eventually shed light on the origins
of imagination and the sources of conceptual blending (Fauconnier and Turner 2002;
Gall 2010; Abrahamson 2009).
As Lewis-Williams and Dowson (1988) suggest, the deliberate mimesis of such
designs is by no means a novel idea. In fact, there appears to be something of a
precedent for it in pre-historic cave art:
Although there is no room for communal activity, the vast number of marks on the
clay wall of this “sanctuary” suggests … that it has been repeatedly touched … [poss-
ibly by] novices who, having entered the caverns on a vision quest, reached out to
the existing depictions to absorb their power and to trace their own visual percepts …
(Lewis-Williams and Dowson 1988: 214).
Given the connections sketched out above, we may at least admit the possibility
that the prehistoric impulse in question is rooted in a more general capacity for em-
bodied modeling from whence symmetrical reasoning and other distinctive features
of language also originate. With this in mind, we may well expect that further
inquiry in this vein will provide answers to the question with which the chapter
began: “What are the origins of symmetrical reasoning?” Not only does cognitive
semiotics provide a natural atmosphere for asking such questions, it also opens up
a fitting habitat for finding answers.
Acknowledgements
This research is supported by the Social Sciences and Humanities Research Council
of Canada: SSHRC-IDG #430–2015-01226 “Steps to a Grammar of Embodied Sym-
metry.” Seedfunding was provided by a Social Sciences and Humanities Research
Council of Canada Institutional Grant (SIG), a Ryerson University Faculty of Arts
New Initiatives Award (NIA) and a Travel Grant from the Ryerson University Dean
of Arts. My appreciation goes to two anonymous reviewers, the editors and Stéph-
anie Walsh Matthews for their helpful comments on earlier versions of this paper.
The digital web app prototypes discussed were engineered by Rami Matloob.
Štěpán Pudlák
Chapter 16
Cognitive Semiotics of Mental Disorders, with
Focus on Hallucinations
1. Introduction
In this chapter I introduce the potential of a cognitive semiotic approach for the
study of mental disorders. The vast field of research of mental disorders is ap-
proached from several scientific perspectives. Each of these perspectives has its
benefits as well as its limitations. My aim is to integrate psychiatric accounts of
schizophrenia with semiotics as a methodology for the analysis of signs and sig-
nification. Such transdisciplinarity is one of the key features of cognitive semiotics
(Zlatev 2012: 14).
Psychiatric approaches to the symptoms of mental disorders are usually heuris-
tic, which means that they require an understanding of what these symptoms are.
Semiotics, on the other hand, offers theoretical concepts through which we can
analyse symptoms from a different perspective. We can then, for example, compare
hallucinations to other mental phenomena. In this chapter I shall argue that halluci-
nations are specific on the level of indexicality of signs and that their indexicality
is the same as in the case of sensual perceptions.
Section 2 outlines the current status of semiotic theories of mental disorders.
Then I focus on the discourse of psychiatry and specifically on how the symptoms
of schizophrenia are described. In Sections 4 and 5, a cognitive semiotic approach
informed by Peircean semiotics is presented and applied to hallucinations. I con-
clude by briefly comparing my cognitive semiotic analysis of hallucinations with
neuropsychiatric hypotheses.
The analysis of hallucinations should be a part of a wider semiotic study of
mental disorders. This approach doesn’t aim to be an exclusive interpretation of
mental disorders, but rather hopes to enrich the interdisciplinary study of mental
disorders with new viewpoints, concepts and hypotheses. If successful, the cognitive
semiotics of mental disorders may suggest theories to be validated by neurobiology,
new clinical methods for practical psychiatry or psychological treatment.
2. Th
e current status of theories of the semiotics of
mental disorders
Semiotics can assess mental disorders from several different points of view. It is
often used as a methodology for studying media content, e.g. the way the news
represent mental disorders (see Allen and Nairn 1997; Aarva and Tampere 2006).
252 Štěpán Pudlák
Although there are studies that bring together semiotics and psychoanalysis (see
Muller 2000), I will not include these perspectives in this study and instead focus
on a semiotic analysis of symptoms.
There are only a few studies that focus on mental disorders and schizophrenia
from the perspective of semiotics. In 1986, James Harrod published an article ar-
guing that schizophrenia can be understood as a “semiotic disorder” (Harrod 1986),
inducing discussion on whether it is primarily such, a “thought disorder” or a “lin-
guistic disorder”. This is a discussion that I will not contribute to in this chapter. In
contrast, my aim is to approach schizophrenia from the point of view of semiotics
rather than argue as to whether it is a “semiotic disorder” or not.
Among more recent studies we can indicate Chernygovskaya’s paper on the
semiotics of olfaction and hearing (Chernygovskaya 2004) or Elvevag’s, Wynn’s and
Covington’s paper Meaningful confusions and confusing meanings in communication
in schizophrenia (Elvevag et al. 2011), a study of a schizophrenic patient’s depiction
of his own concept of a time machine. Similarly to my approach, the authors propose
to analyze the specifics of the delusional expressions of a patient. In conclusion,
they state:
[…] the assertion of conjectures as if they were on the same level as established scien-
tific presuppositions is arguably similar mechanistically to delusional thinking in that
a distinctive thing about the delusion is not that it is false, but that the deluded person
does not question it. In this case, the resulting prose is incoherent science. (Elvevag
et al. 2011: 464)
What is common to most of these studies is that they use semiotics more as a back-
ground than as an actual approach or methodology. My aim is explicitly to use
semiotic terminology and theories for a more straightforward analysis of the cog-
nitive symptoms of mental disorders, rather in the manner of Gallagher (2007), on
which I comment later. First, I would like to present a brief account of the psychiatric
perspective on hallucinations.
scribe that the “voices” are theirs (sense of ownership), but something or someone
else is causing them (sense of agency).
But what is even more important for Gallagher is the distinction between a first-
order experience, which is a phenomenal, pre-reflective experience, and a higher-
order cognition, which is reflective and metarepresentational (Gallagher 2007: 33).
Gallagher argues for bottom-up accounts of hallucinatory experience against top-
down accounts. The latter explain hallucinations as misinterpretations of “normal”
experiences. For Gallagher, hallucinations are caused by impairment to the first-
order experience rather than higher-order reflective interpretation and he shows
that this perspective is coherent both with patients’ descriptions of their halluci-
nations and with neurobiological research.
This approach is fruitful, but I will try to integrate it with Peirce’s semiotic theory,
and use it as a methodological background for the study of hallucinations as one of
the symptoms of schizophrenia. Peirce’s semiotics offers a coherent theory of signs
and their features. Although the theory has major metaphysical and epistemological
dimensions, the most beneficial for this study is the semiotic dimension offering
sign classifications and accounts of aspects of signs.
The major aim is to analyze mental processes that are described as symptoms
of mental disorders in psychiatric discourse and compare these symptoms to other
mental processes. We have accounts of these symptoms based on psychiatric case
studies, but how can we analyse them? The key question here is: what is specific
about a hallucination and how is it different from other mental phenomena? How
can we distinguish hallucinations from products of the imagination, memories and
so forth? I suggest that mental processes can be understood and analyzed as signs
and processes of signification, and that the methodology for such study can be based
on Peircian semiotics. For Peirce, thinking is the interpretation of signs with other
signs, although in Peirce’s “mature semeiotic”, signs are also “outside’ the mind and
interpretation is not necessarily human (Short 2007: 289).
Through semiotic concepts we can analyze the specificity of mental processes
manifested by schizophrenics in much more detail. The psychiatric discourse focuses
mainly on the content and form of hallucinations, but also includes some semioti-
cally relevant elements, as can be seen in the following accounts.
The hallucinations consisted of a voice speaking in a clear, commanding tone with a
dominant manner and was of a man not known to the patient. It was experienced by
him as being outside himself although he did not believe that the voice was other than
an hallucination. (Glaister 1985: 214)
The most common hallucinatory experiences are voices talking to the patient or among
themselves. On many occasions, the voice, which can be identified as male or female, is
not associated with anyone known by the patient. The voice is usually clear, objective,
and definite, and experienced as coming from the outside. (Weiberger and Harrison
2011: 11)
Cognitive Semiotics of Mental Disorders 255
The semiotic study of schizophrenia should aim to analyze symptoms and their
specificity from a perspective different to the one provided by psychiatric discourse.
Where the psychiatric discourse focuses on “what” and “how” a voice in a halluci-
nation is telling (as in above quotations), semiotics is able to distinguish between
different levels of a sign and the relationships between the elements of a sign. As
I show in the following section, “what” a voice of a hallucination is telling is only
one of the levels that can be analyzed.
character of being a Rhematic Indexical Sinsign and its indexicality in this measure.
The indexical aspect of a hallucination draws attention to the “utterer”, which can
be in this case a specific person or an unidentified voice.
Indexicality, in Peirce’s sense of the word, is the connection of an experience to
something else, external. Short interprets Peirce as follows:
This idea of indexicality required a reconception of the category that in the ‘New List’
was known as ‘relation’. That reconception, under the new rubric, ‘2ndness’, was pro-
vided in Peirce’s phenomenological reinterpretation of the categories, beginning in
1885 (W5:235 ff.). In it, he identified phenomenological 2ndness as a two-sided aware-
ness of inseparable polarities, for example, force and resistance, action and reaction.
To feel oneself acting, one must feel a resistance thereto; to feel a resistance, one must
feel oneself exerting a force. Thus we experience something as being other than (‘ex-
ternal to’) our experience of it; both are particular. This experience is directional; hence,
spatio-temporal; self and other are by it experienced as located relative to one another.
In experience, 2ndness is itself indexical and accounts for our ability to think about
particulars, albeit thinking imports general terms. (Short 2007: 49–50)
The “external” aspect, to which the experience is connected to, does not necessarily
belong to “external reality”. It is external to the experience. Memory is external to
the experience of a memory itself. If we recall a certain memory, we exert a force
to recall it. The indexicality of a sign is thus its connection to a spatio-temporal
instance, i.e. something particular. It is directional in its nature. Thus a shout is
indexical in the sense that it directs our attention to the person who shouts and a
remembered voice is indexical in the sense that it directs our attention to memory.
The indexicality of a sign is its directedness towards its object when the object is a
specific, spatio-temporal instance. Nevertheless, even in the case of symbols, whose
character is that of a general law, an instance of such a symbol is always spatio-
temporal and so it is indexical in a certain manner (CP 2.262).
It is this indexical aspect of a hallucination as a sign that is being interpreted
as coming from the “outside”. Yet the specificity of a hallucination is not solely in
this interpretation. The indexicality of the “outside” or of “reality”, which focuses
attention on an “outside object/utterer”, is common to hallucinations and sensory
perceptions. In this aspect, a hallucination differs from imaginations or memories,
which have a different indexicality, for they focus attention on something internal
(“I’m imagining. I’m remembering.”).
In acute states of schizophrenia, a patient is usually not aware that he or she
is hallucinating. Yet in later stages of the disease, a patient could be aware of this.
When the hallucinations first appear, the patient is quite convinced of the external
reality of these experiences and may seek help in finding out who is transmitting the
messages and how this is achieved. […] In the later stages of the illness the patient will
learn that the “voices” are not real. Nevertheless, the experiences still have the quality
of “real” perceptions. (Frith 1992: 68)
Cognitive Semiotics of Mental Disorders 257
In this aspect, my description based on Pierce is coherent with the account given
by Gallagher (2007): hallucinations are not “misinterpretations” of higher-order
cognition, to use Gallagher words. However, I disagree with Gallagher that the
character of hallucinations can be explained by a disruption of “[s]ome neurological
component responsible for the differentiation between self and the other[…]” (Gal-
lagher 2007: 44), which is essential for a sense of self-agency. For example, imagining
and recalling something from memory may be voluntary as well as involuntary. I
can recall something as a result of my “intention” to recall it, but it’s not uncommon
to recall something without the sense that I wanted to recall it (e.g. by an association
to something that I see). In the latter case, I do not have a sense of self-agency, and
yet still understand it as something “internal”.
But then, how does a hallucination differ from a perception? What is different
is the object of a hallucination. In psychiatric discourse, a hallucination doesn’t
have an appropriate external stimulus (Frith 1992: 68). There has to be some other
person to tell the patient that the perception he or she has is not external, otherwise,
the patient cannot tell if he is hallucinating. So, does it require a second or third-
person perspective (feedback given to the patient regarding the “reality” of the
hallucination) to distinguish a hallucination? In other words, are hallucinations
definable only when some other person than the hallucinating patient is involved?
It is important to emphasize that for Peirce the object of a sign does not necess-
arily have to be a “real” object. Peirce’s Object may be perceptible, imaginable or
even unimaginable (CP 2.230). The key feature of an Object of a Sign is that the Sign
represents the Object. We have seen that in the case of a proposition (or Dicent Sym-
bol) it has to be on some level also a Rhematic Indexical Sinsign. But the Object of a
proposition on the level of a Dicent Symbol is different from the Object represented
by the same sign on the level of a Rhematic Indexical Sinsign. For example, when
someone on a street tells me: “Watch out for the car!” the Object of this sign as a
Dicent Symbol is the car (even though it doesn’t have to “really” be there), but the
Object of the same sign as a Rhematic Indexical Sinsign is the utterer of the sentence.
In order to understand what might be the Object of a hallucination as a Rhematic
Indexical Sinsign, we have to consider Peirce’s distinction between an Immediate
Object and Dynamical Object: “[…] it is necessary to distinguish the Immediate
Object, or the Object as the Sign represents it, from the Dynamical Object, or really
efficient but not immediately present Object.” (CP 8.343)
According to Peirce, the important difference here is that the Immediate Object
is an object of one instance of experience, when the Dynamical Object could be
an Object of more instances of experience (CP 8.183). When I catch a glimpse of a
landscape outside a window, the Immediate Object would be that of a landscape.
When I look more closely, I can see that it is but a realistic painting on a wall. The
Dynamical Object, which is the painting, was given differently in different instances
of my experience. The Dynamical Object was the same, yet the Immediate Objects
in those instances of experience were different. I have corrected my interpretation
because it was the same Dynamical Object that was the object of different signs
(Short 2007: 192).
258 Štěpán Pudlák
6. Conclusions
We may conclude that hallucinations are characterised by a distortion of indexicality
and so in the core of schizophrenia there must be a distortion of what gives dif-
ferent mental phenomena their indexicality. Whether hallucinations are closer to
memories or imagination (see Aleman 2001: 72), they are internal processes that
become signs given to consciousness. That suggests that there is a “neurological
component” in the sense of Gallagher (2007) where internal and external inputs
(sensory information, memory, imagination) are integrated as signs given to con-
sciousness where this shift between the indexical character of sensory modality is
wrongly associated with internal mental phenomena.
Cognitive Semiotics of Mental Disorders 259
Acknowledgements
The work was supported by grant SVV-2015-26024002.
Gisela Bruche-Schulz
Chapter 17
Pictorial Responses and Projected Realities:
On an Elicitation Procedure and its
Ramifications
1. Introduction
The pictorial responses to a narrative text that are focused on in this chapter raise
questions concerning their pictorial format, their semiotic status, and their underlying
reality-creating “acts of experience” (Thompson 2007: 268). Such acts of experience
open vistas on scenes and scenarios of a narrative that is continued into the readers’
own situation. The responses take the text that is read as contextual input,1 and exhibit
the readers’ “empathic perspective-taking” (de Waal 2008: 285) with the events and the
characters of a narrative world by filling it with detail from the readers’ own contexts
and perspectives. Thus, the readers’ abductive creations carry “the spiral of semiosis”
(Deely 2001: 39) into newly projected, pictorially framed, realities.
As for the notion of “reality”, I refer to Schütz (1962 [1945]). Schütz mentions,
with reference to Bergson, that “our conscious life shows an indefinite number of
different planes, ranging from the plane of action on one extreme to the plane of
dream at the other” (1962 [1945]: 212). He remarks that he prefers to talk about
“finite provinces of meaning upon each of which we may bestow the accent of real-
ity” (ibid: 230). Such bestowing of the accent of reality on a province of meaning is
accomplished by directing attention to it. When complying with the investigator’s
request to jot down what comes to mind, the readers apply their attention to this
task by using pen and paper. At the same their attention is directed towards a
world of “imagination”, but staying within the task-oriented “world of working”
(Schütz 1962 [1945]: 227) when producing material sketches, notes, and scribbles.
This doubly directed attention arises from an “intersection” of the “flowing modes
of givenness” (Husserl 1999: 312), and is explained by Schütz as follows.
In and by our bodily movements we perform the transition from our durée to
the spatial or cosmic time, and our working actions partake of both. In simultaneity
we experience the working action as a series of events in outer and in inner time,
unifying both dimensions into a single flux which shall be called the vivid present.
The vivid present originates, therefore, in an intersection of durée and cosmic time.
(Schütz 1962 [1945]: 216)
1 The text which is read is an excerpt from Saint-Exupéry’s Le Petit Prince (1943),
chapter XXV.
262 Gisela Bruche-Schulz
In this view, the material act of using pen and paper when complying with
the investigator’s request happens simultaneously in the intersection of inner
time (durée) and cosmic time.2 However, attention is focused (and backgrounded)
differently, and different sides of its directedness are applied when responses
are designed in the vivid presence which originates in the intersection of durée
and cosmic time. When the respondents choose their options of jotting down
what comes to mind they simultaneously engage in acts of unfettered imagin-
ation in inner time, but these remain, by the design of the elicitation events, “in
the world of working connected by communicative acts of working with the
Other” (Schütz 1962 [1945): 258). The elicitation events thus provide a working
time frame, so to speak, within cosmic time that is experienced in terms of
world time.
In Section 2, I address the question of how acts of imagination need to
rely on remembering and experiential concepts. I first sketch out briefly how
the term schema is used for addressing what is remembered and projected
onto current experience. I then turn to the question of how the conceptual
grounding of remembered and projected experience is accounted for from the
point of view of the basic experience of force gestalts (“balance – equilibrium”,
and “blockage – disequilibrium”). In Section 3, I provide a description of the
data that emerged from the elicitation procedure, and in Section 4 provide
illustrations and analyses of increasingly abstract pictorial responses, before
concluding by pointing out the experiential, first-person and second-person
nature of the methodology used.
2 Schütz makes use of Husserl’s perspective on the “double intentionality in the stream
of consciousness” (Husserl 1999: 218).
3 It might even happen that one of the readers enlarges a sample item of the pictorial
responses one day, frames it, and hangs it on a wall, thus turning it into an object of
fond admiration or a different form of appreciation. However, the original purpose –
for which it is used in this chapter – was to respond to an investigator’s request to
“jot down what came to mind”. The pictorial responses were produced as one-off
processual creations, not meant to outlive the life cycle of the paper onto which they
were jotted.
Pictorial Responses and Projected Realities 263
meaning impulse received from the text and the pictorial that is produced as the
response lies a common conceptual ground.4
3. The data
Five different groups of students were given the same one-page long excerpt of
Saint-Exupéry’s Le Petit Prince, each time in a different language (English, Written
4 The term “ground” is taken from Peirce who explains “[W]e cannot comprehend an
agreement of two things, except as an agreement in some respect, and this respect
is […] a pure abstraction. Such a pure abstraction, reference to which constitutes a
quality or general attribute, may be termed a ground.” (Peirce 1992 [1867]: 4).
5 Searle (1995) introduces his book by writing, “How does a mental reality, a world
of consciousness, intentionality, and other mental phenomena, fit into a world con-
sisting entirely of physical particles in fields of force? This books extends the inves-
tigation to social reality: … money, property, marriage, governments, elections, foot-
ball games, cocktail parties and law courts…”. (1995: xi)
Pictorial Responses and Projected Realities 265
Chinese, German, Russian, Turkish), on five separate occasions. The excerpt con-
sisted of the first page of chapter XXV. The student-respondents were asked to
“underline the words for which something came to mind” ad lib, and to “jot down
what came to mind in the margin of (the one-page-long) text” ad lib. Each group
was given ten minutes for the task. Other than this time limit, there were no special
constraints.6
Across the five languages, the students proved sensitive to underlying semantic
concepts such as affirmed or negated assertions, questioning senses, boundedness
or unboundedness of events, and the states of balance and equilibrium. These under-
lying semantic concepts are all to be explained as expressing force gestalts (Johnson
1987: Chapter 5). Affirmed assertions do not show a blockage, negated assertions
do. Questions open up moving lines or scales that await to be stopped. Unbounded
events move on and on, with no end in sight. Balance and equilibrium can be
depicted by compositions that use mappings of “symmetrical force vectors meeting
at a point onto a curved surface” (Johnson 1987: 86). That these observations are
not random or irrelevant is shown by the fact that the students’ responses are not
random.
The response numbers correlate, in all five language groups, with the textual
segments containing such semantic concepts (Bruche-Schulz 2014, 2013). It is true
that it is only the sum of the responses (written and pictorial ones) that contribute
to such correlations. I take it that this does not invalidate the fact that there exist
correlations. The particularity of the pictorial responses consists of the immediate
visibility of force patterns by the diagrammatic line compositions. I suggest that
the students, when responding in both written or pictorial formats, at a level not
accessed by conscious awareness, responded to a “meaning”. It is the meaning
that is preserved in the semantics of the languages, and stored in the memory of
a motive force that generates states of equilibrium and blockages thereof (e.g. by
affirming vs. negating an assertion, by an ongoing “unbounded” vs. a bounded
event). There was no mention of the possibility to respond by drawings. Drawings
and pictographs were produced all the same, either solely, or in connection with
words that situated a depicted space in some context. The page that was given to
the five groups of student-respondents consisted only of a text, an excerpt from the
beginning of chapter XXV without the accompanying drawings by Saint-Exupéry.7
6 I came across this particular design variation of the ethnographic method of self-
report by chance (Bruche-Schulz 2014, 2013). Different from the ethnographic
method in anthropology which uses interviews for self-reports, I cut short the time
given to ten minutes. In addition, I did not ask questions, but presented a text to let
the respondents freely associate when reporting on what came “to mind”.
7 The title page of Le Petit Prince has as its subtitle “Written and Drawn by Antoine de
Saint-Exupéry”. This subtitle is shown in all translations, and Saint-Exupéry’s draw-
ings are included in all translations. The text which was read during the elicitation
events did not show any of these drawings by Saint-Exupéry.
266 Gisela Bruche-Schulz
The student-respondents came from different countries, and lived different every-
day lives. Given the fact that the instructions did not impose any constraint on
the content of what was expected to be “jotted down”, naturally, their pictorial re-
sponses showed considerable differences in terms of motifs and preferred topics. In
a similar vein, the preference for either the pictorial or the written format seemed
to be due to personal dispositions, i.e. not to correlate with a specific parameter.
In Table 1 the total of all response instances, be they in written or pictorial format, is
mentioned first, e.g. 179 (159). The value in parentheses refers to those responses that
were clearly aligned with the text. The number in front of the parentheses includes
also UAV-responses (unaffirmed-view responses): responses for which no alignment
with specific words in the text (by arrow, underlining, circling) was shown.8 Next, the
percentages of responses in pictorial formats are shown. Close to half of the respon-
dents (40 % and 41 %) used pictorial formats in the groups reading the one-page-long
text in English and in Turkish.9 In the other groups, the numbers of pictorializers were
lower (25 % in the Russian and German groups, and 17.6 % in the Chinese group).
Table 1 also indicates the number of “pictorializers” within their respective
language groups. In the Chinese group, there are only three persons who jot down
responses in pictorial formats, in the German and the Russian groups there are 4
who do so. In the Turkish and the English groups there are 7 and 8 pictorializers
respectively. The group average of pictorializer-persons is the lowest in the Russian
group and the highest in the English group. When it comes to the peak value per
person, student P17E (P[erson17]) of the English group produces the highest peak
value. The lowest peak-value output of 4 pictorials per person is found both in the
German and the Russian group, produced by P9G (P[erson9]), and P9R respectively.
As can be seen from the peak values per individual pictorializer, only a small
number of respondents contribute to the high numbers of pictorial response items.
Higher numbers of pictorial items per person occur mostly when readers use pre-
dominantly sketchy pictographs. The sheer distribution of the numbers shows dif-
ferent preferences for either written or pictorial responses.
8 In this chapter, UAV responses are included in the presentation, since – unlike in my
previous attempts (Bruche-Schulz 2014, 2013) – the focus is not on the quantitative
relations between the underlined (also circled, or bracketed) words signalling a sem-
antic concept and the resulting response numbers.
9 I will use the shorthand form of ‘English, Chinese, German, Russian, Turkish
group(s)’. The number of students in each group ranged from 16–20. The ‘Eng-
lish group’ consisted of Cantonese-English bilinguals who were students of English
Language and Literature at a university in Hong Kong. The students of the ‘Chinese
group’ were in their final year of a secondary school in Hong Kong studying Re-
medial English. They were speakers of Cantonese (reading the Written Chinese) and
were preparing for university entry, or other further training. In the ‘German group’,
native Germans and four students from other European countries were taking part
in a class on semantics at a Berlin university (taught in German). The Russian and
the Turkish groups consisted of students of bilingual secondary schools in Berlin,
Germany, studying their last year at these schools and preparing for university entry.
Pictorial Responses and Projected Realities 267
The difference of time and place, as well as of language of translation of the French
original might have played a role during the elicitation events, the relation between
investigator and students across the groups perhaps as well. The English and the
Turkish groups welcomed me with a “let’s-have-fun” mood. The other groups be-
haved in a less spontaneous manner.10
As for the types of the responses, they can be identified according to the degree of
figurative detail. The student-respondents chose: (a) line drawings showing scenes
and scenarios with figurative detail, either visibly populated by living beings or
meant for them, (b) abstracted imagistic detail applied to motion in space, with
increasingly reduced figurativity and (c) pictographs, i.e. highly conventionalized
and abstracted pictorial symbols. In all three response types, the pictorial gist may
be complemented by written words that introduce a specific discursive quality. In
the following section, I first comment on how I see the readers’ responses as situated
in “provinces of meaning” (Schütz 1962 [1945]), and then present examples of the
three types of responses (a–c).
10 There were thus some differences concerning place and time, as well as the mood
towards the elicitation events. Altogether, however, the fact that the same text which
was read by five different groups at five different places and times and in five dif-
ferent languages yielded comparable quantitative values (distributions) should mean
something, and I try to explain this here.
268 Gisela Bruche-Schulz
4. P
rovinces of meaning, accents of reality, and response
types
When the readers produce their responses as drawn on paper with a pencil or ball-
point pen, they aim at the world of the “pragmatic motive” (Schütz 1962 [1945]:
234). By doing so, they follow the request to produce a jotting. When they focus
their attention on the thing experienced as a meaning relevant to themselves, they
enter the plane of imagination upon request rather than upon their own initial
impulse. Thus, when complying with the investigator’s request to jot down what
comes to mind, the readers apply their attention to the task by using pen and paper.
At the same time – while producing material sketches, notes, and scribbles – their
attention is directed towards a world of “imagination”, while staying within the task-
oriented “world of working” (Schütz 1962 [1945]: 227). Again, at the same time, the
request was framed as a playful event, outside the usual teaching situation11, i.e. not
bound with the usual student-teacher relation. They, thus, may even be compared to
children “who play together in their make-believe world, […] indulge with Others in
the same ritual [since still] connected by communicative acts of working with the
Other” (1962 [1945]: 258). From Schütz’s viewpoint, the readers thus remain in the
all-encompassing world of everyday life where imagination precedes the projecting,
or planning of some action, comparable to a scientist who does what she is doing
and likes it, i.e. emotionally confirms her work as being meaningful and fulfilling.
All these drawings use primarily schematic (diagrammatic) modes of presenta-
tion. Only the drawings shown in Figures 1–4 make use of figurative elements in
various degrees of intensity and purity. While these diagrammatic pictorial creations
mirror the textual input, they themselves may motivate textual output. Viewers may
talk about picture signs. In other words, there may result in “discursive effects of […]
the relationships exhibited in diagrams” (Pietarinen 2014: 118). Like the words of the
text which was read have an effect in that they “excite in the mind of the receiver
familiar images […] quite detached from the original circumstances of their first
occurrence” (Nöth 2008: 93), the structural parallelism of the resulting diagrammatic
design of the pictorial responses may be translated forward into typified discursive
signals of new contextualizations.
In the analysis, the drawings are shown and complemented with (projected)
discursive event types as aligned with the reality status of “everyday life”12: (a) Con-
versation: impacting on socio-cohesive relations, (b) analytical commentary: meant
11 The elicitation events took place before a lecture or teaching session started.
12 “[W]ith respect to the paramount reality of everyday life […] this reality seems to us
to be the natural one, and we are not ready to abandon our attitude toward it without
having experienced a specific shock which compels us to break through the limits
of this ‘finite’ province of meaning and to shift the accent of reality to another one.”
(Schütz 1962 [1945]: 231). – The discursive event types of conversation, analytical
commentary, and evaluation belong to the stock of frequently experienced and ex-
pected discourse types.
Pictorial Responses and Projected Realities 269
to help the pragmatic goal of shaping an environment for a purpose, and (c) evalu-
ation of state of affairs: assessing the chance of human intervention.
Figure 1. P 13E at s[9] “was not like the wells of the Sahara”. Discursive style:
a) Conversation: Don’t go there in summer. It will be unbearable. b) Analytic
description: The artist managed to give a rich impression of a difficult
environment with only a few strokes. c) Evaluation: Adversity of forces can
probably not be removed by the counter-force of human action.
Three single-word expressions that accompany this line drawing strengthen its con-
ceptual import. The force of the heat with the resulting types of sensory experience
is captured in the words “hot – thirsty – silent”. No specific entity is referred to as
being “thirsty”. P[erson]13E belongs to the English-language group who jotted down
their responses in Hong Kong. Every Hongkonger knows (from intense episodic
270 Gisela Bruche-Schulz
memory) that heat makes all mammalian bodies thirsty, and that heat disturbs the
bodily equilibrium. Human bodies are not displayed by figurative means, but are
invoked by the verbal commentary.
In Figure 2, it is also s[9] which is in focus. At s[8–9], the text reads: (s[8]
“Vardığımız kuyu s[9] çöl kuyularına benzemiyordu” – “s[8] The well that we had
come to s[9] was not like the wells of the Sahara”). P[erson3] of the T[urkish]
group (P3T) sketches out a space that, this time, includes a self in an explicit
manner (the larger stick figure), together with the “little prince” – the smaller
stick figure. She indicates the larger stick figure as being herself (ben “I”) in
Turkish. The smaller stick figure, standing under a tree (!), represents the little
prince (“kleiner Prinz”, the wording here is German). P3T engages in pretend-
play when expressing wonder and delight about the existence of the one tree in
the desert that is created by herself. She adds a bracket to the sketch of the tree,
and explains (in German): “in der Wüste nur ein Baum” “in the desert only one
tree”. In the process of delineating this imagistic perception (effected by a line
drawing!), the “tree” comes to symbolize an affordance which is “working as” a
shelter against the burning heat.
The line drawing projects a scenario that is meant to alleviate the effects of the ex-
treme desert climate. The reader-respondent creates a tree to rescue the situation,
i.e. for restoring a state of equilibrium. P3T thus gives the narrative sequence a new
turn. She plays on the sense of “care” when creating a tree-scenario that does not
exist in a desert, and involves herself as the larger creator stick-figure, thus engaging
in symbolic play (known from children’s play and works of art).
Next, in Figure 3, P8E designs a space in which motion is happening. At s[2]
(“[Men] set out in on their way in express trains”), the swishing sound of a fast-
moving train is depicted by lines running in parallel beneath and on top of a loco-
motive.13 At s[5] (“Then they rush about, ….”), three stick figures, shown with legs
moving and arms outstretched, “rush about” in the fast motion of the experience of
13 The lines in parallel belong to the inventory of schematic design features, the loco-
motive, with windows and wheels and a requisite shape, represents a figurative feature.
Pictorial Responses and Projected Realities 271
being in a “hurry”. The concept of motion in space is thus recorded and exemplified
by two different sources, the “speed” of the train and the humanly formed “hurry”.
An intense emotive element permeates the sensory experience of rushing (panic,
i.e. disturbed equilibrium).
In Figure 4, spatial space is to be read off the intersubjective space denoted by a stick
figure that is placed in quotation marks as raising her arms in despair. The quotation
marks are addressed to an audience. The written commentary “Where? to do what? a
lost life” complements and intensifies an emotive state; here, the genuine pretend-played
and quotable, feeling of despair. Like P3T in Figure 2, P13E engages in symbolic play
that rehearses both the skill of giving an expression to an affective state (despair), and
the quotative means of referring to an ever-present audience. The quotation marks are
placed around the stick figure, not around the words. Thus, an abstracted line drawing
presents a human being that is heard speaking in the world she inhabits.
Figure 4. P 13E at s[1–4]. “Where? to do what ? a lost life.” Discursive styles: a) Call for
help: A body is in disarray. Someone suffers! b) Analytic observation: A human
being may use quotative language when referring to a context other than the
present one. c) Evaluation: Oppressive force should be understood as a call for
help and should be taken seriously.
272 Gisela Bruche-Schulz
Figures 1–4 can be said to reflect abductions that generate imagistic creations with
various degrees of schematicity. The readers seem to engage in efforts to seriously
apply themselves to the task to jot down “what comes to mind” at the moment of
reading. The task was put to them in the context of the world of the “pragmatic
motive” that governs everyday life (Schütz 1962 [1945]: 208). Since all respondents
jotted down their responses in this context, they should be seen working in the
same “reality”, as defined by Schütz.
When put in the context of the accompanying line drawing (Figure 7), it may
connect with a scenario. Two human figures are walking under the stars, one of
them wearing a crown. The bucket tells of the water that was finally found in
the desert (S[egment 40] of the text reads, “I raised the bucket to his lips.”) Two
figures, dressed in robes, one of them with a crown, are walking under the stars.
The presence of the bucket filled with water on the physical page that was read,
“feels good” since water was finally found in the text printed on that physical page
(state of equilibrium). Words from the text world serve to elaborate the sensory
presence of darkness, and the “grass” on the ground radiates light. Motion, space,
and sensory feeling belong to an “I” who perceives intrinsic spatial properties.
Figures 6 and 7 make use of a mixed figurative and diagrammatic design as well
as of words that elaborate sketchy elements like stars and blades of grass, similar
to what is found in Figures 1–4.
The line drawings in S[40–45] above presents a scenario with an adverse force (dark-
ness). The restoring counterforces of togetherness, as well as achievement (finally
able to drink water in the desert) restore balance and equilibrium.
In Figures 8, 9, and 10, response items are shown that exhibit a further reduc-
tion of detail. In Figure 8, P19E underlines S[34] and provides a description of the
sunlight that shimmers in the still trembling water. The water in the bucket that
reflects the sunlight is presented by wavy dot-like lines. This abstracted style of
pictorializing still follows the text quite closely, and there is still a literal, visibly
iconized connection between the source and the receiver of the sunlight.
The dot-like lines convey a diagrammatic semblance with water when filling the
entire bucket; the shape of the bucket is sketched out, complete with its handle,
and the rays of the sun as streaky lines. The three separate parts of the sketch in
S[34] (Figure 8) together invoke a spatial space and the result of a particular kind
of motion, the transfer of heat that is to be read off the reflection of sunrays in the
water. Both figurative (sensory effects of sun rays) and diagrammatic line elements
are used in this drawing.
The sketch of Figure 9 demonstrates how the very same motif within the same
framework of perception (transpiring from S[34] “sunlight shimmer in the still
trembling water”) is undergoing a process of abstraction. The sketch seems to be
freed from an explicit spatial context. It is shrunk into a symbol. As above in Fig-
ure 8, the force of heat that impacts on water is captured. But now, the source of
the heat is no longer shown. The sun, the source of the heat, is shrunk into a dot on
the surface of the water that fills the bucket. A “shrunk dot on the surface” when
understood in connection to the sun reflects an analogy, and “[a]ny analogy is a
diagram because it represents a parallelism between the structures of two con-
ceptual domains” (Nöth 2008: 88).
Pictorial Responses and Projected Realities 275
4.2.2. A
nalogies and schematics: Parallelisms between conceptual
structures
In Figure 10, P[erson8] of the German group underlines the word “Kreis” (“circle”)
at S[egment 5]: (Nachher regen sie sich auf und drehen sich im Kreis… “Then they […]
get excited and turn round and round…”). She then jots down in the margin the un-
adorned shape of a circle, of the size shown by the letters of the written words. The
question arises whether or not the degree to which the circle is reduced to a naked
shape carries a message by itself. Technically, there is then again, like in Figure 9,
an analogy which connects the structure of two conceptual domains.
From Figure 1 to Figure 9, the drawings have been shown and complemented with
(projected) discursive event types as aligned with the reality status of “everyday
life”. The evaluative commentary for Figure 10, seems to give away some difficulty.
Does the degree to which the “circle” is reduced to a naked shape allow us to read
off some kind of mockery or boredom? Obviously, the question of whether there
is an underlying feeling that motivates the production of the readers’ pictorial re-
sponses is, for the analyst, a question of judgement concerning the felt impact of the
“halos” of things (Schütz 1962 [1945]: 224). In Peirce’s words, “[I]n formulating the
276 Gisela Bruche-Schulz
judgement [we perceive the image of our judgement and] we are, in the very act of
formulation, aware of a certain quality of feeling” (1998 [1903]: 247).
A last example demonstrates how the pictorial creations express the qualitative
aspects of a feeling. Figure 11 below covers a whole page. P17E seems to engage
in just having fun. I count altogether twenty items of pictographic depictions that
P17E jots down, obviously with great enjoyment and fervour. All pictorial creations
consist of abstracted line images that come across as sketchy drawings with degrees
of a more or less pictorial influence.
5. Concluding remarks
In this chapter, I focused on pictorial responses to a narrative text. The inclination
to respond either pictorially or verbally differs among the respondents. Among the
five groups of reader-respondents, the English-language group provided the largest
number of pictorial responses. Pictorials activate a visual scene which is not placed
in some past, present, or future, while linguistic discourse is more or less tied to
temporally anchored states and events. Both pictorial and written responses are
the products of “working […] in outer and in inner time, unifying both dimensions
into a single flux which shall be called the vivid present” (Schütz 1962 [1945]: 216).
The narrative text itself already presents scenes and scenarios whose conceptual
cores consist of gestaltist patterns of experience. This is not immediately visible, but
the distribution of the response numbers onto the segments of the text mirror the
force-gestaltist (diagrammatic) aspectual semantics of the narrative text (Bruche-
Schulz 2014). Even if not consciously aware of this, the respondents’ thoughts and
feelings appear to be shaped by it, and the pictorial creations express this type of
hidden awareness through line drawings directly.
When the reader-respondents engage in abductions that generate imagistic cre-
ations, two styles emerge. The first seems to be born from a joyful, but also serious
engagement with the abductive inspirations that “come to mind”. These pictorial
creations are composed of lines and non-conventionalized figurative elements. The
second style is marked by more and more conventionalized motifs. Such near-to
pictographic creations, especially when used in large numbers, seem to express a
mood of merriment and play. Singular instances seem to express irony. The present
author and analyst of the readers’ responses approaches an understanding, i.e. a
judgment on their meaning, on the basis of the quality of a feeling as well. Such
feeling is needed to convince us of an existing reality. In Schütz’s words, when the
attentional attitude towards the “world of everyday life [achieves the] suspension of
doubt” (Schütz 1962 [1945]: 230), a perceived givenness can be affirmed as real, con-
firmed by the process that needs “the emotional interpretant” (Peirce 1998 [1907]:
410) among others. The key notions of this analysis are therefore quality of a feeling,
experience of reality, and force-gestaltist diagrammatic patterns.
– 2006a. Küçük Prens. [Avunç, Yaşar, trans. Translation first published in 1987.]
Istanbul: Mavibulut Yayıncılık, 79–80.
– 2006b. Malen’kij Princ. [Gal’, Nora, trans., foreword]. Moscow: Detskaya Liter-
atura, 99–101
Peter Coppin, Ambrose Li & Michael Carnevale
Chapter 18
Iconic Properties are Lost when Translating
Visual Graphics to Text for Accessibility
1 Introduction
Imagine a blind or low-vision individual who needs to access a graphic represen-
tation, for example a financial chart (Figure 1a). Unlike a sighted individual, who can
see the actual chart, what the blind or low-vision individual accesses, usually aurally,
is often its text description. Both individuals are accessing a representation of rising
and falling stock prices over time. However, whereas the sighted individual sees
words and undulating shapes, all the screen reader user hears are words (Figure 1d).
Such is the state of the art in accessible graphics: Many blind and low-vision
individuals depend on approaches like the Web Content Accessibility Guidelines
(WCAG) to provide them with text descriptions – translations of the graphics into
text which these individuals can then access via a screen reader, usually aurally via
text-to-speech. Text descriptions are essentially interpretations meant to convey
the meaning intended by the author of the graphic representation. However, if a
text description can fully convey the meaning of the graphic image, then why did
the author create it in the first place?
In the foregoing scenario, parts of the chart, predominantly numerical values
and labels (Figure 1c), are already text and will naturally carry over to the text
description. However, we argue that the rest of the chart (Figure 1b) is not and
cannot be adequately1 translated into text because these parts that are originally
conveyed via shapes are fundamentally different from the parts originally conveyed
via text. Shimojima (1999) calls this fundamental difference the “graphic-linguistic
distinction,” while a semiotician might see this as Lessing’s dichotomy of “pictures”
versus “literature” (cf. Sonesson 1988); in either case, one might predict the parts
originally conveyed via shapes to be “lost in translation.”
Suppose a designer wants to improve the state of the art and design a system of
accessible graphics that minimizes what is lost in translation. Let us call this our
target design problem. Because the state of the art is often aural, we claim that a
sound-based solution is appropriate given our target design problem.
1 We are adapting Toury’s (as cited by Palumbo 2009: 8) definition of adequacy here,
so by adequately we mean being able to subscribe to the norms of the source culture
(of visual graphics) and to express the relations expressed in the source culture. This
definition is consistent with the understanding that text descriptions should convey
the author’s intent.
280 Peter Coppin, Ambrose Li & Michael Carnevale
Figure 1. D
econstruction of a financial chart. The original chart (a) comprises parts
conveyed via shapes (b) and parts conveyed via text (c), but in a text
description (d) parts originally conveyed via shapes are also conveyed via
text (e). Adapted from “Web Accessibility Best Practices: Graphs” by Campus
Information Technologies and Educational Services (CITES) and Disability
Resources and Educational Services (DRES), University of Illinois at Urbana/
Champaign. Copyright 2005 by University of Illinois at Urbana/Champaign.
Although touch (cf. Kennedy 1993) might be a more natural fit than sound, if the
state of the art is primarily aural, a sound-based solution is probably more im-
mediately applicable and aligns better with our target design problem. Also, al-
though screen readers can deliver text descriptions tactilely as braille, refreshable
braille displays are expensive in comparison to audio hardware. Furthermore, al-
though touch screens might be tactile, Klatsky, Giudice, Benntt, and Loomis (2014)
have shown that conveying information tactilely via touch screens suffers from
challenges related to haptic perception. Touch screens are also not as ubiquitous
and inexpensive as audio hardware.
Iconic Properties of Visual Graphics 281
Now, to solve our target design problem we will first need to conceptualize
what is actually lost, which means we need to identify distinct and common
properties of graphics relative to text. Theoretically, then, our true goal is not
our target design problem, but the conceptualization of what is lost during the
translation from visual graphics to text descriptions – in other words, a theor-
etical model. Practically, however, this model can then inform the design of new
approaches for conveying properties of graphically represented shapes in non-
visual perception modes, which in our target design problem would be sound.
While we will not discuss specific plans to test the model, since the translation
of graphics to text is a translation process, the concept of “back-translation” (cf.
Palumbo 2009: 14) can be adapted to serve as the basis of experiments to test and
challenge the model itself.
Our conceptualization of what is lost will be based on the science of perception
and cognition, but expressed in terms compatible with semiotics. Although our
destination is cognitive semiotics, we will begin, in Section 2, in an area of com-
puter science where research and practice relevant to our target design problem is
transpiring, but, in our view, lacks a means of describing what is lost. By reviewing
several leading accounts of the “graphic-linguistic distinction” from the field of dia-
grammatic reasoning, what will emerge is that charts and graphs designed for visual
perceivers are mostly composed of items labelled via text, but located in relation to
other labels through visually perceived spatial, geometric, and topological relation-
ships. This section will discuss how spatial properties of sound could be recruited
to convey spatial, geometric, or topological relations among labelled items that are
currently lost in translation.
Synthesizing the various accounts of the graphic-linguistic distinction results in
a pattern that suggests a more effective means of translating what is lost, but the
terms for conveying spatial-topological-geometric properties are unwieldy com-
pared to corresponding terms in semiotics. Recruiting well established terms and
concepts from semiotics will partially replace these unwieldy phrases. At this point,
diagrams can then be described as iconically conveyed relations among symbolically
conveyed items or objects. However, what principles could guide a designer in dis-
tinguishing between symbolic and iconic properties of a graphic image? Also, how
could a designer identify appropriate mappings from iconic properties of visual
graphics to those of sound to convey the same relations?
To answer these two questions, we introduce Coppin’s (2014) perceptual-
cognitive model, which distinguishes between what he calls the pictorial and sym-
bolic properties of graphics, and where he conceptualizes diagrams as pictorial
relations among symbolic objects. Pictorial and symbolic properties in this model
will be shown to be respectively akin to iconic and symbolic properties from se-
miotics. With this synthesis in place, the final section applies the synthesized model
to the target design problem to precisely identify what is lost in translation and
offers a solution to inform translation.
282 Peter Coppin, Ambrose Li & Michael Carnevale
2. Th
e graphic-linguistic distinction: Implications for
sonic interfaced design
Let us now examine the “graphic-linguistic distinction.” Out of seven candidate dis-
tinctions that Shimojima (1999) identifies, we discuss four here in relation to how
they extend the idea of this distinction into sound. Because Shimojima’s discussion
assumes that “generally, pictures, images, and diagrams are graphical represen-
tations” (ibid: 313), any characterization of graphical representations here applies
also to diagrams.
Figure 2. D
iagrammatic versus sentential representations via sonic and visual perceptual
modes
Because Larkin and Simon’s definitions do not require the information in the data
structures to be visual, text descriptions in their text-to-speech form are already by
Iconic Properties of Visual Graphics 283
definition sentential. Now note that forming a 2D space in the sonic domain will
require two properties of sound that can be independently manipulated and perceived.
Identifying such sonic properties should in principle enable designers to construct
sonic external representations that are diagrammatic (as defined by Larkin and Simon),
which, when used to translate visually perceived diagrammatic structures, should en-
able the conveying of topographical and geometric relations that cannot be conveyed
via text-to-speech translations. We can imagine, for example, how in Figure 2 (lower
right) a blind or low-vision user could press arrow keys to move an “audio cursor”
along the x-axis of an imagined 2D space to perceive the contours of the graph.
Figure 3. B
y scrubbing a “sonic cursor” along an axis, audiences could access sonically
conveyed relations through changes in pitch and via stereo.
284 Peter Coppin, Ambrose Li & Michael Carnevale
Diagrams are physical situations. They must be, since we can see them. As such, they
obey their own set of constraints . . . By choosing a representational scheme appropri-
ately, so that the constraints on the diagrams have a good match with the constraints
on the described situation, the diagram can generate a lot of information that the user
never need infer. Rather, the user can simply read off facts from the diagram as needed.
This situation is in stark contrast to sentential inference, where even the most trivial
consequence needs to be inferred explicitly.
To illustrate that “diagrams are physical situations,” consider how the fact that a
diagram conveys topological and geometric information through visual perception
enables the illustration in Figure 4a in Section 3 to describe many relationships.
However, each description conveys a different story about what is shown visually
and therefore affords different inferences. Barwise and Etchemendy (1990: 22) thus
note that a diagram can show “countless facts” (by which they mean one can con-
struct multiple sentences from a diagram).
When Barwise and Etchemendy (1990: 22) refer to diagrams as “physical situ-
ations,” they are referring to properties (and affordances) of diagrams that can inter-
act with a human perception system. Designers seeking to extend the affordances
of visual diagrams to the sonic domain are challenged to identify properties or
dimensions of sound that similarly make use of “physical situations” (that inter-
act with human perception) to enable multiple stories to be described about the
relationships that are shown sonically.
Extending the example in Figure 3, the hybrid stereo–varying frequency inter-
face should enable one to “hear the shape” of a contour. If text-to-speech labels are
indexed to the contour, then the user should be able to form multiple sentences
about the geometric and/or topological relations among the labelled elements.
of sound so that “countless facts” (Barwise and Etchemendy 1990: 22) can be
inferred from those sonically conveyed physical situations.
3. A provisional model
We now introduce a basic model of perception and action based on Coppin (2014),
with the goal of integrating these ideas into a coherent system to inform design
and aid analysis.
what could be picked up from the physical world through lower-level perceptual
categorization capabilities and simulators.
Note that in Coppin’s model, a perceptual system can still pick up the pictorial
properties of writing from an unfamiliar language, but the individual may be un-
able to produce simulations necessary for the symbolic properties (simulations of
the author’s probable intended meaning; see Coppin, 2014, Chapter 3). Because
symbolic properties make use of higher-level simulators at the convergence
of sensory modes and are more amodal, Barsalou (1999: 578) defines it as “in-
herently nonperceptual” (we interpret this to mean not easily mapped back
to what a perceptual system could pick up in the world); these simulations
are more likely to fall under conceptual categories that do not correspond to
what could be perceptually processed from the physical world. Instead, symbolic
properties typically relate to their objects through socio-cultural conventions
that derive their information relative to their objects through “memory traces
created during cultural learning” (Coppin 2014: 64). This coincides with how
symbols in semiotics are also understood in terms of relating to their objects
through convention.
Coppin’s perceptual-cognitive conceptualization of symbolic properties is
therefore quite consistent with the notion of symbolicity from semiotics. How-
ever, to prevent confusion, this integrated notion of symbolicity will be referred
to as symbolic´.
Figure 5. M
usic notations use a convention where marks at higher elevations represent
higher-frequency sounds. Note. Bach, J. S. (n.d.). Suite in G minor, BWV 995
[Lute score]. Retrieved January 15, 2015 from http://commons.wikimedia.org/
wiki/File:Bachlut1.png
Let us next recruit the iconic´ and symbolic´ definitions to more carefully con-
ceptualize sonic versions of visual charts and graphs.
Table 1. D
iagrams are composed of iconically´ represented relations among symbolically´
represented objects and sentences convey symbolically´ represented relations
among symbolically´ represented objects.
Diagram Sentence
Relations Iconically´ represented Symbolically´ represented
Objects Symbolically´ represented Symbolically´ represented
In a visual or sonic diagram, the relations among symbols′ are conveyed iconically′
(see Table 1) because they are picked up by sensory receptors via light/sounds re-
flected from a surface, and what is meaningful about what is perceived is the per-
ceptually processed relations among the symbols′. In contrast, sentential relations
are conveyed symbolically′ (see Table 1) because, although iconic′ properties of
visual graphics are picked up by sensory receptors via light reflected from a surface,
what is meaningful about what is perceptually processed are the author’s intended
simulation (meaning), and the conceptual category that the intended simulation is
intended to fall under.
Let us now compare this chapter’s notion of diagram developed thus far with
Peirce’s definition. In Peirce’s system, diagrams are a type of icon, and are defined
by their “skeleton-like sketch of relations between parts” (Stjernfelt 2000: 363). The
notion of diagram developed in this chapter thus far, where relations among objects
are conveyed via an iconic´ perceptual feature (e.g., visuospatial location, pitch,
tactile size, etc.), while objects are denoted symbolically´, is potentially synergistic
with the Peirce-Stjernfelt account because iconically´ conveyed relations could be
seen as akin to the aforementioned skeleton-like sketch of relations among parts.
The word “diagrammatic” could thus be a term that refers to iconically´ conveyed
relations among parts.
Iconic Properties of Visual Graphics 293
4. Conclusion
In this chapter we have proposed a provisional model to underpin the various
accounts of the graphic-linguistic and iconic-symbolic distinctions described in
the diagrammatic reasoning literature. The model allows us to extend the graphic-
linguistic and iconic-symbolic distinctions into aural and cross-modal domains.
In the process, we have attempted to integrate various definitions of key notions
from cognitive science and semiotics, making our project an exercise in cognitive
semiotics (cf. Zlatev 2015a).
The model distinguishes between two interrelated capabilities: Lower-level per-
ceptual capabilities to pick up and perceptually process concrete structures of an
environment, and higher-level capabilities to process and interpret how perceptually
processed structures fall under more abstract conceptual categories.
The model conceptualizes iconic´ properties of both visual graphics and sound
as what is picked up by sensory receptors and processed by lower-level perceptual
categories and simulators that develop over a lifespan to enable individuals to
perceptually process occluded surfaces and edges (topographical and geometric
relationships) of a physical environment, thus enabling the perceptual processing
of geometrical and topographical relationships that are other than the surface of
the external representation.
Iconic Properties of Visual Graphics 295
The model thus predicts iconic´ properties to afford the communication of con-
crete structures more effectively than symbolic´ or linguistic properties. Also, sym-
bolic´ properties are thought to emerge when an individual perceptually processes
iconic´ properties of an external representation that an author intentionally config-
ures to cause the perceiver to have a simulation that falls under the author’s intend-
ed conceptual category. The model thus predicts symbolic´ properties to afford the
communication of conceptual categories more effectively than iconic´ properties.
By reverse engineering the classic graphic-linguistic distinction to more fun-
damental perceptual principles, we have introduced a means to understand how
the distinction applies to sonic representations. The proposed model streamlines
definitions that distinguish diagrammatic from sentential structures: Text-sentences
comprise symbolically′ represented relations among symbolically′ represented ob-
jects, whereas diagrams are iconically′ represented relations among symbolically′
represented objects. A sonic diagram is thus conceptualized as iconically′ con-
veyed sonic relations among objects linguistically (symbolically′) conveyed via
text-to-speech.
This proposed model enables researchers and designers to generate testable pre-
dictions for converting visual graphics into non-visual perceptual modes (other
than the text-to-speech approach proposed by WCAG, which ignores the iconic
properties of graphics).
Acknowledgements
This research was supported in part by grants from the Centre for Innovation in
Data-Driven Design and the Graphics Animation and New Media Centre for Ex-
cellence. We would like to thank research assistant Damon Pfaff, Dr. David Stein-
man, the reviewers and the editors Piotr Konderak, Göran Sonesson, and Jordan
Zlatev. The reviewers’ and editors’ detailed feedback has been invaluable in making
this chapter a reality.
Part IV:
Language, blends and metaphors
Todd Oakley
Chapter 19
Deonstemic Modals in Legal Discourse:
The Cognitive Semiotics of Layered Actions
stitution of the Supreme Court of the United States of America (hereafter SCOTUS).
The history of SCOTUS is one of three branches of American government that, at
the time before the Civil War was considered the weakest of the three branches,
but, largely resulting from the slowly gaining momentum of judicial review es-
tablished in the case Marbury vs. Madison (1803), has come to be without argument
the strongest branch of the federal government. To exemplify this change, consider
the contrasting dispositions of presidents Andrew Jackson (1829–1837) and Dwight
D. Eisenhower (1953–1961). Jackson felt perfectly entitled to ignore SCOTUS’s in-
junction against the United States “Indian Removal Act” in Worcester vs. Georgia
(1832) while Eisenhower felt compelled to enforce their decision in Brown vs. Board
(1953) ending racial desegregation, even though it was politically the last thing he
wanted to do. Clinton (1989) provides an historical account of the Supreme Court’s
arrogation of judicial review, the powers to review and declare unconstitutional
statutes and orders originating from the executive and legislative branches of fed-
eral and state governments, as the sociological equivalent of incremental increases
of the Court’s power to push the recalcitrant and block the zealot; his narrative
recounts the morphing of the Court from an institutional featherweight contender
to heavyweight champion.
A study of the English modal verb must in the rigid institutional setting of
SCOTUS offers us a test case for cognitive semiotic research on language as an
institutional practice. After a brief discussion of modal verb types, I proceed to
discuss an inherently ambiguous instance of must that led me to posit the new,
institutionally specific, category of the deonstemic. I then present the findings of a
corpus analysis of must, must not, and mustn’t in 33 landmark majority opinions of
the Court. I identify 6 discourse layers in SCOTUS in relation to this newly identified
mode. On the basis of this model, I proceed to show that the institutionally specific
deonstemic mode arises when a statement’s meaning divides attention between the
impositional world-to-mind fit of the justices (layers 1 and 2) and the descriptive
mind-to-world fit constraints on their powers (layers 4–6).2
2 The notion of “direction of fit” between mind and world derives from the work of
Searle (1995, 2004), and is utilized in the model presented in Section 5.3.
Deonstemic Modals in Legal Discourse 301
3 (2´) releases the subject from an obligation rather than imposing an injunction on
him/her, thereby presuming a contrary state of affairs from (2). Nevertheless, the
focus of attention is on the deontic rather than epistemic power(s) of the speaker.
304 Todd Oakley
situation that renders (5´) and (6´) problematic. Thus, (5´) and (6´) can be rephrased
as follows.
(5´´) (It is logically necessary for me to consider) him to be at home (for his car is
parked in the driveway).
(6´´´) (It is logically necessary for me to consider) her to have been devastated (as a
result of the doctor’s diagnosis).
Hence, epistemic modality is explained as attributing reasons that are more often
than not made explicit, without, however, suggesting that the speaker has any ability
to affect the situation. The semantics of need highlight a potential deontic power of
the speaker to impose an obligation on the addressee.
Cases like (7), on the other hand, are ambiguous precisely because the reasons
are left unarticulated, inviting the audience to infer the reason, which is either based
on evidence or on deontic powers of the speaker. The use of the infinitive copula be,
however, harks back to prior evidence explicitly articulated in the discourse, which
is why such examples are “pure” instances of epistemic modality — their reasons
need to be made transparent, regardless of the speaker’s status.
Now, what about the proposed deonstemic modality? The necessity test high-
lights both the epistemic and deontic dimensions of the same utterance, as shown in
(8) and (8´). In the framework of Speech Act Theory (Searle 1969), court statements
have declarative force (because of its deontic powers), and therefore they have a
double direction of fit and causation: mind-to-world and world-to-mind. This being
the case, the Court considers that something needed to be considered as X causes
it to be of X-type, giving rise to (8´´) and (8´´´).
(8) Such a tax must be unconstitutional.
(8´) (Its is logically necessary for the Court to conclude) such a tax to be uncon-
stitutional.
(8´´) (By this statement, you Americans need to recognize) such a tax to be uncon-
stitutional.
(8´´´) (In America) Such a tax is unconstitutional.
Notice that in this analysis the epistemic force of the statement is blended with
the deontic powers derived from the Court. And the ontology described by (8´´´)
depends on the social ontology that the Court’s (rigid) deontology can create. In a
non-trivial sense, they are deonstemic.
3. C
ognitive semiotics and the dynamics of rigid
institutions
I wish to offer an initial study at this occasion of lining out the prospects and pro-
grams of cognitive semiotics of institutional discourse. That cognitive semiotics has
Deonstemic Modals in Legal Discourse 305
as one of its aims to try to account for the relation of language to its environment
of use provides leverage for better understanding of language as an institutional
phenomenon — p articularly written texts originating from institutions with rigid
protocols for their composition and dissemination. Such texts fit the prototype of
discourse that was described by the rhetorician Lloyd Bitzer in his famous essay,
“The Rhetorical Situation” (1968),4 and extends the range of linguist Herbert Clark’s
notion of layering (1996: 353–384). It is to these two notions that I now turn.
3.2. Layering
“People sometimes appear to say one thing when they are actually doing some-
thing quite different,” observes Clark in the opening sentence of the final chapter
of Using Language (1996: 353). This observation leads Clark to propose that the
discourses of fiction and other forms of pretense consist of multiple layers of joint
action and events, such that two children can be simultaneously digging in a back
yard in the present time (layer 1) and “prospecting for gold in the Dakota Terri-
tories circa 1876”, layer 2 (Clark 1996: 354–360). This joint pretense scenario has
two layers with distinct roles (prospectors, such as Wild Bill Hickok and Calamity
Jane) and values (a particular boy and girl), audiences or ratified participants (e.g.,
the boy can be the audience of the girl in layer 1; Wild Bill can be the audience of
Calamity Jane in layer 2), bystanders (e.g., the boy’s mother calling him to dinner
in layer 1, or the mother could “play along” and call Wild Bill to dinner, in which
case the command issued to Wild Bill in layer 2 imposes an obligation on the boy in
layer 1), constraints (e.g., language use in both layers is English but with contrasting
semantic, referential, and prosodic features endemic to layer 1 or layer 2), and joint
actions (e.g., digging up the flower bed in layer 1, while prospecting for gold in
layer 2). Clark’s (1996: 353–386) account of layering focuses almost exclusively on
situations of pretense in play, fiction, drama, irony and sarcasm, teasing, and other
rhetorical phenomena, but it does not cover rigid institutional discourses; his ac-
count leaves out explicit mention of audience/addressee types. A proper account of
the deonstemic modality comes sharply into focus when we examine these texts as
multilayered artifacts, with each layer potentiating actions of different addressees.
The upshot of my argument is that layering is essential to all textual ecosystems,
and built into the legal discourse of legal systems such as SCOTUS. This argument is,
in part, based on the intuition that a statement that is ambiguously deontic (impos-
ing) or epistemic (descriptive) is actually imposition at one layer of discourse and
descriptive at another; other contextual factors conspire to determine the attentional
salience of any illocutionary force of the utterance in question.
Layering is particularly critical for the interpretation of legal documents, with
SCOTUS opinions being perhaps the most elaborate, as they are the most influential.
Primary participants are to imagine events, actions, etc. at higher layers all the while
appreciating why the authors and actors at the primary level created them. In the case
of SCOTUS, one always has to appreciate the deontic power to perform declarative
speech acts, even in cases where those speech acts are themselves ambiguous. The
ensuing analysis can be regarded as a meshing of Bitzer’s elements of the rhetorical
situation with Clark’s account of layering and is illustrated in the next section.
I outline the classification criteria for each modal type in the sections below. To il-
lustrate each type, I present 8 examples, 1 from Marbury vs. Madison (1803), a decision
establishing the doctrine of judicial review, and 7 from Furman vs. Georgia (1972), a
decision placing a moratorium on the death penalty as cruel and unusual punishment
and eliminating the death penalty in cases of rape. (The first part of this decision was
overturned in Gregg vs. Georgia, 1976.) This choice is merely one of convenience, a
result of culling examples from a specific part of the master spreadsheet.
juries which have the actual responsibility — do not support the contention that
evolving standards of decency require total abolition of capital punishment.
Furman vs. Georgia, 408 U.S. 238 (1972)
5 Although traditional theories tend to underestimate the role the built environment
plays in scaffolding the “internal” mental operations that make such pretend play
possible.
310 Todd Oakley
1962), ecological psychology (Bateson 1972; Gibson 1979) and cultural-historical ac-
tivity theory (Vygotsky 1978; Wertsch 1985; Lave 1988), aligning them with cognitive
semiotics (cf. Zlatev 2015a).
The present model of discourse fits within this “cognitive ecology” tradition,
wherein symbolic environment is seen as scaffolding much of our thinking, com-
munication, and action. Take away the elaborate documentary modes of permanent
script, and the entire enterprise fails to function as a unit of cognition.
The multiplex layers of discourse manifest in any given SCOTUS opinion need
not be the property of any single justice (although something like it might very
well have been internalized by experienced practitioners), for it specifies the social
ontological properties of a textual ecosystem that constitutes jurisprudential reason-
ing. Once in place, the model at the very least offers heuristic value of revealing
the way language works to distribute attention and mental resources to different
facets of jurisprudence.
5.3.1 Layer 1: decision (rights). When an opinion is published, the press focuses on
its implications for future legislation and policy by executives. Thus, these are the
primary ratified audience, for they are the ones who can or are to be compelled to
effect change. Private interests may also be immediately affected, but this is only to
the extent that they are acting in anticipation of compelling force from legislators
by an elected executive (i.e., the President of the United States, governors, cabinet
level secretaries and agencies, law enforcement officials, and so on). Interested by-
standers include attorneys and other legal authorities, many of whom will serve as
ratified parties in future legal, legislative, policy, and academic contests. This layer
represents the naked power of the Court to mold social reality in a world-to-mind
direction of fit (Searle 2004: 118), such that the world comes to resemble the minds
of the justices. Temporally, layer 1 operates according to the present and future;
interpersonally, joint actions at this layer involve sitting Associate Justices and the
Chief Justice; linguistically, constructions operating unambiguously at this layer
include such commonplaces as the court rules that that this law violates the Equal
Protection Clause; this court; and no state can impose such a tax.
5.3.2 Layer 2: rationale (responsibility). Layer 2 focuses attention on the prudential
principle. The responsibility of the justices is to be reasonable and rational, for the
reputation of the Court is of abiding concern to all justices but especially for the
Chief Justice, whose name brands the Court during his tenure (e.g., the Warren
Court after Justice Earl Warren; the Roberts Court after the sitting chief justice, John
Roberts). The sense that other decisions, being that they are final, need to be given
due consideration and influence. The decision cannot come out of the blue, and
the ratified audience can potentially resist if the Court is seen as unreasonable and
overly cavalier. The latest ruling in the Affordable Care Act (a.k.a. Obamacare) in The
National Federation of Independent Businesses, Et. Al. vs. Sebelius, Secretary of Health
and Human Services, Et. Al. (2011) is interpreted as Chief Justice Roberts (the vote
312 Todd Oakley
that tipped the decision in Obama’s favor) recognizing that such a decision coming
in the wake of several conservative decisions that have broken with precedent is
likely to cast the Court as a rogue institution, ultimately undermining its authority in
the long-run. In short, layer 1 brings with it the discourse expectation that “what we
say goes”, while layer 2 highlights the fact that the power of the justices in layer 1 is
not absolute. Actors in layer one appreciate the obligation to reasonability specified
in layer 1 and are often entrained to imagine the consequences of abusing power.
At layer 2, the principal participants are the justices, but the ratified participants
include the general public and future general publics, thus explicit reference to the
Constitution, to the principle of stare decisis, and to other standards of reason are of
strategic importance at this layer. Temporally, the concern is preserving as much
continuity with the past as is morally and practically prudent, while at the same time
focusing attention on the long-term impact of the decision. Thus the temporal scope
is similar to layer 1, but perhaps with an extended scope of the distant future of the
Court. Linguistically, stock phrases like We must always remember that… are used as
admonishments to abide by certain rational and jurisprudential principles. At layer 2,
the discourse tends to focus on the possibility that the “final word” of the Court may
not be its “last words,” or that its “final words” can always become “unjust actions,”
as happened in the infamous Dredd Scott vs. Sandford (1857), regarded as the worst
opinion in the history of the Court and held liable for precipitating the Civil War.
5.3.3 Layer 3: enunciatory (address and assertion). Justices argue with one another.
This layer focuses on justices as personalities who differ, sometime very stridently
with the opinions of their contemporaries and predecessors. This layer dramatizes
the compelling nature of certain speech acts — often reluctantly. Deliberation and
disagreements are as uncomfortable and face-threatening as they are obligatory.
They ameliorate the face threatening dimensions of agonistic discourse. These in-
stances can be regarded as illocutionary force indicating devices (cf. Brown and
Levinson 1987). At this layer, the primary participants are the justices themselves,
each of whom is either speaking for himself, herself, or for the Court. Linguistically,
phrases such as I must admit/disagree/demur, etc., or It must be admitted that function
as pre-disagreements. The second example, which appears more about a half-dozen
times in the corpus fits with Sweetser’s (1990: 73) contention that enunciatory
modality keeps close company with epistemic modality.
5.3.4 Layer 4: epistemic. Justices are human beings, subject to the same powers of
reason as others, and they often must appeal to doxa (common opinion). Epistemic
modals that are unambiguous trade on this notion of inferred certainty common to
us all, especially in topics associated with common sense opinion. We take what is
certain as a basis for deliberating about topics of inherent uncertainty.
This layer legitimizes layers 1 and 2 by way of a fortiori reasoning; if the jus-
tice is acting reasonably by doxastic standards, then we must be confident that
he or she is being reasonable according to jurisprudential standards. To be sure,
this is a social ontological expectation, and virtual sparring over the justices’
own motives and competencies has become something just shy of contact sport
Deonstemic Modals in Legal Discourse 313
among American citizens and pundits. Skepticism aside, layer 4 trades on the
general appeal of reason and rationality as a basis for drawing conclusions. If a
justice is certain of something, we presume he or she is certain for a good reason
and thus is compelled by reason to draw a conclusion. Linguistically, stock epis-
temic constructions, such as I must conclude and It must have been the case, with
a temporal focus either on the present or on the present perfect, operate most
naturally at this layer.
5.3.5 Layer 5: deontic (order). Justices are also obliged to take courses of action that
meet standards of reasonability and rationality in the order of adjudication. Such
standards are often dictated by the order of adjudication followed by the lower
courts. Cases are complex with many dimensions and parts. The order of decision
cannot be seen as arbitrary but must flow from the logic of the case. Linguistically,
this layer is epitomized by such commonplace phraseology as We must now proceed
to decide the next question, and The court must subsequently decide that question only
after dispensing with this one. Temporally, this layer focuses attention on the here-
and-now of the opinion and the immediate future, exemplifying a genre-specific
type of discourse deixis.
5.3.6 Layer 6: narrative. The very reason for the Court’s existence is the fact that
some event or set of events happened in the past. There is a plaintiff and defen
dant, two parties in conflict. In most instances (except in the very small range of
case types in which the Court has original jurisdiction, such as treaties), cases
before SCOTUS were adjudicated by lower courts with original jurisdiction and
then subsequently by the lower appellate courts. With rare exception, plaintiffs
and defendants have sought remedy in other jurisdictions, with this as the final
contest. Thus, William Henry Furman can seek remedy from the State of Georgia
and the United States government can establish a National Bank free from excise
taxation by the State of Maryland. Linguistically, SCOTUS opinions are replete
with narrative of past events pertaining to the case and of its past proceedings of
the lower courts. In fact, for many opinions a majority of textual “real estate” is
given over to description of past events, actions, and states of affairs, all of which
provide the basis for the Court’s verdict. Layer 6 is perhaps the most voluminous,
but narrative particulars are rarely a substantial part of the quoted material in
other SCOTUS opinions. (Quoted material from Marbury vs. Madison (1803) The
most cited opinion, consists of a mere six sentences, none of which pertain to the
case itself (Oakley and Tobin 2014).
gives considerable weight to the epistemic layer (hence the capital markings of
layers 1 and highlighted marking of 2 and 4 in Figure 2).
(17) This tax must be unconstitutional.
Imagine the alternative construals in (17a) and (17b). With (17a), we get a very dif-
ferent implication, with attention focusing heavily on layer 4 and (perhaps) layer 2.
The illocutionary force of this utterance would be epistemic and descriptive, as if
the Court were merely categorizing past laws with little regard for subsequent law,
see Figure 3.
(17a) This tax must have been unconstitutional.
(17b) This tax needs to have been constitutional.
Applying the necessity test to (17a) yields the awkward (but still grammatical) con-
strual of (17b) (which requires change to the final adjective to make it coherent).
While (17a) focuses attention on layer 4, (17b) refocuses attention back to layer 1,
constituting an authoritative rebuke of a past legislative deed. This version might
even lead addressees to expect the court to issue a specific remedy, such as “But
the tax is clearly repugnant to the Constitution;6 thus, the State of Maryland must
never have enacted such an law in the first place.”
6 The phrase “repugnant to the Constitution” is a favorite of John Marshall, Chief Jus-
tice and author of this opinion, and appears in many of his most famous opinions.
Deonstemic Modals in Legal Discourse 315
We may thus conclude that the notion of a deonstemic emerges from the focal at-
tention to layers 1 and 2, two layers not fully present (or not in the same way) in
other governmental institutions in the United States. What is more, this is an in-
stitution with a rich textual archive constantly in play, but they are texts with rigid
procedures for their composition and dissemination. Thus, an appreciation of the
meaning of modal verbs, particularly the most forceful of modal verbs, must, helps
us understand the dynamics of force and counterforce at the interface of semiotics
and the social world. The pragmatic function of deonstemic modality is to package a
pronouncement that conforms to the epistemological expectations at layer 2, so as
to ameliorate the patina of the bald exercise of power. One must both appreciate the
forcefulness of the pronouncement and appreciate the reasonableness and prudence
thereof, even though, at base, (layer 1) the social ontology of SCOTUS boils down
to “because we say so”!
6. Conclusion
From a linguistic perspective, Sweetser (1990) correctly concludes that “modals can
be used either to impose or to describe (report) in both the content and speech-act
domains but can only describe in the epistemic domain” (ibid: 73). From a cognitive
semiotic perspective, Sweetser’s conclusion is not fully adequate. True, epistemics
describe while deontics impose, but deonstemics both impose and describe, and such
instances only become apparent in specific types of rigid institutions.
An analysis of 33 of the most influential Supreme Court opinions suggests that there
is a language-specific category of deonstemic modality for must whose purpose is to
blend our obligation with reason, and that such “modal blends” are critical for under-
standing how certain institutions build and maintain trust and predictability among
316 Todd Oakley
Acknowledgements
In addition to the two anonymous reviewers, the author thanks the following col-
leagues for their thoughtful comments and advice: Mihailo Antović, Line Brandt,
Per Aage Brandt, Peer Bundgaard, Seana Coulson, Anders Hougaard, Vladimir Figir,
Esther Pascual, Frederik Stjernfeldt, Eve Sweetser, Vera Tobin, Mark Turner, Đorđe
Vidanović, Jordan Zlatev, and Svend Østergaard.
7 The Court’s reputation as a reasonable body is never uniform, and its reputation has
suffered in recent years, but not so much so that the other branches of government,
national or local, can feel confident in explicitly refusing to obey its edicts, as An-
drew Jackson did in 1832.
Vlado Sušac
Chapter 20
Commutation of Cognitive Source
Domains as a Semiotic Tool for
Paradigmatic Analysis
1. Introduction
The theoretical postulate of this chapter is based on an attempt to unify the tradi-
tional structuralist approach in semiotics and the cognitive theory of conceptual
metaphor. The former is based on the definition of sign as a connection between the
signifier and the signified, realized through the richness of syntagmatic and para-
digmatic relations and possibilities within the message (Saussure 1916). The latter
represents a relatively new theory of metaphor, which at the end of the last century
transferred this seemingly trivial linguistic phenomenon from the field of stylistics
into the area of cognition and human ways of thinking (Lakoff and Johnson 1980,
1999) My aim is to test the possibility of replacing certain dominant metaphorical
concepts, primarily in political discourse, with alternative ones. It does not mean
giving preference to one type of conceptualization over another (e.g. viewing in-
stitutions or political organizations as PLANTS instead of MACHINES), but examin-
ing the conditions and mechanisms under which such exchanges are possible, if
possible at all. A precondition for such a possibility is the formal overlapping of
the target metaphorical domains as the first step of analysis. Smaller or more sig-
nificant differences in connotative meanings may be further evaluated by placing
lexical realizations deriving from such an alternative metaphorical concept into
the original context. Consequently, the contextualization of signs and their related
concepts inevitably introduces corpus analysis as an additional method of research,
which implies the usage of formerly published papers and classifications of political
discourse according to its source and target metaphorical domains.
Due to the complex structure of such an approach and in order to facilitate the
understanding of the theoretical assumptions it relies on, the argumentation is struc-
tured as follows. Section 2 summarizes the basics of the paradigmatic analysis with
an emphasis on commutation tests and pertaining semantic implications. Section 3
presents the theory of conceptual metaphors, primarily developed within cognitive
linguistics and later applied to cognitive semiotics through the shift from verbal to
pictorial signifiers, conceptually structured in a similar way. In doing so, special
attention is paid to metaphorical interactions, i.e. the possibility of diversification
and multivalency of conceptual domains, which opens the possibility of alternating
various tenors and vehicles. This is followed by the presentation of the research
corpus in Section 4, which provides the basis for the conceptual classification of
318 Vlado Sušac
2. Paradigmatic relations
Semiotic analysis as we know it today has had its own course of development,
scattered in many directions from purely structural to social semiotics. Still, we
cannot avoid the fact that its origins, at least in the European tradition, stem from
the Swiss linguist Ferdinand de Saussure, who suggested semiotics as a general
theory applicable to all sign systems. It was exactly the concept of system that
Saussure gave not only to linguistics, on which he based the majority of his theor-
etical assumptions, but also to other scientific disciplines. Structuralism in itself
involves two fundamental types of relationships between signs, which make the
sign itself a system of differences. These are syntagmatic and associative relations
(Saussure 1959: 123), the latter prevailingly known today as paradigmatic under
the influence of the Prague School (Vachek 1966). The first are also called relations
in praesentia because they include the basic characteristic of the combination of
elements present in the statement, while others are called relations in absentia
because they refer to the possibilities of a selection of elements from the system
and their incorporation into the syntagm. Although this is a logocentric approach,
since Saussure recognizes such relations as the essence of langue (language as a
system of signs), some semioticians have argued that it is applicable to other sign
systems (cf. Barthes 1967; Eco 1976). Syntagmatic relations are mostly recognized
and observed through spatiotemporal combinations, either in narrative procedures
in film and literature or in the relations of the peripheral and central in images or
in any other possible combinations applicable to the selected media, which could
have an effect on a certain shift in meaning (Chandler 2007). It is necessary to note
that syntagmatic relations do not necessarily have universal characteristics since
1 Ideology as a term, given its vagueness, can have numerous interpretations and any
attempt to elaborate them would exceed the limits of a single article and the purpose
of this work. Therefore, the term is used in its rather broad sense corresponding to
the definitions offered by Van Dijk (1998: 8) as “the basis of the social representations
shared by members of a group” and by Hodge and Kress (1988: 6) as “a level of social
meaning with distinctive functions, orientations and content for a social class or a
group”.
Commutation of Cognitive Source Domains 319
many of them are culturally conditioned, e.g. reading direction from left to right or
vice versa, or from the top down, etc. (Kress and Van Leeuwen 1996).
The paradigmatic relations are much more complex because the vertical axis of
selection includes significantly more choices. Structural semiotics has borrowed
from linguistics two fundamental procedures of paradigmatic analysis, namely,
binary oppositions and the commutation test. In terms of methodology, these pro-
cedures allow the examination of the impact on the meaning by replacing one
signifier with another. The paradigmatic axis of selection is handled by a very wide
range of possible and alternative signifiers with respect to gender, age, class, race,
ethnicity, and so on. The commutation test, however, is not limited to paradigmatic
analysis but may also be applicable to syntagmatic relations through the procedur-
es of addition and deletion of existing elements in the semiotically observed text
(Chandler 2007: 90).2
Particularly interesting is the relation between marked and unmarked forms
as part of paradigmatic analysis (Jakobson and Waugh 1979: 92). Its purely lin-
guistic application has spread to semiotics, whereby it is taken into account that
unmarked forms are the most common and dominant and therefore less visible as
an expression of dominant values within a particular culture. This is largely present
in the semiotic analysis of pictorial advertisements, where, for example, we can
test the connotations produced by replacing a female with a male person signifier,
young with old, etc. Especially provoking are the cases when an unmarked sign is
replaced with a marked sign, given the fact that the former is less noticeable and
therefore reveals the dominant values of a particular culture. In other words, very
often by questioning such forms it is possible to reveal particular myths within the
observed community that consider such signs unquestionable and thus “natural”,
which leads us to the most concise definition of myth as “naturalization of the
cultural” (Barthes 1991: 128).
Since, as already explained, the foundation of structural semiotic analysis is
based on replacing one signifier with another, and even one type of media with
another, which verifies the impact on the meaning of such alternative forms, it is
legitimate to ask whether the same procedure can be applied to the replacement of
one metaphorical vehicle or one metaphorical concept with another, that lead to
an approximately similar meaning. The starting point for this procedure we find
within the framework of Conceptual Metaphor Theory (CMT), which is applicable
to both linguistic and visual signs, where the latter can also be subjected to various
commutation tests or binary oppositions (male – female, black – white, etc), This
process is particularly known in the field of advertising where visual metaphors
frequently replace or supplement the verbal ones by applying the same process of
mapping from source to target domains (cf. Forceville 1996). The basic features of
CMT are presented in the next section, where we argue that the mentioned pro-
2 Sonesson (2009: 44) notes that commutation as a procedure in linguistic analysis can
be seen as a special case of the operation of ideation in Husserlian phenomenology.
320 Vlado Sušac
3 Linguistic relativity is a commonly used term that refers to the “weaker” form of
the so-called Sapir-Whorf hypothesis. There is growing consensus nowadays that
language does not determine, but can have impact on thoughts and cognition (cf.
Casasanto 2008).
4 For other references see the special issue on CMT in Fussarolli and Morgagni, eds.
(2013). For the overview and response to some critical views of CMT (Zinken, Ra-
kova, Gevaert, etc.) concerning methodology, direction of analysis, schematicity of
metaphor, embodiment, and the relationship between metaphor and culture, see
Kövecses (2008). Further useful insight into the CMT criticism is provided by Gibbs
(2013).
Commutation of Cognitive Source Domains 321
is the idea of HEAT, which in this case is understood as the source domain that will,
according to the original Greek meaning of metaphor, transfer to us their common
concept or target domain of EMOTION. Therefore, EMOTION IS HEAT is the label
of the conceptual metaphor which consists of the metaphorical mapping occurring
between a source and a target domain.
This rather simplified example still offers clear evidence how metaphors are
structured and how deeply they are rooted in our everyday language and our con-
ceptualization of the world. It also implies one of the principal questions asked here:
can the paradigmatic selection of metaphors that we use have an impact on the way
we understand the reality that surrounds us? If it can, how does it work? In order to
answer such questions, some interactive metaphor relations have to be explained.
The study of metaphor has shown that a metaphorical meaning is not limited by
the relation of mapping established between exclusively one source and one target
domain, but instead we deal with many complex relations resulting from the inter-
action of several different target and source domains. In this respect, the most
challenging inter-relations are those of multivalency and diversification (Goatly
2007: 12). Multivalency is described as the case when the same source domain is
applied to various target domains, as in examples (4) and (5) in Croatian.
(4) Dubrovnik je vrhunska destinacija (GOOD IS HIGH)
‘Dubrovnik is a top destination’
(5) Cijene nekretnina u Dubrovniku su otišle u nebo (MORE IS HIGH)
‘The prices of property in Dubrovnik are sky high’
Since these different targets GOOD and MORE share a multivalent source, they
may become associated into an equation MORE = GOOD. This concept, deeply
rooted in the Western economy and culture, reinforces patterns of excessive wealth
accumulation and consumption as part of the society ethic. In the opposite sense,
it is worth remembering the different religious attitudes that promote economic
modesty as a moral imperative (ibid: 168).5 Another pair of multivalent sources is
given in examples (6–7). This might suggest the equation CHANGE = SUCCESS,
which, according to Goatly (ibid: 173) is a pattern of consumer behaviour widely
recognized in Western culture, especially in terms of buying the latest and most
fashionable products even if we don’t need them, inducing us to frequently change
clothes, cars, mobile phones, etc.
(6) Attitudes to women shifted in the 19th century. (CHANGE IS PATH)
(7) She is intelligent and hardworking; I’m sure she’ll go a long way.
(SUCCESS IS PATH)
5 Of course, this equation explains only one particular aspect of culture concerning
the acquisition of wealth, while in many other fields of experience it would not be
appropriate. Example (5) can also be relativised when viewed from the position of
the seller or the buyer of property.
322 Vlado Sušac
For the objective of this chapter, more relevant is the case of diversification, the
opposite to multivalency, where different source domains lead to one target domain,
as shown in (8–10; 11–13).
(8) He is one of the few clean politicians. GOOD IS CLEAN
(9) It was an immaculate performance.
(10) She had a spotless reputation.
(11) I’m your number one fan. GOOD IS FIRST
(12) Saving children always takes priority over adults.
(13) She’s one of the foremost paediatric experts.
These examples of diversification are randomly selected from Andrew Goatly’s
Metalude database of metaphors accessible online and developed as a project of
contrastive analysis of English and Chinese conceptual systems. At first glance
we can realize that although two groups of metaphorical phrases share the same
target domain of GOOD, it is hardly possible to switch the lexical item from dif-
ferent source domains and use them in a different context. Still, in the afterword
to Goatly’s book Washing the Brain – Metaphor and Hidden Ideology (Goatly 2007),
Zoltan Kövecses suggests that “we need to uncover these ideologically-loaded
metaphors and look for alternative ones”. At the time, Kövecses had already been
aware of the phenomenon of conceptual diversification, particularly having in
mind the differences in conceptualizations between various cultures (Kövecses
2005: 83). However, if such alternative conceptualizations are indeed possible, it
is worth examining the conditions that satisfy such a possibility. After all, the
examples from (8–10) cannot be exchanged with the examples from (11–13) in
the registered contexts (e.g. ??“I am your clean fan”), although they may formally
share the same target domain GOOD.
The database itself offers various possibilities of corpus analysis and firm foun-
dations for either semasiological or onomasiological classification of metaphors
or, in other words, from expression to concept (semasiology) or from concept to
expression (onomasiology). The search engine provides manifold ways of research,
either by typing in lexical items, which are grouped under all the source domains
they appear in, or it is possible to insert a complete “root analogy”, which leads to
a whole list of language metaphors that correspond to the given mapping. It is also
possible to search separately for either source or target domains and the software
will offer a list of the corresponding mappings for the given domain, with all the
language metaphors they are derived from. The first step in our research was to
check the existence of diversification cases (one target and multiple source domains)
in the database; we managed to identify a list of conceptual pairs that share the same
source domain, as shown in Table 2 based on English examples.
Table 2. A
list of conceptual diversification mappings extracted from the Metalude database.
Source: Metalude: Metaphor at Lingnan University, Department of English, with
the permission of A. Goatly.
Further, we needed to choose the most appropriate conceptual pairs from the list
in order to test the possibilities of alternating two or more source domains for the
same target domain. We decided to choose and test the most frequent conceptual
pairs in the political discourse assuming that such a corpus is highly ideologically
motivated and therefore fits into the aforementioned idea of Kövecses’ concerning
“ideologically-motivated metaphors”. For that matter we used previous research
on political discourse in Croatia carried out through the corpus analysis of four
election campaigns registered in the Internet editions of three daily newspapers
in Croatia.6 The objective of the research (Sušac 2007) was bidirectional. Beside
the aforementioned test of the most dominant metaphorical concepts in political
campaigns, it also aimed to contrast the conceptual systems of English and Croatian.
It transpired that all the English conceptual pairs presented in the Metalude database
can also be found in the Croatian language, but not necessarily with the equivalent
lexical items.7 Therefore, the English database could be combined with examples
from the Croatian corpus in order to check for cases of metaphor diversification.
The corpus was strictly reduced to all the quoted speeches and utterances of
politicians involved in the campaign and excluded journalists’ comments. It spanned
a period of five years in the previous decade and more than one thousand extracted
examples have shown that there is evidence of significant inclination of conservative
and liberal parties towards different types of metaphors.
It turned out that two domains FIGHTING and PATH significantly prevail over other
concepts. Further analysis showed that the former seems to be visibly preferred by the
leading conservative party HDZ (Croatian Democratic Union), unlike the concept of
PATH preferred by the leading social-democrat party SDP (Social Democratic Party).
The latter is especially emphasized by the fact that the leading conservative party
dominantly outnumbers social-democrats in its overall number of metaphors used in the
election campaigns, and that the metaphors of PATH used by the social-democrats are
the only ones that outnumber any other type of metaphors used by the conservatives.8
The next step was to identify which target domains from the Metalude database
are shared by the previously mentioned most dominant source domains of FIGHT-
ING and PATH in the political discourse of Croatia. It turned out that the target
domain ACTIVITY was the only case of diversification. This is a rather general
notion, but within the corpus of analysis of political discourse it can be specified as
POLITICAL ACTIVITY, with all its objectives and instruments.
Further analysis was focused on the language metaphors or the lexical items deriving
from such concepts and being used in particular contexts. The objective was to discover
whether certain lexical items belonging to one source domain would fit into a different
metaphorical context that shares the same target domain or the idea conveyed through
the metaphor. It was found that such alterations were possible in a rather limited number
of cases that can be considered cases of contextual diversification (Table 3). Here, the
lexical items share the metaphorical meaning, unlike the cases where the same target
domain is just formally shared by two different source domains, but with a mismatch in
the metaphorical meaning of lexical items. Such cases of formal diversification are shown
in Table 4. Here, although ACTIVITY is the common target domain, the metaphorical
meaning is significantly different (reaction vs. development) and therefore it would be
impossible to switch lexical items from different source domains.
8 By coincidence or not, the previous name of the Social Democratic Party was the
Party of Democratic Change, and considering the frequently present multivalency
example CHANGE IS PATH, a possible pattern may be found that will support the
assumption of ideologically grounded metaphors (Sušac 2010: 563)
326 Vlado Sušac
In the case of contextual diversification (Table 3), it is possible to replace the lexical
item bash on, which belongs to the source domain of FIGHTING, with the lexical
item keep on trucking from the source domain of PATH. The reason why it is possible
is given in the column of metaphorical meaning, with emphasis on the idea of con-
tinuation shared by the two domains. This idea is definitely due to the grammatical
construction VERB + on present in both cases, conveying the aspect of an action in
progress. However, its metaphorical meaning is owed not to the preposition but the
verbs bashing and trucking belonging to two different conceptual domains. It is also
worth noting that the syntax of any language would rarely allow diversification by
simply replacing a lexical item from one domain with the lexical item from another
domain without some kind of syntagmatic adaptation. Of course, paradigmatic
selection of other lexical items is also possible as long as they contain or convey
the idea shared by the two source domains.
It is clear that such “conceptual synonymy” is not absolute, as there is no absolute
synonymy in language either, but the example clearly shows that the commutation
of source domains is possible, with all further semiotic connotations this may lead
to. The overlapping in metaphorical meaning within contextual diversification is
possible because of the metaphorical ground shared by the two domains in the par-
ticular context, which, as shown in Table 4, is not always the case.
The notion of ground belongs to the basic theory of metaphors introduced by
I. A. Richards (1971), where there is a distinction between tenor as the object that
the metaphoric word or phrase refers to, vehicle as the metaphoric word or phrase
and finally ground as the quality that one refers to when using a particular vehicle
in relation to the tenor. This rather rhetorical approach reduced to lexical items can
be easily transferred into the cognitive sphere of concepts, where the vehicles lead
to source and the tenors to target domains. Ground corresponds to the metaphorical
meaning and in case of diversification to those parts of qualities shared by two or
Commutation of Cognitive Source Domains 327
more source domains and one target domain, which in the example above is the idea
of continuation. Graphically, this may be represented as in Figure 1.
Figure 1. A
graphical representation of the metaphorical ground shared by one target (T)
and two source (S) domains
The corpus offers a number of cases with other conceptual grounds shared by the
observed source domains of FIGHTING and PATH. They are: difficulty, give up/
end/finish, start/begin, change, achieve, doubt/confusion, etc. These shared grounds
(as much as conceptual pairs) are present in both languages, English and Croatian,
and therefore it was possible to apply the results from the Croatian corpus to the
database for English. This was done not just for the purpose of text clarity and the
accessibility of the Metalude database, but to show that this kind of procedure can
be applicable to other languages as well.
These cases largely confirm the initial assumption that commutation of meta-
phorical source domains is possible as a semiotic tool for paradigmatic analysis
when, of course, certain preconditions are satisfied. In semiotic terms, it implies a
principal question about the effect or consequence of such alterations on the sig-
nifieds. In political discourse we can ask the same question; what is accomplished by
replacing the dominant source domain of PATH with FIGHTING (and vice versa)?
Let us consider the following examples in Croatian, and their English translations.
(14) Dobili smo najteže bitke u razvoju naše zemlje.
‘We have won the severest battles in the development of our country.’
(15) Prošli smo najteže prepreke na našem putu u EU.
‘We have overcome the most difficult obstacles on our way to the EU.’
Obviously, example (14) belongs to the conceptual mapping POLITICAL ACTIVITY
IS FIGHTING and (15) fits into the category POLITICAL ACTIVITY IS PATH, so
formally, we can recognize the case of diversification. The possibility for contextual
328 Vlado Sušac
5. Conclusions
The main objective of the research described in this chapter was to investigate
whether the commutation of signifiers, as a research procedure that primarily be-
longs to structural semiotics, can be applied to the field of conceptual metaphor.
Diversification, as one of many interplays of metaphor, was shown to be applicable
by extending the commutation test from simple signifiers to whole metaphorical
source domains. The research corpus of political discourse was appropriate for iden-
tifying two kinds of diversification, formal and contextual, where only the latter
provides a good basis for paradigmatic analysis. Contextual diversification can take
place only when metaphorical realizations deriving from two source domains can
be re-contextualized, but retain the generally intended meaning, as in (14+16) and
(15+17). For this, the two domains must share a conceptual ground. The question
of consequences of such alterations, whether they are ideologically grounded or
merely a matter of style, still remains open. However, the results of the political
discourse corpus analysis are rather indicative, given the clearly shown preference
of opposed political groups towards different conceptualizations of their political
objectives.
In conclusion, even if we do not fully support the idea of the need for “conceptual
correctness” so that we may get rid of “ideologically loaded metaphors”, it is still
worth knowing that there is the possibility of choice in creating different metaphori-
cal construals for the same communicative purpose, at least for the sake of style.
Maíra Avelar
Chapter 21
The Emergence of Multimodal Metaphors in
Brazilian Political-electoral Debates
1. Introduction
As there is a lack of studies on multimodality applied to the political domain, this
study analyzes the metaphors that gradually emerged in the political-electoral
debates between Brazilian presidential candidates that took place in 2010 and
2014, taking into consideration two semiotic resources: speech, belonging to the
auditory modality, and gestures, belonging to the visual modality. To analyze the
selected scenes from the debates, an adapted version of a cognitive semiotic model
(Brandt and Brandt 2005) was chosen, as it encompasses the enunciative scenario,
allowing for a dynamic analysis of the metaphors used by the candidates and,
most importantly, allowing for a detailed and integrated analysis of the gestural
content together with the linguistic content.
Based on Turner’s (2007) assumptions, the main hypothesis that guides this
research is that the more entrenched the metaphorical expression in our conceptual
system is, the harder it is to recognize such an expression as metaphorical. When
approaching the difference between literal and metaphorical meaning, Turner
states that these are not “cognitive operations [that are] fundamentally different”
(Turner 2007: 1). Rather, the judgment of an expression as literal or metaphorical
is intimately related to the degree of productive entrenchment – or, in Müller’s
(2009) terminology, to the degree of conventionality – of a conceptual connection.
Thus, the higher the degree of productive entrenchment/conventionality – the
lower the chance that an expression will be judged as metaphorical.
Regarding the meaning of political discourse and, more specifically, the ar-
gument developed in a political-electoral debate, it is relevant to highlight the
importance of compression processes (Fauconnier and Turner 2002). One of the
central benefits of such processes is their capacity to provide compressions of
diffuse event ranges to the human level. According to Conceptual Blending
Theory (Fauconnier and Turner 2002), mental spaces and the connections among
them are established because compression (and decompression) can provide a
global view and a human-level understanding of conceptual relations. Abstract
concepts such as space, time, cause, effect, identity, and change emerge through
blending. These are called vital relations, maximized and intensified by com-
pression. Taking the political debates into account, extremely complex political
matters (related to the conditions of public administration life, work, economy
332 Maíra Avelar
2. Th
eoretical Background: Metaphor theories and
multimodality
In this section, we present our adapted version of Brandt’s (2005) model, called
the Multimodal Semiotic Blending (MSB) model (Miranda and Mendes 2014) in the
context of better known theories of metaphor. We suggest that our model allows a
more detailed discussion of gesture analyses, bringing in categories such as multi-
modal metaphoricity in speech and gesture compounds (Müller and Cienki 2009),
gesture excursion (Kendon 2004), and the compression in speech turns (Hougaard
and Hougaard 2008).
As is well known, Conceptual Metaphor Theory (CMT) asserts that “metaphor
is pervasive in everyday life, not just in language, but in thought and in action.
Our ordinary conceptual system, in terms of which we both think and act, is
fundamentally metaphorical in nature” (Lakoff and Johnson 1980: 3). Further, con-
ceptual metaphors are crucial for the construction of abstract concepts, as their
target domain is anchored in the inferential patterns made from sensorimotor
experience (Johnson 2007). Inference is, therefore, “the heart of metaphor” (Lakoff
and Johnson 1980: 245). Each conceptual metaphor consists of the systematic
mapping of entities and relations from a sensorimotor source domain to a more
abstract target domain.
There are many examples that illustrate how metaphorical mapping works. For
instance, the conceptual metaphor intentional activities are journeys under-
lying example (1), presents a conceptual mapping from the spatial physical move-
Multimodal Metaphors in Brazilian Political Debates 333
The schema clarifies the directionality of the metaphor from source to target do-
main, and can be interpreted as the understanding of one concept (the more abstract)
in terms of another (the more concrete). This means concrete experiences can be
extended, through conceptual metaphors, to characterize non-spatial experiences
such as “falling into a depression” (cf. Rohrer 2007).
While CMT is a very powerful and influential theory, two main critiques
have been addressed to it, among others by authors who work on multimodality
(e.g. Müller and Cienki 2009): (a) the proposed metaphorical mapping is too
static; (b) examples are created by the analysts, instead of being collected in real
contexts of interaction. Such factors provoked the development of a theoretical
framework that brings more dynamicity to the metaphorical mappings: Con-
ceptual Blending Theory (Fauconnier and Turner 2002). Conceptual blending
(or integration) is assumed to be a basic mental operation that leads to global
insights and conceptual compressions that are useful to memory, as it organizes
diffuse ranges of meaning. It is a dynamic cognitive process that has a set of con-
stitutive principles: (i) a partial cross-spatial mapping that links some counter-
parts to the input mental spaces; (ii) a generic mental space that is mapped in
each of the input spaces and has the common elements of these input spaces;
(iii) the blending space; and (iv) a selective projection from the input spaces to
the blending space. The essence of the blending operation is the construction of
a partial correspondence between the two input spaces that will be selectively
projected in the blended space (see Figure 2).
334 Maíra Avelar
The blending products that are interpreted as literal at a specific domain can be
interpreted as metaphorical in another. As pointed out in the introduction, this dep-
ends on the degree of productive entrenchment, or conventionality of the structure
(Turner 2007). For example, in (2), used by a politician to undermine his opponent’s
argument, the conventional metaphor knowing is seeing is used, which is strongly
entrenched and the expression is not (typically) recognized as metaphoric. On the
other hand, in the utterance (3), winning ticket, conventionally related to the
lottery domain, emerges as a novel metaphor, related to the oil reserves domain,
and is easily recognized as such (Miranda and Mendes 2014). Consequently, meta-
phors are seen as emerging from a gradual process: expressions are interpreted as
more or less metaphorical depending on their degree of productive entrenchment,
or conventionality.
(2) As everyone can see, she couldn’t prove her point.
(3) He is going to hand over the winning ticket to foreign private companies.
An extension of this model was proposed by Brandt and Brandt (2005), which makes
it possible to analyze how subjects, in situated interactions, perform blending by
projecting an architecture of spaces (cognitive frames) on top of a semiotic base space
that can be understood as the participants’ shared representation of the situation of
communication. This semiotic base space is fundamental for meaning construction.
Some modifications have been made to the architecture of the spaces originally
presented by Brandt and Brandt (2005), mainly in the semiotic base space, which
was reformulated as a grounded space (Oakley 2009). Finally, to better suit our
analysis of multimodal data, we have proposed that the presentation space should
Multimodal Metaphors in Brazilian Political Debates 335
be unfolded into two kinds of signifiers: (a) gestural and prosodic resources and
(b) linguistic resources (Miranda and Mendes 2014).
In the final modified version (Figure 3), the architecture of the spaces consists of:
(i) grounded semiotic space (which unfolds into three spheres: semiosis as an instance
of language acts performed by the interlocutors; the communication situation, in
which the participants of the interaction are found; and the wider phenomenological
world, accessible to our experience); (ii) input spaces, called presentation spaces
(textual instance), which unfold into two dimensions: gestural/ prosodic resources
and linguistic resources, and reference space (object instance); (iii) virtual space
(blending), projected from the scenario element (frame) selection of the last two
spaces; and (iv) relevance space, which guides the emerging meaning of the virtual
space. This architecture of spaces is entitled Multimodal Semiotic Blending (MSB).
utterance. Consequently, the presentation space results in the merge of both what
is said and how it is said by the participants of the enunciative scene. The highlight
of the prosodic and gestural features is a relevant tool to analyze the specificities
of these bodily resources and their relation to the linguistic content, which can be
seen in the applications of the model described in Section 4. Before we proceed, we
should note that rather than speaking of “conceptual metaphors”, in terms of (static)
mapping across domains, the concept of metaphoricity highlights the emergence
of metaphor in interactions: metaphorical elaborations can be triggered in several
modalities and successively over time. The analysis of syntactic, semantic, prosodic,
and gestural contexts reveals that metaphoricity is a dynamic property, as will be
explicated in the empirical study described in the remaining parts of this chapter.
3. Methodology
3.1. Analytical categories
To analyze the gradual emergence of metaphors in a political debate, we selected
the following specific categories related to multimodality.
3.1.2. M
ultimodal metaphoricity in verbal-gestural compounds
In the context of multimodal metaphoricity, it is possible to describe two patterns
of the relation between gestures and speech (Mueller and Cienki 2009: 307):
• It is possible to find the same source and the same target in different modalities. In
these cases, the gesture embodies the source-domain of the verbal metaphorical
expression, indicating that the metaphoricity of that expression was activated
or was in the foreground of the speaker’s attention.
• It is also possible to find source and target in different modalities. In these cases,
we find a gestural metaphorical expression, together with a target that is ver-
balized in a non-metaphorical manner.
3.2. Material
The material analyzed includes scenes from two second-round debates exhibited
in 2010 and 2014 by Record, a Brazilian TV channel and collected from Youtube.
The generic structure of both debates consists of questions (45 seconds in length),
answers (2 minutes in length), replies (1 minute in length), and responses (1 minute
in length). We focus on four examples: two answers from Dilma Rousseff, from the
PT (Labor Party), one answer from the 2010 debate and one from the 2014 debate;
one answer from José Serra, from the PSDB (Brazilian Social Democracy Party),
from the 2010 debate; and one reply from Aécio Neves, also from the PSDB, from
the 2014 debate.
What the chosen sequences have in common is the fact that controversial issues
were raised by the candidates, especially when addressing the proposals made by
their opponents or by their opponent’s party. In the four sequences, the candidates
denounce contradictory moral stances assumed by the opposition. The controversial
issues raised in the 2010 debate demonstrated, at least in the discourse, the ideo-
logical differences between the PT (a left-wing party) and the PSDB (a right-wing
party). On the other hand, in the 2014 debate, Aécio, from the PSDB, presents himself
as a candidate from the opposition and declares that he was not a candidate from
a political party, but the candidate that embodies change. Dilma, at this time the
incumbent candidate, reinforces the social changes promoted by the PT and clearly
asks for the elector’s vote.
338 Maíra Avelar
4. Analysis
4.1. Dilma’s speech from the 2010 debate (Sequence #1)
The first sample was extracted from an answer given by Dilma about public security.
The candidate discusses the new oil reserves, and only in the end discusses public
security. Considering our coding methods, the gestural sequence is described and
analyzed as shown in Figure 4.1
Figure 4. Gestures and descriptions of Dilma’s speech from 2010 (Sequence #1)
1 Eu não posso conceber que alguém queira pegar um bilhete premiado. Tem dois bilhetes:
um num é premiado e o outro é. Entregar o bilhete premiado, do qual nós podemos ter
Multimodal Metaphors in Brazilian Political Debates 339
Hand over the winning ticket, from which we can win our passport for the future, to
foreign private companies
Closed right hand, folded fingers and half-stretched arm to the front
In this sample, it is possible to identify a Gesture Unit (GU) made up of four Ges-
ture Phrases (GP): when referring to pré-sal (the new deep-sea oil reserves), Dilma
depicts them as a bilhete premiado (‘winning ticket’) and stages the first gesture
of pegar o bilhete premiado (‘taking the winning ticket’). Through the use of this
metaphor, the candidate compares the recently discovered deep-sea oil reserves
to a winning ticket, explaining to the viewer that tem dois bilhetes: um num é pre-
miado e o outro é (‘there are two tickets: one isn’t a winning ticket and the other is’),
gesturally staging two objects that occupy opposite positions in space, which cor-
respond to the second and third gestures. She also uses the verbal-gestural metaphor
entregar o bilhete premiado (‘hand over the winning ticket’), which corresponds to
the privatization of the deep-sea oil reserves. The gesture that accompanies entre-
gar (‘hand over’) is the fourth one. Generally speaking, there is a target domain,
Oil Reserves, which is verbally and gesturally presented by the source domain
Winning ticket. Applying the Multimodal Semiotic Blending model, this can be
analyzed as shown in Figure 5.
o nosso passaporte pro futuro, pra empresas privadas estrangeiras. “I can’t conceive
that someone would want to take a winning ticket. There are two tickets, one isn’t
a winning ticket, and the other is. Hand over the winning ticket, from which we can
win our passport for the future, to foreign private companies.”
340 Maíra Avelar
The enunciative scene (in this and all other samples) is socioculturally situated in
the political-electoral discursive field, in which, from the argumentative relevance
point of view, the candidates use strategies to undermine their opponent’s credi-
bility. In this specific communicative situation, the political-electoral debate, Dilma
uses the fact that the PSDB, Serra’s party, is known for its privatizations. Predicting
her opponent’s strategy, the candidate foresees that the opponent will privatize the
deep-sea oil reserves and accuses him of this. When referring to pré-sal (‘the new
deep-sea oil reserves’), Dilma metaphorically presents it, both verbally and ges-
turally, as a bilhete premiado (‘winning ticket’). It is possible to identify a strongly
metaphorical process, compressed by some vital relations: analogy [petroleum =
winning ticket], identity-change [old oil reserves are not winning tickets → new
deep-sea oil reserves are], space-time [right=future x left=past], cause-effect [Serra’s
government → privatization of the deep-sea oil reserves], and so forth.
of an honest candidate and accuses his opponent of making up lies regarding his
political life, aimed at fooling the voters. The example is shown in Figure 6.2
2 Então ela vem e fica criando fantasias, fazendo espuma, fazendo mitos com o único
propósito de enganar as pessoas do ponto de vista eleitoral. “So she comes and creates
fantasies, makes foam, makes myths, with the sole purpose of deceiving the people
from an electoral point of view.”
342 Maíra Avelar
Figure 7. M
ultimodal Semiotic Blending model, applied to Serra’s speech from 2010
(Sequence #2)
3 Eu, se fosse funcionário do Banco do Brasil, da Caixa e do BNDES, eu (1) ficaria com
três pulgas (2) atrás da orelha (3). “Me, if I were a Banco do Brasil, Caixa, and
Multimodal Metaphors in Brazilian Political Debates 343
In this sample, in which the GU consists of two GPs, a new metaphor is used.
The conventional metaphor being suspicious is having a flea behind the ear
is amplified to having three fleas behind the ear. The first gesture points
to the ear and the last one, which is composed by a repetition, corresponds to a
gesture aimed at the television viewer that says “Beware!” There is also a com-
pression of the speech turn, in which the candidate tries to draw the television
viewer’s attention to the risk of the privatization of public banks. Thus, while the
source domain having three fleas behind the ear is verbally-gesturally staged
by the first GP (and the speech that occurs with it), the second GP not only com-
plements the first one, but also stages the target domain Being aware. Applying
the Multimodal Semiotic Blending model, we reach the analysis in Figure 9.
Me, if I were a Banco do Brasil, Caixa, and BNDES (public banks) employee, I
Left index finger pointing down to the right
BNDES (public banks) employee, I (1) would be with three fleas (2) behind the
ear (3). Note: We use a literal translation, to better clarify the gestures use. A similar
expression in English would be: “to smell a rat” (three rats).
344 Maíra Avelar
ultimodal Semiotic Blending applied to Dilma’s speech from 2014 (Sequence #3)
Figure 9. M
behind the ear, and depicts it pointing to her ear and, after, pointing up and down
in the television viewer’s direction. It is possible to identify some vital relations that
compress the metaphorical process, such as: analogy [suspicion = fleas behind the
ears], cause-effect [Aécio’s government → privatization and destruction of public
banks], and identity [PSDB = privatization party].
4 Aécio: Ela (a Petrobras) deixou as páginas econômicas (1) para frequantar as pági-
nas policiais (2). The company (Petrobras) left the economics section (1) to be
published in the police report pages (2).
346 Maíra Avelar
This sample can be considered as illustrating the metaphor time is space, unfolded
in the specific gestural metaphors: The past is on the left and The present is
on the right. As indicated by Calbris (2008) and Casassanto and Jasmin (2010), in
many cases past situations are located on the left and the present/future situations
on the right, reproducing the direction of writing in Western societies. Aécio draws
the television viewer’s attention to Dilma’s terrible economic management, compar-
ing Petrobras’ good development in the past to the corruption within the company
in the present. The repetition of the second gesture is a mechanism to highlight the
corruption problem. Using the MSB model, the analysis in Figure 11 can be given.
Figure 11. M
ultimodal Semiotic Blending model, applied to Aécio’s speech from 2014
(Sequence #4)
1. Introduction
This chapter reveals and confirms the implicit association between space and light-
ness by looking at the correlation between these two domains in two narratives
(Demian, by Hermann Hesse and The Rocky Horror Picture Show, hence TRHPS, by
Jim Sharman). In both texts there is a strong association between the two domains
used to create meaning in the unfolding of the plot. Space is described in terms
of lightness, and lightness is given a spatial connotation, following a seemingly
recurring pattern where the spatial semantic categories of in and out are associated
with the perceptual categories of light and dark. Both kinds of associations are
possible, as both are experientially motivated, i.e. the implicit association of in with
light and out with dark and vice versa; the association of in with dark and out
with light. For instance, the first association (in with light and out with dark)
is motivated by an experience such as a lit room at night, when it is light inside and
dark outside. It is not uncommon though that the opposite situation be activated
(in with dark and out with light). This can be the case of a daytime experience
looking out of a room into the sunlight, or the case of a tunnel, where we are in the
dark and can see the opening of light at the end. The impetus of the present research
stems from an apparent lack of understanding of the linguistic interplay between
space and lightness perception (see Sandford 2011 for an exception), even though
the topic of spatial conceptualization has been thoroughly discussed.
In the following, I provide the theoretical background for my findings (Section 2),
before proceeding to a discussion of the texts analyzed in Section 3, and the Implicit
Association Test (Section 4) that offers a conceptual explanation for the literary
mechanism employed. In Section 4.3 I provide a discussion of the results, and con-
clusions are given in Section 5.
2. Theoretical background
The theoretical concepts I describe here to facilitate an understanding of the re-
search paradigm this study follows include the concepts of spatial and lightness
understanding through image schemas, conceptual metaphors, and blending theory.
Spatial cognition plays a fundamental role in our everyday lives, not only in our
navigation through space, but also in our thinking and reasoning. Looking at spatial
semantics may unravel the conceptual structure at its base (see Zlatev 2007c; Evans
and Chilton 2010).
350 Marco Bagli
Table 1. S ource-target mapping for the love is a journey metaphor (Evans and Green
2006: 295)
A considerable amount of research has been produced in the CMT framework, ac-
counting for many metaphorical realizations, not only in language, but also in visual
arts, and other aspects of our everyday lives. Some linguistic examples of conceptual
metaphor regarding the source domains of light and seeing are: intelligence is
a light source (He is very bright; He can always shed light on the problem); hope
is light (She has bright hopes); impediments to awareness are impediments to
seeing (I was in the dark for a long time). The opposition of light/dark is also used in
conceptual metaphors that have a moral connotation: thus giving rise to goodness is
light (Look on the bright side!) and badness is darkness (The dark side of the force).1
However, the notion of CM is not sufficient in accounting for more complex
examples such as that given in (2), following Grady, Oakley and Coulson (1999: 103).
(2) This surgeon is a butcher.
1 All of the CMs and examples of their linguistic realization have been taken from
the Master Metaphor List, available online at http://araw.mede.uic.edu/~alansz/
metaphor/METAPHORLIST.pdf.
352 Marco Bagli
Figure 1. The surgeon as butcher blending model (adapted from Evans and Green,
2006: 406)
Making Sense of Space and Light 353
The first input space is the surgeon space, while the second is the butcher space.
The two share the general, abstract knowledge of “agent” (a surgeon in Input 1, and
a butcher in 2), the knowledge of “undergoer” (a person in Input 1, an animal in 2),
the knowledge of “work space” (“the operating room” and “the abattoir”). Both have
a goal (“healing” in one space, and “severing flesh” in the other), and a means (“sur-
gery” and “butchery”). This abstract knowledge is shared in the generic space. In the
blended space though we have an emergent structure, namely “incompetence”, as
a result of the mismatch between the goal (“healing”) and the means (“butchery”)
performed by a surgeon. In this chapter, BT is used to describe what happens in
the dominant conceptualization of light/dark and in/out. Before explaining the
model, I discuss two texts that express this conceptual construal: the conceptual-
ization of an experience communicated by a speaker for the understanding of a
hearer (Croft and Cruse 2004).
(6) Back home! Oh, the good, the blessed return to our house, to brightness and peace!
(p. 8, my emphasis)
Conversely, the dark world (7) is seen as surrounding the house and being every-
thing that is not light.
(7) All these beautiful, horrible, wild, cruel things existed all around – in the next
street over, in the house next door. Policemen and beggars ran around, drunks
beat their wives, gaggles of girls poured out of the factories after work, old women
could cast a spell on you and make you sick, bands of robbers were living in the
forest, arsonists were being caught by the country police – this powerful second
world welled up everywhere, its scent was everywhere, except in our rooms where
Mother and Father were. (p. 4, my emphasis)
As the plot develops, Emil seems to have forgotten this clear distinction of the two
worlds, especially after having met Demian, who seems a sort of “dark doppel-
ganger” of the main character. However, after a night out with his friends, Emil is
hungover and describes his feelings as follows in (8) and (9):
(8) Between headache, nausea, and unspeakable thirst, an image rose up in my soul
that I had not seen for a long time: I saw my parents’ house, my hometown,
Father and Mother, my sisters, the garden; I saw my quiet, comfortable bedroom,
the school and the market square, saw Demian and our confirmation classes – all
of it flooded with bright light, radiant, all of it wonderful, godly and pure, and I
now knew that everything, everything had still belonged to me the day before,
just a few hours ago had been waiting for me to return, but now, only now in this
moment, it had sunk forever under the waves, was cursed, was no longer mine. It
had thrown me out and now looked upon me with disgust! (p. 58, my emphasis)
(9) Once again I belonged entirely to the dark world – to the devil – and in that world
I was considered a splendid fellow. (p. 58–9, my emphasis)
Emil’s description of his feelings towards his family is made particularly vivid by the
number of metaphors and images based on the light/dark and in/out opposition,
which are consistent with the metaphor and image schemas of the whole text: light
has a positive value and is portrayed as radiating from the inside of a house into a
scary, dangerous and dark world outside of it.
Moreover, in TRHPS, the emergent structure is used to create irony. The two char-
acters (and the audience) are led to expect the inside of the castle to be “good” and
to be a safe shelter for the night during the storm. This is due to the elaboration
of the blend that occurs in the understanding of this scene. In fact, as soon as they
enter, this situation is reversed, as anticipated by the narrator and his rhetoric
question in (11).
(11) And so – after braving the inclement weather, and some not too little time – it
seemed that fortune had smiled on Brad and Janet and that they had found the
assistance that their plight required – or had they? – There was certainly some-
thing about this house (to which a flat tyre and a wet night had brought them) that
made the both of them uneasy – but, if they were to reach their destination that
night, they would have to ignore such feelings and take advantage of whatever
help was offered.
The director deliberately violates our metaphorical understanding of the scene to
create irony and a surprising effect around which the whole film revolves. Event-
ually, the two characters will experience self-emancipation from the stereotypes and
difficulties they had at the beginning of the film. This change can be considered as a
sophisticated way in which the director restores the “violated” emergent structure
of the blended space.
3.4. Discussion
Both the house in Demian and the castle in TRHPS are depicted as radiating “light”
or “goodness” in the respective narratives. This quality of the two buildings is used
to create meaning in the unfolding of the two plots, via elaboration of the blended
space that is represented in Figure 2. The conceptual metaphor that motivates the
emergent structure is the good is light metaphor (Forceville and Renckens 2013).
It may be regarded as a primary metaphor, as it expresses the correlation of the two
basic domains in experience (Grady 1997). In many cultures, day and night represent
good and evil respectively, in a symbolism that “probably goes as far back as the his-
tory of man” (Arnheim 1974 [1954]: 324). This positive value attached to light makes
biological sense, in that sight is restricted in the dark, making one more vulnerable
to potential threats and dangers (Forceville and Renckens 2013). In this sense, the
construal of light as good and dark as bad may be seen as universally motivated,
whether or not this motivation is conventionalized in a particular language/culture
or not (see Zlatev 2011). Hence, the Judeo-Christian tradition (which functions as a
background to the texts analyzed) can be seen as one possible cultural elaboration
of this primary metaphor, which dictates the emergent structure in the blend. This
of course does not mean that the domains of good and bad are universally con-
ceptualised in terms of light and dark nor, vice versa, that the domains of light
and dark structure only the domains of good and bad.
Crucially, the emergent structure in the blend allows for the understanding of
the interior of the two buildings as having a positive connotation. Light is given a
358 Marco Bagli
spatial connotation as it emerges from the interior of the buildings from which it
shines, while the spatial orientation of in/out is given a moral connotation, i.e. in
has a positive value, whereas out has a negative one. Duly, these observations and
analyses are consistent with the results described in the following section.
Table 2. Flower-Insects IAT overview (adapted from Greenwald et al. 1998: 1465).
In block 1, the participants are asked to categorize each stimulus presented on the
screen by hitting a corresponding response-key (usually E as left response-key and
I as right response-key). Block 2 is similar to block 1, but the other two categories
are practiced. In blocks 3 and 4, the two categories are overlapping: thus Flower
and Pleasant are associated with the left response-key, while Insect and Unpleasant
with the right response-key. In block 5 the categories of block 2 switch response-
key (i.e. Unpleasant is the left response-key and Pleasant is the right response-key).
Making Sense of Space and Light 359
Blocks 6 and 7 are similar to block 3 and 4, but the Pleasant and Unpleasant cat-
egories switch response keys.
Blocks 3, 4 and 6, 7 are called critical blocks, because the two categories are
overlapping and here is where the discrimination takes place. Furthermore, they
are divided into compatible critical block, when the categories sharing the same
response-key are implicitly associated (according to the researcher’s hypothesis),
and incompatible critical block when the categories sharing the same response-key
are not implicitly associated. The choice of which association is considered com-
patible or incompatible is determined by the research hypothesis. Blocks 1, 2 and 5
go under the name of practice blocks, because they serve to familiarize the partici-
pants with the testing procedure.
It goes without saying that the stimuli in each category need to be entrenched in
the category itself to represent an easy and automatic choice. The participant must
not spend too much time trying to categorize each stimulus; otherwise the results
are biased by reasoning, and would not provide an actual implicit attitude. In fact,
the distinctive feature of this testing procedure is its capacity to reveal implicit at-
titudes, beyond explicit awareness. Thus, results may sometimes seem unexpected
and surprising to the participant, who may not consciously think that they have
the sort of attitude that emerges from the test results.
One such case is represented by the results of the IATs that deal with social
prejudice, such as the black American vs. white American IAT. This experiment was
motivated by results in previous tests, in which there were “automatic expressions
of race-related stereotypes and attitudes that are consciously disavowed by the sub-
jects who display them” (Greenwald et al. 1998: 1473). Results of the test showed a
clear preference towards the category of white American that was not supported by
consciously elicited data that had been supplied through self-report questionnaire.
The IAT thus unmasked a subtle form of evaluative difference towards the two
categories.
The basic idea underlying the test is that the easier a mental task is, the quicker
its performance. Time is therefore an essential component: performance in the
critical blocks is assumed to be quicker when the two categories sharing the same
response-key are also implicitly associated. By calculating the time it took each
participant to perform each critical block, it becomes clear which association be-
tween categories is stronger.
The test was carried out at the Umbra Institute in Perugia (Italy), between No-
vember and December 2012. 31 native speakers of English with average age of 21
participated. Participants were reassured about the anonymity and the nature of
the test, which is not a pass or fail test. The test was carried out on a laptop in an
artificially lit room.
The category association of in and light sharing the same response key was
considered as a compatible block and the same for out and dark). Conversely,
the incompatible block was when the categories of in and dark shared the same
response-key (and the same for out and light).
Making Sense of Space and Light 361
4 According to the IAT script, D values lower than 0.15 indicate a “little to no” prefer-
ence, values between 0.15 and 0.35 indicate “a slight” preference; values between 0.35
and 0.65 indicate “a moderate” preference; values higher or equal to 0.65 indicate “a
strong” preference.
362 Marco Bagli
our embodied cognition and perception. The nature of human beings does not allow
us to see in the dark or during the night, thus exposing us to potential threats and
perils. This experience can potentially be at the basis of the positive value attributed
to light, as suggested by Forceville and Renckens (2013). The status of being “in”
something, and therefore of being contained, can also be seen as a protected status
against potential risks and dangers coming from outside. This experience can give
in a positive value. These shared positive values appear to be the basis for the cor-
relation between in and light. Of course, one can imagine a reverse situation in
which in has a negative value, because the container is a bad thing, and out of it
has a positive value, e.g. in trouble, out of jail. Still, the results of the test revealed
a default association of in with light (and out with dark), and therefore being
more likely to have a positive value, at least for English speaking participants. The
creation of meaning by human beings is a highly dynamic and complex process that
involves multiple steps and devices, regulated by imagination:
Imagination is at work, sometimes invisibly, in even the most mundane construction
of meaning, and its fundamental cognitive operations are the same across radically
different phenomena, from the apparently most creative to the most commonplace.
These operations are characteristics of the human species. Though taken for granted by
human beings, they are extraordinary by any other standard. (Fauconnier and Turner
2002: 89).
The cognitive devices employed to construe the meaning described in this chapter
are those of image schemas, conceptual metaphor, and conceptual blending. The
account of their interaction presented in this chapter may help us understand
how we construe the meaning of lightness in relation to space. On its topic, and
methodology, the study represents a bridge between different disciplines: cognitive
linguistics, literary studies, film studies, and semiotics, and thus engages in one kind
of cognitive semiotics.
Acknowledgments
Firstly, this chapter could have not been written without the participants of the
Implicit Association Test and the staff at the Umbra Institute, who have always
been very helpful and cooperative. Secondly, I would like to thank the reviewers
and Jordan Zlatev for their illuminating comments on the first draft of the chapter.
Lastly, I wish to express my gratitude to Prof. Jodi Sandford for her invaluable help
and guidance throughout these years of working together.
Katherine O’Doherty Jensen
Chapter 23
Performative Metaphor in
Cultural Practices
I cannot hear what you are saying – your actions speak too loud.
Chinese proverb
1. Introduction
I draw attention to a cognitive operation that has been overlooked, but nevertheless
has pervasive functions in the constitution of human cultures. This operation con-
cerns the capacity to discern and align gradient differences between entities, such
that quite different entities – including affects, actions, events and material objects –
are treated as analogues of each other. It mediates the performance of signifying
practices and the communication of un-verbalized meanings in many spheres of
everyday life.
Natural languages express and mediate discernments of categorial distinctions
between one kind of entity and another. They also mediate the shared appreciation
of gradient distinctions, that is to say, “more-or-less” differences – often expressed
by modifiers such as very, extremely, almost, including gradable adjectives such as
younger, higher, closer. Categories are of course also distinguished non-verbally.
Just as a gesture can signify “good-bye”, so manners of dress can signify gender
difference, while one composition of foods signifies “breakfast” and quite a dif-
ferent composition signifies “dinner”. Sometimes the point is made that categorial
distinctions tend towards ambiguity when communicated non-verbally, compared
with the relative precision of language. The point under consideration here, how-
ever, is that non-verbal communication differs from language insofar as it centrally
concerns the display, expression and discernment of gradient rather than categorial
distinctions. Moreover, the shared appreciation of gradient distinctions can become
entrenched in ways of doing things that make sense to practitioners, although they
may find it difficult or impossible to put into words exactly what such practices
mean. While the constitutive role of language in human culture is widely acknowl-
edged, the shared appreciation, normative governance and cultural entrenchment of
gradient differences, as expressed by humans in non-discursive practices, are not.
My objective is to redress this state of affairs.
The point in question is that while non-verbal communicative actions such
as waving, bowing, hugging and kissing, are different categories of social inter-
action, the difference between one instantiation and another of the same kind is
364 Katherine O’Doherty Jensen
quantitative and gradient in character. Even between the same interactants, a hug
given in one social context is likely to be closer, more long-lasting or more intense
than that of another. Such gradient differences are discerned as being meaningful,
although such meaning tends to remain implicit and is always deniable insofar as
it is not put into words. Nevertheless, any given hug is located at some point on
gradient continua of relative proximity, duration and intensity; it is more or less
close – distant, long – short, and high – low on a scale of intensity. Familiarity
with cultural norms can guide us as to whether a hug is expected in a given social
context, but the challenge of appropriate performance always remains. This is
the challenge of propriety, and it has been succinctly described in gradient terms
and in negative form as concerning the discernment of precisely “how far is not
going too far?” (Mayol 1998: 21) Appropriate performance, according to folk
descriptions, “hits the mark”; it is “fitting” or “spot on”. It is never “way out”,
“over the top” or “wide of the mark”. My topic concerns the cognitive operation
whereby this discernment is accomplished by social actors and interactants in
everyday life.
Section 2 introduces the concept of performative metaphor – the objective of
which, as seen from a first-person perspective, is to act appropriately in a given
social context by discerning which variant of a given non-discursive cultural
practice fulfils that intention. The concept developed here has its roots in earlier
accounts of conceptual metaphor theory (Lakoff and Johnson 1980; Lakoff 1987;
Lakoff and Turner 1989; Fauconnier and Turner 2002). While several media of
analogical communication can be at issue in performative metaphors, including
facial expression, voice tone, body movement and gesture, exemplification in
the following sections is mainly focused upon uses of material objects as the
medium of communication. Food practices serve as an excellent case in point,
not only because they involve the use of objects that are often consumed or
deployed in the interactive form of shared meals, but also because they are
well illuminated by empirical social research. These practices also serve as an
interesting point of departure for understanding non-discursive (mimetic) prac-
tices of our species since – in one form or another – they have preceded our
capacity for speech (Donald 1991). The following sections illustrate the relevance
of analysing cultural practices from the perspective of a cognitive theory of
metaphor that focuses upon what we do, rather than what we say. A case study
is presented regarding a puzzling issue that has emerged from anthropological
and sociological studies of food. It concerns differences between the eating prac-
tices of men and women and a corresponding tendency on the part of both to
perceive some foods and dishes as being “masculine” and others as “feminine”.
The ways in which social scientists have accounted for these differences hitherto
are then compared with an account from the perspective of performative meta-
phor theory. I conclude by indicating some implications of this account for the
field of cognitive semiotics.
Performative Metaphor in Cultural Practices 365
2. Performative metaphor
Metaphorical thinking is at issue in what we do. Lakoff and Johnson (1980) re-
vitalised this notion with the claim that conceptual metaphor is pervasive in every-
day life, not just in language but in thought and action, and they concluded their
argument with the suggestion that some of the metaphors we live by are partially
preserved in everyday ritual activities. Psychologists have introduced terms such
as enactive metaphor (Kirmayer 1992) and metaphorical act (Ogden 2002) to refer to
idiosyncratic meanings as presented in gestures, and more recently much research
has been devoted to examining the ways in which conceptual metaphors are in-
stantiated in gestures and multimodal discourse (Cienki and Müller 2008). Gesture,
posture, ritual activities and uses of material objects have long been conceived by
anthropologists as being metaphorical in character.
Douglas devoted decades of research to the observation and analysis of food prac-
tices, concluding that the way in which the British eat is as formally structured as a
Bach sonata (Douglas and Nicod 1974). She explicated the structure underlying these
and other cultural practices, and remained convinced that metaphorical thinking
(“this code-breaking, jigsaw-puzzle solving activity of the human mind”) must be
at play in what she termed the cognitive underpinnings of these practices (Douglas
and Isherwood 1996 [1972]: viii). Her view was that the exact mechanisms of meta-
phor, comparison and social grading of events and food which make for cultural
competence had not yet been established (Douglas 1984). Others have focused in-
stead on performance in their analyses of metaphors at play in everyday interactions
and social dramas (Fernandez 1986; Turner 1987), arguing, for example, that ritual
is the acting out of metaphorical assertions (Fernandez 1986). The concept of per-
formative metaphor introduced here, however, seeks to account for the character of
metaphorical thinking in what we do, quite independently of what we say.
Conceptual metaphor theory (CMT) (Lakoff and Johnson 1980; Lakoff 1987),
followed by conceptual blending theory (BT) (Fauconnier and Turner 2002), shifted
the focus away from the idea that metaphor is a matter of talking about something
in terms of something else, and towards the idea that it concerns conceptualising
something in terms of something else. In this light, accounting for the character of
conceptual metaphors was seen as contributing to a theory of human cognition,
while language practices merely provided an important source of evidence. The
account of performative metaphor presented here takes this line of thought a step
further with the suggestion that metaphorical thinking is also expressed in non-
discursive practices, in which something is treated in terms of something else. It
follows that these cultural practices also provide important sources of evidence with
regard to our cognitive operations.
The type of performative metaphor under consideration here can be defined as
comprising a voluntary, intentional act mediated by the discernment of the grade
of one entity on a gradient continuum as corresponding to the grade of a different
entity on another gradient continuum, such that one entity is treated in terms of
another in practice. This is a cognitive operation each of us performs many times
366 Katherine O’Doherty Jensen
in the course of the day. We do so, for example, when we peruse the contents of
our implicitly graded wardrobe (ordinary – special) with a view to choosing items
of clothing that correspond to the relative importance of events (unimportant –
important) to be attended that day. It is precisely because a hug signifies a level
of affection (little – much), the relative closeness of a relationship (distant – close)
and perhaps the length of time that has elapsed since last performed (short – long)
that its performance here and now can disappoint or delight, as well as deceive.
An example of performative metaphor in a more formal setting is the normatively
governed ritual we perform as members of an audience when a concert, opera or
play reaches its conclusion. Given the gradient characteristics of the noise made by
clapping – its relative level of volume (soft – loud) and its (short – long) duration –
this can be performed by each member of an audience in a manner that signifies at
which point on a gradient scale of relative appreciation or enthusiasm (low – high)
his or her response belongs. The meaning of thunderous and long applause or barely
audible and brief applause, or points between those extremes, is readily understood
by all participants in a given social context as representing the audience’s level of
enthusiasm and does not usually present any difficulties of interpretation.
Common to both CMT and BT is the idea that metaphorical statements “make
sense” when points of similarity are discerned between the elements in two concep-
tual domains. While Lakoff and Johnson sought to account for the process whereby
metaphorical conceptualisations become entrenched in language use in ways that
speakers perceive to be apt expressions of meaning, Fauconnier and Turner sought
to provide a general account of meaning constructions – including metaphorical
meanings. CMT remained closer to the more traditional accounts of metaphor in
linguistic theory by conceiving the generation of meaning as being unidirectional,
that is to say, as comprising the projection of image structure, conceptual structure
and inferences from a “source” domain to a “target” domain. They departed from
the traditional view by proposing that conceptual metaphors not only inform our
way of thinking, but also underlie our manner of expressing ourselves – even when
we do not put them into words in a direct fashion. For example, we do not say
IDEAS ARE FOODS (Lakoff and Johnson 1980: 46–7). Nevertheless, common speech
patterns clearly indicate that we frequently do employ images and concepts drawn
from the source domain of food and project these onto the target domain of ideas
when we want to express something about the character of the latter domain. Thus,
we “plant the seeds” of one idea and “give birth” to another. Ideas “crop up”, “grow”
and may prove “fruitful”. We academics work within “fields”, and while some of
our students “devour” our ideas others find them “hard to swallow” or difficult to
“digest”. It transpires that images and concepts drawn from the entire cycle of food
production, distribution, consumption and waste disposal can in fact be identified
in the ways we talk about ideas in practice (O’Doherty Jensen 2003). Evidence of
this kind lends credence to the view that the main function of conceptual meta-
phors is that of facilitating the process of conceptualisation. That is to say, we use
the images and concepts of a domain that is familiar to us in order to characterise
a domain that is more abstract or less familiar.
Performative Metaphor in Cultural Practices 367
one way rather than another is discerned as appropriate when these locations cor-
respond to each other and as being inappropriate when they do not. For example,
the ritual of applauding as seen from a first-person perspective is one in which
the relative volume and duration of the noise made by clapping hands together
is compared with a felt level of appreciation and enthusiasm. The performance of
clapping is discerned as appropriate and makes sense to the practitioner and to his
or her interactants precisely in the measure that its volume and duration are per-
ceived as corresponding to a felt level of enthusiasm. The meaning of applause can
thus be described in the language of BT as comprising a blend. That meaning is not
found in either input, as such. Had the level of appreciation been lower, we would
expect the volume of clapping to be correspondingly lower – unless, of course,
some practitioners are aiming to deceive. The ritual of applauding is a cross-cultural
convention, easily learned at an early point in our socialization, whereas the per-
formance of clapping rests upon the here-and-now discernment of each member
of a given audience.
The intervals on such gradient continua or scales are related to each other by
reason of contiguity, while mapping between one such continuum and another
rests upon the discernment of relative quantitative similarity. This is the operation
described elsewhere as the discernment of symmetrical analogy in which mapping
reveals a structural similarity between A and B that obtains in both directions
simultaneously (Itkonen 2005).
It can thus be said that a discernment of appropriate performance, when me-
diated by performative metaphor, always has the same generic form: the grade
of A is the grade of B. Likewise, any discernment of disparity between the grade
of A and the grade of B underlies the appraisal of a given performance as being
inappropriate. (In such performances, A does not “hit the mark of” B, and cannot
therefore be deemed as “fitting”.) Thus, enthusiastic members of an audience, who
are thunderously applauding the leading soprano as she takes her bows on the
first night, might well deem it inappropriate if the management of the opera house
were to present her with a small posy of violets as a token of their appreciation.
The reason being, that a small posy of humble violets occupies a location that is
too low on a scale of possible floral tributes to adequately convey a high level of
appreciation. It is to be noted that the floral tribute functions here as the signifier
of a grade of affect. As such, it signifies a gradient meaning, not to be confused with
the semantic content of any accompanying words such as congratulations (get well
soon, heartfelt sympathy or something else).
Given the ubiquity of analogy in cognitive operations, we may well wonder
whether all actions mediated by discernments of symmetrical analogy are instances
of performative metaphor. States of affect and the vocalisations of babies are anal-
ogously related, and the mother running to console her crying baby has doubtless
discerned just how distressed her baby is. But there is no good reason to categorize
this response as an instance of performative metaphor. Metaphoricity is itself a
scalar concept in which meanings are more metaphorical in the measure that the
entities compared are also dissimilar in relatively many respects. In the following
Performative Metaphor in Cultural Practices 369
sections of this chapter I shall be concerned with cultural practices that are largely
conventionalised and in which dissimilar entities – material objects, events, ac-
tivities and people – are treated as analogues of each other.
A wider range of entities can be at issue in performative metaphors including
affects, kinaesthetic feelings, sensory experiences, vocalisations, facial expressions,
body movements and gestures. The inputs at issue may be categorised or named, but
not necessarily so. In all cases, however, only entities ranged on gradient continua
are in question. A gradient perspective is imposed upon any given set of categories
by ranking exemplars according to one or more of their properties, such as relative
size or importance. Thus it can be said that the intervals on gradient continua are
ranged in the space between gradable antonyms (loud – soft, small – large, little –
much, unimportant – important). However, natural languages do not commonly
categorise or name these intervals, except when we number them. For this reason,
it is often difficult to put into words what it is that has occurred when the grade of
A is discerned as corresponding to the grade of B. An action is tacitly understood, a
given option feels right, the proper thing to do is recognized by oneself and others,
but we may well be at a loss to explain why this is the case. Performative meta-
phors are not explicit, declarative or reflective, but operate in the sphere of practical
understanding in everyday life.
3. Food practices
The consumption of food activates the senses of taste, smell and touch, each of
which in contrast to the modalities of vision and sound yields experiences that are
sequential and continuous, that is to say, non-discrete. The properties of a pleas-
urably sweet or salty taste and those of fragrance, warmth and texture are matters
of degree, such that these properties tend to be construed as belonging at some
point on a gradient scale. Scalarity, as pointed out by Popova (2005), dominates
perceptual experience in the so-called “lower” sense modalities of taste, smell and
touch, and is intrinsically normative in that it entails evaluation of instantiations
of graded properties. The consumption of any particular foods, dishes or meals are
thus experienced as being located at some point on a good – bad gradient, and as
being more or less pleasant – unpleasant, satisfying – unsatisfying. Appraisal of
gustatory experience concerns the discernment of just how good or pleasant or fill-
ing a particular item is. In this light, it is not surprising that foods are commonly
rank-ordered within the daily cycle of eating events.
The food practices of one region differ from that of another on many points
that are deemed crucial by practitioners. This observation underlies the view com-
monly held by anthropologists and sociologists that food cultures, like languages,
are unique. They not only differ between regions, but also to a greater or lesser
extent between tribes within a region or population groups within a larger society.
For this reason, distinctive food practices can signify nationality, religious affiliation,
ethnicity, social class, age or gender (cf. Bourdieu 1984; Warde 1997). If we are to
understand the cultural controls upon signifying practices of this kind, it is held,
370 Katherine O’Doherty Jensen
we must be able to interpret a given set of practices within its specific cultural
context. For the present purpose, however, I shall focus upon common features
of food cultures that obtain cross-culturally. On this point, fewer observations are
recorded in the literature.
The following are features that obtain cross-culturally (Twigg 1984):
• hot, cooked foods tend to be prized more than cold or raw foods
• animal fats and vegetable oils are prized more than water as a cooking medium
• animal products are prized more than vegetable products
• cereal products occupy a position at the lowest point of the culinary scale
Food products within these superordinate categories are also ranked. Meat, for
example, is generally prized more than other animal products, such as eggs and
dairy products, just as fruits are prized more than vegetables, leaf vegetables more
than roots and tubers, and some cereals are prized more than others. Red meats
generally rank above the white meats of poultry or fish, and within these categories
some variants, cuts and forms of processing are ranked more highly than others.
These appraisals are reflected in the analogical medium of food prices and in the
successive phases of global demand for specific kinds of food that follow upon rising
standards of living over time (Grigg 1999). They are also reflected in the ways in
which food products are mapped onto primary, secondary and tertiary meals and
courses within a given food culture (O’Doherty Jensen 2002, 2003, 2009).
Briefly, cereal products are generally deemed appropriate to the lowest-ranking
eating event within the meal system (breakfast). Cooked hot foods are deemed
appropriate to the highest ranking meal of the day (dinner), meat being the usual
centrepiece of the main course of this meal, while cereal products are often ex-
cluded as the staple ingredient of this course. Cereals, in the form of bread or pasta
for instance, are rarely excluded from a first course of the primary meal or from
the secondary meal of the day (lunch/supper), and its accompaniments are drawn
from products at the mid-point or higher locations on the culinary scale (veg-
etables, eggs, cheese, fish, meat). The culinary status of any meal is thus raised by
the introduction of products that occupy a higher position on the culinary scale,
by the inclusion of more courses or the addition of cooked dishes. The inclusion of
fruit or alcohol, two products that are consumed for the most part outside the meal
system, also have the function of raising the culinary status of courses or meals in
which they are included.
The manner in which food culture is constituted by aligning gradient scales and
attuning them to each other becomes clearer by considering ways in which grades
of meals are mapped onto grades of participants, the use of more or less household
resources of time, money and skill, as well as grades of events. A shared meal is
an occasion of joint attention, intention and action, in which the notion of having
something in common takes the concrete form of consuming a portion of the same
food. While the daily meal cycle serves to distinguish times of day (morning, noon
Performative Metaphor in Cultural Practices 371
and evening), the highest ranking meal is accorded the most household resources
and is consumed at the time of day when most household members can be assembled
together. There are many with whom a meal is never or rarely shared. The tramp is
given food or drink at the door, neighbours, colleagues, acquaintances and friends
meet for coffee, drinks or lunch, but a shared dinner tends to carry the inference or
prospect of a closer friendship or intimacy. Everyday meals are distinguished from
those of special occasions, which are constituted by including more people, using
more resources, including more courses, and selecting items of food and drink that
occupy locations on the culinary scale that are deemed appropriate to the grade
of both the occasion and its participants. The “dinner party” concerns the con-
sumption of material objects, but cognitively, constitutively and effectively it is an
event within the “world mediated by meaning” (Lonergan 1972). It is an institutional
fact (Searle 1995), insofar as the practices that canonically and normatively count
as a dinner party are performed by its participants (cf. Sinha and Rodriguez 2008).
Appropriate performance in this as in other social contexts rests on the cognitive
operations whereby gradient scales are attuned to each other by social actors, such
that the grade of food and drink, the setting, the status and attire of participants, as
well as the duration of the event are all perceived as “fitting”.
studies document men’s distaste for too much “greenery” and for “sissy” or “wimpy”
dishes in which vegetables or grains dominate (cf. Sobal 2005).
These trends prompted Lupton to conduct focus groups in which men and
women were presented with pictures of a range of different menu options and
asked to identify which were “masculine” and which “feminine” (Lupton 1996). It
was found that while both men and women readily agreed in their discernments of
“masculine” versus “feminine” dishes, they were unable to agree on any reasons why
this was the case and some maintained that neither the questions nor the answers
could be taken seriously. Although Lupton adhered to the view that there is a meta-
phorical relation between femininity and vegetables, and between masculinity and
meat, her study did not illuminate how such discernments are made by social actors.
A review of more than 300 social scientific studies undertaken in Australia, Japan,
North America and Western Europe clarified that a wider range of foods is im-
plicated in gendered preferences and practices (O’Doherty Jensen and Holm 1999).
Women express preferences for fruit as well as vegetables, and for poultry, fish, a
variety of sour-milk products and sweet items, while men have a pronounced prefer-
ence for alcohol, especially beer and spirits, and for all meat products, especially
red meat. Women enjoy “light” meals and regard salads, soups and omelettes as
options that can suitably comprise a dinner for their personal consumption, whereas
men prefer a “solid” meal, such as meat, potatoes and gravy. Moreover, deviations
from these patterns are negatively sanctioned in many settings. Gendered norms
in Japan, for example, are reported as being particularly marked in regard to sweet
foods, when consumed by men, and alcohol, when consumed by women (Loveday
and Chiba 1985). The same food products are also at issue in the norms governing
traditional domestic etiquette in Western societies whereby the woman should cut
the cake (and serve tea or coffee) and the man should offer alcoholic drinks (and
carve the meat).
It is likely that some trends noted in the 1980s and 1990s are no longer as marked
as they once were. There are clear indications for instance, that drinking habits
among Western women are beginning to resemble those of men, just as their smok-
ing habits developed in earlier decades. However, there is an exception to these
general trends that has been observed since the early 1980s. Men who have higher
levels of education and correspondingly higher incomes tend to have eating prac-
tices that resemble those of women rather more closely than they resemble the eat-
ing practices of other men (Murcott 1983; Prättälä, Berg and Puska 1992; Sweeting,
Anderson and West 1994). I will return to this issue presently.
does not rest on the assumption that social structure or consumption practices are
reproduced in the measure that social actors internalise given norms and roles and
act accordingly. In contrast, this account is based on the premise that norms and
roles will tend to be reproduced in the measure that particular practices continue
to “make sense” to social actors.
The manner in which grades of food (high – low) are mapped onto grades of
meals (high – low), and grades of meals are mapped onto grades of events (import-
ant – unimportant) as well as grades of people (close – distant) has been sketched in
Section 3 above. These patterns obtain cross-culturally and ground the assumption
that both men and women are familiar with the relative culinary status accorded
to different foods and beverages in their everyday lives. Whether both men and
women are also likely to be familiar with a pattern whereby a higher social status is
commonly accorded to men and a lower status to women calls for further comment.
A number of contributions to the field of gender studies in recent decades de-
scribe local settings in which men play “feminine” roles, women adopt “masculine”
roles, or in which both men and women are accorded multiple roles influenced
as much by social class, race or age as by gender. The concepts of “multiple mas-
culinities” and “multiple femininities” have been introduced, and the question has
arisen as to whether gender should be conceived as referring to a binary opposition
or to a continuum with feminine and masculine poles (Boydston 2009). These studies
might well be considered as undermining the view that women (still) have a lower
social status than men. On this issue, cross-cultural data are published annually
in Global Gender Gap Reports, which illuminate differential status by calculating
national ratios of women to men within the areas of health, educational attainment,
economic participation and political empowerment.
Among the 142 countries included in the most recent report (Hausmann, Tyson,
Bekhouche and Zahidi 2014), representing more than 90 % of the world’s population,
it is found that the health gap with respect to life expectancy has been now closed
in 35 countries, the educational gap with respect to literacy and enrolment has
been closed in 25 countries, while no country has yet closed the gender gaps with
respect to economic participation or political empowerment. Women remain sig-
nificantly underrepresented within the higher echelons of commercial, public, legal
and political organizations. It is found that while the gender gaps with respect to
rights, responsibilities and opportunities are slowly closing, significant differences
between countries remain. These data do not illuminate the extent to which relative
levels of gender equality differ between population groups within the same country.
On this point, it has been consistently found that educational level is the decisive
factor in the development of more equitable attitudes and practices among men, and
is also decisive with respect to the development of women’s expectations regarding
the division of unpaid domestic labour within households (Lorber 1994; Kitterod
and Lappegard 2012; Levtov, Barker and Contreras-Urbina 2012).
On this background, I am prepared to submit that the theory of performative
metaphor does yield an account of the data concerning gendered food practices that
is more coherent than previous accounts. The metaphor at issue, GRADES OF FOOD
Performative Metaphor in Cultural Practices 375
ARE GRADES OF PEOPLE, is one in which foods, beverages and dishes that have
a high status on the culinary scale are discerned as those appropriately consumed
by those who have a higher social status, men, while those located at lower points
on the culinary scale are appropriately consumed by women.
Men’s food preferences and practices do have the common characteristic just
noted. In each case, they occupy locations on the culinary scale that are, so to speak,
a step higher than those of women. Thus, men’s preferences concern: the main
course of the primary meal of the day (typically meat, vegetables and potatoes) as
compared to other courses (fish, soup, salads, fruit, sweet items); the centrepiece
of the main course (meat) as compared to its side dishes (vegetables, cereals/staple
foods); meat products in general as compared to other animal products (eggs and
dairy products), and alcohol as compared to non-alcoholic beverages. In this way,
distinctive, gender-specific consumption patterns function as constitutive elements
in the social construction of “masculinity” and “femininity”, respectively.
The reason why the same foods function as signifiers of gender in otherwise
disparate societies can be attributed to the character of the inputs at issue in the
performative metaphor GRADES OF FOOD ARE GRADES OF PEOPLE. As we have
seen, the attribution of a higher culinary status to animal as compared to vegetable
and cereal foods (see Section 3), and the attribution of a higher social status to
men as compared to women, are both cross-cultural patterns. If discernments of
gender-appropriate consumption are indeed mediated by this performative meta-
phor, it follows that the discernments and practices at issue are not generated in
an arbitrary fashion. In the measure such practices are recognised as appropriate,
they become conventionalised, regulated by social norms and the sanctions that
accrue to deviation from gendered roles. In this respect, the functionalist account
correctly attributes influence to social structure. By the present account, however,
the stability of that structure is dependent upon the stability of the discernments and
practices by which men and women are rank ordered on a social gradient, just as the
stability of a food culture is dependent upon the stability of the discernments and
practices by which foods are rank ordered on a culinary gradient. In this light, the
fact that men who have higher levels of education do not exhibit the food practices
common to most men is not an anomaly. This is the population group among men
that subscribes to an ideal of gender equality and whose members are therefore least
likely to perceive men and women as being located at higher and lower positions
respectively on a gradient of relative social status. The discernment of a distinctive
pattern of food consumption as being gender-appropriate would fail therefore to
“make sense” to this population group.
7. C
onclusion: the character of culture and the challenges
of cognitive semiotics
By this account, cultural competence lies in learning and recognising the patterns
whereby people, events, material objects and other entities are rank ordered within
376 Katherine O’Doherty Jensen
the social niche in which one operates, while agency is exercised in the discernment
of counterpoint connections. Cultural practices become and remain entrenched in
the measure that they “make sense” to practitioners. We may take such rankings
for granted or we may challenge them, as feminists have been doing with respect
to the relatively high status of men and climatologists and nutritionists have been
doing with respect to the relatively high status of meat. But conscious, intentional
activity is always involved in the discernment of patterns and connections and the
challenge of appropriate performance is ever present. The concept of performative
metaphor describes the cognitive process whereby this is achieved, when con-
nections between gradient phenomena are discerned and acted upon. This is a per-
vasive process in everyday life, and could have been illustrated by considering the
way in which social standing is mediated by the relative depth of bows of greeting
among Japanese geishas or the number of “likes” on Instagram posts among Nordic
teenagers. I hope this concept might prove useful to other analysts of cultural prac-
tices, whether their interest lies in stocks and shares or song and dance.
This pervasive process, I submit, is misunderstood when language practices are
taken as providing the model by which all cultural practices are to be analysed.
This is the viewpoint from which cultures are “texts”, agency is exercised in the
measure that alternative “cultural scripts” are available, gestures and objects are
“read” and signify other entities only in the measure that categories correspond,
while “status symbols” merely refer to goods consumed at the end point of a social
gradient. Language, however, is a relatively poor medium through which gradient
characteristics can be expressed, while, as I have sought to demonstrate, the use
of non-verbal signs achieves this purpose in a non-arbitrary fashion and with ex-
quisite precision in everyday life. For this reason, cognitive semiotics must embrace
anthropological, sociological and psychological studies of cultural practices, with
particular reference to gradient meanings of non-discursive practices, just as it
embraces linguistic studies of language use. This is no mere partisan plea to extend
or define the boundary of data relevant to the discipline. Rather, it is based upon rec-
ognition of the need to delineate and distinguish the cognitive capacities underlying
the signifying and representational practices of humans in their non-discursive as
well as discursive forms. This must be regarded as a core task of cognitive semiotics,
not least if we are to understand the ways in which human cultures have been and
continue to be constituted over time.
Ontogenetically, the capacity to discern and compare gradient phenomena
pre-dates our capacity for language. Our earliest communication, whether crying,
smiling, babbling or laughing, is gradient in character and our individual survival
and development depends upon the capacity of others to discern its significance.
According to Stern (1985), it is the gradient characteristics of sensory experiences
and bodily feelings – intensity, duration and hedonic tone – that enable the infant
to connect experiences in a coherent way, while the ability to perceive and compare
gradient phenomena is well within the infant’s capacity at three weeks of age. His
account of the manner in which affect attunement between mothers and babies is
achieved unreflectively in episodes of joint attention clearly exemplifies the char-
Performative Metaphor in Cultural Practices 377
Acknowledgement
The author wishes to thank two anonymous reviewers for their insightful criticism
of an earlier draft of this chapter and for their thoughtful queries.
Francesco-Alessio Ursini
Chapter 24
Objects and Nouns: An Account of the
Vision-Language Interface
1. Introduction
A core aspect of cognitive semiotics is the transdisciplinary analysis of how different
domains of meaning are interrelated. In this chapter, I propose that two goals are
necessary for such an analysis. First, one must identify core cognitive modalities
and their underlying ontologies, defined as the sets of basic categories and relations
on which cognitive processes operate (Smith 1995; Brandt 2004; Brier 2009; Durst-
Andersen 2011). Second, one must study the properties and relations underpinning
these ontologies and their governing processes. If successful, an integrated account
of how meanings are related hopefully becomes possible.
Recent works have pursued the first goal by proposing that building blocks for
a general cognitive ontology may include mental representations of objects, events
and places (Metzinger and Gallese 2003; Zacks 2010), beliefs and goals (Gallese and
Metzinger 2003), and social, cultural and historical relations (Brier 2009). Cognitive
linguistic analyses have suggested that similar categories can be found in language
and non-linguistic concepts (Jackendoff 1990; Evans 2009).
However, few works have pursued the second goal, mostly by analysing the
parallels between language and belief planning systems (Gallese and Metzinger
2003), or language and spatial cognition (Ursini 2011; Jackendoff 2012). By leaving
aside the study of possible connections between ontologies (e.g. how visual and lin-
guistic primitives of spatial cognition are connected), these works render the second
goal a still outstanding problem.
The aim of this chapter coincides with this second goal. I focus on two cog-
nitive modalities, vision and language, concentrating on a sub-set of processes
individuating and classifying visual objects (or just “objects”) and noun phrases
(henceforth NPs). By presenting a restricted set of data, I aim to offer a minimally
cross-modal analysis, as well as a transdisciplinary methodology, integrating results
from object recognition in the field of vision and semantic typology in linguistics. I
also implement a formal account based on the notion of infomorphism: a model of
how the different domains of vision and language, (conceived as distinct by related
cognitive-semiotic systems) exchange information (Barwise and Seligman 1997). In
particular, I discuss how this analysis can be used to account for three phenomena,
lexical variation, reference, structure of semantic memory, that involve connections
between these two systems.
380 Francesco-Alessio Ursini
2. Ontologies
2.1. Ontologies: objects
In order to define a core ontology for objects, we need to address some basic prob-
lems regarding the controversial notion of object. Earlier definitions have suggested
that visual objects can be individuated as “things” that (a) have a three-dimensional,
convex shape (Marr 1982), and (b) that can be grasped and touched (Spelke 1990;
Carey and Xu 2001). However, some cases suggest that this compound definition is
problematic. For instance, we can point to and grasp an edge of a table, and refer to
it via the definite description the edge of the table. However, an edge is not a convex
three-dimensional object, hence not an object under this definition.
Current definitions of visual object suggest overcoming this enpasse by studying
how visual sub-systems can individuate visual entities, each via different classifying
principles (e.g. Scholl 2007). Here I focus on four sub-systems out of those pro-
posed in the literature for two reasons. First, these sub-systems seem the most
broadly accepted as distinct but connected sub-systems of the object recognition
system. Second, the types of objects that these systems individuate seem to have
clear counterparts in linguistic types, facilitating our analysis of how cross-modal
connections can arise (Barwise and Cooper 1997: 43). I use the term type as in the
literature on object recognition (e.g. Scholl 2007) for a visual property, or mental
representation thereof, that can be used to classify a specific object, which can then
be a token or given instance of a type.
The first sub-system classifies objects via their types of motion patterns. We can
individuate shapeless, colorless entities such as dots and promote them to the status
of objects, insofar as they are moving. We can classify dots as distinct via the shapes
of the patterns they follow (Cavanagh et al. 2001), possibly also attributing to them
a form of agency. If a dot moves on a screen, we can classify it as an object moving
by its own volition (Choi and Scholl 2004). We can also follow these agent-like dots
by tracking their apparent goals as individual or groups of dots, thus classifying
them as different types of quantities (Gao and Scholl 2011). Thus, we can say that
if “something” moves, then we can classify it as an object.
The second sub-system individuates objects by discriminating attribute types,
usually labeled as textons (Julesz 1981). Texton types can include material composi-
tion, basic geometrical properties, color or texture as attributes of objects (Prasada,
Ferenz and Haskell 2002). We can look at a glass of beer and individuate beer as a
liquid via its color and (perhaps) material composition, even if its contingent shape
is determined by the glass, qua a container (Leung and Malik 2001). Thus, if “some-
thing” has properties corresponding to textons, then we can classify it as an object.
The third sub-system classifies objects via their types of geometrical structures
or shapes. The Recognition by Components (RBC) model suggests that objects’
shapes can be found within a library of geons (i.e. shape types), and combinations
thereof (Hummel and Biedermann 1992). The H-MAX model, instead, suggests that
more basic geometric features are computed together as “possible” partial images,
An Account of the Vision-Language Interface 381
1 This conclusion is in line with Meinong’s theory of objects (Marek 2013). I wish to
thank an anonymous reviewer for suggesting this apt point.
2 In most South-East Asian languages, most classifiers also have adjective-like proper-
ties, and the class of “pure” nominal adjectives is very small (Aikhenwald 2000). This
aspect is not crucial for our discussion.
382 Francesco-Alessio Ursini
Asher 2011: ch.2). However, I focus on four sub-types that correspond to the types
discussed for objects in the previous sub-section for two key reasons. First, this
choice renders our discussion of the relations between objects and NP types more
streamlined. Second, it allows us to discuss a restricted but clear set of English and
Mandarin examples, thus reaching a basic degree of cross-linguistic validity.
The first system classifies NPs via the animacy and gender features, which de-
scribe discourse referents as independent agents (Yamamoto 1999). Languages such
as Thai, Basque and Swedish have different gender-assigning morphological systems
(classifiers, case markers and suffixes, respectively, cf. Grinevald 2000). In each of
these languages, NPs are marked as belonging to animateness or gender categories
(Corbett 1991: 8–10), with inanimate NPs conceived as describing non-agentive
entities. In English, the tri-partite distinction among the pronouns he, she and it
can be used to illustrate this system. It is known that the semantic coherence of
a text can be established when a pronoun and an antecedent NP are assigned the
same gender (Kehler 2011). Gender assignment is dependent on many factors, but
the occurrence of the wrong pronoun usually renders a mini-text uninterpretable,
a fact we represent via the symbol “#”:
(1) The woman came to the party. She/#it had a good time.
(2) The ball rolled under the table. It/#he was kicked by the boy.
The NP the woman is assigned female gender (viz. she in (1)), while the ball (2) is
assigned non-animate or neutral gender (viz. it in (2)) rather than animate (female)
gender, lest the mini-text be incoherent. Thus, animacy/gender assignment systems
for NPs can mark NPs as denoting an animate or inanimate referent in discourse.
The second system classifies NPs via features that describe qualities of the dis-
course referents they denote. Classifier languages such as Mandarin include “bare”
NPs such as shui ‘water’, which individuates water as a certain type of liquid.
Mandarin NPs must occur with a classifier that assigns them to a “counting” class,
e.g. Yībēi shuǐ ‘cup of water’ (Grinevald 2000: 74–76). A similar pattern holds for
adjectives, which are part of the structure of Mandarin NPs, e.g. hóngsè yībēi shui
‘cup of red water’. In English, (concrete) mass nouns have similar properties, as
they individuate discourse referents principally via their defining qualities (Pelletier
2009), as shown in (3)-(4).
(3) Gold is rare these days.
(4) Cool water is always refreshing.
The nouns gold and water are mass NPs and, unlike their Mandarin counterparts,
can occur “bare”. Qua mass NPs, they cannot directly combine with many, and
individuate the substance that a discourse referent is made of (e.g. gold in (3)). Ad-
jectives can also combine with NPs to individuate a quality of a referent, e.g. cool
in (4). In such a case, the status of the phrase cool water depends on the syntactic
theory we adopt (cf. Aikhenwald 2000 vs. Baker 2003). Nevertheless, distinct syn-
An Account of the Vision-Language Interface 383
tactic theories equally acknowledge that this phrase denotes a physical quality of
a certain referent.
The third system classifies NPs via features describing the shape of the discourse
referents that NPs denote. Most classifier languages contain a sub-class of classifiers
that assign this type to the NPs they combine with (Aikhenwald 2000). For instance,
Mandarin tian is a classifier that can be used to describe trousers, sticks and fish:
objects with a cylindrical-like shape. Dozens of such classifiers can be identified
in this and other classifier-based languages (Grinevald 2000: 70–72). In English,
NPs such as the sphere and the box are assigned this type as they describe objects’
shapes, viz. (5)-(6):
(5) The metal sphere was fairly big.
(6) The box of chocolates contained some pralines.
The noun sphere is, by definition, a noun describing a shape, like dozens of similar
English nouns (Jackendoff 2012: 1136–1337). The noun box not only describes a
certain canonical shape of container, but also an object with certain functions. It
can combine with the count noun chocolates (a plural form of the singular, mass
noun chocolate) in a partitive construction, and describe a rectangular container for
sweets (Acquaviva 2008).
The fourth system classifies NPs via the features describing the number and
quantity of referents that NPs can denote (Harbour 2007; Acquaviva 2008). The
English system contains plural and singular morphemes, and partitive constructions
such as the box of chocolates in (6), which denote a so-called group NP (Chierchia
1998). English also has a complex system of quantified NPs, viz. (7)-(8).
(7) Every boy went to the park.
(8) At least three men bought an ice-cream.
The quantified NPs every boy and at least three men denote distinct quantities of
(types of) referents, although they do so in distinct manners. For instance, every boy
in (7) individuates quantities of referents that are context-sensitive. What counts
as “every” depends on a context of discussion: in a context with four boys, every
boy denotes such a quartet (Peters and Westertåhl 2006: ch.1). However, at least
three men involves a more complex syntactic structure and individuates a range of
quantities that goes from a minimum of three to an undefined maximum (Nouwen
2010: 245–246). More could be said about quantified NPs, as there is an ample
literature on this topic (cf. Peters and Westertåhl 2006). Thus, languages seem to
include a system that can assign a type to NPs via the quantity of referents they
individuate, mirroring the quantity type of objects.
Let me summarize, before proceeding to the analysis. I have discussed four of the
possible sub-systems that individuate object and NP types, basing my choice on the
close resemblances and established status of types between the vision and language
domains. I have also pointed out some differences between the classifying systems
of the two domains. While visual systems assign visual (semantic) types to objects
384 Francesco-Alessio Ursini
according to the concepts they instantiate, the equivalent assignment systems for
NPs are morpho-syntactic in nature (case markers, classifiers, plural morphology).
The case in point, the partitive NP the edge of the table seems to be assigned a sem-
antic type (count and singular, “edge” shape) that mirrors that of its corresponding
object. Thus, it seems that the classification principle(s) for tokens (via types), and
relations/interactions among types, work in parallel fashion across the two domains.
The questions underpinning our goal, then, are (a) whether we can define con-
nections between the two systems based on these parallels, and (b) whether these
connections indeed correspond to phenomena involving our two systems.
consider colour as a sub-type of the texton (a visual object) type. We can represent
this relation as a part-of/type-of relation, colour⊑texton. Note furthermore that we
can define colour as the union of more specific types of colour, one minimal example
being red⊔blue=colour. Since the relation red⊑colour holds as a consequence, we
can have type hierarchies, e.g. red⊑colour⊑texton (cf. Smith 1995; Asher 2011: ch.2–4
for NP types). We can now define the classification structures that describe object
and NP types.
In words, this definite NP instantiates the blue type (via the adjective blue’) and the
sphere’ shape type. Note that we could offer richer representations, and, for example,
represent quantity types for both the object and NPs, but we offer these simpler
classifications for space reasons.
We can now discuss how we can capture the possible relations and connections
that can emerge between these two systems. The classifications for objects and NPs
constrain the types of relations that can emerge between objects, NPs and their
types, since they are governed by the same underlying principles. This intuition
can be made precise via the notion of infomorphism. Given our two classifications
VO and NP, an infomorphism is defined as a pair of functions f’=<f, g> such that
g(n)⊨O* if and only if o⊨f(N*). In words, the classifications underpinning objects
and NPs can be connected via two symmetrical relations/functions. First, the func-
tion g maps NP tokens to their matching object tokens. We have g(n)=o for any n
and o belonging to N and O, respectively (Barwise and Seligman 1987: ch.3). Second,
the function f maps NP types to object types. We have f(n*)=o*, for any n* and o*
belonging to N* and O*, respectively. Consequently, objects “connected” to NPs
are classified via object types (the g(n)⊨O* part), and NPs can instantiate NP types
connected to object types (the o⊨f(N*) part).
Consider again our edge example. The NP the edge “describes” a certain object
as instantiating an object type: an edge shape, singular in number. This object, in-
stantiating the “edge” type, finds its “description” in an NP type that matches this
type: an NP describing a singular shape, such as the edge. We thus have two distinct
types of connections: one between NPs and the object they refer to, and one between
the types that objects and NPs instantiate. We thus may predict, although in a fairly
indirect way, that these constraints can offer models of cross-modal patterns and
processes involving objects and NPs. I discuss three phenomena involving cross-
modal patterns in the next section.
the object c can instantiate the “group” or the “plural” quantity type: c⊨(group⊔plu-
ral⊑quantity).
According to the definition of infomorphism, then, semantic NP types are related
to object types: the function f establishes the mapping f(type)=type’. In the case of
the union type (a⊔b), f can map this type onto the corresponding type (i.e. we have
f(a⊔b)=c’), or to the union of the two corresponding types (i.e. we have f(a⊔b)=-
a’⊔b’). Importantly, an infomorphism can involve surjective (“many-to-one”) or
injective (“one-to-one”) functions (Barwise and Seligman 1997: 100–101). In our
“lump of coins” case, we can either have f(group⊔plural)=amb’, (i.e. one NP type for
two object types) or f(group⊔plural)=group’⊔plural’. In other words, an infomor-
phism can either connect sum types from one classification to another, or it can
match the sub-types of one type to the sub-types of another type.
An empirical consequence is that two possible patterns emerge by which NPs
can describe this complex object type. In the first pattern, each of the object types
is described via a matching NP. The two resulting NPs belong to distinct semantic
types. This seems to be the case in English, where the pair coins and money in-
volve respectively a count, plural NP and a non-count (mass) NP (cf. how many
coins vs. how much money), and both can be used to describe our lump of coins.
We can represent the classifications of these NPs as coins⊨(plural’⊑quantity’) and
money⊨(group’⊑quantity’), respectively (cf. Pelletier 2009: ch. 1). In other words,
coins is assigned the semantic sub-type plural, within the type quantity, while money
is assigned the sub-type group. In the second pattern, one NP can describe both
types, thus making it semantically ambiguous. This is the case of the Mandarin NP
jīn-qián, since its meaning is ambiguous between a plural and a group interpretation.
This ambiguity can be simply represented as jīn-qián⊨amb’ (for ambiguous), as our
discussion suggests.
Overall, we capture this specific form of cross-linguistic lexical variation as a con-
sequence of the mappings between object and NP types being “flexible” (cf. Corbett
1991 2000: ch.2). Note that we can also expect intra-linguistic lexical variation to occur.
Mandarin NPs are generally ambiguous with respect to quantity (Chierchia 1998;
Acquaviva 2008: ch.4), but ambiguous NPs can also occur in English, often based on
abstract nouns (e.g. many experiences vs. much experience (Pelletier 2009: ch.1). Intra-
lexical variation may involve differences concerning whether the infomorphisms
defined between some sub-types of objects and NPs have different properties from
those defined over other sub-types (e.g. quantity vs. shape). Such “local” connections
can be established and accounted for (Barwise and Seligman 1997: ch.2).
We proceed to our second phenomenon: reference patterns. In the definition of
infomorphism, the notion of “reference” can be modeled as a relation between object
and NP tokens, conceived as extra-linguistic entities (cf. Abbott 2010: ch.1–2). The
different reference patterns emerging across these two languages can be illustrated
via (9)–(11). Each of these sentences can describe the location of our “lump” of coins.
(9) The coins are in my pocket
(10) The money is in my pocket
388 Francesco-Alessio Ursini
between these facts, learning the meaning of NPs and the objects they refer to can
be seen as a process of strengthening these connections over time.
5. Conclusions
In this chapter I have offered an account of the relations and connections between
the cognitive-semiotic systems of vision and language, as reflected in object recog-
nition processes and semantic properties of NPs. I discussed how each system clas-
sifies objects and NP tokens via a finite set of types, and proposed that the properties
and relations of these classifications constrain the types of relations that can emerge
between the two systems. Further, I offered an account of these connections based
on the notion of infomorphism: a mapping between the tokens and types of each
classification that preserves and connects their relations.
If we assume that such an infomorphism can be defined between these clas-
sifications, then lexical variation, reference patterns, and semantic memory patterns
can be accounted for, in part, as phenomena that involve the connection between
vision and language. This fact suggests that the cross-modal analysis presented in
this chapter can shed light on how to reach the second goal formulated at the onset
of the chapter: to account for the properties underpinning the ontologies of distinct
but connected cognitive-semiotic systems. As pointed out, the choice of types was
partial, as the aim was to offer a concise explanation of these phenomena rather than
a very thorough one. I hope that a more thorough account along these lines will offer
a broader perspective, and a more accurate answer to these and similar phenomena.
Per Aage Brandt
Chapter 25
Linguistic Theory in the Framework of
Cognitive Semiotics: The Role of Semio-Syntax
1 The sentence tree, or phrase marker, uses the format of Head Driven Phrase Structure
Grammar (HPSG) (e.g. Zhang 2009).
392 Per Aage Brandt
grammars, and their reluctant followers in cognitive linguistics, in fact all accept
the design proposed by Noam Chomsky in his early writings (Chomsky 1957, 1965),
namely the computer-inspired concatenative rewriting rules as in (2).
(2) S –> NP ˆAux ˆ VP
VP –> V ˆ NP
NP –> Det ˆ N
…
At least since the author’s iconic dedication in Lucien Tesnière’s Éléments de syntaxe
structurale (Tesnière 1965), sentence trees have occasionally been used jokingly, as
autonomous expressive units, to stress the intuition that the tree-like articulation of
grammatical components, the stemma, as Tesnière called it, does make sense as such.
Tesnière was maybe the first linguist to problematize the linearism of grammatical
thinking. He saw that grammatical tree structures are useful if they show the principles
that make word combinations meaningful and hence create sentence meaning. He saw
that stemmas, tree structures, are not just echoes of linear word combinations but that
they instead are autonomous organizers of meaning. Syntactic compositions are sem-
antic operations, he thought. Words already have meanings, either as categorial terms
(lexemes) or as relational terms (grammatical morphemes), but this does not fully ex-
plain the meaning or the principle of their integrations into meaningful sentences; the
syntactic operations that create meanings out of these meanings have to be understood
in their own right, as the real creators of sentence meanings. As I explained in my early
work (Brandt 1973), however, Tesnière did not succeed in finding the structural laws of
nodes and morpho-lexical integration in the sentence and utterance perspective, which
is admittedly a huge task. However, he managed to challenge the linearism of traditional
syntactic analysis, the linearist belief in the semantic miracle of meaning-making word
contact that still dominates linguistics all together and that often lives under the notion
of “compositional sentence meaning”. We must, however, ask what is meant by a “com-
position” of words. Is it just phonetic word-to-word contact?
Let us try to see language in a larger perspective. Language allows our mind-brain
to communicate with other mind-brains and to shape itself through expression. It
opens an access to knowledge, feelings, and thoughts that exist “out there” in the
community of minds we call culture. This cultural universe of knowledge, feelings,
and thoughts is the world of meaning. The aspect of language we refer to as seman-
tics thus concerns the meaningful exchanges between the individual speaker and
hearer and a given community of language speakers surrounding the individual
with equally meaningful discourse.
In individual minds, meaning emerges as “chunks” or “portions” of thought that
seem to form a rhythmic flow of ideational units, each equipped with both a mini-
mum of figurative features and some intentional traits, including the predicative
schema of attributing something to something (Brandt 2015).
According to common cognitive semantics, and in particular the current theories
of mental spaces and frames (e.g. Fauconnier and Turner 2002), our basic units of
394 Per Aage Brandt
6 This applies both to the original construction grammar (Goldberg 1995) and to later,
“radical” construction grammar (Croft and Cruse 2004).
Linguistic Theory in the Framework of Cognitive Semiotics 395
The “pairings” of construction grammar are really expanded word signs in Saussure’s
and Hjelmslev’s sense: signifier/signified or expression/content interdependencies. But
if we want to know how, in a given language, an infinity of contents and an infinity of
expressions can possibly be connected in an intersubjectively stable and fast-processed
way, we need to study the finite architecture of integrative syntactic structure that pre-
pares the mind-brains of communicating subjects to immediately, in real time, “code”
and “decode” the expressions involved. It is essential to notice that the rhythm and
tempo of thinking (at least our thinking-for-speaking) correspond rather directly to
the rhythms and speeds of the exchange of utterances, a phenomenon that makes
interlocutors experience oral dialogues as events of shared thinking (see Slobin 2003).7
In fact, there is no smooth transition from minimal linguistic signs (“pairings”)
to maximal units such as utterances. Instead, there are levels of organization, and
structures of one level are integrated into units of the next level by certain gram-
matical principles that might be in part language-specific but still comparable to
principles found to be functioning in other languages. In terms of bindings hold-
ing between expressions and contents, we may consider an integrative, concentric
semio-cognitive model such as the one shown in Figure 3.
Figure 3. S emio-linguistic grammar. Since the model is still organized in terms of Forms and
Meanings, that is, signifiers and signifieds, and since each level of organization
presents its own “pairings”, we might well consider this view as a manifestation of
a revised construction grammar – a semio-linguistic construction grammar.
7 Hermann von Helmholtz has already argued that thought moves at a finite rate. In
this century, research on speed of thought is centered on attention and working
memory (see René Marois and the many works of his laboratory at Vanderbilt). On
attention, see also Oakley (2009). It is generally believed that thinking-for-speaking
is approximately less than four times faster than speech.
396 Per Aage Brandt
Relations (I) between lexemes and cognitive, conceptual categories integrate into
clauses (II) structured by morphemes that are linked to cognitive schemas relating
the categories. Syntactic clauses and phrases composed of lexemes and morphemes
are integrated into sentences (III), which both constitute the semantic simulations of
3D thought needed for transmission, and provide the expressive 1D strings needed.
Sentences are further integrated into enunciative (person-oriented) and pragmatic
(situation-oriented) acts, which correspond to, and are thus semiotically linked
to, concrete single expressive acts of thinking, deciding, evaluating, wondering,
desiring, feeling etc. The signifiers of the expressive levels (RI – RIV) carry markers
of their type of content as their signifieds, and these again bind to the semantic
entities of the corresponding types and levels, in the linguistically configured minds
of speakers and hearers.
According to this model, the linearization of meaning happens step by step, or
level by level, in processes of “languaging-while-thinking”. Delinearization of utter-
ance strings correspondingly also happens step by step and level by level in pro-
cesses of decoding spoken, signed or written language. When words are integrated
into clauses, their meanings are thus specified and integrated into more compre-
hensive meanings; this phenomenon is active in all levels of transition from R I to
R IV. Semio-syntax – all that happens from words to utterances – is therefore mean-
ingful, literally: it performs an all-important semantic task; grammar reshapes and
simulates meaning for transmission. Grammar takes single structures of thought as
an input and rebuilds them, so to speak, into single syntactico-semantic structures.
These “rebuilt” simulatory structures have the extremely important advantage that
they can be linearized, due to their tree format, which means they have an output
going to phonetics and gesture, when we speak, sign or write. When listening to
speech in a known language, we inversely automatically delinearize the perceived
expressions and make grammatical sense of them, which creates semantic structures
that are input to thought as such.8
The linearizing and delinearizing functions thus refer to a semio-syntactic
sentence meaning, which is expressed by linearization and reconstructed by de-
linearization in language reception. The reconstruction in question refers to the
intersubjective situation in which a semio-syntactic construction takes place in the
speaker’s mind and a reconstruction of it in the hearer’s mind. When we write and
immediately read what we just wrote, we are performing both roles – construction
and reconstruction. This allows us to monitor and modify our sentence production.
Often, our first construction comes automatically, whereas the reconstruction in this
self-reading process is conscious, slower, and – most importantly – presents itself
8 So instead of considering the local structures as linear per se, we have to consider
them as compatible with a set of possible linear orders, hence we have to isolate
linearization and delinearization as specific processes, different from the structures
themselves. The term linearization was maybe first used by Willem Levelt (1982),
who analyzed discourse phenomena.
Linguistic Theory in the Framework of Cognitive Semiotics 397
9 All syntactic theories that allow trees with terminal strings, like the example in Fig-
ure 1, show deep negligence of the two core issues of syntax, namely how to under-
stand its integrative semantic mechanism and how to understand its mechanisms of
linear manifestation. However, James McCawley’s generative semantics (McCawley
1976) drafted a compelling view of the connection between grammar and meaning
that might be understood as a forerunner of what I am proposing here.
398 Per Aage Brandt
There is form, and there is meaning, and their pairing is called grammar. It is evident,
however, that this model, sporting only two instances, expression and content, falls
short of accounting for the relation of grammatical meaning to thought, as well as for
the relation of phonetics to writing; these relations have to be modeled as semiotic
functions in their own right. Furthermore, in a level superimposed on the level of
sentence-structured utterance, we will have to add the function of enunciation as a
suprasegmental expressive instance that inscribes the structure in an intersubjective
conversation frame by letting it be the content of a deictic context of interpersonal
address (‘I/we say to you that X’; ‘this (X) is what I/we intend to let you know or
think’). This function conveys the “temperature” of speech, irony, speech act value,
emotion, etc., and can be more or less lexicalized and grammaticalized.
Finally, the enunciated structure will rhetorically signify or select a pragmatic
frame determining its particular situational, social meaning – often different from
the literal conceptual meaning structured by grammar. The graph shown in Fig-
ure 5 exploits the Hjelmslevian principle of semiotic recursion10, whereby a semiotic
function can appear as embedded in the expression (signifier) of another function
11 This principle is also very useful in the analysis of complex semiotic architectures
connecting texts to music or to images or other semiotic sorts, or “semiologies”.
400 Per Aage Brandt
13 School grammar questions: Where is the verb? Where is the subject? Where is the
object? What does this SVO sentence mean?…
14 This is of course a postulate; it is only a working hypothesis, but it works in school
grammar. Note that if the operation itself is marked by a morpheme, a binary inte-
gration will comprise the modified, the modifier, and the morpheme marking the
integration.
15 The “school grammar” approach allows us to at least temporarily suppose we know
what we mean by verb, noun, proposition, adverb, and so forth. The problem of
relating syntactic functions to word classes is left open for the moment, because its
treatment depends on the syntax itself.
16 A proto-concept is a deliberately vague phenomenological idea that may correspond
to a set of actual concepts sharing some schematic core relational meaning. Aboutness
is a proto-concept in this sense. I suspect that the stagnation of generative seman-
tics was due to the uncertainty as to the conceptual or proto-conceptual status of
the semantic categories used for the characterization of case structures. The same
problems were present in Fillmore’s case discussions (Fillmore 1968), which gave
rise to a notion of frames and a frame grammar that now constitutes the core of the
semantics of construction grammar. However, frames do not explain the finiteness
and universality of sentence syntax. We still need a set of proto-conceptual cases
that can function across frames.
402 Per Aage Brandt
A dative complement will be taken by the preexisting, often transitive, cluster and
add the proto-concept of “going-somewhere”, of a “destination”, whether kinetic,
as a direction, or by finality, as a final cause: a directive proto-concept. The result of
this operation can again integrate items pertaining to a proto-concept of “coming-
from-something” or “emerging from something”, such as springing from a place, a
ground, a source, a material cause, a force, or an instrumental facilitator, a mental
disposition – a projective proto-concept.
The locative complement is a framing supplement that can be integrated into
the more or less scenario-like, or frame-like, cluster produced so far by some of the
above. It adds a sort of anchoring space-and-time determination to the scenario.
The heavily enriched cluster finally, before reaching its limit of saturation, “takes”
an epistemic framer that indicates the truth-value status or the modal and evidential
value of the entire set-up, which is now a saturated meaning, a representation of
some minimal thought, and forms a sentence. The total syntactic cluster can finally
integrate other syntactic clusters as complements. This elementary principle is es-
sential to the basic recursivity of syntax. Paratactic connections between sentences
are the last instance of integration: and, or, but, comma, and certain logical con-
nectors can appear here; there is always a certain analogy between the structures
of two connected sentences, which typically allows them to share local and modal
determinations, as in (3).
(3) The sun shines, and the birds are singing.
That is, both states are supposed to happen at the same time and location (C6), and
the sentences share evidentiality (C7) across the connector and. None of this is
explicitly stated, but the connector (C8) transfers these implicit values.
The cascade of proto-conceptual integrations considered so far can be summar-
ized as an ordered series of binary complementations to a cluster built on a proto-
conceptual head. We get a locally maximal and canonical sequence of operations
(<–) inserting a complement – C – of a certain rank x (Cx) to an already created
cluster, as shown in (4).
(4) (((((((((Head C0 <– subject C1) <–predicative C2) <– objective C3) <– directive
C4) <– projective C5) <– locative C6) <– epistemic C7) <– connective C8)
Most sentences contain only few of these complements; but the complement slots
not filled may still be semantically active, as C6 and C7 in (3). This cascade of proto-
conceptual complementations is not only active in building verbal phrases and
clauses of many sorts, but also appears to underlie nominal, adjectival and adverbial
phrases. A canonical sequence of possible proto-conceptual complements in all
phrase types could in fact explain the fast and automated processing in production
and reception of speech, and also the possibility of converting verb phrases to
noun phrases, active constructions to passive constructions, etc.17 A generalized
Figure 6. Semio-synactic construction for Jensen sneezed the napkin off the table.
Complement nodes (1–8) are types of available, ordered integrations and carry the
rank number of the operation of integration in the canonical cascade ((4), above).
As the example shows, the deictics, articles and demonstrative and possessive pro-
nouns (the, a; this, that, his, her…), are treated as nominal heads,20 which may be
18 Since both thinking and semio-syntax take place online in the mind-brain, we might
speak of a spontaneous self-simulation in the thinking mind.
19 Fauconnier and Turner (2002: 370) propose seeing this construction as a blend of
a sneezing scenario and a moving scenario. Even so, the intelligibility of the result
needs an underlying semio-syntax as the one presented here.
20 In constructions that leave the noun bare, and languages that do not have articles,
the nominal heads are noted as presenting a Ø article; this Ø filler is still going to be
significant. In general, complements that are not worded can still be anaphorically
active. See Obiamalu 2013 for an interesting discussion of determiners and Ø articles
in Igbo.
404 Per Aage Brandt
surprising but they do mean “look, something is going on”, and they do take a
subject-substantive (C1). The caused-motion construction in the example allows
the otherwise intransitive verb sneeze to “take” both an object (C3) and a directive
complement (C4) indicating where the napkin is going as a result of the activity.
We may accept the rather sketchy explanation by a mapping and a grammatical
“blending” of two constructions21 – one intransitive, the other motion-causative –
in order to account for sentences such as (5). Still, when constructions “blend”, if
they do, they need a design protecting the result from chaos – a design necessary
for possible constructions to be meaningful.
Let us reconsider the Danish sentence we initially discussed in Section 1, (1). Its
semio-syntactic analysis is given in Figure 7.
The lexical and phrasal integrations create the particular constructions. The iter-
ations of parts of the canonical cascade thus account for meaning construction in
complex utterances. However, the integration stemmas (trees, phrase markers),
using Tesnière’s term, should also make it possible to describe the process of ex-
pressive linearization. The alternative to the “terminal string”, in the perspective I
am proposing here, is built on a two step procedure:
21 The syntax of this example, as discussed by Goldberg and Turner and Fauconnier,
is thoroughly and critically reanalysed by Line Brandt, The Communicative Mind
(2013: 333–363).
Linguistic Theory in the Framework of Cognitive Semiotics 405
22 Note that there are often several different correct linear versions of the “same”
stemmatic sentence; inversion rules are therefore a major challenge for language
learners. Different linearizations can be motivated by discourse-based constraints.
These motives are picked up by competent hearers and readers while de-linearizing
sentences. The process of delinearization reverses the linearization rules and may
406 Per Aage Brandt
involve local strategies of partial stemmatization before reaching the level of global
stemmatization. Studying these processes and strategies of delinearization in specific
languages may help programming machine parsing of natural language overcome
the hurdle of not being able to reach the instance of meaning production in language.
Automated natural language parsing should target the reproduction of stemmas be-
hind strings. The same would be true of automated translation, which would thereby
be less semantically blind.
Linguistic Theory in the Framework of Cognitive Semiotics 407
Figure 8. Semio-syntactic structure of example (9), the first sentence of Hjelmslev (1948)
The details of such an analysis may, of course, be discussed, contested and put to a
test by extended grammatical research, and especially by systematic comparative
analyses of textual corpora in given languages.23 With their specific linearization
editing rules, and further analyses of the specific proto-conceptual integrative oper-
ations underlying constructions of different types, such analyses will of course
turn out to be somewhat more complicated than the few simple ones shown here.
23 The author has had the opportunity to apply, with his students, the stemmatic semio-
syntactic analysis to corpora of sentences in Arabic, Japanese, Mandarin Chinese,
and Lithuanian, with interesting preliminary results. Entire grammars should be
written in order to test the validity of the model.
408 Per Aage Brandt
Ameel, E., Malt, B.C., & Storms, G. (2008). Object naming and later lexical devel-
opment: From baby bottle to beer bottle. Journal of Memory and Language, 58,
262–285.
American Psychiatric Association et al. (2013). Diagnostic and statistical manual of
mental disorders, (DSM-5®). American Psychiatric Pub.
Amorim, M. A. (2006). Embodied spatial transformations: “Body analogy” for the
mental rotation of objects. Journal of Experimental Psychology: General, 135(3),
327–347.
Amoruso, L., Gelormini, C., Aboitiz, F., González, M.A., Manes, F., Cardona, J.F. &
Ibanez, A. (2013). N400 ERPs for actions: building meaning in context. Frontiers
in Human Neuroscience, 7, 57.
Anderson, J. R. (1996). ACT: A simple theory of complex cognition. American Psy-
chologist, 51(4): 355–365.
Andersson, S. (1980). Matens Roller – Sociologisk Gastronomi [The Roles of food.
Sociological Gastronomy]. Stockholm: Almqvist & Wicksell Förlag.
Andrén, M. (2010). Children’s gestures between 18 and 30 months. Lund: Mediatryck.
Anstis, S., Verstraten, F. A. J., & Mather, G. (1998). The motion aftereffect. Trends in
Cognitive Sciences, 2(3), 111–117.
Apresyan, J. D. (1995). Selected Works. Vol. 1 and 2. Moscow: Yazyki Russkoj Kul’tury.
Apresyan, J.D. (2004). Interpretation verbs: the semantic structure and properties
[Interpretatsyonnyje glagoly: semanticheskaja structura i svoystva]. Russian
language in the scientific perspective [Russkij jazyk v nauchnom osveshchenii],
1, 5–22. Moscow: Institute for Russian language named after V.V. Vinogradov.
Arbib, M. A. & Bickerton, D. (2010). The emergence of protolanguage. Philadelphia:
John Benjamins.
Arbib, M. A. (2003). An action-oriented neurolinguistic framework for the evolution
of protolanguage. Typescript, University of Southern California.
Arbib, M. A. (2005a). From monkey-like action recognition to human language:
An evolutionary framework for neurolinguistics, Behavioral and Brain Sciences,
28(02), 105–124.
Arbib, M. A. (2005b). Interweaving protosign and protospeech: Further devel-
opments beyond the mirror. Interaction Studies, 6(2), 145–171.
Arbib, M. A. (2006). Action to language via the mirror neuron system. Cambridge:
CUP.
Arbib, M. A. (2008a). From grasp to language: Embodied concepts and the challenge
of abstraction. Journal of Physiology-Paris, 102(1–3), 4–20.
Arbib, M. A. (2008b). Holophrasis and the protolanguage spectrum. Interaction
Studies, 9(1), 154–168.
References 411
Barthes, R. (1988 [1964]). Semantics of the object. The semiotic challenge, 179–190.
Oxford: Blackwell.
Barthes, R. (1991). Mythologies (transl. Annette. Lavers). New York: The Noonday
Press
Bartlett, F. C. (1995 [1932]). Remembering. A Study in Experimental and Social Psy-
chology. Cambridge: CUP.
Barwise, J., & Etchemendy, J. (1990). Visual information and valid reasoning. In
W. Zimmerman (ed.), Visualization in mathematics, 8–23. Washington, DC: Math-
ematical Association of America.
Barwise, J., & Etchemendy, J. (1995). Heterogeneous logic. In B. Chandrasekaran,
J. Glasgow, and N. H. Narayanan (eds.), Diagrammatic reasoning: Cognitive and
computational perspectives, 211–234. Menlo Park, CA: AAAI Press.
Barwise, J., & Seligman, J. (1997). Information flow: the logic of distributed systems.
Stanford, CA: CSLI press.
Bates, E., Benigni, L., Bretherton, I., Camaioni, L., & Volterra, V. (1979). The emergence
of symbols: Cognition and communication in infancy. New York: Academic Press.
Bateson, G. (1958). Naven: A survey of the problems suggested by a composite picture
of the culture of a New Guinea tribe drawn from three points of view (2nd ed.).
Stanford: Stanford University Press.
Battison, R. (1974). Phonological deletion in American Sign Language. Sign language
studies, 5(1), 1–19.
Beardsworth, A., & Keil, T. (2002). Sociology on the menu: Invitation to the study of
food and society. London: Routledge.
Bednarik, R. G. (1992). Paleoart and archaeological myths. Cambridge Archaeological
Journal, 2, 27–43.
Bednarik, R. G. (1998). The ‘australopithecine’ cobble from Makapansgat, Sout Af-
rica. South African Archaeological Bulletin, 53, 4–8.
Bednarik, R. G. (2003). A figurine from the African Acheulian. Current Anthropology,
44, 405–413.
Bednarik, R. G. (2008). The origins of symboling. Signs, 2, 82–113.
Beebe, L. (1995). Polite fictions: instrumental rudeness as pragmatics competence.
In Linguistics and education of language teachers. Ethnolinguistics. Georgetown
Univ Press. 154–168.
Belyaev I., Kapustyan V., & Rykov V. (2008). Semiotic systems and business pro-
cesses [Znakovye sistemy i bisnes-processy]. Russian Informative Resources [In-
formatsionnye resursy Rossii], №5.
Benfey, O. T. (1960). The role of imagination in science: Van’t Hoff’s inaugural ad-
dress. Journal of Chemical Education, 37(9), 467–470.
414 References
Bergen, B. K. (2012). Louder than words: The new science of how the mind makes
meaning. New York: Basic Books.
Berlin, B., & Kay, P. (1969). Basic color terms: Their universality and evolution. Ber-
keley, CA: University of California Press.
Bermúdez, J. L. (2003). Thinking without words. Oxford: OUP.
Bertamini, M., Makin, A., & Pecchinenda, A. (2013a). Testing whether and when
abstract symmetric patterns produce affective responses. PLoS ONE, 8(7), 1–1.
Bertamini, M., Makin, A., & Rampone, G. (2013b). Implicit association of symmetry
with positive valence, high arousal and simplicity. I-Perception, 4(5), 317–327.
Berwick, R., Friederici, A., Chomsky, N., & Bolhuis, J. (2013), Evolution, brain, and
the nature of language. Trends in Cognitive Sciences, 17(2), 89–94.
Bickerton, D. (1990). Language and species. Chicago: University of Chicago.
Bickerton, D. (1995). Language and human behavior. Seattle: University of Wash-
ington Press.
Bickerton, D. (2008). But how did protolanguage actually start? In M.A. Arbib and
D. Bickerton (eds), The Emergence of protolanguage: Holophrasis vs. compositional-
ity 167–174. Amsterdam: Benjamins.
Bitzer, L. (1968). The rhetorical situation. Philosophy and rhetoric, 1, 1–19.
Blomberg, J., & Zlatev, J. (2014). Actual and non-actual motion: why experientialist
semantics needs phenomenology (and vice versa). Phenomenology and the cog-
nitive sciences, 13(3), 395–418.
Bloom, L. (1993). The transition from infancy to language. New York: CUP.
Boden, M. A. (1977). Artificial intelligence and natural man. New York: Basic Books.
Boden, M. A. (2006). Mind as machine: A history of cognitive science. Oxford: OUP.
Bonvillian. J. D., & R. Folven, R. (1993). Sign language acquisition: Developmental
aspects. In M.D. Marschark & D. Clark (eds.), Psychological perspectives on deaf-
ness, 229–268. Hillsdale, NJ: Erlbaum.
Bosch, M., & Gascón, J. (2006). Twenty-five years of the didactic transposition. Bull-
etin of the International Commission on Mathematical Instruction (ICMI), 58, 51–65.
Bouissac, P. (2000). Steps toward evolutionary semiotics. Semiotica, 132, 317–342.
Bouissac, P. (2004). Criteria of symbolicity: Intrinsic and extrinsic formal properties
of artifacts. In Virtual Symposia Online, http://semioticon.com/, 1–8.
Bourdieu, P. (1984). Distinction. A social critique of the judgement of taste. Massa-
chusetts, MA: Harvard University Press.
Bousfield, D. (2008). Impoliteness in the struggle for power. In D. Bousfield, &
M. Locher (eds). Impoliteness in language. Studies on its interplay with power in
theory and practice, 127–155. Berlin: Mouton de Gruyter.
References 415
Bouzouggar, A., Barton, N., Vanhaeren, M., d’Errico, F., Collcutt, S., Higham, T.,
Hodge, E., Parfitt, S., Rhodes, E., Schwenninger, J.-L.,Stringer, C., Turner, E.,
Ward, S., Moutmir, A., & Stambouli, A. (2007). 82,000-year-old shell beads from
North Africa and implications for the origins of modern human behaviour. Pro-
ceedings of the National Academy of Sciences, USA, 104, 9964–9969.
Bove, C.F., Sobal, J., & Rauschenbach, B.S. (2003). Food choices among newly married
couples: Convergence, conflict, individualism and projects. Appetite, 40, 25–41.
Bowerman, M. (1978). The acquisition of word meaning: an investigation of some
current conflicts. In N. Waterson & C. E. Snow (eds.) The development of com-
munication, 263–287. New York: John Wiley and Sons.
Bowles, H. (2010). Storytelling and drama: Exploring narrative episodes in plays.
Amsterdam: John Benjamins.
Boydston, J. (2009): Gender as a question of historical analysis. A. Shepard &
G. Walker (eds.) Gender and change. Agency, chronology and periodisation. Chi-
chester, West Sussex: Wiley-Blackwell.
Brakel, J. v. (2000). Philosophy of chemistry: between the manifest and the scientific
image. Leuven: Leuven University Press.
Brandt, L. (2013). The communicative mind. A linguistic exploration of conceptual
integration and meaning construction. Newcastle: Cambridge Scholars Publishing.
Brandt, L., & Brandt, P.A. (2005). Making sense of a blend. In R. Mendonza Ib-
añez (ed.) Annual Review of Cognitive Linguistics, 216–249). Amsterdam: John
Benjamins.
Brandt, P. A. (1973). L’analyse phrastique. Introduction à la grammatique. Bruxelles:
AIMAV.
Brandt, P. A. (2004). Spaces, domains and meanings. Essays in cognitive semiotics.
Bern: Peter Lang.
Brandt, P. A. (2015). The rhythmic mind. http://papers.ssrn.com/sol3/papers.
cfm?abstract_id=2567815
Breheny, M. (2011). Experimentation-based pragmatics. In W. Bublitz & N. Norrick
(eds.) Handbook of pragmatics: Vol. 1 Foundations of pragmatics. Berlin: Mouton
de Gruyter.
Bressem, J., & Müller, C. (2014a). A repertoire of German recurrent gestures with
pragmatic functions. In C. Müller, A. Cienki, E. Fricke, S. H. Ladewig, D. McNeill,
J. Bressem, (eds), Body – language – communication. An international handbook
on multimodality in human interaction (Vol. 2, pp. 1575–1591). Berlin: De Gruyter.
Bressem, J., & Müller, C. (2014b). The family of away gestures. Negation, refusal and
negative assessment. In C. Müller, A. Cienki, E. Fricke, S. H. Ladewig, D. McNeill,
J. Bressem, (eds), Body – language – communication. An international handbook
on multimodality in human interaction Vol. 2, 1592–1604. Berlin: De Gruyter.
416 References
Bressem, J., Ladewig, S.H. & Müller, C. (2013). Linguistic annotation system for
gestures (LASG). In C. Müller, A. Cienki, E. Fricke, S. H. Ladewig, D. McNeill, &
S. Teßendorf, (eds), Body – language – communication. An international handbook
on multimodality in human interaction, Vol. 1, 1098–1124. Berlin: De Gruyter.
Bridges, A., & Van-Lancker Sidtis, D. (2013a). Formulaic language in Alzheimer’s
disease. Aphasiology, 27(7), 799–810.
Bridges, K., Van Lancker Sidtis, D., Sidtis, J. J. (2013b). The role of subcortical struc-
tures in recited speech: Studies in Parkinson’s disease. Journal of Neurolinguistics,
26, 591–601.
Brier, S. (2009). Levels of cybersemiotics: Possible ontologies of signification. Cog-
nitive Semiotics, 4, 28–62.
Brookes, D. T., & Etkina, E. (2009). “Force,” ontology, and language. Physical Review
Special Topics. Physics Education Research, 5(1).
Brooks, R. A. (1991b). Intelligence without representation. Artificial Intelligence,
47, 139–159.
Brooks, R.A. (1991a). Intelligence without reason. Proceedings, ICJAI-91, 569–595.
San Francisco: Morgan Kaufmann.
Brown, M. J. (2009). Models and perspectives on stage: remarks on Giere’s Scientific
perspectivism. Studies in History and Philosophy of Science, 40, 213–220.
Brown, P., & Levinson, S. (1987). Politeness: Some universals in language usage. Cam-
bridge: CUP.
Bruche-Schulz, G. (2013). On moving in conceptual space: Readers’ responses to a
parallel text in Chinese and English. In G. Cao & H. Chappell & R. Djamouri &
T.Wiebusch (eds.). Breaking down the barriers: Interdisciplinary studies in Chinese
linguistics and beyond, 49–77.Tapei: Academia Sinica.
Bruche-Schulz, G. (2014). Where semiosis begins when reading a text: On event
perception. Sign System Studies 42(2/3), 330–358.
Buccino, G., Lui, F., Canessa, N., Patteri, I., Lagravinese, G., Benuzzi, F., … & Rizzol-
atti, G. (2004). Neural circuits involved in the recognition of actions performed by
nonconspecifics: An fMRI study. Journal of cognitive neuroscience, 16(1), 114–126.
Bugnyar, T. (2011). Knower-guesser differentiation in ravens: Others’ viewpoints
matter. Proceedings of the Royal Society B, 278, 634–640.
Bühler, K. (1934). Sprachtheorie. Die Darstellungsfunktion der Sprache. Stuttgart:
Fischer.
Butler, C. S. (1988). Pragmatics and systematic linguistics. J. of Pragmatics, 12(1),
83–102.
Bybee, J, Perkins, R., & Pagliuca, W. (1994). The evolution of grammar: tense, aspect
and modality in the languages of the world. Chicago: University of Chicago Press.
References 417
Castro, L., & Toro, M. A. (2004). The evolution of culture: From primate social
learning to human culture. Proceedings of the National Academy of Sciences of
the USA, 101(27), 10235–10240.
Cavanagh, P., Labianca, A. T., & Thornton, I. M. (2001). Attention-based visual rou-
tines: Sprites. Cognition, 80, 47–60.
Chafe, W. (1987). Cognitive constraints on information flow. In R. Tomlin (ed.), Co-
herence and grounding in discourse, 21–51. Amsterdam: Benjamins.
Chafe, W. L. (1970). Meaning and the structure of language. Chicago: Chicago Uni-
versity Press.
Chalmers, D. J. (1997). The conscious mind: In search of a fundamental theory. New
York: OUP.
Chandler, D. (2007). Semiotics – The Basics. London: Routledge
Charles, N., & Kerr, M. (1988). Women, food, and families. Manchester: Manchester
University Press.
Chemero, A. (2009). Radical embodied cognitive science. Cambridge, MA: MIT Press.
Chernigovskaya, T. (2004). Cognitive struggle with sensory chaos: Semiotics of ol-
faction and hearing. Semiotica, 150, 61–76.
Chierchia, G., & McConnell-Ginet, S. (2000). Meaning and grammar: An introduction
to semantics. Cambridge, MA: MIT Press.
Choi, H., & B. J. Scholl. (2004). Effects of grouping and attention on the perception
of causality. Perception & Psychophysics, 66, 926–942.
Chomsky, N. (1957). Syntactic structures. The Hague/Paris: Mouton.
Chomsky, N. (1965). Aspects of the theory of syntax. Cambridge, MA: MIT Press.
Chomsky, N. (1980). Rules and representations. New York: Columbia University Press.
Chomsky, N. (1986). Knowledge of language: Its nature, origin, and use. New York:
Praeger.
Chomsky, N. (2010). Some simple evo devo theses: How true might they be for
language? In R.K. Larson, V. Dèprez, & H. Yamakido (eds.). The evolution of human
language. Cambridge: CUP.
Christiansen, F. V., & Rump, C. (2008). Three conceptions of thermodynamics: Tech-
nical matrices in science and engineering. Research in Science Education, 38(5),
545–564.
Christiansen, M. H., & Kirby, S. (eds.) (2003). Language evolution. Oxford: OUP.
Cid Jurado, A. (2002). El estudio de los objetos y la semiótica. Cuicuilco, 25, 1–19.
Cienki, A. (2005). Image schemas and gestures. In B. Hampe (2005), 21–442.
Cienki, A., & Müller, C. (2008). Metaphor, gesture and thought. In J. Gibbs (ed.),
The Cambridge handbook of metaphor and thought, 484–501) Cambridge: CUP.
References 419
Cienki, A., & Müller, C. (eds.) (2008). Metaphor and gesture. Amsterdam: John Ben-
jamins.
Clancy, P.M., Thompson, S.A., Suzuki, R., & Tao, H. (1996). The conversational use
of reactive tokens in English, Japanese and Mandarin. Journal of Pragmatics 26,
355–387.
Clark, A. (1997). Being there: Putting brain, body, and world together again. Cam-
bridge, MA: MIT press.
Clark, A., & Chalmers, D. (1998). The extended mind. Analysis 58, 7–19.
Clark, E. V. (1993). The lexicon in acquisition. Cambridge: CUP.
Clark, H. H. (1996). Using language. Cambridge: CUP.
Clinton, R. L. (1989). Marbury vs. Madison and judicial review. Lawrence, KS: Uni-
versity of Kansas Press.
Cogill-Koez, D., (2000). Signed language classifier predicates: Linguistic structures
or schematic visual representation? Sign Language and Linguistics, 3, 153–207.
Cohen, E., Namir, L., & Schlesinger, I. M. (1977). A new dictionary of sign language:
employing the Eshkol-Wachmann movement notation system. The Hague: Mouton.
Consigny, S. (1974). Rhetoric and its situations. Philosophy and rhetoric, 7, 175–186.
Cook, R. S., Kay, P., & Regier, T. (2005). The world color survey database: History
and use. Handbook of categorisation in the cognitive sciences. Amsterdam: Elsevier.
Coppin, P. W. (2011). Reconciling competing accounts of picture perception from art
theory and perceptual psychology via the dual route hypothesis. In B. Kokinov,
A. Karmiloff-Smith, & N. J. Nersessian (eds.), European perspectives on cognitive
science. Sofia: New Bulgarian University Press.
Coppin, P. W. (2014). Perceptual-cognitive properties of pictures, diagrams, and
sentences: Toward a science of visual information design (Doctoral dissertation,
University of Toronto, Toronto, Canada). Retrieved from https://tspace.library
.utoronto.ca/handle/1807/44108.
Corballis, M. C. (2002). From hand to mouth: The origins of language. Princeton, NJ:
Princeton University Press.
Corballis, M. C. (2009). Mental time travel and the shaping of language. Experimental
Brain Research, 192(3), 553–560.
Corballis, M. C. (2011). The recursive mind. The origins of human language, thought
and civilization. Princeton: Princeton University Press.
Corballis, M. C. (2013a). Mental time travel: a case for evolutionary continuity.
Trends in Cognitive Sciences, 17(1), 5–6.
Corballis, M. C. (2013b). Gesture as precursor to speech in evolution. In C. Müller,
A. Cienki, E. Fricke, S. H. Ladewig, D. McNeill, S. Teßendorf, (eds) Body – lan-
guage – communication. An international handbook on multimodality in human
interaction. Vol. 1, 466–480. Berlin: De Gruyter.
420 References
Corballis, M. C., & Lea, S. E. (1999). The descent of mind: Psychological perspectives
on hominid evolution. Oxford: OUP.
Corbett, G. (1991). Gender. Cambridge: CUP.
Corbett, G. (2000). Number. Cambridge: CUP.
Cosentino E., Adornetti I., & Ferretti F. (2013) Processing narrative coherence: To-
wards a top-down model of discourse. Open Access Series in Informatics, 32, 61–75.
Cosentino, E. (2011). Self in time and language. Consciousness and Cognition, 20(3),
777–783.
Cosentino, E. (2014). Embodied pragmatics and the evolution of language. Humana.
Mente. Journal of Philosophical Studies, 27, 61–78.
Cosentino, E., Adornetti, I., & Ferretti, F. (2013). Processing narrative coherence:
Towards a top-down model of discourse. Open Access Series in Informatics, 32,
61–75.
Cosentino, E., Baggio, G., Kontinen, J., Garwels, T., & Werning, M. (2014). Lexicon
in action: N400 contextual effect on affordances and telicity. In P. Bello et al.
(eds.) Proceedings of the 36th Annual Conference of the Cognitive Science Society,
2079–2084.
Couturat, L., Jespersen, O., Lorenz, R., Ostwald, W., & Pfaundler, L. (1910). Inter-
national language and science: Considerations on the introduction of an inter-
national language into science. London: Constable & Co.
Cowan, N. (2001). The magical number 4 in short-term memory: A reconsideration
of mental storage capacity. Behavioral and Brain Sciences, 24, 1–185.
Cowan, N. (2005). Working memory capacity. Hove: Psychology Press.
Croft, W., & Cruse, D.A. (2004). Cognitive linguistics. Cambridge: CUP.
Cross-cultural dictionary [Gesty i mimika v russkoy rechi. Lingvostranovedcheskij
slovar’]. Мoscow: URSS.
Csibra, G., Davis, G., Spratling, M. W., & Johnson, M. H. (2000). Gamma oscillations
and object processing in the infant brain. Science, 290(5496), 1582–1585
Culpeper, J. (1996). Towards an anatomy of impoliteness. Journal of Pragmatics,
25, 349–367.
Culpeper, J. (2009). Using language to cause offence. Cambridge: CUP.
D’Amato, M. R., Salmon, D. P., Loukas, E., & Tomie, A. (1985). Symmetry and tran-
sitivity of conditional relations in monkeys (Cebus apella) and pigeons (Columba
livia). Journal of the Experimental Analysis of Behavior, 44, 35–47.
D’Errico, F., Henshilwood, C., Vanharen, M., & van Niekerk, K. (2005). Nassarius
kraussianus shell beads from Blombos Cave: evidence for symbolic behavior in
the Middle Stone Age. Journal of Human Evolution, 48, 3–24.
d’Errico, F., Vanhaeren, M., Henshilwood, C. S., Lawson, G., Maureille, B., Gambier,
D., Niekerk, K. van. (2009). From the origin of language to the diversification of
References 421
De Jaegher, H. (2013). Rigid and fluid interactions with institutions. Cognitive Sys-
tems Research, 25/26, 19–25.
De Warren, N. (2009). Husserl and the promise of time. Subjectivity in transcendental
phenomenology. Cambridge: CUP.
Deacon, T. (1997). The symbolic species. New York: Norton.
Deely, J. (2001). Physiosemiosis in the semiotic spiral: A play of musement. Sign
System Studies, 29(1), 27–48.
Deely, J. (2009). Purely objective reality. Berlin: Mouton de Gruyter.
DeLoache, J. S., & Ganea, P. A. (2009). Symbol-based learning in infancy. In A. Wood-
ward & A. Needham (eds.), Learning and the infant mind, 263–285. London:
Oxford.
Deltete, R. J., & Thorsell, D. L. (1996). Josiah Willard Gibbs and Wilhelm Ostwald: A
Contrast in scientific style. Journal of Chemical Education, 73(4), 289.
Dennett, D. (1987). Cognitive wheels: The frame problem in artificial intelligence.
In Z.W. Pylyshyn (ed.), The robot’s dilemma: The frame problem in artificial intel-
ligence. Norwood, NJ: Ablex.
Dennett, D. (1991). Consciousness explained. Boston: Little, Brown and Co.
Dennett, D. (2001). The fantasy of first-person science. http://ase.tufts.edu/cogstud/
papers/chalmersdeb3dft.htm, accessed on 15 May 2015.
Derrida, J. (1967). De la grammatologie. Paris: Editions Minuit.
Derrida, J. (2003 [1967]). La voix et le phénomène. Paris: Editions Quadrige.
DeSilva, J., & Lesnik, J. (2006). Chimpanzee neonatal brain size: Implications for
brain growth in Homo erectus. Journal of Human Evolution, 51, 207–212.
Di Paolo, E. (2005). Autopoiesis, adaptivity, teleology, agency. Phenomenology and
the Cognitive Sciences, 4, 429–452.
Diamond, A. (2002). Normal development of prefrontal cortex from birth to young
adulthood: Cognitive functions, anatomy, and biochemistry. In D. G. Stuss & R. T.
Knight (eds.), Principles of frontal lobe function, 466–503. London: OUP.
Dijkstra, K., Bourgeois, M. S., Allen, R. S., & Burgio, L. D. (2004). Conversational
coherence: Discourse analysis of older adults with and without dementia. Journal
of Neurolinguistics, 17(4), 263–283.
Dingemanse, M, Torreira, F., & Enfield, N.J. (2013). Is “Huh” a universal word? Con-
versational infrastructure and the convergent evolution of linguistic items. PloS
ONE 8 (11), e78273.
Dingemanse, M. (2011). The meaning and use of ideophones in Siwu. Ph.D. disser-
tation, Max Planck Institute for Psycholinguistics/Radboud University Nijmegen.
Ditman, T., & Kuperberg, G. R. (2007). The time course of building discourse coher-
ence in schizophrenia: An ERP investigation. Psychophysiology, 44(6), 991–1001.
Dobrova, E. V. (2011). Body language [Jazyk zhestov]. Мoscow: AST.
References 423
Donald, M. (1991). Origins of the modern mind. Three stages in the evolution of culture
and cognition. Cambridge, MA: Harvard University Press.
Donald, M. (1998). Mimesis and the executive suite: Missing links in language evol-
ution. In J. R. Hurford, M. Studdert-Kennedy, & C. Knight (eds.), Appoaches to
the evolution of language: Social and cognitive biases, 44–67. Cambridge: CUP.
Donald, M. (1999). Preconditions for the evolution of protolanguage. In M.C. Cor-
ballis and S. E. Lea, (eds.), The descent of mind: Psychological perspectives on
hominid evolution. New York: OUP.
Donald, M. (2001). A mind so rare: The evolution of human consciousness. New York:
Norton.
Donald, M. (2012). The mimetic origins of language. In M. Tallerman, & K. R. Gibson
(eds.), The Oxford gandbook of language evolution, 180–183. Oxford: OUP.
Donald, M. (2013). Mimesis theory re-examined, twenty years after the fact. In
G. Hatfield & H. Pittman (eds.) Evolution of mind, brain, and culture. Philadelphia:
University of Pennsylvania Museum of Archaeology and Anthropology.
Douglas, M. (1984). Standard uses of food: Introduction. In M. Douglas & D. Hull
(eds.) Food in the social order. Studies of food and festivities in three American
communities, 1–39. New York: Russell Sage.
Douglas, M. (2007). Thinking in circles: an essay on ring composition. New Haven:
Yale University Press.
Douglas, M., & Isherwood, B. (1996 [1972]). The World of goods. Towards an an-
thropology of consumption. London: Routledge.
Douglas, M., & Nicod, M. (1974). Taking the biscuit: The structure of British meals.
New Society, 19, 744–747.
Doyle, R. A. (2013). Bodies and occlusion: Item types, cognitive processes, and
gender differences in mental rotation, Quarterly Journal of Experimental Psy-
chology, 66(4), 801–815.
Dragicevic, P., & Jansen, Y. (2012). List of physical visualizations (http://dataphys.
org/list/).
Dreyfus, H. L. (1972). What computers can’t do: A critique of artificial intelligence.
New York: Harper & Row.
Dreyfus, H. L. (1992). What computers still can’t do: A critique of artificial reason.
Cambridge, MA: MIT Press.
DuBois, J. (2007). The stance triangle. In R. Englebretson (ed.), Stancetaking in dis-
course: Subjectivity, evaluation, interaction, 139–182. Amsterdam, John Benjamins.
Dudchenko, P. A (2010). Why people get lost: The psychology and neuroscience of
spatial cognition. Oxford: OUP
424 References
Dugdake, N., & Lowe, C. L. (2000). Testing for symmetry in the conditional discrimi-
nations of language-trained chimpanzees. Journal of the Experimental Analysis
of Behavior, 73, 5–22.
Duncan, A. M. (1996). Laws and order in eighteenth-century chemistry. Oxford:
Clarendon Press.
Dunsworth, H. M., Warrener, A.G. , Deacon, T., Ellison, P. T. , & Pontzer, H. (2012). A
metabolic hypothesis for human altriciality. Proceedings of the National Academy
of Sciences, 109, 1–7.
Durst-Andersen, P. (2011). Linguistic supertypes: A cognitive-semiotic theory of
human communication. Berlin: de Gruyter.
Dutton, D. (2009). The art instinct: Beauty, pleasure, and human evolution. London:
Bloomsbury Press.
Dye, D. S. (1937). A grammar of Chinese lattice. Cambridge, MA: Harvard.
Eco, U. (1976). A theory of semiotics. Bloomington: Indiana University Press
Eco, U. (1995). The search for the perfect language. Oxford: Blackwell.
Efron, D. (1972 [1941]). Gesture and environment. New York: King’s Crown Press.
Elman, J. L., Bates, E. A., Johnson, M. H., Karmiloff-Smith, A. (1996). Rethinking in-
nateness: A connectionist perspective on development. Cambridge, MA: MIT Press.
Elvevåg, B., Wynn, R., & Covington, M. A. (2011). Meaningful confusions and con-
fusing meanings in communication in schizophrenia. Psychiatry research, 186(2),
461–464.
Emmorey, K. (2003). Perspectives on classifier constructions in sign languages.
Mahwah, N.J.: Lawrence Erlbaum.
Engel, A. (2008). How moving objects become animated: The human mirror neur-
on system assimilates non-biological movement patterns. Social Neuroscience,
3(3–4), 368–387.
Enqvist, N. E. (1978). Coherence, pseudo-coherence, and non-coherence. In J.-O
Östman. (ed.), Cohesion and semantics, 109–128. Åbo: Åbo Akademi.
Evans, G. (1982). Varieties of reference, In McDowell, J. (ed.). Oxford: Clarendon Press.
Evans, V. (2009). Semantic representation in LCCM Theory. In V. Evans and
S. Pourcel (eds.), New directions in cognitive linguistics, 50–80. Amsterdam: John
Benjamins.
Evans, V., & Chilton, P. A. (2010). Language, cognition and space: The state of the art
and new directions. London: Equinox.
Evans, V., & Green, M. C. (2006). Cognitive linguistics: An introduction. Edinburgh:
Edinburgh University Press.
Explanatory dictionary of contemporary Russian language [Slovar’ sovremennogo
russkogo literaturnogo jazyka], BAD (Big Academic Dictionary) (1951–1965) in
17 volumes, vol. 2, 3, 16. Moscow: Academy of Science of USSR.
References 425
Freund, I. (1904). The study of chemical composition: An account of its method and
historical development. Cambridge: CUP.
Freyd, J. J. (1992). Dynamic representations guiding adaptive behavior. In F. Macar,
V. Pouthas and W. Friedman (eds.), Time, action, and cognition: Towards bridging
the gap, 309–323. Dordrecht: Springer.
Friedrich, M., & Friederici, A. D. (2011). Word learning in 6-month-olds: fast
encoding–weak retention. Journal of Cognitive Neuroscience, 23(11), 3228–3240.
Frith, C. D. (1992). The cognitive neuropsychology of schizophrenia. New York: Psy-
chology Press.
Froese, T. (2007). On the role of AI in the ongoing paradigm shift within the cog-
nitive sciences. In L. Lungarella, F. Lida, J. Bongard & R. Pfeifer (eds.), 50 Years
of artificial intelligence, 63–75. Berlin: Springer.
Froese, T. (2011). Breathing new life into cognitive science. Avant: The Journal of the
Philosophical-Interdisciplinary Vanguard, 2(1), 113–129.
Froese, T. (2012). Cognitive science is not computer science. AISB Quarterly, 134, 7.
Froese, T. (2013). Altered states and the prehistoric ritualization of the modern
human mind. In C. Adams, A. Waldstein, D. P. Luke, B. Sessa, & D. King (eds.),
Breaking convention: Essays on psychedelic consciousness, 10–21. London: Strange
Attractor Press.
Froese, T., Iizuka, H., & Ikegami, T. (2013a). From synthetic modeling of social inter-
action to dynamic theories of brain–body–environment–body–brain systems.
Behavioral and Brain Sciences, 36(4), 420–421.
Froese, T., Woodward, A., & Ikegami, T. (2013b). Turing instabilities in biology,
culture, and consciousness? On the enactive origins of symbolic material culture.
Adaptive Behavior, 21(3), 199–214.
Fuller, G. (1997). Reading the readers: The role of semiotics in visitor research. Visitor
Behavior XII (3–4), 30–31.
Fürst, E. (1995). Mat – Et annet sprog. Rasjonalitet, kropp og kvinnelighet [Food – an-
other language. Rationality, body and femininity]. Oslo: Pax Forlag.
Fusaroli, R., & Morgagni, S., (eds.) (2013). Conceptual metaphor theory: Thirty years
after, Cognitive Semiotics, 5 (1–2.)
Fusaroli, R., Bahrami, B., Olsen, K., Roepstorff, A., Rees, G., Frith, C., & Tylén, K.
(2012). Coming to terms quantifying the benefits of linguistic coordination. Psy-
chological Science, 23, 931–939.
Fusaroli, R., Rączaszek-Leonardi, J., & Tylén, K. (2014). Dialog as interpersonal syn-
ergy. New Ideas in Psychology, 32, 147–157.
Fussell, S.R., & Kraut, R.E. (2004). Visual copresence and conversational coor-
dination. Behavioral & Brain Sciences, 27(2), 196–197.
428 References
Gabbay, D., & Woods, J. (2003). Agenda relevance. A study in formal pragmatics.
Amsterdam: Elsevier.
Gallagher, S. (1997). Mutual enlightenment: Recent phenomenology in cognitive
science. Journal of Consciousness Studies 4(3), 195–214.
Gallagher, S. (2005). How the body shapes the mind. Cambridge: CUP.
Gallagher, S. (2007a). Phenomenological and experimental contributions to under-
standing embodied experience. In Ziemke et al. (2007), 271–293.
Gallagher, S. (2007b). Phenomenological approaches to consciousness. In M. Vel-
mans & S. Schneider (eds.), The Blackwell companion to consciousness, 686–696.
Oxford: Blackwell.
Gallagher, S. (2007c). Sense of agency and higher-order cognition: Levels of ex-
planation for schizophrenia. Cognitive Semiotics, 1, 33–48.
Gallagher, S. (2007d). The natural philosophy of agency. Philosophy Compass, 2,
347–357.
Gallagher, S. (2009). Philosophical antecedents of situated cognition. In P. Robbins &
M. Aydede (eds.), The Cambridge handbook of situated cognition, 35–51. Cam-
bridge, CUP.
Gallagher, S. (2012). Multiple aspects in the sense of agency. New Ideas in Psychology,
30, 15–31.
Gallagher, S., & Zahavi, D. (2008). The phenomenological mind: An introduction to
philosophy of mind and cognitive science. London: Routledge.
Gallese, V. (2006). Intentional attunement: A neurophysiological perspective on
social cognition and its disruption in autism. Brain Research, 1079(1), 15–24.
Gallese, V., & Metzinger, T. (2003). Motor ontology: The representational reality of
goals, actions and selves. Philosophical psychology, 13, 365–388.
Gallistel, C. R. (1990). The organization of learning. Cambridge, MA: MIT Press.
Gao, T., & Scholl, B. J. (2011). Chasing vs. stalking: Interrupting the perception of
animacy. Journal of Experimental Psychology: Human Perception and Perform-
ance, 37, 669–684.
Gao, W. Zhu, H., Giovanello, K. S., Smith, J. K, Shed, D., et al. (2009). Evidence on
the emergence of the brain’s default network from 2-week-old to 2-year-old
healthy pediatric subjects. Proceedings of the National Academy of Sciences, 106,
20961–20966.
Gärdenfors, P. (2004). Conceptual spaces: The geometry of thought. Cambridge: Brad-
ford Books.
Gärdenfors, P., & Osvath, M. (2010). Prospection as a cognitive precursor to sym-
bolic communication. In Larson, R. (ed.), Evolution of language: Biolinguistic ap-
proaches, 103–114. Cambridge: CUP.
References 429
Gelfert, A. (2003). Manipulative success and the unreal. International Studies in the
Philosophy of Science, 17(3), 245–263.
Gentner, D., & Goldin-Meadow, S. (2003). Language in mind: Advances in the study
of language and thought. Cambrdige, MA: MIT Press.
Gerdes, P. (2004a). Symmetries on mats woven by Yombe women from the Lower
Congo: On the interplay between cultural values and mathematical-technical
possibilities. In D. K. Washburn & D. W. Crowe (eds.), Symmetry comes of age:
the role of pattern in culture, 81–99. Seattle: University of Washington Press.
Gerdes, P. (2004b). Basketry, geometry, and symmetry in Africa and the Americas.
Visual mathematics 24: Online: <http://bit.ly/1l1Bat1>.
Gibbs, J. W. (1906). The scientific papers of J. Willard Gibbs (Vol. 1 Thermodynamics).
New York: Longmans, Green and Co.
Gibbs, R. W. (1994). The poetics of mind: Figurative thought, language, and under-
standing. Cambridge: CUP.
Gibbs, R. W. (2005). The psychological status of image schemas. In B. Hampe (ed.),
From perception to meaning: Image schemas in cognitive linguistics, 113–135.
Berlin: Mouton de Gruyter.
Gibbs, R. W. (2006a). Embodiment and cognitive science. Cambridge: CUP.
Gibbs, R. W. (2006b). Metaphor interpretation as embodied simulation. Mind &
Language, 21(3), 434–458.
Gibbs, R. W. (2007), Idioms and formulaic language, In D. Geeraerts and H. Cuyck-
ens, The Oxford Handbook of Cognitive Linguistics, 697–725. New York: OUP.
Gibbs, R. W. (2008). Images schemas in conceptual development: What happened
to the body? Philosophical Psychology, 21(2), 231–239.
Gibbs, R. W. (2013). Why do some people dislike conceptual metaphor theory?
Cognitive Semiotics, 5, (1–2),14–36.
Gibson, J. J. (1960). Review of “Art and illusion”. The American Journal of Psychology,
73(4), 653–654.
Gibson, J. J. (1971). The information available in pictures. Leonardo, 4(1), 27–35.
Gibson, J. J. (1978). The ecological approach to the visual perception of pictures.
Leonardo, 11(3), 227–235
Gibson, J.J. (1979). The ecological approach to visual perception. New York: Psychology
Press.
Giere, R. N. (1992). Cognitive models of science. Minneapolis: U. of Minnesota Press.
Giere, R. N. (2006). Scientific perspectivism. Chicago: University of Chicago Press.
Giora, R. (1997). Discourse coherence and theory of relevance: Stumbling blocks in
search of a unified theory. Journal of Pragmatics, 27(1), 17–34.
Giora, R. (1998). Discourse coherence is an independent notion: A reply to Deirdre
Wilson. Journal of Pragmatics, 29(1), 75–86.
430 References
Grady, J. E., Oakley, T., & Coulson, S. (1999). Blending and metaphor. In R. Gibbs &
G. J. Steen (eds.), Metaphor in cognitive linguistics, 101–124. Amsterdam: John
Benjamins.
Greenwald, A. G., McGhee, D, & Schwartz, J. (1998). Measuring individual differ-
ences in implicit cognition: The Implicit Association Test. Journal of Personality
and Social Psychology, 74, 1464–1480.
Greenwald, A. G., Nosek A.B., & Banaji, M.R. (2003). Understanding and using the
Implicit Association Test: I. An improved scoring algorithm. Journal of Person-
ality and Social Psychology, 85, 197–216.
Grice, H. P. (1968). Utterer’s meaning, sentence-meaning, and word-meaning. Foun-
dation of Language, 4(3), 225–242.
Grice, H. P. (1975). Logic and conversation. In P. Cole & J.L. Morgan (eds.) Syntax
and semantics, Vol. 3, Speech acts, 41–58. New York: Academic Press.
Grigg, D. (1999). The changing geography of world food consumption in the second
half of the twentieth century. Geographical Journal, 165(1), 1–11.
Grigoriev, N. V., Grigorieva, S. A., Kreydlin, G. E. (2001). Dictionary of Russian ges-
tures [Slovar’ jazyka russkih zhestov]. Мoscow: Jazyki slavyanskoy kul’tury.
Grinevald, C. (2000). A morphosyntactic typology of classifiers. In G. Seft (ed.),
Systems of nominal classification, 69–84. Cambridge: CUP.
Grondin, S. (2010). Timing and time perception: a review of recent behavioral and
neuroscience findings and theoretical directions. Attention, Perception, & Psycho-
physics, 72(3), 561–582.
Groupe µ (1993 [1992]). Tratado del signo visual. Barcelona: Cátedra.
Grünbaum, B. (2004). Periodic ornamentation of the fabric plane: Lessons from
Peruvian Fabrics. In D. K. Washburn (ed.), Symmetry comes of age: the role of
pattern in culture, 18–64. Seattle: University of Washington Press.
Grush, R. (2004). The emulation theory of representation: motor control, imagery,
and perception. Behavioral & Brain Sciences, 27(3), 377–396.
Grush, R., & Churchland, P. (1995). Gaps in Penrose’s toilings. In Metzinger, T. (ed.),
Conscious experience, 185–214. Imprint Academic.
Gumperz, J. J., & Levinson, S. C. (eds.). (1996). Rethinking linguistic relativity. Cam-
bridge: CUP.
Hacking, I. (1983). Representing and intervening: Introductory topics in the philosophy
of natural Science. Cambridge: CUP.
Haiman, J. (2008). In defence of iconicity. Cognitive Linguistics, 19(1), 35–48.
Hakke, R. J., & Somerville, S. J. (1985). Development of logical search skills in in-
fancy. Developmental Psychology, 21, 176–186.
Halliday, M. A. K. (1975). Learning how to mean: Explorations in the development of
language. New York: Elsevier-North Holland.
432 References
Holt, E., and Drew, P. (1995). Idiomatic expressions and their role in the organization
of topic transition in conversation. Hillsdale, N.J.: Lawrence Erlbaum.
Holt, N. R. (1970). A note on Wilhelm Ostwald’s Energism. Isis, 61(3), 386–389.
Holtzclaw, H. F., Nebergall, W., & Robinson, W. (1984). General chemistry. Houghton
Mifflin.
Hon, G., & Goldstein, B. R. (2008). From summetria to symmetry: The making of a
revolutionary scientific concept. Berlin: Springer.
Hookway, C. (1985). Peirce. London: Routledge & Kegan Paul.
Hopkins, Z., Yuill, N., & Keller, B. (2015). Children with autism align syntax in
natural conversation. Applied Psycholinguistics, 1–24.
Hougaard, A., & Hougaard, G. (2008). Implications of cognitive metaphor and ges-
ture studies for ethnomethodology and conversation analysis and vice versa. In
A. Cienki & C. Müller Metaphor and gesture, 265–273. Amsterdam: John Ben-
jamins.
Housmann, R., Tyson, L. B., Bekhouche, Y., & Zahidi, S. (2014). The global gender
gap report, World Economic Forum. Retrieved from: http://www3.weforum.org/
docs/GGGR14/GGGR_CompleteReport_2014.pdf
Hummel, J. E., & Biederman, I. (1992). Dynamic binding in a neural network for
shape recognition. Psychological Review, 99, 480–517.
Hurford, J. R. (2014). The origins of language. The slim guide. New York: OUP.
Hurley, S. (1998). Consciousness in action. Cambridge, MA: Harvard University Press.
Husserl, E. (1969 [1929]). Formal and transcendental Logic. The Hague: Martinus
Nijhoff.
Husserl, E. (1970). Logical investigations, Volume 1. London: Routledge.
Husserl, E. (1973 [1939]). Experience and judgement. Northwestern: Northwestern
University Press.
Husserl, E. (1999). The essential Husserl. Basic writings in transcendental phenom-
enology. Edited by D. Welton. Bloomington: Indiana University Press
Husserl, E. (2001 [1900]). Logical investigations. London: Routledge.
Husserl, E. (2001). Analyses concerning passive and active synthesis: Lectures on tran-
scendental logic. Dordrecht: Kluwer Academic Publishers.
Hutchins, E. (2010). Cognitive ecology. Topics in Cognitive Science, 2, 705–715.
Hutchins, E. (2010). Enaction, imagination, and insight. In J. Stewart, O. Gapenne
and E. A. Di Paolo (eds.), Enaction: Toward a new paradigm for cognitive science,
425–450. Cambridge: The MIT Press.
Hutto, D. D. (2008). First communions: Mimetic sharing without theory of mind. In
Zlatev et al. (2008), 245–276.
References 435
Hutto, D. D., & E. Myin (2013). Radicalizing enactivism: Basic minds without content.
Cambridge, MA: MIT Press.
Huxley, T. H. (1874). On the hypothesis that animals are automata, and its history,
Fortnightly Review, 16, 555–580.
Iacoboni, M., Molnar-Szakacs, I., Gallese, V., Buccino, G., Mazziotta, J.C., & Rizzol-
atti, G. (2005). Grasping the intentions of others with one’s own mirror neuron
system. PLoS biology, 3(3), e79.
Ihde, A. J. (1964). The development of modern chemistry. New York: Dover Pub-
lications.
Imai, M. (1999). Constraint on word learning constraints. Japanese Psychological
Research, 41, 5–20.
Imai, M. (2014). Symbol grounding problem: How do children build up the lexicon?
Plenary address presented to IACS 1, Lund, Sweden, September 27, 2014.
Imai, M., & Gentner, D. (1997). A cross-linguistic study of early word meaning:
Universal ontology and linguistic influence. Cognition, 62(2), 169–200.
Imai, M., & Haryu, E. (2001). Learning proper nouns and common nouns without
clues from syntax. Child Development, 72(3), 787–802.
Imai, M., & Kita, S. (2014). The sound symbolism bootstrapping hypothesis for
language acquisition and language evolution. Philosophical Transactions of the
Royal Society B: Biological Sciences, 369(1651), 20130298.
Imai, M., & Mazuka, R. (2007). Language‐relative construal of individuation con-
strained by universal ontology: Revisiting language universals and linguistic
relativity. Cognitive Science, 31(3), 385–413.
Imai, M., Gentner, D., & Uchida, N. (1994). Children’s theories of word meaning: The
role of shape similarity in early acquisition. Cognitive Development, 9(1), 45–75.
Imai, M., Haryu, E., & Okada, H. (2005). Mapping novel nouns and verbs onto
dynamic action events: are verb meanings easier to learn than noun meanings
for Japanese children? Child Development, 76(2), 340–355.
Imai, M., Kita, S., Nagumo, M., & Okada, H. (2008). Sound symbolism facilitates early
verb learning. Cognition, 109(1), 54–65.
Imai, M., Li, L., Haryu, E., Okada, H., Hirsh-Pasek, K., Golinkoff, R., & Shigem-
atsu, J. (2008). Novel noun and verb learning in Chinese-, English-, and Japanese-
speaking children. Child Development, 79, 979–1000.
Imai, M., Miyazaki, M., Yeung, H. H., Hidaka, S., Kantartzis, K., Okada, H., & Kita,
S. (2015). Sound symbolism facilitates word learning in 14-month-olds. PloS
one, 10(2), e0116494.
Itkonen, E. (2005). Analogy as structure and process. Amsterdam: John Benjamins.
Jackendoff, R. (1990). Semantic structures. Cambridge, MA: MIT Press.
436 References
Jackendoff, R. (1994). Patterns in the mind: language and human nature. New York:
Basic Books.
Jackendoff, R. (2003). Foundations of language. Brain, meaning, grammar, evolution.
New York: OUP.
Jackendoff, R. (2012). Language as a source of evidence for theories of spatial rep-
resentation. Perception, 41, 1128–1152.
Jacobs, I.F., von Bayern, A., Martin-Ordas, G., Rat-Fischer, L., & Osvath, M. (2015).
Corvids create novel interventions after all. Proceedings of the Royal Society B,
282, 1806.
Jakobson, R. & Waugh, L (1979). The sound shape of language. Bloomington: Indiana
University Press.
Jakobson, R. (1965). Quest for the essence of language. Diogenes, 13(51), 21–37.
James, W. (1907). Pragmatism: A new name for some old ways of thinking. New York:
Longmans, Green, and Co.
Jansson, S. (1993). Maten och det Sociala Samspelet [Food and Social Interaction].
Stockholm: Agricultural University of Sweden.
Jervolino, D. (2001). Ricoeur et la pensée de l’histoire: Entre temps et mémoire.
Labyrinth, vol. 3, Winter.
Jespersen, O. (1992 [1924]). The philosophy of grammar. Chicago: University of
Chicago Press.
Johnson, M. (1987). The body in the mind: The bodily basis of meaning, imagination,
and reason. Chicago: University of Chicago Press.
Johnson, M. (2005). The philosophical significance of image schemas. In B. Hampe
(ed.), 15–34.
Johnson, M. (2007). The meaning of the body: Aesthetics of human understanding.
Chicago: The University of Chicago Press.
Julesz, B. (1981). Textons, the elements of texture perception and their interactions.
Nature, 290, 91–97.
Kantartzis, K., Imai, M., & Kita, S. (2011). Japanese Sound-Symbolism Facilitates
Word Learning in English-Speaking Children. Cognitive Science, 35(3), 575–586.
Kastler, L. (2004). Negative and Positive politeness [Negativnaja i pozitivnaya
vezhlivost’]. In Agression in language and in speech [Agressija v jazyke i rechi].
Мoscow: RSUH. 9–18.
Katz, J. J. (1973). Compositionality, idiomaticity, and lexical substitution. In S.R. An-
derson & P. Kiparsky (eds.), A festschrift for Morris Halle. New York.´: Holt, Re-
inhart & Winston.
Katz, J. J. (1980). Propositional structure and illocutionary force: A study of the con-
tribution of sentence meaning to speech acts. Cambridge: Harvard University Press.
References 437
Kay, P., Berlin, B., Maffi, L., & Merrifield, W. (1997). Color naming across languages.
In C. L. Hardin & L. Muffi (eds.) Color categories in thought and language, 21–58.
Cambridge: CUP.
Kecskes, I. (2014). Intercultural pragmatics. Oxford: OUP.
Keeler, J., & Wothers, P. (2003). Why chemical reactions happen. Oxford: OUP.
Kehler, A. (2011). Cohesion and coherence. In C. Maienborn, K. von Heusinger
and P. Portner (eds.), Semantics: An international handbook of natural language
meaning (vol. 2), 963–1987. Berlin: de Gruyter.
Kempson, R. (2001). Pragmatics: language and communication. In M. Aronoff &
J. Rees-Miller (eds.) The handbook of linguistics, 349–427. Cambridge: MIT Press.
Kendon, A. (1980a). A description of a deaf-mute sign language from the Enga
Province of Papua New Guinea with some comparative discussion. Part I: The
formational properties of Enga signs. Semiotica, 31(1/2), 1–34.
Kendon, A. (1980b). A description of a deaf-mute sign language from the Enga
Province of Papua New Guinea with some comparative discussion. Part II: The
semiotic functioning of Enga signs. Semiotica, 32(1/2), 81–117.
Kendon, A. (1986). Iconicity in Warlpiri sign language. In P. Bouissac, M. Herzfeld, &
R. Posner (eds.), Iconicity: Essays on the nature of culture. Festschrift for Thomas
A. Sebeok on his 65th Birthday, 437–446. Tübingen: Stauffenburg Verlag.
Kendon, A. (1988). Sign languages of Aboriginal Australia: Cultural, semiotic and
communicative perspectives. Cambridge: CUP.
Kendon, A. (2004). Gesture. Visible action as utterance. Cambridge: CUP.
Kendon, A. (2009). Signs for language origins? Public Journal of Semiotics, 2(1), 2–27.
Kennedy, J. M. (1974). A psychology of picture perception. San Francisco, CA: Jossey-
Bass Publishers.
Kennedy, J. M. (1993). Drawing and the blind. New Haven, CT: Yale University Press.
Kim, J. (2007). The causal efficacy of consciousness. In Velmans, M., &
Schneider, S. (eds.), The Blackwell companion to consciousness, 406–417. Oxford:
Blackwell.
Kim, M. H. (2014). Why self-deprecating? Achieving ‘oneness’ in conversation.
Journal of Pragmatics, 69, 82–98.
Kintsch, W., & Van Dijk, T. A. (1978). Toward a model of text comprehension and
production. Psychological review, 85(5), 363.
Kirmayer, L. J. (1992). The body’s insistence on meaning: Metaphor as presentation
and representation in illness experience. Medical Anthropological Quarterly, 6(4),
323–346.
Kitterod, R. H., & Lappegard T. (2012). A typology of work-family arrangements
among dual-earner couples in Norway. Family Relations, 61(4), 671–685.
438 References
Klatzky, R. L., Giudice, N. A., Bennett, C. R., and Loomis, J. M. (2014). Touch-screen
technology for the dynamic display of 2D spatial information without cision:
Promise and Progress. Multisensory Research, 27, 359–378.
Klüver, H. (1966). Mescal and mechanisms of hallucinations. Chicago: University of
Chicago Press.
Köhler, W. (1947). Gestalt psychology, 2nd ed. New York: Liveright Publishing.
Kono, T. (2009). Social affordances and the possibility of ecological linguistics. Inte-
grative Psychological and Behavioral Science, 43(4), 356–373.
Kosslyn, S. M., Ganis, G., & Thompson, W. L. (2001). Neural foundations of imagery.
Nature Reviews Neuroscience, 2(9), 635–642.
Kövecses, Z. (2005). Metaphor in culture: Universality and variation. New York: CUP.
Kövecses, Z. (2008). Conceptual metaphor theory – Some criticisms and alternative
proposals. Annual Review of Cognitive Linguistics, 6(1), 168–184.
Kövecses, Z. (2010). Metaphor: A practical introduction. Oxford: OUP.
Kragh, H. (2001). Van’t Hoff and the transition from thermochemistry to chemical
thermodynamics. In W. J. Hornix & S. H. W. M. Mannaerts (eds.), Van’t Hoff and
the emergence of Chemical Thermodynamcis, 191–211. Delft: DUP Science.
Kragh, H. (2015). The “New Physics”. In M. Saler (ed.), The fin-de-siècle world,
441–455. London: Routledge.
Kress, G. (2010). Multimodality. A social semiotic approach to contemporary com-
munication. London: Routledge.
Kress, G., & Leeuwen, T. van (1996). Reading images: The grammar of visual design.
London: Routledge.
Kress, G., & Selander, S. (2012). Multimodal design, learning and cultures of recog-
nition. The Internet and Higher Education, 15(4), 265–268.
Kress, G., Jewitt, C., Ogborn, J., & Tsatsarellis, C. (2014). Multimodal teaching and
learning. The Rhetorics of the Science Classroom. London: Continuum.
Kreydlin, G. E. (2002). Nonverbal semiotics. Body language and natural language
[Neverbal’naya semiotica. Jazyk tela i estestvennyj jazyk]. Мoscow: NLO.
Kreydlin, G. E., Morozova, E. V. (2003). Interlingual typology of the nonverbal units:
the common bows [Vnutrijazykovája tipológija neverbál’nyh jediníts: bytovýje
poklóny]. In Questions of linguistics [Voprósy jazykoznánija], 4. Мoscow: Nauka.
34–47.
Kronhaus, M. A. (2001). Changes in contemporary Russian etiquette [Izmenénija v
sovreménnom rechevóm etikéte]. In Life of the Language [Zhyzn’ jazyka]:collec-
tion of articles for the 80th anniversary of M.V. Panov. Мoscow: Jazyki slav’anskoy
kul’tury. 263–269.
Kronhaus, M. A. (2004). Russian verbal etiquette at the turn of the century [Rússkij
rechevój etikét na rubezhé vekóv]. In Russian linguistics, 28 (2), 163–187.
References 439
Krylova, T. V. (2006). Naïve linguistic explanation of the politeness and its lexical
units [Naivno-jazykovyje predstavlenija o vezhlivosti i obsluzhivajushaja ih
leksika]. In Linguistic world image and system lexicography[ Jazykovajz kartina
mira i sistemnaya leksikografija]. Moscow: Jazyki slavyanskih kul’tur. 241–400.
Kull, K., Deacon, T., Emmeche, C., Hoffmeyer, J. & Stjernfelt, S. (2009). Theses
on biosemiotics: Prolegomena to a theoretical biology. Biological Theory, 4(2),
167–173.
Kulvicki, J. (2010). Knowing with images: Medium and message. Philosophy of
Science, 77(2), 295–313.
Kuperberg, G. R. (2013). The proactive comprehender: What event-related potentials
tell us about the dynamics of reading comprehension. In B. Miller, L. Cutting, &
P. McCardle (eds.) Unraveling the behavioral, neurobiological, and genetic com-
ponents of reading comprehension. Baltimore: Paul Brookes Publishing.
Kurzweil, R. (2006). The singularity is near: When humans transcend biology. New
York: Penguin.
Kutas, M., & Federmeier, K. D. (2011). Thirty years and counting: finding meaning
in the N400 component of the event-related brain potential (ERP). Annual Review
of Psychology, 62, 621–647.
Kuznetsov, S. A. (1998). The big explanatory dictionary of the Russian language
[Bol’shój tolkovyj slovar’ rússkogo jazyka]. Saint-Petersburg: Norint.
Ladewig, S. H. (2014). Recurrent gestures. In C. Müller, A. Cienki, E. Fricke,
S. H. Ladewig, D. McNeill, J. Bressem, (eds), Body – language – communication. An
international handbook on multimodality in human interaction, Vol. 2, 1558–1574.
Berlin: De Gruyter.
Laidler, K. J. (1985). Chemical kinetics and the origins of physical chemistry. Archive
for History of Exact Sciences, 32(1), 43–75.
Laidler, K. J. (2000). Van’t Hoff and the Scientific Imagination. Chemical Intelligencer,
4, 37–41.
Laidler, K. J., Meiser, J. H., & Sanctuary, B. C. (2003). Physical chemistry (4. ed.).
Boston, MA: Houghton Mifflin.
Lakoff, G. (1987). Women, fire and dangerous things: what categories reveal about the
mind. Chicago: University of Chicago Press.
Lakoff, G. (2008). The neural theory of metaphor. In Gibbs, R. (ed.). The Cambridge
handbook of metaphor and thought, 17–38. Cambridge: CUP.
Lakoff, G., & Johnson M. (1980). Metaphors we live by. Chicago: University of Chi-
cago Press.
Lakoff, G., & Johnson, M. (1999). Philosophy in the flesh: The embodied mind and its
challenge to Western thought. New York: Basic Books.
Lakoff, G., & Turner, M. (1989). More than cool reason. A field guide to poetic meta-
phor. Chicago: University of Chicago Press.
440 References
Lakoff, G., Espenson, J., Goldberg, A., & Schwartz, A. (1991 [1989]). The master
metaphor list: University of California Berkeley, online: <http://araw.mede.uic.
edu/∼alansz/metaphor/METAPHORLIST.pdf>.
Landau, B., Smith, L. B., & Jones, S. S. (1988). The importance of shape in early lexical
learning. Cognitive development, 3(3), 299–321.
Langacker, R. (1987). Foundations of cognitive grammar: Vol. 1. Theoretical prerequi-
sites. Stanford: Stanford University Press.
Langacker, R. (1991). Concept, image, and symbol. The cognitive basis of grammar.
Berlin: Mouton de Gruyter.
Langacker, R. (1998). Conceptualization, symbolization, and grammar. In M. Toma-
sello (ed.), The new psychology of language: Cognitive and functional approaches
to language structure, 1–39. London: Erlbaum.
Langacker, R. (1999). Grammar and conceptualization. Berlin: Mouton de Gruyter.
Langacker, R. (2008). Cognitive grammar: A basic introduction. New York: OUP.
Larina, T. V. (2009). Category of politeness and the style of communication. Com-
parative analysis of English and Russian linguistic and cultural traditions [Kat-
egoria vezhlivosti i stil’ kommunicatsyji. Sopostavleniye anglijskih i russkih
lingvokul’turnyh tradutsyj]. Moscow: Jazyki slavyanskih kul’tur.
Larina, T. V. (2013). English and Russian. Language, culture, communication [Angli-
chane i russkie. Jazyk, kultura, kommunikatsyja]. Moscow: Jazyki slavyanskih
kul’tur.
Larkin, J. H., & Simon, H. A. (1987). Why a diagram is (sometimes) worth ten
thousand words. Cognitive Science, 11, 65–99.
Larkin, J. H., & Simon, H. A. (1989). Learning through growth of skill in mental
modeling. In H. A. Simon (ed.), Models of thought. Volume II (pp. 134–144). New
Haven, CT: Yale University Press.
Lave, J. (1988). Cognition in practice: Mind, mathematics and culture in everyday life.
Cambridge: CUP.
Le, Q. V. (2012). Building high-level features using large scale unsupervised learning.
Proceedings of the 29th international conference on machine learning, Edinburgh.
Lee, P. (1996): The Whorf theory complex: A critical reconstruction. Amsterdam: John
Benjamins.
Lenninger, S. (2012). When similarity qualifies as a sign: A study in picture under-
standing and semiotic development in young children. Doctoral Dissertation, Lund
University.
Lennon, E. M. (1984). Exploration, communication, and symbolization : Gestural devel-
opment in infancy. Unpublished Masters Thesis. New Brunswick, NJ: Rutgers
University.
References 441
Leroi Gourhan, A. (1980). Hand und Wort. Die Evolution von Technik, Sprache und
Kunst. Frankfurt am Main: Suhrkamp.
Lestrade, S. (2010). The space of case, PhD thesis, Radboud University Nijmegen.
Leung, T., & Malik, J. (2001). Representing and recognizing the visual appearance
of materials using three-dimensional textons. International Journal of Computer
Vision, 43, 29–44.
Levent, W. (1982). Linearization in describing spatial networks. In S. Peters & E. Saa-
rinen (eds.), Processes, beliefs, and questions. Amsterdam: Reidel.
Levinson, S. C., & Brown, P. (1994). Immanuel Kant among the Tenejapans: An-
thropology as empirical philosophy. Ethos, 22(1), 3–41.
Levtov, R.G., Barker, G. Contreras-Urbina, M. Heilman, B. & Verma, R. (2012). Path-
ways to gender-equitable Men: Findings from the international men and gender
equality survey in eight countries. Men and Masculinities, 17(5), 467–501.
Lewis, M. M. (1936). Infant speech. New York: Harcourt Brace.
Lewis-Williams, J. D. & Dowson, T. A. (1988). The signs of all times: Entoptic phe-
nomena in upper palaeolithic art. Current Anthropology, 29(2), 201–217.
Libet, B. (1985). Unconscious cerebral initiative and the role of conscious will in
voluntary action. Behavioral and Brain Sciences, 8(4), 529–539.
Liebal, K., Call, J. & Tomasello, M. (2004). Use of gesture sequences in chimpanzees.
American Journal of Primatology, 64(4), 377–396.
Liebal, K., Müller, C., & Pika, S. (eds.) (2007). Gestural communication in nonhuman
and human primates. Amsterdam: John Benjamins.
Lind, P. (2013). Pyrrhonean aporiagogics and the astrocognitive challenge. In
D. Duner, J. Parthemore, E. Persson, and G. Holmberg (eds.), The history and
philosophy of astrobiology: Perspectives on extraterrestrial life and the human mind,
93–128. Newcastle: Cambridge Scholars.
Linder, C. (2013). Disciplinary discourse, representation, and appresentation in the
teaching and learning of science. European Journal of Mathematics and Science
Education, 1(2), 43–49.
Linkenauger, S.A., Lerner, M.D., Ramenzoni, V.C., & Proffitt, D.R. (2012). A perceptual-
motor deficit predicts social and communicative impairments in individuals with
autism spectrum disorders. Autism Research, 5(5), 352–362.
Linnell, P. (2006). Towards a dialogical linguistics. In M. Lähteenmäki, H. Dufva,
S. Leppänen, & P.Varis (eds.), Proceedings of the XII international Bakhtin con-
ference Jyväskylä, Finland, 18–22 July, 2005, 157–172. Jyväskylä: University of
Jyväskylä Press.
Lionello-DeNolf, K. M. (2009). The search for symmetry: 25 years in review. Learn-
ing & behavior, 37(2), 188–203.
442 References
(eds.), The conceptual mind: New directions in the study of concepts. Cambridge
MA: MIT press.
Malt, B.C., Sloman S.A., Gennari S., Shi M., & Wang Y. (1999). Knowing versus nam-
ing: Similarity and the linguistic categorization of artifacts. Journal of Memory
and Language, 40, 230–262
Mandel, M. (1977). Iconic devices in American Sign Language. In L. A. Friedman
(ed.), On the other hand: New perspectives on American Sign Language, 57–107.
New York: Academic Press.
Mandler, J. M. (2004). Foundations of mind: Origins of conceptual thought. New York:
OUP.
Mandler, J. M. (2006). Categorization, development of. Encyclopedia of Cognitive
Science. Wiley Online Library.
Marean, C. W. (2011). Coastal South Africa and the coevolution of the modern
human lineage and the coastal adaptation. In N.F. Bicho, J. A. Haws &, L.G. Davis
(eds.), Trekking the shore: Changing coastlines and the antiquity of costal settle-
ment, 421–440. New York: Springer.
Marean, C. W., Bar-Matthews, M., Bernatchez, J., Fishser, E., Goldberg, P., Herries,
A. I. R., Jacobs, Z., Jerardino, A., Karkanas, P., Minichillo, T., Nilssen, P. J., Thomas,
E., Watts, I., & Williams, H. M. (2007). Early human use of marine resources and
pigment in South Africa during the Middle Pleistocene. Nature, 449, 905–908.
Marek, J. (2013). Alexius Meinong, The Stanford Encyclopedia of Philosophy (Fall
2013 Edition), Edward N. Zalta (ed.), URL = <http://plato.stanford.edu/archives/
fall2013/entries/meinong/>.
Margulis, L., & Sagan, D. (1995). What is life? New York: Simon & Schuster.
Marini, A., Spoletini, I., Rubino, I. A., Ciuffa, M., Bria, P., Martinotti, G., Banfi, G.,
Boccascino, R., Strom, P., Siracusano, A., Caltagirone, C., & Spalletta, G. (2008).
The language of schizophrenia: An analysis of micro and macrolinguistic abilities
and their neuropsychological correlates. Schizophrenia Research, 105(1), 144–155.
Marini, A., Zettin, M., & Galetto, V. (2014). Cognitive correlates of narrative impair-
ment in moderate traumatic brain injury. Neuropsychologia, 64, 282–288.
Markman, E. M. (1989). Categorization and naming in children: Problems of induction.
Cambridge, MA: MIT Press.
Marler, P., & Tenaza, R. (1977). Signalling behaviour of wild apes with special ref-
erence to vocalization. In T. Sebeok (ed.), How animals communicate, 965–1033.
Bloomington: Indiana University Press.
Marr, D. (1982). Vision: A computational investigation into the human representation
and processing of visual information. San Francisco: Freeman & Co.
Martin, J. R., & Veel, R. (1998). Reading science. Critical and functional perspectives
on discourses of science. London: Routledge.
444 References
Maturana, H., & Varela. F. (1980). Autopoiesis and cognition: The realization of the
living (2nd edition). Boston: Reidel.
Maturana, H., & Varela, F. (1998). The tree of knowledge: The biological roots of human
understanding (revised edition). Transl. by R. Paolucci. Boston and London:
Shambhala.
Maurer, D. (2002). Vegetarianism: Movement or moment? Philadelphia, PA: Temple
University Press.
Maurer, D., Pathman, T., & Mondloch, C. J. (2006). The shape of boubas: sound–shape
correspondences in toddlers and adults. Developmental science, 9(3), 316–322.
Mayol, P. (1998). Propriety. In M. de Certeau, L. Giard & P. Mayol (eds.) The practice
of everyday life. Minneapolis, MN: University of Minnesota Press.
McCarthy, J., & Hayes, P.J. (1969), Some philosophical problems from the standpoint
of artificial intelligence, In D. Michie & B. Meltzer (eds.) Machine intelligence 4,
463–502. Edinburgh: Edinburgh University Press.
McCawley, J. (1976). Grammar and meaning. New York: Academic Press.
McClelland, J. L., Botvinick, M. M., Noelle, D. C., Plaut, D. C., Rogers, T. T., Seiden-
berg, M. S., & Smith, L. B. (2010). Letting structure emerge: Connectionist and
dynamical systems approaches to Cognition. Trends in Cognitive Science, 14,
348–356.
McCune, L. (1992). First words: A dynamic systems view. In C. A. Ferguson,
L. Menn, & C. Stoel-Gammon (eds.), Phonological development: Models, research,
implications, 313–336. Parkton, MD: York Press.
McCune, L. (1993). The development of play as the development of consciousness.
In M. Bornstein & A. O’Reilly (eds.), New directions in child development. The
role of play in the development of thought, 67–79. San Francisco CA: Jossey-Bass.
McCune, L. (1995). A normative study of representational play at the transition to
language. Developmental Psychology, 31, 198–206.
McCune, L. (1999). Children’s transition to language: A human model for devel-
opment of the vocal repertoire in extant and ancestral primate species? In B. J.
King (ed.) Origins of language: What can nonhuman primates tell us? 296–306.
Santa Fe: SAR Press.
McCune, L. (2006). Dynamic event words: From common cognition to varied lin-
guistic expression. First Language, 26, 233–255.
McCune, L. (2008). How children learn to learn language. New York: OUP.
McCune, L. (2010). Developing symbolic activities. In B. Wagoner (ed.). Symbolic
transformations: The mind in movement through culture and society, 193–208.
Oxford: Routledge Academic Press.
McCune, L., & Vihman, M. (1987). Vocal motor schemes. Papers and Reports in Child
Language Development, 26, 72–79.
References 445
McCune, L., & Vihman, M. (2001). Early phonetic and lexical development: A
productivity approach. Journal of Speech, Language, and Hearing Research, 44,
670–684.
McCune, L., & Zlatev, J. (2015). Dynamic systems in semiotic development: The
transition to reference. Cognitive Development, 36, 161–170.
McCune, L., Vihman, M. M., Roug-Hellichius, L., Delery, D. B., & Gogate, L. (1996).
Grunt communication in human infants. Journal of Comparative Psychology,
110, 27–37.
McCune-Nicolich, L. (1981a). The cognitive basis of relational words. Journal of
Child Language, 8, 15–36.
McCune-Nicolich, L. (1981b). Toward symbolic functioning: Structure of early
pretend games and potential parallels with language. Child Development, 52,
785–797.
McGilchrist, I. (2009). The master and his emissary: the divided brain and the making
of the Western world. New Haven: Yale University Press.
McMahon, A., & McMahon, R. (2013). Evolutionary linguistics. Cambridge: CUP.
McNeill, D. (1992). Hand and mind. What gestures reveal about thought. Chicago:
University of Chicago Press.
McNeill, D. (2013). The co-evolution of gesture and speech, and downstream con-
sequences. In C. Müller, A. Cienki, E. Fricke, S. H. Ladewig, D. McNeill, S. Te-
ßendorf, (eds) Body – language – communication. An international handbook on
multimodality in human interaction. Vol. 1, 480–512. Berlin: De Gruyter.
McNeill, D. (2015). Why we gesture. Cambridge: CUP.
Meigs, A. (1984). Food, sex and pollution: A New Guinea religion. New Brunswick,
NJ: Rutgers University Press.
Mendoza Collazos, J. (2015). Semiótica del diseño con enfoque agentivo: Condiciones
de significancia en artefactos de Uso. Bogotá: Universidad Jorge Tadeo Lozano.
Merleau-Ponty, M. (1962 [1945]). Phenomenology of perception. London: Routledge.
Merleau-Ponty, M. (1968 [1960]). The intertwining — the chiasm. In C. Lefort (ed.),
A. Lingis (Trans.), Le visible et l’invisible: The visible and the invisible, 130–155.
Evanston: Northwestern University Press.
Merrell, F. (2010). Entangling forms: Within semiosic processes. Berlin: De Gruyter.
Merritt, D. J., Casasanto, D., & Brannon, E. M. (2010). Do monkeys think in meta-
phors? Representations of space and time in monkeys and humans. Cognition,
117, 191–202.
Metzinger, T., & Gallese, V. (2003). The emergence of a shared action ontology:
building blocks for a theory. Consciousness and Cognition, 12, 549–571.
446 References
Millikan, R. (1998). A common structure for concepts of individuals, stuffs, and real
kinds: More Mama, more milk, and more mouse. Behavioral and Brain Sciences,
21(1), 55–65.
Millikan, R. (2010). On knowing the meaning; with a coda to Swampman. Mind,
119(473), 43–81.
Miranda, M., & Mendes, P. (2014). A emergência de metáforas multimodais: análise
da metaforização e da compressão no debate político-eleitoral. Scripta, 18 (34).
Mittelberg, I. (2010). Geometric and image-schematic patterns in gesture space. In
V. Evans & P. Chilton (eds.), Language, cognition and space: The state of the art
and new directions, 351–385. London: Equinox.
Mittelberg, I. (2014a) Gestures and iconicity. In C. Müller, A. Cienki, E. Fricke,
S. H. Ladewig, D. McNeill, J. Bressem, (eds), Body – language – communication. An
international handbook on multimodality in human interaction, Vol. 2, 1712–1732.
Berlin: De Gruyter Mouton.
Mittelberg, I., & Evola, V. (2014) Iconic and representational gestures. In C. Müller,
A. Cienki, E. Fricke, S. H. Ladewig, D. McNeill, J. Bressem, (eds), Body – lan-
guage – communication. An international handbook on multimodality in human
interaction, Vol. 2, 1732–1746). Berlin: De Gruyter.
Mittelberg, I., & L. R. Waugh (2014b) Gestures and metonymy. In C. Müller, A. Cien-
ki, E. Fricke, S. H. Ladewig, D. McNeill, J. Bressem, (eds), Body – language – com-
munication. An international handbook on multimodality in human interaction,
Vol. 2, 1747–1766. Berlin: De Gruyten.
Mittelberg, I., & Waugh, L. R. (2009). Metonymy first, metaphor second: A cognitive-
semiotic approach to multimodal figures of speech in co-speech gesture. In
C. Forceville & E. Urios-Aparisi (eds.), Multimodal metaphor, 329–356. Berlin:
Mouton de Gruyter.
Moran, D. (2012). Husserl dictionary. Bloomsbury: Bloomsbury Academic.
Morozova, E. B. (2006). Correlation between verbal and non-verbal etiquette [Never-
bal’nyj etiket v jego sootnosheniji s verbal’nym]. PhD thesis. Moscow: RSUH.
Morris, C. (1938). Foundations of the theory of signs. International Encyclopedia of
Unified Science, 2.
Morris, D. (2010). Peoplewatching: The Desmond Morris Guide to Body Language.
Moscow: Eksmo.
Müller, C. (1998a). Redebegleitende Gesten. Kulturgeschichte – Theorie – Sprachverg-
leich. Berlin: Berlin Verlag Arno Spitz GmbH.
Müller, C. (1998b). Iconicity and gesture. In S. Santi (ed.), Oralité et gestualité: Com-
munication multimodale,interaction, 321–328. Montréal/ Paris: L’Harmattan.
Müller, C. (2004). The palm-up-open-hand. A case of a gesture family? In C. Müller
und R. Posner (eds.), The semantics and pragmatics of everyday gestures, 233–256.
Berlin: Weidler.
References 447
Murai, C., Miyazaki, M., Tomonaga, M., Okada, H., & Imai, M. (2014). The origin
of a uniquely human thinking bias: The symmetry inference bias in human in-
fants and chimpanzees. Proceedings of the 20th international conference on infant
studies, Berlin, Germany, July 3–5.
Murcott, A. (1983). The Sociological Significance of Food. A. Murcott (ed.), The Soci-
ology of Food and Eating. Aldershot: Gower.
Muth, C. (2013). The Aesthetic Aha: On the pleasure of having insights into Gestalt.
Acta Psychologica, 144(1), 25–30.
Naigles, L. (1990). Children use syntax to learn verb meanings. Journal of Child
Language, 17, 357–374.
Namy, L. L. (2009). Early word learning and other seemingly symbolic behaviors.
In A. Woodward & A. Needham (eds.), Learning and the Infant Mind, 249–262.
London: Oxford.
National Corpus of Russian Language, NCRL (2004–2014). www.ruscorpora.ru.
Neisser, U. (2008). Memory with a grain of salt. In H.H. Wood & A. S. Byatt (eds.),
Memory: an anthology, , 80–88. London: Chatto and Windus.
Nelson, D. L., & Cox, M. M. (2008). Principles of biochemistry. New York: W.H. Free-
man.
Nelson, K .E., Craven, P. L., Xuan, Y., & Arkenberg, M.E. (2004). Acquiring art,
spoken language, text, and other symbolic systems: Developmental and evol-
utionary observations from a dynamic tricky mix theoretical perspective. In J. M
Lucariello, J. A. Hudson, R. Fivush, & P. J. Bauer (eds.) The development of the
mediated mind, 175–222. Hillsdale, NJ: Erlbaum.
Nelson, K. E. (1980). Theories of the child’s acquisition of syntax: A look at rare
events and at necessary, catalytic, and irrelevant components of mother-child
conversation. Annals of the NY Academy of Sciences, 345, 45–67.
Nelson, K. E. (1982). Experimental gambits in the service of language acquisition
theory. In S. Kuczaj (ed.), Language development: Syntax and semantics, 159–199.
Hillsdale, NJ: Erlbaum.
Nelson, K. E. (2006). A dynamic tricky mix theoretical perspective on educational
interventions to support the acquisition of L1 and L2 spoken languages, sign
languages, text, and art. Studies in Language Sciences, 5, 91–102.
Nelson, K. E., & Arkenberg, M. E. (2008). Language and reading development reflect
dynamic mixes of learning conditions. In M. Moody & E. R. Silliman (eds.). Brain,
behavior, and learning in language and reading disorders. New York: The Guil-
ford Press.
Nersessian, N. J. (2008). Creating scientific concepts. Cambridge, MA: MIT Press.
New Explanatory Dictionary of Russian Synonyms, NEDRS (2006) [Novyj objas-
nitel’nyj slovar’ sinonimov]. ed. by Apresyan, J. D. Moscow: Jazyki slavyanskih
kul’tur.
References 449
Newell, A. (1974). A framework for representing knowledge. MIT Lab, Memo #306.
Newen, A. & Bartels, A. (2007). Animal minds and the possession of concepts.
Philosophical Psychology, 20(3), 283–308.
Nicolich, L. McCune (1977). Beyond sensorimotor intelligence: Assessment of sym-
bolic maturity through analyais of pretend play. Merril Palmer Quarterly, 23,
89–101.
Nieuwland, M.S., & van Berkum, J.J. (2006). When peanuts fall in love: N400 evidence
for the power of discourse. Journal of cognitive neuroscience, 18(7), 1098–1111.
Nikolaeva, T. M. (1990). About the principle of non cooperation and / or the cat-
egories of the social linguistic influence [O príntsype nekooperátsyji i / ili kat-
egórijah sociolingvistícheskogo vozdéjstvija]. In Logical analysis of the language:
inconsistency and abnormality of the text [Logícheskij analiz jazyka: protivore-
chivos’t’ I anomal’nos’t’ texta], 3. Мoscow: Nauka. 225–234.
Niño, D. (2015). Elementos de Semiótica Agentiva. Bogotá: Universidad Jorge Tadeo
Lozano.
Nizamani, A.R., Juel, J., Persson, U., & Strannegård, C. (2015). Bounded cognitive
resources and arbitrary domains. In J. Bieger, B. Goertzel & A. Potapov (eds.), Ar-
tificial general intelligence: 8th International Conference, 166–176. Berlin: Springer.
Noble, W., & Davidson I. (1996). Human evolution, language and mind. Cambridge:
CUP.
Noë, A. (2004). Action in perception. Cambridge, MA: MIT Press.
Noë, A. (2008). Précis of Action in perception: Philosophy and Phenomenological
Research. Philosophy and Phenomenological Research LXXVI no. 3, 660–665.
Norrman, R. (1999). Creating the world in our image: A new theory of love of
symmetry and iconicist desire. In M. Nänny & O. Fischer (eds.), Form miming
meaning: Iconicity in language and literature, 59–82. Amsterdam: John Benjamins.
Nöth, W. (1998). Symmetry in signs and in semiotic systems. Interdisciplinary Jour-
nal for Germanic Linguistics and Semiotic Analysis, 3(1), 47–62.
Nöth, W. (2008). Semiotic foundations of natural linguistics and diagrammatic ico-
nicity. K. Willems & L. DeCuypere (eds.). Naturalness and iconicity in language,
73–100. Amsterdam: John Benjamins.
Nouwen, R. W. F. (2010). What’s in a quantifier? In M. Everaert, T. Lentz, H. de
Mulder, O. Nilsen & A. Zondervan (eds.), The linguistic enterprise, 235–256. Am-
sterdam: Benjamins.
O’Doherty Jensen, K. (2002). Gradient blends: The art of discerning and doing the
appropriate thing. In A. Hougaard & M. Lund (eds.), The way we think, Vol. I,
245–265. Odense: University of Southern Denmark.
O’Doherty Jensen, K. (2003). The contribution of cognitive semantics to the devel-
opment of sociological theory of food culture and food practices. Frederiksberg:
The Royal Veterinary and Agricultural University.
450 References
Osvath, M. (2009). Spontaneous planning for future stone throwing by a male chim-
panzee. Current Biology, 19(5), R190-R191.
Osvath, M., Kabadayi, C., & Jacobs, I. (2014). Independent evolution of similar com-
plex cognitive skills: The importance of embodied degrees of freedom. Animal
Behavior and Cognition, 1(3), 249–264.
Otten, M., & van Berkum, J.J. (2007). What makes a discourse constraining? Compar-
ing the effects of discourse message and scenario fit on the discourse-dependent
N400 effect. Brain Research, 1153, 166–177.
Otten, M., & Van Berkum, J.J. (2008). Discourse-based word anticipation during
language processing: Prediction or priming? Discourse Processes, 45(6), 464–496.
Ozturk, O., Krehm, M., & Vouloumanos, A. (2013). Sound symbolism in infancy: evi-
dence for sound–shape cross-modal correspondences in 4-month-olds. Journal
of Experimental Child Psychology, 114(2), 173–186.
Pais, A. (1982). “Subtle is the Lord …” : the science and the life of Albert Einstein. Ox-
ford: Clarendon.
Palumbo, G. (2009). Key terms in translation studies. New York, NY: Continuum.
Panksepp, J. (2005). Affective consciousness: Core emotional feelings in animals and
humans. Consciousness and Cognition, 14(1), 30–80.
Parise, C. V., Knorre, K., & Ernst, M. O. (2014). Natural auditory scene statistics
shapes human spatial hearing. Proceedings of the National Academy of Sciences,
111(16), 6104–6108.
Parise, E., & Csibra, G. (2012). Electrophysiological evidence for the understanding
of maternal speech by 9-month-old infants. Psychological Science 23(7), 728–733.
Parkinson, C., Liu, S., & Wheatley, T. (2014). A common cortical metric for spatial,
temporal, and social distance. Journal of Neuroscience, 34, 1979–1987.
Parthemore, J. & Whitby, B. (2014). Moral agency, moral responsibility, and artifacts:
What existing artifacts fail to achieve (and why), and why they, nevertheless,
can (and do!) make moral claims upon us. International Journal of Machine Con-
sciousness, 6(2), 141–160.
Parthemore, J. (2011a). Concepts Enacted: Confronting the Obstacles and Paradoxes
Inherent in Pursuing a Scientific Understanding of the Building Blocks of Human
Thought. DPhil thesis, University of Sussex, Falmer, Brighton, UK. Available from
http://www.sussex.ac.uk/Users/jep25/papers/thesis/thesis-a4.pdf.
Parthemore, J. (2011b). Of boundaries and metaphysical starting points: Why the
extended mind cannot be so lightly dismissed. Teorema, 30(2), 79–94.
Parthemore, J. (2013). The unified conceptual space theory: An enactive theory of
concepts. Adaptive Behavior, 21, 168–177.
Parthemore, J. (2014). Conceptual change and development on multiple time scales:
From incremental evolution to origins. Sign System Studies, 42, 193–218.
452 References
Peirce, C.S. (1998 [1903]). What Makes a Reasoning Sound? In: Peirce Edition Pro-
ject. The Essential Peirce. Vol. 2, 242–257. Bloomington: Indiana University Press
Pelkey, J. (2011). Dialectology as dialectic: Interpreting Phula variation. Berlin: De
Gruyter Mouton.
Pelkey, J. (2013a). Analogy, automation and diagrammatic causation: The evolution
of Tibeto-Burman *lak. Studies in Language, 37(1), 144–195.
Pelkey, J. (2013b). Chiastic antisymmetry in language evolution. The American Jour-
nal of Semiotics, 29(1–4), 39–68.
Pelkey, J. (2013c). Cognitive chiasmus: Embodied phenomenology in Dylan Thomas.
Journal of Literary Semantics, 42(1), 79–114.
Pelkey, J. (2013d). Nonlinear process in Peirce: “The end second the middle third.”
In K. Haworth, A. Johnson & L. Sbrocchi (eds.), Semiotics 2012: Semiotics and the
New Media, 77–85. Ottawa: Legas.
Pelkey, J. (2014). Iconic legisigns and the embodied X. Semiotics 2013: Yearbook of
the Semiotic Society of America, 303–316. Ottawa: Legas.
Pelletier, F. J. (2009). A philosophical introduction to mass nouns. In F.J. Pelletier
(ed.), Kinds, things and stuff, 123–131. Oxford: OUP.
Peña, M., Mehler, J., & Nespor, M. (2011). The role of audiovisual processing in early
conceptual development. Psychological Science, 22(11), 1419–1421.
Penrose, R. (1994). Shadows of the mind: A search for the missing science of conscious-
ness. Oxford: OUP.
Perkins, M. (2007). Pragmatic impairment. Cambridge: CUP.
Perkins, M., Body, R., & Parker, M. (1995). Closed head injury: assessment and re-
mediation of topic bias and repetitiveness. In M. Perkins & S. Howard (eds.), Case
studies in clinical linguistics, 293–320. London: Whurr.
Perniss, P. M. (2007). Achieving spatial coherence in German Sign Language nar-
ratives: The use of classifiers and perspective. Lingua, 117(7), 1315–1338.
Perry, J. (1997). Indexicals and demonstratives. In C. Wright & R. Hale (eds.) Com-
panion to the philosophy of language, 586–612. Oxford: Blackwell.
Peterburs, J., Nitsch, A. M., Miltner, W. H., & Straube, T. (2013). Impaired repre-
sentation of time in schizophrenia is linked to positive symptoms and cognitive
demand. PloS ONE, 8(6), e67615.
Peters, S. & Westerståhl, D. (2006). Quantifiers in Language and Logic. London:
Clarendon Press.
Phrasal Verb Dictionary (2002). Glasgow: Harper Collins Publishers.
Piaget, J. (1954). The construction of reality in the child. New York: Basic Books.
Piaget, J. (1962). Play, dreams and imitation in childhood. New York: Norton.
Piaget, J., & Inhelder, B. (1969). The psychology of the child. New York: Basic Books.
454 References
Prinz, P. M., & Prinz, E. A. (1981). Acquisition of ASL and spoken English by a
hearing child of deaf mother and a hearing fater: Phase II, early combinatorial
patterns. Sign Language Studies 30, 78–88.
Prinz, W., Beisert, M., & Herwig, A. (2013). Action Science Emerging: Introduction
and Leitmotifs. In Prinz, W., Beisert, M., & Herwig, A. (eds.), Action science.
Foundations of an emerging discipline, 1–33. Cambridge, MA: MIT Press.
Proni, G. (2003). Outlines for a semiotic analysis of object. Milan: Milan Polytechnic.
Putnam, H. (1960). Minds and machines. In S. Hook (ed.) Dimensions of mind. New
York: New York University Press.
Pylyshyn, Z. (1989). The role of location indexes in spatial perception: A sketch of
the FINST spatial-index model. Cognition, 32, 65–97.
Pylyshyn, Z.W. (1999). Is vision continuous with cognition? The case for cognitive
impenetrability of visual perception. Behavioral & Brain Sciences, 22(3), 341–423.
Quine, W. V. (1960). Word and object. Cambridge, MA: MIT press.
Quine, W. V. (1969). Ontological relativity and other essays. New York: Columbia
University Press.
Rabipour, S., & Raz, A. (2012). Training the brain: Fact and fad in cognitive and be-
havioral remediation. Brain and Cognition, 79(2), 159–179.
Raby, C., Alexis, D., Dickinson, A., and Clayton, N. (2007). Planning for the future
by Western scrub jays. Nature, 445, 919–921.
Ramenzoni, V. N., Riley, M.A., Davis, T., Shockley, K., & Armstrong, R. (2008). Tuning
in to another person’s action capabilities: perceiving maximal jumping-reach
height from walking kinematics. Journal of Experimental Psychology: Human
Perception and Performance, 34(4), 919.
Ramsay, D., & Campos, J. (1978). The onset of representation and entry into stage
6 of object permanence development. Developmental Psychology, 52, 785–797.
Ramsay, O. B. (1981). Stereochemistry. London: Heyden.
Rathmayr, R. (2009). The pragmatic clichés of the ‘new Russian politeness’: their
usage and estimation [Pragmatícheskije klishé «nóvoj rússkoj vézhlivos’ti»: ih
upotreblénije i otsénka]. In Stereotypes in language, communication and culture
[Stereotipy v jazyka, kommunikatsyji i kul’ture]. Мoscow: RSUH.
Ravitch, D. (2003). The language police. New York: Vintage Books.
Redick, T. S., Shipstead, Z., Harrison, T. L., Hicks, K. L., Fried, D. E., Hambrick, D. Z.,
… Engle, R. W. (2013). No evidence of intelligence improvement after working
memory training: A randomized, placebo-controlled study. Journal of Experi-
mental Psychology: General, 142(2), 359.
Reinhart, T. (1980). Conditions for text coherence. Poetics Today, 1, 161–180.
Reinhart, T. (1981). Pragmatics and linguistics: An analysis of sentence topic. Philos-
ophica, 27, 53–93.
456 References
Reiss, N. (1989), Speech act taxonomy, chimpanzee communication, and the evol-
utionary basis of language. Studies in language origins, 1, 283–304.
Remmers, J. E. (1973). Extra-segmental reflexes derived from intercostal afferents:
Phrenic and laryngeal responses. Journal of Physiology, London, 233, 45–62.
Renn, J. (2005). Einstein’s invention of Brownian motion. Annalen der Physik, 14(S1),
23–37.
Rhodes, G., Peters, M., Lee, K., Morrone, M. C., & Burr, D. (2005). Higher-level
mechanisms detect facial symmetry. Proceedings of the Royal Society B: Biological
Sciences, 272(1570), 1379–1384.
Richards, I. (1971 [1932]). The Philosophy of Rhetoric. New York: OUP
Richardson, D., & Dale, R. (2005). Looking to understand: The coupling between
speakers’ and listeners’ eye movements and its relationship to discourse com-
prehension. Cognitive Science, 29(6), 1045–1060.
Riesenhuber, M., & T. Poggio. (2000). Models of object recognition. Nature Neur-
oscience, 3, 1199–1204.
Roberson, D., Davies, I. R. L., Corbett, G. G., & Vandervyver, M. (2005). Free-sorting
of colors across cultures: Are there universal grounds for grouping? Journal of
Cognition and Culture, 5, 349–386.
Rodemeyer, L. (2006). Intersubjective temporality. Its about time. Dordrecht: Springer.
Roe, P. G. (2004). At play in the fields of symmetry: Design structure and shamanic
therapy in the upper Amazon. In D. K. Washburn & D. W. Crowe (eds.), Sym-
metry comes of age: the role of pattern in culture, 232–303. Seattle: University of
Washington Press.
Rogers, T. T., & McClelland, J. L. (2004). Semantic cognition: A parallel distributed
processing approach. Cambridge, MA: MIT press.
Roland, J. (n.d.). Landmark Supreme Court Decisions. Retrieved January 1, 2011, from
http://www.constitution.org/ussc/usscdeci.htm>
Rosch, E. (1975). Family resemblances: Studies in the internal structure of categories.
Cognitive Psychology, 7, 573–605.
Rosch, E. (1978). Principles of categorization. In E. Rosch and B. B. Lloyd (eds.),
Cognition and categorization, 27–48. Hillsdale, NJ: Lawrence Erlbaum.
Rosch, E. (1999). Principles of categorization. In Margolis, E., & Laurence, S. (eds.),
Concepts: Core readings, 189–206. Cambridge, Massachusetts, USA: MIT Press.
Rosch, E., & Mervis, C. B. (1975). Family resemblances: Studies in the internal struc-
ture of categories. Cognitive Psychology, 7, 573–605.
Roy, I., Copley, B., & McCune, L. (2015). Merge and recursion as generative devel-
opmental processes: Analysis from 3 children’s early combinations. Paper pres-
ented at SRCD March, 2015: Philadelphia.
References 457
Rozin, P., Hormes, J.M., Faith, M.S., & Wansink, B. (2012). Is meat male? A quanti-
tative multimethod framework to establish metaphoric relationships. Journal of
Consumer Research, 39 (3), 629–643.
Ruff, H. A. (1984). Infant’s manipulative exploration of objects: Effects of early age
and object characteristics. Developmental Psychology, 20, 9–20.
Russell, B. (1923). Vagueness. Australasian Journal of Philosophy, 1(2), 84–92.
Ryle, G. (1949). The concept of mind. London: Penguin.
Saji, N., Asano, M., Oishi, M., & Imai, M. (2015). How do children construct the color
lexicon?: Restructuring the domain as a connected system. Proceedings of the 37th
annual conference of the Cognitive Science Society, 2080–2085.
Saji, N., Imai, M., Saalbach, H., Zhang, Y., Shu, H., & Okada, H. (2011). Word learning
does not end at fast-mapping: Evolution of verb meanings through reorganiz-
ation of an entire semantic domain. Cognition, 118(1), 45–61.
Saldert, C., Ferm, U., Bloch, S. (2014), Semantic trouble sources and their repair
in conversations affected by Parkinson’s disease, International Journal of Lan-
guage & Communication Disorders, 49 (6), 710–721.
Sánchez, M. (2005). Morfogénesis del Objeto de Uso: la Forma como Hecho Social de
Convivencia. Bogotá: Universidad de Bogotá Jorge Tadeo Lozano.
Sandford, J. L. (2011). The figure/ground conceptual and concrete spatial relation of
color metaphor. In M. Brdar, M. Omazic, V, Pavicic Takac, T. Gradecak-Erdeljic, &
G. Buljan (eds.), Space and time in language: Language in space and time, 69–78.
Frankfurt-New York: Peter Lang.
Sandford, J. L. (forthcoming) Cognitive entrenchment of color categories and im-
plicit attitudes in English. In M. Uusküla, G. Paulsen, & J. Brindle (eds.). Colour
language and colour categorization. Newcastle upon Tyne: Cambridge Scholar
Publishing.
Sartre, J. (1962 [1948]). The psychology of imagination. New York: Philosophical
Library.
Saussure, F. d. (1976 [1916]). Cours de linguistique générale. Paris: Payot.
Scerri, E. R. (1994). Has chemistry been at least approximately reduced to quantum
mechanics? PSA: Proceedings of the Biennial Meeting of the Philosophy of Science
Association, 1994, 160–170.
Schiffrin, D. (1987). Discourse markers. New York: CUP.
Schmidt, R.C. (2007). Scaffolds for social meaning. Ecological Psychology, 19, 137–151.
Schmitt, N. (2010). Researching vocabulary: A vocabulary research manual. Basing-
stoke: Palgrave Macmillan.
Schoenemann, P. T. (2006). Evolution of the size and functional areas of the human
brain. Annual Review of Anthropology, 35, 379–406.
458 References
Scholl, B. (2007). Object persistence in philosophy and psychology. Mind & Lan-
guage, 22, 563–591.
Schönborn, K. J., & Anderson, T. R. (2006). The importance of visual literacy in the
education of biochemists. Biochemistry and Molecular Biology Education, 34(2),
94–102.
Schütz, A. (1962 [1945]). On multiple realities. In Collected Papers I. The Problem of
Social Reality, 207–259. The Hague: Martinus Nijhoff.
Scott-Phillips, T. (2014). Speaking our mind. London: Palgrave Mcmillan.
Scott-Phillips, T. (2015). Nonhumans primate communication, pragmatics, and the
origins of language. Current Anthropology, 56(1), 56–80.
Searle, J. (1969). Speech acts: An essay in the philosophy of language. Cambridge: CUP.
Searle, J. (1980). Minds, brains, and programs. Behavioral and Brain Sciences, 3(3),
417–458.
Searle, J. (1983). Intentionality: An essay in the philosophy of mind. Cambridge: CUP.
Searle, J. (1992). The rediscovery of the mind. Cambridge, MA: MIT Press.
Searle, J. (1995). The construction of social reality. New York: The Free Press.
Searle, J. (2004). Mind: A brief introduction. Oxford: OUP.
Searle, J., & Vanderveken, D. (1985). Foundations of illocutionary logic. Cambridge:
CUP.
Sebeok, T. A. (1979). The sign and its masters. Austin: University of Texas Press.
Sebeok, T. A., & Danesi, M. (2000). The forms of meaning: Modeling systems theory
and semiotic analysis. Berlin ; New York: Mouton de Gruyter.
Semetsky, I. E. (ed.) (2010). Semiotics – Education – Experience. Boston: Sense Pub-
lishers.
Sereno, M. I. (2014). Origin of symbol-using systems: speech, but not sign, without
the semantic urge. Philosophical Transactions of the Royal Society. B 369: 20130303.
Setiffi, F. (2014). Society’s visible patrimony. A sociological approach to understand-
ing consumption and material culture. Italian Sociological Review, 4(2), 235–251.
Shannon, C., & Weaver, W. (1949). The mathematical theory of communication. Ur-
bana: The University of Illinois Press.
Shapiro, S. C. & Rapaport, W.J. (1987). SNePS considered as a fully intensional
propositional semantic network. In Cercone, N. & McCalla, G. (eds.), The knowl-
edge frontier: Essays in the representation of knowledge. Springer Science and
Business Media.
Shapiro, S. C. (1978). The SNePS semantic network processing system. De-
partment of Computer Science, State University of New York at Buffalo,
http://128.205.32.53/~shapiro/Papers/sha79d.pdf.
References 459
Sharman, J., & O’Brien, R. (1975). The Rocky Horror Picture Show (script), online:
http://www.imsdb.com/scripts/Rocky-Horror-Picture-Show,-The.html.
Sharov, A. (2010). Functional information: towards synthesis of Biosemiotics and
Cybernetics. Entropy 12, 1050–1070.
Sharov, A. (2010). Functional Information: Towards Synthesis of Biosemiotics and
Cybernetics. In Entropy, 12, 1050–1070.
Sharov, A. (2013). Minimal mind. In L. Swan (ed.), Origins of mind, 343–360. Dor-
drecht: Springer.
Sharov, A., & Vehkavaara, T. (2015). Protosemiosis: agency with reduced represen-
tation capacity. Biosemiotics 8(1), 103–123.
Sheets-Johnstone, M. (1999). The primacy of movement. Amsterdam: John Benjamins.
Sheets-Johnstone, M. (2008). The roots of morality. University Park, PA: Penn State
Press.
Sheets-Johnstone, M. (2010). Thinking in movement: Further analyses and vali-
dations. In J. Stewart, O. Gapenne & E.A. Di Paolo (eds.), Enaction: Toward a new
paradigm for cognitive science, 165–181. Cambridge, MA: MIT Press.
Sheets-Johnstone, M. (2012). Fundamental and inherently interrelated aspects of
animation. In A. Foolen, U. Lüdtke, T.P. Racine & J. Zlatev (eds.), Moving ourselves,
moving others. Amsterdam: John Benjamins.
Shimojima, A. (1999). The graphic-linguistic distinction: Exploring alternatives.
Artificial Intelligence Review, 13(4), 313–335.
Short, T. L. (2007). Peirce’s theory of signs. Cambridge: CUP.
Sidman, M., Rauzin, R., Lazar, R., Cunningham, S., Tailby, W., & Carrigan, P. (1982).
A search for symmetry in the conditional discriminations of rhesus monkeys,
baboons, and children. Journal of the Experimental Analysis of Behavior, 37, 23–44.
Simões, A. (2002). Dirac’s Claim and the Chemists. Physics in Perspective, 4(3),
253–266.
Simons, J. S., Owen, A. M., Fletcher, P. C., and Burgess P. W. (2005). Anterior pre-
frontal cortex and the recollection of contextual information, Neuropsychologia,
43(12), 1774–1783.
Sinclair, J. (1991). Corpus, concordance, collocation (Vol. 1). Oxford: OUP.
Sinclair, J. (2004). Trust the text: Lexis, corpus, discourse. London: Routledge.
Sinha, C., & Gärdenfors, P. (2014). Time, space, and events in language and cog-
nition: a comparative view. Annals of the New York Academy of Sciences, 1326(1),
72–81.
Sinha, C., & Rodríguez, C. (2008). Language and the signifying object: From con-
vention to imagination. In J. Zlatev, T.P. Racine, C. Sinha & E. Itkonen (eds.),
The shared mind: Perspectives on intersubjectivity, 357–378. Amsterdam: John
Benjamins.
460 References
Stec, K., & Sweetser, E. (2013). Managing multiple viewpoints: Coordinating em-
bedded perspective in multi-modal narratives. http://www.ualberta.ca/~iclc2013/
ABSTRACTS/Stec_et_al.pdf
Steen, G. J. (2011). The contemporary theory of metaphor — now new and improved!
Review of Cognitive Linguistics, 9(1), 26–64.
Stern, D. N. (1985). The interpersonal world of the infant. New York: Basic Books.
Stewart, J. (1995). Cognition = life: Implications for higher-level cognition. Behav-
ioural Processes, 35(1–3), 311–326.
Stewart, J., Gapenne, O., & Di Paolo, E. (eds.) (2010). Enaction. Toward a new para-
digm for cognitive science. Cambridge: MIT Press.
Stiles, J. (2001). Neural plasticity and cognitive development. Developmental Neur-
opsychology, 18, 237–272.
Stiles, J., & Jernigan, T. L. (2010). The basics of brain development. Neuropsychologi-
cal Review, 4, 327–348.
Stjernfelt, F. (2000). Diagrams as a centerpiece of a Peircean epistemology. Trans-
actions of the Charles S. Peirce Society, 36(3), 357–384.
Stjernfelt, F. (2007). Diagrammatology: An investigation on the borderlines of phenom-
enology, ontology, and semiotics. Dordrecht: Springer.
Stjernfelt, F. (2014). Natural propositions: The actuality of Peirce’s doctrine on dicisigns.
Boston: Decent Press.
Stocker, K. (2014). The theory of cognitive spacetime. Metaphor and Symbol, 29(2),
71–93.
Strannegård, C., Engström, F., and Nizamani, A.R. (2013). Reasoning about truth
in first-order logic. Journal of Logic, Language and Information, 22(1), 115–137.
Strecker, I. A. (2011). Ethnographic chiasmus: essays on culture, conflict, and rhetoric.
East Lansing: Michigan State University Press.
Strecker, I. A. (2014). Chiasmus and metaphor. In B. Wiseman & A. Paul (eds.),
Chiasmus and culture, 69–88. Oxford: Berghahn Books.
Strecker, I. A., & Tyler, S. A. (2009). Culture and rhetoric. New York: Berghahn Books.
Stringer, C. (2012). Lone survivors: How we came to be the only humans on earth.
London: Times Books.
Suddendorf, T., & Corballis, M. C. (2007). The evolution of foresight: What is mental
time travel, and is it unique to humans? Behavioral and Brain Sciences, 30(03),
299–313.
Sušac, V. (2007). Konceptualna metafora u političkom govoru (Conceptual metaphor
in political discourse). Unpublished doctoral thesis: University of Zadar.
Sušac, V. (2010). Cognitve corpus analysis – A semiotic approach to cultural identity.
In H. Shröder & U. Bock (Eds): Semiotische Weltmodelle. Berlin: LIT.
References 463
Thelen, E., & Smith, L. (1994). A dynamic systems approach to the development of
cognition and action. Cambridge, MA: MIT Press.
Thelen, E., Corbetta, D., & Spencer, P. (1996). The development of reaching during
the first year: The role of movement speed. Journal of Experimental Psychology:
Human perception and performance, 22, 1059–1076.
Thelen. (1989). Self-organization in developmental processes: Can systems ap-
proaches work? In M. R. Gunnar & E. Thelen (eds.), The Minnesota symposia
on child psychology: Volume 22. Systems and development, 77–117. Hillsdale, NJ:
Lawrence Erlbaum.
Thompson, E. (2007). Mind in life: Biology, phenomenology, and the sciences of mind.
Cambridge, MA: Harvard University Press.
Thompson, E. and Stapleton, M. (2009). Making sense of sense making: Reflections
on enactive and extended mind theories. Topoi, 28(1), 23–30.
Tinbergen, N. (1952). “Derived” activities: Their causation, biological significance,
origin, and mancipation during evolution. The Quarterly Review of Biology, 27,
1–32.
Tomasello, M. (1997). The pragmatics of word learning. Cognitive Studies: Bulletin
of the Japanese Cognitive Science Society, 4, 59–74.
Tomasello, M. (1999). The cultural origins of human cognition. London: Harvard
University Press.
Tomasello, M. (2003). Constructing a language: A usage-based theory of language
acquisition. Cambridge, MA: Harvard University Press.
Tomasello, M. (2008). Origins of human communication. Cambridge, MA: MIT Press.
Tønnessen, M. (2015). The Biosemiotic glossary project: Agent, agency. Biosemiotics
8(1), 125–143.
Tønnessen, M., & Beever, J. (2014). Beyond sentience: Biosemiotics as foundation
for animal and environmental ethics. In J. Hadley & E. Aaltola (eds.): Animal
ethics and philosophy: Questioning the orthodoxy, 47–62. London: Rowman &
Littlefield International.
Tononi, G. (2008). Consciousness as integrated information: A provisional mani-
festo, The Biol. Bull., 215(3), 216–242.
Treder, M. S. (2010). Behind the looking-glass: A review on human symmetry per-
ception. Symmetry, 2(3), 1510–1543.
Trevarthen, C. (1979). Communication and cooperation in early infancy: A de-
scription of primary intersubjectivity. In M. Bullowa (ed.), Before Speech, 321–347.
Cambridge: CUP.
Tsakiris, M., S. Schütz-Bosbach & Gallagher, S. (2007). On agency and body-
ownership: Phenomenological and neurocognitive reflections. Consciousness
and Cognition, 16, 645–660.
References 465
Tulving, E. (2002). Episodic memory: From mind to brain. Annual Review of Psy-
chology, 53, 1–25.
Turing, A. (1950). Computing machinery and intelligence. Mind, 59(236), 433–460.
Turing, A. M. (1952). The chemical basis of morphogenesis. Philosophical Trans-
actions of the Royal Society of London. Series B, Biological Sciences, 237(641), 37.
Turner, M. (1991). Reading minds: The study of English in the age of cognitive science.
Princeton, N.J.: Princeton University Press.
Turner, V. (1987). The anthropology of performance. New York, NY: PAJ Publications.
Twigg, J. (1984). Vegetarianism and the meanings of meat. In A. Murcott (ed.), The
sociology of food and eating, 18–30. Aldershot: Gower.
Tyler, C. W. (1995). Empirical aspects of symmetry perception. Spatial Vision, 9(1),
1–8.
Tyler, S. (2014). Chiasmi figuring difference. In B. Wiseman & A. Paul (eds.), Chias-
mus and culture, 116–123. Oxford: Berghahn Books.
Uexküll, J. von (1928). Theoretische Biologie (second edition). Berlin: Julius Springer.
Uexküll, J. von (1957). A stroll through the worlds of animals and men. In C. Schiller
(ed. and transl.), Instinctive behavior: The development of a modern concept, 5–80).
New York: International Universities Press.
Umilta, M.A., Kohler, E., Gallese, V., Fogassi, L., Fadiga, L., Keysers, C., and Rizzol-
atti G. (2001), I know what you are doing. A neurophysiological study, Neuron,
31(1), 155–165.
Ursini F.-A. (2011). Space and the language-vision interface: a model-theoretic ap-
proach. Biolinguistics, 5, 170–225.
Vachek J. (1966). The linguistic school of Prague: An introduction to its theory and
practice. London: Bloomington
Valkman, Y. (2012). Cognitive Semiotics in the processes of the Creative Thinking
[Kognitivnaya Semiotika v processah obraznogo myshleniya] // Problems of
Computer Intellectualization. 349–358.
van Berkum, J.J., Brown, C.M., Zwitserlood, P., Kooijman, V., & Hagoort, P. (2005).
Anticipating upcoming words in discourse: evidence from ERPs and read-
ing times. Journal of Experimental Psychology: Learning, Memory, and Cogni-
tion, 31(3), 443.
van Berkum, J.J., van Den Brink, D., Tesink, C.M., Kos, M., & Hagoort, P. (2008).
The neural integration of speaker and message. Journal of Cognitive Neuros-
cience, 20(4), 580–591.
van Berkum, J.J., Zwitserlood, P., Hagoort, P., & Brown, C.M. (2003). When and how
do listeners relate a sentence to the wider discourse? Evidence from the N400
effect. Cognitive Brain Research, 17(3), 701–718.
466 References
van der Spek, T. M. (2006). Selling a theory: The role of molecular models in J. H.
van’t Hoff’s Stereochemistry Theory. Annals of Science, 63(2), 157–177.
Van Dijk, T.A. (1988). Ideology: a multidisciplinary approach. London: Sage.
van Driel, J. H., & Gräber, W. (2002). The Teaching and Learning of Chemical Equi-
librium. In J. K. Gilbert, O. De Jong, R. Justi, D. F. Treagust, & J. Van Driel (eds.),
Chemical education: Towards research-based practice, 271–292. Kluwer Academic
Publishers.
van Houten, J. (2001). A century of chemical dynamics traced through the Nobel
Prizes. 1901: Jacobus van’t Hoff. Journal of Chemical Education, 78(12), 1570–1573.
van Houten, J. (2002). A century of chemical dynamics traced through the Nobel
prizes. 1909: Wilhelm Ostwald. Journal of Chemical Education, 79(2), 146–148.
van Lancker Sidtis, D. (2012a), Two-track mind: Formulaic and novel language sup-
port a dual-process model, Advances in the neural substrates of language: Toward
a synthesis of basic science and clinical research, 342–367, Blackwell Publishing
London, UK.
van Lancker Sidtis, D. (2012b), Formulaic language and language disorders, Annual
Review of Applied Linguistics, 32, 62–80.
van Lancker, D. (1987), Nonpropositional speech: Neurolinguistic studies, Progress
in the psychology of language, 3, 49–118.
van Lancker, D. (2006), Where in the brain is nonliteral language?, Metaphor and
Symbol, 21(4), 213–244.
van Lancker-Sidtis, D., and Rallon, G. (2004), Tracking the incidence of formulaic
expressions in everyday speech: methods for classification and verification,
Language & Communication, 24(3), 207–240.
van Lancker-Sidtis, D., Canterucci, G., Katnelson, D. (2009), Effects of neurological
damage on production of formulaic language, Clinical Linguistics Phonology,
23(4), 274–284.
van Lier, H. (2003). Around Homo in 80 theses: A fundamental anthropogeny. In
M. Lo (ed.), Pierre Lottefier (trans.). Anthropogénie Un Darwinisme Des Sciences
Humaines Homo Est Le Primate Anguleux. Brussels: Anthropogenie.com.
van Lier, L. (2002). An ecological-semiotic perspective on language and linguistics.
In C. Kramsch (ed.) Language acquisition and language socialization. Ecological
perspective, 140–164. London: Continuum.
van ‘t Hoff, J. H. (1878). The role of imagination in science. Translated in: Benfey
(1960).
van ‘t Hoff, J. H. (1896). Studies in chemical dynamics Amsterdam: Amsterdam Boek
en Steendrukkereij.
Vanhaeren, M., d’Errico, F., Stringer, C., James, S.L., Todd, J.A., Mienis, H.K. (2006).
Middle Paleolithic shell beads in Israel and Algeria. Science, 312, 1785–1788.
References 467
Zlatev, J. (2005). What’s in a schema? Bodily mimesis and the grounding of language.
In B. Hampe (ed.), From perception to meaning: Image schemas in cognitive lin-
guistics, 313–343. Berlin: Mouton de Gruyter.
Zlatev, J. (2007a). Language, embodiment and mimesis. In T. Ziemke et al (2007),
297–337.
Zlatev, J. (2007b). Intersubjectivity, mimetic schemas and the emergence of language,
Intellectica, 46/47, 123–152.
Zlatev, J. (2007c). Spatial semantics. In D. Geeraerts & H. Cuyckens (eds.), The Oxford
handbook of cognitive linguistics, 317–343. Oxford: OUP.
Zlatev, J. (2008a). The co-evolution of intersubjectivty and bodily mimesis. In J. Zla-
tev, T. Racine, C. Sinha, & E. Itkonen (eds.), 215–244.
Zlatev, J. (2008b). From proto-mimesis to language. Evidence from primatology and
social neuroscience. Journal of Physiology-Paris, 102, 137–152.
Zlatev, J. (2009a). The semiotic hierarchy: Life, consciousness, signs and language.
Cognitive Semiotics, 4, 169–200.
Zlatev, J. (2009b). Levels of meaning embodiment and communication. Cybernetics &
Human Knowing, 16, 149–200.
Zlatev, J. (2010). Phenomenology and cognitive linguistics. In S. Gallagher and
D. Schmicking, (eds.), Handbook of phenomenology and cognitive science, 415–443.
Dordrecht: Springer.
Zlatev, J. (2011a). From cognitive to integral linguistics and back again. Intellectica,
56(2), 125–147.
Zlatev, J. (2011b). What is cognitive semiotics? SemiotiX XN-6.
Zlatev, J. (2012). Cognitive semiotics: An emerging field for the transdisciplinary
study of meaning. Public Journal of Semiotics, 4(1), 2–24.
Zlatev, J. (2013). The Mimesis Hierarchy of semiotic development: Five stages of
intersubjectivity in children. Public Journal of Semiotics, 4(2), 47–70.
Zlatev, J. (2014a). Image schemas, mimetic schemas and children’s gestures. Cog-
nitive Semiotics, 7(1), 3–29.
Zlatev, J. (2014b). Human uniqueness, bodily mimesis and the evolution of language.
Humana.Mente. Journal of Philosophical Studies, 27, 197–219.
Zlatev, J. (2015a). Cognitive semiotics. In Trifonas, P. (ed.), International handbook
of semiotics, 1043–1067. Dordrecht, Netherlands: Springer.
Zlatev, J. (2015b). The semiotic hierarchy revised: From life to language. Plenary
lecture at The Tartu Semiotics Summer School, June 2015.
Zlatev, J., & M. Andrén (2009). Stages and transitions in children’s semiotic devel-
opment. In J. Zlatev, M. Andrén, C. Lundmark and M. Johansson Falck (eds.)
Studies in language and cognition, 380–401. Newcastle: Cambridge Scholars.
References 471
Zlatev, J., & McCune, L. (2014). Toward an integrated model of semiotic devel-
opment. In R. Chen (ed.), Cognitive development: Theories, stages, processes and
challenges, 59–76. New York: Nova Publisher.
Zlatev, J., Madsen, E.A., Lenninger, S., Persson, T., Sayehli, S., Sonesson, G., & Weijer,
J.v.d. (2013). Understanding communicative intentions and semiotic vehicles by
children and chimpanzees. Cognitive Development, 28, 312–329.
Zlatev, J., Racine, T, Sinha, C. and Itkonen, E. (eds.) (2008) The shared mind: Perspec-
tives on intersubjectivity. Amsterdam: Benjamins.
Zwaan, R.A., & Radvansky, G.A. (1998). Situation models in language comprehen-
sion and memory. Psychological bulletin, 123(2), 162.
List of Contributors
Ines Adornetti is Temporary Lecturer of Philosophy of Language at Roma Tre
University. Department of Philosophy, Communication and Visual Arts, Roma Tre
University, via Ostiense 234, 00146 Rome, Italy. ines.adornetti@gmail.com
Maíra Avelar is Professor in Linguistics, and coordinator of Cognition and Lan-
guage Studies Lab, with research focus on metaphor and mutimodality.Universidade
Estadual do Sudoeste da Bahia, Programa de Pós-graduação em Linguística, Estrada
do Bem-querer, km4,s/n, Bairro Universitário 45031–300 Vitória da Conquista-BA
Brasil. mairavelar@uesb.edu.br
Gisela Bruche-Schulz is currently an independent scholar, formerly at Hong Kong
Baptist University, with research focus on thought, language, and the validating of
meaning. Mueggelstr. 21A, 10247 Berlin, Germany. gibrushu@gmail.com
Marco Bagli is PhD candidate in Cognitive Linguistics at the University of Peru-
gia, whose research concentrates on metaphors and the conceptualization of the
senses. C/o the Umbra Institute, P.zza IV Novembre 23, 06123 Perugia (PG), Italy.
marbagli@gmail.com
Per Aage Brandt is Professor in Semiotics with a long career of research in lan-
guage and meaning, and one of the founders of cognitve semiotics. Department of
Cognitive Science, Case Western Reserve University, Cleveland, OH 44106–7063,
USA. pab18@case.edu
Michael Carnevale is Research Assistant with the Perceptual Artifacts Lab,
working with psychophysical methods, multisensory integration, and perceptual
illusions. OCAD Faculty of Design, OCAD University, 100 McCaul Street, Toronto,
Ontario, Canada, M5T 1W1.
Alessandra Chiera is a post-doctoral researcher, with interest in the psychological
basis of human communication and its relationship with consciousness and mental
imagery. Department of Philosophy, Communication and Visual Arts, Roma Tre
University, via Ostiense 234, 00146 Rome, Italy. ale.chiera@gmail.com
Peter Coppin is Assistant Professor whose research seeks to improve information
displays through a better understanding of how designs make use of perceptual-
motor capabilities. Faculty of Design, OCAD University, 100 McCaul Street, Toronto,
Ontario, Canada, M5T 1W1. pcoppin@faculty.ocadu.ca
Geert Dardanelle is Associate Professor of biochemistry in the Section for Biomol-
ecular Sciences at the Department of Biology, University of Copenhagen, Denmark.
dandanell@bio.ku.dk
474 List of Contributors
Centre for Languages and Literature, Lund University, Box 201 221 00 Lund, Sweden.
goran.sonesson@semiotik.lu.se
Vlado Sušac is Assistant Professor and Head of Department of Tourism and Com-
munication Studies with research in semiotics, advertising and English for specific
purposes. University of Zadar, Department of Tourism and Communication Studies,
ul. dr. F. Tudjmana 24 i, 23000 Zadar, Croatia. vsusac@unizd.hr
Morten Tønnessen is Associate Professor of philosophy at University of Stavanger,
and Co-Editor-in-Chief of Biosemiotics. Department of Social Studies, University of
Stavanger, 4036 Stavanger. morten.tonnessen@uis.no
Francesco-Alessio Ursini is Senior Lecturer in English Linguistics and Literature
with a specialization in spatial language and cognition. Department of Education
and Communication (HLK), Jönköping University, Gjuterigatan 5 Box 1026 551 11
Jönköping, Sweden. Francesco.Ursini@ju.se
Jordan Zlatev is Professor in General Linguistics, and editor of Public Journal of
Semiotics, with research focus on language, meaning and consciousness. Centre
for Languages and Literature, Lund University, Box 201 221 00 Lund, Sweden.
jordan.zlatev@semiotik.lu.se
Index: Subject
D F
Design: 71, 152, 240–241, 244, 246–248, Fossil: 18, 164–167, 169, 203
268, 240–243, 279–287, 295, 392–393,
404 G
– of artifacts: 13–14, 21, , 83–99 Geometry: 248
– experimental: 23, 25, 33–34, 157, Gesture: 16, 18–20, 24, 128, 130, 134,
197, 262–265 141–142, 163, 172, 201, 201, 213,
Diagram: 10, 115–117, 237, 270, 281, 217–226, 229, 242, 236–237, 255, 332,
281–282, 288, 291, 324, 331, 394, 402 336–346, 363–365, 396
Distributed cognition: 122 Graphic: 22, 247, 279, 281, 283, 284–295
Drawing: 218, 260, 265, 266–270 Ground: 130, 145, 149, 158, 192–193,
198, 206, 212, 240,-241, 248, 264,
E 301–302, 326,-329, 374, 408
Embodiment: 7, 27, 45–46, 81, 124, 144,
241–242, 316 H
Empathy: 249 Hierarchy: 27, 57, 59–60, 128, 214
Enaction: 27, 81, 92–95, 24 History: 12, 14, 40, 47–48,
Enlightenment: 6–7, 27, 31 70, 83,101,105, 109–110,
Episodic memory: see Memory 116–117,122–123,137–139, 189,
Evolution: 10–11, 14, 16–17, 26, 56, 59, 193,239, 241, 290,300, 312, 357,409
125, 155, 161, 164–165, 170–171, 180, Holophrasis: 200
186, 189, 193, 196, 200–202, 204, 212, Homo (genus): 16, 200
214, 226, 242–243
Index: Subject 479
I M
Icon (types): 129, 151, 239–241, 286, Memory: 38, 42, 44, 87, 97, 99, 132, 167,
292 181, 203, 207, 241, 243, 256–258, 263,
Iconic: 20, 22, 58, 163–164, 211–215, 265, 333, 358, 388
281, 286, 289–295, 398 – episodic: 90, 171, 186
Iconicity: 39, 135, 164, 173, 211, 215, – semantic: 90, 379, 386–389
222, 286, 289 – traces: 288–290, 388
Imitation: 128, 130, 163, 202, 212, 215 – working: 90, 248, 209, 395
Index: 130, 242, 286 Mental image: 381
Indexicality: 256–259 Metaphor: 64–68, 118, 319–329,
Intention 332–347, 350–362
– meaning: 34–38 – “computer metaphor”: 12, 48,
– representational: 129, 134 50–51, 58, 63–64
– communicative: 177–184, 202, 392 – conceptual: 24, 64–65, 117, 186,
– motor: 195, 30 317, 319–321, 332–336, 350–354,
Intentionality: 21, 40, 43, 54, 79, 262 357, 362, 364–365, 373
Intersubjectivity: 21, 35, 189, 194, – gestural: 219, 221, 338–346
263–264, 299 – linguistic: 64–65, 320–321
– performative: 25–26, 363–377
L – primary: 25, 357
Language – visual: 65
– acquisition: 10, 15, 127–141, Metonymy: 221
145–159, 170–171, 388 Mimesis: 23, 202, 215, 242, 377
– evolution: 16–18, 175–188, – Aristotle’s: 215–217
189–197, 199–208 – bodily: 19, 128, 136, 211, 213–214
– formulaic: 17, 199, 201, 203–209 – gestural: 19, 211–226
– institutional: 299–316 Mimesis hierarchy: 214
– meaning: 12, 23, 197, 283, 380–389, Mimetic: 164, 225
393 – practice: 364–377
– pathology: 177, 207–208 – process: 128, 136, 215, 225
– poetic: 216 – modes: 216–224
– signed: 164, 172, 215 – objects: 216–224
– syntax: 27, 176, 391–408 – schemas: 19, 211–213, 215, 217,
– system: 102–104, 318 222, 225, 263
– use: 34–35, 299, 305, 335 – words: 153
Lifeworld: 60, 240 Mind: 58, 300, 393–406
Linguistics: – computational: 190
– cognitive: 19, 23, 27, 35, 103, 317, – embodied: 31
320, 347, 393, 400 – philosophy of: 47, 56, 58, 408
– functional: 102–103, 117 – travel: 175
– generative: 177, 190, 393 Multimodal: 95, 122, 172, 229–230
– structural: 83–84, 103–104, – communication: 103, 365
318–319, 406
480 Index: Subject
Semiosis: 13, 62, 69, 72–76, 80–81, 85, 281–283, 285–286, 292–293, 301, 310,
261, 328, 335, 377, 399–400 332–333, 346, 349–350, 353, 355–356,
Semiotic development: 15, 128, 358–359, 361, 379, 391
130–132, 189 Structuralism: 318
Semiotic resources: 18–20, 225, 331 Symbol: 15–16, 20, 22, 27, 56, 84, 113,
Semiotics: 9–14, 16, 24, 31, 33–34, 128–130, 134–141, 145, 148–149,
39, 50, 52, 67, 70–71, 83–86, 89, 91, 154–156, 158–159, 161–175, 190, 200,
96, 98–99, 101–104, 122–123, 127, 239, 242, 255–257, 267, 274–275, 283,
129–131, 181, 229–230, 239, 251–252, 285–286, 289–290, 292, 376, 382, 384
254–255, 281, 286, 289–290, 294, 315, Symbol grounding: 15, 145, 148–149,
317–319, 362 154, 158–159
– of science: 14, 101–103, 105, 109, Symbolic: 10, 15, 20, 22, 58, 61, 67, 83,
122–123 127–131, 133–137, 139, 141–142, 150,
Sensory: 18, 21–22, 31, 54, 65, 71, 78, 153, 159, 161, 163–164, 167–174, 214,
95, 148, 190, 194, 256, 258–259, 263, 239–241, 270–271, 281, 289–290,
269, 271, 273,-274, 287, 289,-292, 294, 292–295, 299, 309–310, 316, 377
369, 376 – language: 128
Sensory-Motor: 189, 194–196, 198 – representations: 58, 169, 190, 292,
Sign: 10, 14–16, 19–21, 27–28, 34–38, 295
52, 59–60, 65, 69–70, 72–73, 75–76, Symbolicity: 290
80–82,84, 86, 102, 122, 127–133, 135, Symmetry: 19–20, 84, 243–246
137, 139, 153, 164, 170–172, 211, 213,
215, 226–231, 233, 236–240, 242, 251, T
253–259, 262, 268, 271–272, 286, Taxonomy: 95–97
317–320, 356, 376, 395–396, 400 Temporal: 17, 35, 37, 40–45, 74, 127,
Similarity: 110, 129, 139, 146, 151, 286, 183–187, 194, 216, 256, 289, 301, 310,
289, 366, 368, 373 313, 318, 392–394
Simulation: 11–12, 22–23, 27, 31, Text: 16, 21, 23, 25, 52, 97, 176,
33–35, 38–39, 44–45, 123, 129, 178–179, 182, 228–229, 261–262,
287–292, 295, 394, 396–397, 403, 408 264–268, 272–275, 277, 279–285, 288,
Situatedness: 31, 50, 86–87 293, 295, 304–306, 309, 315, 319, 376,
Skill: 16–18, 42, 90, 97, 140–141, 163, 382, 399, 400
175, 184, 188, 196, 200, 214, 271, 309, Time: 11, 14, 17, 21, 34, 40, 43, 63, 65,
370, 377 86, 94–95, 118, 127, 131, 138–139,
Sound symbolism: 15, 149–152 158, 167, 173, 175, 185–188, 191, 249,
Space: 17, 21, 57, 104, 110, 131, 138, 252, 261–262, 267, 277, 301, 331, 337,
153, 159, 175, 186–188, 196–197, 212, 340, 346–347, 359, 366, 392, 394–395,
220–221, 237–238, 244, 248–249, 397, 402, 406, 408
264–265, 267, 269, 270–274, 283–284, – travel: 14, 17, 63, 175
328, 331, 333–337, 339–340, 346–347, Translation: 22, 49, 113–114, 222, 244,
349–350, 352,-353, 355–357, 361–362, 265, 267, 275, 279, 281, 283–284, 293,
367, 369, 393–394, 402, 408 327, 343, 388, 406
Spatial: 25, 84, 110–111, 147, 171,
186–187, 261, 264, 269, 271, 273–274,
482 Index: Subject
Transdiciplinary: 52, 122, 230, 251, 253, 166–170, 207, 217, 240, 243–244,
379 246–249, 255, 263, 277, 279, 281–282,
Typology: 26, 379 285–286, 291,-295, 319, 331, 350,-351,
361, 379–381, 383–386, 388
U Verbal: 21, 24, 34–36, 40, 67, 84, 91, 136,
Umwelt: 60, 70, 74, 78–81, 240 145, 178–180, 187, 199, 207–208, 212,
Upper Palaeolithic: 165–166, 173 227–230, 233–238, 263, 270, 317, 319,
328, 332, 337–339, 347, 367, 381, 402
V Vocabulary:103, 146–148, 150, 154, 171,
Visual: 19–22, 25–26, 39, 52, 55, 65, 67, 229
95, 110, 138, 148–149, 151, 163–164, Vocalization: 15, 128, 130, 135–141, 201
Jordan Zlatev / Göran Sonesson /
Piotr Konderak (eds.)