Professional Documents
Culture Documents
Architecture of Narrative Memory
Architecture of Narrative Memory
Available at www.sciencedirect.com
ScienceDirect
RESEARCH ARTICLE
Department of Software Engineering and Artificial Intelligence, Computer Science Faculty, Universidad Complutense de
Madrid, Spain
Received 1 March 2016; received in revised form 31 March 2016; accepted 5 April 2016
KEYWORDS Abstract
Narrative; Narrative is ubiquitous. According to some models, this is due to the hypothesis that narrative is
Cognitive architecture;
not only a successful way of communication, but a specific way of structuring knowledge. While
Episodic memory;
most cognitive architectures acknowledge the importance of narrative, they usually do so from
Semantic memory;
Procedural memory; a functional point of view and not as a fundamental way of storing material in memory. The
Script presented approach takes one step further towards the inclusion of narrative-aware structures
in general cognitive architectures. In particular, the presented architecture studies how
episodic memory and procedures in semantic memory can be redefined in terms of narrative
structures. A formal definition of narrative for cognition and its constituents are presented,
and the functions that an implementation of the architecture needs are described. The relative
merits and the potential benefits with regard to general cognitive architectures are discussed
and exemplified.
Ó 2016 The Author. Published by Elsevier B.V. This is an open access article under the CC BY
license (http://creativecommons.org/licenses/by/4.0/).
http://dx.doi.org/10.1016/j.bica.2016.04.002
2212-683X/Ó 2016 The Author. Published by Elsevier B.V.
This is an open access article under the CC BY license (http://creativecommons.org/licenses/by/4.0/).
20 C. León
This work does not dig into the physiological aspects the corresponding examples are explained in Section ‘Exam
causing narrative behavior, not it does into the source of ples’. Sections ‘Discussion’ and ‘Conclusions and future
this phenomenon. Whether narrative happens in cognition work’ provide discussion and conclusions, respectively.
because of learning or because of innate capabilities is,
from the point of view of the functional description of the
average human cognition, not strictly relevant. This work Definition of narratives for the cognitive
focuses on the implications of the hypothetical narrative architecture
characteristics of some functions of human cognition and
not on what produces them. This narrative hypothesis has Along the years, several definitions of what a narrative is
strong implications in the way we understand cognition in have been proposed. All of them have some aspects in com-
humans, and as such a cognitive architecture accepting it mon, but they in general depart from each other when the
must describe its components including narrative properties focus is set on literary aspects. This work, while not leaving
not only with additional modules, but as an in-depth review literature and artistic behavior apart, is centered on a more
of the functional characteristics of more general aspects of general conceptualization of narrative, more in line with
memory and processing. cognitive structures.
To the best of our knowledge, a general architecture for For the sake of this work, the definition of narrative pro-
narrative cognition has not been yet proposed. This is prob- posed by Finlayson will be used (Finlayson & Corman, 2013).
ably due a number of issues. The fact that narrative behav- Briefly, this definition states that a narrative:
ior is certainly complex and many times assumed to be a
literary phenomenon puts it away from the focus of general 1. is a sequence of events,
cognition. Besides, achieving shallow narrative behavior is 2. that are causally related,
doable without explicit models of narrative, since procedu- 3. involving specific characters and times,
ral information about how to build a useful story is probably 4. and displays a certain level of organization beyond the
enough for most practical cases. basic coherence.
However, given the recent advances of cognitive archi-
tectures and the growing interest on narrative and its rela- The particular difference with regard to a plain sequence
tion to general cognition, proposing an architecture for of events is made by point 4, which does actually highlight a
narrative cognition can be relevant and useful in general fundamental property with regard to the proposed architec-
as a first step towards the study of the working hypothesis, ture. This account tries to describe the common activity of
and its influence on practical architectures both for Cogni- narrative behavior, and as such it identifies the particular
tive Science and Artificial Intelligence. characteristics that differentiate it from the simple descrip-
Most modern cognitive architectures are complex enough tion of an event list (which could also be a narrative).
as to consider different types of memory. This paper only The fact that narratives have a certain organization
addresses two main parts of explicit memory, namely episo- beyond pure coherence and order is what makes the defini-
dic and semantic. This is due to practical and theoretical tion useful for the proposed architecture that this work is
aspects. First, analyzing the narrative properties all pro- about to describe. This organization is commonly assumed
posed kinds of memory would require a more extensive to happen at several levels both from the point of view of
work, outside reasonable scope. Second, it is unlikely that structure (discourse, content) and the cognitive effect
all cognitive models of memory present narrative aspects, (focus, salience, inference).
at least as they are conceived in this paper. Therefore, it From here on, this study will be based on the assumption
has been considered illustrative and useful to propose a that:
model of narrative cognition restricted to two main, gener-
ally accepted types of memory where, hypothetically, nar- From a functional point of view, part of human cognition
rative properties can be clearly observed. is structured with narrative properties.
The current proposal makes no claims about the applica- Narratives have certain properties that make them dif-
bility of the proposed solution to other domains, for ferent from plain sequences of events.
instance, non-narrative messages. However, it is hypothe-
sized that some features of this architecture could be
adapted to fit other non-narrative sources of information,
Representation of narratives as cognitive
as discussed in Section ‘Conclusions and future work’.
This paper is organized as follows: Section ‘Definition of objects
narratives for the cognitive architecture’ proposes a formal
definition of narratives suitable as a unit of information in While this architecture for narrative memory offers a
the narrative memory and Section ‘Representation of narra- framework and not a final implementation, it is relevant
tives as cognitive objects’ specifies the constituents of nar- to describe the main properties that a narrative must
rative objects. Section ‘Episodic memory as a narrative address in order to be suitable for the model. This section
memory’ describes an architectural description of episodic proposes a formal definition of narrative information, while
memory as narrative memory, and Section ‘Procedural–se still leaving room for particular instantiations for specific
mantic memory and narrative’ does the same for processes implementations.
stored in semantic memory. As previously pointed out in Section ‘Definition of narra-
An overall architecture of narrative memory is described tives for the cognitive architecture’, the number of defini-
in Section ‘Overall architecture of narrative memory’, and tions of what constitutes a narrative is vast and some
An architecture of narrative memory 21
aspects bring conflict between authors. For the sake of The satellites, again using Chatman’s nomenclature, are
designing a useful architecture, it is mandatory to establish those constituents of a narrative object that are not central
the range of application of the model. In this case, the to the narrative itself, but are connected to the kernel and
architecture contemplates two subsystems of a general cog- needed or useful for a variety of reasons, namely causation,
nitive architecture: the episodic memory and the procedu- contextualization, explanation of the effects or
ral information in the semantic memory, the explicit Let ni be a narrative object describing the event i. Then,
‘‘know-how”, in contrast with the implicit information in ni ¼ ði; ki ; Si ; Ci Þ ð1Þ
the procedural memory. Given that this work focuses on a
subset of the semantic memory, From here on the text will where i is the information to be processed by the narrative
refer to the this subset as procedural–semantic memory. memory (event, procedure), ki is the kernel of the informa-
For simplicity, simple models of both kinds of memory tion Si is the set of satellites of the information, and Ci is the
will be assumed. Episodic memory stores events experi- set containing a number of labeled directed relations
enced by the cognitive entity, and procedural–semantic between the kernel and the satellites.
memory stores explicit information about how to perform Graphically, the relation between satellites and kernel
different activities. The focus is set on functional aspects creates a directed graph that resembles the work by Tra-
and their suitability for a computational implementation. basso et al. regarding narrative causality (Trabasso &
Detail about the particularities of how the model manages Sperry, 1985). Figure 1 depicts an example.
episodic and procedural–semantic memory will be given in
Sections ‘Episodic memory as a narrative memory’ and ‘Pro Narrative memory
cedural–semantic memory and narrative’ respectively. In
general, this paper will from here on refer to narrative A narrative memory, N, is then a set of narrative objects
memory, defined like this: and relations between them, as formally expressed in
Narrative memory is the subset of episodic and semantic Eq. (2).
memory in a cognitive storing information that presents N ¼ ðfo1 ; o2 ; . . . ; om g; fR1 ðoa ; ob Þ; . . . ; Ri ðoy ; oz ÞgÞ ð2Þ
narrative features.
where o1 ; . . . ; om are narrative objects and R1 ðoa ; ob Þ; . . . ;
In the architecture explained in this paper, the narra- Ri ðoy ; oz Þ are relations between these narrative objects.
tive memory is composed by the episodic memory and In Section ‘Narrative objects: Kernels and Satellites’ the
the procedural–semantic memory, as these will be used concepts of narrative kernel and satellite have been
to exemplify the relative improvements that the narrative defined. In the case of external material to be learnt, how-
memory provides. In order to make the model suitable for ever, information is input to our cognitive system which,
architectures in which episodic memory and procedural– ideally, stores everything. It is at this point where the nar-
semantic memory also include non-narrative information, rative memory starts playing a role in cognition.
narrative is assumed to be a subset of these parts of gen- A narrative-aware cognitive architecture takes raw infor-
eral memory. However, it would be possible to simplify the mation as input. This input must be converted to symbols
model and include the whole episodic and procedural and aggregated into events in order to be available for the
memory in it. rest of the cognitive system. Events, however, do not form
The narrative memory handles a certain type of object: full narratives. A sequence of plain events must be restruc-
the narrative object, which is defined next for the purposes tured into a narrative (i.e., narrative properties must be
of this paper: added to the events) to produce narrative objects and full
A narrative object is any cognitive object stored in the narratives from them.
narrative memory, therefor having narrative properties. The architecture for narrative memory defines a function
that accepts a linear sequence of events of processes and
According to the previous definition, all information
stored in or retrieved from the narrative subset of the epi-
sodic and procedural–semantic memory are assumed to be
a narrative object. With these definitions in mind, we can
take one more step and describe what a narrative object
contains. A narrative object includes the information to
be stored, but it is structured in terms of kernels, satellites
and narrative relations.
returns a set of narrative objects. Eq. (3) describes the this Trabasso, 1994). It seems that there are two different kinds
function formally. of narrative inferences: elaborative and necessitated. Elab-
orative inferences are those that, while augmenting the
nð½ev 1 ; ev 2 ; . . . ; ev n Þ ¼ ðfo1 ; o2 ; . . . ; om g;
contextual information of the world described in the narra-
fR1 ðoa ; ob Þ; . . . ; Ri ðoy ; oz ÞgÞ ð3Þ tive, are not strictly needed for pure comprehension. On the
where n is a narrative, ev 1 ; . . . ; ev n are events coming from other hand, necessitated inferences are created to satisfy
an external environment and o1 ; . . . ; om are narrative comprehension requirements. This architecture proposes a
objects that are produced by the function creating the nar- generic definition of the process of identifying kernels and
rative and correspond to the narrative-processed version of their associated satellites based on narrative focus and
the input. The second set in the output contains relations inference.
between different narrative objects, for instance Ri ðoy ; oz Þ A narrative kernel is a salient element of some narrative.
contains the existing relation between the narrative objects Therefore, any narrative event with a particularly high level
oy and oz . The number of events and the number of narra- of salience is a candidate for being encoded as a kernel.
tive objects, n and m in Eq. (3), can be equal but it is not Thus, the architecture assumes that the implementation
necessary, n > m, n ¼ m and n < m are perfectly valid val- will provide a function l assigning a given narrative event
ues. This is so because an episode can be divided in several a probability of being represented as a kernel that is directly
narrative objects, some events in the episode can be dis- proportional to the cognitive salience that the cognitive
carded, and some events can appear in more than one nar- entity experiments, as formalized in Eq. (4).
rative object. The function, then, creates content for a lðeventÞ / cognitive salience of event ð4Þ
narrative memory from external input.
After identifying the most salient events of the input
The function must be able to:
material in the form of narrative, the kernels for the narra-
tive objects are selected. In order to complete them it is
1. Build an appropriate set of kernels, filtering out irrele-
necessary to find the satellites. The model proposes narra-
vant content and identifying important material to build
tive inference to identify them and link them to their corre-
a coherent network of satellites for each kernel.
sponding kernel or kernels.
2. Identify relations between the different kernels and
In order to do so, the model defines a function m that,
build a full narrative using the particular narrative
given a narrative event and a kernel, returns both the prob-
objects as constituents.
ability that the event is linked to the kernel as a satellite,
and the kind of link that makes it part of the narrative
Sections ‘Building the narrative network between Kernel
object. The probability is directly proportional to the
and Satellites’ and ‘Creating relations between kernels’
‘‘strength” of the inference. The relation defines the kind
detail the architectural approach taken to address these
of the link and conveys the semantics. Eq. (5) shows a for-
objectives.
mal definition of the m function.
Building the narrative network between Kernel and mðevent; kernelÞ ¼ ðPlink ; Rðevent; kernelÞÞ
ð5Þ
Satellites Plink / strength of relation
The specific definitions of the l and m are left to the
One of the relative contributions of the proposed approach
implementation of the architecture. The feasibility of the
is the inclusion of narrative knowledge into the architecture
approach is evidenced by the existence of computational
as a cognitive process. Narratology, especially Cognitive
approaches implementing salience and inference (Niehaus
Narratology, can offer useful insight on how narrative phe-
& Young, 2014).
nomena is managed by human cognition. Given that this
work follows the hypothesis that there exist underlying nar-
rative properties in several cognitive processes, a model of Creating relations between kernels
identification of relevant information based on narrative
properties is proposed. The relations potentially present in narrative objects, those
Two cognitive processes of narrative are addressed: nar- linking satellites and kernels, must capture properties glu-
rative focus and narrative inference. Narrative focus is the ing the constituents in such a way that overall narrative
salience of some element or elements of the narrative in properties are explicitly stored and used by the architec-
some entity’s mind. Studies show that the salience of ele- ture. These relations are complex, and there is a high
ments impacts interpretation in such a way that salient ele- degree of coupling between them. The architecture defines
ments are more easily comprehended and understood by the the relations at a certain granularity, specific enough as to
human receiving the narrative. capture common narrative semantics and general enough
Narrative inference is a process by which humans tend to as to leave room for particular implementation particulari-
create both knowledge and relations between facts in order ties. The level of granularity has been chosen with com-
to give coherence to some discourse, which is usually con- putability in mind. That is, the proposed relation try to
veyed in an incomplete form (for instance, omitting serve the purpose of describing narrative structures that
details). For instance, there is evidence suggesting that can be processed by computational means.
readers perform narrative inference while they read. What The final implementation of the functions used to
is really interesting is that this process is carried out almost compute the relations and create the actual network of
no conscious effort by the reader (Graesser, Singer, & relations is not covered in this approach. Nonetheless, there
An architecture of narrative memory 23
exist computational approaches proposing partial solutions Causality is a very complex concept and it usually encap-
for every proposed relation. Restricting the architecture sulates many specific aspects of preconditions and effects,
to a fixed set of functions would narrow the general as exemplified in Figure 3. Most approaches, especially com-
approach of this architecture. References to existing putational ones, assume a straightforward definition basi-
approaches are, nevertheless, proposed on each case. cally accepting a broad sense of causal relations. These
are then aggregated into a single kind of causality link.
Time
According to the definition of narrative definition of narra- Location
tive used for this work (Section ‘Definition of narratives Locations are also fundamental in narratives because they
for the cognitive architecture’) narrative is a sequence represent the setting and context where related action hap-
linked by, among other things, time relations. It is hard to pens. Due to obvious physical aspects, events occurring in
think of a narrative not involving some sense of time. As the same place form the same scene. Entities performing
sequences of event usually happen over time, even the most together influence each other if they are in the same loca-
simple narrative requires time aware structures. For this, tion. Therefore, a full representation of narratives can
time relations are included in the architecture. never leave location out.
An in-depth study of temporal logics and formal time In the same way as with time, locations can be absolute
representation is outside scope, but many approaches exist or relative. Absolute locations refer to specific places (‘‘the
(Allen, 1991). It is however important to remark that time cinema”, ‘‘the table”), while relative ones actually link
can be absolute and relative. Absolute time relations hap- events together as satellites and kernels (see Figure 4).
pen when an exact time is given (i.e. ‘‘at 7 pm”) and rela- Most of the times, location relations are implicit in nar-
tive time occurs when events are related by its order (i.e. rative discourses. Scenes are described and actions happen-
‘‘later on”, ‘‘before”). ing in them are just assumed to be related by a relation
While relative time can be well represented with classi- expressing proximity. However, the underlying representa-
cal relations linking two events, absolute time is slightly dif- tion of a narrative (the fabula, as opposed to the sjuzhet)
ferent. An absolute time is a property of an event, and must include location information explicitly.
therefore it is modeled in the architecture as a special rela-
tion, as show in Figure 2. In this case, the absolute time Agency and objects
would be part of the event, and not a satellite. Graphically, Narratives are usually complex sequences of actions pro-
this is depicted by a dotted edge. While technically part of duced by agents and received by objects (objects can be
the event, it has been considered useful to include absolute agents too). Most stories are structured over a character
information in the graph. or a set of characters—many times humans, or entities play-
ing the role of humans. The events happening to or being
Causality produced by characters are the backbone of most
Causality is crucial in narrative. Causal relations are the narratives.
base of narrative understanding. Some authors claim that Objects also play a predominant role when structuring
narrative understanding happens, among other things, when narratives. A magical ring or the ball in a football game
a coherent causal network has been built (Trabasso & being told are examples of entities not able to produce
Sperry, 1985). Many computational approaches base their events but actually causing and receiving most actions in a
process on causality as the main narrative relation (Riedl, narrative.
2004). Agents and objects, then, are the constituents triggering
narrative relations. Again, they are usually properties of
events (as shown in Figure 5), and narrative relations as
Figure 2 Time relations can be absolute or relative. Relations Figure 4 Location relations in narratives. The kernel (bold)
between events and absolute time are represented with a has one satellite connected by a solid line. This satellite
dotted edge, while relations between satellites and kernels are happens at the same place. The dotted edge represents event
represented with a solid edge. information.
which gives useful insight on how what the underlying pro- composing the full narrative. A computational system
cesses are. This change of focus from the story itself to implementing the narrative architecture needs some source
the cognitive system suggests that both the creator and of knowledge to discover these relations. However, the
the audience of an episode play an active role (either con- probability of a link between two events seems to be
sciously or unconsciously) to build a representation of the directly proportional to the number of shared dimensions
narrative and to connect that narrative to other information of the two events (Zwaan et al., 1995).
in the cognitive system. An implementation of these ideas is proposed by Niehaus
There exist formal and computational models of narra- and Young (2014). In the implementation, a computational
tive understanding (Mueller, 2002). In general, most of them model of narrative discourse understanding was tested.
acknowledge the active role of the cognitive system in Results evidence the plausibility of the model.
understanding. Understanding is also generally assumed by
these models to consist on building a coherent network con-
necting the received information as a meaningful graph-like Producing episodic narratives
structure. This includes creating a structure linking all rele-
vant data of the perceived narrative and connecting parts of The task of narrative production has been addressed from
this new narrative with previous knowledge. many perspectives (Bringsjord & Ferrucci, 1999; Pérez y
Referring back to the descriptions of narrative focus and Pérez, 1999; Riedl & Young, 2010; León & Gervás, 2014).
narrative inference (Section ‘Building the narrative network Since the 1970s, computational story generation systems
between Kernel and Satellites’) it can be seen that the cog- have been created, all of them addressing different features
nitive narrative model left the final definition of l and m from a more or less cognitive perspective. Many times,
open (Eqs. (4) and (5) respectively). these systems have been created with an engineering objec-
Narrative salience is proportional to recency and related- tive in mind and they have not addressed general aspects of
ness (Zwaan, Langston, & Graesser, 1995; Kintsch, 1991; narratives. While from the point of view of Artificial Intelli-
Langston, Trabasso, & Magliano, 1999, cha). Recency refers gence story generation systems represent a relevant
to the evidence that an event that has been recently men- advance, they shed little light on the more general problem
tioned has a higher salience than other that was mentioned of formalizing narrative cognition.
in the past. Additionally, events semantically related to the Gervás et al. propose a model of the story generation
current event also have stronger salience. This information task, ICTIVS (Gervás & León, 2014; Gervás & León, 2015).
can be used to implement the function for identifying the In this model, the entity producing the story models the
kernel, l. audience and iteratively cycles through a set of steps, refin-
For the satellites, the implementation has to define the m ing the draft and updating the internal constraints. These
function. This must create semantic relations between the steps involve invention, composition, transmission, inter-
kernel and the potential satellites for each narrative object pretation and validation of stories, putting strong focus
26 C. León
but salience and inference are influenced by different happen for A to be possible. The probability that the m func-
aspects. tion finds a precondition between events e1 and e2 is com-
From here on we will borrow the term script from the lit- puted by the distance between those two events. It is
erature (Schank & Abelson, 1977; Bower, Black, & Turner, formally described in Eq. (7). The definition of the distance
1979). In short, a script is a specific narrative describing a is the same as the one used for Eq. (6). In this case, comput-
certain process. In the proposed architecture, a script is a ing the strength of the precondition takes into account the
narrative in the procedural–semantic memory and, in order average of the distances between e1 and e2 for all previously
to learn a script, a cognitive system must experience it. The learnt scripts. This tries to capture the fact that causality is
fact that a system receiving the script through language not script-specific, but the result of many instances of the
instead of through experience will be omitted, since that pair ðe1 ; e2 Þ. Therefore, previous experiences affect the
is considered a special case involving language, not in the way in which cognitive systems learn causal rules.
scope of the current study. For procedural–semantic mem-
mðe1 ; e2 Þ ¼ ðdistanceðe1 ! e2 Þ; preconditionðe1 ; e2 ÞÞ ð7Þ
ory, then, the script becomes the narrative, a set of narra-
tive objects. Other relations computed by m would be created in the
Analogously to what was described for the narrative same way as shown for the episodic memory. This would
properties of the episodic memory in the presented archi- include time relations, location, agency and so on.
tecture, we define the function l and m for the procedu- Analogously to the proposed model for generation in the
ral–semantic memory. The m function captures the episodic memory, the process for performing an action
salience of a certain event, thus making it more likely to using the procedural–semantic memory is based on a com-
form a kernel. In the case of the procedural–semantic putational approach, focusing on the functional aspects
memory, an objective is needed. When carrying out a num- rather than the structural ones.
ber of actions, there is an object to be fulfilled by the cog- The architecture models the execution of a script as a
nitive entity. The objective is assumed to be computed in an planning process. Instead of using a reader model, as the
external module, not necessarily part of the narrative mem- architecture does for episodic memory, the model relies
ory. The architecture, then, considers that there is an on a planner. Given that performing an action has an objec-
external module in the system computing a specific objec- tive and narrative objects can be used as operators, the
tive, and this objective is borrowed by the narrative mem- architecture can apply a more or less complex planning
ory, in this case by the procedural–semantic submodule. algorithm. This algorithm can be driven not only by the
Eq. (6) formalizes l. scripts but also by external information like the current
1 state of the cognitive entity.
lðeÞ ¼ ð6Þ
distanceðe ! objectiveÞ
where e is any element coming from the conceptualization
Overall architecture of narrative memory
module, the objective is the objective given by the external
module that computes the objective, and distanceðÞ is an Figure 10 shows a diagram of the overall narrative architec-
implementation-dependent function that computes the dis- ture. As it can be seen, it is strongly simplified with respect
tance between the objective and the current event. The to other more general cognitive architectures, due to the
distance can be semantic or structural. A semantic distance fact that this architecture for narrative memory only
could be the difference between buying and renting a addresses certain subsystems and procedures (Anderson
house. Both objectives have many aspects in common. et al., 2004; Cai, Goertzel, & Geisweiller, 2011; Snaider,
Structural distance would represent the relation between McCall, & Franklin, 2011). As such, only the relevant aspects
buying a house and applying for a mortgage loan. These for the narrative memory are covered in this architecture.
two things do not have much in common, but they are struc- The Input and Output nodes represent narrative content.
turally related since there is a preconditional link between They are assumed to be physical, raw content. For this con-
them and usually appear in the same narrative. tent to be usable in the cognitive architecture, it must be
The m function computes the strength and the category of promoted to symbolic information. This task is carried out
the relation between a satellite and a kernel. For procedu- by the Preprocessing module. Once the narrative informa-
ral–semantic memory, the most important family are the tion has been produced in the narrative memory of the cog-
causal relations. It is fundamental to note that, as we com- nitive system, it is taken out as raw output through the
monly use it, causality has several levels of meanings and Realization module.
strong implications in coherence and understanding. In nar- The Preprocessing module feeds the episodic memory
rative, this has been deeply studied (Goldman, Graesser, & and the procedural–semantic memory. These are the data
van den Broek, 1999). For example, if a glass falls and that will become the constituents of events, and therefore
breaks, the cause could be the existence of gravity, the rea- kernels and satellites as well.
son why the glass felt, or both. Analyzing the many semantic The intentions of the cognitive entity are summarized in
options is not the target of this research, it is just assumed the Objectives module. This module controls narrative pro-
that there is a global, general notion of causality and that duction and understanding by taking into account the
human cognition has the functional ability of learning and entity’s specific objectives for each process (generation of
inferring it. a narrative, understanding of a procedural narrative, and
In order to model this high level sense of causality, we so on).
define a particular narrative relation, the precondition. A The Realization module captures narrative objectives
precondition of an event A is a narrative event B that must and the specific output (tasks to be performed, stories to
28 C. León
Figure 10 Overall architecture of the narrative memory. PS stands for procedural–semantic memory and NG stands for narrative
generation.
be told) and realizes the narrations in several ways. This material available. Otherwise, the process would be analo-
realization is performed as the output of the system. gous, but several iterations would have to take place in a
The narrative memory is embedded in the general cogni- specific implementation.
tive system and connected with the preprocessed input and Before telling the story, A must have created a narrative
the objectives. The modules for the episodic memory and structure (connected kernels and satellites) storing the
the procedural–semantic memory (whose functionality has sequence of events in the episodic memory. We can assume
been described in Sections ‘Episodic memory as a narrative that the implementation would find the main kernel of the
memory’ and ‘Procedural–semantic memory and narrative’ events, probably corresponding to point 6: A hits a traffic
respectively) are its main constituents. A reader model is sign. Choosing point 6 as a kernel would be the result of
also included and used by the episodic memory. The output computing the value of the l function (Section ‘Building
of these two modules is connected to the Realization mod- the narrative network between Kernel and Satellites’).
ule, producing the actual output. The particular details of the implementation are left to a
valid implementation. The relevant part is identifying the
cognitive salience of each one of the events and the pick
Examples the most salient one. A function that computes salience
and takes unexpectedness into account would most likely
This section describes example interactions involving the choose point 5 as the most salient element. Then,
narrative memory and the narrative processing module. lð5Þ ¼ maximumðlð1Þ; lð2Þ; . . . ; lð8ÞÞ.
Although theoretical, it is meant to highlight both the role This would be the base for creating the network of satel-
and the need of each part of the architecture. lites. An abstraction relation in which the kernel would be
point 5, and the full story would be story in detail, as a ker-
nel, as show in Figure 11. The m function in this case would
Episodic memory: telling and understanding events return one single satellite (the whole narrative) and the
as a story relation abstraction. The implementation should assign it
a high probability.
Let us assume that a cognitive entity A (a person) tells a Temporal order could then be constructed. Since points
story about her morning to another entity B. For this exam- 1–8 happened in order, a simple next relation could be
ple, let us just assume that the sequence of events is the added. However, narrative usually convey broader informa-
next one: tion. First, all temporal information must be constructed
around the kernel. Events 1–5 happen before the kernel
1. A wakes up. and events 7 and 8 happen later on. Additionally, is usually
2. A has breakfast. relevant enough as to make it necessary to remember the
3. A gets out. exact time. Assuming that A would not be looking at the
4. A drives her car to the office. clock at the time of the accident, probably the only abso-
5. A looks at a bird. lute time available would be time at which the police arrive.
6. A hits a traffic sign. Figure 12 depicts the time relations graphically.
7. A calls the police. Causality relations would also be produced by the m func-
8. The police arrives. tion. This time, not all events take place in the creation of
the relations. Relevant satellites are looking at a bird, driv-
Events ½1; 8 are not fed into the system as proper narra- ing, calling the police and the arrival of the police. The fact
tive events, but as a low level physical stimuli. These phys- that A woke up does not really produce a car accident.
ical stimuli are received, discretized and made symbolic by Although it is possible to find a causal link (if A was asleep,
the Preprocessing module. The symbolic representation is no car accident would have happened), we can assume an
then passed to the episodic memory. Leaving sequentiality implementation of the m function omitting this aspect. Of
details out to make the explanation simpler, let us assume course, other implementation could have included it in
that the episodic memory has already received all the infor- the set of causal relations. Figure 13 shows the graphical
mation and the narrative can be constructed with the whole representation of the causality network.
An architecture of narrative memory 29
Procedural–semantic memory: telling and entity has successfully repeated the process many times
understanding a process (i.e. it has drank the coffee, which was the original objec-
tive), it makes sense to assume that there would be a high
In order to illustrate the way in which material stored in the probability that the steps were caused in order.
so-called procedural–semantic memory, the next script will Moreover, the definition would not only assign precondi-
be and used: tional or causal relations between consecutive events, but
to close ones. This means that points 5 and 6 are strongly
1. take a cup from the cupboard, connected, but points 3 and 6 would be strongly connected
2. put coffee in the cup, as well. The satisfaction of the objective is also part of the
3. add milk, narrative object, because it is actually part of the story.
4. add sugar, The causal network could be represented as depicted in
5. stir, Figure 16.
6. drink coffee. After learning the script and having it stored in the pro-
cedural–semantic memory, the cognitive architecture can
It is the script for ‘‘drinking a coffee”. As explained in use the information to satisfy the objective. Since the
Section ‘Storing and carrying out processes as narrative objective is included in the narrative, retrieval is simple.
scripts’, the kernel of the narrative object is set by the main Let us assume a computational implementation of the nar-
objective. Obviously, the objective (coming from other rative architecture. If the entity has a desire, the system
module of the general cognitive architecture) is to drink a must only search for a narrative in which the desire has been
coffee. In order to compute the l function, we would have satisfied, which can be done very efficiently (effectively
to measure the relative distance of each one of the events mimicking biological behavior). Then, a simple backtracking
of the script to the main objective. As such, we can hypoth- process could give the cognitive system the set of events,
esize that any reasonable implementation would provide a l now considered as instructions, to perform in order to sat-
function satisfying Eq. (8): isfy the objective.
Figure 16 Example causal network for a script describing how to drink a coffee.
An architecture of narrative memory 31
There are a number of aspects that are worth some dis- At this point, formalizing the actual process by which this
cussion. The architectural nature of this proposal leaves is carried out by humans is too difficult. However, the def-
certain aspects open, and in order to draw general conclu- inition of the narrative architecture makes it possible to
sions, implementations must prove the validity of the pro- hypothesize that the process implies a narrative. It makes
posed changes and relative benefits. Next sections sense to think of a cognitive entity that mentally reproduces
elaborate several of these aspects in detail. the steps, as if it would imagine itself tying a shoe, and
stores the mental image in the episodic memory. The human
(assuming that only humans can perform narratively in the
Procedural–semantic memory as narrative proposed sense) would then retrieve the narration from
behavior the episodic memory, abstract it as proposed in the previous
sections, and store it again in the semantic memory as a
The working hypothesis leading this design states that narra- script.
tives are not only the product of a certain generation pro- In this way, the narrative architecture could provide a
cess, but are rooted in the basic structures of our scaffolding for representing how narratives are produce
cognition, specially in the way we store sequences of events from the implicit procedural memory to the explicit proce-
(hence this proposal links it to the episodic memory). dural–semantic memory. The narrative architecture can
According to this, the inner structure of procedural– provide this behavior if embedded in a general cognitive
semantic memory, the way in which humans story not only architecture capable of internally reproducing and episode.
episodic memory but procedural–semantic memory as well, If this mental episode is fed back into the procedural–se-
has these narrative properties. That is, producing narratives mantic memory, it is possible then to describe an explicit
is not only an ability that is more or less natural, but a speci- analysis of information stored in the procedural memory.
fic topology for storing information. If the hypothesis is
true, a cognitive model of the narrative memory in humans Multiple narrative objects for the same events
would also have to include the procedural–semantic
memory. An in-depth study of other memory structures in The architecture has not explicitly addressed the possibility
humans could also inform about which ones show narrative of constructing more than one narrative for a single event
characteristics. sequence. This, however, is quite plausible from a cognitive
General architectures not specifically addressing point of view. As previously pointed out, experiments on
production and understanding of narrative objects can still narrative processing suggest that humans produce explana-
represent narrative behavior, probably in the form of proce- tions for narratives when trying to understand them, which
dural–semantic knowledge in the cognitive entity. While in could be modeled as creating many plausible narrative
functional terms an implementation of that family of archi- objects for a concrete event list.
tectures could be fully equivalent, the presented architec- The current state of the architecture allows for an imple-
ture proposes an explicit module for narrative memory mentation producing more than one narrative per event
and processing. sequence. However, it is not clear what the best approach
The presented approach can potentially allow for a more for relating these narratives is. Connecting them together
natural representation of behavior in terms of narrative. with a special kind of relation or creating a meta structure
Intentions, desires, objectives, constraints and outcomes containing every plausible narrative object, but the relative
have been long studied in Narratology as specific units and merits of each approach must be examined in detail before
not only as more or less inter-dependent parts of human proposing an improvement for the narrative architecture.
behavior. Abstract representation of narrations made expli- Additionally, it makes sense to consider on-line schema
cit in architectures like the one presented in this work can creation based on an ‘‘canonical” narrative object. Maybe
make it easier to explain, for instance, the hypothesis that storing one single narrative object per event sequence could
humans do not perform searches for each step of a long be enough if the architecture was added a module adapting
activity. Instead, the process can be formalized as steps the canonical object for different purposes, modifying it
of a short script of narrative. accordingly. Future work will contemplate this.
Procedural–semantic memory has been defined in terms
of abstract narratives used to learn and perform. While
More narrative relations
information vignettes have been proposed from a long time,
the fact that these have narrative properties enriches the
procedures and informs the algorithms for identifying these While the overall approach for defining narrative objects is
scripts. considered to be general enough as to provide valid descrip-
tions for most narrative schemas, the proposed relations
linking satellites and kernels could be insufficient for
Procedural memory and procedural–semantic describing all narrative aspects commonly used. This could
memory seem like a limitation of the current state of the narrative
architecture, but the model leaves the set of relations open
Procedural memory is assumed to store implicit functions to further refinement and particular implementation.
like tying a shoe. This is certainly implicit, but it is possible An exhaustive analysis of all types of relations exceeds
for humans to analyze the process and create a narrative the scope of this paper. Additionally, there are many layers
over it. This narrative is what the proposed architecture sit- of narrative relations. Whereas at the presented level of
uates in the procedural–semantic memory. detail the focus has been put on the most basic knowledge
32 C. León
Langston, M. C., Trabasso, T., & Magliano, J. P. (1999). Under- Schank, R., & Abelson, R. (1977). Scripts, plans, goals and
standing language understanding (pp. 181–226). Cambridge, understanding: An inquiry into human knowledge structures.
MA, USA: MIT Press, <http://dl.acm.org/citation.cfm?id= Hillsdale, NJ: L. Erlbaum.
305135.305144>. Sharples, M. (1999). How we write. Routledge.
León, C., & Gervás, P. (2014). Creativity in story generation from Snaider, J., McCall, R., & Franklin, S. (2011). The LIDA framework
the ground up: Non-deterministic simulation driven by narrative. as a general tool for AGI. Artificial General Intelligence, Lecture
In 5th international conference on computational creativity, Notes in Computer Science, 6830, 133–142. http://dx.doi.org/
ICCC 2014, Ljubljana, Slovenia. . 10.1007/978-3-642-22887-2_14, <http://www.springerlink.
Mueller, E. T. (2002). Story understanding. Encyclopedia of Cogni- com/content/g4563306506388k1/>.
tive Science, 238–246. Squire, L. R. (2004). Memory systems of the brain: A brief history
Niehaus, J., & Young, R. M. (2014). Cognitive models of discourse and current perspective. Neurobiology of Learning and Memory,
comprehension for narrative generation. Literary and Linguistic 82(3), 171–177. http://dx.doi.org/10.1016/j.nlm.2004.06.005.
Computing, 29(4), 561–582. http://dx.doi.org/10.1093/llc/fqu056, Szilas, N. (2015). Towards narrative-based knowledge representa-
<http://llc.oxfordjournals.org/content/29/4/561.abstract>. tion in cognitive systems. In Proceedings of the 6th workshop on
Pérez y Pérez, R. (1999). MEXICA: A computer model of creativity computational models of narrative (CMN’15) (pp. 133–141),
in writing Ph.D. thesis. The University of Sussex. <http://narrative.csail.mit.edu/cmn15/schedule_cmn15.html>.
Propp, V. (1928). Morphology of the folk tale.Leningrad: Trabasso, T., & Sperry, L. (1985). Causal relatedness and impor-
Akademija. tance of story events. Journal of Memory and Language, 24,
Reiter, E., & Dale, R. (2000). Building natural language generation 595–611.
systems. Cambridge University Press. Tulving, E. (1972). Episodic and semantic memory. http://dx.doi.
Riedl, M. (2004). Narrative planning: Balancing plot and character org/10.1017/S0140525X00047257, <~jorkin/generals/papers/
Ph.D. thesis. Department of Computer Science, North Carolina Tulving_memory.pdf" xlink:type="simple">http://web.media.
State University. mit.edu/jorkin/generals/papers/Tulving_memory.pdf>.
Riedl, M., & Young, M. (2010). Narrative planning: Balancing plot Tulving, E. (1983). Elements of episodic memory.Oxford, UK; New
and character. Journal of Artificial Intelligence Research, 39, York: Oxford University Press.
217–268. Zwaan, R. a., Langston, M. C., & Graesser, a. C. (1995). The
Ryan, M.-l., & Ryan, M.-l. (2010). Narratology and cognitive construction of situation models in narrative comprehension: An
science: A problematic relation. Style, 44(4), 469–496. event-indexing model. Psychological Science, 6(5), 292–297.
http://dx.doi.org/10.1007/s13398-014-0173-7.2. http://dx.doi.org/10.1111/j.1467-9280.1995.tb00513.x.