Professional Documents
Culture Documents
Current Topics in Cognitive Psychology, 2010
Current Topics in Cognitive Psychology, 2010
“Oh and BTW, the mind is a maximum hypersurface and thought a trajectory on it and the
amygdala and hippocampus are Hopf maps of it. No one knew this before me, and it seems no
one cares. So be it. My time will come in a hundred or a thousand years when the idea again
returns.”
- Chris McKinstry, AI researcher
(in his electronic suicide note, 20th of January 2006)
The facts I know about Chris McKinstry can be counted on one hand. What I know
mistaken in supposing that it would take “a hundred or a thousand years” for his idea
from 2007 the very same idea is presented – albeit in a more tentative and persuasive,
less grandiose, way. In this essay I will briefly summarize Spivey’s general perception
of the mind/brain, before applying this idea to a specific area of research: visual word
recognition in the lexical decision task. In order to do this I first outline the central
current findings in this area using traditional reaction time measures, before
suggesting how a novel approach based on more dynamic outcome measures (for
word recognition.
Continuity of Mind
Start by picturing, individually, every single neuron in your brain. If, like Spivey
dimension in a multidimensional state space, then the total brain activity at any single
point in time will corresponds to a single point in this immense space. Likewise,
changes in brain activity over time correspond to trajectories through the space. As far
as abstraction goes, the multidimensional state space is not far removed from the
266689, 1
Current Topics in Cognitive Psychology, 2010
reality it illustrates – in a sense, the neurons have simply been rearranged to cover a
huge, unwieldy and unimaginable space, rather than being compressed into your skull.
To help conceptualise the problem in a more practical way, Spivey (2007) suggests
another level of abstraction: attractor basins. Attractor basins are regions of the
multidimensional state space towards which many trajectories tend to gravitate. In the
percepts – for example a vase or two faces; a red object among many yellow ones; or,
Comparisons to Computers
systems provide the concept of choice for modelling its functioning. The use of
dynamic systems for this purpose can be contrasted to what is sometimes referred to
as ‘the computer metaphor’ of cognition. As Dale, Kehoe & Spivey (2007) point out,
While this dichotomy (like most dichotomies) overlooks the nuances of opinion that
lie between the two extremities, highlighting the areas where the dynamic systems
metaphor differs most from the computer metaphor can still be useful. In the
events, decision, etc) are seen as discrete, sharply bounded and symbolic. When
turn produce discrete outputs – a button press, a step left or right, a spoken word.
While the computational perspective has long served cognition research well, it has
not been without its critics. A full elucidation of these criticisms is beyond the scope
of this essay (but see Spivey, 2007); suffice to say the dynamic systems approach
266689, 2
Current Topics in Cognitive Psychology, 2010
discrete and symbolic, but rather as probabilistic and with fuzzy boundaries. Nor are
representations and their manipulations are “part and parcel of a system that blends
content and process in its operation” (Dale, Kehoe & Spivey, 2007). Furthermore, the
taken as support for the discreteness of processes leading up to that output (Spivey,
2007).
cognition. Not only ‘low level’ perceptions can be conceived of as attractor basins
space, but also higher level cognitions such as evaluation of the truth of statements
(McKinstry, Dale & Spivey, 2008). However, in order to move this essay on from a
focus on the tools used in cognitive research. With new metaphors come new
research. I return to the dynamic systems metaphor when discussing how visual word
266689, 3
Current Topics in Cognitive Psychology, 2010
Continuity of Measurement
some kind of stimuli, and then recording some kind of response. In the context of
word recognition, as in many other areas of research, reaction time (RT) is commonly
lexical access – is usually the focus of research, and the speed of response is thus a
The mental lexicon can be accessed through many modalities – for example through
sound, vision, or pictures – and responses that tap various aspects of the word’s entry
in the lexicon can be elicited through many different tasks – for example semantic
Balota et al. (2004) however, it is the lexical decision task which has become “the
gold standard” in research on lexical access, and it will thus receive the most attention
in this essay. The lexical decision task (LDT) involves participants being presented
with a string of letters (either a word or a non-word, for example glump), and making
quickly and accurately as possible. Mean response times are then compared for words
1
I could here draw parallels to the computer metaphor… but don’t want to set up more of a straw man
than necessary.
266689, 4
Current Topics in Cognitive Psychology, 2010
Spivey (2007) arranges outcome measurements and the techniques used for their
percentage correct) fall at the most discrete end. Mean reaction times compared across
different experimental conditions using t-tests or analysis of variance (as in the LDT)
is one step towards the more continuous end of the scale. Taking another step in this
direction, researchers can analyse the overall distribution of reaction times, a type of
processes underlying response times in specific tasks (Spivey 2007). Common to all
the measurements mentioned above is a reliance on the final output; for example the
button press in the LDT. Researchers use the time taken to reach this end point to
make inferences about the process leading up to it – an approach which is only fruitful
continuous measures (Spivey, 2007) – address this issue and attempt to go beyond this
more direct, online, look at the process of responding in an experimental task: eye-
tracking and (computer) mouse-tracking. I focus here on their use in two illustrative
266689, 5
Current Topics in Cognitive Psychology, 2010
Two Studies
Eye-tracking
Tanenhaus (1994), eye tracking was used to demonstrate that there is “continuous
2007). Participants were asked to pick up one object (for example a candle) among
many on a table, and one of the other objects has a similar onset to the target object
(for example candy). In this situation, eye-movement analysis reveals that participants
fixate a substantial amount on the ‘competing’ object during the time in which the
instructions are being read to them. This fixation occurs despite participants rarely
making a mistake by actually picking up the competing object, and also despite
participants denying that they ‘looked at’ the candy (Spivey, 2007). Spivey (2007)
takes this study (among others) as support for a view that words and sentences
continuously influence the smooth transition from input (participants being instructed
to “pick up the candle”) to the correct motor output, an influence reflected in the eye-
movement data.
Mouse-tracking
One drawback of eye-tracking is the ballistic nature of saccades (Dale, Kehoe &
Spivey, 2007); an issue which is overcome by mouse-tracking. Dale, Kehoe & Spivey
266689, 6
Current Topics in Cognitive Psychology, 2010
(presented as either a word or a picture) appeared at the lower edge of the screen, and
the correct higher order category (e.g. mammal for target whale) and a distractor
category (e.g. fish) appeared in the upper left and right of the screen. Participants were
instructed to click on the correct category, and the trajectory of the response was then
analysed to reveal the relative ‘pull’ of each of the categories. The crucial independent
variable was the typicality of the animal to be categorized: some were considered to
hypothesised, the trajectories in the atypical trials showed more attraction towards the
distractor categories, indicating that “multiple categories (attractor basins) are nearly
visited as the system eventually settles into a unique outcome-based response, much
debate surrounding this assumption is too comprehensive to delve into here, so I must
simply ask the reader to accept the argument that “continuous motor dynamics reveal
continuous mental dynamics” (Spivey, 2007), and leave this caveat for another essay.
So far this essay has outlined the idea of dynamic cognition, discussed outcome
measurements in cognitive research, and shown how a dynamic approach can involve
continuous measurements, through which new insights can be attained in areas such
266689, 7
Current Topics in Cognitive Psychology, 2010
recognition.
(Elman, 2004). Perfetti and Hart (2002) further break down each ‘dictionary entry’, or
lexical representation, into three separate components – one component for semantics
(the meaning of the word), one for phonetics (sound), and a third for orthography
(spelling). During reading (i.e. visual word recognition), it is the orthographic part of
the representation which first becomes activated. Perfetti and Hart (2002) propose that
the “quality” of this orthographic representation will influence the speed of lexical
access, and can thus be indexed by tasks like the lexical decision task (LDT) where
response times (RTs) are recorded. In turn, the quality of orthographic representations
the word, the age when the word was first learnt, the spelling-to-sound consistency, to
name a few. Two variables that have been subject to much study are frequency and
neighbourhood density (N) of a word is defined as the number of words which can be
created by changing just one letter in the original word (Coltheart et al., 1977). Thus,
sharp, share, shank, shack, spark and stark are all neighbours of shark, which has an
N of six. Frequency, on the other hand, is simply how often the word is encountered
in text. One common and uncontroversial finding is that high frequency words are
responded to faster in the LDT than are low frequency words (Andrews, 1997).
According to Perfetti and Hart (2002), this increased speed reflects the higher quality
266689, 8
Current Topics in Cognitive Psychology, 2010
representations) are slightly more complex, and I will now turn to a particular study
Forster, Davis, Schoknecht & Carter (1987) used a masked priming paradigm with the
LDT to investigate how the neighbourhood of a word might influence the quality of
presenting participants with two words – the first word is the prime, the second word
is the target to which participants respond (by pressing ‘word’ or ‘non-word’ buttons,
in the LDT). The prime is forward masked by a pattern (#####) and presented for a
very short amount of time (usually around 50ms), which means most participants are
Time
######
prime
TARGET
Figure 1: A schematic representation of the masked priming paradigm. To avoid perceptual continuity
in the identity priming condition, prime words are presented in lower case and target words in
uppercase.
When a target word is preceded by an ‘identity prime’ – i.e. the same word functions
as both prime and target – responses to the target word are facilitated. This is
presumably because the orthographic representation for the target word is ‘pre-
activated’ or ‘pre-opened’ by the prime. The primes used by Forster et al. (1987)
266689, 9
Current Topics in Cognitive Psychology, 2010
however, were not identical to the target: instead they were form primes. Form primes
are visually similar to the target word; they are typically created by changing one
letter in the target word. For example, tafget is a form prime of target. Because of
their similarity to the target word, Forster et al. (1987) hypothesised that form primes
and thus lead to faster RTs. Their findings, however, suggest that the benefit derived
from form priming is contingent on the neighbourhood of the target word. That is, in
the Forster et al. (1987) study, words that were visually similar to many other words
(i.e. high N words) benefited less from priming by a one-letter-different form prime
than did words which were visually unique. This effect is referred to as the
neighbourhood density constraint, and was proposed by Forster and Taft (1994) to be
due to what they called the lexical tuning hypothesis. Words from a high density
require their orthographic representations to be ‘finely tuned’ – which means they are
less flexible, and less able to be activated by non-identical stimulus. This is adaptive
experimental context, however, a form prime will be less able to pre-activate a finely
tuned representation.
The study by Forster et al. (1987) has been the inspiration for many subsequent
studies, however the researchers failed to control for a possible confound: the
presence of a higher frequency neighbour. If a word has many neighbours then it is also
2
“Competition” between words in the lexicon is important in many models of skilled reading,
particularly models based on the interactive activation model (McClelland & Rumelhart, 1981).
Briefly, it means that when the system is presented with any string of letters, many words in the lexicon
are going to be activated – not only the word that is a ‘perfect match’ to the input. These words have
inhibitory connections between them, and this competition is part of the process which eventually leads
to only one word being ‘uniquely identified’.
266689, 10
Current Topics in Cognitive Psychology, 2010
more likely to have many higher frequency neighbours, compared to a word with few
competition with its neighbours – response times to a target word primed by a higher
frequency word neighbour (sleet primed by sweet, for example) will be inhibited
relative to a non-word neighbour prime (Davis & Lupker, 2006). Thus, in the
experiment of Forster et al. (1987) the non-word form prime could have activated this
higher frequency neighbour (HFN) as well as the target. The competition from the
HFN would then have inhibited responses to the target, which would lead to slower
response times for the high N words. This explanation offers an alternative
Investigating this possibility, in a recent study by McKague, Teo and Watkins (2010)
all the target words had a HFN, and the primes were constructed to be as specific to
variable. When this was done, no neighbourhood density constraint was found.
Instead, frequency was found to interact with form priming – low frequency words
were primed more than high frequency words. This finding is inconsistent with the
lexical tuning hypothesis as presented by Forster and Taft (1994), which states that it
account of form priming driven by frequency is also hard to incorporate into current
3
Though competition and inhibition from primes and neighbours in the lexicon is also of interest, the
main aim of this study was to separate out the facilitatory influence of primes on words.
266689, 11
Current Topics in Cognitive Psychology, 2010
rapid (and restricted) excursion through the world of masked priming in visual word
recognition research, we now arrive at the intersection of these two areas. I believe
that some of the current contradictions in the field of visual word recognition, as
outlined above, may benefit from a dynamic systems approach. This benefit may
consist of two interrelated forms: one is conceptual, the other more pragmatic.
First, cast your mind back to the multidimensional state space, with attractor basins –
A portion of this space may be devoted to the mental lexicon, and words, then, are
but one that is supported by the spoken word recognition studies by Spivey-Knowlton
2007).
categorization study, one might propose a LDT using mouse-tracking, with ‘word’
and ‘non-word’ being options at the top of the screen. However, in the categorization
of atypical animals there would seem to be a ‘natural’ pull from the opposing category
266689, 12
Current Topics in Cognitive Psychology, 2010
– a whale really is like a fish, in many ways. Categorizing a string of letters as either a
‘word’ or a ‘non-word’ is by contrast a much more contrived task, and thus it is hard
response4.
Another option is to combine elements from both the Dale et al. (2007) mouse-
tracking study and the Spivey-Knowlton et al. (2007) eye-tracking study, and thus
arrive at an improved test of the lexical tuning hypothesis. For example, one might
flip the above ‘continuous LDT’ on its head, and instruct participants to ‘click on the
word’ in every trial. The word target would appear at either the top left or right corner
of a screen, along with a distractor non-word, and participants would move the mouse
cursor from its starting position at the centre of the lower edge of the screen.
neighbour of the target (e.g. clarp for clasp) or an unrelated non-word control. While
the frequency of the target word may still be an important factor in how quickly the
word is clicked (resembling the traditional lexical decision task), the ‘pull’ of the
might provide clues to the effects of neighbourhood density. If, for example, a high N
word experiences less ‘pull’ from its non-word distractor neighbour than a low N
word does, this might be taken as evidence of some sort of ‘tuning’5. On the other
hand, the path towards a high N word might diverge more, or be more irregular than
the path towards a low N word. Such a finding could be taken as support for a mental
4
However, a non-word response for a non-word with many neighbours might diverge more from the
ideal trajectory, if these word neighbours (attractor basins) exert an influence on the non-word
response. This seems to me more likely than the other way around, since encountering a non-word in
real-life reading usually means you have encountered a misspelling, and the system would presumably
still be able to settle on the ‘intended’ word in such a case.
5
…though this begs the question of why this ‘tuning’ would not show up in traditional lexical decision
tasks with masked priming (as measured by McKague, Teo & Watkins, 2010).
266689, 13
Current Topics in Cognitive Psychology, 2010
lexicon landscape where a neighbourhood is an area dense with attractor basins, thus
Until more research is done, the above three paragraphs remain mere conjecture.
Given the amount of time and effort invested in investigating the mental lexicon and
not be advised to simply abandon all the fruits of this labour. However, as I have
shown above, a move towards a dynamic systems metaphor might not be as drastic as
it first appears, and the benefits gained from more continuous measurements may
metaphor of mind/brain as space and the use of dynamic systems to model this space
have the potential to provide new insights in research areas throughout cognitive
visual word recognition would in particular benefit from adopting the tools relevant to
such an approach.
266689, 14
Current Topics in Cognitive Psychology, 2010
References
Balota, D.A., Cortese, M. J., Sergent-Marshall, S.D., Spieler, D.H., Yap, M. J. (2004). Visual Word
Recognition of Single-Syllable Words. Journal of Experimental Psychology: General, 113(2) 283-
316.
Coltheart, M., Davelaar, E., Jonasson, J. T., & Besner, D. (1977). Access to the internal lexicon. In S.
Dornic (Ed.), Attention and performance VI (pp. 535-555) New York: Academic Press.
Dale, R. Kehoe, C. & Spivey, M. J. (2007). Graded motor responses in the time course of categorizing
atypical exemplars. Memory & Cognition, 35, 15‐28.
Davis, C. J. & Lupker, S. J. (2006). Masked inhibitory priming in English: Evidence for lexical
inhibition. Journal of Experimental Psychology: Human Perception & Performance, 32, 668-687.
Elman, J. L. (2004). An alternative view of the mental lexicon. Trends inCognitive Sciences, 8, 302‐
306.
Forster, K. I., Davis, C., Schoknecht, C. & Carter, R. (1987). Masked priming with graphemically
related forms: repetition or partial activation? Quarterly Journal of Experimental Psychology, 39A,
211-251.
Forster, K. I., & Taft, M. (1994). Bodies, antibodies, and neighbourhood density effects in masked
form-priming. Journal of Experimental Psychology: Learning, Memory, & Cognition, 20, 844-
863.
McClelland, J. L., & Rumelhart, D. E. (1981). An interactive activation model of context effects in
letter perception: Pt. 1. An account of basic findings. Psychological Review, 88, 375-405.
McKague, M., Teo, N., Watkins, H.M. (2010). The role of word frequency and visual similarity in
determining the quality of lexical memory representations for reading. Unpublished Honours
thesis, The University of Melbourne, Australia.
McKinstry, C., Dale, R. & Spivey, M. J. (2008). Action dynamics reveal parallel competition in
decision making. Psychological Science, 19, 22‐24.
Norris, D. (2006). The Bayesian reader: Explaining word recognition as an optimal Bayesian decision
process. Psychological Review, 113(2), 327-357.
Perfetti, C. A. & Hart, L. (2001). The Lexical Basis of Comprehension Skill. In D.S. Gorfien (Ed.), On
the consequences of meaning selection: Perspectives on resolving lexical ambiguity (pp. 67-87).
Washington, DC: American Psychological Assoctiation.
Spivey-Knowlton, M., Sedivy, J., Eberhard, K., & Tanenhaus, M. (1994). Psycholinguistic study of the
interaction between language and vision. Proceedings of the 12th National Conference on Artificial
Intelligence: Workshop on the Integraton of Natural Language and Vision Processing (pp.189-
192). Menlo Park, Calif.: AAAI Press. (In Spivey, M. 2007).
Van Orden, G. C. (2002). Nonlinear Dynamics and Psycholinguistics. Ecological Psychology, 14(1–2),
1–4.
266689, 15