Professional Documents
Culture Documents
Rules of Indexing. A Critique of Mentalism in Information Retrieval Theory
Rules of Indexing. A Critique of Mentalism in Information Retrieval Theory
Journal of Documentation
VOLUME 46 NUMBER 2 JUNE 1990
RULES OF INDEXING:
A CRITIQUE OF MENTALISM IN INFORMATION
RETRIEVAL THEORY
BERND FROHMANN
81
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
the more we study the two cognitive ends of the picture, that is, the
cognitive processes which produce information, and the cognitive
processes which occur on the receipt of information, the more we may be
able to improve and control the processes of information storage and
retrieval to attain desired results . . . in a large part of its scope
information science is a cognitive science, that is, it deals with thought
processes[6, p.75].
82
June 1990 INDEXING
[6, p. 76], but these rules are not to be confused with grammar and syntax, or
the rules of language, which vary from one language to another. Some of these
rules govern the translation of thought into language[6, p. 76]. But because we
are not aware of our mental processes, we 'do not yet, however, know all the
rules by which thought operates'[6, p. 76]. Their discovery and explicit
expression is the aim of theoretical inquiry. Farradane's relational indexing is
a case in point. It is 'an attempt to simulate the structure of thought'[6, p. 76],
bypassing language, he claims, as far as possible. Although it 'does not
eliminate language altogether'[6, p. 76], 'the structure of language is not being
directly used, and in fact is not useful'[6, p. 76]. Indeed, the study of language
in any theoretical investigation of mental processes is necessary only because
'human beings cannot communicate by direct thought transference, and need
some intermediate medium'[6, p. 76].
The mentalist picture brought into such clear focus by Farradane guides
theory in disciplines other than information science. Theory in linguistics, for
example, is thought to consist in making explicit the rules 'internalised',
'realised', or 'cognised' in the mind (or brain) of a competent speaker.
Although the rules are not consciously known, theorists take them to
represent the speaker's unconscious knowledge of language. They are invoked
to explain the speaker's ability to learn a language and to recognise hitherto
unencountered syntactically well-formed sentences.
In a very useful paper[8], Clare Beghtol applies mentalism in linguistics to
bibliographic classification theory. Van Dijk's discourse analysis offers a
theoretical description of the cognitive operation of understanding the subject
of a text in terms of unconsciously followed mental rules. The derivation from
a text of representations of its subject or 'aboutness', is of course highly
relevant to the first, most fundamental step of the indexing process. The
mentalist picture of the process is sketched in the following remarks:
83
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
84
June 1990 INDEXING
The mentalist picture driving research proposals is taken for granted even
when not as explicitly foregrounded as by Farradane, Beghtol or Anderson.
Bivins, for example, asserts that 'What is needed is the simulation of the
cognitive processes of the user in the information system'[10, p. 36] and 'it is
probably necessary to start from this approach in order to improve retrieval
systems'[10, p. 37]. The point seems equally obvious to Artandi[11], who asks
'Do we know enough about how we behave when we say we are thinking that
we can describe the process explicitly for the machine?' The behaviour in
question is clearly not such characteristic 'thinking behaviours' as scratching
one's head, looking puzzled, saying something sotto voce, or the like. Her
recommendation for research and theory suggests that 'how we behave when
we say we are thinking' refers to activities carried out 'in the mind'. Progress in
theory for the young science of information is taken to lie in the direction of
'the understanding of the human processes that the machine is to duplicate'
[11, p. 235].
Other, less explicit formulations betray a more subtle influence of the
mentalist picture. How else to explain the egregious but common confusion of
85
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
86
June 1990 INDEXING
such inquiries further; they are noted here merely to indicate the firm grip of
the mentalist picture.
2. WITTGENSTEIN'S CRITICISM
87
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
Here I should like first of all to say: your idea was that that act of
meaning the order had in its own way already traversed all those steps:
that when you meant it your mind as it wereflewahead and took all the
steps before you physically arrived at this or that one.
Thus we were inclined to use such expressions as 'The steps are really
already taken, even before I take them in writing or orally or in thought.'
88
June 1990 INDEXING
We use expressions such as 'The right step is the one that accords with the
order - as it was meant', or 'But I already knew, at the time when I gave the
order, that he ought to write 1002 after 1000'. Wittgenstein writes in response:
Certainly: and you can also say you meant it then; only you should
not let yourself be misled by the grammar of the words 'know' and
'mean'. For you don't want to say that you thought of the step from 1000
to 1002 at that time - and even if you did think of this step, still you did
not think of other ones. When you said 'I already knew at the time . . . '
that meant something like: 'If I had been asked what number should be
written after 1000, I should have replied "1002"'. And that I don't
doubt[14, §187].
89
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
What really comes before our mind when we understand a word?- Isn't it
something like a picture? Can't it be a picture?
Well, suppose that a picture does come before your mind when you
hear the word 'cube', say the drawing of a cube. In what sense can this
picture fit or fail to fit a use of the word 'cube'? - Perhaps you say: 'It's
quite simple; - if that picture occurs to me and I point to a triangular
prism for instance, and say it is a cube, then this use of the word doesn't
fit the picture'. - But doesn't it fit? I have purposely so chosen the
example that it is quite easy to imagine a method of projection according
to which the picture does fit after all.
The picture of the cube did indeed suggest a certain use to us, but it
was possible for me to use it differently[14, §139].
90
June 1990 INDEXING
like a picture, that forces a particular application on us; so that my mistake lay
in confusing one picture with another?'[14, §140]. Instead, the notion that
mental pictures, unlike ordinary pictures, are not subject to interpretation is a
superstition. Purported explanations of intellectual operations in terms of tacit
knowledge of internally represented rules are not false but incoherent. The
notion of mental rules generating their own applications is an illusion.
Wittgenstein writes that our grasp of rules does not consist in our associating
with them either pictures, mental representations, or interpretations, but is
exhibited in what we do: 'there is a way of grasping a rule which is not an
interpretation, but which is exhibited in what we call "obeying the rule" and
"going against it" in actual cases'[14, §210]. Our reaction to a rule, whether to
its written or spoken expression or to a picture, an interpretation, or any other
associated representation, is the criterion of our understanding it:
It is a fact about human beings that we act on orders and rules, in the
massive majority of cases without any deliberation or thought. Our actions are
as spontaneous as our reaction to a pointing gesture by looking in the
direction from shoulder to finger-tip rather than from finger-tip to shoulder.
Without these reactions, there would be no such thing as following a rule. But
following a rule would also be impossible if all our reactions were different.
Hence:
91
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
One has a right to say they are playing a game only if the yells and stampings
play the same role in their life as making of moves ofgames play in our life. The
fact that their actions can be described by our chess notation, or translated
into rules of the game we call 'chess', does not constitute their playing a game
of chess. They do not follow the rules of chess unless the point, purposes,
intentions, aims, ends and strategies of their rules are the same or similar to
those of chess in our life. Wittgenstein makes the same point by another
example. If a tribe of people use what we call mathematical formulae to no
other purpose than decorating walls, they are not following rules of
mathematics, even if they derive results we call correct[26, pp. 39-40]. A final
example comes from a commentator on Wittgenstein[27]. If a tribe of people
move pieces like our chess pieces on a board like our chess-boards according to
rules we call the rules of chess, yet their activity is a solemn religious rite, they
do not play a game of chess. Their practice of the rules is embedded in their
social life in ways so different from ours that they do not even play a game.
Reminders such as these that following a rule is a practice, a custom, or a
technique, directs reflection to the social context of rules. The role in our form
of life of our practice of a rule is not accidental or contingent, but is instead one
of its necessary features. It settles the identity of the practice of the rule.
The upshot of Wittgenstein's remarks is that rule-following is a matter of
reacting to a sign, a symbol, a drawing, a gesture, a formula, or the countless
other members of the large family of rules, in ways we call, according to public
criteria, 'in accordance with the rule'. Among the various uses of rules is their
employment as standards of correctness; if someone goes on incorrectly (for
92
June 1990 INDEXING
example, by writing 1004 after 1000 given the ' + 2' rule), we cite the rule ('You
were meant to do this'). In addition, rules are used as justifications; asked for
the reasons for our actions, we cite our rule. Rules are also used as instruments
of training and instruction. Hence rules are properly seen as instruments of
practices. Since a rule is always a rule of a practice, to understand the rule
requires a description of the role of its practice in our form of life. Rules are
instruments used in public life, and not mysterious agents, operating silently,
invisibly and unbeknownst to us, far behind the fleeting scenes of our
consciousness.
3. RETRIEVAL PRACTICES
93
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
94
June 1990 INDEXING
index The upshot is that there is as yet no consensus among experts
about the answers to even some of the most basic questions of what
indexers ought to be told to do or of how an indexer's performance
should be evaluated[32, p. 107].
95
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
The uses of discourse analysis have been investigated by Dea and Belkin[41].
Research into these and other textual structures that might provide the basis
of rules for the derivations of indexing phrases need no defence against
mentalism's requirement that the structures investigated be capable of
description as our internal representations of the text.
Fourth, mentalism conceals relations between texts. Like intratextual
criteria of significance, intertextual criteria may also reveal structures
providing a basis for indexing rules. Intertextual criteria define significant
elements of one text in relation to some other text. Significant textual elements
defined by each set of criteria may not be identical. For example, a part of a
text may be neither a logically central theorem nor a hypothesis, neither the
premise nor the conclusion of a central argument, and it may not state
arguments or conclusions that are even peripherally related to the logically
central theme. Yet it may state a proposition that is significant in the context of
the author's other works or in the context of scholarship on the subject.
Bertrand Russell's use in one of his political papers of his concept of definite
descriptions merely as an illustration rather than as the subject of a central
argument is sufficiently significant for inscription in a subject statement in
indexing his Collected papers because the concept is central in other major
Russell texts. These intertextual criteria are concealed by mentalism's focus on
the supposed mental rules for 'processing' a single text. Intertextuality is
limited by mentalism to the membership of several texts in the same class by
virtue of their 'mental processing' yielding 'subjects' or 'aboutnesses'[8, p. 95].
But mentalism does not allow that the significance of a part of a text may be
constituted by another text. Research aimed at explicating intertextual criteria
of significance should also not be retarded by mentalism's unduly narrow
focus.
Fifth, mentalism's focus on processes occurring in minds conceals the
crucial social context of rules. Since we do not understand the rule we are
constructing without understanding its social context, or the way it is
embedded in the social world, its point, its purpose, the intentions and
interests it serves, in short, the social role of its practice, indexing theory
cannot avoid investigation into the historical, economic, political, and social
context of the rules in its domain. Mentalism, on the other hand, either erases
the social dimension altogether by conceiving rules as operating in disembodied,
ahistorical, classless, genderless, and universal minds, or else acknowledges it
only by expanding the set of rules of mental processing. Beghtol, for example,
following van Dijk, allows that the unconscious rules of text processing may
include some determined by individual differences and culture[8, pp. 98-101].
But these particular rules are conceived merely as additions to our mental
equipment. As mental effects, caused perhaps by the interaction of mind and
the world, they are not conceived as instruments we construct in order to
institute particular practices, and are therefore, like the rules of universal
minds, outside the realm of deliberate change and willed construction.
Many consequences follow from the disclosure of the social context of rules.
Questions of justification return to centre stage from the margins of etiquette
96
June 1990 INDEXING
97
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
discrimination. Not only are text retrieval practices of the socially, economic-
ally and politically marginalised more difficult to discern, but also the
construction of indexing rules aimed at meeting their needs may quite
explicitly become part of the exercise of building new and alternative social
practices.
Some questions demand critical inquiry. Does text retrieval fulfil a need, or
does it satisfy a want? Indexing rules will look quite different depending upon
how this question is answered. Wants are explicitly recognised and admitted;
they reflect the agents' goals, purposes, and intentions. Not all needs are
known, and someflyin the face of wants. For example, not everyone knows
what they need to prevent AIDS, and not everyone wants what they need.
Identification of needs depends upon a conception of human nature and the
social world; wants can be identified by questionnaire. If want satisfaction
alone is considered the end of text retrieval, then most indexing rules will serve
the retrieval practices of the prevailing form of social organisation. Among the
important indexing rules for want satisfaction in consumer capitalism, for
example, are those which efficiently represent goods for consumption. On the
other hand, if text retrieval is taken to fulfil needs, then rules for its practice
may not only be inconsistent with the aims of the dominant social order, but
may also be antagonistic to them.
Other questions demand political analyses. What and whose aims, goals,
strategies, and intentions are fulfilled by text retrieval in the social world in
which indexes, abstracts, online databases, catalogues, thesauri, biblio-
graphies and the whole range of retrieval apparatus make their appearance? Is
the retrieval of truth a desirable (or even feasible) retrieval practice, as Patrick
Wilson[13] has suggested? If so, this purpose imposes a serious constraint on
the kind of indexing rules governing the derivation of indexing phrases. Is the
spread of disinformation in the service of ruling elites the purpose of text
retrieval? If so, this purpose imposes a rather different constraint on indexing
rules. Is text retrieval an economic transaction yielding a profit for the
database owner, and therefore a social practice whose participation is
constrained by ability to pay, or is it a fundamental right that ought to be
guaranteed by government? Again, differing conceptions of the social role of
text retrieval will determine the kinds of indexing rules we construct.
Some questions have already been raised in the literature. Is there a large
category of document uses for which the subject of a text, as identified by inter-
and intratextual criteria of significance, is quite irrelevant? Are certain other
features, such as those identified by Swift and Weinberg[43-46], more
significant than a text's 'aboutness'? Does investigation of the uses of
documents show that a text should be considered in light of 'the projects in
which it can be used, the decisions it can facilitate, the arguments it can
support, the predictions it can warrant?'[13, p. 22] If so, we need tofindways
of constructing rules yielding index phrases representing projects, decisions,
arguments, and predictions. Are the practices of document production
relevant for text retrieval? If so, then retrieval practices based on document
clusters mapped by bibliographic coupling and co-citation patterns can
98
June 1990 INDEXING
eliminate the need for rules for the derivation of indexing phrases altogether.
The implication of the Wittgensteinian shift in the conception of rules is that
questions such as these belong at the centre rather than at the margins of
theoretical reflection on information retrieval. There are doubtless many other
important questions; the sample presented here is not intended to be
exhaustive. It offers little more than a gesture in what, by Wittgenstein's
lights, is a direction of fruitful inquiry for information retrieval. The route
indicated travels from the assumption that retrieval systems depend upon
rules governing the derivation of indexing phrases from texts, through the site
of Wittgenstein's reminder that following a rule is a practice, to the
requirement that in order to construct rules we need to know the game we are
playing.
REFERENCES
1. SCHRADER, A.M. In search of a definition of library and information science.
Canadian Journal of Information Science, 9,1984, 59-78.
2. SCHRADER, A.M. The domain of information science: problems in
conceptualization and in consensus-building. Information Services and Use, 6,
1986, 169-205.
3. SCHRADER, A.M. In search of a name: information science and its conceptual
antecedents. Library and Information Science Research, 6,1984, 227-271.
4. FOSKETT, A.C. The subject approach to information. 4th ed. London: Bingley,
1982.
5. FARRADANE, J. The nature of information. Journal of Information Science, I,
1979, 13-17.
6. FARRADANE, J. Knowledge, information, and information science. Journal of
Information Science, 2, 1980, 75-80.
7. FARRADANE, J. Towards a true information science. Information Scientist, 10,
1976,91-101.
8. BEGHTOL, CLARE. Bibliographic classification theory and text linguistics:
aboutness analysis, intertextuality and the cognitive act of classifying documents.
Journal of Documentation, 42(2), 1986, 84-113.
9. ANDERSON, JAMES D. Indexing systems: extensions of the mind's organizing
power. Information and Behavior, 1, 1985, 287-323.
10. BIVINS, KATHLEEN T. Indexing and universals. The information age in
perspective: Proceedings of the ASIS Annual Meeting, 15, 1978, 35-37.
11. ARTANDI, SUSAN. Machine indexing: linguistic and semiotic implications.
Journal of the American Society for Information Science, 27, 1975, 235-239.
12. VICKERY, B.C. Knowledge representation: a brief review. Journal of
Documentation, 42,1986, 145-159.
13. WILSON, PATRICK. Some fundamental concepts of information retrieval. Drexel
Library Quarterly, 14, 1978, 10-24.
14. WITTGENSTEIN, LUDWIG. Philosophical investigations. Oxford: Basil Blackwell,
1953.
15. WITTGENSTEIN, LUDWIG. Remarks on the foundations of mathematics. Rev. ed.
Oxford: Basil Blackwell, 1978.
16. NEDOBITY, WOLFGANG. Concepts versus meaning as reflected by the works of
E. Wüster and L. Wittgenstein. International Classification, 16, 1988, 24-26.
17. McLACHLAN, HUGH V. Buchanan, Locke and Wittgenstein on classification.
99
JOURNAL OF DOCUMENTATION Vol. 46, no. 2
100
June 1990 INDEXING
101