Professional Documents
Culture Documents
(Studies in Language Companion Series 153) András Kertész, Csilla Rákosi - The Evidential Basis of Linguistic Argumentation-John Benjamins Publishing Company (2014)
(Studies in Language Companion Series 153) András Kertész, Csilla Rákosi - The Evidential Basis of Linguistic Argumentation-John Benjamins Publishing Company (2014)
(Studies in Language Companion Series 153) András Kertész, Csilla Rákosi - The Evidential Basis of Linguistic Argumentation-John Benjamins Publishing Company (2014)
Editors
Werner Abraham Elly van Gelderen
University of Vienna / Arizona State University
University of Munich
Editorial Board
Bernard Comrie Christian Lehmann
Max Planck Institute, Leipzig University of Erfurt
and University of California, Santa Barbara
Marianne Mithun
William Croft University of California, Santa Barbara
University of New Mexico
Heiko Narrog
Östen Dahl Tohuku University
University of Stockholm
Johanna L. Wood
Gerrit J. Dimmendaal University of Aarhus
University of Cologne
Debra Ziegeler
Ekkehard König University of Paris III
Free University of Berlin
Volume 153
The Evidential Basis of Linguistic Argumentation
Edited by András Kertész and Csilla Rákosi
The Evidential Basis
of Linguistic Argumentation
Edited by
András Kertész
University of Debrecen,
and MTA-DE Research Group for Theoretical Linguistics
Csilla Rákosi
MTA-DE Research Group for Theoretical Linguistics
The evidential basis of linguistic argumentation / Edited by András Kertész, Csilla Rákosi.
p. cm. (Studies in Language Companion Series, issn 0165-7763 ; v. 153)
Includes bibliographical references and index.
1. Linguistic models--Data processing. 2. Linguistic analysis (Linguistics) 3. Linguistics--
Research--Methodology. 4. Corpora (Linguistics) 5. Computational linguistics.
I. Kertész, András, editor of compilation. II. Rákosi, Csilla, editor of compilation.
P128.M6E95 2014
410.1--dc23 2013048427
isbn 978 90 272 5918 9 (Hb ; alk. paper)
isbn 978 90 272 7055 9 (Eb)
Chapter 1.
Introduction: The state of the art and the structure of the book 1
András Kertész & Csilla Rákosi
Introduction
The state of the art and the structure of the book
One of the major driving forces in the development of linguistics has been the
quest for its empirical foundations. For example, a prominent historian of linguis-
tics characterises the essence of this development as follows:
Modern theoretical linguistics cannot be understood if its struggle for scientific
status is not placed at the centre of its history. The main problem in this quest for
scientific status has always been that of empirical access, that is, some notion of
what constitutes reliable data, the formulation of one or more causal questions,
and some success at answering these questions by means of an explanatory
theory.(Seuren 1998: 140; bold emphasis added, italics as in the original)
From the seventies on, there have been two interrelated sets of methodological
assumptions which have been accepted by the overwhelming majority of linguists
András Kertész & Csilla Rákosi
from generativists to corpus linguists. The first set of assumptions is what we will
call the standard view of linguistic data and evidence and which we summarise as
follows:1
According to the standard view, the empiricalness of linguistic theories
depends crucially on the use of the appropriate data type. This means that the
origin of data is decisive: according to generativists, introspection is a reliable data
source, while corpus linguists are of the opinion that corpora are the only legiti-
mate data sources. There is general agreement on the view that the treatment of
data does not require special attention, because their handling can be sufficiently
controlled with the help of a few simple, general methodological rules.
The standard view of linguistic data and evidence was for the most part
shaped by a second set of assumptions, namely, the standard view of the analytical
philosophy of science.2 A central idea of the latter was that empirical theories must
consist of statements which can be tested on the basis of a particular subset of data,
namely, evidence. It is the requirement that evidence has to be directly control-
lable, thus objective and totally reliable that lies at the core of the interpretation of
the concept ‘evidence’. From these features it follows that evidence was supposed
to be capable of justifying the hypotheses of theories and providing a firm base for
the decision between rival theories.
From the second half of the nineties onwards, there was a turn in the meth-
odological debates on data and evidence in linguistics: a series of collections of
papers and conferences tackled aspects of linguistic theorising that had been
neglected and had previously remained unreflected.3 The quotation below illus-
trates the problem that has recently been in the focus of interest:
[…] there is a growing awareness among formal linguists that a sole reliance on
introspective data […] will no longer do. Not only should speakers’ intuitions be
collected in a systematic way, but also should the database of linguistic theories be
broadened as to include types of data that well go beyond introspective data as the
primary data of linguistic theorizing (such as e.g. psycholinguistic and historical
. See Kertész & Rákosi (2008a, b, 2012) for detailed overviews.
. A recent textbook characterises the standard view of the analytical philosophy of science
as follows: “The two philosophies, logical postivism and Popper’s falsificationism, are usually
taken together as forming what is known as the classical tradition. Sometimes the term
‘positivism’ is used. At other times, it is called the standard or the orthodox view. These latter
expressions are apt.” (Hung 2014: 311)
. See, among others, Schütze (1996), Borsley (ed.) (2005), Lehmann (2004), Kallmeyer &
Zifonun (eds) (2007), Kepser & Reis (eds) (2005), Kristiansen et al. (eds) (2006), Penke &
Rosenbach (eds) (2004, 2007), Stefanowitsch & Gries (eds) (2007), Sternefeld (ed.) (2007),
Featherston & Winkler (eds) (2009), Winkler & Featherston (eds) (2009).
Introduction
Against this background, raising questions regarding the useable types of data/
evidence and their treatment in linguistic theories are well-motivated method-
ological issues. Nevertheless, there are several reasons why it is extremely difficult
to answer such questions. One reason is that linguistics is not a homogeneous
discipline. Thus, both the diversity and the common aspects of the treatment of
linguistic data and evidence have to be accounted for. Another reason is that the
object of linguistic research is not directly available for the linguist, but has to
be reconstructed from the directly accessible manifestations of the knowledge of
language (see e.g. Kepser & Reis 2005: 1). Therefore, theoretical tools for the repre-
sentation and treatment of the indirectness and the uncertainty of linguistic data
resulting from them are needed, too.
Against these fundamental difficulties, it is understandable that the first
attempts in the current literature to solve methodological problems about linguis-
tic data and evidence are double-faced. On the one hand, contemporary views still
contain elements originating from the standard view of the analytical philosophy
of science. For instance, linguistic evidence is still interpreted as a specific subset of
data which are true with certainty and whose main task is to test (verify, confirm or
falsify) the hypotheses of the theory. On the other hand, the very same views have
aspects which are no longer compatible with the standard view of the analytical
philosophy of science and the standard view of linguistic data and evidence, and
which indicate the need to break with these norms. Every important characteristic
of the concept of evidence adapted from the analytical philosophy of science has
been questioned, and the new proposals raise a series of innovative ideas.4 These
progressive insights are, however, not acknowledged by all authors; and even if
they are accepted by several of them, they are accepted in different forms, so they
cannot be generalised. The most important new insights are the following:
. See Kertész & Rákosi (2008a, b, 2012) for detailed overviews.
András Kertész & Csilla Rákosi
–– Linguistic theorising cannot rely only on one data type but different data types
have to be combined in order to increase the reliability of the results and to
capture the complexity of linguistic behaviour.
–– All data types have their own weaknesses. Thus, the control of the factors
which might influence the reliability of the given data type always requires
special care.
–– Linguistic data do not secure a neutral basis for linguistic theorising because
they are problem- and theory-dependent.
–– The function of data cannot be reduced to the testing of hypotheses. Rather,
there is a cyclic relationship between them and the theory, which means that
data are used in several cycles of the raising, checking, modifying, refining,
and extending of hypotheses and hypothesis systems.
–– Data can function not just as tools of problem solving but may also lead to the
emergence of new problems because the application of different data types
often generates contradictions.
No doubt, the progressive insights may pave the way for a new solution to the data/
evidence problem. The question is: how? To find the answer, let us emphasise that
the discrepancy witnessed by the above double facedness is rooted in the unsatis-
factory nature of metascientific reflection. This means the following.
First, raising questions related to the reliability, combinability, complexity and
treatment of data presupposes the clarification of more fundamental and general
methodological problems: What is a datum? What is evidence? What is the struc-
ture of data/evidence? What is the function of data/evidence in linguistic theories?
etc. The current literature does not address these questions overtly and the latently
presupposed answers seem to be fragmentary and unsatisfactory.
Second, most linguistic theories cannot be interpreted as deductive systems
of hypotheses but rather as complex networks of more or less sophisticated argu-
ments and counter-arguments. These arguments and counter-arguments usually
rely on not perfectly reliable data and regard them as pieces of evidence. Thus, it
is indispensable to reveal the relationship between the structure and the function
of data and evidence on the one hand and the argumentation structure of theories
on the other.
Third, the main shortcoming of the literature on the data/evidence prob-
lem is that linguists are not in possession of an elaborated, systematic and solid
metatheoretical (methodological) background. As we have shown in Kertész &
Rákosi (2008a, b, 2012), most contributions to the data/evidence debate are based
primarily on the naive reflection of the linguists on their everyday research practice
and contain several metatheoretical prejudices. These metatheoretical prejudices
are mostly remnants of the standard view of linguistic data and evidence and lead
to apparent inconsistencies with the (accepted) practice of linguistic research.
Introduction
. According to the p-model, one never makes a decision between isolated hypotheses
but compares whole theory-variants built around the rival hypotheses – by examining, for
example, the plausibility values of the data and hypotheses included by them as well as their
completeness and consistency.
. As we have already indicated in Footnote 5, this is not automatically so. That is, for
example, if the acceptance of the less plausible hypothesis leads to a consistent theory-version,
while the more plausible hypothesis generates a great number of contradictions, then the less
plausible hypothesis can be a better choice. It may also happen that the less plausible hypoth-
esis is consistent with a set of hypotheses with very high plausibility value, while the more
plausible is incompatible with them, etc. That is, the decision is well-founded only if it takes
every aspect of the possible alternatives into consideration.
Introduction
The volume consists of three parts. Part I outlines the methodological frame-
work. Chapter 2 in Part I entitled The p-model of data and evidence in linguistics
by András Kertész and Csilla Rákosi summarises the line of reasoning that has
motivated the elaboration of the p-model, introduces its central concepts, infers its
central claims, illustrates them by a series of examples and puts forward a possible
metatheoretical solution to the data/evidence problem in linguistics.
The case studies in Part II systematically apply the p-model, as outlined in
Chapter 2, to object-scientific problems and show that its application can further
the solution of the particular problem raised.
Chapter 3 is Károly Bibok’s contribution entitled The plausibility of approaches
to syntactic alternation of Hungarian verbs. With the help of Kertész and Rákosi’s
notion of plausible argumentation, the author attempts to re-evaluate the p lausibility
of the tenets that several different approaches to syntactically alternating verb classes
András Kertész & Csilla Rákosi
Acknowledgements
Work on the present volume was supported by the MTA-DE Research Group for
Theoretical Linguistics, the project OTKA K 77823 as well as the T
ÁMOP-4.2.2
Introduction
References
Bibok, Károly. 2010. From syntactic alternations to lexical pragmatics. In The Role of Data at the
Semantics–pragmatics Interface, Enikő Németh T. & Károly Bibok (eds), 261–304. Berlin:
Mouton de Gruyter.
Borsley, Robert D. (ed.). 2005. Data in Theoretical Linguistics. Special issue of Lingua.
Featherston, Sam & Winkler, Susanne (eds). 2009. The Fruits of Empirical Linguistics, Vol. 1:
Process. Berlin: De Gruyter.
Holtgraves, Thomas & Ashley, Aaron. 2001. Comprehending illocutionary force. Memory &
Cognition 29: 83–90.
Hung, Edwin. 2014. Philosophy of Science Complete. A Text on Traditional Problems and Schools
of Thought. Wadsworth: Cengage Learning.
Juge, Matthew L. 2006. Morphological factors in the grammaticalisation of the Catalan “go” past.
Diachronica 23(2): 313–339.
Kallmeyer, Werner & Zifonun, Gisela (eds). 2007. New York: De Gruyter (Institut für Deutsche
Sprache – Jahrbuch 2006).
Kaplan, David. 1989. Demonstratives. In Themes from Kaplan, Joseph Almog, John Perry &
Howard Wettstein (eds), 481–563. Oxford: OUP.
Kepser, Stephan & Reis, Marga (eds). 2005. Linguistic Evidence. Empirical, Theoretical and Com-
putational Perspectives. Berlin: De Gruyter.
Kepser, Stephan & Reis, Marga. 2005. Evidence in linguistics. In Kepser & Reis (eds), 1–6.
Kertész, András & Rákosi, Csilla. 2008a. Conservatism vs. innovation in the debate on data in
generative grammar. In Kertész & Rákosi (eds), 85–108.
Kertész, András & Rákosi, Csilla. 2008b. Conservatism vs. innovation in the (un)grammaticality
debate. In Kertész & Rákosi (eds), 61–84.
Kertész, András & Rákosi, Csilla (eds). 2008. New Approaches to Linguistic Evidence. Pilot Stud-
ies / Neue Ansätze zu linguistischer Evidenz. Pilotstudien. Frankfurt: Peter Lang.
Kertész, András & Rákosi, Csilla. 2012. Data and Evidence in Linguistics. A Plausible Argumenta-
tion Model. Cambridge: CUP.
Kristiansen, Gitte, Achard, Michel, Dirven, René & de Mendoza Ibáñez, Francisco J. Ruiz
(eds). 2006. Cognitive Linguistics: Current Applications and Future Perspectives. Berlin: De
Gruyter.
Lehmann, Christian. 2004. Data in linguistics. The Linguistic Review 21: 175–210.
Penke, Martina & Rosenbach, Anette (eds). 2004. What Counts as Evidence in Linguistics?
Studies in Language 28(3): 481–747. Special Issue.
Penke, Martina & Rosenbach, Anette (eds). 2007. What Counts as Evidence in Linguistics?
[Benjamins Current Topics 7]. Amsterdam: John Benjamins.
Searle, John R. 1969. Speech Acts. An Essay in the Philosophy of Language. Cambridge: CUP.
Schütze, Carson T. 1996. The Empirical Base of Linguistics. Grammaticality Judgments and
Linguistic Methodology. Chicago IL: The University of Chicago Press.
Seuren, Pieter A.M. 1998. Western Linguistics. An Historical Introduction. Oxford: Blackwell.
András Kertész & Csilla Rákosi
Stefanowitsch, Anatol & Gries, Stefan T. (eds). 2007. Grammar Without Grammaticality [Corpus
Linguistics and Linguistic Theory 3(1)].
Sternefeld, W. (ed). 2007. Data in Generative Grammar [Theoretical Linguistics 33(3)].
Winkler, Susanne & Featherston, Sam (eds). 2009. The Fruits of Empirical Linguistics, Vol. 2:
Product. Berlin: De Gruyter.
part i
In the Introduction to the present volume, we put forward a brief analysis of the
current debate on linguistic data and evidence. Against this background, the prob-
lem to be solved is as follows:
(P) (a) What kind of metascientific framework is capable of providing a novel
and comprehensive model of linguistic theorising?
(b) How can the concepts of ‘linguistic data’ and ‘linguistic evidence’ be
defined in such a way that the shortcomings of the current literature
can be avoided?
. Our approach has been motivated by the ideas of George Polya and Nicholas Rescher (cf.
e.g. Polya 1948, 1954; Rescher 1976, 1987), and prepared e.g. in Kertész (1993, 2004a, 2004b)
and Kertész & Rákosi (2006, 2009).
The p-model of data and evidence in linguistics
Thus, in the present chapter we will confine the discussion of the basic notions
on which the p-model rests to the elucidation of the ideas motivating them with-
out striving for formalisation. For the more complete and more precise – but, for
the substantial reason mentioned, still informal – presentation of the p-model see
Kertész and Rákosi (2012).
–– The linguistic intuition of the linguist (in this case, that of Komlósy) may be
influenced and rendered unreliable by the theory he relies on. Thus, he may be
mistaken about the analysis of the sentences at issue and state incorrectly that
they are subjectless in this sense.
–– It might happen that other native speakers judge the sentences containing the
given construction differently.
Németh T. (this volume) shows that after involving further sources (arguments
relying on introspective data) this hypothesis has to be regarded as implausible.
can be deemed plausible, among others, on the basis of the frequency analysis
provided in Juge (2006) as a source. Nevertheless, Nagy C. (this volume) refers to
two factors which may diminish the reliability of this source: “First, in other cases
of analogical change the basicness of third persons is emphasised. Second, it is
questionable whether the frequency will remain the same, when investigating it
in a larger corpus.”
Plausibility can be represented numerically (cf. Rescher 1976). These values, how-
ever, do not have any exact meaning which could be applied in every context but
solely indicate different relative strengths of plausibility (reliability, supportedness,
acceptance) within one argumentation process (theory). The minimum value of
plausibility is indicated by 0, which means that the given statement is of neutral
plausibility on the basis of the source at issue. The maximal value is indicated by
1 meaning that the statement in question is true with certainty on the basis of the
given source.
To sum up, the plausibility value of the statement p on the basis of the source
S is such that
a. |The sentence Elements moving rightward cannot be moved out of the clause
in which they originate contains the cognitive metaphor Clauses are
containers.|S = 0
where S is the Theory of Government and Binding (GB Theory). This statement
has neutral plausibility according to the source S, because neither it nor its nega-
tion is supported by the source S. The reason for this is that GB Theory cannot say
anything about the acceptability of this hypothesis because it does not make use of
concepts such as ‘cognitive metaphor’.
If this statement has not been tested yet but is only a conjecture raised at this
moment, then it has to be assigned a low plausibility value on the basis of S which
is an intellectual source.
It may be the case that the hypothesis in (b) has undergone some testing with the
help of a thought experiment conducted by the linguist. That is, if he/she has tried
to think over its consequences and examined its tenability on the basis of a few
acceptability judgements as evidence, then this hypothesis can receive a somewhat
higher plausibility value on the basis of a thought experiment as source S than in the
former case. Nevertheless, this plausibility value cannot be high, since the testing
was unsystematic and relied solely on the linguistic intuition of the given linguist.
d. |In sentences containing both a degree word complement clause and a relative
clause extraposed from subject position, the relative clause has to precede the
degree word complement clause.|S = 0.6
Suppose that this hypothesis has been checked by a linguist with the help of several
pairs of sentences such as People were so angry who knew John that they refused to
participate vs. *People were so angry that they refused to participate who knew John.
In this case, the above hypothesis can be judged to have a rather high plausibility
value on the basis of the linguist’s linguistic intuition as well as his/her linguistic
András Kertész & Csilla Rákosi
analyses as a source S. Nevertheless, we must not forget that the judgements which
are rooted in this source may be controversial.
e. |The sentence People were so angry who knew John that they refused to par-
ticipate is grammatically correct, while the sentence People were so angry that
they refused to participate who knew John is ungrammatical.|S = 0.8
It has to be stressed that low plausibility values do not mean improbability but rather,
a relatively small, limited amount of plausibility (reliability, acceptance). In such
cases, the source votes expressly for the given hypothesis. If a source is against a
hypothesis then it makes its negation plausible and the given hypothesis implausible.
We distinguish between direct and indirect sources. In the case of direct
sources, the plausibility of the statement at issue is evaluated with respect to the
reliability of the source. With indirect sources, however, the plausibility value of
the given statement is determined with reference to the plausibility of other state-
ments – that is, with the help of an inference.
If she simply referred to Juge (2006) without presenting his argumentation, then
the plausibility value would be assigned to this statement on the basis of this paper
as a direct source. Nagy C., however, presents Juge’s argumentation in detail.
Therefore, the above hypothesis is supported by a series of plausible inferences as
an indirect source.
2.3 O
btaining new information from uncertain information:
Plausible inferences
Deductive inferences the premises of which are true with certainty are capable of
guaranteeing the truth of the conclusion. That is, if the premises of an inference are
true with certainty, and there is a logical consequence relation between them and
the conclusion, then we are totally justified in accepting the conclusion. However,
The p-model of data and evidence in linguistics
very often either there is no logical consequence relation between the premises
and the conclusion or at least one of the premises, instead of being certainly true, is
only plausible in the light of the given sources. Moreover, the combination of these
two cases may also occur. Although such inferences are not capable of securing the
certainty of the conclusion (assured hypothetically or actually by the premises),
they may – under appropriate circumstances – make the conclusion plausible.
Plausible inferences are mappings associating a set of plausible and/or true
statements (premises) with a plausible statement (conclusion). In such inferences
the connection between the premises and the conclusion cannot be reduced to the
relationship between their logical structures, as is the case with deductive infer-
ences. Plausible inferences take into consideration not only the logical structure of
the premises and the conclusion but their plausibility values and semantic struc-
ture as well. They rest on a semantic relation: for example, causality, analogy, simi-
larity, sign, necessary or sufficient condition, part-whole relation etc.
The first type of plausible inferences consists of cases in which there is a logical
consequence relation between the premises and the conclusion but at least one of
the premises is only plausible and not true with certainty.
Example 5. Let us illustrate this by an example taken from Bibok (this volume):
The first premise is a conditional sentence. It captures a lexical rule proposed for
the locative alternation of the verb ken ‘smear’ (cf. Pinker 1989: 79). This hypoth-
esis results from an inductive generalisation about the syntactic behaviour and the
semantic structure of a certain verb class. Although we cannot state that the first
premise is true with certainty, it is legitimate to deem it plausible on the basis of the
two sources mentioned. The second premise is plausible on the basis of theoretical
considerations. That is, it can be hypothesised but one cannot claim with certainty
that it is true. Due to the logical structure of the premises and the conclusion, there
is a logical consequence relation between the two premises of this inference.
It is clear that these premises are not sufficient to support the conclusion per-
fectly. If the premises were true with certainty, then the conclusion could also be
considered as certainly true. If the plausibility value of one of the premises were 0,
András Kertész & Csilla Rákosi
then the conclusion would be of neutral plausibility, too. It is also clear that if the
plausibility of the premises increases (decreases), then the plausibility of the con-
clusion will also move in the same direction. From this it follows that if both prem-
ises are plausible statements, then one is allowed to suppose that the plausibility
of the conclusion lies between the two cases mentioned – that is, it is a plausible
statement, too.
Example 6. With inductive inferences, the set of the premises has to be supple-
mented by the background assumption that the cases not examined also possess
the characteristics that could be found in the investigated ones; the conclusion
states the presence of these characteristics as a general rule. Let us examine the
following example from Rákosi (this volume):
|In Axininca Campa, it is the output-form [noŋkomati] that belongs to the
input /no-N-koma-i/.|K = 1
1
|In Axininca Campa, it is the output-form [noŋkomataati] that belongs to
the input /no-N-koma-aa-i/.|K = 1
1
|In Axininca Campa, it is the output-form [noŋkomatakoti] that belongs to
the input /no-N-koma-ako-i/.|K = 1
1
The p-model of data and evidence in linguistics
Since Kager never indicates that statements of this type could turn out to be
false and be in need of revision, he seems to assign a very high or even maximal
plausibility value to the premises; in this reconstruction, we will choose the lat-
ter interpretation. At this point of his argumentation, he does not mention fur-
ther arguments which could support or refute the latent background assumption
which covers all cases not mentioned in the premises. Therefore, it has neutral
plausibility on the basis of K1, that is, the relevant passages of Kager (1999) as a
direct source. At a later stage of his argumentation (abbreviated as source K2),
however, Kager presents data which indicate that the latent background assump-
tion is implausible:
|In Axininca Campa, it is the output-form [osampi] that belongs to the
input /osampi/ and not the form *[tosampi].|K = 1
2
|In Axininca Campa, it is the output-form [iŋkomati] that belongs to the
input /i-N-koma-i/ and not the form *[tiŋkomati].|K = 1
2
[|For every other input form which contains a syllable without an onset in
Axininca Campa in word-initial position, the corresponding output form
has no onset word-initially.|K = 0]
2
0 < |Word-initial syllables do not have an onset in Axininca Campa.|I < 1
2
Thus, if we take into consideration the information provided by this source too,
and evaluate the plausibility value of the premises on the basis of the sources K1
and K2 together, then the first inference is no longer capable of supporting the
plausibility of the conclusion. This is due to the circumstance that it relies on a
latent background assumption which is implausible instead of being at least of
neutral plausibility.
András Kertész & Csilla Rákosi
There is no logical consequence relation between the premises and the conclusion.
This inference can be transformed into the following inference which is deduc-
tively valid:
[|If to every input form belongs the output form on the surface which
is optimal according to an evaluation based on the constraint ranking
Onset >> Dep-IO, then Axininca Campa is characterised by the constraint
hierarchy Onset >> Dep-IO.|S = 1]
[|The output form [noŋkomati] is optimal according to an evaluation based
on the constraint ranking Onset >> Dep-IO.|S = 0]
0 < |It is the form [noŋkomati] that belongs to the input /no-N-koma-i/ at
the surface representation.|S ≤ 1
[|In every other case too, to the input form belongs the output form on the
surface which is optimal according to an evaluation based on the constraint
ranking Onset >> Dep-IO.|S = 0]
0 < |Axininca Campa is characterised by the constraint hierarchy Onset >>
Dep-IO.|I < 1
(H) Sentential complements are generated in the deep structure between the
degree word and the adjective in the case of too, so and after the degree
word in the case of enough as sisters of the degree word. Then they get
moved rightward and adjoined to S’ in the surface structure (i.e. they are
‘extraposed’).
From (H) it follows that in sentences containing a relative clause extraposed from
subject position and a degree word complement clause (sentential complement),
the relative clause precedes the degree word complement clause – supposing that
relative clauses are adjoined to S’. (H) was tested with the help of the following data:
(D) The sentence People were so angry who knew John that they refused to
participate is grammatically correct, while the sentence People were so angry
that they refused to participate who knew John is ungrammatical.
András Kertész & Csilla Rákosi
(D) can be regarded to be plausible – but not certainly true – on the basis of Baltin
(1987) as a source, since this source is basically nothing else but Baltin’s linguistic
intuition. From this we obtain the following plausible inference:
0 < |If (H), and relative clauses are adjoined to S’, then the sentence
People were so angry who knew John that they refused to participate is
grammatically correct, while the sentence People were so angry that they
refused to participate who knew John is ungrammatical.|B < 1
0 < |The sentence People were so angry who knew John that they refused to
participate is grammatically correct, while the sentence People were so angry
that they refused to participate who knew John is ungrammatical.|B < 1
0 < |(H), and relative clauses are adjoined to S’.|I < 1
There is no logical consequence relation between the premises and the conclusion
but there is a semantic relation between them, both premises are plausible on the
basis of the same source and the inference can be transformed into a plausible infer-
ence of Type 1. Relying on this inference as an indirect source one may assign only
a relatively low plausibility value to (H). This plausibility value can be increased, for
example, if one checks (H) with the help of further “linguistic data” like (D).
The three cases also witness that plausible inferences are fallible. First, at the out-
set their conclusion is not true with certainty but only plausible to some extent;
that is, the conclusion gets only partial support from the premises. Second, plau-
sible inferences with latent background assumptions are especially liable to mislead
because if one of the latent background assumptions turns out to be implausible or
false, then the inference will be insufficient to establish the plausibility of the conclu-
sion even if the premises are true or plausible. Moreover, taking into consideration
a wider scope of sources, one may come into possession of information that makes
the premises, the latent background assumptions or the conclusion implausible or
false. Therefore, it is always the case that the conclusion is plausible only relative to
the premises, the latent background assumptions and the sources supporting them.
forward, used and evaluated. The p-context includes, first, a set of sources in terms
of which the plausibility value of statements can be judged. Second, it covers a
set of statements together with their relevant characteristics (in particular, their
plausibility values with respect to the sources in the p-context, and their logical
and semantic structure). Third, the accepted methodological norms related to the
components of the p-context (for instance, the permissible type(s) of inference,
the criteria of the acceptability of the statements, the methods of the treatment
of inconsistencies, the criteria for judging the reliability of the sources etc.) also
belong to the p-context.
Plausible inferences are in several respects p-context-dependent. First, the
p-context facilitates plausible inferences only in cases in which the premises are
plausible or true according to some sources in the context. In addition, plausible
inferences have to be legitimate inference types in the given p-context. Second,
if the p-context changes so that new sources become available which may influ-
ence the plausibility value of the premises (by making them more plausible or less
plausible or even implausible), then the plausibility of the conclusion will change
as well. Third, if there is no logical consequence relation between the premises and
the conclusion, then the plausibility of the conclusion does not only depend on
the plausibility of the premises, but is also influenced by every piece of informa-
tion related to the latent background assumptions. From this it follows that any
new information which strengthens or weakens the plausibility of the premises
and that of the latent background assumptions influences the plausibility of the
conclusion as well. This also means that plausible inferences are dynamic. They
are indirect sources which are sensitive to changes in the p-context. Fourth, there
are often many (direct or indirect) sources that allow a plausibility value to be
assigned to a statement. In such cases, the plausibility of this statement is higher
than its plausibility on the basis of the individual sources.
Inference 1:
0 < |In English, there is no context-shift.|S < 1
1
0 < |In Hungarian, there is no context-shift.|S < 1
1
0 < |In German, there is no context-shift.|S < 1
1
[|In all other languages, too, there is no context-shift.|S = 0]
1
0 < |There are no context-shifting languages.|I < 1
1
The latent background assumption has neutral plausibility on the basis of the
sources within this p-context (S1), since they do not contain more relevant
information.
Thus, if we test (FT) on the languages mentioned, we obtain the following
plausible inference:
Inference 2:
|If (FT) is crosslinguistically applicable, then there are no context-shifting
languages.|S1 = 1
0 < |There are no context-shifting languages.|I < 1
1
0 < |(FT) is crosslinguistically applicable.|I < 1
2
If one extends the p-context by further data which indicate the presence of
context-shift in further languages, then the plausibility value of the second premise
increases, and, as a consequence, that of the conclusion increases, too. In contrast,
in this extended p-context the first inference is no longer capable of supporting
its conclusion. The reason for this is the circumstance that its latent background
assumption is not plausible on the basis of all available sources (S1 and S2). As a
consequence, the second inference loses its capability to support its conclusion,
The p-model of data and evidence in linguistics
because its second premise is not plausible on the basis of all available sources in
this extended p-context.
According to Vecsey (present volume), however, the situation changes again,
if we add the results of Stechow’s (2003) considerations to the p-context. He claims
that the argumentation presented by Schlenker (2003) and Anand & Nevins (2004)
is faulty and they cannot be regarded as reliable sources. In this p-context, the
source S2 loses its reliability and is no longer capable of supporting the premises of
Inference 3. Therefore, Inference 4 cannot be regarded as a reliable indirect source
either, which could make its conclusion plausible.
Example 10. As Németh T. (this volume) shows, the p-context of Komlósy’s (1994)
lexical-functional approach becomes p-inconsistent by adding further data to it.
Namely, in the p-context of Komlósy (1994) the following statement receives a high
plausibility value, since it is supported by several arguments (plausible inferences):
(H) Hungarian verbs of natural phenomena cannot occur with an explicit
subject and do not license a formal subject, either.
Németh T., however, presents corpus data which support the opposite standpoint:
(~H) There are verbs of natural phenomena in Hungarian that can occur with
explicit subjects.
Both (H) and (~H) are assigned a high plausibility value by their sources. Thus, the
extended p-context is p-inconsistent.
Example 11. As Nagy C. (this volume) points out, Detges (2004) cannot be
regarded as a useable source on the basis of which it would be possible to assign a
plausibility value to the statement
András Kertész & Csilla Rákosi
The Old French form (il) va dire is a present form [meaning ‘all of a sudden,
he says’].
or to its negation
The Old French form (il) va dire is not a present form [but it is in the
preterit, meaning ‘suddenly, he said’].
The reason for this lies in the circumstance that Detges refers in both cases to the
same source, namely to Gougenheim (1971 [1929]: 96f.). From this it follows that in
the p-context of Detges (2004), the above statement is of neutral plausibility; thus,
this p-context is informationally underdetermined, because it is p-incomplete.
Example 12. Nagy C. (this volume) presents the p-context consisting of investi-
gations into the morphological aspects of the history of the Catalan “anar ‘go’ +
infinitive” construction. This p-context is p-inconsistent, because the statement
(H) The Catalan GO-past comes from the construction “GO-verb conjugated in
present tense + infinitive”
as well as its negation are plausible according to some source. Nagy C. analyses
three solutions proposed in the literature:
(S1) Colon (1978a, b) presents arguments which unanimously support (H). He
states that the current meaning of “anar + infinitive” can be derived from
the present auxiliary variant. The reason for the loss of preterit forms in the
auxiliary was a historical present use which pushed the preterit forms of the
auxiliary into the background. In this p-context version, it is the periphrasis
with present auxiliary which undergoes the semantic change. The spread of
the present auxiliary forms is interpreted as being the result of an extension
of historical present usage.
(S2)
Detges (2004) deems (H) plausible and (~H) implausible and agrees with
Colon on the hypothesis that the current meaning of “anar + infinitive”
can be derived from the present auxiliary variant. He claims that the tense
of the auxiliary is optional, although the present-tense variant was used
more frequently at all times. He also maintains that the present auxiliary
periphrasis conventionalised as a foreground-marker due to its high
frequency.
(S3)
Juge (2006) rejects (H) and supports (~H). According to this p-context
version, it is not the periphrasis with present auxiliary which undergoes
the semantic change, but the periphrasis with preterit auxiliary. Further,
the spread of the present auxiliary forms is the result of a reinterpretation,
triggered by syncretism in one of the most frequent forms of the paradigm.
To sum up, while in the solutions provided by Colon and Detges a high plausi-
bility value is assigned to (H), and (~H) is rejected, Juge proposes the opposite:
giving up (H) and keeping (~H). Thus, all three proposals are consistent solutions
of the starting p-problem along the lines of point (a) mentioned above (which
says that the statement at issue is unanimously supported by the sources); that is,
according to these authors, one of the rival hypotheses has to be given up and only
the other one can be deemed to be plausible. With the help of the extension of
the p-context by further sources and data, Nagy C. argues that the first two solu-
tions are untenable, because the plausibility of their hypotheses is lost or at least
decreases considerably and the p-inconsistency of these p-contexts increases. In
addition, the solution proposed by Juge (2006) can be regarded as the resolution
of this p-problem because it does not contain p-inconsistencies and its hypotheses
have a high plausibility value on the basis of the sources presented by Nagy C.
Example 13. Nagy C. (this volume) mentions a solution of a p-problem that is based
on the method described in point (b) above, that is, on the systematic separation of
the conflicting statements. As she points out, in Colon’s account both the hypothesis
András Kertész & Csilla Rákosi
that the GO-periphrasis has a past tense value in Old Catalan and its rival, accord-
ing to which the periphrasis manifests a historical present use, are plausible. This,
however, leads to p-inconsistency. Namely, from the first hypothesis it follows that
the GO-verb does not refer to real motion but is grammaticalised to some degree.
If the second hypothesis holds, then the GO-verb should refer to real motion. Nev-
ertheless, we can maintain both hypotheses if we separate their field of application
and suppose that they refer to different periods in the development of Old Catalan.
a. the extension of the p-context by new sources, new statements, new methods
etc.;
b. the coordination of the extended p-context (for example, comparing the plau-
sibility values of the statements stemming from the old and the new sources,
collecting and summarising information about the reliability of the sources,
checking the consistency of the set of the plausible statements etc.);
The p-model of data and evidence in linguistics
c. the modification of the extended and coordinated p-context. This means that
one has to elaborate the p-context that can be regarded as a re-evaluated ver-
sion of the starting p-context. For instance, one has to determine the plau-
sibility value of the statements in the p-context on the basis of the totality
of the sources in the p-context, one has to decide which sources have to be
rejected as unreliable or accepted as new, which methodological norms have
to be abandoned, modified or accepted, etc.; and
d. the comparison of the rival solutions.
limits given – that is, the resolution of the starting p-problem; otherwise, a new
argumentation cycle has to be initiated.
Figure 1 illustrates the main stages of the argumentation process.
Argumentation cycle
Elaboration of the
p-context
Subcycle
Coordination of the p-context
Example 14. Kertész & Rákosi (this volume) and Cs. Rákosi (this volume) recon-
struct the argumentation cycles of the given author. Bibok (this volume), Németh T.
(this volume), Nagy C. (this volume), Koczogh (this volume), Gy. Rákosi (this
volume) and Vecsey (this volume) present and analyse rival solutions to a given
p-problem. Since in the first cycle it can only be shown that they are all problem-
atic and do not provide the resolution of the p-problem investigated, the authors
start a new argumentation cycle in order to find a resolution with the help of the
extension of the p-context with new sources.
The p-model of data and evidence in linguistics
a. The Contrastive Strategy. The essence of this strategy is that it treats the
p-context versions containing the contradictory statements as rival alterna-
tives, compares them and strives for a decision between them.
b. The Exclusive Strategy. This strategy is the continuation of the Contrastive
Strategy in cases in which a decision has been reached between the rival
p-context versions elaborated. It fulfils a kind of control function insofar as it
examines whether the p-context version chosen can provide an explanation
for all phenomena that could be explained by the rejected p-context version.
This is important because the explanatory power of the resolution should be
as high as possible; therefore, information loss resulting from the rejection of
p-context versions should be avoided or at least minimised.
c. The Combinative Strategy. It may be the case that one wants to keep both rival
p-context versions because they illuminate some phenomenon from different
points of view which are equally important and cannot be given up. There-
fore, the two p-context versions are no longer deemed to be rivals but are
András Kertész & Csilla Rákosi
Example 15. Nagy C. (this volume) first applies the Contrastive Strategy and strives
for a comparison of the plausibility of the hypotheses and latent background
assumptions and via this, of the tenability of the rival solutions investigated. As
a result, she comes to the decision that one of the three solutions is considerably
better than the other two. At this point, she changes to the Exclusive Strategy and
examines whether the solution of Juge (2006) can solve all problems which have
arisen within the two rejected solutions.
Example 16. Bibok (this volume) adopts first the Contrastive Strategy, too. But after
the comparison of the rival solutions to problems related to the Hungarian loca-
tive alternation verbs presented by the lexical and the constructional approaches,
he changes to the Combinative Strategy. The reason for this decision lies in the
need to keep both perspectives, because lexical and constructional factors seem
to play a role in syntactic alternations. Therefore, neither of the two proposals can
be given up because relevant phenomena would remain without an explanation.
Thus, by the exact specification of the role of the lexical and the constructional
factors in the model, Bibok proposes a lexical-constructional explanation that is
capable of unifying the explanatory power of the two approaches without leading
to proliferation of p-inconsistencies in his approach.
The p-model allows us to describe the different strategies for the treatment of con-
tradictions, considering the latter to be an integral part of linguistic theorising, that
is, of a cyclic and prismatic argumentation process. These strategies make it possi-
ble to deem inconsistency not a fatal failure. They do not compel us to treat all con-
tradictions in a unique way but license several possible methods for their handling:
–– With the help of cyclic argumentation based on plausible inferences, one may
try to resolve contradictions by giving up one of their members in that the
Contrastive and the Exclusive Strategy are made use of.
–– One may tolerate a contradiction with the help of their paraconsistent repre-
sentation based on the application of the Combinative Strategy. In this case,
they may be regarded not as temporary phenomena but as constitutive ele-
ments of the given theory.
–– In the absence of an acceptable solution, when both kinds of strategies seem
to fail, the inconsistency may be temporarily tolerated until one is in posses-
sion of new information on the basis of which new argumentation cycles can
be initiated.
The p-model of data and evidence in linguistics
Naturally, the next step is to show how the p-model defines the notions of linguis-
tic data and evidence.
3. A
possible solution to (P)(b): The p-model’s concepts of ‘data’
and ‘evidence’
3.1 Data
The p-model suggests that data are not identical to “examples” such as the follow-
ing Hungarian sentences:
Az anya vizet önt a virágra.
*Az anya vízzel önti a virágot.
Rather, their structure consists of two components: a statement with an informa-
tion content and a plausibility value supported by a direct source (see also Rescher
1979: 69):
(D1) A datum is a statement with a positive plausibility value originating from
some direct source.
where S is the linguistic intuition of the linguist as a direct source. A second pos-
sibility is the transformation to a datum with a more complex structure whose
plausibility value originates from a compound of the linguist’s linguistic intuition
and linguistic knowledge:
0 < |The sentence Az anya vizet önt a virágra is grammatical in Hungarian
while the sentence *Az anya vízzel önti a virágot is not, and the meanings
and grammatical properties of the words and parts of words in these two
sentences can be characterised as follows:
András Kertész & Csilla Rákosi
Accordingly, data are in most cases not claimed to be true with certainty, but they
are usually more or less plausible “truth candidates”. Their plausibility is usually
supported by the sources to some extent but the sources are not able to make them
certainly true (see also Rescher 1976: 8, 1977: 213, 1987: 307). Nevertheless, a datum
must possess a certain degree of initial plausibility, that is, it has to receive a plausi-
bility value from some reliable source. Statements which are of neutral plausibility
or implausible according to any sources in the p-context, are not data in this sense.
is a datum in Németh T. (this volume: Section 2.4) because it receives its initial
plausibility value on the basis of the linguistic intuition of the author of Tóth
(2001). This plausibility value is not maximal, because linguistic intuition cannot
be regarded as a totally reliable source. As Németh T. (this volume) remarks, the
evaluation of the correctness of these sentences is not unanimous, because another
native speaker, for example, Sárik (1998) judges the acceptability of -ván-participle
forms differently. That is, on the basis of this source, the following statement counts
as a datum:
Neither the sentence Hajnalodván, elindultunk hazafelé, nor the sentence
Hajnalodva, elindultunk hazafelé is grammatically correct in Hungarian.
. For a better understanding, we repeat the author’s glosses here. It is important to remark,
however, that they do not belong to the datum at issue (although it is possible to construct a
datum that involves this information, too):
From this it follows that the p-context is p-inconsistent because it contains two
incompatible data. Despite this, the initial plausibility value of the first datum can
be increased by collecting judgements from other native speakers, too.
The statement
The va-participle *hajnalodva formed from the weather verb hajnalodik
‘to dawn’ is ill-formed, but the ván-participle form of it hajnalodván is
well-formed in Hungarian.
looks similar to the above datum but it is not a datum according to the terminol-
ogy of the p-model in the argumentation conducted by Németh T. (this volume).
The reason for this is that its plausibility value does not originate from a direct
source (a linguistic paper or the linguistic intuition of native speakers) but from
an indirect one (that is, from a plausible inference):
0 < |The sentence Hajnalodván, elindultunk hazafelé is correct, while the
sentence Hajnalodva, elindultunk hazafelé is not.|S < 1
0 < |If the sentence Hajnalodván, elindultunk hazafelé is correct, then the
ván-participle form of the verb hajnalodik ‘dawn’ is well-formed.|S < 1
0 < |If the sentence Hajnalodva, elindultunk hazafelé is not correct, then the
va-participle *hajnalodva formed from the weather verb hajnalodik ‘dawn’ is
ill-formed.|S < 1
0 < |The va-participle *hajnalodva formed from the weather verb hajnalodik
‘to dawn’ is ill-formed, but the ván-participle form of it, hajnalodván is well-
formed.|I < 1
If, however, this statement obtained its plausibility value from a linguistic paper
simply referred to by Németh T., then it would count as a datum.
Example 18. Corpora are often used and reliable – but not completely reliable –
sources, too, which make it possible to assign a plausibility value to statements
such as
The phrase esteledik a nap is grammatical in Hungarian.
The phrase sötétedik a látóhatár is grammatical in Hungarian.
The phrase hajnalodik az idő is grammatical in Hungarian.
The phrase zuhog az eső is grammatical in Hungarian.7
It may be the case that there are native speakers who judge these phrases to be
ungrammatical. Therefore, the presence of the phrases at issue in the Hungarian
National Corpus cannot guarantee that all speakers of Hungarian, or at least the
majority of them, accept them.
. For the analysis of these data, see Németh T. (this volume).
András Kertész & Csilla Rákosi
Example 19. Data used in linguistic theories often have a complex structure in the
sense that the source on the basis of which their plausibility can be judged has to
be interpreted as a compound of several sub-sources. For example, the plausibil-
ity value of the following statements is not just based on the linguist’s linguistic
intuition, native speakers’ judgements or corpus presence, but also on linguistic
analyses with the help of a semantic/syntactic/pragmatic theory:
a. In the sentence Mas con foren prop d’Agda, noves los van venir con havia pres,
lo dia passat, a aquells from Muntaner’s Crònica, van venir ‘arrived’ is a gram-
maticalised form, conveying a past tense meaning.
b. In Axininca Campa, to the input /no-N-čhik-i/ belongs the output-form
[noɲčhiki].
c. Villámlik az ég ‘the sky is lightning’ and sötétedik az ég ‘the sky is getting dark’
are well-formed expressions in Hungarian in which villámlik as well as sötéte-
dik are weather verbs and ég is an explicit subject.
d. In the Amharic sentence ȷ̌on ȷ̌əgna nə-ññ y yl-all, the attitude operator modi-
fies the content of the character of the pronoun.
The structure of the datum in (a), for example, is “The unit S can be identified
in corpus C and it had the characteristics X.” From this it is clear that the source
of its plausibility value is not the corpus (Muntaner’s Crònica) alone. Rather, it
is a compound of the historical document itself, the linguist’s linguistic intuition
and his/her skills in the application of linguistic analysis. The reliability of each
of these (sub-)sources influences the plausibility of the datum. Therefore, it is not
only factors which are decisive for the originality of the historical document that
are relevant; difficulties or uncertainties in the identification and interpretation of
the given linguistic phenomenon must be taken into account as well.
From what we have said about plausible statements in Section 2.2, the following
characterisation of data can be highlighted:
–– Data are “given” in a specific sense: their initial plausibility is not determined
with the help of plausible inferences conducted within the given argumenta-
tion process, but directly on the basis of the reliability of their source. Thus,
they function as starting points: plausibility values enter the argumentation
process through them – and, of course, they can spread towards other hypoth-
eses of the theory by plausible inferences.
–– Since data are defined as statements possessing a positive plausibility value, and
the plausibility value of statements depends on the p-context (see S ection 2.4),
data are p-context-dependent, which means that they are theory-dependent
as well.
The p-model of data and evidence in linguistics
3.2 Evidence
The p-model defines three types of evidence in order to grasp the relationship
between data and other hypotheses of the theory.
Weak evidence for a hypothesis h simply means that we can build inference(s)
on the given datum that make(s) h plausible (in the extreme case true with cer-
tainty). Weak evidence against a hypothesis h means a datum on which we can
build inference(s) that make(s) h implausible (in the extreme case false with
certainty):
(D2) (a) A datum e is weak evidence for hypothesis h, if the p-context contains
statements that extend e into an indirect source on the basis of which a
positive plausibility value can be assigned to h.
(b) A datum e is weak evidence against hypothesis h, if the p-context
contains statements that extend e into an indirect source on the basis of
which a positive plausibility value can be assigned to ~h.
From this definition it follows that a datum can be weak evidence for a statement
and its rival simultaneously, although it may support them to different extents.
Example 21. As Bibok (this volume) shows, the lexical rule approach proposes the
following hypothesis for the description of the locative alternation of verbs:
(H) If there is a verb with the semantic representation ‘X causes Y to move
into/onto Z’, then it can be converted into a verb with the semantic
representation ‘X causes Z to change state by means of moving Y into/
onto it’.
(D1) is weak evidence for (H):
(D1) The sentence Az anya zsírral keni a kenyeret is grammatical in Hungarian.
First, (D1) is a plausible statement and its plausibility value originates from a direct
source (Bibok’s linguistic intuition). Second, we can construct a chain of plausible
inferences that make (H) plausible and one of them uses (D1) as a premise:
0 < |If (D1), then there is a verb ken with the semantic representation
‘X causes Z to change state by means of moving Y into/onto it’.|B < 1
0 < |(D1)|B < 1
0 < |There is a verb ken with the semantic representation ‘X causes Z to
change state by means of moving Y into/onto it’.|I < 1
1
|If (H), and there is a verb ken with the semantic representation ‘X causes
Y to move into/onto Z’, then there is also a verb ken with the semantic
representation ‘X causes Z to change state by means of moving Y into/
onto it’.|B = 1
András Kertész & Csilla Rákosi
0 < |If (D2), then there is no verb önt with the semantic representation
‘X causes Z to change state by means of moving Y into/onto it’.|B < 1
0 < |(D2)|B < 1
0 < |There is no verb önt with the semantic representation ‘X causes Z to
change state by means of moving Y into/onto it’.|I < 1
3
|If (H), and there is a verb önt with the semantic representation ‘X causes
Y to move into/onto Z’, then there is also a verb önt with the semantic
representation ‘X causes Z to change state by means of moving Y into/
onto it’.|B = 1
0 < |There is a verb önt with the semantic representation ‘X causes Y to
move into/onto Z’.|B < 1
0 < |There is no verb önt with the semantic representation ‘X causes Z to
change state by means of moving Y into/onto it’.|I < 1
3
0 < |(~H)|I < 1
4
Relative evidence for a hypothesis h also requires that the datum provides stronger
support to h than to its rivals:
(D3) (a) A datum e is relative evidence for hypothesis h, if
(i) e is weak evidence for hypothesis h;
(ii) the inference(s) connecting the premises and h provide(s) h with
a higher plausibility value than the plausibility values of h’s rivals
assigned to them by the inferences also using e as a premise.
(b) A datum e is relative evidence against hypothesis h, if
(i) e is weak evidence against hypothesis h;
(ii) the plausible inference(s) connecting the premises and ~h
provide(s) ~h with a higher plausibility value than the plausibility
value of h assigned to it by the inferences also using e as
a premise.
The p-model of data and evidence in linguistics
The reason for this is that (D) can be treated with the help of (H)’s rivals – as Bibok
(this volume) shows – only as an exception. From this it follows that in this case,
the plausibility of the rival hypotheses can only be relatively low, because they can
receive only very weak support from inferences making use of (D). In contrast, the
plausibility of (H) can be high on the basis of plausible inferences involving (D)
as a premise.
The third type is strong evidence which means that the datum makes only hypoth-
esis h plausible and does not provide any support to its rivals:
(D4) (a) A datum e is strong evidence for hypothesis h, if
(i) e is weak evidence for hypothesis h;
(ii) e is not weak evidence for any of h’s rivals.
(b) A datum e is strong evidence against hypothesis h, if
(i) e is weak evidence against hypothesis h;
(ii) e is not weak evidence against any of h’s rivals.
The following plausible inference making use of (D) as a premise indicates that
(H1) has to be regarded as implausible:
|If (H1), and the verb sötétedik is a verb of natural phenomena and ég is an
explicit subject, then (~D).|N = 1
|The verb sötétedik is a verb of natural phenomena and ég is an explicit
subject.|N = 1
0 < |(D)|N < 1
0 < |(~H1)|I < 1
1
András Kertész & Csilla Rákosi
(D) provides also weak evidence for all of (H1)’s rivals mentioned in Németh T.
(this volume), that is, for (H2) and (H3):
(H2) The verb sötétedik belongs to the group of verbs of natural phenomena that
have a subject argument position which can be filled optionally.
|If (H2), then (D).|N = 1
0 < |(D)|N < 1
0 < |(H2)|I < 1
2
does not provide strong evidence against (H1), because it is also weak evidence
against one of its rivals, namely (H2’), which is regarded as plausible by Keszler
(2000):
(H2’) The verb villámlik belongs to the group of verbs of natural phenomena which
do not have a subject argument position, while there are verbs of natural
phenomena which have a subject position that can be filled optionally.
It is easy to see that the concepts of weak, relative and strong evidence do not meet
the requirements laid down by the standard view of linguistic data and evidence.
First, evidence is interpreted by the p-model not as a special subset of data but as
a datum with a special function relative to some hypothesis of the theory. From this
it follows that evidence is not objective, immediately given, theory-independent
and completely reliable but source- and theory-dependent and reliable only to a
certain extent. Second, data which meet the criteria laid down in (D2)–(D4) in
most cases do not perfectly support or refute the given hypothesis. The connec-
tion between the datum and the hypothesis is established by plausible inferences
relying on plausible premises. Third, the function of evidence is not restricted by
the p-model to the testing of hypotheses, that is, to the justification of theories, but
data and evidence play a role in every stage of the process of linguistic theorising.
The p-model of data and evidence in linguistics
4. Conclusions
The solution to (P)(b) which we have obtained with the help of the p-model pro-
vides a radically new interpretation of linguistic theorising in general and of linguis-
tic data/evidence in particular. Its essence can be summarised as follows:
(SP)(b) Linguistic theorising is a retrospective, cyclic and prismatic plausible
argumentation process. The structure and the functions of linguistic data
and evidence emerge from, and are bound to, this argumentation process:
(i) The structure of linguistic data/evidence consists of a statement
(information content) and a plausibility value (relative strength of
acceptability) stemming from a direct source (linguistic intuition,
corpus, conjecture, experiment etc.). A datum is evidence for a
hypothesis if it is a premise of a plausible inference which makes this
hypothesis plausible.
(ii) The primary function of data is to supply the theory with plausibility
values. That is, data receive their initial plausibility value directly from
reliable sources. All other hypotheses obtain their plausibility value
indirectly (with the help of plausible inferences) from the data. The
function of evidence is that it contributes to the evaluation and the
comparison of the plausibility of rival hypotheses; thus, it is conducive
to the resolution of p-problems.
As we have seen, the peculiarities of the argumentation process can account for,
among others, the uncertainty, the variety and the combinability of linguistic data/
evidence as well as for the emergence and (provisional) existence of inconsisten-
cies triggered by data – that is, for those characteristics of data that could not be
explained with the help of other metatheoretical frameworks.
On the way to (SP)(b), our most important insights were as follows:
a. Data, evidence and the hypotheses supported by the latter are substantially
rooted in uncertainty.
b. Instead of linear progress, linguistic theory formation is built on the cyclic,
prismatic and retrospective re-evaluation of previously assumed data and
hypotheses.
c. There are several kinds of inconsistency, and not every kind is disastrous.
Among certain circumstances inconsistency may be tolerated, while the pro-
cess of plausible argumentation also governs the resolution of inconsistencies
as well as their temporary or long-term tolerance.
d. The pluralism of linguistic theories is fruitful because it allows linguists to
examine linguistic phenomena from different perspectives and must not give
way to the absolutistic defence of particular theories.
András Kertész & Csilla Rákosi
e. There are no final solutions to linguistic problems; any solution can be pris-
matically and cyclically re-evaluated during later argumentation cycles.8
f. The argumentation structure of linguistic theories largely determines the effi-
ciency of linguistic research.
g. There is a complex relationship between the argumentation structure of lin-
guistic theories and linguistic data/evidence.
h. Data are not objective facts but plausible statements whose plausibility may
increase or decrease (or even disappear) dynamically during the argumenta-
tion process.
i. The source of linguistic data is not the objective observation of manifesta-
tions of linguistic behaviour. Data usually obtain their plausibility value from
a complex source which is a combination of the linguist’s linguistic intuition,
his/her analytic skills, and some other source such as corpora, psycholinguis-
tic experiments, judgements of native speakers, etc.
(SP)(b) does not provide final, all-embracing and infallible answers to questions
concerning the nature of linguistic data and evidence. Rather, it should be regarded
as a starting point which might open a new stage in the debate on linguistic data
and evidence by putting forward a novel and systematic metatheoretical model of
linguistic theorising.
Acknowledgements
Work on this chapter was supported by the MTA-DE Research Group for
Theoretical Linguistics, the project OTKA K 77823 as well as the TÁMOP-
4.2.2/B-10/1-2010-0024 project, which is co-financed by the European Union and
the European Social Fund. Csilla Rákosi’s work was also supported by the János
Bolyai Research Scholarship of the Hungarian Academy of Sciences.
References
Anand, Pranav & Nevins, Andrew. 2004. Shifty operators in changing contexts. In Proceedings
of the 14th Conference on Semantics and Linguistic Theory, Kazuha Watanabe & Robert B.
Young (eds), 20–37. Ithaca NY: CLC Publications.
. A problem may, of course, have a clear-cut final solution under well-defined conditions,
within a given theoretical framework. Despite this, we must not forget that theoretical frame-
works may be revised, too, and give way to new, better (elaborated) ones.
The p-model of data and evidence in linguistics
Baltin, Mark R. 1987. Degree Complements. In Discontinuous Constituency [Syntax and seman-
tics 20], Geoffrey J. Huck & Almerindo E. Ojeda, (eds), 11–26. Orlando FL: Academic
Press.
Bibok, Károly. This volume. The plausibility of approaches to syntactic alternation of Hungarian
verbs.
Colon, Germà. 1978a. [1959]. El perfet perifràstic català “va + infinitiu”. (The Catalan periphras-
tic perfect “va + infinitive”). In La llengua catalana en els seus textos (The Catalan language
of texts),Vol.2, Germà Colon (ed.), 119–130. Barcelona: Curial.
Colon, Germà. 1978b[1976]. Sobre el perfet perifràstic ‘vado + infinitiu’, en català, en proven-
çal i en francès (On the periphrastic perfect ‘vado + infinitive’ in Catalan, Provençal and,
French). In La llengua catalana en els seus textos (The Catalan language of texts),Vol. 2,
Germà Colon (ed.), 131–174. Barcelona: Curial.
Detges, Ulrich. 2004. How cognitive is grammaticalization? The history of the Catalan perfet
perifràstic. In Up and Down the Cline – The Nature of Grammaticalization [Typological
Studies in Language 59], Olga Fischer, Muriel Norde & Harry Perridon (eds), 211–227.
Amsterdam: John Benjamins.
Gougenheim, Georges. 1971. [1929]. Étude sur les périphrases verbales de la langue française.
Paris: Nizet.
Juge, Matthew L. 2006. Morphological factors in the grammaticalization of the Catalan “go”
past. Diachronica 23(2): 313–339.
Kager, René. 1999. Optimality Theory. Cambrige: CUP.
Kaplan, David. 1989. Demonstratives. In Themes from Kaplan, Joseph Almog, John Perry &
Howard Wettstein (eds), 481–563. Oxford: OUP.
Kertész, András. 1993. Heuristik der deutschen Phonologie. Eine elementare Einführung in Strat-
egien der Problemlösung. Budapest: Akadémiai Kiadó.
Kertész, András. 2004a. Philosophie der Linguistik. Studien zur naturalisierten Wissenschafts-
theorie. Tübingen: Narr.
Kertész, András. 2004b. Cognitive Semantics and Scientific Knowledge. Case studies in the Cogni-
tive Science of Science [Converging Evidence in Languag and Communication Research 4].
Amsterdam: John Benjamins.
Kertész, András & Rákosi, Csilla. 2006. Inconsistency and plausible reasoning in an analysis
of German affricates. A case study in the philosophy of linguistics. Language Sciences 28:
386–423.
Kertész, András & Rákosi, Csilla. 2009. Cyclic vs. circular argumentation in the conceptual
metaphor theory. Cognitive Linguistics 20(4): 703–732.
Kertész, András & Rákosi, Csilla. 2012. Data and Evidence in Linguistics. A Plausible Argumenta-
tion Model. Cambridge: CUP.
Kertész, András & Rákosi, Csilla. 2013. Paraconsistency and plausible argumentation in genera-
tive grammar: A case study. Journal of Logic, Language and Information 22(2): 195–230.
Kertész, András & Rákosi, Csilla. This volume. Thought experiments and real experiments as
converging data sources in pragmatics.
Keszler, Borbála (ed.). 2000. Magyar grammatika (Hungarian grammar). Budapest: Nemzeti
Tankönyvkiadó.
Komlósy, András. 1994. Complements and adjuncts. In Syntax and Semantics 27, Ferenc
Kiefer & Katalin É. Kiss (eds), 91–178. New York NY: Academic Press.
Nagy C., Katalin. This volume. Methods and argumentation in historical linguistics: A case study.
Németh T., Enikő. This volume. Hungarian verbs of natural phenomena with explicit and
implicit subject arguments: their use and occurrence in the light of data.
András Kertész & Csilla Rákosi
Pinker, Steven. 1989. Learnability and Cognition: The Acquisition of Argument Structure.
Cambridge MA: The MIT Press.
Polya, George. 1948. How to Solve It. Princeton NJ: Princeton University Press.
Polya, George. 1954. Patterns of Plausible Inference. London: OUP.
Rákosi, Csilla. This volume. Data and the resolution of inconsistency in Optimality Theory.
Rescher, Nicholas. 1973. The Coherence Theory of Truth. Oxford: Clarendon Press.
Rescher, Nicholas. 1976. Plausible Reasoning. Assen: Van Gorcum.
Rescher, Nicholas. 1977. Methodological Pragmatism. Oxford: Blackwell.
Rescher, Nicholas. 1979. Cognitive Systematization. Oxford: Blackwell.
Rescher, Nicholas. 1987. How serious a fallacy is inconsistency? Argumentation 1: 303–316.
Rescher, Nicholas & Brandom, Robert. 1980. The Logic of Inconsistency. Oxford: Blackwell.
Sárik, Pál. 1998. A határozói igenevek néhány problémája (Some problems of adverbial parti-
ciples). Magyar Nyelv 94: 423–435.
Schlenker, Philippe. 2003. A plea for monsters. Linguistics and Philosophy 26: 29–120.
von Stechow, Arnim. 2003. Feature deletion under semantic binding: Tense, person, and mood
under verbal quantifiers. In NELS 33: Proceedings of the Thirty-Third Annual Meeting of
the North East Linguistic Society, GLSA, Makoto Kadowaki & Shigeto Kawahara (eds),
379–404. Amherst MA: University of Massachusetts.
Tóth, Ildikó. 2001. Impersonal constructions and null expletives. In Argument structure in
Hungarian, Istvan Kenesei (ed.), 51–78. Budapest: Akadémiai Kiadó.
Vecsey, Zoltán. This volume. A plausibility-based model of shifted indexicals.
part ii
Object-theoretical applications
chapter 3
Károly Bibok
University of Szeged, Dept. of Russian Philology, MTA-DE Research Group
for Theoretical Linguistics
1. Introduction
1.1 Background
By syntactic alternation it is generally meant that a verb occurring in a type of syn-
tactic argument structure can be used in another one as well. As to the meanings
of these syntactic structures, they are different, but in a predictable way. Consider
the following Hungarian examples and verbs, which show syntactic alternations of
some kind, also known in several other languages.
(A) Locative alternation, i.e. change of place alternates with change of state. E.g.:
Károly Bibok
In a broader sense of syntactic alternation, the words occurring with different gov-
ernment patterns are not necessarily of the same form but it is sufficient if they are
connected to each other word-formationally. In (2) one can see various prefixed
verbs derived from ken ‘smear’. They themselves alternate similarly to the base verb.
(2) a. Az anya-∅ rá-ken-i a
the mother-nom onto-smear-prs.def.3sg the
zsír-t a kenyér-re.
fat-acc the bread.sub
‘The mother is smearing fat on the bread.’
b. Az anya-∅ meg-ken-i a kenyer-et zsír-ral.
the mother-nom pfv-smear-prs.def.3sg the bread-acc fat-ins
‘The mother will have smeared the bread with fat.’
The verbs which behave syntactically in the same way as ken ‘smear’ include the
following:
fest ‘paint’, fröcsköl ‘splash’, hint ‘dust; sprinkle’, locsol ‘water; sprinkle’, mázol
(3)
‘paint’, öntöz ‘water; sprinkle’, permetez ‘spray; sprinkle’, spriccel ‘spray’, tölt
‘fill’, töm ‘cram’, etc.
(B) The verbs of manner of motion can take a directional phrase denoting directed
(proceeding) motion in a particular manner. E.g.:
(4) a. A gyerek-∅ úsz-ik.
the child-nom swim-prs.indf.3sg
‘The child is swimming.’
b. A gyerek-∅ a barlang-ba úsz-ik.
the child-nom the cave-ill swim-prs.indf.3sg
‘The child is swimming into the cave.’
. The glosses are not intended to capture all morphological properties of Hungarian words
but indicate the necessary ones for the present purposes. The abbreviations used in the glosses
throughout this chapter are the following: 3sg = third person singular, acc = accusative, def
= definite (conjugation), ill = illative, indf = indefinite (conjugation), ins = instrumental,
nom = nominative, pfv = perfective (prefix), prs = present (tense) and sub = sublative.
The plausibility of approaches to syntactic alternation of Hungarian verbs
. Several verbs of (5), such as dülöngél ‘reel’, forog ‘spin’, fut ‘run’, himbálózik ‘swing’,
imbolyog ‘totter’, lebeg ‘float’, pattog ‘bounce (several times)’, pörög ‘spin’, ring ‘swing’, szökdécsel
‘skip’, szökdel ‘skip’, táncol ‘dance’, ugrál ‘jump (several times)’, ugrik ‘jump’, denote a manner
of motion which is not only displacement but also change of position. The expression (egy)
helyben ‘in (a) place’, added to these verbs, may unequivocally indicate change of position.
. Mutatis mutandis, it also concerns the character of examples marked below with an
asterisk as not well-formed. Consequently, the acceptability of linguistic expressions is not
conceived of as an issue treatable by a dual “yes–no” system in any case.
. Throughout the present chapter, (im)plausibility should be understood with the following
proviso: in an extreme case true/false with certainty.
Károly Bibok
acoustic or visual signs in various types of corpora, but plausible statements which
concern their occurrence and/or characteristics and whose plausibility stems
from direct sources. After all, it is not only a plausible statement about the well-
formedness of linguistic examples but also every hypothesis about their syntac-
tic or semantic structures and about their classification as one linguistic category
which is considered data if the plausibility value is assigned to it immediately on
the basis of the reliability of its source as, e.g., in the case of the existence of alter-
nating verb classes.
It is also worth noticing that if some plausible statements concerning (1)–(5)
became implausible, and even if a statement concerning an entire class of verbs
became implausible on the basis of one type of sources,5 it would not be an implau-
sible conclusion at all that the two above-mentioned syntactically alternating verb
classes exist and, consequently, the linguistic phenomenon of syntactic alternation
exists. Although, in the case of recognition of implausibility the datum concerning
the given class would become less plausible, or one should regard that source as
less reliable (or unreliable in some respects).
. For instance, certain meanings are absent from dictionaries (see below).
The plausibility of approaches to syntactic alternation of Hungarian verbs
found as s eparate meanings in both the multivolume and the concise explana-
tory dictionaries of Hungarian (Bárczi & Országh 1959–1962; Pusztai 2003), the
directed motion sense of the verb úszik ‘swim’ is not listed in them but one can
infer it from an example: partra úszik ‘swim to the seaside/riverside’. Neverthe-
less, recent trends in theoretical linguistics seem to believe that an enumerative
conception of the lexicon which simply fixes various meanings of a word in a
lexical entry is insufficient. They all attempt to elaborate their own version which
exceeds the traditional lexicographic treatment of polysemy induced by syntac-
tic alternation.
(6) If there is a verb with the semantic representation ‘X causes Y to move into/
onto Z’, then it can be converted into a verb with the semantic r epresentation
‘X causes Z to change state by means of moving Y into/onto it’.
One can get (6) as an inductive generalisation on the basis of the syntactic behav-
iour and the semantic structure of the verbs in (3). Since it is an induction, it can-
not be considered a truth. At the same time, (6) seems to be a plausible statement.
It is also a plausible assumption that
ken (X Y-t Z-re) ‘smear (X, Y on Z)’ contains the semantic representation ‘X
(7)
causes Y to move onto Z’.
This conclusion is made by an inference scheme similar to the modus ponens well-
known in logic:
(9) If A, then B.
A
B
However, one has to remember that our premises are only plausible statements
and, therefore, the conclusion (8) can only be a plausible statement as well. Thus,
Károly Bibok
The inference in (10) explains the syntactic alternation of another verb, i.e. ken
‘smear’, not mentioned among the verbs in (3), on the basis of which the lexical
rule in (6) was hypothesised.
In connection with (10) two remarks are in order. First, the rule should not be
fixed in the direction indicated in the first premise of (10) but it should proceed in
the other direction as well: from the with-variant into the locative variant (Pinker
1989: 80). Second, the with-variant in Hungarian can mean that Z is not totally
affected by the activity as in the conclusion of (10) (cf. also (1)(b)) or is totally
affected as in (2)(b).
Plausible inferences such as (10) are put forward and evaluated against a
background which Kertész and Rákosi term the p-context, which includes, among
others, “a set of sources in terms of which the plausibility value of statements can
be judged” (Kertész & Rákosi, this volume: Section 2.4, see also Kertész & Rákosi
2012: 122). Let us suppose that our initial p-context is extended by a new, reliable
source containing the following example:
(11) Az anya-∅ viz-et önt-∅ a virág-ra.
the mother-nom water-acc pour-prs.indf.3sg the flower-sub
‘The mother is pouring water onto the flower.’
Thus, extending the p-context, we gain a new datum, i.e. a plausible statement
about the occurrence of önt ‘pour’ in a source. Furthermore, it is also a plausible
. Instead of the verbal indicator for plausibility of statements it is plausible that…, Kertész
and Rákosi (this volume: Section 2.2, Kertész & Rákosi 2012: 69f.) use the following notation
presented in the case of the first premise of (10):
(i) 0 < |If a verb contains in its semantic representation ‘X causes Y to move into/onto
Z’ (= A), then the rule converts it into a verb with the semantic representation
‘X causes Z to change state by means of moving Y into/onto it’ (= B).|S < 1
The plausibility of approaches to syntactic alternation of Hungarian verbs
hypothesis on the basis of this source that önt (X Y-t Z-re) ‘pour (X, Y onto Z)’ has
the semantic representation ‘X causes Y to move onto Z’. Let us apply our plausible
modus ponens scheme once again:
(12) It is plausible that if a verb contains in its semantic representation ‘X causes
Y to move into/onto Z’ (= A), then the rule converts it into a verb with the
semantic representation ‘X causes Z to change state by means of moving Y
into/onto it’ (= B).
It is plausible that the verb önt (X Y-t Z-re) ‘pour (X, Y onto Z)’ is a verb
with the semantic representation ‘X causes Y to move onto Z’ (= A).
It is plausible that there is a verb önt (X Z-t Y-nal) ‘pour (X, Z with Y)’ with
the corresponding semantic representation ‘X causes Z to change state by
means of moving Y onto it’ (= B).
At the same time, according to a more extended p-context containing (13) it is also
plausible that there is no such verb.7 Cf.:
(13) *Az anya-∅ víz-zel önt-i a virág-ot.
the mother-nom water-ins pour-prs.def.3sg the flower-acc
‘The mother is pouring the flower with water.’
Furthermore, if the rule also derives a verb in the other direction, as mentioned
above, then the verb fed ‘cover’ should alternate. However, the contrary seems to
be the case. Consider (14).
(14) a. *Az anya-∅ csokoládémáz-at
the mother-nom chocolate.coating-acc
fed-∅ a sütemény-re.
cover-prs.indf.3sg the cookie-sub
‘The mother is covering chocolate coating onto the cookie.’
b. Az anya-∅ csokoládémáz-zal
the mother-nom chocolate.coating-ins
fed-i a sütemény-t.
cover-prs.def.3sg the cookie-acc
‘The mother is covering the cookie with chocolate coating.’
. How negative evidence can be detected is an issue which deserves further research. It has
already been proposed that corpus-linguistic methods can provide access to negative evidence
and corpus linguists do not have to have recourse to acceptability judgments (Stefanowitsch
2008).
Károly Bibok
another (Kertész & Rákosi this volume: Section 2.5, 2012: 130).8 This inconsis-
tency calls for the modification of the p-context which “means that one has to
elaborate the p-context that can be regarded as the re-evaluated version of the
starting p-context” (Kertész & Rákosi this volume: Subsection 2.6.1, cf. Kertész &
Rákosi 2012: 138). Since not only a set of sources belongs to a p-context but also
statements together with their relevant characteristics, we can gain a modified
p-context if the plausibility value of the lexical rule in (6) is revised. The new lexi-
cal rule should be formulated with respect to a narrow semantic class to whose
members – and only to those members – the given rule can be applied. In other
words, (6) is the necessary condition of the locative alternation; its sufficient con-
dition, however, seems to be that the verb at stake belongs to a narrow seman-
tic class. Pinker (1989: 126f.) lists six classes of English locative alternation verbs.
They are given in (15), which specially indicates where the locative alternation
cannot be attested in Hungarian.
Smear-class: ‘Simultaneous forceful contact and motion of a mass
(15) a.
against a surface.’
Pile-class: ‘Vertical arrangement on a horizontal surface.’ In Hungarian
b.
there is no alternation, unlike English: téglákat halmozott a székre –
*téglákkal halmozta a széket vs. He heaped bricks on the stool – He
heaped the stool with bricks.
Splash-class: ‘Force is imparted to a mass, causing ballistic motion in a
c.
specified spatial distribution along a trajectory.’
Scatter-class: ‘Mass is caused to move in a widespread or non-directed
d.
distribution.’ In Hungarian there is no alternation, unlike English:
magot szórt a földbe – *maggal szórta a földet vs. He scattered seeds onto
the field – He scattered the field with seeds.
Cram-class: ‘A mass is forced into a container against the limits of its
e.
capacity.’
Load-class: ‘A mass of a size, shape, or a type defined by the intended
f.
use of a container is put into the container, enabling it to accomplish its
function.’
Three remarks concerning (15) are in order. First, whereas in Hungarian there is
no alternation like (15)(d) a morphologically derived verb beszór ‘scatter’, contain-
ing the prefix be ‘in’, alternates syntactically in a broad sense, cf.: beszórja maggal a
földet ‘scatter the field with seeds’. Second, Pinker himself (1989: 126) admits that
. In addition to önt ‘pour’ and fed ‘cover’, there are other verbs that may be characterised
with the semantic representation at issue but do not alternate. See (15)(b) and (15)(d) below.
The plausibility of approaches to syntactic alternation of Hungarian verbs
the with-variant of scatter in (15)(d) could be marginal for some English speak-
ers. Goldberg (1995: 239, Footnote 15) belongs to the latter group because, relying
on a certain source (presumably on her intuition) she doubts the existence of the
alternating scatter-class. However, according to The Collins Cobuild English Dic-
tionary for Advanced Learners (32001), the verbs sow and strew occur with both
syntactic patterns under consideration. Third, the full list of English locative alter-
nation verbs also includes two one-member sets: string (the Hungarian equivalent
does not alternate) and wrap (Pinker 1989: 127).
To explain that the verb ken ‘smear’ alternates syntactically as in (1) one needs
now to supplement the first premise of the plausible inference in (10) with the
indication of the semantic structure characteristic of the smear-class ken ‘smear’
belongs to. Furthermore, an account of the non-alternating behaviour of önt ‘pour’
and fed ‘cover’ simply means that their meanings do not correspond to any of the
semantic classes in (15). Rather these Hungarian verbs occur in non-alternating
verb classes similar to those established by Pinker (1989: 126f.) for English verbs,
namely, in the dribble-class (‘a mass is enabled to move via the force of gravity’)
and in the inundate-class (‘a layer completely covers a surface’), respectively. At
the same time, the p-context modified by statements about alternating and non-
alternating narrow semantic classes seems to be inconsistent with its further
extension.9 Consider (16).
(16) a. Az apa-∅ (véletlenül/szándékosan) kávé-t
the father-nom (accidentally/intentionally) coffee-acc
löttyent-∅ az asztalterítő-re.
spill-prs.indf.3sg the tablecloth-sub
‘The father (accidentally/intentionally) spills coffee on the tablecloth.’
b. *Az apa-∅ (véletlenül/szándékosan) kávé-val
the father-nom (accidentally/intentionally) coffee-ins
löttyent-i az asztalterítő-t.
spill-prs.def.3sg the tablecloth-acc
‘The father (accidentally/intentionally) spills the tablecloth with coffee.’
As (16) seems to indicate, löttyent ‘spill’ does not alternate. A plausible account
can consist in a statement that the meaning of this verb – like that of önt ‘pour’ – is
fairly similar to the semantic structure of the dribble-class (‘a mass is enabled to
move via the force of gravity’). However, one can realise that löttyent ‘spill’ means
. Pinker (1989: 395f., Footnote 12) assumes that the verb sprinkle behaves as an alternating
verb on the basis of its phonological similarity to others. As to ditransitivisable verbs, Pinker
(1989: 118ff.) emphasises the role of morphological constraints.
Károly Bibok
more than motion caused by gravity since a different force causes ballistic motion
of a mass. Thus, löttyent ‘spill’ could enter the alternating splash-class in (15)(c), cf.
‘force is imparted to a mass, causing ballistic motion in a specified spatial distribu-
tion along a trajectory’. It could be an objection that motion does not come about
in a sufficiently specified manner. This objection is eliminated by a (plausibly)
well-formed example containing the verb löttyent ‘spill’ with the prefix le ‘down’,
which does not effect how the mass moves.
(17) Az apa-∅ (véletlenül/szándékosan)
the father-nom (accidentally/intentionally)
le-löttyent-i kávé-val az asztalterítő-t.
down-spill-prs.def.3sg coffee-ins the tablecloth-acc
lit. ‘The father (accidentally/intentionally) spills down the tablecloth
with coffee.’
After all, putting a verb in a narrow semantic class does not correlate with the
alternating or non-alternating syntactic behaviour of this verb alone and with a
prefix not affecting the character of the movement.
To recapitulate where we have arrived with the help of lexical rules applied
to narrow semantic classes of verbs, we should say that even lexical idiosyncrasy
plays some role, in addition to lexical rules made more precise. Such a double char-
acter is acknowledged, at least implicitly, by the theory itself (see narrow seman-
tic classes vs. one-word classes and the factor of phonological similarity) and is
highlighted by our present investigation. It is also clear that in spite of extending
the p-context by new sources and statements, the methodological stance of the
treatment of syntactic alternations has not been changed. Overall, the meaning
brought about by a meaning shift is grasped as a lexical phenomenon but not as a
meaning occurring in a particular syntactic structure. Using a different methodol-
ogy in the starting, or initial, p-context, a constructional conception approaches
meaning shifts and syntactic alternations from the latter point of view.
The three participant roles of slather are compatible with the argument roles of
both the caused-motion construction and the causative-plus-with-adjunct con-
struction. The first construction has three argument roles: a cause, a theme and a
goal (directional). The two kinds of role sets can be fused with each other because
the slatherer is semantically construable as a cause, thick-mass as a theme – since
it undergoes a change of location –, and the target as a directional. Cf.:
(19) Sam slathered shaving cream onto his face.
In the case of the second construction, fusing the slatherer with the first argument
is the same as above. The target can be construed not only as a directional, but also
as a patient, in that the entity on which the substance is slathered is affected. Since
the third participant role of slather requires that it be expressed, a with-phrase
emerges even though in the framework of Construction Grammar it is an adjunct
of the corresponding construction. Cf.:
(20) Sam slathered his face with shaving cream.
. It should be worth noting that in a wider p-context one cannot avoid the question con-
cerning the motivation of the occurrence of a directional in the intransitive motion construc-
tion. Consider (i).
While in (22)(a) with úszik ‘swim’ there is a motion component which can motivate the
directional role in (22)(b), in (i)(a) one cannot refer to any motion which would make the
directional role plausible in (i)(b).
The plausibility of approaches to syntactic alternation of Hungarian verbs
. For a substantially more detailed analysis of the lexical and constructional approaches to
syntactic alternations, see Bibok (2010).
The plausibility of approaches to syntactic alternation of Hungarian verbs
Starting from the lexicon, containing the verb ken ‘smear’ with the underspecified
representation in (26), the constructional meanings of the given verb correspond
to the two possible interpretations of (26) alternating with each other. When a
mass is focused, or profiled, the constructional meaning is equal to the part of
(26) which is before the conjunction and, i.e. ‘with smoothing movements of the
hand, X causes a mass Y to move onto a surface Z’. This constructional meaning is
expressed in (1)(a), repeated here as (27).
(27) Az anya-∅ zsír-t ken-∅ a kenyér-re.
the mother-nom fat-acc smear-prs.indf.3sg the bread-sub
‘The mother is smearing fat on the bread.’
. My lexical-constructional approach has some common features with Iwata’s (2002, 2005)
but also differs from it in the details of the characteristics of both lexical and constructional
meanings. A comparison between them would deserve a separate paper.
. An attentive reader who may have noticed that the component of moving is missing here
from (8) is referred to Bibok (2008), where he/she finds a motivation to assume that such a
type of meaning be represented without a motion element.
Károly Bibok
In the opposite case, when a surface comes into prominence, the constructional
meaning is ‘X causes a surface Z to be covered partially or totally with a mass Y’, i.e.
the part of (26) figuring after the conjunction and (if one leaves aside the manner
specification ‘with smoothing movements of the hand’). It is (1)(b), repeated here
as (28), that has this second interpretation.
(28) Az anya-∅ zsír-ral ken-i a kenyer-et.
the mother-nom fat-ins smear-prs.def.3sg the bread-acc
‘The mother is smearing the bread with fat.’
Consequently, the verb ken ‘smear’ can alternate syntactically because its under-
specified meaning in (26) provides access to two constructional meanings
expressed by the corresponding syntactic structures.14
Contrary, the verbs önt ‘pour’ in (11) and (13), i.e. Az anya vizet önt a virágra
‘The mother is pouring water onto the flower’ and *Az anya vízzel önti a virágot
‘The mother is pouring the flower with water’, as well as fed ‘cover’ in (14), i.e. *Az
anya csokoládémázat fed a süteményre ‘The mother is covering chocolate coating
onto the cookie’ − Az anya csokoládémázzal fedi a süteményt ‘The mother is cover-
ing the cookie with chocolate coating’, do not occur in syntactic structures of two
alternating types because their meaning is not underspecified in a way that would
allow two different interpretations. The verb önt ‘pour’ has a more specific mean-
ing representation which only contains (29) and does not contain (30):
(29) ‘in a given manner, X causes a mass Y to move onto a surface Z/into a
container Z’;
(30) ‘in a given manner, X causes a surface Z to be covered/a container Z to be
filled (in) partially or totally with a mass Y’.
Also, the meaning representation of fed ‘cover’ is more specific but in the other
way. This verb simply means that
(31) ‘in a given manner, X causes a surface Z to be covered partially or totally
with a mass Y’
Now, we can make the following plausible hypotheses concerning the locative alter-
nation in general. First, in accordance with principles of the lexical-constructional
approach, the locative alternation is relevant only for those verbs whose meaning
. It is precisely because of the lexically unfixed character of (26) that one may speak of
underspecification. For other forms of underspecification in meaning representations, see
Bibok (2004).
The plausibility of approaches to syntactic alternation of Hungarian verbs
representations are underspecified in a similar way as (26) is. In other words, both
constructional meanings are generally available if all locatively alternating verbs
are characterised with a meaning scheme (template) underspecified with regard
to their parts before or after the conjunction and. Thereby each of them can be
interpreted in two ways and expressed syntactically in two ways. Second, the verbs
which do not own such underspecified representations cannot occur in syntactic
structures alternating the locative variant with the with-variant. This entails that
the verbs not occurring in locative alternation do not have to be considered excep-
tions. The meaning of a verb itself determines – like narrow semantic classes but
more precisely – whether it may participate in the syntactic alternation at stake.
At this point of the present chapter we can explicitly show which data are
regarded as evidence for our lexical-constructional account and against its rivals,
i.e. lexical and constructional theories. Kertész and Rákosi (this volume: Sec-
tion 3.2, 2012: Section 13.3) generally view evidence as a datum whose function is
to contribute to the judgment and comparison of the plausibility of rival hypoth-
eses. They distinguish between three types of evidence: weak, relative and strong
evidence. Recall the examples in (16), given again here as (33).
(33) a. Az apa-∅ (véletlenül/szándékosan) kávé-t
the father-nom (accidentally/intentionally) coffee-acc
löttyent-∅ az asztalterítő-re.
spill-prs.indf.3sg the tablecloth-sub
‘The father (accidentally/intentionally) spills coffee on the tablecloth.’
b. *Az apa-∅ (véletlenül/szándékosan) kávé-val
the father-nom (accidentally/intentionally) coffee-ins
löttyent-i az asztalterítő-t.
spill-prs.def.3sg the tablecloth-acc
‘The father (accidentally/intentionally) spills the tablecloth with coffee.’
The plausible statement about the non-existence of the verb löttyent ‘spill’ with a
with-variant and the corresponding semantic structure, which was made in the
above discussion (cf. also (17)), should be considered weak evidence against the
hypothesis about the existence of a narrow semantic class, namely, the splash-
class in (15)(c) and, naturally, weak evidence for one of the rivals of that hypoth-
esis, i.e. for non-existence of that semantic class in the sense of Kertész & Rákosi
(this volume: Section 3.2, 2012: 178), provided – as said above – that the semantic
class includes verbs − and only those verbs − which undergo the locative alter-
nation. In the terms of Kertész & Rákosi (this volume: Section 3.2, 2012: 181)
such a plausible statement is strong evidence against a hypothesis, in the pres-
ent case: against the hypothesis regarding the existence of (15)(c). Moreover,
it is easy to see that the plausible statement at stake is relative evidence against
the hypothesis about the classification of relevant verbs into n arrow semantic
Károly Bibok
classes (cf. Kertész & Rákosi this volume: Section 3.2, 2012: 180). This is the case
because the plausible inferences connecting the premises and the negation of
such a hypothesis provide a higher plausibility value than the plausibility value
of the original hypothesis. Namely, the lexical-constructional account without
narrow semantic classes makes a more precise prediction concerning the set
of Hungarian verbs taking part in locative alternation while it does not have to
allow lexical idiosyncrasies which result in exceptions either (see our general
hypotheses concerning the locative alternation after (32)). At the same time,
the plausible statement about the non-existence of the verb löttyent ‘spill’ with
a with-variant and the corresponding semantic structure does not exclude that
the rivals of the lexical-constructional theory give an explanation with the help
of exceptions, i.e. the classification into narrow semantic classes can proceed
with some residue (maybe, in various languages with different lexical residue).
However, such a hypothesis loses against the lexical-constructional approach
from the point of view of plausibility.
Since the lexical-constructional approach rests on an idea which is not widely
accepted in regard to the form of the lexical-semantic representation − namely
that of underspecification −, it is important to gain some evidence to support it.
With this aim in mind, take into consideration the above plausible statements
about (11), (13) and (14), given here as (34)(a)–(b) and (35).
What is crucial from the point of view of evidence for underspecificity is the double
direction of the non-occurrence of locative alternation (cf. (34)(b) vs. (35)(a)). The
possible lack of either one of both variants in alternation indicates that both scenes
The plausibility of approaches to syntactic alternation of Hungarian verbs
underlying the two constructional meanings seem to be necessary for a verb to alter-
nate syntactically. These scenes together can be captured at an underspecified level of
meaning representation (see (26) above). Nevertheless, once again we can only speak
about relative evidence because there are other ways to account for locative alterna-
tion even if they have to allow exceptions but thereby they have less plausibility.
Acknowledgements
Work on the present chapter was supported by the MTA-DE Research Group for
Theoretical Linguistics and the TÁMOP-4.2.2/B-10/1-2010-0024 project, which is
co-financed by the European Union and the European Social Fund.
Károly Bibok
References
Bárczi, Géza & Országh, László (eds). 1959–1962. A magyar nyelv értelmező szótára (A Defining
Dictionary of the Hungarian Language) I–VII. Budapest: Akadémiai Kiadó.
Bibok, Károly. 2004. Word meaning and lexical pragmatics. Acta Linguistica Hungarica 51:
265–308.
Bibok, Károly. 2008. Az igék szemantikája és a szintaktikai alternáció (The semantics of verbs
and syntactic alternation). In Strukturális magyar nyelvtan 4: A szótár szerkezete (A Struc-
tural Grammar of Hungarian, vol. 4: The Structure of the Lexicon), Ferenc Kiefer (ed.),
23–70. Budapest: Akadémiai Kiadó.
Bibok, Károly. 2010. From syntactic alternations to lexical pragmatics. In The Role of Data at the
Semantics–Pragmatics Interface, Enikő Németh T. & Károly Bibok (eds), 261–304. Berlin:
Mouton de Gruyter.
Bierwisch, Manfred. 1997. Lexical information from a minimalist point of view. In The Role
of Economy Principles in Linguistic Theory [Studia Grammatica 40], Chris Wilder, Hans-
Martin Gärtner & Manfred Bierwisch (eds), 227–266. Berlin: Akademie Verlag.
The Collins Cobuild English Dictionary for Advanced Learners, 3rd edn. 2001. Glasgow: Harper
Collins.
Goldberg, Adele E. 1995. Constructions: A Construction Grammar Approach to Argument Struc-
ture. Chicago IL: University Press of Chicago.
Goldberg, Adele E. 2006. Constructions at Work: The Nature of Generalization in Language.
Oxford: OUP.
Iwata, Seizi. 2002. Does MANNER count or not? Manner-of-motion verbs revisited. Linguistics
40: 61–110.
Iwata, Seizi. 2005. Locative alternation and two levels of meaning. Cognitive Linguistics 16:
355–407.
Jaszczolt, Katarzyna M. 2012. Semantics and pragmatics: The boundary issue. In Semantics: An
International Handbook of Natural Language Meaning, Vol. 3, Claudia Maienborn, Klaus
von Heusinger & Paul Portner (eds), 2333–2360. Berlin: Mouton de Gruyter.
Kertész, András & Rákosi, Csilla. 2012. Data and Evidence in Linguistics. A Plausible Argumenta-
tion Model. Cambridge: CUP.
Kertész, András & Rákosi, Csilla. This volume. The p-model of data and evidence in linguistics.
Komlósy, András. 2000. A műveltetés (Causativisation). In Strukturális magyar nyelvtan 3:
Morfológia (A Structural Grammar of Hungarian, vol. 3: Morphology), Ferenc Kiefer (ed.),
215–292. Budapest: Akadémiai Kiadó.
Pinker, Steven. 1989. Learnability and Cognition: The Acquisition of Argument Structure.
Cambridge MA: The MIT Press.
Pusztai, Ferenc (ed.). 2003. Magyar értelmező kéziszótár (A Concise Dictionary of Definitions of
Hungarian), 2nd rev. edn. Budapest: Akadémiai Kiadó.
Stefanowitsch, Anatol. 2008. Negative entrenchment: A usage-based approach to negative evi-
dence. Cognitive Linguistics 19: 513–531.
chapter 4
Katalin Nagy C.
MTA-DE Research Group for Theoretical Linguistics
1. Introduction
and include the above mentioned pieces of information in their so-called p -context
(Kertész & Rákosi, this volume b: Section 2.4, 2012: 121ff., for phases of scien-
tific inquiry see Kertész & Rákosi, this volume b: Section 2.6.1, 2012: 136ff.).
Researchers, when interpreting former literature on a certain problem, unavoid-
ably face rival claims, which they have to compare and to evaluate their reliability.
In other words, the p-context can contain statements and their negations, which
belong to competing approaches and are plausible simultaneously. In such cases
the researchers have to decide how to resolve this p-inconsistency (cf. Kertész &
Rákosi this volume b: Sections 2.5–6, 2012: Sections 10.2, 10.6). According to
Kertész & Rákosi (this volume b: Section 3.1, 2012: 169ff.), data enter the argu-
mentation process with an initial plausibility value, which is determined by the
reliability of their source. But what does the reliability of different data sources
depend on? How can researchers decide between competing hypotheses?
What counts as a reliable source depends on the linguistic field we are work-
ing in. Sources which can support our statements include, for example, the “use
of special tools, methods, procedures […] certain methodological principles”
(Kertész & Rákosi, this volume b: Section 2.2, 2012: Sections 9.1, 9.3). Linguists
usually apply the methods and argumentation techniques accepted in the lin-
guistics area in which they work automatically, without conscious reflection. It
also occurs that researchers accept previous claims made in the relevant literature
without asking about their reliability. The reputation of famous researchers also
plays a role in attributing plausibility values to their claims. In this way, the claims
of highly recognised authors are given high plausibility values when used as data
in other research. Statements often enter the argumentation process with a high
plausibility value, while it might be next to impossible to see where they come
from and whether they actually deserve such a high plausibility value.
The aim of the present chapter is to show the usefulness of taking into account
metalinguistic considerations in order to decide between competing hypotheses
in linguistic research and judge the reliability of certain claims. Using the metalin-
guistic framework presented in Kertész and Rákosi’s introductory chapter to the
present volume, some characteristics of historical linguistics argumentation are
examined through a concrete case study concerning a subquestion of the gram-
maticalisation of the Catalan construction “anar ‘go’ + infinitive” as a perfective
past. The present chapter does not aim to provide a full overview of argumentation
in historical linguistics but to illustrate the usefulness of metalinguistic consider-
ations in linguistic research.
The organisation of the chapter is as follows. After the present introduction, in
Section 2 two basic methods and argumentation techniques in historical linguis-
tics are discussed, namely, argumentation based on analogy and frequency count-
ing. In Section 3 their application is examined through a historical linguistic case
Methods and argumentation in historical linguistics
2.2 Frequency
Frequency is a central notion in historical linguistics in two respects: as a phe-
nomenon and as a method. On the one hand, several historical phenomena are
associated with the high frequency of the linguistic units concerned, and on the
other, frequency counting is a central method for investigating various historical
phenomena. Let us consider both concepts of frequency.
The definition of grammaticalisation as “the process of automatisation of
frequently occurring sequences of linguistic elements” (Bybee 2007: 969) shows
the importance of frequency also in grammaticalisation, especially in relation
to the generalisation of meaning. Some linguists consider the generalisation of
meaning as a precondition for more frequent use. Others think the opposite: it
is frequent use which facilitates the generalisation of meaning (Dér 2005: 48, cf.
Ladányi 1998: 420ff.), that is, according to this concept of frequency, it has a cogni-
tive impact. Actually, both claims seem to be maintainable. As Bybee (2005 [2003]:
602) points out, “frequency is not just a result of grammaticisation, it is also a
primary contributor to the process”. She argues that one of the main character-
istics of grammatical units is that, compared to lexical morphemes, they occur
extremely frequently. This frequency is not simply a result of grammaticalisation
but its main motive power. Examining the effects of repetition on constructions,
Bybee (2005: 10f.) finds that while low levels of repetition lead to conventionalisa-
tion only, extremely high frequency leads to the grammaticalisation of the new
construction, and that “the changes that take place in grammaticization are condi-
tioned at least in part by high frequency of use” (Bybee 2005: 11). These changes are
the following: phonological reduction, the gaining of autonomy of the new con-
struction, loss of the original meaning, and addition of a new, inferential meaning
from the context. Habituation processes cause semantic bleaching. Frequency also
has an impact on the strength of the exemplars and on category formation.
Let us consider the other concept of frequency. Frequency counting is a pow-
erful device of historical linguistics methodology, which provides quantitative
data and is applied in order to investigate the above mentioned cognitive pro-
cesses. Quantitative data should also be regarded as plausible statements of the
Methods and argumentation in historical linguistics
Table 1. Infinitives with can in Middle English (Bybee 2005 [2003]: 611f.)
1. verbs of intellectual states or activities: 52 tokens, 18 types
High token frequency: see 12, deem 6, understand 6, espy (discover) 5
Type frequency: 18 distinct verbs
2. verbs of communication: 102 tokens, 31 types
High token frequency: tell 30, say 29, devyce ‘describe’ 8
Type frequency: 31 distinct verbs
3. verbs of skills (‘know how to’): 26 tokens, 18 types
Katalin Nagy C.
Bybee compares these data with data from Old English. The comparison
shows that although these infinitives could also be used with can in Old English,
in Middle English more infinitives are documented, and also the token frequency
of some combinations in the first two groups of infinitive types increases. At the
same time, new infinitives appear with can.
To sum up, frequency analysis as a corpus method is a basic research tech-
nique in historical linguistics, since change in frequency is used to explain sev-
eral historical phenomena. However, it is worth noting that the research question
we are interested in and the theoretical framework we work in determine what
we count the frequency of. In this sense, the quantitative data obtained from fre-
quency counting is theory-dependent.
2.3 Analogy
Numerous historical changes discussed in the literature are attributed to analogi-
cal processes (Hock 1991: 167). In historical linguistics, the term ‘analogy’ tradi-
tionally covers processes whereby on the basis of a preexisting structural pattern
in a language, corresponding new patterns are introduced, or old patterns are
replaced (Jeffers & Lehiste 1979: 60, cf. also Bynon 1983). Analogical processes are
at work at every linguistic level. For instance, as is widely maintained in historical
linguistics, some changes in phonetic structure are influenced by factors which
are not phonetic in nature. One of them is analogy, whose main function is to
make phonetically and morphologically similar the form of lexical units which are
similar from a morphological, syntactic or semantic point of view. In morphology,
when a grammatical distinction is lost in some morphological classes as a result of
phonetic changes, the pattern of identity is usually extended by analogy (Jeffers &
Lehiste 1979: 71).
Hock (2005 [2003]) presents a wide range of phenomena which have been
considered analogical, namely, four-part analogy, leveling, morphophonemic
extension, blending, contamination, recomposition and folk etymology. He con-
cludes that
the entire range of analogical changes, as well as at least some aspects of semantic
change (mainly metaphorical changes) constitute points in a continuum of
changes which may be considered analogical in the larger sense, in that they
extend linguistic patterns. The differences in behaviour between these changes,
in terms of (potential) regularity or systematicity, are a consequence of the
differences in domain in which the changes can apply. (Hock 2005 [2003]: 455)
Let us consider two examples of how analogy works in language history. The cre-
ation of the medieval Spanish negation particle njn can be considered a simple
illustrative case of analogy. On the basis of direct sources it can be claimed that
Methods and argumentation in historical linguistics
the negation particles no (< non) and non (< non) ‘no’ alternate in medieval
Spanish and that the forms ni (< nec) and njn ‘neither’ show a similar alternation.
However, the final n of the form njn cannot be derived etymologically, that is, there
is no direct source which makes plausible that the form njn would be etymologi-
cal. Relying on the semantic similarity of the pairs no/non and ni/njn, it can be
supposed that the form njn was created by analogy of non, as shown in (1):
(1) ‘no’ no: non
‘neither’ ni: X = njn [nin]
The existence of the form njn in the medieval language state of Spanish has an
analogical explanation in the sense that it refers to the process of analogy at work
in its creation. Also, in the historical evolution of verbal systems a wide range of
analogical developments can be traced. Let us consider an example from Alvar
and Pottier (1987 [1983]: 270). In medieval Spanish, several variants of the per-
fective past of the verb haber ‘have’ coexisted. In addition to etymological forms,
presumably analogically created forms were also used. Table 2 shows the perfec-
tive past forms of “habere > haber”, a verb of possession which also serves as
an auxiliary. Alvar and Pottier also enumerate some forms signalled as “strange
forms” which cannot be considered etymological, but cannot have an analogi-
cal explanation either. The forms in italics have survived and are used in current
Spanish. With one exception, they can all be regarded as analogical forms, that is,
their existence can be explained with reference to analogical processes at work at
an earlier language state.
The phonetic changes which yielded the current perfective past forms of haber
can be explained by analogy. In this case it is difficult to imagine exactly how this
analogical process worked, since only one of the etymological forms contains the
vocal u (the 3Pl hubieron). In spite of this, it is the vowel of this verb stem which
extends to the other forms. The explanation may be that the speakers interpret
the vocal u as a marker of the perfective past tense, since it also marks perfective
Katalin Nagy C.
In the perfective past forms of the verbs poder ‘can’ and poner ‘put’, pude ‘I could’
and puse ‘I put’, respectively, a verb stem appears which contains the vowel u
as a marker of the perfective past tense. Relying on an associative relationship
based on meaning, the vowel u can be supposed to extend analogically to other
verbs, including saber ‘know’ and haber ‘have’, yielding the perfective past forms
supe ‘I knew’ and hube ‘I had’, respectively, and other analogical forms of the
paradigm.
2.4 Summary
The notions of frequency and analogy play a central role in historical linguistics
argumentation. Frequency-based argumentation makes use of plausible state-
ments which capture characteristics of some linguistic structures in quantita-
tive terms, while analogy-based argumentation contains references to analogical
processes which are supposed to work at a given linguistic level. Since they are
widely accepted methods and explanations in this field, statements from an argu-
mentation which relies on them can be assigned a high plausibility value. In the
following section let us examine how these notions appear in the research on a
concrete historical phenomenon and how the above considerations can help to
decide between competing hypotheses.
3.1 Th
e starting p-context: Three accounts of the morphological
development of the Catalan periphrastic perfective past
In this section let us examine some characteristics of historical linguistics argu-
mentation through a case study concerning the grammaticalisation of the periph-
rasis “anar ‘go’ + infinitive” in Catalan. Since this is a highly complicated process,
I discuss only a subquestion of this historical development.
The following hypotheses concerning the Catalan construction “anar +
infinitive” are unanimously accepted in the relevant literature. The c onstruction
Methods and argumentation in historical linguistics
The verb anar also continues to be used as a full lexical verb with the meaning ‘go’
in modern Catalan. Although the paradigms of the full lexical verb anar and of the
auxiliary anar almost coincide, they are divergent from each other in some forms
(see Table 3 in Subsection 3.1.3). The var- forms were created by analogy, modeled
on the regular preterit forms of a first conjugation verb.
It is generally acknowledged that the source of the current Catalan preterit,
the medieval “anar ‘go’ + infinitive” construction was originally a purposive con-
struction which had the sense of ‘be in motion with the aim of doing something’.
The development of the Catalan preterit “anar + infinitive” raises a wide range
of problems, of which in the present chapter I discuss only one, which is the fol-
lowing. Although the auxiliary forms of the perfective past “anar + infinitive” in
current Catalan formally seem to be in the present tense, medieval data attest an
alternation between present and preterit auxiliary forms. In other words, the ques-
tion discussed in the present chapter concerns the morphological development of
the “anar ‘go’ + infinitive” construction: the mix of present and preterit auxiliary
use in early documents and the later extension of the use of present forms and the
loss of the preterit auxiliary versions. The following Examples (4) and (5) show the
medieval alternation.
‘they met the troops of the Saracens and attacked (literally: went [in order]
to attack) them’
. The following abbreviations are used in the glosses throughout this chapter: 3 – third
person, inf – infinitive, pl – plural, prs – present tense, pret – preterit tense, and sg – singular.
. The sources of the examples are abbreviated at the end of each example, with page and/or
line numbers (full details of sources can be found at the end of the chapter).
Katalin Nagy C.
Examples like (4) and (5) show the present–past alternation in the auxiliary forms
of the construction “anar + infinitive” in medieval Catalan. Two versions of the
periphrasis under consideration are documented: first, “anar ‘go/auxiliary’ conju-
gated in perfective past + infinitive” (Example 4) and, second, “anar ‘go/auxiliary’
conjugated in present + infinitive” (Example 5). The question examined here is
which of these two medieval periphrases should be considered the antecedent of
the perfective past of modern Catalan. At the beginning of the investigation of
this question, we will summarise previous results, which belong to our starting
p-context. The three approaches I examine below provide statements which are
made plausible by some source and their negations by another, that is, they yield
a p-inconsistent p-context. In the following sections I discuss how we extend the
starting p-context with new information, re-evaluate the plausibility values of
contradicting statements comparing the data use and argumentation in the three
different accounts, and finally, modify the p-context in order to eliminate the
p-inconsistency. Let us start with a summary of three accounts of the morphologi-
cal development of the “anar ‘go’ + infinitive” construction, namely, those of Colon
(1978a, b), Detges (2004) and Juge (2006).
understandable that its auxiliary was conjugated in the historical present instead
of a preterit, because its role is the same and both “unite their virtues”.
In order to illustrate the claim that the construction “anar + infinitive” serves
to animate the description, Colon (1978a: 127f.) mentions an interesting example
which shows the auxiliary in present tense. This example of “anar + infinitive”
appears in a general description in the present tense in a Catalan text, which has
a parallel text in Provençal. In the corresponding Provençal text a simple present
appears when Catalan uses the GO-periphrasis with a present tense auxiliary. Colon
attributes this use to the function of animating the narration. This occurrence does
not correspond to the formally similar occurrences which show the periphrasis with
a present tense auxiliary in a preterit context, because it appears in a general descrip-
tion in the present tense, that is, the context is different. This datum from a direct
source certainly makes plausible Colon’s hypothesis about the early function of the
GO-construction (animating the narration). At the same time, however, it makes
plausible the following claim too: in the earliest examples the auxiliary should
appear in a tense required by the context (cf. Colon 1978a: 128; Colon 1979b: 159).
Consequently, it can be supposed that in the earliest examples the auxiliary of the
periphrasis should be conjugated in the preterit in a preterit context.
Concerning the later morphological development of the construction, Colon
(1978b: 169) only refers to the appearance of the var- forms. The var- auxiliaries
(cf. (3)) were created by analogy, modeled on the regular preterit forms of first
conjugation verbs. Colon suggests that the formal similarity of var- auxiliaries to
regular perfective past forms attests to the temporal identity of the construction.
That is, he supposes that the construction is already grammaticalised as a perfec-
tive past tense at that time (about the 14th century).
Let us give a summary of the development of “anar + infinitive” in Colon’s
(1978a, b) account step by step:
The statements presented above are explicit claims in Colon (1978a, b). However,
because the plausibility of a conclusion does not only depend on the plausibility
values of the premises, but is also influenced by the plausibility of the latent back-
ground assumptions, we have to reconstruct them before considering the plausibil-
ity of Colon’s hypotheses. Latent background assumptions are statements which are
also needed to secure the plausibility of the conclusion in certain cases (cf. Kertész &
Rákosi, this volume b: Section 2.3). The following latent background assumptions
can be reconstructed at the crucial 2nd and 3rd points of Colon’s argumentation.
First, he supposes that two things which have the same function can fulfill this
function better together. However, he does not support this claim by independent
evidence. Second, he supposes that the present auxiliary forms in medieval texts
are representatives of the historical present. Third, he supposes that the historical
present has the function of animating narration (cf. Colon 1978a: 129, 1978b: 161).
These latter two latent background assumptions show that Colon’s argumentation
concerning the morphological development of the “anar + infinitive” periphrasis
is based on an assumed historical present use. This view is, however, not properly
established: Colon does not express explicitly what he means by “historical pres-
ent” and does not support these assumptions by other data. Because none of them
is supported properly, low plausibility values can be assigned to these latent back-
ground assumptions, which also decreases the plausibility of the hypothesis.
4. The periphrasis – with the GO-verb in present – acquires a past tense value.
As to the shift between the 3rd and 4th points of the development, according to
Colon (1978a, 1978b: 156), we cannot examine the beginnings of the history of
the Catalan “anar + infinitive” construction, because they already appear with a
past tense value in the earliest texts. In contrast with Colon’s claim, on the basis
of corpus data it can be claimed that in the earliest text the range of contexts in
which the periphrasis can appear is very limited. The grammaticalisation of the
construction has not yet concluded and if there is a past tense value, it is not yet a
grammatical meaning.
This claim can be assigned a high plausibility value on the basis of data from sev-
eral different language systems (see below, in 3.1.3).
Colon’s analysis is based on a wide range of data, not only from Old Catalan
but also from other Romance languages. However, his account also contains an
inconsistency between two of his statements, namely, (i) that the periphrasis has a
past tense value, and (ii) that it is about a historical present use. This inconsistency
results from the circumstance that both claims are based on the same sources of
Methods and argumentation in historical linguistics
data, that is, the same historical texts, which represent the same language state. It is
obvious that the two statements, inasmuch they concern the same language state,
cannot be true simultaneously. If the first statement is maintainable, the GO-verb
should already function as a grammatical marker or at least be grammaticalised
to some degree. However, this would contradict the statement that we are dealing
with a historical present usage, because in this case the GO-verb should refer to
real motion. Consequently, the simultaneous presence of both statements in the
argumentation would yield a p-inconsistent p-context. However, there is another
possibility: we can suppose that Colon intends the two statements to refer to two
different language states. In this sense, the second one concerns an earlier period
of the history of the Catalan, from which, as Colon also points out, we do not have
documents. For lack of direct sources from this language period, it is not possible
to support this claim properly. Consequently, it receives a low plausibility value,
which decreases the plausibility of Colon’s account.
The problem with this claim is that, although it may be maintainable, it is hard
to see whether it is relevant enough to include in the argumentation. Although it
is possible that the formation of the past value of this Catalan construction took
place in the context of such an incoherent verbal tense use, Detges does not pro-
vide examples taken from these types of historical documents. Moreover, this
claim does not hold for the medieval Catalan chronicles, which contain the earliest
documented occurrences of the periphrasis under study. Detges himself presents
examples4 which show a coherent use of preterit tenses in narrative, only inter-
rupted by occurrences of the GO-periphrasis with an auxiliary in the present, but
. These examples are provided at another point of his argumentation and with a different
aim from that of the present discussion.
Katalin Nagy C.
with no other present tense forms. Example (11) provided in Detges (2004: 217,
taken from Colon 1978b: 135f.) is a passage from Desclot’s Chronicle. It contains
14 verbal forms, of which 13 are in the past tense, and the only one in the present is
the auxiliary of an occurrence of the GO-periphrasis. In his other example (again
taken from Colon 1978b: 169) he presents a passage from Muntaner’s Chronicle.
This contains 23 verbal forms. Two of them appear in a direct quotation and
another two in the expression addressing the public Què us diré? ‘What shall I tell
you?’ The rest of them belong to the real narrative part of the text, and 5 of them
are representatives of the “anar + infinitive” periphrasis. Of these five occurrences
only one has the auxiliary in the preterit, the other four are occurrences with a
present tense auxiliary. However, these present auxiliary forms are the only pres-
ent forms with a past tense reference in the text. That is, there are no other present
forms which could be representatives of the historical present: the coherent verbal
tense use in this narrative is only interrupted by the present auxiliary variants of
“anar + infinitive”. The illustrative examples do not show the unsystematic alterna-
tion of verbal tenses assumed by Detges. Consequently, this is not a felicitous argu-
ment, or at least it should be mentioned that the periphrasis also appears in texts
whose verbal tense use is systematic and coherent. The present tense forms in this
type of texts should be explained, too. That is, the p-context of Detges’s investiga-
tion is p-inconsistent in the sense that the statement concerning the “unsystem-
atic present tense – past tense alternation” is claimed to be plausible, while direct
sources make its negation plausible.
2. The historical present is dispensable for the “effect” of the periphrasis; the tense of
the auxiliary is optional.
Detges does not explain why, if this is the case, historical present would appear at
all, and why only in the case of this periphrasis. At this point a latent background
assumption can be reconstructed, that is, that the present auxiliary occurrences of
“anar + infinitive” are representatives of a historical present usage. However, since
he operates with an intuitive notion of the historical present, this statement is not
properly supported. Consequently, it has a low plausibility value, which weakens
the plausibility of his hypothesis as well.
3. The version which has the auxiliary in the present is the “stronger alternative”.
The present-tense realisation combines two effects, namely, the “dynamification
effect and, due to the historical present, “the actuality-effect” (Detges 2004: 218).
Detges bases this claim, again, on the latent background assumption that the pres-
ent auxiliary versions are representatives of a historical present usage. He argues
that they are stronger versions, since they combine two effects. But how can we
Methods and argumentation in historical linguistics
know what “effect” the periphrasis had in the Middle Ages? We should support
such a claim with evidence accepted in historical linguistics. If in the medieval
language there actually existed a stronger and a weaker version of the periphrasis,
such a difference should be reflected in the documented uses. However, contexts
in historical documents do not reflect a “weaker version ‒ stronger version” oppo-
sition: in early texts the present and the past tense auxiliary versions appear in
the same types of contexts and with the same infinitives. Consequently, we do
not have supporting evidence for the claim that the present auxiliary variant was
stronger. This statement is only based on the author’s subjective judgment and our
sources (i.e. historical documents) make it implausible by supporting its nega-
tion. Once again, the extension of the p-context of Detges’s analysis with new data
results in a p-inconsistency.
4. The present-tense variant was used more frequently at all times (Detges 2004: 218).
Detges does not support the higher frequency of the present auxiliary version with
(quantitative) data. Thus, the reader can misinterpret this statement, believing that
statistics have been used in the investigation, which is not the case. Moreover,
statistical data suggest the opposite hypothesis, according to which the preterit
auxiliary variant of “anar + infinitive” was more frequent at the beginning and was
displaced by the present auxiliary forms only later. In other words, the extension
of the p-context of his investigation with new information originating from these
statistics yields a p-inconsistency. We have to decide, then, which one of the two
simultaneously plausible statements to accept: the statement in 4. or its negation.
Detges attributes the loss of the preterit forms to the higher frequency of present
auxiliary variants. Although frequency can have this effect, Detges does not sup-
port this claim with a frequency analysis of historical occurrences, neither does he
take into account results presented in previous literature, which show the oppo-
site: at the beginning the preterit auxiliary occurrences were more frequent. Con-
sequently, on the basis of information from relevant literature we have to judge
this statement to be implausible.
To sum up, Detges’s argumentation does not rely sufficiently on data whose
plausibility value originates from direct sources and in his argumentation there
appear statements which are not presented as results of usually accepted histori-
cal linguistics argumentation methods. At the same time, when he uses accepted
argumentation methods (e.g. based on frequency), his statements are either not
Katalin Nagy C.
supported by data whose plausibility value originates from some direct source, or
further sources render them implausible.
In addition, a textual contradiction appears, confusing the reader. Detges
(2004: 213) translates the Old French form (il) va dire in the following way: “not
simply ‘he starts to say’, but more specifically ‘all of a sudden, he says’ (Gougenheim
1971 [1929]: 96)”. And elsewhere he says (Detges 2004: 217): “il va dire ‘suddenly,
he said’ (Gougenheim 1971 [1929]: 96–97)” with a translation in the preterit. We
do not know whether this contradiction between two different translations with
present and past tenses comes from Gougenheim or it is Detges’s own mistake. In
any case, it is very confusing, as the question of tense is a crucial one in the devel-
opment of this construction.
In the two accounts presented above, the morphological analysis of the devel-
opment of the Catalan GO-past amounts only to the “historical present explana-
tion”. However, it is in itself not properly established. Consequently, the p-context
of the research should be extended with new information about historical pres-
ent usage. In addition, these two analyses need to be complemented with more
detailed morphological and pragmatic analyses, because the plausibility of a state-
ment which is in accordance with analyses of other linguistic levels is higher. As
Kertész and Rákosi (this volume: Section 2.2, see also 2012: 76f.) point out, “if
several sources support a statement, then its plausibility value is higher on the
basis of all sources together than its plausibility value on the basis of any of the
sources alone”.
present tense auxiliary. As a consequence, the use of the present extended to the
rest of the paradigm of the auxiliary. Let us observe his methods and argumenta-
tion in more detail.
This claim can be (and is) supported by frequency analysis, consequently, a high
plausibility value can be assigned to it.
Juge (2006: 319) presents corpus data in support of this claim: in his corpus, 32 of
the 157 (20.4%) instances of anà ‘he went’ “are plausibly cases of the periphrastic
preterit” and, similarly, with anaren ‘they went’ there are 164 instances in all, with
40 (24.4%) of these “being good candidates for the periphrasis”.
The third point of the development described in Juge (2006) is crucial. At this
point, language users presumably reinterpret the periphrasis as a construction
with a present tense auxiliary. In this reinterpretation, Juge (2006: 320) attri-
butes a crucial role to the ambiguity between present and preterit found in the
first and second plural indicative (cf. Table 3). Because present and preterit are
formally identical in the first and second plural indicative, language users rein-
terpret the construction with a preterit auxiliary as one with a present auxiliary.
Katalin Nagy C.
Juge (2006: 320) claims that “the influence is not directly on the other forms in the
paradigm; rather it is on the interpretation of the structure of the construction as
a whole”, that is, at this point Juge assumes an analogy with other constructions
which have the auxiliary in the present tense.
The analogy-based and frequency-based arguments are connected here. On
the one hand, according to Juge (2006: 321), “one possible reason for the reanalysis
of forms such as anam as present in this construction is the tendency for finite
verbs participating in periphrases to be morphologically present tense”. Juge’s
argumentation, relying on analogy with formally similar constructions, takes into
account the whole verbal system. His other argument (Juge 2006: 322) is based on
universal tendencies:
there is a strong crosslinguistic tendency for auxiliary marking to be in the
present, although this is certainly not an absolute universal […]. In various
Romance languages, for example, the perfect TAMCATS [Tense-Aspect-Mood-
CATegories] allow a wide range of auxiliary forms, and the progressive and GO-
future do so to a lesser extent. Even in these cases, the non-present marking on
auxiliaries strongly tends to be imperfective, not perfective.
Table 4. “Frequency of first person plural and other forms in Old Catalan”
Juge (2006: 321)
Preterit Present indicative
Form # Form #
1S vag 1
2S vas 1
3S anà 41 va 13
1Pl anam 209 anam (209)
3Pl anaren 71 van 10
209 of 346 occurrences represent the 1Pl form anam ‘we go/we are going/we
went’, which is a relatively high frequency. The frequency-based argumentation at
this point consists in claiming that the reinterpretation of “anar + infinitive” as a
periphrasis with a present tense auxiliary is realised due to the high frequency of
Methods and argumentation in historical linguistics
the 1Pl form anam. This form is ambiguous between past and present tense and
allows an association to the present tense paradigm of the verb anar.
Two problems should be mentioned, which can decrease the plausibility of
this statement. First, in other cases of analogical change the basicness of third per-
sons is emphasised. Second, it is questionable whether the frequency will remain
the same, when investigating it in a larger corpus. As to the first problem, the
extremely high frequency of this no-third person form can explain its central role
at this particular occasion, because it can be considered a basic form. Juge’s argu-
mentation at this point is in accordance with Kuryłowicz’s second “law” of anal-
ogy: “analogical developments follow the direction ‘basic form’ → ‘derived form’,
where the relationship between basic and derived forms is a consequence of their
spheres of usage” (Hock 1991: 212). Also: “of any given set of forms or morpho-
logical classes, the one which has a greater sphere of usage is more ‘basic’ than the
others” (Hock 1991: 214). That is, the 1Pl form anam, due to its high frequency,
can be considered a basic form. As to the second problem, at the moment, it is a
plausibility weakening factor, and needs more investigation.
4. Later, new forms are created by analogy of regular preterit forms. Juge (2006: 316)
argues that other forms are analogically created from the root va-, since the form
va ‘(he) goes’ is both monosyllabic and third person singular.
This claim can be supported by crosslinguistic data, since the basicness of the third
person can be found in many languages and often has diachronic consequences (cf.
Hock 1991: 220). Third persons tend to be more basic in analogical change than
other forms of the verb (Hock 2005 [2003]: 446), cf. also Kuryłowicz’s “sphere of
usage” provision (Hock 1991: 220ff.). Some forms created on the root va- resemble
regular preterit forms like cantares ‘you sang’. These are also given an explanation
in Juge (2006: 316). The marking of the periphrasis with a preterit meaning by a
preterit marker also has a parallel in another language. As Juge (2006: 316) explains
Russian presents a parallel for such a ‘double’ marking where the tense of the
auxiliary is the same as that of the construction as a whole. […] There may then
be a crosslinguistic tendency to make the auxiliary match the overall value of the
construction, thus leading to the forms of va- that look like preterits.
Juge (2006: 319) supports this claim by another one: all the present tense auxiliary
occurrences found in his corpus are representatives of the periphrasis (that is, they
Katalin Nagy C.
do not retain the lexical meaning ‘go’ of anar). As he emphasises: “there is NOT A
SINGLE EXAMPLE of va or van used non-periphrastically with a past value” and
the present tense variants appear “with no other present tense forms nearby” (Juge
2006: 319; emphasis in the original). Similarly to the other two authors, Juge does
not explain exactly what the historical present is. Although his claims concerning
this issue are intuitively acceptable and seem to provide a correct description of
the phenomenon, with an extension of the p-context with new, relevant informa-
tion, the plausibility of his statements can also be modified.
To sum up, Juge’s (2006) argumentation takes into consideration the whole
verbal system and crosslinguistic tendencies, too. He often makes typological
comparisons and relies on examples and parallels from other languages. Juge bases
his claims on data originating from a historical corpus and provides an exhaustive
morphological analysis of the occurrences. He explains the morphological devel-
opment of the construction from the beginnings until the modern paradigm with
argumentation methods accepted in historical linguistics. References to analogical
processes and frequency are widely used in his argumentation, yielding a convinc-
ing analysis.
representatives of the historical present, because they appear “with no other pres-
ent tense forms nearby”. This suggests that historical present forms usually do not
appear in isolation. In any case, it is undeniable that if the historical present tense
forms had been grammaticalised in the Catalan GO-past, there should have been
also forms of anar in the historical present combined with an infinitive, in which
the verb anar retained its lexical meaning ‘go’. However, as Colon also emphasises,
the formally present forms all seem to have a past tense value and do not retain the
lexical meaning ‘go’ of anar.
Now I will attempt to summarise what is worth knowing about the historical
present for the purposes of the present chapter. The following hypotheses con-
cerning the historical present originate from the relevant literature, where they
are considered to deserve a high plausibility value. The historical present is tra-
ditionally defined as the present tense used in the narration of events set in the
past, and it has been suggested that it makes stories more vivid by bringing past
actions into the present (see relevant literature given in Wolfson 1979: 169 and in
Schiffrin 1981: 46). It occurs when relating the most crucial events in a narration.
Fludernik (1992) points out that the historical present use reflected in early texts
is related to an oral pattern of story telling. Later, however, as the written standard
is developed, historical present usage suffers some changes:
In the process of narrative development from the written codification of oral
storytelling via the written composition of texts on an oral model towards a
purely written conception of narrative structure, the shape and function of the
historical present tense necessarily undergo equivalent changes.
(Fludernik 1992: 1, cf. also Fludernik 1992: 21)
We can suppose that the use of the historical present may also be close to the oral
pattern in early Catalan texts.
Beyond the simple characterisation presented above, some interesting new
findings on the historical present have been put forward. The first is Wolfson’s
(1979) claim that the conversational historical present has no significance in itself.
To put it differently, it is not simply the historical present use that matters, but
the switching between present and past tenses. This present–past alternation is a
discourse phenomenon which organises and segments the narrative (cf. Wolfson
1979: 178): it separates narrative events from each other, and, at the same time,
highlights the turning points of a story. Wolfson (1979: 172) also emphasises that
“many stories are so organized that what seems to be the most important event is
given in the past tense”. That is, she does not accept the claim that the historical
present would have the function of dramatising certain acts. Rather, as Schiffrin
(1981: 59) explains, the shift to the historical present seems to be an “internal eval-
uation device: it allows the narrator to present events as if they were occurring at
Katalin Nagy C.
that moment, so that the audience can hear for itself what happened, and can inter-
pret for itself the significance of those events for the experience”. Silva-Corvalán’s
findings (1983: 774f.) offer Spanish support for Schiffrin’s claim for English that
historical present usage is an internal evaluation device.
The second finding is that historical present forms tend to occur more fre-
quently in specified points in the narrative but are missing in others. Schiffrin
(1981) analyzes how the use of the historical present depends on the organisation
of narratives. Fludernik (1992: 23) too, emphasises that the dynamics of the histor-
ical present should be discussed in relation to narrative episodes in their totality.
The present auxiliary versions of “anar + infinitive” actually can be found at
certain points of the narrative, namely, in the description of events which can be
considered “turning points” of the story. However, we should not conclude from
this that it is about a historical present usage. As Colon points out, both the periph-
rasis and the historical present share the same function, and this can be the reason
why they appear at the same points of the narrative: in the description of events
which the speaker wants to attract the hearers’ attention to. We cannot conclude
that it is about a historical present use because there are at least two other options
to explain the presence of the construction at these points of the narration. On the
one hand, it can be due to the value of the periphrasis in itself, if we regard it as a
pragmatically marked construction at that language state. On the other hand, the
present tense of the auxiliary can be also a simple morphological characteristic as
described in Juge (2006), that is, the tense of the auxiliary of an already grammati-
calised construction.
It is also worth noting Schiffrin’s (1981: 47) finding, according to which in
narrative clauses about 30% of the verbs are in the historical present. Moreover,
she notes that “there is a tendency for verbs in the same tense to cluster together.
[…] Thus sequences with rapid alternation between the HP [historical present]
and the P [past] are not typical” (Schiffrin 1981: 51) (for the clustering of his-
torical present forms cf. also Pinkster 1990: 73, 75). Investigating the historical
present/preterit alternation in oral Spanish narrative, Silva-Corvalán (1983: 767)
finds a very similar percentage for Spanish: “of 476 narrative clauses, 156 (32.7%)
verbs are in the historical present”. She claims that “this cross-linguistic similar-
ity may respond to a universal pragmatic function” (Silva-Corvalán 1983: 767,
Footnote 13). These two characteristics, again, do not fit the present auxiliary
use in the medieval Catalan chronicles. In these texts the proportion of the pres-
ent variants of the construction does not reach 30%, and the occurrences do not
cluster together. Although in some passages of Muntaner’s Chronicle some occur-
rences appear close to each other, it is not a general characteristic of the use of the
periphrasis in all chronicles, and it can also be attributed to the higher degree of
grammaticalisation of the construction. Moreover, what is typical for the use of
Methods and argumentation in historical linguistics
“anar + infinitive” in these medieval texts is precisely the rapid alternation of the
present auxiliary versions of the construction with verbal forms in the preterit,
which is not typical for a historical present usage. The main evidence, however, is
that “there is NOT A SINGLE EXAMPLE of va or van used non-periphrastically
with a past value” and the present tense variants appear “with no other present
tense forms nearby” (Juge 2006: 319). This finding argues contra the historical
present hypothesis, because a historical present use which affects only the occur-
rences of the “anar + infinitive” construction, but no other verbal forms, would be
very strange, and moreover, counterintuitive.
To sum up, the extension of the p-context with new information concerning
the historical present helps us to eliminate one of the p-inconsistencies present in
the starting p-context, namely, that which exists between the statement that there
is a historical present use in the historical documents under study (considered as
plausible in Colon’s and Detges’s accounts) and its negation (made plausible by
Juge’s approach). On the basis of the new information presented in the present
section we should modify the plausibility values of these statements: the statement
that it is about a historical present use should be evaluated implausible, while the
plausibility of its negation should be strengthened.
“anar (conjugated in the preterit) + infinitive” → the loss of preterit forms of anar in
the construction, displaced by occurrences of anar in the historical present → “anar
(in the historical present) + infinitive” → semantic change → “anar (auxiliary, con-
jugated in the present) + infinitive”
change, and in the second account it is the periphrasis with preterit auxiliary that
does. Similarly, the spread of the present auxiliary forms is accounted for differ-
ently in the two analyses. In the first, it is the result of an extension of the historical
present usage (which, very strangely, only affects the forms of this periphrasis). In
the second analysis, however, they are the result of a reinterpretation, triggered by
syncretism in one of the most frequent forms of the paradigm. According to Juge’s
theory, it was the change in the meaning which triggered formal change. This hap-
pens frequently in analogical changes: the users of a language give similar forms to
semantically similar linguistic units (cf. Hock 1991: 167).
The higher frequency of preterit forms of anar in the construction in early
texts in itself does not contradict the first analysis. The problem lies in the circum-
stance that the occurrences of the construction with the GO-verb in the present
show a higher degree of grammaticalisation in the sense that in medieval Cata-
lan chronicles we do not find any occurrence with a present form of anar which
would retain the lexical meaning ‘go’. In contrast, the preterit auxiliary versions
show a greater variety of meanings: some of them retain the old, lexical meaning
of motion, while others offer a new, more plausible interpretation. Table 5 shows
the distribution of supposed meanings of the occurrences of “anar + infinitive” in
two medieval Catalan chronicles.
In Example (6) the subject (‘news’) of the construction suggests that van venir
‘arrived’ should be a grammaticalised form, conveying a past tense meaning. This
claim is also supported by the tense of the verbal form havia pres ‘had been taken’
in the second clause. This past tense expresses an action, state or event that took
place before the start of another past action, that is, the form van venir should refer
to a past event. The meaning of the infinitive itself ‘come’ also suggests that the
form van does not retain its lexical meaning ‘go’, since it would yield the strange
and nonsense reading ‘the news come/are coming (in order) to go’. Several similar
occurrences documented in the medieval Catalan chronicles show that the require-
ment of an agentive, human subject and that of intentionality are disappearing.
The considerations presented above support the hypothesis according to
which present auxiliary occurrences of “anar + infinitive” in medieval texts are not
representatives of a historical present usage but rather forms which, in comparison
with the preterit auxiliary variants, show a more advanced stage of grammaticali-
sation. On the basis of the above considerations, we can attribute a high plausibil-
ity value to the “morphological account”, while the “historical present account”
seems to be unacceptable because it fails to explain adequately the data.
At this point of the investigation, we have to compare the rival solutions, decide
which one to accept and, with the modification of the p-context, eliminate the
p-inconsistency.
Information concerning universal tendencies in linguistic change often helps us
to decide between competing hypotheses. Let us discuss how considerations about
universal tendencies in analogy and universal tendencies in semantic change in
grammaticalisation appear in historical linguistic argumentation. Various u
niversal
Katalin Nagy C.
tendencies have been detected concerning the way in which meanings change in the
languages of the world (cf. Bybee et al. 1994, Heine & Kuteva 2002, among others),
but also in the case of analogy we find some tendencies concerning the way in which
analogical change usually occurs. Argumentations in linguistics which make use of
universal tendencies also rely on plausible inferences, because linguistic universals
cannot be regarded as certainly true statements, they can be only plausible. Raising
hypotheses about universal characteristics of languages starts from the investigation
of a limited number of languages when the presence of the characteristics at issue is
checked, and in a second step, an inductive inference is applied, where
the set of the premise-candidates has to be supplemented by the background
assumption that the cases not examined also possess the characteristics that
could be found in the investigated ones; the conclusion states the presence of
these characteristics as a general rule. (Kertész & Rákosi 2012: 90)
We can use these general claims about the semantic development of lexemes
meaning ‘go’ in assigning plausibility values to the following two statements:
1. The Catalan GO-past comes from the construction “GO-verb conjugated in the
present tense + infinitive”.
2. The Catalan GO-past does not come from the construction “GO-verb conjugated
in the present tense + infinitive”.
In the case study presented above our starting p-context contained three hypoth-
eses concerning the morphological development of the Catalan “anar + infini-
tive” construction. Two of them made statement (1) plausible, while the third
made statement (2) plausible. Consequently, the p-context was p-inconsistent. We
extended the p-context with new information and re-evaluated the plausibility of
the three hypotheses. Now, we have to modify the p-context and decide between
the rival hypotheses. I want to show how taking into account universal tendencies
helps us to do that. A high plausibility value can be assigned to the statements
in (a) and (b), because they are supported by several case studies in historical
linguistics. Relying on (a) and (b), we can assign a high plausibility value to (2),
because it is in accordance with the general tendencies presented in (a) and (b),
while statement (1) can be considered implausible on the basis of (a) and (b). In
the present form, however, (2) is too general. As presented above in Section 3.1.3,
Juge (2006) claims that (3) the Catalan GO-past comes from the construction
“GO-verb conjugated in the preterit + infinitive”. This statement can be regarded
as a special case of (2). Moreover, this latter hypothesis can be supported by cross-
linguistic analogy with the Tucano language which shows a similar development.
When Bybee et al. (1994) examine possible paths for the formation of perfectives
and past tenses, they mention the completive aspect as one of the possible sources.
They define the meaning of completives as “to do something thoroughly and com-
pletely”, but they point out further semantic nuances and other uses completives
may have, among them that “the action is reported with some emphasis or sur-
prise value”. They report the ‘go’ completive in the Tucano language as one which
resembles “the English ‘went and did it’ construction: He went and told her the
whole story, […] which is usually used in the past to emphasise the deliberateness
and finality of an action” (Bybee et al. 1994: 57). The case of the Catalan “anar +
infinitive” is very similar: it also seems to refer to a completed action, and, at the
Katalin Nagy C.
same time, it expresses emphasis. Thus, the development of this construction can
be supposed to follow the path described in Bybee et al. (1994: 104) as “that of
resultative or completive leading to anterior and then to perfective or simple past”.
That is, completives are a possible source for perfective aspect and then past mark-
ers, and the history of the Catalan “anar + infinitive” seems to instantiate this
process. Such an account is also in accordance with a historical pragmatic analysis
(see Nagy C. 2010).
Our initial question in the present case study was as follows: from the two
documented variants of the “anar + infinitive” construction, namely, (i) “anar ‘go/
auxiliar’ conjugated in the present + infinitive” and, (ii) “anar ‘go/auxiliar’ con-
jugated in the perfective past + infinitive”, which one should be considered the
ancestor of the perfective past of modern Catalan? Now, we can answer that it is
plausible that it is the second one. This decision favors Juge’s (2006) account and
makes Colon’s (1978a, b) and Detges’s (2004) claims concerning the morphologi-
cal aspect of the development of “anar ‘go’ + infinitive’ implausible. Juge’s (2006)
hypotheses deserve a high plausibility value primarily due to its complexity: his
argumentation relies widely on frequency, analogy and crosslinguistic similarities,
and the development he describes fits general tendencies of semantic and analogi-
cal change.
5. Conclusions
As regards the subquestion discussed in the present case study, the main prob-
lem with the historical present accounts has been exactly what Fischer (2007: 18)
describes in the following way: “There is a natural tendency to interpret an older
construction very much from the point of view of the modern system. This hap-
pens especially when the form of the construction has remained more or less the
same.” It seems that the formal similarity between the forms of the current para-
digm of “anar + infinitive” (which have the auxiliary in the present tense) and
the medieval present auxiliary forms of “anar + infinitive” resulted in a superfi-
cial analysis in this respect and distracted attention from preterit auxiliary occur-
rences of the medieval “anar + infinitive”. This has yielded the omission of a more
thorough morphological analysis, which appears only later, in Juge (2006). The
development of the Catalan “anar ‘go’ + infinitive” construction has found a more
satisfactory explanation under this latter proposal.
Relying on the considerations presented in this chapter, we can conclude the
following. In historical linguistics we should assign a higher plausibility value to
a statement if:
1. sources traditionally accepted in historical linguistics (e.g. historical corpora,
a properly reconstructed language system, etc.) are used in the research,
2. research methods and argumentation techniques traditionally accepted in
historical linguistics (e.g. analogy-based argumentation, frequency counting,
etc.) are used in the research,
3. the analysis is supported by, or at least does not contradict, accounts of other
linguistic levels,
4. it is supported by or at least does not contradict universal historical linguistics
tendencies detected in previous, reliable research.
It is important to note that the emphasis frequency counting has been given in the
present chapter does not mean that the corpus method and quantitative data would
be the only ones used in historical linguistics. The methods discussed in the pres-
ent chapter do not cover all the methodological tools used in historical linguistics.
In their introductory study to a volume of selected articles on historical pragmatics
methodology, Fitzmaurice and Taavitsainen (2007: 17) point out that “it is not clear
that corpus methodology is applicable to purely pragmatic research questions” and
that “corpus-based methods can be barriers to the investigation of pragmatic phe-
nomena like conversational implicatures, which are neither routinely nor conven-
tionally realised in lexical grammatical expressions” (Fitzmaurice & Taavitsainen
2007: 18). To put it differently, in historical pragmatics studies qualitative methods
also have a great significance (cf. Navarro 2008). However, discussion of the limits
which the corpus method sets for the research of some, primarily historical, prag-
matics phenomena is beyond the scope of the present chapter.
Katalin Nagy C.
Acknowledgements
Work on the present chapter was supported by the MTA-DE Research Group for
Theoretical Linguistics and the TÁMOP-4.2.2/B-10/1-2010-0024 project, which
is co-financed by the European Union and the European Social Fund. I am grate-
ful to Enikő Németh T. for valuable discussions about several topics which are
touched upon in this chapter, and to András Kertész and Csilla Rákosi for their
useful comments on earlier versions of the paper. I wish to thank Anna Fenyvesi
for improving my English. I alone am responsible for any remaining errors.
Historical sources
Desc: Desclot, Bernat: Crònica (Chronicle). 1283–88. Barcelona: Edicions 62. 1990.
Munt: Muntaner, Ramón: Crònica (Chronicle). Ca. 1325. Barcelona: Edicions 62. 1991. For the
English translation of the examples taken from the Chronicle of Muntaner I have mainly
relied on the following work: Goodenough, Lady, The Chronicle of Muntaner. 〈http://www.
yorku.ca/inpar/muntaner_goodenough.pdf〉
References
Alvar, Manuel & Pottier, Bernard. 1987[1983]. Morfología histórica del español (Historical mor-
phology of Spanish). Madrid: Gredos.
Badia i Margarit, Antoni M. 1981. Gramàtica històrica catalana (Catalan historical grammar).
València: Tres i Quatre.
Bybee, Joan. 2005[2003]. Mechanisms of change in grammaticalization. The role of frequency. In
The Handbook of Historical Linguistics, Brian D. Joseph & Richard D. Janda (eds), 602–623.
Oxford: Blackwell.
Bybee, Joan. 2005. From Usage to Grammar: The Mind’s Response to Repetition. LSA Presidential
Address, January 8, 2005. 〈http://www.unm.edu/~¡bybee/Bybee%20plenary.pdf〉
Bybee, Joan. 2007. Diachronic Linguistics. In The Oxford Handbook of Cognitive Linguistics,
Dirk Geeraerts & Hubert Cuyckens (eds), 945–987. Oxford: OUP.
Bybee, Joan, Perkins, Revere & Pagliuca, William. 1994. The Evolution of Grammar: Tense, Aspect,
and Modality in the Languages of the World. Chicago IL: The University of Chicago Press.
Bynon, Theodora. 1983. Historical Linguistics. Cambridge: CUP.
Colon, Germà. 1978a[1959]. El perfet perifràstic català “va + infinitiu”. (The Catalan periphras-
tic perfect “va + infinitive”). In La llengua catalana en els seus textos (The Catalan language
of texts), Vol. 2, Germà Colon (ed.), 119–130. Barcelona: Curial.
Colon, Germà. 1978b[1976]. Sobre el perfet perifràstic ‘vado + infinitiu’, en català, en proven-
çal i en francès (On the periphrastic perfect ‘vado + infinitive’ in Catalan, Provençal and,
French). In La llengua catalana en els seus textos (The Catalan language of texts), Vol. 2,
Germà Colon (ed.), 131–174. Barcelona: Curial.
Methods and argumentation in historical linguistics
Detges, Ulrich. 2004. How cognitive is grammaticalization? The history of the Catalan perfet
perifràstic. In Up and Down the Cline – The Nature of Grammaticalization [Typological
Studies in Language 59], Olga Fischer, Muriel Norde & Harry Perridon (eds), 211–227.
Amsterdam: John Benjamins.
Dér, Csilla Ilona. 2005. Grammatikalizációs folyamatok a magyar nyelvben – Elméleti kérdések
és esettanulmányok (Processes of grammaticalisation in Hungarian: Theoretical issues and
case studies). Ph.D. dissertation, Eötvös Loránd University, Budapest.
Fischer, Olga. 2004. What counts as evidence in historical linguistics? Studies in Language 28(3):
710–740.
Fischer, Olga. 2007. Morphosyntactic Change: Functional and Formal Perspectives. Oxford: OUP.
Fitzmaurice, Susan & Taavitsainen, Irma. 2007. Historical pragmatics: What it is and how to
do it. In Methods in Historical Pragmatics [Topics in English Linguistics 52], Susan M.
Fitzmaurice & Irma Taavitsainen (eds), 11–36. Berlin: Mouton de Gruyter.
Fludernik, Monika. 1992. The historical present tense in English literature. An oral pattern and
its literary adaptation. 〈http://freidok.uni-freiburg.de/volltexte/5343/pdf/Fludernik_The_
historical_present_tense.pdf〉
Gougenheim, Georges. 1971[1929]. Étude sur les périphrases verbales de la langue française
(A study on the verbal periphrases of French). Paris: Nizet.
Heine, Bernd. 2005[2003]. Grammaticalization. In The Handbook of Historical Linguistics, Brian
D. Joseph & Richard D. Janda (eds), 575–601. Oxford: Blackwell.
Heine, Bernd & Kuteva, Tania. 2002. World Lexicon of Grammaticalization. Cambridge: CUP.
Hock, Hans Henrich. 1991. Principles of Historical Linguistics. Berlin: Mouton de Gruyter.
Hock, Hans Henrich. 2005[2003]. Analogical change. In The Handbook of Historical Linguistics,
Brian D. Joseph & Richard D. Janda (eds), 441–460. Oxford: Blackwell.
Jeffers, Robert J. & Lehiste, Ilse. 1979. Principles and Methods for Historical Linguistics. Cambridge
MA: The MIT Press.
Juge, Matthew L. 2006. Morphological factors in the grammaticalization of the Catalan “go”
past. Diachronica 23(2): 313–339.
Kertész, András & Rákosi, Csilla. 2012. Data and Evidence in Linguistics. A Plausible Argumenta-
tion Model. Cambridge: CUP.
Kertész, András & Rákosi, Csilla. This volume a. Introduction: The state of the art and the struc-
ture of the book.
Kertész, András & Rákosi, Csilla. This volume b. The p-model of data and evidence in linguistics.
Ladányi, Mária. 1998. Jelentésváltozás és grammatikalizáció – kognitív és szerves nyelvelméleti
keretben (Meaning change and grammaticalization in a cognitive and organic theoretical
linguistic context). Magyar Nyelv 94: 407–423.
Nagy C., Katalin. 2010. The pragmatics of grammaticalization: The role of implicatures in
semantic change. Journal of Historical Pragmatics 11(1): 67–95.
Navarro, Federico. 2008. Análisis histórico del discurso. Hacia un enfoque histórico-discursivo
en el estudio diacrónico de la lengua (Historical discourse analysis. Toward a historical
discursive approach in the diachronic study of the language). In El valor de la diversidad
[meta]lingüística. Actas del VIII Congreso de Lingüística General (The value of the [meta]
linguistic diversity. Proceedings of the 7th Congress on General Linguistics), Antonio
Moreno Sandoval (ed.). Madrid: Universidad Autónoma de Madrid. 〈http://www.lllf.uam.
es/clg8/actas/pdf/paperCLG85.pdf〉
Katalin Nagy C.
Pinkster, Harm. 1990. Is the Latin present tense the unmarked, neutral tense in the system? In
Latin in Use [Amsterdam Studies in the Pragmatics of Latin], Rodie Risselada (ed.), 63–83.
Amsterdam: Gieben.
Schiffrin, Deborah. 1981. Tense variation in narrative. Language 57(1): 45–62.
Silva-Corvalán, Carmen. 1983. Tense and aspect in oral Spanish narrative: context and mean-
ing. Language 59(4): 760–780.
Wolfson, Nessa. 1979. The conversational historical present alternation. Language 55(1):
168–182.
chapter 5
Enikő Németh T.
University of Szeged, Dept. of General Linguistics, MTA-DE Research
Group for Theoretical Linguistics
The present chapter has two main aims. First, it attempts to provide a novel
account for the occurrence of Hungarian verbs of natural phenomena with
subject arguments. The problem which the present chapter wants to resolve
is that there is an inconsistency between the previous explanations of the
behaviour of verbs of natural phenomena with and without an explicit subject
and the data. The previous approaches have considered the Hungarian verbs of
natural phenomena subjectless, although data from various direct sources (such
as written corpora, intuition, introspection, and spoken discourses) testify to
occurrences of verbs of natural phenomena with explicit subjects. To eliminate
this inconsistency, the chapter argues for a new, unified approach, according to
which Hungarian verbs of natural phenomena have subject arguments in their
lexical-semantic representations which can (or even should) be lexically unrealised
in some contexts, while in others they can be explicitly expressed and contextual,
grammatical as well as lexical-semantic factors together license the implicit or
explicit occurrence of subject arguments. Second, applying Kertész & Rákosi’s
(2012, this volume) p-model, the chapter aims to reflect on a metalinguistic level
on how the proposed explanation has been reached in the course of the research
as well as how the new account can resolve the above mentioned problem.
In the present chapter I have two main aims. First, taking into account a wide
spectrum of data as well as relying on plausible inferences (cf. Kertész & Rákosi
2012: 56–128, this volume: Section 2.3) built on these data, I attempt to provide a
Enikő Németh T.
in the sense that it involves statements that are plausible according to some
source but cannot be true simultaneously (Kertész & Rákosi 2012: 130ff., this
volume: Section 2.5). The overdetermination of the starting p-context results
from incompatibility between, on the one hand, the proposals suggesting
the subjectlessness of verbs of natural phenomena, i.e. Magyar grammatika
[Hungarian Grammar] (Keszler 2000) as well as a lexical-functional grammar
(Komlósy 1994, 2001), and, on the other hand, a generative syntactic analy-
sis (Tóth 2001), which considers the verbs at stake to have a quasi-argumental
subject. Consequently, the starting p-context is p-inconsistent. In Section 3,
I will present a range of data capturing the characteristics of verbs of natu-
ral phenomena in Hungarian which provide evidence for the hypothesis con-
cerning the occurrence of verbs of natural phenomena with explicit subjects,
i.e. I will extend the starting p-context with data which were neglected earlier
and confront these data with the approaches in the starting p-context. By this
coordination of the extended p-context (cf. Kertész & Rákosi 2012: 136–143,
this volume: Subsection 2.6.1), I will show that the uses of verbs of natural
phenomena with explicit subjects cannot be accounted for by the approaches
in the extended p-context either. Consequently, the extended and coordinated
p-context is more p-inconsistent: there is p-inconsistency inherited from the
starting p-context between the rivalling approaches and there is a new incon-
sistency in the extended p-context between the previous approaches and the
data newly introduced into the p-context. In Section 4, I will continue to extend
the p-context with the results of my previous research into implicit arguments
in Hungarian. In Section 5, I will modify the previous p-context (Kertész &
Rákosi 2012: Section 10.4, this volume: Subsection 2.6.1). Starting out from my
results regarding the verbs’ occurrence with implicit arguments in Hungarian
as initial hypotheses to explain the subjectless use of verbs of natural phenom-
ena, I will propose a new kind of lexical-semantic representation for the verbs
of natural phenomena, on the basis of which the occurrence of the verbs of
natural phenomena both without a subject and with a subject can be accounted
for in a unified way. By applying this kind of lexical-semantic representation as
well as taking into consideration contextual factors, the p-inconsistency can be
eliminated in the modified context at the end of the argumentation cycle. And,
finally, in Section 6, I will summarise my results and formulate my conclusions,
indicating why the suggested explanation is more plausible than the previous
approaches in the starting p-context, or, in other words, why my account for
the occurrence of Hungarian verbs of natural phenomena can be regarded as
the resolution of the starting p-problem (cf. Kertész & Rákosi 2012: 133f., this
volume: Section 2.5).
Enikő Németh T.
2. Th
e rivalling approaches in the starting p-context: On the
subjectlessness of verbs of natural phenomena in Hungarian
2.1 M
agyar értelmező kéziszótár (Concise Explanatory Dictionary
of Hungarian) (Pusztai 2003)
The majority of verbs of natural phenomena, e.g. esteledik ‘[for evening to] close
in’, alkonyodik ‘[for dusk to] set in’, sötétedik ‘[for dark to] grow’, hajnalodik ‘[for
day to] break, dawn’, virrad ‘dawn’, tavaszodik ‘[for spring to] come’, fagy ‘freeze’,
olvad ‘thaw’, villámlik ‘[for lightning to] strike’, havazik ‘snow’ are evaluated as
subjectless in the Magyar értelmező kéziszótár (Concise Explanatory Dictionary
of Hungarian) (Pusztai 2003). The feature of subjectlessness is assigned not only
to the above mentioned verbs but also to those which can be used with explicit
subjects quite easily, e.g. dereng ‘dawn’ – dereng az ég/a hajnal ‘the sky/the dawn
gets lighter’, zuhog ‘pour’ – zuhog az eső ‘the rain pours’, szakad ‘pour’ – szakad
az eső ‘the rain pours’. Furthermore, while items like the verbs borul ‘cloud over’,
dörög ‘thunder’, sötétedik ‘[for dark to] grow’ and be-sötétedik ‘pvb1-[for dark to]
grow’ contain the evaluation “subjectless”, verbs such as be-felhősödik ‘pvb-cloud
over’ and tisztul ‘clear up’ are not considered subjectless in the concise explanatory
dictionary although both groups of verbs can occur with the same explicit subject,
az ég ‘the sky’. The verbs ki-tisztul ‘pvb-clear up’ and pirkad ‘dawn’ are evaluated as
“also subjectless” in the dictionary. In the case of verbs with meanings e xpressing
the way the rain falls, the lexical items of szemerkél ‘drizzle’, csepereg ‘sprinkle’,
csorog ‘trickle’, ömlik ‘pour’ do not include the evaluation “subjectless’, while the
verbs zuhog ‘pour’ and szakad ‘pour’ are considered subjectless, although all of
these verbs can occur with an explicit subject, az eső ‘the rain’. And, finally, among
verbs with meanings concerning the wind such as fúj ‘blow’, fújdogál ‘blow gently’,
fuvall ‘blow gently’, lengedezik ‘blow gently’, süvít ‘bluster’, fütyül ‘whistle’, only the
verb fúj ‘blow’ is evaluated as “also subjectless” although all of the named verbs can
be used without an explicit subject.
In the lexical entries of the above mentioned verbs of natural phenomena
there is no explicit clarification of the notion of subjectlessness and because of the
confusing evaluations regarding the subjectlessness of verbs of natural phenom-
ena there is no possibility to reconstruct this notion on the basis of the diction-
ary’s evaluations. Consequently, on the basis of the Magyar értelmező kéziszótár
(Concise Explanatory Dictionary of Hungarian) one cannot make a plausible
statement about subjectlessness of Hungarian verbs of natural phenomena, i.e. it
cannot be considered a suitable source (Kertész & Rákosi 2012: 64, this volume:
Section 2.2).
The second group involves verbs with an optional explicit subject, e.g. esik az eső
‘the rain falls’, szemerkél az eső ‘the rain drizzles’, dörög az ég ‘the sky thunders’,
be-borul az ég ‘the sky [pvb-]clouds over’, ki-derül az ég ‘the sky [pvb-]clears up’,
sötétedik az ég ‘the sky darkens’, ki-világosodik az ég ‘the sky [pvb-]clears up’. The
structure of data concerning verbs in this group can be reconstructed, for example,
as follows:
The verb esik ‘[for precipitation to] fall’ is a verb of natural phenomena and is not
subjectless.
According to Keszler (2000), the filling of the subject argument position is not
permitted in the first group of verbs. Keszler (2000) argues that meanings of verbs
in the first group concern elements of the general natural environment; therefore,
there is no need or interest to name them explicitly. Furthermore, in some cases
the subject is missing because it is incorporated into the stem of the verb of natural
phenomena. However, it is not clear how to interpret the constraint of not filling
the subject argument position: either the verbs in the first group do not have a
subject argument position at all or they do have such a position but its filling is
prohibited. In the first case, the structure of data concerning verbs of natural phe-
nomena in the first group can also be reconstructed, for example, as follows:
The verb hajnalodik ‘[for day to] break’ is a verb of natural phenomena and it does
not have a subject argument position.
It must be noted that Keszler (2000) emphasises that verbs of natural phenomena
considered subjectless can and even must occur with explicit subjects in their
metaphorical uses, cf. e.g. havazik az élet ‘life is snowing’, az egész világ havazik
‘the whole world is snowing’, csend havazik ‘silence is snowing’, a szeme villám-
lik ‘his/her eyes flash’, villámlik a fekete nappal ‘the black daytime flashes [like
lightening]’.2 If verbs of natural phenomena are subjectless, i.e. one selects the
data structure first proposed in the previous paragraph, it must be explained how
a subject argument position arises in the metaphorical use. If one assumes that
these verbs always have a subject argument position, i.e. their data structure can
be reconstructed along the lines of the second proposal in the previous para-
graph, which must not be filled in non-metaphorical uses, then what constraints
predict the subjectless uses must be accounted for. It is obvious that these two
answers are inconsistent, and this inconsistency cannot be eliminated in Keszler
(2000).3 As for the verbs in the second group, Keszler (2000) allows them to be
used with explicit subjects, i.e. they certainly have a subject argument position.
But this position can be filled optionally, i.e. the verbs in second group can have
an explicit subject argument, and they can also be used without it. According to
this, the structure of data concerning the verbs in this group can also be recon-
structed, for example, as follows:
The verb esik ‘[for precipitation to] fall’ is a verb of natural phenomena and it has
a subject argument position which can be filled optionally.
The p-model argues that data must not be reduced to their information content,
instead, their structure consists of two components: their information content
and their plausibility value. The plausibility value of data stems from their direct
sources (Kertész & Rákosi 2012: 67, this volume: Section 2.2). Now, one can ask
. These examples of metaphorical use can be found in the Magyar Nemzeti Szövegtár
( Hungarian National Corpus) 〈http://corpus.nytud.hu/mnsz〉.
. Keszler (2000) argues that it is also characteristic of verbs in the first group that they
cannot have other complements either. The only exception mentioned by this grammar is the
verb esteledik ‘[for evening to] close in’, which can have an optional adverbial complement
ránk ‘upon us’. However, on the one hand, this optional adverbial complement can appear with
other verbs which have a meaning expressing the passing of days and seasons as well, e.g. ránk
alkonyodik ‘evening is closing in on us’, ránk hajnalodik ‘day is breaking on us’, ránk sötétedik
‘it is getting dark on us’, ránk virrad ‘dawn is breaking on us’, and ránk tavaszodik ‘spring is
breaking on us’. And, on the other hand, the adverbial complement in question can occur with
this group of verbs of natural phenomena in other forms as well, cf. Jól rátok e steledett ‘Evening
has really closed in on you’, Addig dolgoztak, míg rájuk virradt ‘They worked until dawn broke
on them’. I want to thank Anna Fenyvesi for drawing my attention to this p henomenon and
also these two examples.
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
what plausibility value can be assigned to the above reconstructed data? If one can
accept Magyar grammatika (Hungarian Grammar) as a relatively reliable source
on an authoritative basis, then the plausibility value of the above reconstructed
data is relatively high. But if one decomposes and analyses the information con-
tent of the reconstructed data, as has been done in the present article, one can find
inconsistencies. Because of these inconsistencies Magyar grammatika (Hungarian
Grammar) cannot be considered a reliable or even a usable source.
. The term ‘weather verb’ also occurs in the literature, but I prefer using the term ‘verb of
natural phenomena’ because of its wider scope. Komlósy (1994, 2001) and Tóth (2001) (cf.
Section 2.4) use the term ‘weather verb’.
. The abbreviations used in the glosses throughout this chapter are the following: 1 – first
person, 2 – second person, 3 – third person, sg – singular, pl – plural, nom – nominative,
acc – accusative, gen − genitive, indf – indefinite (conjugation), ine – inessive, def – definite
Enikő Németh T.
b. Villám-l-ott.
lightning-sfx-pst.indf.3sg
‘there was lightning’ → ‘appear (lightning)’
c. Sötét-ed-ik.
dark-sfx-prs.indf.3sg
‘it is getting dark’ → ‘arise (darkness)’
(conjugation), sfx – suffix, sup – superessive, prs – present tense, pst – past tense, and pvb –
preverb.
. However, the morphological complexity is not considered a definitive property of the
Hungarian weather verbs by Komlósy (1994).
. Komlósy (1994: 160) argues that adjectival stems might be conceived as nouns in the
roots of weather verbs based on the morphologically unmarked adjective→noun derivation
in Hungarian.
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
Villámlik az ég ‘the sky is lightning’ and sötétedik az ég ‘the sky is getting dark’ are
well-formed expressions in which villámlik as well as sötétedik are weather verbs
and ég is an explicit subject.
If these weather verbs are subjectless, i.e. they do not have a subject slot in their
lexical-semantic representation, it should not be possible to use them with overt,
syntactically realised subjects in metaphorical (cf. a szeme villámlik ‘his/her eyes
are flashing’, villámlik a fekete nappal ‘the black daytime flashes [like lightning]’)
and non-metaphorical senses (cf. villámlik az ég ‘the sky is lightning’ sötétedik az
ég ‘the sky is getting dark’ in Section 2.1 and 2.2). If we take into consideration
these latter data as well and extend and coordinate the p-context of the lexical-
functional approach to Hungarian weather verbs, i.e. confront these data with
Komlósy’s (1994, 2001) proposals, Komlósy’s lexical-functional approach to
Hungarian weather verbs becomes p-inconsistent.
The utterance in (3) makes it possible to form the following plausible statement:
The va-participle *hajnalodva formed from the weather verb hajnalodik ‘dawn’ is
ill-formed, but the ván-participle form of it, hajnalodván is well-formed.
. For the verbs’ behaviour in participial adjunct clauses cf. also Sárik (1998).
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
. For the problems of classification of direct data sources cf. Németh T. (2010).
Enikő Németh T.
(4) a. este-l-ed-ik/alkony-od-ik
evening-sfx-sfx-prs.indef.3sg/dusk-sfx-prs.indef.3sg
a nap-Ø
the day-nom.sg
‘the day is coming to the evening / the day is coming to dusk’
b. sötét-ed-ik a látóhatár-Ø/ az ég-Ø
dark-sfx-prs.indef.3sg the horizon-nom.sg/ the sky-nom.sg
‘The horizon/sky is getting dark.’
c. hajnal-od-ik az idő-Ø
dawn-sfx-prs.indf.3sg the time-nom.sg
‘The day is breaking. [lit. The time is coming to dawn.]’
d. virrad a nap-Ø/ az éjszaka-Ø/
dawn.prs.indef.3sg the day-nom.sg/ the night-nom.sg/
a fény-Ø
the light-nom.sg
‘The day/night/light is dawning.’
e. villám-l-ik az ég-Ø
lightning-sfx-prs.indef.3sg the sky-nom.sg
‘The sky is lightning.’
f. zuhog az eső-Ø
pour.prs.indef.3sg the rain-nom.sg
‘Rain is pouring down.’
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
g. szakad az eső-Ø
pour.prs.indef.3sg the rain-nom.sg
‘Rain is pouring down.’
h. szakad a hó-Ø
pour.prs.indef.3sg the snow-nom.sg
‘It is snowing heavily.’
Of course, one can say that the Hungarian National Corpus is not a reliable source
of data, or the occurrence in a corpus cannot be evaluated as sufficient positive
evidence (Bibok 2009). However, my own intuition, as well as the testing of data by
other native speakers, have made the following statement plausible:
Hungarian verbs of natural phenomena can occur with explicit subjects.
If we also take into account metaphorical uses, then all verbs of natural phenom-
ena can occur with explicit subjects. Let us consider only two verbs – havazik
‘snow’, villámlik ‘[for lightening to] strike’– in their metaphorical uses: havazik
az élet ‘life is snowing’, az egész világ havazik ‘the whole world is snowing’, csend
havazik ‘silence is snowing’, a szeme villámlik ‘his/her eyes are flashing [like light-
ning]’, villámlik a fekete nappal ‘the black daytime is lightning’.
In the light of the data newly introduced into the starting p-context, it can be
seen that verbs of natural phenomena considered subjectless by Keszler (2000) can
occur with explicit subjects. Thus, on the basis of these data one can conclude the
following: (i) verbs of natural phenomena considered subjectless by the H ungarian
grammatical tradition are not subjectless, instead, they can occur with explicit
subjects, and, (ii) Hungarian verbs of natural phenomena cannot be divided into
two groups regarding whether they can occur with explicit subjects or not.
Enikő Németh T.
4. E
xtending the p-context with results of previous research into implicit
arguments in Hungarian
s emantic), discourse, and/or pragmatic evidence (cf. e.g. Németh T. 2000, 2001,
2008, 2010; Bibok & Németh T. 2001; Németh T. & Bibok 2010). Such evidence
testifies to the implicit presence of an argument and attests that the particular
utterance does not have a performance error. In Hungarian there are various
types of implicit arguments and their occurrence in utterances is licensed by
various factors. The separate mention of these various factors in the above men-
tioned definition of implicit arguments indicates the diverse character of the
different types of implicit arguments. However, although the possibilities of the
occurrence of implicit arguments in Hungarian can be explained by various fac-
tors, their intensive interaction should be emphasised. An argument can be left
implicit in Hungarian in the following three cases (Németh T. 2000, 2001, 2008,
2010; Bibok & Németh T. 2001; Németh T. & Bibok 2010):
. It is worth mentioning that my definition of implicit arguments contains, but termino-
logically does not differentiate between, the notions of implicit arguments and some kinds
of unarticulated constituents recently discussed in the semantics and pragmatics literature
(cf. e.g. Taylor 2001; Martí 2006; Recanati 2007). The confrontation of the approaches in the
debate concerning unarticulated constituents with my proposal will be the task of a further
argumentation cycle (cf. Kertész & Rákosi 2012: Section 10.4, this volume: Subsection 2.6.1).
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
5. M
odification of the p-context: The occurrence of verbs of natural
phenomena with implicit subject arguments in Hungarian
On the basis of the theoretical conclusions in the previous sections and also relying
on various kinds of data from different direct sources we can assume that verbs of
natural phenomena considered subjectless in Hungarian grammars are not sub-
jectless. They can all occur with (or without) explicit subjects but differ in what
manner and in what context. The default use of the verbs esteledik ‘[for evening to]
close in’, be-esteledik ‘pvb-[for evening to] close in’, alkonyodik ‘[for dusk to] set in’,
be-alkonyodik ‘pvb-[for dusk to] set in’, hajnalodik ‘[for day to] break’, tavaszodik
‘[for spring to] come’, ki-tavaszodik ‘pvb-[for spring to] come’, fagy ‘freeze’, olvad
‘thaw’, villámlik ‘[for lightning to] strike’, havazik ‘snow’, be-havazik ‘pvb snow in’,
virrad ‘dawn’, pirkad ‘dawn’ etc. is the occurrence without an explicit, lexicalised
subject. However, this does not necessarily mean that these verbs are zero argu-
ment verbs, as is widely assumed in the literature. On the contrary, these verbs
have an argument position in their lexical-semantic representation which can be
filled in with an explicit, lexicalised syntactic subject, but what lexemes can appear
in this subject position is strictly constrained. In the literature concerning implicit
object arguments in various languages a hypothesis has been formulated accord-
ing to which if a verb can have only one specific object or a very limited number
of specific objects, then the object(s) can be predicted on the basis of the particu-
lar verb meaning, and, therefore, be easily omitted (Rice 1988; García Velasco &
Portero Muñoz 2002). For example, in English the verbs of bodily emission such
as sneeze, blow, cry, spit, ejaculate, piss, and vomit, and verbs of contribution such
as contribute, donate, and give (‘contribute’) can occur with implicit direct object
arguments in this manner. In construction grammar Goldberg (2005) argues that
these verbs can be fused with the implicit theme construction since their second
argument position can be filled in by a unique object. The omitted syntactic object
can be identified with a particular kind of bodily emission or contribution which is
indicated in the lexical-semantic representation of these verbs as selection restric-
tions. Pragmatic factors, especially politeness considerations, also motivate l eaving
these arguments lexically unrealised. Lexical-functional grammar also applies a
semantically determined, unique selection restriction to account for characteris-
ing idioms such as to crane one’s neck (Komlósy 2001: 121).
There is no theoretical argument against the extension of this hypothesis to syn-
tactic subject arguments. By means of selection restrictions, the lexical-semantic
representation of verbs of natural phenomena enumerated above unambiguously
and uniquely determines those kinds of natural phenomena (e.g. the Hungarian
equivalents of sky, dawn, day and horizon) that can occupy the s yntactic subject
position in non-metaphorical uses. To put it the other way around, it directly
Enikő Németh T.
f ollows from the selection restriction what kinds of natural phenomena can serve
as syntactic subjects with verbs of natural phenomena; therefore, there is no need
to express them explicitly. Thus, if the verbs in question occur without an overt,
lexically realised subject, then this implicit subject can be unambiguously and
exclusively identified with the information about the particular kind of natural
phenomenon involved in the selection restriction. Consider (5)(a)–(b).
(5) a. Nagy a vihar-Ø. Villám-l-ik.
big the storm-nom.sg lightning-sfx-prs.indef.3sg
‘The storm is big. It is lightning.’
b. Nagy a vihar-Ø. Az egész
big the storm-nom.sg the whole
ég-Ø villám-l-ik.
sky-nom.sg lightning-sfx-prs.indef.3sg
‘The storm is big. It is lightning [in the entire sky].’
In (5)(a) the verb villámlik ‘[for lightning to] strike’ occurs with an implicit subject
argument, while in (5)(b) with a lexically realised one. The occurrence of the verb
villámlik ‘[for lightning to] strike’ with an implicit subject argument is licensed by
the unique selection restriction on argument x in its lexical-semantic representa-
tion: argument x must be az ég ‘the sky’ among the natural phenomena (cf. (6)).
Similarly, in the semantic representation of the verbs esteledik ‘[for evening to]
close in’ and hajnalodik ‘[for day to] break, dawn’ the argument x which can be
realised as a syntactic subject must be day/time and time, respectively: esteledik a
nap/az idő,11 hajnalodik az idő (cf. (7)–(8)).
(6) villámlik: ‘x: ég erős cikázó fényjelenséget mutat’
‘x: sky is showing very bright flashes of light’
(7) esteledik: ‘x: nap/idő (24 órányi időegység) az esti időszakhoz közeledik’
‘x: day/time (24 hour time unit) is approaching evening time’
The possibility to use the verbs villámlik ‘[for lightning to] strike’, esteledik ‘[for
evening to] close in’, hajnalodik ‘[for day to] break, dawn’ with implicit subject
arguments follows from the double interpretability of their lexical-semantic
representations. If they occur with an explicit subject, then the subject argument
is also in the focus of attention, while if these verbs are used with an implicit
. Esteledik a nap (24 hour time unit) and esteledik az idő are dialectal variants.
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
subject, only the particular natural (weather) event is in the focus of attention,
but the lexically unrealised subject argument can also be accessed by means of the
unique selection restriction as background information. The two kinds of occur-
rence of verbs v illámlik ‘[for lightning to] strike’, esteledik ‘[for evening to] close
in’, hajnalodik ‘[for day to] break, dawn’ with an overt or covert subject argument
can be considered a syntactic alternation similar to the occurrence of transitive
verbs’ with or without an explicit object argument (Bibok 2008, 2010). Because of
the uniqueness of the selection restriction on argument x and also the permanent
accessibility of argument x in the lexical-semantic representation as background
information, the verbs villámlik ‘[for lightning to] strike’, esteledik ‘[for evening
to] close in’, hajnalodik ‘[for day to] break, dawn’ as well as be-esteledik ‘pvb-[for
evening to] close in’, alkonyodik ‘[for dusk to] set in’, be-alkonyodik ‘pvb-[for dusk
to] set in’, tavaszodik ‘[for spring to] come’, ki-tavaszodik ‘pvb-[for spring to] come’,
fagy ‘freeze’, olvad ‘thaw’, havazik ‘snow’, be-havazik ‘pvb-snow in’, virrad ‘dawn’,
pirkad ‘dawn’ can be used with implicit subject arguments very economically,
which is why their o ccurrence with a lexically unrealised subject argument can be
considered their default use.
In the semantic literature the meaning of the utterance Villámlik ‘It is light-
ning’ in (5)(a) is accounted for as a thetic statement (Maleczki 2001; Martí 2006;
Recanati 2007). Thetic statements are about a particular situation, they have no
topic constituent. But if a component of a particular situation e.g. a temporal or
locative constituent becomes prominent enough, then it can play the role of the
sentential topic (Maleczki 2001: 169, 179f.), cf. (9)(a)–(b).
(9) a. Tegnap hav-az-ott.
Yesterday snow-sfx-pst.indef3sg
‘Yesterday it snowed.’
b. Galyatető-n hav-az-ik.
Galyatető -sup snow-sfx-prs.indef3sg.
‘At Galyatető it is snowing.’
(10) a. (In the family there is a big row between the father and kids beause of
the kids’ bad behaviour when the mother enters the room.)
Lát-om, nagy a vihar-Ø,
see-prs.def.1sg big the storm-nom.sg
villám-l-ik apa-Ø tekintet-e-Ø.
lightning-sfx-prs.indef.3sg father-gen.sg glance-poss.3sg-nom.sg
‘I can see that there is a big storm, father’s glances are flashing
[like lightning].’
b. ∗Lát-om, nagy a vihar-Ø,
see-prs.def.1sg big the storm-nom.sg
villám-l-ik [apa-Ø tekintet-e-Ø].
lightning-sfx-prs.indef.3sg father-gen.sg glance-poss.3sg-nom.sg
c. (Later, kids among themselves.)
− Lát-t-ad, milyen volt
see-pst-def.2sg what kind be.pst.3sg
apa-Ø tekintet-e-Ø?
father-gen.sg glance-poss.3sg-nom.sg
‘Did you see what father’s glances were like?’
− Igen. Villám-l-ott
yes lightning-sfx-pst.indef.3sg
[apa-Ø tekintet-e-Ø].
father-gen.sg glance-poss.3sg-nom.sg
‘Yes. They [=father’s glances] were flashing [like lightning].’
In (10)(a) az apa tekintete ‘father’s glances’ serves as the explicit subject of the
verb villámlik ‘[for lightning to] strike’ metaphorically used. The metaphorical use
of villámlik with an explicit subject is also supported by the metaphorical use of
vihar ‘storm’ in the utterance context. On the basis of (6) it is obvious that by
means of its selection restriction the verb villámlik requires a unique subject argu-
ment (ég ‘sky’). The lexically realised subject argument in (10)(a) is not identical
with the one required by the selection restriction; consequently, it cannot be left
implicit, which is why (10)(b) is not grammatical. (10)(b) cannot be used with an
implicit subject argument metaphorically either. Keszler (2000: 409) referred to
this phenomenon by formulating a constraint according to which in the metaphor-
ical uses weather verbs must be supplied with explicit, lexically realised subjects.
However, in (10)(c) one can see that the verb villámlik ‘[for lightning] to strike’ can
occur with an implicit subject argument in a metaphorical sense. But this occur-
rence with an implicit subject argument in a metaphorical sense is not licensed
by the verb’s unique selection restriction on argument x and, consequently, by the
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
. The present chapter does not aim to confront these solutions. This can be the task of another
argumentation cycle (Kertész & Rákosi 2012: Section 10.4, this volume: Subsection 2.6.1).
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
Now, let us return to the answers to the question Milyen az idő? ‘What is
the weather like?’ In (11)(b) the verb szállingózik ‘snow softly’ also puts a unique
selection restriction on its subject argument, it must be hó ‘snow’, i.e. x: hó ‘snow’.
This unique selection restriction makes it possible to use the verb szállingózik
without an explicit subject. However, in (11)(c) it is not unambiguous what the
exact reference of the lexically unrealised subject argument is. In the lexical-
semantic representation of the verb szitál ‘mizzle’ the unique selection restriction
concerning argument x is a two-element-set from which one has to select: x: {köd
‘haze’; eső ‘rain’}. The particular occurrence of the verb szitál with an implicit
subject is licensed by the interaction between the verb’s lexical-semantic repre-
sentation, including the selection restriction and information from the observ-
able physical context. If one does not know the physical environment, then one
cannot select from the two possibilities predicted by the selection restriction.
However, this does not mean that (11)(c) becomes a syntactically ill-formed
utterance, but rather that two meanings can be assigned to it. There are Hungarian
native speakers according to whose intuition the acceptability of (11)(b) and (c)
is questionable. Their intuition always requires an explicit subject with the verbs
szállingózik ‘snow softly’ and szitál ‘mizzle’ without a specific context. In this case
the lexical-semantic representation of the verbs szállingózik and szitál do not con-
tain a unique selection restriction requiring the particular kind of precipitation
they can occur with, i.e. this piece of encyclopaedic information has not yet been
built into the lexical-semantic representation of these verbs. It must be noted that
this compulsory-subject-analysis supports the hypothesis that weather verbs can
occur with an explicit subject to a great extent.
The utterance in (11)(d) is not grammatical. Although in the use of the verb
hull ‘[for precipitation to] fall’ as a weather verb it is obvious what kind of pre-
cipitation can fall, i.e. e.g. eső ‘rain’, hó ‘snow’ and zúzmara ‘rime’, it cannot be
used without an explicit subject in a sentence environment. It can be assumed
that there is no unique selection restriction built into the lexical-semantic repre-
sentation of hull ‘[for precipitation to] fall’ which can predict what precipitation
can fall, therefore it cannot occur with an implicit subject argument in the first
(A) manner (cf. Section 4.1), only relying on its lexical-semantic representation.
However, if one answers the question Milyen az idő? ‘What is the weather like?’
with the utterance Nagy pelyhekben hull [a hó] ‘It is snowing in large flakes’, then
the use of hull ‘[for precipitation to] fall’ with implicit subject argument becomes
acceptable. The encyclopaedic information stored under the conceptual address of
hull ‘[for precipitation to] fall’ and the other lexemes in the utterance license the
omission of the subject argument a hó ‘the snow’; i.e. hull ‘[for precipitation to]
fall’ can occur with an implicit subject argument in the second (B) manner. Hull
‘[for p recipitation to] fall’ can be used with an implicit subject argument in the
third (C) manner as well, cf. (12).
Enikő Németh T.
In (12), in the question the verb szállingózik ‘snow softly’ occurs with an implicit
subject argument in the first (A) manner by means of the selection restriction.13
This question and the implicit subject argument [a hó ‘the snow] within it provide
the discourse context for the answer to the question. In the answer the implicit
subject argument [a hó ‘the snow] occurs with the verb hull ‘[for precipitation
to] fall’ in the third (C) manner as a zero anaphor, co-referential with the implicit
subject argument in the question.
Let us continue the analysis examining the utterance in (11)(e). The lexical-
semantic representation of the verb esik ‘[for precipitation to] fall’ does not put a
unique selection restriction on its subject argument, instead the selection restriction
constrains the type of the subject argument: x: csapadék ‘precipitation’, cf. (13).
(13) esik: ‘x: csapadék hull’
‘x: precipitation falls’
In (11)(e) the lexically unrealised subject argument can be identified with the type
csapadék ‘precipitation’. If a particular token of the type csapadék, i.e. a s pecific
kind of precipitation is referred to, then it must be lexicalised, cf. (14).
(14) Es-ik az eső-Ø/ hó-Ø/ jégeső-Ø/
fall-prs.indef.3sg the rain-nom.sg snow-nom.sg hail-nom.sg
hó-dara-Ø/ köd-Ø/ permet-Ø/ harmat-Ø/
snow-sleet-nom.sg haze-nom.sg sprinkle-nom.sg dew-nom.sg
dér-Ø/ zúzmara-Ø.
frost-nom.sg rime-nom.sg
‘Rain/snow/hail/hoar-frost/haze/a sprinkle of rain/dew/frost/rime is falling.’
The particular kinds of the type of precipitation enumerated in (14) show that the
type csapadék ‘precipitation’ has a prototypical structure. This prototypical struc-
ture also influences the occurrence of the verb esik ‘[for precipitation to] fall’ with
implicit subject arguments. The more typical the type of precipitation is, the easier
. Of course, the information from the weather forecast may also influence the use of
szállingózik ‘snow softly’ with implicit subject argument.
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
it is to use the verb esik ‘[for precipitation to] fall’ with an implicit s ubject argument
without a specific physical or discourse context. Since in H ungarian geographical
circumstances the prototypical csapadék ‘precipitation’ is rain, without a specific
physical or discourse context the reference of the implicit subject argument in
(11)(e) can be identified with az eső ‘the rain’.14
Finally, it is worth analysing the verb havazik ‘snow’ in some more detail.
The verb havazik ‘snow’ can occur with an explicit subject argument as well,
cf. (15)(a)–(c).
(15) a. ?A hó-Ø hav-az-ik./
the snow-nom.sg snow-sfx-prs.indef.3sg/
?Hav-az-ik a hó-Ø.
snow-sfx-prs.indef.3sg the snow-nom.sg
‘The snow is snowing.’
b. A rég vár-t hó-Ø meg-áll-ás-Ø nélkül,
the long wait-ptcp snow-nom.sg pvb-stop-sfx-nom.sg without
gyönyörű-en hav-az-ik.
beautiful-sfx snow- sfx-prs.indef.3sg
‘The long awaited snow is snowing beautifully and continuously.’
c. A múl-t század-i hó-Ø még nagy
the last-ptcp century-sfx snow-nom.sg still large
pelyh-ek-ben hav-az-ott.
flake-pl-ine snow-sfx-pst.indef.3sg
‘The snows of the last century were snowing in large flakes.’
This analysis can be extended to the verb havazik ‘snow’ and the noun hó ‘snow’.
The acceptability of (16)(b)–(c) can be accounted for as follows: the verb havazik
is complemented by an adverb, and the noun hó is complemented by an adjective.
The verb havazik ‘snow’ cannot occur with an implicit subject argument other
than a hó ‘the snow’. In its lexical-semantic representation the verb havazik puts a
unique selection restriction on its argument x: hó ‘snow’, cf. (17).
(17) havazik: ‘x: hó esik’
‘x: snow falls’
Since the verb havazik ‘snow’ is morphologically derived from the noun hó ‘snow’,
predicted by the selection restriction to be the unique subject argument, the mor-
phological structure of havazik ‘snow’ is transparent. This transparent morpho-
logical structure, as well as its phonetic form, easily recall the subject argument
hó predicted by the selection restriction. Therefore, the explicit occurrence of hó
‘snow’ is disturbingly redundant with the verb havazik ‘snow’. Of course, in meta-
phorical uses the verb havazik ‘snow’ also occurs with explicit subjects as we have
seen above in Section 3: havazik az élet ‘life is snowing’, az egész világ havazik ‘the
whole world is snowing’, csend havazik ‘silence is snowing’.
Consider also (18).
(18) Hav-az-ik [a hó-Ø]. Nagy-on szép-en
snow-sfx-prs.indef.3sg the snow-nom.sg very-sfx nice-sfx
es-ik [a hó-Ø].
fall-prs.indef.3sg the snow-nom.sg
‘[Snow] is snowing. [The snow] is falling very nicely.
The possibility that in the second utterance in (18) one can refer to a hó ‘the snow’
by a zero anaphor supports the plausibility of the analysis assuming an implicit
subject argument in (18) and weakens the plausibility of another widely accepted
analysis in traditional Hungarian grammars according to which the noun hó
‘snow’ is incorporated into the verb havazik ‘snow’.
6. Th
e resolution of the starting p-problem in the modified p-context:
The advantages of the analysis of verbs of natural phenomena
with implicit and explicit subject arguments
Starting out from the wide spectrum of data neglected by previous approaches, in
this study I have proposed a lexical-semantic representation with a subject argu-
ment position for the verbs of natural phenomena analysed as subjectless verbs
in Hungarian grammatical tradition as well as in lexical-functional grammar.
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
The verbs of natural phenomena constrain their subject argument very strictly
by means of the elements in their lexical-semantic representation such as selec-
tion restrictions, information on the characteristic manner of natural events, and
the prototypical structure of the categories. The lexical-semantic representation of
these verbs can be interpreted in two ways. Either only the natural events denoted
by the verbs are in the focus of attention, or their subject is as well. When only the
natural events are in the focus of attention, the subject arguments can be left lexi-
cally unrealised. Verbs give access to their lexical-semantic representations with
their argument structure. If the subject argument is left implicit in the first (A)
manner, it can be identified with the information provided by the constraints the
selection restriction imposes on the subject argument as background informa-
tion. Since in the case of verbs of natural phenomena what can fill in the subject
position is strictly, or even uniquely, constrained by the selection restrictions, the
subject argument should not be lexically realised. Therefore, the economical and
default occurrence of verbs of natural phenomena is the use without an explicit
subject, i.e. with implicit subjects. At the same time, if the subject argument is also
in the focus of attention or the particular subject argument is not the prototype
of the category required by the selection restriction or the speaker wants to refer
to a specific token of the category, then the subject argument must be explicitly
expressed. Similarly, in metaphorical uses, the verbs of natural phenomena cannot
occur with a lexically unrealised subject argument in the first (A) manner either.
Sometimes the background information in the lexical-semantic representation
of a verb of natural phenomenon by itself cannot license an argument to be left
implicit. In these cases the lexical-semantic representation of the verb interacts
with the context.
The main advantages of the proposed solution are as follows. First, it can
explain the occurrences of verbs of natural phenomena without and with an
explicit subject argument in a unified way. Second, it can account for a wider
spectrum of data than the rivalling approaches in the starting p-context could, i.e.
it cancels the p-inconsistency in the extended p-context. Third, assuming a sub-
ject argument position in the lexical-semantic representation of verbs of natural
phenomena, the metaphorical uses can be handled in the same way as the non-
metaphorical occurrences, i.e. further p-inconsistency and p-incompleteness in
the starting and extended p-context can be eliminated. Fourth, the occurrences of
verbs of natural phenomena with implicit subject arguments can be treated simi-
larly to the uses of verbs with implicit direct and indirect objects. And, fifth, the
occurrences of verbs of natural phenomena with explicit and implicit subjects can
be considered syntactic alternations similarly to the non-objectless and objectless
uses of transitive verbs. This supports the proposition of this kind of a lexical-
semantic representation for verbs of natural phenomena from the syntactic point
Enikő Németh T.
of view. All these advantages increase the explanatory power of the implicit subject
argument analysis of verbs of natural phenomena.
To summarise: confronting the proposed implicit subject analysis with
the rivalling approaches in the starting p-context, it can be established that the
new analysis is more plausible than the previous ones on the basis of the newly
introduced data from various direct sources as well as theoretical statements listed
above as advantages. Therefore, the new solution can be considered as a resolution
of the p-problem in the starting p-context and the newly introduced data from dif-
ferent direct sources serve as strong evidence for it (cf. Kertész & Rákosi 2012: 133,
this volume: Section 2.5).
Acknowledgements
The research reported on in the present chapter was supported by the MTA-
DE Research Group for Theoretical Linguistics and the TÁMOP-4.2.2/B-10
/1-2010-0024 project, which is co-financed by the European Union and the
European Social Fund. I would like to express my gratitude to András Kertész,
Csilla Rákosi and Károly Bibok for their comments and suggestions which
helped me to clarify some points. I wish to thank Anna Fenyvesi for improving
my English.
References
Bartos, Huba. 2000. Az inflexiós jelenségek szintaktikai háttere (The syntactic background
of inflectional phenomena). In Stukturális magyar nyelvtan, 3: Morfológia (Structural
Hungarian Grammar, 3: Morphology), Ferenc Kiefer (ed.), 653–762. Budapest: Akadémiai
Kiadó.
Bibok, Károly. 2008. Az igék szemantikája és a szintaktikai alternáció (The semantics of verbs and
syntactic alternation). In Strukturális magyar nyelvtan, 4: A szótár szerkezete (Structural
Hungarian Grammar, 4: Structure of the Lexicon), Ferenc Kiefer (ed.), 23–70. Budapest:
Akadémiai Kiadó.
Bibok, Károly. 2009. Az evidencia problémája az elméleti nyelvészetben (The problem of evi-
dence in theoretical linguistics). Talk presented at the conference Kutatócsoportjaink (Our
research groups), Szeged.
Bibok, Károly. 2010. From syntactic alternation to lexical pragmatics. In The Role of Data at the
Semantics-Pragmatics Interface, Enikő Németh T. & Károly Bibok (eds), 261–304. Berlin:
Mouton de Gruyter.
Bibok, Károly & Németh T., Enikő. 2001. How the lexicon and context interact in the mean-
ing construction of utterances. In Pragmatics and the Flexibility of Word Meaning, Enikő
Németh T. & Károly Bibok (eds), 289–320. Amsterdam: Elsevier.
Hungarian verbs of natural phenomena with explicit and implicit subject arguments
The main objective of this chapter is to find a category system that is the most
capable of describing Hungarian disagreement based on certain criteria set
beforehand. As none of the rival approaches to disagreement to be introduced
in this chapter seem to satisfy each criterion, a new taxonomy is proposed.
Distinguishing between strategies and pragmatic force modifiers (PFMs), this
new category system is more reliable and more capable of capturing the complex
nature of (Hungarian) disagreements that the rival models. A detailed description
of the metatheoretical research is provided including a step-by-step description
of the analytical procedure(s) employed. By describing each step of the research
along with the decisions and re-evaluations of the information at hand as well as
the p-context, the chapter demonstrates the cyclic, retrospective and prismatic
nature of the plausible argumentation process.
To date, the majority of research carried out on disagreement has focused on the
English language and investigated the linguistic manifestation of disagreement
while ignoring its functional spectrum. The primary aim of the present chapter
is to find a category system that is the most capable of describing Hungarian dis-
agreement. The specific aim of my chapter is twofold: (1) to examine how ver-
bal disagreement is accomplished linguistically in the mixed-sex conversations of
Hungarian undergraduate students and (2) to identify the functional categories
of disagreements occurring in the research corpus. The present study proposes
a framework that is more capable of capturing the complex nature of (Hungar-
ian) disagreements and can account for a wider range of occurrences than other
rival models. In the course of the development of the framework, the steps of the
argumentation process are reconstructed highlighting its cyclic, retrospective, and
prismatic nature (cf. Kertész & Rákosi, this volume: Subsection 2.6.1).
Helga Vanda Koczogh
At this point it is important to explain what I mean by the term verbal dis-
agreement. Verbal disagreement is defined in this chapter as a situated activity
whose function is to express an opinion (or belief) the propositional content or
illocutionary force of which is – or is intended to be – partly or fully inconsistent
with that of a prior (non-verbal) utterance.1
The structure of the chapter is as follows. Section 2, provides a brief over-
view of major taxonomies of disagreement. These rival models serve as the start-
ing p-context (ibid.: Subsection 2.6.1) for my research, i.e. they constitute the
first step of the argumentation process. The starting p-context is problematic,
since it contains too much information about the categorisation of disagree-
ment. Thus, the starting p-context is informationally overdetermined (ibid.:
Section 2.5). S ection 3 gives a step-by-step account of the multi-step and multi-
method strategy that is adopted in order to address the research issues. The
starting p-context is extended by a corpus of spoken discourses and the section
provides a detailed description of a proposed taxonomy of disagreement that is
set up based on data from several direct sources. This extended p-context could
not resolve the problems identified in the starting p-context. Thus, in Section
4, a new argumentation cycle is started and the p-context is extended further
with new research data. This extended p-context is not satisfactory either, as it
inherited some weaknesses of the previous models. I prismatically re-evaluate
the p-context, revisit my taxonomy and analyze disagreements from a different
perspective, which results in a modified version of the framework. I argue for
this new version of the taxonomy in Section 5, which distinguishes between
disagreement functions (strategies) and certain linguistic devices (PFMs). This
framework is more complete and reliable than its previous version or the other
p-context versions described earlier. Consequently, the new taxonomy can be
regarded as a resolution of the starting problem in the starting p-context (ibid.:
Section 2.5).
As already mentioned in the previous section, the global aim of the chapter is to
find the model that is the most capable of describing Hungarian disagreement.
Thus, I raise the following problem:
(P) Which category system allows for describing Hungarian disagreement?
In this chapter, I will look for the category system that can satisfy the above-listed
criteria to the greatest extent; that is, the model that offers the best solution to (P).
In the literature of conflict talk, various types of disagreement have been identified
by different researchers based on different criteria. In what follows, I will introduce
the four most detailed and presumably most influential models of disagreement
from contrastive and evaluative points of view. This means that I will introduce,
analyse and evaluate the four models separately as rival p-context versions within
the starting p-context.
. I am aware that using our intuitions in linguistic analysis is indispensable and unavoid-
able; however, it might undermine the reliability of the analysis as it cannot be considered as
a completely reliable source of data. Hence, using intuition as the sole basis of analysis is not
viable.
Helga Vanda Koczogh
Muntigl and Turnbull also note that these acts can co-occur, with the most fre-
quent combination being contradictions followed by counterclaims. In my view,
irrelevancy claims cannot be considered disagreements, as they do not contradict
(part of) the previous utterance, but rather call into question its relevance to the
topic under discussion. Another weakness of the model is that it consists of a very
limited number of categories, as a result of which it cannot possibly account for all
occurrences of disagreement. For instance, the frequently used partial agreement
(Yes, but…) does not fit any of the categories listed by Muntigl and Turnbull. Thus,
p-context version 1 does not fulfil criterion (a) described in the previous section.
Disagreement not
softened or Contradictory statement
strengthened
(30% of turns) Verbal shadowing
Rhetorical question
Aggravated
Intensifiers
disagreement
Personal, accusatory you
(8% of turns)
Judgemental vocabulary
Rees-Miller (1995, 2000) takes Brown and Levinson’s (1987) model of polite-
ness as the underlying point of departure for her classification of disagreement
The development of a taxonomy of verbal disagreements in the light of the p-model
expressed by a question with the intention of challenging the other person. Rees-
Miller’s model cannot account for an utterance like that, thus, it does not meet cri-
terion (a) listed in Section 2. Moreover, the taxonomy disregards the circumstance
that a given linguistic item can have several functions. For instance, I think can
be a hedge or a booster depending on such factors as context and/or intonation.
Considering all the above, we can see that p-context version 2 does not offer a
solution to (P) either.
Whilst the above categorisation must be credited for being the first attempt to
identify the linguistic strategies used for expressing disagreement in Hungarian,
it cannot be denied that it has several flaws. First, some of the categories (e.g.
strategies (2)–(4)) are used as if they were self-evident without any explanation of
their meaning. Second, the difference between some of the strategies is not clear
and the examples provided by the speaker give little help differentiating among
the categories and clarifying the boundaries. For instance, the utterance Először
orvoshoz kellett volna menned. (~ You should have gone to the doctor first.) can
also be interpreted as criticism which includes a negative evaluation on the past
action/behaviour of the other interactant. Thus, it could be easily categorised as a
judgement/qualification as well. Furthermore, the functional categories seem to
be intuitive. Intonation can have a meaning differentiating role in the examples
listed above; however, Bándli (2009) does not discuss the prosodic features of her
examples. Moreover, the examples she provides for illustrating the identified strat-
egies are out of context, which makes it almost impossible to judge the accuracy
of the categories, since the reader is not familiar with the preceding utterance with
Helga Vanda Koczogh
. It has to be mentioned that mapping the pragmatic competence of Roma and Hungarian
primary school children, Mászlainé Nagy (2010) also touches upon the differences in the ex-
pression of disagreement utilised by the two groups. Based on Brown and Levinson’s (1987)
politeness theory, she identifies three broad categories of disagreements (‘bare disagreement’
strategy, ‘disagreement with recovery’, ‘indirect disagreement’ strategy), but provides no defini-
tion of them.
The development of a taxonomy of verbal disagreements in the light of the p-model
. For a more detailed account of the research design including methodological consider-
ations see Koczogh (2012).
Helga Vanda Koczogh
. The following intonational symbols are adapted from Varga (1985) and Németh T. (1996)
in a simplified form: /● – rising intonation, \● – falling intonation, /\● – rise-fall intonation,
\/● – fall-rise intonation, /\/● – rise-fall-rise intonation, ║ – intonation phase boundary.
. The functions of disagreement reported in previous research were revised and comple-
mented on the basis of the patterns found in the research corpus.
The development of a taxonomy of verbal disagreements in the light of the p-model
3. Joke none
4. Ask questions none
5. Hedge Hát…, / Well,
-hat, -het / can, could
(Hát) szerintem / (Well), I think
valamennyire / to some extent
lehet, hogy / maybe/perhaps
m:, ö: / uhm:, u:h
6. Impersonalise S to H, point- Azt mondják… / It is said that
of-view distancing Egyesek szerint… / According to some people…
7. Interject M:?, Ha!, Á:! / U:hm?, Ha!, A:h!
8. Contradictory statement negation (Ez nem igaz!, Ne!, Nem. / This is not true!,
No!, Not…)
linking word expressing contrast (De/Viszont…, /
Neutral disagreement
But/However,…)
9. State disagreement Én ezzel nem értek egyet. / I don’t agree with this.
Ebben nem értünk egyet. / We don’t agree on this.
10. Clarify S’s meaning Nem erre gondoltam. / I didn’t mean that.
Nem úgy… /Not that way…
Nem arra értem. / I didn’t mean it that way.
Én nem erről beszélek. / I’m not talking about that.
11. Verbal shadowing none
12. Use mockery (irony, sarcasm) emotive language (Ó:, igen! / Yea:h, right., Az igen!
/ Wow!)
13. Intensify disagreement egyáltalán nem / not at all
sokkal / much more
Strengthened disagreement
abszolút / absolutely
teljesen / totally
akkor is / still
mondom / I’m telling you
14. Ask rhetorical questions none
15. Challenge interjections
16. Judge Judgemental vocabulary
(hülyeség/marhaság/baromság, butaság / nonsense,
rubbish, bullshit, silly,
seggfej / asshole)
17. Show disbelief, play down Á: / A:h
previous statement Ugyan már! / Come on!
Helga Vanda Koczogh
Example 19
M03 [Hát a Vőlegény és a Bölcs között vacilál]tam, de a Vőlegény
szerintem az, aki aki igazából az egész történetben semmi semmi
rosszat nem tett.
[Well I was hesitating between the Wiseman and the Fiancé], but
I think the Fiancé is the one who who actually didn’t didn’t do
anything wrong in the whole story.
→ F03 \●Jó de a \●Bölcs se║. \●Sőt ő ő \●jó tanácsot adott neki║.
Fine, but neither did the Wiseman. Actually, he he gave him good
advice.
. Linguistic devices with the function of softening the force of face threat to the addressee.
. Speakers in the research corpus were given a code consisting of a capital letter indicating
the sex of the speaker, and a two-digit number that is identical in the case of conversational
partners.
. The transcription conventions (modified from Tannen 1984 and Du Bois et al. 1992) used
in the examples are as follows:. – falling intonation with a conclusion point;, – continuing
intonation; ? – rising or question intonation; ! – strong feelings, high volumes, sentence–final
exclamatory intonation; ??? – unclear or unintelligible words;:vowel/consonant – lengthening
of sound; … – short pause; … … – medium or long pause; @@@ – laughter; TEXT – emphatic
stress; “text” – direct quotations read out loud from the task sheet; [text] – speech overlap;
(text) – transcriber’s descriptive comments on speech quality (i.e. irony, sarcasm, etc.); {text} –
transcriber’s comments added for clarity; → – lines relevant to the analysis at hand.
The development of a taxonomy of verbal disagreements in the light of the p-model
In this example, F03 begins her utterance with an agreeing jó (~ fine) before con-
tradicting M03’s utterance and elaborating on this contradicting statement.
Token agreement is a polite disagreement disguised as an agreement. It occurs
when the speaker appears to agree with the previous speaker in order to save
the addressee’s positive face. It is reportedly one of the most common strategies
(Bándli 2009; Brown & Levinson 1987; Holtgraves 1997; Locher 2004; Rees-Miller
1995, 2000) and often expressed the same way as partial agreement.10
Another softened disagreement strategy identified in my corpus is explana-
tion. This umbrella term refers to an utterance that gives or asks for a reason or
example to indicate that the speaker cannot accept the previous speaker’s proposi-
tion. In Example 2, the female speaker contradicts the previous utterance indi-
rectly, by giving a counter-example:
Example 2
Talking about gender differences in cooking skills
M05 A nők sokkal jobban el tudják találni szerintem az ízeket is.
I think women are much more sensitive to flavours.
F05 \●Hát mondjuk … /●például a halászlét azt \●nálunk általába a
→
\●Fater főzi║.
Well, let’s say … for example fish soup is cooked by my father in
my family.
When asking for a reason, the speaker’s interrogation is frequently preceded by the
contrastive marker de (~ but) and/or accompanied by a critical or doubtful tone.
Although the contrastive marker de expresses opposition in itself, these utterances
would generally be identified as disagreement either because of the intonation or
the divergent propositional content. The following example illustrates this point:
Example 3
M12 Én egyetértek ezzel az állítással.
I agree with this statement.
→ F12 \●De MIÉR║?
But WHY?
In the last line the emphatic MIÉR (~ WHY) alone clearly conveys the message of
disagreement, which is strengthened by the use of de (~ but). The falling intona-
tion on the question also reinforces disagreement.
. Since token agreements and partial agreements are closely-related concepts with only a
thin borderline between them, I grouped them under the same category.
Helga Vanda Koczogh
In this extract speaker M05 uses humour to buffer the effect of his disagreement
and utters his opinion in a non-serious, joking manner accompanied by laughter.
This utterance creates a light-hearted atmosphere and results in the laughter of the
listener, which indicates that an atmosphere of camaraderie has been established
through the use of joking. There is no identifiable linguistic item that generally co-
occurs with the strategy of joking, but the analyst can rely on other features such
as laughter and tone.
Expressing objection in the form of a question rather than a direct statement
is a softened disagreement strategy noted by both Rees-Miller (1995, 2000) and
Locher (2004). As explained by the latter (2004: 133), the indirect strategy of using
an interrogative form for implying disagreement can “express relational work and
hence face considerations (for S and H) of various degrees if it lacks the quality of
The development of a taxonomy of verbal disagreements in the light of the p-model
In Rees-Miller’s (1995) study, the use of interrogative form is the second most
common mitigating device. It occurs in two forms: yes/no questions and seem-
ingly information-seeking wh-questions used to express disagreement. The sec-
ond type, as well as negative questions, is reported to have been used for presenting
contradictory evidence or competing alternatives (pp. 100–105). I also consider
the use of questions (other than requests for reasons or examples and rhetorical
questions) a softened disagreement strategy, however, I categorise negative ques-
tions as challenges since they express a higher degree of adversarialness and they
question the other interlocutor’s stance. Similarly to joking, there is no identifiable
linguistic marker of the strategy of using a question, only the opposing proposi-
tional content and sometimes the tone of the question yield some clues.
Hedges can also be used to mitigate disagreement. The classification of miti-
gating devices proposed by Rees-Miller (1995) outlines the following four groups:
personal pronoun I followed by a cognitive verb such as think, question forms,
downtoners (e.g. sort of, a little, sometimes), and verbs of uncertainty (e.g. may,
seem). Locher (2004) reports hedging to be the most common softened disagree-
ment strategy in her research and provides the following exhaustive list of hedges
as an extension of Aijmer’s (1986) list:
actually, anyway, as it were, basically, a bit, certainly, honestly, I mean, I think, in
a way, in fact, just, kind of, let me, little, maybe, more or less, of course, perhaps,
probably, say, see, so-called, somehow, sort of, stuff, suppose, type of, uh, uhm, well,
whatever, what you call; or and and in final position. (p. 115)
It is important to note here, however, that although these linguistic items do tend
to lessen the impact of opinion and disagreement on the listener, there is no direct
correspondence between form and function. Some of these devices (e.g. discourse
markers) are multifunctional, so they can fulfil several functions based on the con-
text. Hence, it is crucial to interpret the meaning and function of the given item in
Helga Vanda Koczogh
its context in each case. To illustrate this point, there are linguistic devices which
can act both as a hedge and as a booster, a point also noted by Holmes (1995: 78).
Hedges and boosters can combine with each other to soften or strengthen the
force of the utterance more effectively, a phenomenon reported by other research-
ers (e.g. Holmes 1995; Locher 2004; Rees-Miller 1995, 2000).
The last but one softened disagreement category includes the strategies of
impersonalise speaker and hearer and point-of view distancing. These strategies are
similar in the sense that they both aim at shifting responsibility, that is, making an
utterance appear as coming from a different source. This can be achieved by using
passive structures (see Example 7) or general subjects (see Example 8) instead of
the first person pronoun I. These negative politeness strategies mitigate disagree-
ment as well as protecting the face of the speaker. Locher (2004: 130) calls this
strategy ‘shifting responsibility’ and argues that its advantage in a conversation is
that “the content of what a speaker just said might be debatable, but the person as
such is not as exposed to criticism as when he or she had reported the content as
his or her own point of view”.
Example 7
M05 Szerintem a férfiak SOKKAL intelligensebbek a nőknél.
I think men are MUCH MORE intelligent than women.
→ F05 De ö: /●azt mondják║ hogy azér egy nő /●tud több dologra figyelni
egyszerre║ … \●jobban használja agyának a ké /●mindkét felét║…
But u:h it is said that women are able to pay attention to more things
at the same time … they use both of both sides of their brain…
Example 8
F01 Szerintem mindenkinek be kellene oltatnia magát H1N1 ellen.
I think everyone should get themselves vaccinated against H1N1.
M01 \●Egyesek szerint /●csak azt oltsák be aki aki aki /●veszélynek van
kitéve║ tehát a /●terheseket║ az /●időseket║ és aki \●veszélyeztetett
területen dolgozik║.
According to some people only those should be vaccinated who
who who are at risk, that is, pregnant women, the elderly, and those
who work in a risky area.
Example 9
F03 És nézd meg a világ legjobb … ö:… legjobb szakácsait, [szinte
mind férfi].
And look at the world’s best … u:h … best chefs, [almost all of
them are male].
M03 [Na jól van]
[Ok fine]
F03 És ott van XY milyen jól főz!
And there’s XY, what a great cook he is!
→ M03 /●M:? (szkeptikus)
U:hm? (sceptical)
Here the sceptical indirect answer m: (~ u:hm)11 uttered with a rising intonation is
considered to be more polite than simply stating Nem. (~ No, he isn’t).
. Hums in Hungarian serve a broad range of discourse functions such as signalling agree-
ment, disagreement, surprise, hesitation, doubt, scepticism, urge, etc. depending on their pro-
sodic features and the context of talk. For instance, while affirmative hums are associated with
a descending-rising intonation, interrogative hums feature rising intonation (Markó 2007: 30).
For a functional analysis of humming in Hungarian see Markó (2005, 2007).
Helga Vanda Koczogh
In the above example, speaker M09 expresses his disagreement with the contra-
dictory marker de (~ but) in utterance-initial position followed by a statement
whose propositional content expresses disagreement. Contrastive markers, by
themselves, express opposition (ebbe nem lehetsz biztos szerintem ~ you can’t be
sure of that, I think). In this case de (~ but) signals contrast, but without its use the
utterance would still be recognised as disagreement.
The third subtype of contradictory statement is when the utterance, by itself,
“does not bear any markers of disagreement. However, in the context in which
it occurs, it contradicts a previous utterance” (Rees-Miller 1995: 116). I would
add to this definition that, naturally, this type of disagreement does not neces-
sarily have to contradict the whole of the previous proposition; the contrasting
information can refer to only a part of it. In this type of disagreement, contradic-
tion is not stated but implied and so understanding the illocutionary force often
involves more mental processing on the part of the listener. Consider the following
example:
Example 11
M15 Főzési tudományom a tojásrántottában ki is merül.
My knowledge of cooking ends with scrambled eggs.
F15 De abba teszel jó sok mindent, igaz?
But you put a lot of things in it, right?
→ M15 \●Tojást║.
Eggs.
In the last line of this interchange the disagreement is expressed with a single word
(Tojást. ~ Eggs.) without any linguistic markers of opposition. It is an indirect way
The development of a taxonomy of verbal disagreements in the light of the p-model
of saying No, I put only eggs in it, and is understood by virtue of the Gricean maxim
of relevance.12
Another neutral disagreement strategy is what I term stating disagreement.
Stating disagreement, as it is used here, means an utterance that explicitly states that
the speaker disagrees with the previous proposition, as illustrated by Example 12.
Example 12
F02 “Az új járványok ellen mindenkinek feltétlenül be kellene oltatnia
magát.”
“Everyone should definitely get themselves vaccinated against new
diseases.”
→ M02 Hm /●nem értek vele egyet \●egyáltalán║.
Uhm I don’t agree with this at all.
This type of disagreement always contains the expression nem ért egyet, which
contains the performatives agree or disagree in its English translation, thus con-
veying the message with maximum efficiency. Interestingly enough, this category
does not appear in either Rees-Miller’s (1995, 2000) or Locher’s (2004) schema at
all. This might be due to the relative infrequency of the use of this strategy in the
context of verbal interactions in English.13 However, what is more remarkable is
that Bándli (2009) and Mászlainé Nagy (2010) do not make mention of this dis-
agreement strategy in Hungarian either.
Clarifying speaker’s meaning is another recently discovered disagreement
strategy emerging from the research corpus that does not appear in previous liter-
ature. It has the function of clarifying the misunderstood meaning of the speaker’s
previous utterance, which therefore contradicts or corrects the other interlocu-
tor’s previous proposition. This strategy has a constructive effect on conversation,
as it helps to avoid communication breakdowns and facilitates the smooth flow
of interaction. In the following example taken from a dispute on euthanasia, the
female speaker (F04) disagrees with her boyfriend (M04) twice because she wants
to explain what she meant.
. Speaker M15’s brief but informative utterance also observes the maxims of quantity and
manner.
. I investigated the occurrence of the expressions I don’t agree, I do not agree and I disagree
in the British National Corpus (BNC) Spoken and in the corpus of US TV Talk (with the help
of Corpus Concordance English (CCE) program) with a volume of 100 million and 2 million
words respectively. My queries resulted in zero hits in CCE for each of the searches, while I
found no tokens for the first, a single token for the second, and only 20 tokens for the third
expression in BNC Spoken. The contexts of hits were mainly council meetings, teacher’s con-
ferences, court hearings and public debate programs.
Helga Vanda Koczogh
Example 13
M04 De én azt mondom, hogy ehhez egy nagy adag pesszimista
világnézet is kell. Le kell tojni önmagunkat néha ahhoz, hogy ö:
… hogy hogy ilyen ilyen estekben hogy úgymond ‘meg akarok
halni, mert izé ronda vagyok, meg itt a púp a hátamon’. (irónikus)
But I say that for this you need a great deal of pessimistic
worldview. Sometimes we need to get over ourselves so that u:h …
that that in such such cases like ‘I want to die because I’m ugly and
there is a hunch on my back’. (ironical)
→ F04 De /\●nem úgy Maci║ én \●nem erről beszélek║.
But it’s not that Honey Bear, I’m not talking about that.
M04 Tudom, ilyen rokkant esetekbe is, hogy … például…mi miről is
akar …
I know in the cases of disabilities that … for example… wha what
did I want to
→ F04 \●Nem erre gondoltam \●Macika║! Ö: \●mondjuk egy /\●fél even
belül /\●rákban meghalsz║. […]
I didn’t mean that, Honey Bear. U:h let’s say you are going to die of
cancer in half a year. […]
Similarly to the previous strategy, this one is not mentioned in other taxonomies of
disagreement strategies either. In the cases of Rees-Miller (1995, 2000) and Locher
(2004), this might be explained by the circumstance that their studies are con-
cerned with how disagreement is (linguistically and non-linguistically) expressed
rather than with the functions of particular utterances.
The last neutral disagreement strategy identified in the research corpus is
labelled verbal shadowing by Rees-Miller (1995, 2000). By this term I mean the
act of repeating the previous speaker’s words, phrases or entire utterance (some-
times with slight alteration) to question the content of the previous utterance.
I take verbal shadowing in a narrower sense than Rees-Miller (1995), whose
interpretation allows for significant alterations in single words in the opposing
utterance. It is noteworthy that in some cases only the intonation contour is
changed to express disagreement and often criticism, as exemplified in the fol-
lowing interchange from the dispute about gender differences in people’s sense
of direction:
Example 14
M05 Menjünk tovább! „A férfiak intelligensebbek a nőknél.”
Let’s move on. “Men are more intelligent than women.”
The development of a taxonomy of verbal disagreements in the light of the p-model
In this heated debate speaker F05 expresses her disagreement several times, but
in the marked line she does this by simply picking up and repeating a single word
from the previous utterance (Né:ha? ~ Someti:mes?) and adding a critical tone to it
with the vowel lengthening and falling intonation.
. For a comprehensive overview of approaches to irony and other related phenomena (e.g.
sarcasm, cynycism, mocking) see, for example, Komlósi (2007, 2010, 2013) or Svindt (2007,
2008).
The development of a taxonomy of verbal disagreements in the light of the p-model
In this extract about two characters of a story, speaker M05’s non-serious proposal
of the two characters being friends is rejected by speaker F05’s ironic utterance
(Ó: igen! ~ Yea:h, right.).
Intensify disagreement is another strengthened disagreement strategy,
one which makes use of certain words or expressions (e.g. egyáltalán ~ at all,
abszolút ~ absolutely, teljesen ~ totally, sokkal ~ much more, mondom ~ I’m tell-
ing you, etc.) to emphasise and strengthen the effect of disagreement, a strategy
also identified by Rees-Miller (1995, 2000). I refer to these linguistic items as
intensifiers as they increase the degree of intensity of disagreement. Expres-
sions such as szerintem (~ I think), csak (~ just), hát (~ well) also belong here
when they are used as boosters, since they intensify or emphasise the force
of the utterance and express certainty on the part of the speaker. Intensified
disagreements are frequently accompanied by paralinguistic features such as
volume amplification of the intensifier and quick tempo applied to the open-
ing phrase (e.g. hát ~ well, szerintem ~ I think). In some instances, intensifiers
are used in clusters. The second line (Hát SZERINTEM ez EGYÁLTALÁN nem
igaz. ~ Well I DON’T THINK this is true AT ALL.) of Example 14 is a case in
point. In this utterance three tokens of intensifiers (hát ~ well, szerintem nem
~ I don’t think, egyáltalán ~ at all) are adopted with emphatic stress on two of
them to reinforce the speaker’s oppositional intent and make her disagreement
prominent.
Another form of strengthened disagreement is the use of a rhetorical question,
a question asked not to evoke an actual reply but to imply that there is no answer to
the question or that the answer is too obvious. In some cases it is used for the sake
of persuasive effect and the speaker provides an answer. “What makes a rhetorical
question an aggravated form of disagreement is that its basic purpose is to score
points against an opponent, not to further understanding or build consensus”
(Rees-Miller 1995: 137f.). Let us consider Example 16, taken from a discussion
about abortion and adoption.
Helga Vanda Koczogh
Example 16
F02 Akkor örökbe lehet adni vagy akármi.
Then it can be given up for adoption or whatever.
M02 De az is szívtelenség eléggé.
But that’s pretty heartless too.
→ F02 \●Örökbe adni║? \●Bolond vagy║? /●Legalább lesz valaki║ aki
\●szereti║!
To give it up for adoption? Are you crazy? At least there’ll be
somebody to love him/her!
In the interchange two rhetorical questions are used one after the other with a fall-
ing intonation, followed by an explanation (Örökbe adni? Bolond vagy? Legalább
lesz valaki, aki szereti. ~ To give it up for adoption? Are you crazy? At least there’ll
be somebody to love him/her.). The adoption of rhetorical questions is intended to
send the other interlocutor the unmistakable signal that his previous statement
about giving up a baby for adoption is heartless is unequivocally rejected. Hence,
the deployment of this strategy clearly displays overt disagreement.
Challenge is a disagreement strategy which displays strong disagreement with
the prior proposition and typically has the syntactic forms of either imperative or
interrogative (both rhetorical and response-seeking questions). Rather than mak-
ing a specific claim, this type of disagreement questions the addressee’s stance and
implies that (s)he cannot provide evidence for his/her claim. This strategy is often
accompanied by phonetic stress, rapid tempo and an ironic tone, all of which high-
light the speaker’s oppositional stance. The following example, taken from a dispute
on gender differences in driving skills, illustrates how hostile challenges can be:
Example 17
M05 „A nők biztonságosabban vezetnek, mint a férfiak.” Az igen!
(irónikus)
“Women drive more safely than men do.” Wow! (ironical)
→ F05 \●Miért║? Szerinted /\●nem igaz║? (irónikus)
Why? Don’t you think it’s true? (ironical)
M05 Nem igaz! … A nők nem tudnak odafigyelni a vezetésre!
It’s not true! … Women can’t focus on driving!
→ F05 \●HA║…! És /●elmondanád hogy \●MIÉRT║?
HA…! And could you tell me WHY?
true?). Then, he makes a blunt contradictory statement (Nem igaz! ~ It’s not true!)
followed by a categorical proposition (A nők nem tudnak odafigyelni a vezetésre! ~
Women can’t concentrate on driving!), which generates an even more hostile atmo-
sphere and elicits an antagonistic challenge (És elmondanád, hogy MIÉRT? ~ And
could you tell me WHY?) preceded by an interjection (HA!) expressing resentment.
The challenge in the last line of the conversation questions the stance of the other
and implies that he cannot back up his claim, thus, directly attacks his compe-
tence. Hence it can be stated that challenges direct a high degree of aggravation
to the other’s rationality, which also explains why this disagreement strategy is
claimed to be the second most face aggravating disagreement type in Muntigl &
Turnbull’s (1998) categorisation.
Judging is a strengthened disagreement strategy in which the speaker expresses
a negative evaluation of the previous proposition, often to the point of condem-
nation.15 In this strategy, interlocutors often use offensive language (‘judgmental
vocabulary’ in Rees-Miller 1995, 2000) such as hülyeség (~ rubbish), marhaság
(~ bullshit), seggfej (~ asshole), etc. to show their strong disagreement. The aggra-
vating adjectives are frequently utilised with word stress and/or falling or rise-fall
intonation to exert an emphatic effect on disagreement. Consider the following
example on higher education tuition fee:
Example 18
F06 „A felsőoktatási intézményekben be kellene vezetni a kötelező
tandíjat, de költségét a hallgató tanulmányi átlagától kellene
függővé tenni.” Szerintem ez teljesen korrekt és helyénvaló.
“Compulsory tuition fees should be introduced in higher
education, but the amount for a semester should depend on the
student’s grades”. I think this is absolutely fair and right.
→ M06 De hát ez /\●BAROMSÁG║.
But that’s RUBBISH.
In the above interchange speaker F06 expresses her agreement with the statement
she has just read out. The other interlocutor’s response in the subsequent turn
clearly conveys the message that he strongly disagrees with both the proposal and
the previous speaker and directly confronts her. The use of prefatory de (~but), the
word choice of baromság (~ rubbish) and the emphatic stress and rise-fall intona-
tion on the latter all magnify the opposition and make the disagreement explicit.
The last strengthened disagreement strategy identified in the corpus is show
disbelief, play down previous statement, a strategy in which the speaker expresses
. Bándli (2009) also identifies this disagreement strategy and calls it ‘Judgement/Qualification’.
Helga Vanda Koczogh
that (s)he does not believe or doubts the previous proposition and therefore can-
not accept it.16 Disbelief is often accompanied and/or expressed by interjections
expressing scepticism (e.g. Á: ~ A:h) and emotive phrases such as kizárt (~ no way)
or Az ki van zárva (~ That’s out of the question), as exemplified in the following
extract, which is the continuation of Example 6.
Example 19
F08 [De] hát most ő mit tehet arról? {hogy balfasz}
[But] what could he do about it? {that he is a screw-up}
M08 M: olvasni kellett volna a sorok között.
U: hm he should have read between the lines.
→ F08 Jaj \●UGYAN [\●MÁ:R║!]
Oh, COME [O:N!]
Here, the functions of the last utterance are clearly to make the previous state-
ment appear absurd and to urge speaker M08 to abandon his supposedly foolish
proposition. The imperative force of the idiom ugyan már (~ come on), the use of
interjection jaj (~ oh), and the emphasis on the last two words with the prolonged
vowel articulation and the falling intonation contour on the phrase highlight the
speaker’s rejection of the previous statement and overtly express her opposing
stance. Similarly to challenges, this strategy often questions the other interlocu-
tor’s rationality, thus aggravating his/her face.
3.2.4 Odd-one-outs
It must be mentioned that I encountered some tokens of disagreement in my cor-
pus that could not be categorised based on my category system either. Consider
the following example taken from a discussion about two characters of a story:
Example 20
F03 […] Nekem ő {a Szépfiú} volt a legellen[szenvesebb.]
[…] For me he {the Pretty Boy} was the most dis[tasteful].
M03 [Nekem pedig] a Szépleány. Azért mert mert szerintem ő volt
a leggonoszabb azért, mert mert [megcsalta a … megcsalta a
vőlegényét.]…
[For me it was] the Pretty Girl. Because because I think she was the
meanest because because [she cheated cheated on her fiancé.] …
. Bándli’s (2009) categorisation of disagreement strategies also includes a category called
‘doubts/uncertainties’, however, it accounts for utterances that express uncertainty and attempt
to make the other speaker unsure of his/her stance.
The development of a taxonomy of verbal disagreements in the light of the p-model
that even the newly proposed framework cannot account for. Thus, the extended
p-context is still informationally underdetermined. This finding triggers further
re-evaluation of the extended p-context and a new argumentation cycle.
In Subsection 3.2.2, the function of speaker M02’s response to the previous pro-
posal was categorised as stating disagreement; however, egyáltalán (~ at all) magni-
fies the opposition, so it deploys the intensify disagreement strategy as well. Along
similar lines, apart from the softened disagreement strategy of interject, the dis-
agreement (M:? ~ U:hm?) expressed in Example 9, also makes use of the strength-
ened strategy of showing disbelief, playing down previous statement. In a similar
vein, besides the discussed function, the disagreements uttered in Examples 518
(Figyelj már, szerinted egy egy hatvan eves tata szerinted versenyzik? Örül, hogyha
beül a kocsiba. ~ Listen, do you think a 60-year-old fogey races? He is happy if he
can get in the car.) and 1419 (Né:ha? ~ So:metimes?) can equally be interpreted
The informants were presented with 23 excerpts, which included at least one
example of each function as well as two odd-one-outs (items 8 and 19); the latter
made use of strategies (impersonalise S to H and rhetorical question) that were not
listed among the categories. The aim was to discover whether the participants can
identify the functions of these tokens of disagreement, and, if not, what categories
they assign to them from the list.
Although I do not question the value of the answers with low confidence level
values, ignoring this information ensures the elimination of answers that might
have resulted from pure guesswork. Hence, only the answers with confidence level
values of 3 and 4 were included in the analysis presented below.
The research results are presented in two parts. First, the findings are dis-
cussed with reference to the answers that were given with a high level of confi-
dence indicating the individual items’ perceived level of difficulty. In the second
part, the research participants’ categorisation is compared with my intuition to
find out whether they match with each other. Some of the differences and the
problematic cases are discussed in the second half of this section.
As Table 3 illustrates, items 2, 22, 1, 18, 20, and 16 (in decreasing order)
received the highest number of answers (over 70) with a high confidence level
value. Naturally, this simply means that in the case of these tokens of disagree-
ment, participants were more confident about the function(s) they assigned to the
token, but it does not necessarily mean that their interpretations were correct. The
The development of a taxonomy of verbal disagreements in the light of the p-model
answers given to these items are relatively homogenous in terms of the assigned
functions. These disagreements are either expressed explicitly (e.g. extract 2) or
deploy the strategies of mockery (e.g. extract 18) and disbelief (e.g. extract 20).
The tokens of disagreement that were answered with a higher degree of per-
ceived difficulty are the following (presented in increasing order): items 5, 8, 13,
19. These tokens received fewer than 50 answers with a high level of confidence
and, with the exception of number 8, are more diverse with regard to the assigned
categories. Each of these items will be further investigated in the second half of
this section.
Table 3. Research results (%) with confidence level values of 3 and 4
Extract Categories*
no.
P/TA EXPL HUM HED CST STD CS’M ECHO ID META ELSE
Moving on to item 13, illustrated by Example 22, we can see that although the
function assigned by most participants (59.6%) corresponded to my categorisa-
tion (interjection), a considerable number of them (34%) interpreted the interjec-
tion M:?(~ U:hm?) uttered by speaker F04 as a way of expressing doubt or disbelief.
I must admit that in its broader sense disbelief can include such indirect utterances
as the one under discussion, especially when uttered with rising intonation.
Example 22
On death penalty
M04 Ez ebben egyet értek, hogy nem kéne alkalmazni. Ö: igaz, ö… van
egy fajta megfélemlítő ereje a halálbüntetésnek a bűnözők számára.
That this I agree with that it shouldn’t be used. U:h it’s true that
uh… death penalty has a kind of deterrent power for criminals.
→ F04 /●M:║?
U:hm?
Extract 19 (see Example 23) illustrates a heated debate on the character of the
Pretty Boy and contains a token of disagreement that was in the focus of my
interest in terms of its categorisation. According to my analysis,23 the function
of speaker M12’s utterance (MI? Az az, hogy kihasználta egy éjszakára? Átment,
[visszajött] ~ WHAT? That he had her for a one-night stand? He went over, [came
back]) in the last line of the excerpt is that of a rhetorical question; however, this
strategy was not listed among the categories on the task sheet.
Example 23
M12 Az nem lett volna gonoszság? Átmegyek aztán visszajövök
Wouldn’t it have been a mean thing? I go over then I come back
F12 De nem [ne:m]
But no [no:]
M12 [és otthagyja]. ME:RT?
[and he leaves her there.] WHY:?
F12 Hát mert akkor Ő tesz áldozatott a Szép[lányért]
Well because then HE makes sacrifices for the Pretty [Girl]
M12 \●MI║? Az az hogy /\●kihasználta egy éjszakára║? /●Átment║
→
[\●visszajött║]
WHAT? That he had her for a one-night stand? He went over,
[came back]
. It should be stressed that my analysis is primarily based on the functions of disagree-
ment that were identified in previous literature and it is not based on intuition alone.
Helga Vanda Koczogh
Only three subjects assigned the function other to this instance of disagreement,
but two of them interpreted it as a rhetorical question. Most of the participants
(47.9%) placed it into the meta category, identifying its function as a challenge.
Those subjects who assigned the function echo to it (33.3%) justified it with the
mocking tone. In my view, the tone adds a hostile attitude to the utterance, which
makes it a challenge rather than mockery. Also the interrogatives question the
stance of the other speaker in an antagonistic way, which is a characteristic feature
of challenges.
As the vast majority of the categories comprising my new system for coding
disagreements have already been introduced, what follows is only a short defini-
tion of each for convenience and some prototypical examples of reactions to the
utterance Men can cook better than women.24
Disagreement strategies:
. Except for the last one (clarification of speaker’s meaning), where an additional turn
needed to be inserted the disagreeing utterance reacts to.
. Rees-Miller (1995: 116).
The development of a taxonomy of verbal disagreements in the light of the p-model
–– stating disagreement: an utterance that explicitly states that the speaker dis-
agrees with the previous proposition (e.g. Ezzel nem értek egyet. ~ I don’t agree
with this.).
–– challenge: an utterance that displays strong disagreement with the prior propo-
sition by questioning the addressee’s position and implying that (s)he cannot
provide evidence for his/her claim26 (e.g. Miért főznének jobban? ~ Why would
they cook better?; És elmondanád, hogy miért? ~ And could you tell me why?;
Akkor te miért főzöl olyan pocsékul? ~ Why do you cook so badly, then?; Akkor
mondj már egy olyan férfit, akire ez igaz! ~ Then tell me a man of whom it’s true.).
–– disbelief: an utterance that indicates that the speaker does not believe or
doubts the previous proposition and thus cannot accept it (e.g. Á:! ~A:h!;
Ugyan már! ~ Come on!; Ki van zárva! ~ That’s out of the question!).
–– evaluation: an utterance that expresses a negative evaluation of the previous
speaker’s proposition, indicating strong disapproval (Hülyeség! ~ Nonsense!;
Ez marhaság! ~ That’s bullshit!).
–– clarification of speaker’s meaning: an utterance that clarifies the usually mis-
understood meaning of the speaker’s previous utterance (e.g. Szóval te jobban
főzöl, mint én? ~ So you can cook better than me?; Magyarul én béna vagyok a
főzéshez. ~ So I am lame at cooking.), which, therefore, contradicts or corrects
the other interlocutor’s previous proposition (e.g. Én nem erről beszélek. ~ I’m
not talking about that.; Nem arra értem. ~ I didn’t mean that.; Nem úgy. ~ Not
that way.).
As is clear from the above, no new categories have been added to the disagreement
strategies. The only modification made to this new version of the coding scheme is
the division of contradictory statement and implied contradiction into two different
categories. This is due to these categories being different from each other in terms
of their directness.
The disagreement strategies listed above exhibit varying degrees of in/direct-
ness. Direct disagreement strategies are utterances in which there is a match
between the propositional content and the illocutionary force of the acts, so the
disagreement is uttered in a way that clearly communicates to the hearer that a dis-
agreement is being produced. They express disagreement explicitly,27 and therefore
require no or little interpretative work on the addressee’s part. Obviously, the most
direct disagreement strategy is stating disagreement, and other direct strategies
–– humour: a mitigating device that softens the force of the disagreement usually
engendering laughter on the part of the speaker, listener, or both. The utter-
ance is made in a non-serious, joking manner (cf. Example 5).
–– hedge: mitigating devices used to (1) soften the face-threatening force of dis-
agreement by indicating uncertainty on the part of the speaker (e.g. szerintem
~ I think, lehet ~ maybe, esetleg ~ perhaps, hat, -het ~ can, could), (2) lower the
effect on the meaning of another element (e.g. valamennyire ~ to some extent,
egy kicsit ~ a little) or (3) fill in the time used for thinking (e.g. ö ~ uh, m: ~
u:hm, hát ~ well).
–– impersonalisation: the use of impersonal/passive structures to shift responsi-
bility. It is a mitigating device as it makes the utterance appear as coming from
a different source (e.g. Egyesek szerint ~ Some say, Azt mondják, hogy… ~ It is
said that…).
–– tag question: a short question added to a statement which requests assurance
or affirmation regarding what is expressed in the main clause. It serves to seek
confirmation either explicitly or rhetorically and to involve the hearer in the
discourse (e.g. ugye? ~ right?).
–– conditional: a mitigating linguistic device expressing a hypothetical situation,
thus softening the pragmatic force of the utterance.
–– term of endearment: a word or a phrase used to address a person in a way that
conveys solidarity and affection and thus softens the propositional content of
the utterance (e.g. Maci, Macika ~ Honey Bear).30
As is apparent from the list above, the last three categories are new additions to the
taxonomy. Tag questions have several functions (e.g. epistemic, facilitative, chal-
lenging) and there is no one-to-one correspondence between individual occur-
rences and functions. As a result, tag questions need to be interpreted with regard
to the given context. Here only those tag questions are considered as mitigators
that soften the force of the utterance by showing positive face considerations and
facilitate conversation.
The category of conditional refers to l’art pour l’art conditional, which softens
the force of disagreement by expressing a hypothetical situation. Other polite uses
of conditional are included among hedges.
The last category of mitigators includes the use of terms of endearment as in
the research corpus they were uttered in a soft tone without special emphasis and
thus they soften the force of disagreement by shifting the focus from the disagree-
ment per se to the camaraderie and ties existing between interlocutors. This is
illustrated in Example 13, where the last two utterances of speaker M04 are miti-
gated by the use of Maci and Macika (~ Honey Bear) emphasising the special bond
between the speaker and the hearer.
The other group of PFMs termed aggravators include those linguistic items
and devices that aggravate the pragmatic force of disagreement. As can be seen
from the list below, this group contains only one new category when compared to
the previous versions of the taxonomy. It is the category of tag question with the
function of challenging the other speaker.
. Naturally, I do not intend to imply that terms of endearment always reflect immediacy.
These terms can have either positive or negative polarity which is determined by intonation
and context. In the research corpus, both occurrences were uttered with a soft tone and falling
intonation, which conveys solidarity rather than distance or annoyance.
Helga Vanda Koczogh
Aggravators:
It goes without saying that neither the list of disagreement functions nor that of
PFMs is exhaustive by any means. They include the categories that were identified
in my research corpus but other disagreement strategies and even PFMs might be
discovered, for example, when examining disagreement in other contexts.
As for the construction of disagreements, I identified the following internal
structure in the course of indexing illustrated by Figure 2.
As the figure shows, disagreement can be expressed via a single disagreement
strategy or optionally (indicated by brackets) with a combination of strategies, and
can be preceded by a pre-sequence. Optionally, the pragmatic force of disagree-
ments can be altered by the use of PFMs.
The development of a taxonomy of verbal disagreements in the light of the p-model
(PFMs)**
** Naturally, it is possible to assign more than one PFM to a single disagreement strategy.
In order to interpret the last utterance as a disagreement one needs more than
mere lexical knowledge. On the surface level Ó: igen! can be understood as an
agreement and it is the ironical tone (rise-fall intonation) that helps us arrive at
the right interpretation. As Kotthoff (1993) states, in the context of a dispute inter-
locutors use short assent as a means of being ironic and expressing disagreement.
As a result of the new taxonomy for analysing disagreements, the disagreement
expressed by speaker F05 in Example 7 below can be categorised clearly without
having to assign several functions to it, unlike in the previous version of the model.
Helga Vanda Koczogh
Example 7
M05 Szerintem a férfiak SOKKAL intelligensebbek a nőknél.
I think men are MUCH MORE intelligent than women.
→ F05 De ö: /●azt mondják║ hogy azér egy nő /●tud több dologra figyelni
egyszerre║ … \●jobban használja agyának a ké /●mindkét felét║…
But u:h it is said that women are able to pay attention to more
things at the same time … they use both of both sides of their
brain…
In speaker F05’s utterance the initial de (~ but) indicates that she would like to
reject the previous proposition (a férfiak sokkal intelligensebbek a nőknél ~ men are
much more intelligent than women) and she chooses to express disagreement in a
mitigated way. The contrastive marker de (~ but) is followed by a hesitation marker
(ö: ~ u:h) and a counter-claim that provides an explanation and supports why the
speaker is right. Giving a reason or explanation attenuates disagreement and so
does the use of the generic statement azt mondják, hogy (~ it is said that) serving
as a means of distancing the speaker from the truth value of the proposition. Thus,
in the new framework the utterance under scrutiny utilises the strategy of explana-
tion and it is accompanied by two mitigators (hedge and impersonalisation).
The following example illustrates how disagreement strategies can co-occur in
order to express disagreement in a more effective way.
Example 24
F09 … Hát szerintem nem kellene kötelező tandíjat bevezetni, mert
most én azért fizessek, mert tanulni akarok? Nekem ez így nem
… kapcsolódik össze.
… Well I don’t think compulsory tuition fees should be introduced
because should I pay because I want to study? For me this doesn’t
… make any sense.
→ M09 Hát /●jó║ de azt azért \/●figyelembe kell venni║, hogy a \●háttérben
azért van egy apparátus║ amit \●biztosítanak számodra║ /\●azért
hogy tanulhass║.
Well fine, but it needs to be taken into consideration that there is
some apparatus provided (for you) in the background so that you
can study.
Example 24, taken from a discussion about compulsory tuition fees in higher edu-
cation, illustrates the most frequently used combination of disagreement strategies
in my research corpus. The disagreement starts with a hedging device (hát ~ well),
followed by a partial agreement (jó, de ~ fine, but) and an explanation (azt azért
figyelembe kell venni, hogy… ~ it needs to be taken into consideration that…) of how
The development of a taxonomy of verbal disagreements in the light of the p-model
tuition fees are used for the benefit of students. As Kotthoff (1993: 208) points out,
“partial agreement can help to develop the main point of disagreement” and “is
often a prestep to further disagreement”.
As can be seen from the examples discussed in the previous section, the new tax-
onomy distinguishing between disagreement functions (strategies) and linguis-
tic devices that either mitigate or aggravate the force of the opposing utterance
(PFMs) seems to serve as a suitable analytical tool for providing us with a com-
prehensive analysis of disagreements. In addition, this framework allows for an
analysis of utterances that contain a combined use of disagreement strategies, thus
the new taxonomy is more complete and sophisticated than its previous version
or the models described in the starting p-context. Also, the functional catego-
ries are less subjective than those of the other p-context versions, since they are
assigned typically co-occurring PFMs and, when applicable, acoustic features as
well, which make the categories more distinguishable and the model more robust.
Therefore, this model leads to statements with a higher plausibility value than the
other p-context versions. Moreover, the model also resolves the informational
underdetermination of the previous p-context versions, since it can account for
all instances of disagreement found in the research corpus. One can conclude that,
based on the information available and the set of criteria laid down in Section 2,
this modified category system seems to be a much better solution to (P) than the
rival models (cf., 2.1–2.4); thus, it can be regarded as a resolution to (P) in the
information state at our disposal. Consequently, on the basis of this case study it
can be stated that the resolution to (P) is as follows:
(SP) The model distinguishing between strategies and PFMs allows for
describing Hungarian disagreement the most.
However, it must be noted that (SP) is a fallible solution which, in light of new
data, can be reconsidered in the course of a new argumentation cycle.
6. Conclusion
changes made in the instrument as the research proceeded. I explained the multi-
step method used for analysis including coding schemes and extensive illustration,
where applicable. A step-by-step description of the analytical procedure(s) was
aimed at guiding the reader through the stages of the research whose findings
constantly shaped the original framework until it reached its ultimate form. By
describing each step of the research along with the decisions and re-evaluations
of the information at hand as well as the p-context, I demonstrated the cyclic and
prismatic nature of the plausible argumentation process.
As a primary goal, the study attempted to find the model that is the most
capable of describing Hungarian disagreement. It succeeded in mapping the
functional spectrum of disagreements employed by Hungarian undergraduate
students by identifying specific disagreement strategies. Additionally, PFMs,
which either strengthen or soften the illocutionary force of the utterance, were
also identified. This study, thus, managed to investigate disagreement not only in
type of strategy but also in terms of modification devices. Furthermore, it pro-
poses a framework of verbal disagreement that is more reliable and sophisticated
than other rival models. It seems to meet all the criteria laid down previously and
thus can be regarded as a resolution to (P), which might be re-evaluated based
on new data.
Acknowledgements
Work on the present chapter was supported by the MTA-DE Research Group for
Theoretical Linguistics and the TÁMOP-4.2.2/B-10/1-2010-0024 project, which is
co-financed by the European Union and the European Social Fund. I would like to
I express my gratitude to András Kertész and Csilla Rákosi for their helpful com-
ments and insightful views on earlier drafts of this paper.
References
Aijmer, Karin. 1986. Discourse variation and hedging. In Corpus Linguistics, II, New Studies in
the Analysis and Exploitation of Computer Corpora, Jan Aarts & Willem Meijs (eds), 1–18.
Amsterdam: Rodopi.
Bándli, Judit. 2009. Az egyet nem értés pragmatikája. PhD dissertation, University of Pécs, Pécs,
Hungary.
Blum-Kulka, Shoshana, House, Juliane, & Kasper, Gabriele. 1989. The CCSARP coding man-
ual. In Cross-cultural Pragmatics: Requests and Apologies, Shoshana Blum-Kulka, Juliane
House, & Gabriele Kasper (eds), 273–294. Norwood NJ: Ablex.
Brown, Penelope, & Levinson, Stephen. 1987. Politeness: Some Universals in Language Usage.
Cambridge: CUP.
The development of a taxonomy of verbal disagreements in the light of the p-model
Bousfield, Derek. 2008. Impoliteness in Interaction [Pragmatics & Beyond New Series 167].
Amsterdam: John Benjamins.
Culpeper, Jonathan. 1996. Towards an anatomy of impoliteness. Journal of Pragmatics 25: 349–367.
Culpeper, Jonathan. 2005. Impoliteness and entertainment in the television quiz show: The
Weakest Link. Journal of Politeness Research: Language, Behaviour, Culture 1: 35–72.
Culpeper, Jonathan. 2011. Impoliteness: Using Language to Cause Offence. Cambridge: CUP.
Culpeper, Jonathan, Bousfield, Derek, & Wichmann, Anne. 2003. Impoliteness revisited: With
special reference to dynamic and prosodic aspects. Journal of Pragmatics 35: 1545–1579.
Du Bois, John W., Cumming, Susanna, Schuetze-Coburn, Stephan, & Padino, Danae (eds).
1992. Discourse transcription. In Santa Barbara Papers in Linguistics 4. Santa Barbara CA:
University of California.
Fónagy, Iván. 1971. Synthèse de l’ironie. Basel: Karger.
Fónagy, Iván, & Magdics, Klára. 1967. A magyar beszéd dallama. Budapest: Akadémiai Kiadó.
Grice, Herbert Paul. 1975. Logic and conversation. In Syntax and Semantics, 3: Speech Acts, Peter
Cole & Jerry Morgan (eds), 41–58. New York NY: Academic Press.
Holtgraves, Tom. 1997. Yes, but…: Positive politeness in conversation arguments. Journal of
Language and Social Psychology 16: 222–239.
Holmes, Janet. 1995. Women, Men and Politeness. London: Longman.
Kertész, András, & Rákosi, Csilla. This volume. Evidence and the argumentation structure of
linguistic theories.
Kertész, András, & Rákosi, Csilla. 2012. Data and Evidence in Linguistics. A Plausible Argumen-
tation Model. Cambridge: CUP.
Koczogh, Helga Vanda. 2012. The Effects of Gender and Social Distance on the Expression of
Verbal Disagreement Employed by Hungarian Undergraduate Students. Ph.D. disserta-
tion, University of Debrecen, Debrecen, Hungary.
Komlósi, Boglárka. 2007. Az iróniakutatás elméleti dilemmái a pragmatikában. Nyelvtudomány
3–4: 95–128.
Komlósi, Boglárka. 2010. Irony in the semantics-pragmatics interface: A reconstructional
model. 〈http://conference.clancorpus.net/?p=1422〉
Komlósi, Boglárka. 2013. A polarizációs tendencia szerepe az irónia nyilvánvalótételében. A
hiperbola és lilotész mint nyomok az irónikus interpretációhoz. Magyar Nyelv 4: 409–420.
Kotthoff, Helga. 1993. Disagreement and concession in disputes: On the context sensitivity of
preference structures. Language in Society 22: 193–216.
Leech, Geoffrey. 1983. Principles of Pragmatics. London: Longman.
Locher, Miriam. 2004. Power and Politeness in Action: Disagreements in Oral Communication.
Berlin: Mouton de Gruyter.
Markó, Alexandra. 2005. A spontán beszéd néhány szupraszegmentális jellegzetessége: Mono-
logikus és dialogikus szövegek összevetése, valamint a hümmögés vizsgálata. Ph.D. disser-
tation, ELTE, Budapest, Hungary.
Markó, Alexandra. 2007. A Special Conversational Device: Humming in Hungarian. The
Phonetician 95: 28–35.
Mászlainé Nagy, Judit. 2010. A jászapáti Magyar és roma kisiskolások nyelvhasználati stratégiái-
nak kultúraközi elemzése. PhD dissertation, University of Debrecen, Debrecen, Hungary.
Muntigl, Peter & Turnbull, William. 1998. Conversational structure and facework in arguing.
Journal of Pragmatics 29(3): 225–256.
Németh T., Enikő. 1996. A szóbeli diskurzusok megnyilatkozáspéldányokra tagolása [Nyelvtudo-
mányi értekezések 142]. Budapest: Akadémiai Kiadó.
Helga Vanda Koczogh
Nauke, Astrid, & Braun, Angelika. 2011. The production and perception of irony in short
context-free utterances. ICPhS 7: 1450–1453.
Rees-Miller, Janie. 1995. Linguistic Features of Disagreement in Face-to-Face Encounters in
University Settings. Ph.D. dissertation, Stony Brook, NY: State University of New York.
Rees-Miller, Janie. 2000. Power, severity, and context in disagreement. Journal of Pragmatics 32:
1087–1111.
Sperber, Dan, & Wilson, Deirdre. 2006. Relevance Theory. In The Handbook of Pragmatic,
Laurence R. Horn & Gregory Ward (eds), 607–632. Oxford: Blackwell.
Svindt, Veronika. 2007. Iróniaértelmezés elméletben és gyakorlatban. Első Század, 1: 171–208.
〈http://www.epa.hu/01600/01639/00003/pdf/elso szazad_2007_1_171-208.pdf〉
Svindt, Veronika. 2008. Az irónia szupraszegmentális ismertetőjegyei. Félúton 4: 1–15.
Tannen, Deborah. 1984. Conversational Style: Analyzing Talk Among Friends. Norwood NJ:
Ablex.
Toplak Maggie & Katz, Albert N. 2000. On the uses of sarcastic irony. Journal of Pragmatics
32(10): 1467–1488.
Varga, László. 1985. Intonation in the Hungarian sentence. In Approaches to Hungarian, István
Kenesei (ed.), 205–224. JATE: Szeged, Hungary.
chapter 7
A case of disagreement
On plural reduplicating particles in Hungarian
György Rákosi
University of Debrecen, Institute of English and American Studies, MTA-DE
Research Group for Theoretical Linguistics
1. Introduction
Hungarian verbal particles fall into several, grammatically distinct classes. One type
doubles the case suffix that appears on the oblique associate of the particle verb. The
resulting reduplicating particle verb construction (RPVC for short) is illustrated in (1):1
. Diverging from Hungarian orthographical conventions, I write the verb and the particle
that immediately precedes it as two different words, and I do this consistently throughout this
article. It is partly for expository purposes, and partly for the reason that these conventions
are not fully consistent and they would dictate the two-word spelling in some of the cases to
be discussed here anyway. Native speakers are also often uncertain about whether a particular
György Rákosi
In this example, the oblique associate of the particle verb bears sublative case, and
the case suffix is reduplicated as the verbal particle rá.
The reduplicating particle verb construction has been studied quite exten-
sively in the literature on Hungarian, including, but not restricted to, Ackerman
(1987, 1990, 2003), Ackerman & Webelhuth (1993), Ackerman & LeSourd (1997),
Hegedűs (to appear), É. Kiss (1998, 2002), Kálmán & Trón (1999), Rákosi & L aczkó
(2011, to appear), Surányi (2009a, b) and Ürögdi (2003). The construction has
many intriguing properties and my aim in this article is to scrutinise one of these:
the behaviour of the particle when the oblique associate is plural. In the standard,
the particle takes what looks like a default 3sg form even in the presence of a plu-
ral associate.2 There is general consensus in the literature on the existence, i.e. the
grammaticality, of the resulting non-agreeing RPVC. Consider (2a) for illustration.
(2) non-agreeing RPVC
a. Peti rá nézett a lány-ok-ra.
Peti onto.3sg looked the girl-pl-onto
‘Pete looked at the girls.’
agreeing RPVC
b. Peti rá-juk nézett a lány-ok-ra.
Peti onto-3pl looked the girl-pl-onto
‘Pete looked at the girls.’
Much less obvious is the answer to the question whether the hypothetical alterna-
tive with a plural form of the particle is grammatical or not (2)(b). Most of the
literature listed above mentions the agreeing RPVC, but they are in disagreement
as to whether it is to be considered grammatical or ungrammatical. Some explicitly
reject it, and mark constructional equivalents of (2)(b) with the asterisk (É. Kiss
1998, Hegedűs to appear, Kálmán & Trón 1999; Kenesei 2005). But for others, the
agreeing RPVC is grammatical and they generally take it to be so without e xplicitly
entertaining the possibility that it need not be (É. Kiss 2002; Surányi 2009a, b). This
basic inconsistency and its possible resolution is the focus of this chapter.
particle-verb combination should be spelt as one or two words – an uncertainty that is well-
reflected in the variation in written forms found in corpora.
. Since the purposes of this chapter do not require a decision, I remain agnostic about
whether the particle in (2)(a) is underspecified for number or whether it is a morphologically
unmarked 3sg form. I gloss it as 3sg for expository reasons.
A case of disagreement
. The notation |p|S introduces the plausibility value of a statement p relative to a (possibly
plural) individual S (see Kertész & Rákosi 2012, this volume). This notation is described in
more detail in 3.1.
. It is beyond the scope of this chapter to discuss the Hungarian reduplicating particle verb
construction from a comprehensive cross-linguistic perspective. For recent discussions of
P-doubling constructions in German, see Noonan (2010) and Abraham (2010).
György Rákosi
Interestingly, all the pertinent literature tacitly assume that the answer to (P2) is
yes, irrespective of whether they take the agreeing RPVC to be grammatical or not.
This assumption is tacit in the sense that no-one has claimed explicitly that the
two respective RPVC varieties are distinct constructions. I attempt to show here
that the answer to (P2) turns out to be negative under closer inspection, and this
recognition will help us find a solution to (P1) that is hopefully more satisfactory
than previous accounts – that is, it can be regarded to be the resolution of (P1) on
the basis of the information at our disposal.
The structure of the chapter is as follows. In Section 2, I set up the minimal
descriptive background as a prerequisite for the construal of the initial p-context
of argumentation. This p-context is essentially the reconstruction of the existing
accounts that altogether give an inconsistent answer to (P1). I describe the key
features of this p-context in Section 3, discussing arguments that this literature
offers in support or in rejection of the grammaticality of the agreeing reduplicating
particle verb construction. In Section 4, I extend this p-context by the introduction
of new pieces of evidence favouring a positive answer to (P1) as well as coordinate
it via synthesising the old and the new information that is relevant for the argu-
mentation. In Section 5, I provide a summary by presenting the resulting final
p-context that crucially relies on a joint resolution to (P1) and (P2).
If these locative case markers take a pronominal complement, than the case suf-
fix assumes the relevant agreement morphology (4)(a), which is zero in 3sg (see
A case of disagreement
Footnote 2). The pronoun itself is normally not present in compliance with the
pro-drop nature of Hungarian (4)(b), and it is there only if it receives a discourse
function. In (4)(a), it is the focus of the clause.
(4) a. János ő-rá nézett fel.
John (s)he-onto.3sg looked up
‘It is her that John looked up to.’
b. János fel nézett rá.
John up look onto.3sg
‘John looked up to her.’
For ease of exposition, I simply refer to both inflected case suffixes in (4)(a) and
(4)(b) as pronominal forms, without intending to imply any theoretical commit-
ment with this term.
It is this pronominal form that apparently occurs in the reduplicating con-
struction, but only in its ‘pro-dropped’ version. I repeat (1) as (5)(a), and compare
it with (5)(b):
(5) a. Peti rá nézett a lány-ra.
Peti onto.3sg looked the girl-onto
‘Pete looked at the girl.’
b. *Peti ő-rá nézett a lány-ra.
Peti (s)he-onto.3sg looked the girl-onto
‘Peter looked at the girl.’
Since there is general consensus about the ungrammaticality of (5)(b), and since
my own intuition is in agreement with this consensus, I treat the u
ngrammaticality
of (5)(b) as certain. The contrast between (5)(b) and (5)(a), and the potential
analogy of the particle in (5)(a) with the pronominal in (4)(b) indicate the essen-
tial challenge that these constructions pose for a linguist: in what sense can the
particle in (5)(a) be treated as a pronominal of some sort? As I argue in Section 4,
this important issue constitutes a crucial element in the elaboration of (P2).
Before embarking on the establishment of the initial p-context for our case,
I give an exhaustive list of the inflected suffixal pronominal forms that can serve
as reduplicating particles.
(6) a. bele ‘into (it)’
b. benne ‘in (it)’
c. érte ‘for (it)’
d. hozzá ‘to (it)’
e. neki ‘to/against (it)’ (dative case)
f. rá ‘onto (it)’
g. rajta ‘on (it)’
h. vele ‘with (it)’
György Rákosi
Some of these are more productive in this use than others, and they share the
property that they are either directional or (stative) locative in their semantics.5
These 8 elements represent the class of particles that constitutes the empirical
coverage of the forthcoming investigation.
. See É. Kiss (1998, 2002) and Surányi (2009a, b) on the linguistic relevance of this.
. Kenesei (2005: 422) also denies the grammaticality of the construction, but since he
does this in a review discussion of É. Kiss’s (2002) examples, I come back to his remarks in
Section 3.3 below.
A case of disagreement
These authors all seem to treat these examples implicitly as representatives of what
I call here the agreeing reduplicating particle construction, and they describe it as
one that does not license (plural) number agreement between the particle and the
associate.
In short, the following plausible inferences present themselves as possible
reconstructions of their argumentation:
|The respective sentences (7)(b), (8)(b), (9) and (10) are
ungrammatical.|S = 1.
NAY
[0 < |The respective sentences are representative of the agreeing
RPVC.|S < 1]
NAY
[0 < |All other instances of this construction type are ungrammatical,
too.|S < 1]
NAY
0 < |The agreeing RPVC is ungrammatical as a construction.|I <1
NAY
Here and below I use the notation that Kertész & Rákosi (2012, this volume:
Section 2.2–3) introduce in their p-model with respect to plausible inferences.
The first line is an instance of the notation that denotes the plausibility value
of the relevant statement, where 1 is certainty on the basis of a given source S,
and 0 is neutral plausibility on the basis of a given source S. The second and third
line above introduce background assumptions with a plausibility value between 0
and 1.7 I is the plausible inference itself. I use NAY as a subindex on S as a short-
hand for the group of naysayers, i.e. those authors who consider the agreeing
RPVC ungrammatical.
The above background assumptions are taken to have a relatively high
plausibility value, since the description that the authors provide of the data
strongly implies that they share them. But since it is not an explicitly presented
commitment – that is, none of the authors claim that their examples generally
. As Kertész and Rákosi (this volume: Section 2.3) point out, “With inductive inferences,
the set of the premises has to be supplemented by the background assumption that the cases
not examined also possess the characteristics that could be found in the investigated ones …”.
This is a general assumption in linguistic practice.
György Rákosi
r epresent the construction type itself –, the plausibility value is not 1. It follows
that the conclusion also has a plausibility value that is lower than 1.
Surányi (2009a: 72)
–
(11) Az orvos hozzá / hozzá-juk érintett egy műszer-t
the doctor to.3sg / to-3pl touched an instrument-acc
a gyerek-ek-hez.
the child-pl-to
‘The doctor had an instrument touch the children.’
Surányi (2009b: 231)
–
(12) János nek-i / nek-ik ment a járókelő-k-nek.
John dat-3sg / dat-3pl went the passer.by-pl-dat
‘John bumped into the passers-by.’
There is no indication in the papers that his intuitions dictate any grammaticality
difference between the agreeing and the non-agreeing RPVCs, therefore I take it
to mean that Surányi finds both equally grammatical.
É. Kiss (2002: 197) mentions the following two minimal pairs:
– É. Kiss (2002: 197)
(13) a. János nek-ik ment a járókelő-k-nek.
John dat-3pl went the passer.by-pl-dat
‘John bumped into the passers-by.’
b. János nek-i ment a járókelő-k-nek.
John dat-3sg went the passer.by-pl-dat
‘John bumped into the passers-by.’
There are two noteworthy aspects of these examples. First, while É. Kiss (1998)
finds (7)(b) and (8)(b) ungrammatical, (13)(a) is presented in É. Kiss (2002) as
grammatical. Admittedly, the three respective sentences are non-identical, but
both texts suggest that the author describes properties of the construction and not
of particular examples only.8
The second interesting aspect of these examples is the contrast between
(13)(a) and (14)(a). The difference between the two examples is that the oblique
associate in (13)(a) has a human referent but the one in (14)(a) does not. This
latter is marked by É. Kiss as “??”, indicating that she does not find it either fully
grammatical or ungrammatical. She does not, however, make explicit note of this
contrast beyond presenting the data. The oblique associates in Surányi’s examples
each have human referents, and it is not clear whether the lack of examples with
non-human referents is an intentional feature of the presentation or not.
I interpret this as a sign that each author assumes latently that the agreeing
RPVC is only grammatical with human associates. Since this is my own intuition,
too, at this stage of the argumentation process I will set aside any examples where
the oblique associate has non-human reference. This also helps reconstruct the
initial p-context in such a way that “noisy” data are filtered out. In other words, I
background certain types of data that are not promising to be useful for the subse-
quent plausible argumentation process, and this will help concentrate on and the
core issue that constitutes the essence of (P1). In Section 4, I return to this refer-
ential restriction only to show that this strategy of backgrounding is justifiable at
this stage of the argumentation.
The yea-sayers’ approach is characterised by the plausible inference pattern
below.
|The respective sentences (11), (12) and (13)(a) are grammatical.|S = 1.
YEA
[0 < |These sentences are representative of the agreeing RPVC that includes
an oblique with +human referents.|S < 1]
YEA
[0 < |All other instances of this construction type are grammatical,
too.|S < 1]
YEA
0 < |If the oblique associate has +human referent(s), then the agreeing
RPVC is grammatical.|I <1
YEA
É. Kiss (1998: 142): “A preverbal element of the hozzá type does not agree in number
with the postverbal argument bearing the same case ending.”
É. Kiss (2002: 197): “[…] when coindexed with a plural lexical noun phrase, the 3rd
person PP [i.e. the particle] does not necessarily bear a plural marker.”
György Rákosi
Since the background assumptions (the second and the third lines) are, of course,
not explicit in É. Kiss (2002) and Surányi (2009a, b), the plausibility of the conclu-
sion itself is also lower than 1.
This is the basic p-inconsistency that underlies (P1) and which is in need of
resolution.
Before continuing, I would like to point out that this problem has been explic-
itly noted by Kenesei (2005) and Hegedűs (in prep.) in their discussion of the
relevant data. Kenesei (2005) is in fact a review of É. Kiss (2002), and I quote an
illustrative passage discussing É. Kiss’s data that appear in (13) and (14) above.
Correction of, or comment on, the Hungarian examples themselves is of course
not the remit of a copy-editor. One would expect this from a reviewer, but although
Katalin É. Kiss thanks an anonymous reviewer, it can only be supposed that this
person was not a native Hungarian linguist, since otherwise one would have
expected comment on some of the more controversial examples. To illustrate,
compare the following pair on p. 197, rejected by all my students [reference to
examples (13)(a) and (14)(a) above] […].
Both of these sentences are totally unacceptable: the grammatical versions …
must have singular nek-i [dat-3sg, cf. (13)(b) and (14)(b)], […].
(Kenesei 2005: 423)
Kenesei (2005) thus provides a solution that, primarily on the basis of his and his
students’ intuitions, gives a negative answer to (P1), supporting (∼G). In what fol-
lows I try to argue that the total elimination of (G), which Kenesei seems to sug-
gest, is much less plausible as a solution than its maintenance.
4. E
laborating the p-context: The differential analysis of agreeing
and non-agreeing RPVCs
As stated in the Introduction to this chapter, (P2) is not raised explicitly in the
literature. The common latent background assumption is that we are dealing with
instances of one and the same construction.
The existing analyses fall into two different types along a parameter that is
relevant for us. É. Kiss (2002), as we have seen in the previous section, argues that
the particle in the RPVC is a full pronoun. As such, it is the head of an appositive
construction to which the oblique associate is an adjunct. No other account makes
this assumption in the literature. Others take this particle to be a verbal prefix
(É. Kiss 1998), a reduced copy of the associate (Ürögdi 2003; Surányi 2009a, b),
or an incorporated agreement marker of a special sort (Rákosi & Laczkó 2011, to
appear; and see also the series of papers by Ackerman and co-authors for a similar
approach). Cutting this opposition to its essentials, the difference is between treat-
ing the particle either as a full pronoun or as something else.
The claim that I intend to substantiate here is that both sides are right, albeit
for different domains of data. The two hypotheses that I want to defend below are
as follows:
(H1) and (H2) together entail a negative answer to (P2), and they elaborate the
initial p-context in a way that facilitates the establishment of a resolution to
A case of disagreement
(P1).9 This is a paraconsistent solution in the sense of Kertész & Rákosi (2012)
and K ertész (2012) inasmuch as two contradictory statements are shown to
be co-existing at a higher level of analysis by restricting their application to
distinct empirical domains. Below I discuss three sets of data in support of this
claim. The source of these data is my own intuitions and the intuitions of the
speakers that I have asked for judgements.
In (17)(a), the antecedent of the oblique reciprocal anaphor is the subject noun
phrase, and the presence of the non-agreeing particle does not influence the gram-
maticality of the construction. (17)(a) is grammatical with or without the particle. In
(17)(b), the presence of the agreeing particle renders the sentence ungrammatical.10
It is a plausible assumption that (17)(b) is ungrammatical for the same reason
as (18):
(18) *A gyerek-eki bemutatták őketi egymás-naki.
the child-pl introduced them each.other-dat
‘*The childreni introduced themi to each otheri.’
That is, we can treat (18) and (17)(b) as instances of the same phenomena: a
pronoun intervenes between the antecedent and the anaphor, and its presence cre-
ates ungrammaticality. Crucially, this explanation works if we treat the agreeing
particle as a pronoun of some sort (H2), but we do not consider the non-agreeing
particle as a pronominal (H1).
. (H2) can be equally plausible both in the context of (G) and (~G). It is because the
agreeing construction can in principle be either grammatical or ungrammatical because of
(H2). I argue here that the former is the case.
. In terms of the traditional Binding Theory of Generative Grammar, the pronominal par-
ticle creates a Principle B violation.
György Rákosi
The semantic properties of the associate do not seem to matter in the non-agreeing
construction ((21)(b) & (22)(b)). Once we switch to the agreeing construction
((21)(a) & (22)(a)), judgements get markedly worse if the associate (and therefore
the particle itself) has -human reference. What may lie behind this?
I have been suggesting that non-agreeing particles are not pronouns. As such,
they are not expected to show any special selectional restrictions on their obliques,
which is the case by (21)(b) and (22)(b). Interestingly, true pronouns do not have
such referential restrictions either, consider (23):
(23) János nek-ik vágta a labdá-t.
John dat-3pl went the ball-acc
‘John threw the ball against them.’
The pronoun nekik ‘to them’ can refer to humans and non-humans, alike.11 That
seems to create a problem if we, just like in the previous two cases, want to treat
the a-examples above and (23) analogously, and analyse the agreeing particle in
(21)(a) and (22)(a) as a pronoun.
One possible solution to this particular problem can be found along the
strong pronoun – weak pronoun distinction of Cardinaletti & Starke (1999).
Their proposal, which is aimed at cross-linguistic coverage, draws a systematic
. The editors of this volume do not find the -human reading of this pronoun grammatical.
Arguably, this is indeed not the default reading, but many native speakers, including myself,
do find it a possible reading in a facilitating context. For example, I find (23) a perfect answer
to the question What happened to the windowpanes? Nevertheless, it could be the case that
there is a split between native speakers in accepting or rejecting the -human reading of such
pronouns, which is yet another instance of data inconsistency in this p-context. I put this issue
aside because it does not influence the main line of argumentation.
György Rákosi
distinction between strong pronouns, weak pronouns and clitics. It is the first
two categories that are relevant for us. Weak pronouns are deficient in several
ways, and they are underspecified for the +/–human feature by default. Strong
pronouns, on the other hand, are not deficient structurally and they may not refer
to non-human entities.
I take the pronoun in (23) to be a weak pronoun, which is a natural assump-
tion given that the ‘real’ pronominal complement of the case marker is absent (or
is pro-dropped). This explains why it shows no referential restrictions. If the plural
agreeing particle in (21)(a) and (22)(a) is a pronoun and it forms an appositive
construction with the associate as É. Kiss (2002) suggests, than it has to be a strong
pronoun, since appositives require a strong pronoun head. The +human restriction
follows from the strong pronoun status if we accept the account of Cardinaletti &
Starke (1999). Therefore the data in (21) and (22) make the pronominal analysis of
the agreeing particle even more plausible rather than implausible.12
. One could argue that the non-agreeing particles are also weak pronouns, and that is why
they are not subject to a referential constraint. However, even weak pronouns should show
Principle B affects, which non-agreeing particles do not, as we have seen above in Section 4.2.
A case of disagreement
is in the negative. In the final section, I conclude this chapter by pointing out
how this result helps to create the final p-context for the purposes of the current
argumentation.
The negative answer to (P2) has followed from a plausible argumentation process
that has rendered (H1) and (H2) statements of high plausibility.
Thus the final solution to (P1), I argue, rests on the acceptance of (H2) as a
very plausible statement. (H2) requires an account of the agreeing reduplicating
particle verb construction in which the relation between the particle and the asso-
ciate is appositive in nature. This is the account that É. Kiss (2002) proposes for
both RPVCs, but I have argued that it has to be restricted to the agreeing type. The
plural particle, as a referential pronoun, can only be in an appositive relation with
György Rákosi
the likewise referential associate. This construction is marked in the usual sense of
the term: because of the appositive structure, it is more complex and ‘heavier’ than
its unmarked counterpart, the non-agreeing RPVC; and it is also less frequent in
corpora or in speech.
Thus the issue, I believe, is not simply or only whether the agreeing RPVC is
grammatical or ungrammatical. The issue is, as I have tried to show, is that it is a
marked construction and native speakers may often show uncertain judgements
towards it and they may often try to avoid it altogether. This is not to deny that
there may be idiolectal variation. But if there is one, then it should also follow
from the structural difference between the agreeing and the non-agreeing RPVC
that I have established here. That is, some speakers may strongly disprefer or reject
the pronominal use of the reduplicating particle, and I suggest it is exactly these
speakers that disprefer or reject the agreeing RPVC.
This line of reasoning represents an instance of what Kertész & Rákosi (2012,
this volume: Subsection 2.6.2) call the Combinative Strategy, applied to the current
case in an attempt to tolerate the inconsistency that constitutes (P1) and to arrive at a
paraconsistent solution. We have found that it is much more motivated to treat state-
ments (i) and (ii) of (P1) as co-existing alternatives that can be maintained simulta-
neously. This requires the systematic separation of the two inconsistent p-contexts as
follows. First, it needs to be recognised that (i) and (ii) of (P1) characterise distinct
groups of native speakers. Second, the assumption that the agreeing and the non-
agreeing RPVC are instances of essentially the same construction type must be given
up: the particle is pronoun in the agreeing RPVC, but it is not in the non-agreeing
RPVC. Third, the pronominal use of the particle is a marked phenomenon and that
explains why even favourable native speakers may find it somewhat unusual, and
why many of them vacillate between dispreferring or outright rejecting it.
Needless to say, this account raises many further questions, the issue of
what it means to be data and evidence in linguistics being a prominent one. But
my intentions have been more modest in this chapter. I have tried to show that
conscious reliance on the p-model of Kertész and Rákosi (2012, this volume) as
a metalinguistic framework of analysis helps us find a potential resolution to
an inconsistency that characterises the literature on the agreeing reduplicating
particle construction of Hungarian.
Acknowledgments
References
Surányi, Balázs. 2009b. Adpositional preverbs, chain reduction, and phases. In Approaches to
Hungarian 11, Marcel den Dikken & Robert M. Vago (eds), 217–250. Amsterdam: John
Benjamins.
Ürögdi, Barbara. 2003. Feature doubling, aspectual structure, and expletives. In Proceedings of
the North East Linguistic Society 33, Makato Kadowaki & Shigeto Kawahara (eds), 425–444.
Amherst MA: University of Massachusetts.
chapter 8
A plausibility-based model
of shifted indexicals
Zoltán Vecsey
MTA-DE Research Group for Theoretical Linguistics
Influenced by David Kaplan’s seminal work (1989a, b), the dominant view on the
semantic interpretation of indexicals is based on the following familiar picture.1
Model-theoretic structures for indexical languages consist of two separable inter-
pretive domains: contexts of occurrence (c) and circumstances of evaluation (i).
Contexts are conceived of as collections of different parameters which are needed
to determine the denotations of indexical expressions. Such collections could be
minimally represented by quintuples of the form 〈s, t, l, w, g〉, where s is an agent,
t is a point of time, l is a location, w is a possible state of the world, and g is a
(possibly partial) variable assignment function defined on the salient entities of w
at t and l. Circumstances of evaluation are symbolised traditionally with centered
. In this chapter I will concentrate solely on the behaviour of pure indexicals. For simplicity’s
sake, I will ignore cases where pure indexicals are interpreted metaphorically or analogically.
Zoltán Vecsey
world-time tuples 〈w, t〉, but if the complexity of the interpreted language fragment
requires this, they may also include a few further parameters. The number and
kind of additional parameters do not influence substantially the model-theoretic
role of circumstances. That role consists in assigning truth values to the contents of
sentences which contain one or more indexical expressions. Within the Kaplanian
framework, sentential contents are treated as structured propositions. If we are
willing to accept this approach, we can say that circumstances settle truth values
for propositions.
The model-theoretic interpretation of indexical languages is then basically a
two-step procedure. In the first step of the interpretation process indexicals are
evaluated with respect to a particular collection of relevant contextual parameters.
The evaluated objects are conceived of traditionally as conventionalised mean-
ings or lexical entries. Kaplan (1989a: 505) calls such objects ‘characters’. As an
example, consider the contextual occurrence of three paradigmatic indexicals in
an intended model M:2
(1) a. [[Ig]]c = AGENTg(c)
b. [[hereg]]c = LOCATIONg(c)
c. [[nowg]]c = TIMEg(c)
In (1)(a), (1)(b), and (1)(c) characters are evaluated with respect to their context
of occurrence. It is commonly assumed that the context of occurrence c and the
assignment function g of the model M together determine the content (denota-
tion) of paradigmatic indexicals. So given an appropriate assignment function g,
the content of the personal pronoun ‘I’ is the agent of c, the content of the locative
adverb ‘here’ is the most salient location of the occurrence of ‘here’ in c, and the
content of the temporal adverb ‘now’ is the most salient moment of time selected
by the occurrence of ‘now’ in c. Other indexicals such as ‘today’ or ‘actual’ are sup-
posed to behave in the same way.
The second step in the interpretation process identifies an actual or counter-
factual circumstance with respect to which it is possible to ask for the truth values
{1, 0} of structured propositions. In intended models, circumstances coincide with
the actual state of the world and the time of this world 〈w@, t@〉, that is to say, with
our actual world. Circumstances may be identified in other admissible models
with arbitrary possible worlds and time points.
Ultimately, then, indexical sentences are evaluated with respect to two differ-
ent collections of parameters. First, contexts determine the content of characters,
. By ‘an intended model’ I mean simply a model whose contextual domain consists of some
set of objects of our actual world w@.
A plausibility-based model of shifted indexicals
Indexicals may denote different objects in different contexts, but it seems that every
particular context determines the denoted objects completely and definitely. The
personal pronoun ‘I’, for example, necessarily denotes in each of its occurrences
the agent of the context. If you are the only agent in c, then, of course, ‘I’ cannot
denote me truly in that very context.
This rather simple observation is the basis of the Kaplanian Reference Prin-
ciple, according to which indexicals are directly referential expressions (Kaplan
1989a: 492, 1989b: 569). The principle of direct reference claims that indexical
denotations are directly determined with respect to the relevant parameters of
the actual context of their occurrence. The mechanism of the determination is
called ‘direct’, since the denoted objects become part of the propositions without
the mediation of other propositional constituents. It is irrelevant how characters
find their denoted objects. Although their referential behaviour may be governed
by descriptive rules, these rules are not part of the constituent structure of propo-
sitions. The character of ‘I’ may be identified roughly with the descriptive rule
‘search for the agent who is speaking or writing’. At the ground level of propositions,
however, only the agent denoted by ‘I’ may have any model-theoretic significance.
If the Reference Principle is correct, then it is quite obvious that the content
of indexicals cannot be influenced by counterfactual circumstances. Moreover, it
seems that when an indexical expression denotes a particular object in a given
context, then it must denote the same object in all possible contexts (of its model
M). Therefore, taking into consideration the characteristics of double indexing
displayed above, [[αg]]c,i = DENg(c) may be regarded as a generally valid rule for
determining indexical denotations.
This rule does not mean, in and of itself, that circumstances cannot have an
autonomous function in the process of interpretation. As stated above, circum-
stances might be relevant to assigning truth values to propositions. Consider the
sentence in (2)(d). According to this example, the proposition expressed by ‘I am
Zoltán Vecsey
here now’ should come out true in such contexts, where the agent of the sentence
is at the location of the utterance at the time of the utterance in the world of the
utterance. It is not hard to see that in intended models every context meets these
requirements. Uttered sentences are bound with necessity to their actual agent,
location and time parameters. So the question arises: could we have any model M*
in which the sentence ‘I am here now’ may be interpreted as false? On Kaplan’s
view, such models do not exist. The reason for this is that a counterfactual circum-
stance in which the agent is not at the location and time of the utterance would
contain an impossible world parameter. If one tried to detach the agent parameter
from the location and time parameters, the resulting picture would be obviously
incoherent. Viewed from this perspective, ‘[[I am here nowg]]c,i = 1’ counts as a
logical truth in Kaplan’s theory.
But now consider the apparent difference between the following two
interpretations:
(3) a. [[I am here nowg]]c,i = 1, via the Reference Principle and the rule ‘[[αg]]
c,i = DEN (c)’
g
b. [[Necessarily, I am here nowg]]c,i = 0, via Necessitation and the
Reference Principle with the rule ‘[[α ]]c,i = DEN (c)’3
g g
. In the present setting, ‘Necessitation’ should be interpreted in the same way as in standard
modal logics.
A plausibility-based model of shifted indexicals
to act on contexts, because when operators begin their work the content of the
characters in the embedded sentences is already secured. In fact, this is Kaplan’s
official view. On the other hand, one might assume that context-shifting is gov-
erned by the same scopal mechanisms as circumstance-shifting. It is sensible to
expect ‒ solely on syntactic grounds ‒ that within their scope operators can manip-
ulate contexts in the same way as circumstances.
As for the first view, Kaplan illustrates the immunity of the content of char-
acters to varying contexts with the locational operator ‘in some contexts it is true
that’ (Kaplan 1989a: 510). According to his interpretation, there is a sharp contrast
between the acceptability conditions of (4)(a) and (4)(b):
(4) a. [[I am tired nowg]]c,i = 1, iff AGENTg(c) is tired at LOCATIONg(c) and
at TIMEg(c) in wi
# [[IN SOME CONTEXTS IT IS TRUE THAT (I am tired now )]]c,i = 1,
b. g
iff AGENT(c) is tired at LOCATIONg(c*) and at TIMEg(c*) in wi
(4)(b) seems to assume that there is at least one arbitrary context c* significantly
different from the actual context c, and in this context the agent of the sentence is
tired at the time of the utterance. At first blush this assumption is harmless. The
property of being tired may be truly self-attributed by agents in different contexts.
Accordingly, the proposition expressed by ‘In some contexts it is true that I am
tired now’ seems to be true if and only if the agent denoted by ‘I’ is at the appropri-
ate location and time and she is tired. So far, so good.
But on closer examination, (4)(b) is hard to accept as a correct truth condi-
tional structure, because the indexical expressions ‘I’ and ‘now’ in the embedded
sentence cannot denote arbitrary objects. If I am tired now, I am tired in my actual
context c. I could not be at the location of c* at this very moment. Contrary to
this fact, (4)(b) assumes that it may be true that I am now in c* and I am tired
right there. The problem is caused by the operator ‘in some contexts it is true’. The
example displayed in (4)(b) is unacceptable, because the operator in question can-
not interact with the content of ‘I’ and ‘now’.
The upshot of the Kaplanian argumentation is that operators which attempt to
modify the content of indexicals do not exist in natural languages such as English.
The main point of the Kaplanian view concerning the impossibility of context-
shift is aptly summarised by the Fixity Thesis:4
(FT) The denotation or propositional contribution of an indexical expression a is
fixed solely by the relevant parameters of its actual context of occurrence c,
and cannot be influenced by any sentential operators.
. See Schlenker (2003a, 2003b) and Cable (2005) for other variants of the Fixity Thesis.
Zoltán Vecsey
As we have seen, Kaplan derives (FT) from two key model-theoretic insights. The
first insight is that indexicals are context sensitive expressions that select their
denotations from their proprietary contextual domain. These domains must be
complete in the sense that they must contain all the parameters which are needed
to determine the content of characters. The second insight teaches us how to think
about the propositional contribution of the contextual occurrences of characters.
According to this insight, (i) characters contribute to propositions only the objects
they denote, and (ii) the denoted objects become part of propositions without
any descriptive material. In fact, (i) and (ii) are the general claims on which the
Reference Principle is based.
Having outlined these fundamental model-theoretic ideas, the natural ques-
tion one might ask now is whether (FT) is indeed a valid thesis. Remember that
Kaplan explicitly denies the possibility of context-shifting operators only with
respect to English: “I am not saying we could not construct a language with such
operators, just that English is not one” (Kaplan 1989a: 510).
As can be easily observed, there is some ambiguity in Kaplan’s remark just
quoted above. One might emphasise that concerning the fundamentals and work-
ing of its indexical system, English is not an exceptional language. For example, it
would be an astonishing discovery about German or Hungarian, if it turned out
that indexical expressions in these languages do not obey the Reference Principle.
Of course, the German first person pronoun ‘ich’ and the Hungarian temporal
adverb ‘most’ (translated: ‘now’) obey the Reference Principle, and this is hardly
an accidental coincidence. The pivotal role of every indexical system is to help
speakers to locate themselves in physical and mental space and time. Obviously,
the realisation of this role may yield some subsidiary divergences between certain
natural languages that belong to different families. But in any case, a Kaplan-style
theory of indexicals seems called upon to give an account of those divergences that
arise from specific semantic or syntactic features of individual languages.
So I tend to think that the right question to ask is whether (FT) is a crosslin-
guistically valid thesis. For purposes of clarification, it is useful to distinguish two
aspects of the issue we are considering here. First, one might investigate the Kap-
lanian double indexing model with respect to its theoretical structure. Investiga-
tions of this kind concentrate only on the internal properties of the model: what is
analyzed is whether the constituents of the model are appropriately related to each
other, or whether double indexing provides a structurally complete framework
etc. Second, one might investigate how effectively and efficiently the model can be
applied. In this kind of investigation it might be questioned whether the Kaplanian
model is applicable to English and other natural languages. There is a sensible dif-
ference between the level of theoretical structure and the level of its application.
The latter aspect is what interests us here. So the question we should really ask is
A plausibility-based model of shifted indexicals
The research situation described by Rescher is very similar to ours. We have a ques-
tion pertaining to the crosslinguistic applicability of (FT), but we do not know, at
the moment, how to answer this question conclusively. The method of plausible
argumentation seems to offer a way out of this difficulty. In their contribution to
this volume, Kertész and Rákosi put forward in essence the same idea:
In most cases our hypotheses are not statements the truth of which is guaranteed
by experience, supporting evidence, a proof or whatever. Rather, they are
Zoltán Vecsey
plausible: we are ready to accept them to some extent on the basis of suitable
sources that support them.
(Kertész & Rákosi this volume: Section 2.2; emphasis in the original)
As we can learn from the investigations of Kertész and Rákosi (this volume:
Section 2.2, 2006, 2012: 69ff.), plausibility values come in degrees and are rela-
tive to a given source. We can differentiate between two types of these sources.
In some cases, plausibility values are assigned to statements on the ground of the
. Curiously enough, Kaplan mentions a potential counterexample to his thesis, but in fact
he says nothing about it. See Kaplan (1989a: 510, Footnote 34).
A plausibility-based model of shifted indexicals
r eliability of their sources. The members of this type are called direct sources. In
other cases, the plausibility value of a given statement depends on the plausibility
of other statements. In such cases we speak about indirect sources.6 Interestingly,
most counterexamples to the crosslinguistic applicability of (FT) are built on state-
ments the plausibility of which is obtained from direct sources. Let us analyse the
supporting sources of these statements in some details.
The Amharic indexical first person pronoun is perhaps the best documented
counterexample to Kaplan’s thesis. Schlenker (2003b: 31) proposes to analyse the
following speech situation:
(5) Situation to be reported: John says: “I am a hero”.
a. English: John1 says that he1 is a hero.
b. English: # John1 says that I1 am a hero.
c. Amharic: ȷ̌on ȷ̌ǝgna nǝ-ññ yl-all
John hero be.pfv-1sobj 3m.say-aux.3m
‘John1 says that he1 is a hero.’ (lit.: John1 says that I1
am a hero.)
. For further details see Kertész & Rákosi (this volume: Section 2.2, 2012: 67).
. In accordance with the usage of Kertész and Rákosi (this volume: Section 3.1, 2012:
Section 13.1), by ‘linguistic data’ I mean statements about some aspect of a language or lan-
guages whose initial plausibility value stems from a direct source. Schlenker’s data cannot be
identified with the Amharic sentence tokens themselves, since in order to be theoretically
useful these sentence tokens must already be evaluated from Schlenker’s own linguistic point
of view. In this sense, linguistic data are strongly theory-dependent. In the remainder of the
text, I will follow this approach to linguistic data.
Zoltán Vecsey
In English, (6)(a) reports correctly the de se attitude of the subject of the matrix
sentence (Hesen). (6)(b), however, represents a misleading report, because the
A plausibility-based model of shifted indexicals
English indexical ‘I’ denotes the agent of the reporting context, not the agent
(Hesen) of the original speech situation. Interestingly, the evaluation rules of
Zazaki indexicals allow both forms of reporting. In (6)(c) the embedded clause
mirrors an ambiguity concerning the effects of context-shifting. In the first case,
context-shifting is blocked, and the attitude holder (Hesen) is denoted by the third
person pronoun ‘he’, as is usual in English. In the second case, where shifting is
allowed, the content of the character of the first person pronoun ‘I’ is modified,
and it also denotes the attitude holder. Anand and Nevins (2004: 24) maintain that
the source of this ambiguity lies in the grammar of Zazaki. According to their own
account, there are empirical findings to support this claim. As in the Amharic
example, the crucial point is, again, that shifted indexicals in Zazaki cannot be
quotations. The best evidence for this is that A-extraction is prohibited in direct
quotations, but it is possible out of complements of attitude verbs with shifted
indexicals. Consider the following illustration:
(7) Situation to be reported: Hesen said: “The girl that I kissed is pretty”.
a. English and Zazaki: # The girl that Hesen said, “I kissed t” is pretty.
b. Zazaki: čεnεkε [kε Hεseni va mɨ t paci kεrda] rindεka
girl that Hesen said I t kiss did pretty.be-prs
‘The girl that Hesen said {Hesen, I} kissed is pretty.’8
This example supports the interpretation according to which the shifting effect in
(6)(c) depends on the grammatical features of the attitude verb ‘vano’ rather than
on the mechanism of quoting. If this is so, then, on the basis of its grammar, the
Zazaki counterpart to English ‘say’ may function as a context-shifting operator.
Roughly the same could be said about the Slave attitude verb ‘hadi’ (translated:
‘say’). There are, however, two minor differences between Zazaki and Slave. In Slave
only the first person pronoun can be shifted in the scope of ‘hadi’, and the shift is
always obligatory. On the other hand, the Slave counterpart to E nglish ‘want’ can
optionally modify the content of the character of the first person p
ronoun, and the
counterpart to ‘tell’ can influence optionally all embedded pronouns. At the end
of their investigation, Anand and Nevins (2004: 36) draw the tentative conclusion
that the analysed cases underpin the existence of Zazaki and Slave context-shifting
operators. It would follow from this that (FT) must be abandoned as an applicable
thesis in these languages.
Let us summarise where we are. On the ground of data whose plausibil-
ity originates from reliable direct sources both Schlenker and Anand & Nevins
The first premise of this inference must be assigned the plausibility value 1 on
conceptual grounds. The plausibility value of the second premise depends on the
degree of the reliability of the arguments provided by Schlenker and Anand and
Nevins. If the reliability of these arguments increases, then the plausibility value of
the consequence increases as well.
One should take into consideration, however, that the plausibility value of the
second premise may be high on the basis of the arguments offered by Schlenker
and Anand and Nevins, but it may become implausible on the ground of other
arguments to be considered. The emergence of such a situation cannot be excluded
in principle, since the plausibility value of a given statement may be evaluated by
different sources differently.
The present case is peculiar in the sense that the plausibility value of the
second premise may be easily increased by involving further introspective data
obtained from other languages, provided that these data indicate the existence
of context-shift in these languages.9 But this premise could only be regarded as
implausible with respect to all available sources, if we were able to show that the
plausibility value of the arguments made by Schlenker and Anand and Nevins
arises from unreliable sources.
Stechow (2003) has recently worked out a syntactic theory which may be use-
ful in this respect. Stechow (2003: 380) rejects the view suggested by Schlenker and
Anand and Nevins, according to which sentential operators are capable of shifting
the content of indexicals. In Stechow’s conception, the alleged facts of shifting may
be explained better through the mechanism of feature deletion under semantic
binding. This explanation treats the symptoms of shifting as an effect of feature
deletion at the logical forms of sentences.
Stechow does not deny that attitude verbs are operators. But according to
Stechow, attitude verbs such as TELL or SAY can never act on the content of
. The initial plausibility value of the second premise may be increased, for example, by the
context-shifting phenomena found recently in Navajo. See Speas (2000). On the other hand,
if only such languages were found in which there is no context-shift, then the implausibility
of the statement ‘There are context-shifting languages’ would increase – on the ground of the
investigation of these languages as sources, of course.
A plausibility-based model of shifted indexicals
The binding rule requires that in (8)(b) the attitude holder of the main clause be
co-indexed with the embedded first person pronoun. However, this would involve
an agreement failure both in English and Amharic.10 In order to avoid this failure
the AGENT feature of ‘I’ must be deleted roughly in the following way:11
(8) b′. [[JOHN SAYS THAT (I am a herog)]]c,i = 1, iff (John says that)
XAGENTg(c) is a hero in wi
. As we have seen, the Amharic first person pronoun does not produce such failures in
other syntactic positions. For further details on semantic binding see Büring (2005) and Ball
(2006).
. (8)(b’) shows only the semantic results of feature deletion. The syntactic derivations can
be found in Stechow (2003: 397f.) and Anand (2006: 94–98).
Zoltán Vecsey
In comparing the conclusions of these inferences, we can easily notice that both
‘(FT) is not crosslinguistically applicable’ and ‘(FT) is crosslinguistically applicable’
are plausible to a certain extent. This indicates that the current theoretical situa-
tion concerning the applicability of (FT) is inconsistent. The ground of the incon-
sistency is that both the statement ‘(FT) is crosslinguistically applicable’ and its
negation have been made plausible by some reliable source. This is a special case of
informational overdetermination. If we were to stop here, our analysis of Kaplan’s
thesis would prove to be ineffective, if not useless, for our intended purpose.
. See Kertész & Rákosi (this volume: Section 2.5–6, 2012: Section 10).
. Adherents of the extended plausibility theory may apply more than one problem-solving
strategy in this situation. On the constraints of problem solving see Rescher (1976) and
Magnani (2009).
A plausibility-based model of shifted indexicals
the roots of shifting lie at the features of the lexical entries, then it is not clear why
there occurs any difference between ‘tell’ and ‘want’.14 If an indexical expression
shifts in the scope of ‘tell’, then it should have to shift in the scope of ‘want’ as well.
But, as we have learned, ‘want’ is able to shift only the content of the first person
pronoun. This is a contradiction.
Second, a closer examination would reveal that Stechow’s theory also has an
implicit problem. At first sight, it appears that the mechanism of feature deletion
offers a comprehensive explanation for the phenomenon of context-shift. What
we have seen in (8)(b’) is, however, only the first half of an explanation. We have
seen that ‘I’ has lost its AGENT feature under the binder SAY. But Stechow’s theory
does not shed light on the relationship between the residual features of ‘I’ and
‘John’. Specifically, it is not made clear how the pronoun in question may denote
the attitude holder:
(9) ?[[IAGENTg]]c,i = ATTITUDE HOLDERg(c) in wi
As long as the underlying mechanisms of (9) are not fully understood, Stechow’s
theory cannot be regarded as satisfactory.
In the remainder of this section, I will outline a theory – or, rather, a the-
ory-element – which has the following advantages: (i) it is able to eliminate the
inconsistency revealed by the re-evaluation of the theoretical situation above, and
(ii) it is able to assign plausibility values to the statements about context-shifting
languages and (FT).
In my view, Stechow’s theory is basically on the right track. But something is
missing from his view. As (9) shows, quantified first person pronouns in Amharic
and Slave require a passage from their contextually determined denotation to
another one. The passage has the following form:
(10) AGENTg(c) in wi → ATTITUDE HOLDERg(c) in wi
If we try to generalise the phenomenon of passage for all shiftable indexicals, then
we get the following form:
(11) DENg(c) in wi → DEN*g(c) in wi
In (11), the second denotation DEN*g(c) functions as a proxy of the first one.
The role of the proxy is to make it possible to evaluate the shifted content. It is
important to see that if DENg(c) is given, then DEN*g(c) cannot be arbitrarily
determined. Not any candidate object can function as a proxy for DENg(c). There
. Compare Anand (2006: 91). It must be added that the view advocated in Anand & Nevins
(2004) is rejected by Anand (2006). This latter work rejects the view according to which
indexical shifting is driven by context-shifting operators.
Zoltán Vecsey
. In (PC), the symbol OP represents an appropriate operator that acts on the content of a.
. Moreover, if we delete the OP component from (PC), we can also explain the cases of
free context-shift. Free context-shift may occur, for example, in free indirect discourses or in
historical narratives even in English. On this topic see Sidelle (1991) and Sharvit (2004).
A plausibility-based model of shifted indexicals
5. Conclusion
Remember, we have departed from Kaplan’s original theory about the impossibil-
ity of operator controlled context-shift. As a result of metatheoretical reflection,
we have extended the Kaplanian theory by adding new data sources to it. There
then emerged a conflict because the plausibility of a statement was judged differ-
ently by different sources: we have realised that the current theoretical situation
concerning the crosslinguistic applicability of (FT) is inconsistent. In the following
step, we have partly re-evaluated the extended theory. This has brought to light
that the re-evaluated theory still suffers from some deficiencies. In particular, it
has turned out that the view advocated by Schlenker and Anand and Nevins leads
to contradictory consequences, and Stechow’s account has proved to be essentially
incomplete. In the final step, we have tried to remove these deficiencies by intro-
ducing a new theory-element into the debate.
According to the position we have arrived at, context-shift is a binding
phenomenon that is governed language-externally by a pragmatic constraint.
From this it can be seen that our final position acknowledges the existence of
context-shifting languages, but denies the possibility of operator controlled con-
text-shift. In the end, what we have done may be seen as an attempt to provide a
consistent solution to the problem raised by the inconsistent approaches to the
crosslinguistic applicability of (FT). It should be noted, however, that (PC) is
no more than a provisional resolution to the current problem. It is by no means
a definitive solution. For it rests on a set of theoretical statements that are not
maximally reliable, but only to a certain extent plausible. For example, the state-
ment according to which a pragmatic component is needed to explain the pos-
sibility of context-shift cannot be regarded as true with certainty. Statements
like this are at best plausible in the light of the certain sources. It should also
be noted that (PC) might lead to further problems and, in turn, the analysis of
these problems might lead to new insights that modify the plausibility status of
(PC). That is, (PC) should be seen as a fallible attempt to formulate a pragmatic
constraint on shifting.
In spite of its provisionality and fallibility, the theoretical position advanced
in this chapter has a notable advantage. If we compare the applied methodolo-
gies of the rival positions in the debate revolving around Kaplan’s theory, then
we can see that (PC) represents the most consistent and wide-ranging approach
to the problem of shifted indexicals. In sum, it should be stressed once more that
without supplementing the methodology of Kaplan’s double index theory with
plausibility considerations, we would not have been able to develop our alterna-
tive account.
Zoltán Vecsey
Acknowledgements
Work on the present chapter was supported by the MTA-DE Research Group for
Theoretical Linguistics and the TÁMOP-4.2.2/B-10/1-2010-0024 project, which is
co-financed by the European Union and the European Social Fund.
References
Sharvit, Yael. 2004. Free indirect discourse and de re pronouns. In Proceedings of Semantics and
Linguistic Theory (SALT) 14, Robert B. Young (ed.), 305–322. Ithaca NY: CLC Publications.
Sidelle, Alan. 1991. The answering machine paradox. Canadian Journal of Philosophy 81: 525–539.
Speas, Margaret. 2000. Person and point of view in Navajo. In WCCFL Papers in Honor of Ken
Hale, Eloise Jelinek (ed.), 19–38. Cambridge MA: The MIT Press.
von Stechow, Arnim. 2003. Feature deletion under semantic binding: Tense, person, and mood
under verbal quantifiers. In NELS 33. Proceedings of the Thirty-Third Annual Meeting of
the North East Linguistic Society, Makoto Kadowaki & Shigeto Kawahara (eds), 379–404.
Amherst MA: GLSA.
part iii
Metatheoretical applications
chapter 9
The aim of the present chapter is to reconstruct the relationship between thought
experiments and real experiments in pragmatics. In the first part, Searle’s (1969)
thought experiment on Kennst du das Land, wo die Zitronen blühn? is analysed
with the help of the p-model. The second part reconstructs the argumentation
structure of the real experiments reported on in Holtgraves & Ashley (2001),
and shows that the latter and Searle’s thought experiment closely interact.
Basically, their relationship is characterised by the dynamic process of the cyclic,
prismatic and retrospective re-evalutation of information in the sense of the
p-model. However, they have different strengths and weaknesses. Accordingly,
the generalised finding is that the conscious integration of real and thought
experiments as data sources within some research may considerably enhance the
reliability of pragmatic hypotheses.
1. Introduction
One of the most important insights obtained in the course of the current discus-
sion on linguistic data and evidence is that in many cases it is not sufficient to
restrict one’s data to just one data type and data source. Rather, the present state
of the discussion seems to suggest the integration of several data sources and data
types. However, this desideratum is highly problematic. In particular, it is any-
thing but clear which data sources and data types can be integrated, how integra-
tion can be achieved and under what conditions. With respect to these questions,
the relationship between thought experiments and real experiments in pragmatics
is especially challenging.
Austin’s, Searle’s, Grice’s and others’ thought experiments shaped the clas-
sical topics of pragmatics; moreover, without further thought experiments later
approaches could not have been elaborated. Psycholinguistic experiments were
András Kertész & Csilla Rákosi
sparse and did not gain ground for decades (see Noveck & Sperber 2007: 2). By
now they have, however, become increasingly popular. Basically, this development
could proceed in two directions.
The first possible direction is the replacement of thought experiments by real
ones. The rapid development of experimental pragmatics has triggered decisive
changes in the state of the art. The relevance of the current focus on real experi-
ments in pragmatics is beyond doubt: the rise of experimental pragmatics has been
accompanied by the introduction of new methods, the detection of new research
topics, the publication of new findings and the restructuring of the scientific com-
munities involved. Furthermore, there has also been a revision of earlier hypothe-
ses and assumptions, solutions to a series of unsolved problems were proposed and
attempts were made at substantial modifications of methodological rules applied
in pragmatics research. For example, in Noveck and Sperber’s characterisation,
traditional pragmatics relied on pragmatic intuitions which must be regarded as
results of thought experiments and cannot be interpreted as “facts” in the same
sense as data which are normally regarded as objects of linguistic theorising:
by thought experiments. If so, then it follows that neither pragmatic intuitions nor
results gained from other kinds of thought experiments can be used for testing
hypotheses, they can only motivate the raising of hypotheses. This is in accord
with the requirements of the standard view of linguistic data and evidence that
thought experiments (and particularly, pragmatic intuitions) cannot play the role
of evidence for or against hypotheses.1 From this Noveck and Sperber conclude
that they have to be replaced by experimental data:
Besides helping compensate for the inherent limits of pragmatic intuitions, an
experimental approach can provide crucial evidence when deciding between
alternative theories that may agree on the content of the interpretations
of utterances, but that have different implications regarding the cognitive
mechanisms through which these interpretations are arrived at.
(Noveck & Sperber 2007: 2f.; emphasis added)
At present, one cannot decide which of these alternative directions future develop-
ment will take, and whether there are further possibilities as well. Therefore, one of
the most relevant, but still unclarified, methodological aspects of current pragmat-
ics is the relationship between thought experiments and real experiments. Thus, the
present chapter centres on the problem (P) below:
(P) What kind of relationship is there between thought experiments and real
experiments as data sources in pragmatics?
. For the notion of the standard view of linguistic data and evidence see Kertész & Rákosi
(this volume a, 2012: Section 3.3).
. In a considerably milder fashion, Noveck and Reboul (2008: 5f.) evaluate the appear-
ance of experiments in pragmatics as the result of a linear development, too, where thought
experiments are interpreted only as (historical) starting points. Nevertheless, they interpret
the r elationship between “traditional” pragmatics and experimental pragmatics not as a con-
flict or opposition but as an example of “how the cognitive sciences can work together in
harmony” (Noveck & Reboul 2008: 5).
András Kertész & Csilla Rákosi
In order to get an insight into the nature of this relationship, we will apply Kertész
and Rákosi’s (2012) p-model as summarised in Kertész & Rákosi (this volume b).
In Section, 2 we will provide a rough overview of the philosophical literature
on thought experiments. We will highlight two points. First, according to sev-
eral philosophers of science, there is a close relationship between real and thought
experiments. Second, thought experiments are interpreted in a part of the current
literature as a special kind of reasoning, and experimental narratives are supposed
to be constitutive elements of both real and thought experiments. Therefore, the
application of an argumentation theoretical model to real and thought experi-
ments seems to be well-motivated. We have already proposed a model of psycho-
linguistic experiments based on the p-model (cf. Kertész & Rákosi 2012; Rákosi
2011a, b, 2012). Since the p-model characterises real experiments in such a way
that it assumes that they are organised by a plausible argumentation process, we
will try to characterise thought experiments along similar lines (see also Kertész
2010; Kertész & Kiefer 2013).
In order to illustrate how the p-model works and in order to exemplify how
it yields a possible solution to (P), Section 3 will be devoted to a case study. We
will reconstruct the argumentation structure of one of Searle’s (1969) thought
experiments.
Section 4 will focus on another case study: the analysis of real experiments as
discussed in Holtgraves & Ashley (2001) which are related to the thought experi-
ment discussed in Section 3.
On the basis of our analyses in Sections 3 and 4, in Section 5 we will reveal dif-
ferent aspects of the relationship between Searle’s thought experiment and Holt-
graves and Ashley’s real experiments. After generalising our results, we will obtain
the solution to (P).
Finally, in Section 6 we will summarise our findings and relate them to the
contemporary discussion on data and evidence in linguistics.
Schrödinger’s cat. The 17th century saw some of its most brilliant practitioners
in Galileo, Descartes, Newton, and Leibniz. And in our own time, the creation
of quantum mechanics and relativity are almost unthinkable without the crucial
role played by thought experiments. (Brown & Fehige 2011)
In contrast, in the philosophy of science – under the influence of the harsh criti-
cism of Pierre Duhem and the standard view of the analytical philosophy of
science – for a long time they were attributed only a psychological relevance as
being solely a means of discovery but not of the justification of theories.3 Partly on
the basis of the rediscovery of Ernst Mach’s views on the subject, over the last two
decades or so thought experiments have become an attractive topic in the philoso-
phy of science and there has been a radical turn in the way they are evaluated. For
most authors the focus no longer seems to be on deciding whether or not they are
legitimate tools in scientific theorising but rather on metascientific reflection con-
cerning their common features, classification, structure, functioning, limits, etc.
Although there is no widely accepted definition of the concept ‘thought exper-
iment’, most authors agree that argumentation belongs to the constitutive elements
of thought experiments. Nevertheless, there are several markedly different views
about the nature and role of these argumentative aspects.
As we have already mentioned, many authors agree that thought experiments
consider the consequences of a hypothesised state of affairs. However, this assump-
tion gives rise to different lines of reasoning. For example, John Norton assumes
that thought experiments are just “ordinary argumentation that is disguised in a
vivid pictorial or narrative form. […] They can do nothing more epistemically
than can argumentation” (Norton 2002: 1).
Norton’s position did not find general acceptance and became the object of
severe criticism. For example, Nersessian (1993) argues that although argumenta-
tion is one of their constitutive elements, thought experiments are basically of a
non-argumentative character:
While I agree with Norton (1991) that thought experiments can often be
reconstructed as arguments, the modeling function cannot be supplanted by an
argument. As Norton acknowledges, the argument can be constructed only after
the fact. That is, on my account, the argument is not evident until after the thought
experiment has been constructed and executed. […] real tworld experimental
outcomes can be recast in argument form as well, but no one would argue that the
experiment can be replaced by the argument. (Nersessian 1993: 297)
. For more on the history of the philosophy of thought experiments, see e.g. Nersessian
(1993), Kühne (2005), Moue et al. (2006).
András Kertész & Csilla Rákosi
Gooding remarks that his and Nersessian’s standpoints are close to each other.
They share a common general view according to which thought experiments and
real experiments are cognates. Nevertheless, they take different perspectives: while
Gooding’s point of reference seems to be real experiments in physics, Nersessian’s
position can be interpreted as that of a cognitive psychologist.4
. Similarly to real experiments, thought experiments are not directly accessible to the
reader but they can be partially reconstructed from the paper describing and summarising
them. Therefore, the primary object of an argumentation theoretical reconstruction can be the
paper presenting the results, that is, the ‘(thought) experimental report’.
. In Goethe’s original poem, one finds blüh’n instead of blühen. In quoting Searle, we will
preserve his way of quoting Goethe.
András Kertész & Csilla Rákosi
The starting p-context includes Grice’s (1989a [1957]) explication of the notion
of ‘non-natural meaning’ which is one of the central theses of his model of
communication:7
(2) “[t]o say that a speaker S meant something by X is to say that S intended
the utterance of X to produce some effect in a hearer H by means of the
recognition of this intention.” (Searle 1969: 43)
(2) receives a positive plausibility value on the basis of Grice (1989a [1957]) as a
direct source. This source is part of the p-context of Searle’s argumentation, but
its reliability is not sufficient to assign to it a higher plausibility value. Therefore,
Searle seems to be ready to acknowledge Grice’s paper as a reliable source only to
a limited extent and only provisionally. The reason for this decision is the circum-
stance that Searle does not find Grice’s argumentation in support of (2) convinc-
ing, because, according to him,
(3) (2) “fails to account for the extent to which meaning can be a matter of rules
or conventions”. (Searle 1969: 43)
Should one accept (3), then (2) would be implausible; nevertheless, (3) is at this
point not reliable enough to legitimise the rejection of (2). Thus, the starting
p-context is p-inconsistent in the sense of Subsection 2.5 of Kertész & Rákosi (this
volume b). Accordingly, Searle’s aim can be characterised in such a way that he
tries to increase the plausibility value of (3) and to decrease that of (2).
. Following Searle, we will speak simply of ‘meaning’, because we are not concerned with
such uses of ‘mean’ as e.g. Clouds mean rain which Grice (1989a [1957]) distinguishes from
‘non-natural meaning’.
Thought experiments and real experiments as converging data sources in pragmatics
a new source – namely, his thought experiment. For convenience we quote Searle’s
thought experimental report at length:8
Suppose that I am an American soldier in the Second World War and that
(4) a.
I am captured by Italian Troops. And suppose also that I wish to get these
troops to believe that I am a German soldier in order to get them to release
me. What I would like to do is to tell them in German or Italian that I am a
German soldier. But let us suppose I don’t know enough German or Italian
to do that. So I, as it were, attempt to put on a show of telling them that I
am a German soldier by reciting those few bits of German I know, trusting
that they don’t know enough German to see through my plan. Let us
suppose I know only one line of German which I remember from a poem
I had to memorise in a high school German course. Therefore, I, a captured
American, address my Italian captors with the following sentence: Kennst
du das Land, wo die Zitronen blühen?
b. Now, let us describe the situation in Gricean terms. I intend to produce a
certain effect in them, namely, the effect of believing that I am a German
soldier, and I intend to produce this effect by means of their recognition of
my intention. I intend that they should think that what I am trying to tell
them is that I am a German soldier. But does it follow from this account
that when I say, Kennst du das Land…etc. what I mean is ‘I am a German
soldier’?
c. Not only does it not follow, but in this case I find myself disinclined to say
that when I utter the German sentence what I mean is ‘I am a German
soldier’, or even ‘Ich bin ein deutscher Soldat’, because what the words mean
and what I remember that they mean is ‘Knowest thou the land where
the lemon trees bloom?’ Of course, I want my captors to be deceived into
thinking that what I mean is: “I am a German soldier”, but part of what is
involved in that is getting them to think that that is what the words I utter
mean in German.
d. In the Philosophical Investigations, Wittgenstein (discussing a different
problem) writes “Say ‘It’s cold here’ and mean ‘it’s warm here’”.
e. The reason we are unable to do this without further stage setting is that
what we can mean is at least sometimes a function of what we are saying.
f. Meaning is more than a matter of intention, it is also at least sometimes a
matter of convention.
. We will divide the quotations into smaller units in order to make it easier to refer to parts
of the quotation to be analysed. The units do not necessarily correspond to paragraphs in the
original text.
András Kertész & Csilla Rákosi
g. One might say that on Grice’s account it would seem that any sentence can
be uttered with any meaning whatever, given that the circumstances make
possible the appropriate intentions. […]
h. Grice’s account can be amended to deal with counter-examples of this kind.
We have here a case where I intend to produce a certain effect by means of
getting the hearer’s recognition of my intention to produce that effect, but
the device I use to produce this effect is one which is conventionally, by the
rules governing the use of that device, used as a means of producing quite
different illocutionary effects, and the stage setting or conditions which
would permit us to say one thing and mean something totally unrelated are
not present.
i. We must […] reformulate the Gricean account of meaning in such a way as
to make it clear that one’s meaning something when one utters a sentence
is more than just randomly related to what the sentence means in the
language one is speaking. In our analysis of illocutionary acts, we must
capture both the intentional and the conventional aspects and especially the
relationship between them.”
(Searle 1969: 44f.; italics as in the original, bold emphasis added)
(4)(a) delineates an imaginary situation. In (4)(b)–(c), plausible inferences are
presented which provide an interpretation of some facets of the given situation
with the help of the terminology of speech act theory. These inferences rely on
the following data introduced into the p-context through the imaginary situation
in (4)(a):
(5) a. 0 < |In the situation depicted in (4)(a), John Searle says, Kennst du das
Land… etc. because he wishes to get Italian soldiers to believe that he is a
German soldier.|S < 1
b. 0 < |In the situation depicted in (4)(a), Italian soldiers do not know what
the utterance Kennst du das Land… etc. means.|S < 1
c. 0 < |In the situation depicted in (4)(a), Italian soldiers can realise that the
utterance Kennst du das Land… etc. is a German sentence.|S < 1
The direct source of the plausibility of the statements in (5) is a compound of sub-
sources, abbreviated as S, convering, among other things, Searle’s everyday (linguis-
tic) intuition. Nevertheless, we have no direct access to it but we can reconstruct it
from the text of Searle (1969) by making use of our own (linguistic) intuition. The
statements (5)(a)–(c) do not contain theoretical terms but everyday concepts.
Based on these data, in (4)(b) the characterisation of Searle’s behaviour as
described in (4)(a) is reformulated, that is, retrospectively re-evaluated in accor-
dance with (2) in Gricean terms. Thereby, ‘wish to get X to believe that Y’ as used
in (5)(a) is explicated as ‘intending that X thinks that what was told him is Y’ in
the sense of (2):
Thought experiments and real experiments as converging data sources in pragmatics
(6) 0 < |In the situation depicted in (4)(a), John Searle says, Kennst du das
Land… etc. because he wishes to get Italian soldiers to believe that he is a
German soldier.|S < 1 (=(5)(a))
[0 < |If in the situation depicted in (4)(a), John Searle says, Kennst du das
Land… etc. because he wishes to get Italian soldiers to believe that he is a
German soldier, then he intends that the German soldiers should think that
what he is trying to tell them is ‘I am a German soldier’ when saying Kennst
du das Land… etc.|S < 1]
0 < |In the situation depicted in (4)(a), Searle intends that the German
soldiers should think that what he is trying to tell them is ‘I am a German
soldier’ when saying Kennst du das Land… etc.|I(6) < 19
Since the conclusion of (6) will play an important role in Searle’s argumentation,
we will represent it as hypothesis (7):
(7) 0 < |In the situation depicted in (4)(a), Searle intends that the German
soldiers should think that what he is trying to tell them is ‘I am a German
soldier’ when saying Kennst du das Land… etc.|I(6) < 1
The question in the last sentence of (4)(b) rests on the assumption that the appli-
cation of (2) to the situation described in (4)(a) would lead to the conclusion that
the meaning of Searle’s words has to be identified with the content of his intention:
(8) 0 < |In the situation depicted in (4)(a), Searle intends that the German
soldiers should think that what he is trying to tell them is ‘I am a German
soldier’ when saying Kennst du das Land… etc.|I(6) < 1 (= (7))
0 < |The meaning of an utterance has to be identified with the speaker’s
intention related to this utterance.|G < 1 (= (2))10
0 < |In the situation depicted in (4)(a), Searle means with his words ‘I am a
German soldier’ when saying Kennst du das Land… etc.|I(8) < 1
. In the conclusion of (6), I(6) stands for the inference in (6) as an indirect source of the
p| indicates the plausibility value of statement p; see Kertész & Rákosi (this
conclusion. | _
volume b: Section 2.2, Example 3; Section 2.3).
. G indicates Grice (1989a [1957]) as a direct source.
András Kertész & Csilla Rákosi
Thus, the analysis of the imagined situation in (4)(a) based on Grice’s theory of
meaning, and especially, on hypothesis (2) leads to the conclusion that the mean-
ing of Searle’s words in the situation in (4)(a) has to be identified with the speaker’s
intention, that is, with the information that he wants to communicate. The current
state of the p-context is as in Figure 1.11
(9)
Figure 1. The state of the p-context after its extension with (4)(a–b)
(9) is, however, questioned immediately in the first sentence of (4)(c). The starting
point of Searle’s argumentation against (2) is the following datum:
(10) 0 < |According to the rules of German as well as what Searle remembers,
the words Kennst du das Land… etc. mean that ‘Knowest thou the land
where the lemon trees bloom?’.|S < 1
The extension of the p-context with (10) makes it possible to draw the following
plausible inference:
(11) 0 < |According to the rules of German as well as what Searle remembers,
the words Kennst du das Land… etc. mean ‘Knowest thou the land where
the lemon trees bloom?’.|S < 1 (= (10))
[0 < |There is no difference between the meaning of these words according
to the rules and conventions of the German language as Searle remembers
and what Searle means with them in the given situation.|S < 1]
0 < |In the situation depicted in (4)(a), Searle means with his words
‘Knowest thou the land where the lemon trees bloom?’.|I(11) < 1
In (11), the latent background assumption identifies what Searle means by his
utterance in this situation with the conventional meaning of his words. Thus, the
argumentation process has led to the following hypothesis:
. In Figures 1–10, hypotheses are in italics, and inferences are in normal script. C
onclusions
of inferences are connected with their premises and the given inference by simple arrows.
Double dotted arrows indicate p-inconsistency.
Thought experiments and real experiments as converging data sources in pragmatics
(12) 0 < |In the situation depicted in (4)(a), Searle means with his words
‘Knowest thou the land where the lemon trees bloom?’.|I(11) < 1
(8) and (11) lead to contradicting conclusions – clearly due to the circumstance
that they use ‘mean’ in two different senses: in (8) in accordance with (2), that is,
by identifying meaning with the speaker’s intention, and in (11) by identifying it
with the conventional meaning of the uttered words. Figure 2 shows the current
state of the p-context.
(9) (12)
Figure 2. The state of the p-context after its extension with the first part of (4)(c)
Searle deems (12) more plausible than (9) and solves this contradiction in
such a way that he gives up (9) and retains (12). Thus, the following indirect source
presents itself:
(13) 0 < |In the situation depicted in (4)(a), Searle means with his words
‘Knowest thou the land where the lemon trees bloom?’.|I(11) < 1 (= (12))
0 < |In the situation depicted in (4)(a), Searle intends that the German
soldiers should think that what he is trying to tell them is ‘I am a German
soldier’ when saying Kennst du das Land… etc.|I(6) < 1 (= (7))
[0 < |If in the situation depicted in (4)(a), Searle intends that the German
soldiers should think that what he is trying to tell them is ‘I am a G erman
soldier’ when saying Kennst du das Land… etc. but he means with his words
‘Knowest thou the land where the lemon trees bloom?’, then the meaning of
this utterance cannot to be identified with the speaker’s intention related to
this utterance.|S < 1]
0 < |In the situation depicted in (4)(a), the meaning of Searle’ utterance
cannot be identified with the speaker’s intention related to this
utterance.|I(13) < 1
Thus, the interpretation of the situation in (4)(a) comes to the conclusion that
(14) 0 < |In the situation depicted in (4)(a), the meaning of Searle’s utterance cannot
be identified with the speaker’s intention related to this utterance.|I(13) < 1
András Kertész & Csilla Rákosi
In the last sentence of (4)(c), Searle provides a possible analysis of the mechanism
of the deceit applied in the story. There he seems to exploit the given situation with
the help of the data in (5)(b) and (c):
(15) 0 < |In the situation depicted in (4)(a), the meaning of Searle’s utterance
cannot be identified with the speaker’s intention related to this
utterance.|I(13) < 1 (=(14))
[0 < |If in the situation depicted in (4)(a), the meaning of Searle’s utterance
cannot be identified with the speaker’s intention related to this utterance,
Italian soldiers do not know what the utterance Kennst du das Land… etc.
means, and they can realise that the utterance Kennst du das Land… etc.
is a German sentence, then they will erroneously think that the sentence
Kennst du das Land… etc. means ‘I am a German soldier’ according to the
conventions of the German language.|S < 1]
0 < |If in the situation depicted in (4)(a), Italian soldiers erroneously think
that the sentence Kennst du das Land… etc. means ‘I am a German soldier’
according to the conventions of German, then they should erroneously
think that with the utterance of the sentence Kennst du das Land… etc.
Searle means ‘I am a German soldier’.|S < 1
0 < |In the situation depicted in (4)(a), Italian soldiers do not know what
the utterance Kennst du das Land… etc. means.|S < 1 (= (5)(b))
0 < |In the situation depicted in (4)(a), Italian soldiers can realise that
the utterance Kennst du das Land… etc. is a German sentence.|S < 1
(= (5)(c))
0 < |In the situation depicted in (4)(a), Italian soldiers may erroneously
think that with the utterance of the sentence Kennst du das Land… etc.
Searle means ‘I am a German soldier’.|I(15) < 1
As a result, a hypothesis was obtained that would be p-inconsistent with (9) that
has already been rejected, but is p-consistent with (12):
(16) 0 < |In the situation depicted in (4)(a), Italian soldiers may erroneously
think that with the utterance of the sentence Kennst du das Land… etc.
Searle means ‘I am a German soldier’.|I(15) < 1
(12) (13)
(2) (14)
(16)
Figure 3. The state of the p-context after its extension with the second part of (4)(c)
. According to Walton et al. (2008: 189), “[o]ne problem with enthymemes is that reason-
able people can have differences of opinion on what the implicit assumptions are supposed
to be. Filling in the missing parts of an enthymeme may depend on interpreting the natural
language text in which the argument was put forward […].”
. The whole passage is this: “510. Make the following experiment: say ‘It’s cold here’ and
mean ‘It’s warm here’. Can you do it? – And what are you doing as you do it? And is there only
one way of doing it?” (Wittgenstein 1958 [1953]: 140; italics as in the original, bold emphasis
added).
András Kertész & Csilla Rákosi
At this point, Searle does not clarify what “further stage setting” means; only
later does the reader receive explanation, in (4)(h). Here again, the reliability of
this thought experiment depends crucially on the question of whether the reader
accepts this datum by judging it with the help of his/her own (linguistic) intuition.
Based on (17), the following plausible inference presents itself:
(18) 0 < |Without further stage setting, when one says, It’s cold here, one cannot
mean ‘It is warm here’.|S < 1 (= (17))
0 < |If without further stage setting, when one says, It’s cold here, one cannot
mean ‘It is warm here’, then there are cases where, what one can mean is a
function of what one is saying.|S < 1
0 < |There are cases in which, what one can mean is a function of what one
is saying.|I(18) < 1
(19) is clearly p-consistent with (14), which states that the meaning of Searle’s
utterance in the situation depicted in (4)(a) cannot be identified with the speaker’s
intention related to this utterance.
(4)(f) continues the analysis of the Wittgensteinean situation and puts for-
ward a plausible inference based on (19) in order to elucidate and generalise it.
Searle’s argument can be reconstructed in the following way:
(20) 0 < |There are cases in which what one can mean is a function of what one is
saying.|I(18) < 1 (= (19))
[0 < |If there are cases in which what one can mean is a function of what
one is saying, then meaning is more than a matter of intention, it is also at
least sometimes a matter of convention.|S <1]
0 < |Meaning is more than a matter of intention, it is also at least sometimes
a matter of convention.|I(20) < 1
(21) 0 < |Meaning is more than a matter of intention, it is also at least sometimes
a matter of convention.|I(20) < 1
At this point, Searle’s argumentation turns back to the point where it had been
suspended before the subcycle was started, and the new information obtained is
added to the p-context. The state of the p-context as depicted in Figure 3 has been
retrospectively re-evaluated. Data originating from Wittgenstein’s thought experi-
ment contributed to the increase of the plausibility value of ~(2), since besides (14)
and (16), (21) is also p-inconsistent with (2). Thereby, (21) is a reformulated and
re-evaluated version of ~(2). Since (19) covers (14) as a special case, it would be
possible to construct a plausible inference with (14) as one of its premises and (21)
as its conclusion. After its coordination, the p-context is as in Figure 4.
(12) (13)
(21)
(19) (20)
(17) (18)
Figure 4. The state of the p-context after its extension with (4)(d–f)
3.2.2 Th
e continuation of the argumentation cycle: Further elaboration
of a rival hypothesis
After the coordination of the p-context, Searle turns to the application of the Con-
trastive Strategy in order to arrive at a decision between (2) and its rival, (21). In
(4)(g), he generalises the conclusion which emerges from the application of the
Gricean hypothesis in (2) to the imaginary situations delineated in (4)(a) and (4)(d):
(22) 0 < |If the meaning of an utterance is identified with the speaker’s intention
related to this utterance, then any sentence can be uttered with any meaning
whatever, given that the circumstances make possible the appropriate
intentions.|S < 1
András Kertész & Csilla Rákosi
0 < |In the situation depicted in (4)(a), the meaning of Searle’s utterance
cannot be identified with the speaker’s intention related to this
utterance.|I(13) < 1 (= (14))
0 < |In the situation depicted in (4)(d), without further stage setting, when
one says, It’s cold here, one cannot mean that ‘It is warm here’.|S < 1 (= (17))
0 < |The meaning of an utterance cannot be identified with the speaker’s
intention related to this utterance.|I(22) < 1 (= ~(2))
That is, the plausible inference presented in (22) makes (2) implausible, because it
shows that (2) overgeneralises. The p-context is as in Figure 5.
Figure 5. The p-context resulting from the application of (2) to the imaginary situations
delineated in (4)(a) and (4)(d)
While the data stemming from the two thought experiments provide weak
evidence for (21), they also provide weak evidence against (2) in the sense of Sub-
section 3.2 in Kertész & Rákosi (this volume b). Therefore, since the p-context
version of (2) is burdened with a high amount of p-inconsistency, Searle gives it
up and preserves only its rival and elaborates further on it. He turns back to the
Exclusive Strategy in order to find out whether the p-context version of (21) pro-
vides a satisfactory solution of the starting p-problem.
(17), (19), and (21) do not exclude the possibility that sometimes a situation can
be established in which what one can mean is independent of what one is saying. As
we have remarked with respect to (17) in Subsection 3.2.1, Searle does not provide
any explanation of what “further case setting” means. Thus, it remains unclarified
how cases in which utterance meaning can be identical with a completely different
speaker’s intention can be distinguished from cases in which this is not possible. (4)
(h) deals with this question and identifies the factors which are supposed to consti-
tute the difference between the two types of cases. Since it is not clear from Searle’s
formulation which of the two thought experiments – i.e. Searle’s or Wittgenstein’s – is
referred to in this passage, our reconstruction will pertain to both of them:
Thought experiments and real experiments as converging data sources in pragmatics
(23) 0 < |In the situation depicted in (4)(a), Searle intends that the hearer should
think that what he is trying to tell him is ‘I am a German soldier’ when
saying, Kennst du das Land… etc.|(I)(6) < 1 (= (7))
0 < |According to the rules of German as well as what Searle remembers,
the words Kennst du das Land… etc. mean ‘Knowest thou the land where
the lemon trees bloom?’.|S < 1 (= (10))
[0 < |If according to the rules of German, the words Kennst du das Land…
etc. mean ‘Knowest thou the land where the lemon trees bloom?’, then these
words are a device which is conventionally used as a means of producing
quite different illocutionary effects than ‘I am a German soldier’.|S < 1]
0 < |In the situation depicted in (4)(a), Searle intends to produce a certain
effect by means of getting the hearer’s recognition of his intention to
produce that effect, but the device he uses to produce this effect is one
which is conventionally used as a means of producing quite different
illocutionary effects.|I(23) < 1
(24) 0 < |In the situation depicted in (4)(d), one intends that the hearer should
think that what one is trying to tell him is ‘It is warm here’ when saying,
It’s cold here.|S < 1
[0 < |According to the rules of English, the words It’s cold here mean ‘It’s
cold here’.|S < 1]
[0 < |If according to the rules of English, the words It’s cold here mean ‘It’s cold
here’, then these words are a device which is conventionally used as a means of
producing quite different illocutionary effects than ‘It is warm here’.|S < 1]
0 < |In the situation depicted in (4)(d), one intends to produce a certain
effect by means of getting the hearer’s recognition of his intention to
produce that effect, but the device he uses to produce this effect is one
which is conventionally used as a means of producing quite different
illocutionary effects.|I(24) < 1
As a next step, in (4)(h), relying, among other things, on the conclusion of (23) and
(24), Searle generalises the explanation which could be applied to these two cases
with the help of the following inductive inference:
(25) 0 < |In the situation depicted in (4)(a), Searle intends to produce a certain
effect by means of getting the hearer’s recognition of his intention to
produce that effect, but the device he uses to produce this effect is one
which is conventionally used as a means of producing quite different
illocutionary effects.|I(23) < 1
0 < |In the situation depicted in (4)(d), one intends to produce a certain
effect by means of getting the hearer’s recognition of his intention to
produce that effect, but the device he uses to produce this effect is one
which is conventionally used as a means of producing quite different
illocutionary effects.|I(24) < 1
András Kertész & Csilla Rákosi
0 < |In the situation depicted in (4)(a) and (4)(d), the stage setting or
conditions which would permit one to say one thing and mean something
totally unrelated are not present.|S < 1
0 < |In the situation depicted in (4)(a) and (4)(d), the sentences mentioned
cannot be uttered with a meaning identical to the speaker’s
intention.|(14) & (17) < 1
[0 < |In every other case, too, if the circumstances do not make possible the
appropriate intentions or (i) the sentence cannot be conventionally, by the
rules governing the use of this sentence, used as a means of producing this
illocutionary effect, and (ii) the stage setting or conditions which would
permit one to say one thing and mean something totally unrelated are not
present, then this sentence cannot be uttered with a meaning identical to the
speaker’s intention.|S < 1]
0 < |A sentence can be uttered with a meaning only if the circumstances
make possible the appropriate intentions and (i) the sentence can be
conventionally, by the rules governing the use of this sentence, used as
a means of producing this illocutionary effect, or (ii) the stage setting or
conditions which would permit one to say one thing and mean something
totally unrelated are present.|I(25) < 1
From this it seems that (4)(a) and (4)(d) refer to one’s linguistic behaviour under
normal circumstances, that is, to a situation in which the speaker and the hearer
know the language used well enough so that he/she cannot be mistaken or deceived
about the conventional meaning of the words used, and both the speaker and the
hearer are supposed to use language in a normal way, without any special modifi-
cation of the meaning of words.
The conclusion of (25) is a retrospectively re-evaluated version of (21), that is,
a further refinement of ~(2):
That is, the results of the two thought experiments motivate the following meth-
odological principle:
(28) The analysis of illocutionary acts must capture both intentional and
conventional aspects of meaning and the relationship between them.
(12) (13)
(14) (16)
(28)
(19) (20)
(23) (24) (25)
(17) (18)
This solution to (1) is provisional and does not count as a resolution in the
sense of Subsection 2.5 in Kertész & Rákosi (this volume b). It is exposed to fur-
ther revision for at least three reasons.
First, the argumentation process continues with Searle’s (1969: 43) consider-
ation of the problem of “what is it for something to have a meaning in analogy
to the process we have just exemplified”. This problem is discussed on pp. 49f. by
contrasting Grice’s solution with Searle’s. The final p-context of this continuation
includes the retrospective re-evaluation of (21)/(26) in the light of Searle’s solu-
tion to the above problem. We will not reconstruct this process here, because it is
analogous to the one we have just analysed.
Second, as is well known, Searle’s account of meaning supported by the
thought experiment in (4) turned out to be problematic in several respects again.
Grice (1989b), Meggle & Ulkan (1979) and other contributions argued against
(21)/(26); i.e. these authors re-evaluated the results of the plausible argumentation
which we have reconstructed above.
Third, the counter-arguments against (21)/(26) motivated, among others, real
experiments. In the next section we will analyse an example which illustrates how
(21)/(26) can be retrospectively re-evaluated by real experiments.
We choose Holtgraves & Ashley (2001) as the subject matter of the case study
that scrutinises a problem with the help of a real experiment which is directly con-
nected to the thought experiment presented by Searle and analysed in the previous
section. For lack of space, we will dispense with the reconstruction of Holtgraves
and Ashley’s (2001) whole argumentation. We will focus on those aspects that are
instructive with respect to our problem (P).
4. Case study 2: Real experimental reports in Holtgraves & Ashley (2001)
The starting p-context is problematic in the sense of Subsection 2.5 in Kertész &
Rákosi (this volume b):
Thought experiments and real experiments as converging data sources in pragmatics
or to assign a plausibility value to its rival, namely, its negation, – i.e. whether (32)
is plausible or implausible.
4.2.1 Experiment 1
Regarding the experimental design,14 participants were presented scenarios fol-
lowed by a dialogue. For example:
(34) “Jenny and Emily had been close friends since grade school. Now they were
rooming together at college.
Emily was very forgetful.
Today, Jenny was sure Emily didn’t remember (had forgotten) her dentist
appointment.
Don’t forget (I’ll bet you forgot) to go to your dentist appointment
Jenny:
today.15
Probe: remind.” (Holtgraves & Ashley 2001: 85)
The task of the participants was to decide whether the probe word – in (34) the
speech act verb remind – had been literally mentioned. Holtgraves and Ashley put
forward the following conjectures about the possible outcome of the experimental
procedure:
(35) a. “If comprehension of a remark involves activation of its illocutionary
force, then, when the probe represents the speech act just performed,
participants’ ability to verify that the probe had not been literally
present should be slowed; activation of the speech act term should
. We do not reconstruct the experimental designs completely in either case. For further
details of these, see the relevant passages in Holtgraves & Ashley (2001).
. “The speech act version contained the italicized material; the control version was created
by replacing the italicized material with the material in parentheses” (op. cit.).
Thought experiments and real experiments as converging data sources in pragmatics
(36) a. 0 < |Mean probe reaction time (and error rate) in speech act scenarios
was 1,034 ms (6.36%) in Experiment 1.|Exp1 < 1
b. 0 < |Mean probe reaction time (and error rate) in control scenarios was
960 ms (1.52 %) in Experiment 1.|Exp1 < 1
c. 0 < |“Participants were significantly slower at verifying that the probe
had not been literally present when it represented the speech act
performed with the final remark than when it did not.”|Exp1 < 1
András Kertész & Csilla Rákosi
It is very important to realise that these data are not true with certainty. The experi-
mental procedure, the interpretation and the authentication of the perceptual data
may be faulty and be in need of modification. They cannot be totally controlled;
it is always possible that relevant factors which influence the outcome of the
experimental process to some extent remain unrevealed. From this it follows that
experimental data can be only plausible.
As a next step, Holtgraves and Ashley confronted these data with one of the
rival hypotheses, namely, with (32). With this aim in view, first they referred to the
conjecture formulated in (35):
(37) 0 < |“If comprehension of a remark involves activation of its illocutionary
force, then, when the probe represents the speech act just performed,
participants’ ability to verify that the probe had not been literally present
should be slowed.” |HA < 1 (= (35))
0 < |When the probe represented the speech act just performed,
participants’ ability to verify that the probe had not been literally present
was slowed.|Exp1 < 1 (= (36))
0 < |Comprehension of a remark involves activation of its illocutionary
force.|I(37) < 1
Although this inference is not deductively valid, it is capable of making its conclu-
sion plausible, since it can be shown that it belongs to the third group of plausible
inferences (see Kertész & Rákosi, this volume b: Section 2.3):
(38) 0 < |Comprehension of a remark involves activation of its illocutionary
force.|I(37) < 1
As for the coordination and modification of the p-context, experimental data origi-
nating from Experiment 1 as a data source provide support for (32). This means
that the hypothesis that comprehension involves recognition of the action that
is performed with an utterance has become plausible. The current state of the
p-context is as in Figure 7.
Thought experiments and real experiments as converging data sources in pragmatics
(38) (39)
(32)
(40) “A potential problem with this experiment is that participants took
longer to read the final remark when it performed the relevant speech
act […] than when it did not […]. Hence, slowed judgement speeds
following the relevant speech act utterances could have been due to the
greater comprehension difficulty for these remarks, relative to the control
remarks.” (Holtgraves & Ashley 2001: 86)
In (37), the authors supposed that slowed reaction times indicate the presence of
illocutionary force activation. After extending the p-context by (40), however, (37)
had to be transformed into another inference which takes into consideration the
presence of a second possible explanation of the experimental data as well. Thus,
the following inference presents itself:
(41) 0 < |If the probe represents the speech act just performed, and participants’
ability to verify that the probe had not been literally present slows, while
there is no difference in the comprehension difficulty for the final sentences
of the speech act scenarios and the control scenarios, respectively, then
comprehension of a remark involves activation of its illocutionary
force.|HA < 1 (= (40))
0 < |When the probe represented the speech act just performed,
participants’ ability to verify that the probe had not been literally present
was slowed.|Exp1 < 1 (= (36))
[|There was no difference in the comprehension difficulty for the
final sentences of the speech act scenarios and the control scenarios,
respectively.| = ?]
|Comprehension of a remark involves activation of its illocutionary
force.|I(41) = ?
András Kertész & Csilla Rákosi
The latent background assumption of (41), which was put between ‘[’ and ‘]’
above, has been retrospectively and prismatically re-evaluated in the sense of Sub-
section 2.6.1 of Kertész & Rákosi (this volume b). It had neutral plausibility during
Experiment 1. Subsequent analysis of the experimental data, as (40) shows, how-
ever, questioned the assumption and showed that it was implausible. As a conse-
quence, the premises of (41) are no longer capable of supporting the conclusion
in the given state of the p-context, even though their plausibility did not change.
Therefore, (38) loses its earlier plausibility value. This does not mean, of course,
that the conclusion became implausible.
In order to find out whether the data originating from Experiment 1 can
be used to support (32), a control experiment was proposed and a new subcycle
initiated.
(43) 0 < |If the probe represents the speech act just performed, and participants’
ability to verify that the probe had not been literally present slows, while
there is no difference in the comprehension difficulty for the final sentences
of the speech act scenarios and the control scenarios, respectively, further
participants make more errors when the probe represented a performed
speech act than when it did not, then comprehension of a remark involves
activation of its illocutionary force.|HA < 1
0 <|When the probe represented the speech act just performed, participants’
ability to verify that the probe had not been literally present was
slowed.|Exp2 < 1 (= (42))
0 < |The participants made more errors when the probe represented a
performed speech act than when it did not.|Exp2 < 1 (= (42))
0 < |There was no difference in the comprehension difficulty for the
final sentences of the speech act scenarios and the control scenarios,
respectively.|Exp2 < 1
0 < |Comprehension involves recognition of the action that is performed
with an utterance. |I(43) < 1 (= (32))
Moreover, (42) indicates that the concern in (40) is unfounded. Thus, (41) can be
rejected and (37) can be added to the p-context again. Figure 8 shows the current
shape of the p-context.
(38) (39)
(32)
To filter out such factors, Experiment 3 was carried out. Although the stimulus
material remained unchanged, the participants had to solve another task. Instead
of indicating whether or not the probe word literally occurred in the conversation
they were presented, they had to check whether a certain letter string was a word.
Holtgraves and Ashley put forward the following conjecture about the possible
outcome of the experimental procedure:
(44) 0 < |“If comprehension of the utterance involved speech act activation,
the participants should be significantly faster at this task when the target
represented the performed speech act than when it did not.” |HA < 1
The interpretation and authentication of the perceptual data led to the following
set of experimental data:
(45) a. 0 < |Mean lexical decision reaction time in speech act scenarios was
1,002 ms in Experiment 3.|Exp3 < 1
b. 0 < |Mean lexical decision reaction time in control scenarios was
1,095 ms in Experiment 3.|Exp3 < 1
c. 0 < |In error-free trials, the overall error rate was 9.6%, and this rate did
not vary across conditions in Experiment 3.|Exp3 < 1
d. 0 < |“[…] the participants were significantly faster at the lexicon
decision task when the target represented the prior speech act than
when it did not […]” in Experiment 3.|Exp3 < 1
It this case, the results were significant both over items and participants. Thus,
the following plausible inference indicates the confrontation of these experimental
data with the previous conjectures:
(46) 0 < |“If comprehension of the utterance involved speech act activation,
the participants should be significantly faster at this task when the target
represented the performed speech act than when it did not.” |HA < 1 (= (44))
Thought experiments and real experiments as converging data sources in pragmatics
0 < |“[…] the participants were significantly faster at this task when
the target represented the performed speech act than when it
did not”.|Exp3 < 1 (= (45))
0 < |Comprehension of the utterance involved speech act activation.|I(46) < 1
The conclusion of (46) provides support for (32) again. Changes in the p-context
after extending it with the results of Experiment 3 are as in Figure 9.
(32)
After adding the data obtained in Experiment 4, the p-context is as in Figure 10.
(32)
(48) (49)
value to (32): “[…] we have converging evidence from four separate experiments
and three different methods that illocutionary force is activated” (Holtgraves &
Ashley 2001: 87f.).
The authors explicitly compare their result – i.e. (32) along with its increased
plausibility value gained from the four real experiments – to two rival hypotheses
and, accordingly, apply the Contrastive Strategy. The first is Cohen and Levesque’s
(1990) claim according to which
(50) “[s]peech act recognition is not required, although it may be entailed in the
recognition process.” (Holtgraves & Ashley 2001: 87f.)
The second hypothesis is the stance adopted by conversation analysis as put for-
ward for example in Schegloff (1991) which says that
(51) “[…] an interactant’s understanding of what has been accomplished with a
remark is revealed in that person’s subsequent turn at talk; there is no need
to consider a dimension of illocutionary force.” (ibid.)
Experimental data from Experiments 1–4 are not only weak evidence for (32)
but also against its rivals. Thus, (36), (42), (44) and (48) provide strong evidence
for (32) and against its rivals, (50) and (51), as well (see Kertész & Rákosi, this
volume b: Section 3.2).
its acceptance leads to the least p-problematic p-context – even if this is merely
provisional and can be retrospectively re-evaluated in later cycles of the argumen-
tation process.
However, with respect to the relationship between thought experiments and real
experiments in pragmatics, Holtgraves and Ashley’s experiments work as a prism
through which the outcome of Searle’s thought experiment can be re-evaluated
retrospectively:
(52) 0 < |Comprehension involves recognition of the action that is performed
with an utterance.|HA & Experiments 1–4 < 1 (= (32))
0 < |If comprehension involves recognition of the action that is performed
with an utterance, then “[…] utterances have both a propositional
content and an illocutionary force (the speech act performed with the
utterance).”|HA < 1
[0 < |If “[…] utterances have both a propositional content and an
illocutionary force (the speech act performed with the utterance)”, then
the analysis of illocutionary acts must capture both the intentional and
conventional aspects of meaning and especially the relationship between
them.|HA < 1]
0 < |The analysis of illocutionary acts must capture both the intentional and
conventional aspects of meaning and especially the relationship between
them.|I(52) < 1 (= (28))
This also means that experimental data originating from the four experiments
conducted by Holtgraves and Ashley considerably increased the plausibility value
of (21) as well.
Step 2: On the basis of our analyses in Sections 3 and 4, we can realise that there
is a striking similarity between the argumentation structures of Searle’s thought
experimental report and Holtgraves and Ashley’s experimental report. It seems to
be instructive to examine possible analogies between the argumentation processes
presented in thought experimental reports and in reports on real experiments.
Step 3: Analogies between the argumentation structure of thought experimental
reports and real experimental reports might indicate a deeper similarity between
real and thought experiments themselves. That is, it is possible that there is a basic
similarity between the functioning of both experimental processes, too. Of course,
the question is whether and to what extent the experimental processes can be
reconstructed from the experimental reports.
. The decisions were checked in a pretest. Despite this, the primary source of the plau-
sibility of several hypotheses on which the experimental design was based, is speech act
theory – the pretest only increased these values.
András Kertész & Csilla Rákosi
the results of thought experiments also contributed to the plausibility of the result of
Experiments 1–4 in an indirect way.
Second, Searle’s thought experiment has taken the existence of the
illocutionary aspects of meaning for granted and focused solely on the pres-
ence of the conventional aspects.17 Conversely, Holtgraves and Ashley tried to
find experimental evidence for the presence of the illocutionary force of utter-
ances but did not investigate their conventional aspect. Since all four experi-
ments they conducted provided evidence for the existence and recognition
of speaker intentions, they considerably increased the plausibility of that part
of (28) which is related to the intentional aspects of utterances. In this way,
the results of Experiments 1–4 also contribute to the plausibility of the result of
Searle’s thought experiment.
To sum up, Holtgraves and Ashley’s experiments work as a prism through
which the outcome of Searle’s thought experiment can be re-evaluated retrospec-
tively. At the same time, however, Experiments 1–4 relied on hypotheses which
were partially and indirectly supported by thought experiments. It is important
to emphasise that this process, as a result of which the finding of Holtgraves and
Ashley’s real experiments feeds back to that of Searle’s thought experiment and
increases its plausibility, is not circular. Thus, the generalisation is that the result of
real experiments may increase the plausibility of the result of thought experiments
in the course of the cyclic, retrospective and prismatic re-evaluation of informa-
tion: new pieces of information were made use of, and the thought experiment
and the real experiment were conducive to the creation of a more comprehensive
picture by unifying their virtues.
There is also a third kind of relationship between Holtgraves and Ashley’s
experiments and thought experiments. Namely, the experimental designs of Experi-
ments 1–4 (and in general, every experimental design) can be interpreted as a special
kind of thought experiment. This interpretation is based on the circumstance that
experimenters go through the experimental process in their mind in order to test
whether the procedure proposed will provide relevant and reliable data. Of course,
the results of such thought experiments are considerably vaguer and less reliable
than those of the real experimental procedure. Despite this, the experimental
design is an indispensable part of real experiments. It delineates and motivates the
details of the experimental procedure to be carried out, and it also provides sev-
eral points of reference against which the results and all facets of the experimental
process can be interpreted and checked in order to gain reliable data.
. We have already proposed a model of psycholinguistic experiments based on the p-model
(cf. Kertész & Rákosi 2012; Rákosi 2011a,b, 2012), by relying on insights in the current litera-
ture on scientific experiments. Therefore, we will apply this model to the reconstruction of the
structure of Experiments 1–4.
. In accordance with our definition of the notion of ‘datum’ (Kertész & Rákosi, this
volume b: Section 3.1), by ‘perceptual data’ we mean plausible statements the plausibility
value of which stems from perception as a direct source.
András Kertész & Csilla Rákosi
this process, experimental data20 are obtained which are then confronted with the
given theory or with some rival theories. Nevertheless, it may happen that the
interpretation and authentication of the perceptual data indicates shortcomings
in the experimental procedure, in the experimental design, or in the theoretical
model of the phenomena or of the apparatus. If things do not run smoothly, one
turns back to some earlier stage of the experimentation process and modifies a
component until there is mutual support among the constituents. All facets of the
experiment have to be re-examined.
The experimental report does not contain the set of perceptual data, as is the
case with most psycholinguistic experiments on cognitive metaphors. Thus, their
interpretation and authentication cannot be properly judged by the reader. This
circumstance – together with the absence of the stimulus material – decreases the
plausibility of the experimental data. On the other hand, Holtgraves and Ashley
revealed and checked several possible systematic errors with the help of control
experiments. This makes their results more plausible, because in this way the reli-
ability of their experiments increased. Nevertheless, the authentication can never
be complete. In this case, for example, the following problematic points can be
mentioned:
. Likewise, by ‘experimental data’ we mean plausible statements the plausibility value of
which stems from an experiment as a direct source.
Thought experiments and real experiments as converging data sources in pragmatics
From these considerations it is clear that the experimental data gained are
not true with certainty but only plausible on the basis of these experiments as
sources. Their confrontation with the hypotheses of the theory, with previous
conjectures, involves factors which indicate that experimental results are not
decisive. They cannot prove or falsify hypotheses but are capable of increasing
their plausibility or making them implausible as weak or strong evidence for/
against them. For example, the reconstruction of the argumentation structure
of Holtgraves and Ashley’s experimental report contains the following plausible
inference (cf. Section 4.2):
(37) 0 < |“If comprehension of a remark involves activation of its illocutionary
force, then, when the probe represents the speech act just performed,
participants’ ability to verify that the probe had not been literally present
should be slowed.” |HA < 1 (= (35))
0 < |When the probe represented the speech act just performed,
participants’ ability to verify that the probe had not been literally present
was slowed.|Exp1 < 1 (= (36))
0 < |Comprehension of a remark involves activation of its illocutionary
force.|I(37) < 1
. In Figure 11 and 12, simple arrows indicate successive stages of the experimental process;
dotted arrows signify the non-public argumentation process which organises the experi-
mental process.
András Kertész & Csilla Rákosi
Experimental
design;
Theoretical
model of the Experimental
Perceptual data
phenomena; procedure
Theoretical
model of the
apparatus
Plausible
Revision
argumentation
Confrontation
Authentication
of the
and
experimental Experimental
interpretation
data with the data
of perceptual
theory/rival
data
theories
Transformation
Experimental
report
. Thus, for example, theoretical physicists’ and experimental physicists’ knowledge of the
working of an experimental apparatus may differ in depth.
. Of course, the whole of the situation need not be typical – situations depicted in thought
experiments are often fantastic or unrealistic, but contain realistic and well-known elements.
Thought experiments and real experiments as converging data sources in pragmatics
s ituation have to be identified. That is, the experiential data have to be described with
the help of the conceptual apparatus of the theory. As a result, thought experimental
data24 are obtained. With Searle’s thought experiment, hypothesis (7) is, for instance,
a consequence of the situation analysed in (4)(a)-(c) which has been obtained on
the basis of the experiential datum (5)(a). Of course, our reconstruction of Searle’s
argumentation process could have interpreted (7) as a thought experimental datum
as well; i.e. if we analyse the relationship between (5)(a) and (7) not as a connection
between a premise and a conclusion of the plausible inference in (6), but deem (7)
to be the re-evaluation of (5)(a) with the help of the terminology of Gricean theory
of meaning, then (7) will originate from a direct source and could be regarded as a
datum. Similarly, (12) could also be interpreted as a thought experimental datum
which is the re-evaluation of the experiential datum (10).
Following this, in both real and thought experiments, experimental data are
confronted with the hypotheses of a theory or rival theories. If the results seem to
be reliable, then the real/thought experimental process is transformed into a real/
thought experimental report. Akin to real experimental reports, thought experi-
mental reports are informationally impoverished in comparison to the thought
experimental process conducted by the experimenter:
By the time the thought experiment is communicated it is in a polished form.
That is, we are not shown the tinkering that went into setting it up and refining it.
[…] When conveyed, a thought experiment is as packaged and polished as a real-
world experiment is when it is published. (Nersessian 1993: 296)
. In analogy to the notion’experimental data’, ‘thought experimental data’ should be un-
derstood as plausible statements the plausibility value of which stems from a thought experti-
ment as a direct source.
András Kertész & Csilla Rákosi
through the prism of its informationally reduced and edited version. Reproduc-
ibility is in both cases a very important means of judging the reliability of the out-
come of the experimental process: repetitions may reveal suppressed but relevant
factors which indicate that the results are less reliable or even unreliable.
In summary, the p-model interprets thought experiments by analogy with
real experiments, that is, as cyclic processes organised and conducted by a plausible
argumentation process.25 Both real and thought experiments are characterised by
a double argumentative structure: they are organised and guided by a non-public
argumentation process and they are transformed into a public argumentation pro-
cess (thought/real experimental report). See Figure 12.
Delineation of a
situation;
Theoretical
model of the
involved Imagination of Experiential
phenomena; the situation data
Construction of
a mental model
of the situation
Plausible
Revision
argumentation
Determination
Confrontation
of the theoretical
of the Thought consequences
experimental experimental of the events
data with the data occurring in the
theory/rival
depicted
theories
situation
Transformation
Thought experimental
report
. It is worth remarking that in Holtgraves and Ashley’s experimental report the sources
were used cumulatively, because the experiments presented a revised version of the previous
experiment. This need not always be so; in other experimental reports the sources may be
independent of each other and may independently increase or decrease the plausibility of a
certain hypothesis (see Kertész & Rákosi 2012).
Thought experiments and real experiments as converging data sources in pragmatics
The present chapter started with the remark that, while on the one hand, in con-
temporary linguistics there is a need to integrate different data sources, on the
other hand, the methodology which should govern such an integration is missing.
By obtaining (SP) with the help of the p-model, we have made an attempt to fill
this gap with respect to the relationship between thought experiments and real
experiments in pragmatics. Nevertheless, to reveal additional aspects of this rela-
tionship, numerous further case studies should be carried out.
Acknowledgements
Work on this chapter was supported by the MTA-DE Research Group for
Theoretical Linguistics, the project OTKA K 77823 as well as the TÁMOP-
4.2.2/B-10/1-2010-0024 project, which is co-financed by the European Union and
András Kertész & Csilla Rákosi
the European Social Fund. Csilla Rákosi’s work was also supported by the János
Bolyai Research Scholarship of the Hungarian Academy of Sciences.
References
Moue, Aspasia S., Masavetas, Kyriakos A. & Karayianni, Haido. 2006. Tracing the develop-
ments of thought experiments in the philosophy of natural sciences. Journal for the General
Philosophy of Science 37: 61–75.
Nersessian, Nancy J. 1993. In the theoretician’s laboratory: Thought experimenting as mental
modelling. In Proceedings of the 1992. Biennial Meeting of the Philosophy of Science Associa-
tion, Vol. 2, 291–301. East Landing MI: Philosophy of Science Association.
Nersessian, Nancy J. 1991. The cognitive sciences and the history of science. In Proceedings, joint
HSS-SHOT conference on Critical Problems and Research Frontiers in History of Science and
Technology, October 1991, 92–116. Madison WI.
Norton, John D. 1991. Thought experiments in Einstein’s work. In Thought Experiments in
Science and Philosophy, Tamara Horowitz & Gerard J. Massey (eds), 129–48. Savage MD:
Rowman and Littlefield.
Norton, John D. 2002. On thought experiments: Is there more to the argument? 〈http://philsci-
archive.pitt.edu/960/1/Norton.doc〉
Noveck, Ira A. & Sperber, Dan. 2007. The why and how of experimental pragmatics. In Pragmat-
ics, Noel Burton-Roberts (ed.), 184–212. Basingstoke: Palgrave Macmillan.
Noveck, Ira A. & Reboul, Anne. 2008. Experimental Pragmatics: a Gricean turn in the study of
language. Trends in Cognitive Sciences 12(11): 425–431.
Rákosi, Csilla. 2011a. Metatheoretical reconstruction of psycholinguistic experiments. Part 1.
Sprachtheorie und germanistische Linguistik 21(1): 55–93. (Csatár, Péter (ed.), Metapher,
Metonymie und Argumentation/Metaphor, Metonymy and Argumentation 1. Münster:
Nodus).
Rákosi, Csilla. 2011b. Metatheoretical reconstruction of psycholinguistic experiments. Part 2.
Sprachtheorie und germanistische Linguistik, 21.2 [= Csatár, Péter (ed), Metapher, Metony-
mie und Argumentation/Metaphor, Metonymy and Argumentation 2. Münster: Nodus],
159–187.
Rákosi, Csilla. 2012. The fabulous engine: Strengths and flaws of psycholinguistic experiments.
Language Sciences 34(6): 682–702. (= Kertész, András, Schwarz-Friesel, Monika & Consten,
Manfred (eds), Converging Data Types in Cognitive Linguistics. Amsterdam: Elsevier).
Schegloff, Emanuel A. 1991. Conversational analysis and socially shared cognition. In Per-
spectives on Socially Shared Cognition, Lauren B. Resnick, John M. Levine & Stephanie D.
Teasley (eds), 150–171. Washington DC: American Psychological Association.
Searle, John R. 1969. Speech Acts. An Essay in the Philosophy of Language. Cambridge: CUP.
Searle, John R. & Vanderveken, Daniel. 1985. Foundations of Illocutionary Logic. Cambridge:
CUP.
Walton, Douglas, Reed, Chris & Macagno, Fabrizio. 2008. Argumentation Schemes. Cambridge:
CUP.
Wittgenstein, Ludwig. 1958[1953]. Philosophical Investigations. Oxford: Blackwell.
chapter 10
Csilla Rákosi
MTA-DE Research Group for Theoretical Linguistics
This chapter provides a survey about the most frequent methods of inconsistency
resolution in Optimality Theory. With the help of the p-model, inconsistencies
in OT are divided into two main groups. The first group includes conflicts
that are deemed to be fatal and are solved usually by the modification of the
theory: namely, clashes between “linguistic data” (acceptability judgements) and
applications of the model (results of the evaluation procedure). The second group
consists of conflicts that are, in contrast, tolerable in the view of OT theorists:
inconsistencies between constraints and the best output candidates. The author’s
analyses result in the conclusion that there is no radical break between OT and
“standard” Generative Grammars with respect to the treatment of inconsistencies.
1. Introduction
Diana Archangeli characterises the development which the role and treatment of
inconsistencies has undergone in the history of generative linguistics as follows:
[…] the original idea of a generative grammar (as in Chomsky 1957) […] accounts
directly for (i.e. generates) all and only all the expressions of a given language
with no auxiliary devices to remove ungrammatical expressions. Because of the
enormous complexity of the grammar which results from trying to put that idea
into practice, many linguists chose to drop the only all proviso for the generative
mechanism (the technical description of this state of affairs is that the grammar
overgenerates), and to add devices, called filters, to eliminate the ungrammatical
expressions that the generative mechanism allows […]. This situation, in turn,
has been viewed as unsatisfactory: why have two components of the grammar
responsible for separating out the expressions which are ungrammatical in a
particular language? […] Optimality Theory was introduced in response to
this situation. Optimality Theory opts for the ‘ideal separator’: a very simple
Csilla Rákosi
The quotation suggests that Optimality Theory has brought a radical change in the
treatment of inconsistencies. The scope of this chapter does not allow for a system-
atic comprehension of the strategies applied in different stages of the development
of generative linguistics and OT. Instead, we will focus on the role and treatment
of inconsistencies in OT. Thus, we will raise the following problem:
(P) (a) Which are the basic types of inconsistency in Optimality Theory?
(b) Which strategies of inconsistency resolution are applied in OT?
Section 2 provides a brief overview of the basic ideas of OT. In Section 3, I will try
to provide a partial answer (P)(a) with the help of an instructive case study. Some
passages of Rene Kager’s argumentation in connection with the introduction of
two correspondence constraints will be presented which are quite simple but seem
to be especially illuminating concerning the emergence and resolution of incon-
sistencies in OT. Section 4 will focus on (P)(b) and analyse the strategies applied
by Kager in order to resolve the conflicts with the help of the p-model. Section 5
will summarise our findings.
In OT the relation between the input and output forms is established by two for-
mal mechanisms, GEN (Generator) and EVAL (Evaluator). GEN produces all pos-
sible output-candidates from the input (a compound of elements of the linguistic
vocabulary). EVAL has the task of choosing the best candidate with the help of the
constraint hierarchy of the given language. The constraints are supposed to belong
to the Universal Grammar but their ranking is language-specific. The constraints
are violable; all output candidates infringe at least one constraint. It is the weight
of the conflicts with the constraints that is different and decisive. This weight is
basically determined not by the number of constraint violations but the ranking
Data and the resolution of inconsistency in Optimality Theory
3. O
n (P)(a): Emergence of inconsistencies in Kager (1999)’s
argumentation
section aims at revealing the most frequent and basic types of inconsistency in OT,
whose treatment will be analysed in Section 4 with the help of the p-model.
After some general typological considerations, Kager (1999: 92ff.) character-
ises Axininca Campa as one of the languages which do not allow onset-less sylla-
bles. As the data in (2) show, consonant epenthesis is applied when two vowels are
adjacent in the input; in contrast, there is no epenthesis when the input contains
an onset in every syllable:
(2) a. In Axininca Campa, it is the output-form [noŋkomati] that belongs to
the input /no-N-koma-i/.
b. In Axininca Campa, it is the output-form [noɲčhiki] that belongs to the
input /no-N-čhik-i/ and not the form *[noɲčhikti].
From typological insights and data such as (2), Kager (1999: 93) concludes that the
following structural well-formedness constraint has to be supposed to be undomi-
nated by other constraints in Axininca Campa:
(3) Onset: Syllables must have an onset.
Several data are in conflict with Dep-IO in Axininca Campa, because they show
that in order to satisfy Onset, this language employs epenthesis. This means that
the output forms have a consonant inserted between two vowels and this conso-
nant does not have any input correspondence (cf. Kager 1999: 101):
(5) a. In Axininca Campa, it is the output-form [noŋkomati] that belongs to
the input /no-N-koma-i/.
b. In Axininca Campa, it is the output-form [noŋkomataati] that belongs
to the input /no-N-koma-aa-i/.
c. In Axininca Campa, it is the output-form [noŋkomatakoti] that be-
longs to the input /no-N-koma-ako-i/.
d. In Axininca Campa, it is the output-form [noŋkomatakotaatiro] that
belongs to the input /no-N-koma-ako-aa-i-ro/.
e. In Axininca Campa, it is the output-form [noɲčhiki] that belongs to the
input /no-N-čhik-i/.
Data and the resolution of inconsistency in Optimality Theory
Since the output-candidate [noŋkomati] violates only the lower ranked con-
straint Dep-IO, while [noŋkomai] infringes the higher ranked Onset, [noŋkomati]
counts as more optimal according to (1). This means that the conflict between Dep-
IO and the optimal output-form [noŋ.ko.ma.ti] is tolerated, while the inconsistency
between Onset and the output-candidate [noŋ.ko.ma.i] is deemed to be fatal.
Max-IO is supposed to be ranked above Dep-IO but having the same rank as
Onset:
(8) Axininca Campa is characterised by the constraint hierarchy
Onset, Max-IO >> Dep-IO.
Csilla Rákosi
While the infringement of the two higher ranked constraints Onset and
Max-IO is treated as fatal failure, the violation of Dep-IO by the optimal output-
form is t olerated. The reason for this decision is the circumstance that this tolera-
tion makes it possible to avoid the violation of higher ranked constraints.
He argues, however, that (10) would be problematic from two points of view. First,
in some languages all syllables must have onsets, that is, word-initial epenthesis
is not required. Thus, (10) cannot function as a linguistic universal. Second, the
left edge of words is “neglected” by a series of phonological changes. From this
Kager (1999: 110f.) concludes that raising the following alignment constraint is
well-motivated:
(11)
Align-L: The left edge of the Grammatical Word coincides with the left
edge of the Prosodic Word.
In order to capture the data in (9), Kager (1999: 111) suggests giving up (8) and
replacing it with an extended version of it:
Data and the resolution of inconsistency in Optimality Theory
Table 3. Application of the ranking Align-L, Max-IO >> Onset >> Dep-IO
/osampi/ Align-L Max-IO Onset Dep-IO
o.sam.pi *
to.sam.pi *! *
sam.pi *! *
OT tries to capture this process with the help of the following constraint hierarchy
(cf. Kager 1999: 377):
(18) Isthmus Nahuat is characterised by the constraint hierarchy
Final-C >> Max-IO,
where the dominating constraint is a well-formedness constraint:
(19) Final-C: Stem ends in a consonant.
The interaction of the two processes takes place as follows:
(20) In Isthmus Nahuat, it is the output form [ʃikɑkíl] that belongs to the
underlying form /ʃikɑkíli/.
The problem is, however, that by taking into consideration the rankings in (14) and
(18), the four relevant constraints cannot be ranked in such a way that they pro-
vide the correct result by the evaluation of the possible output-forms of /ʃikɑkíli/.
For example, the hierarchy
(21) Isthmus Nahuat is characterised by the constraint hierarchy
Final-C >> Max-IO, *Voiced-Coda >> Ident-IO (voice).
evaluates an ungrammatical form as optimal. See Table 4.
Table 4. Incorrect prediction produced by the constraint hierarchy Final-C >> Max-IO,
*Voiced-Coda >> Ident-IO (voice)
/ʃikɑkíli/ Final-C Max-IO *Voiced-Coda Ident-IO (voice)
ʃikɑkíli *!
ʃikɑkíl * *!
ʃikɑkíl̥ * *
In order to obtain the correct form as optimal output form, the ranking of
*Voiced-Coda and Ident-IO should be reversed:
(22) Isthmus Nahuat is characterised by the constraint hierarchy
Final-C >> Max-IO >> Ident-IO (voice) >> *Voiced-Coda.
This constraint ranking leads to the evaluation as shown in Table 5.
Table 5. Correct prediction produced by the constraint hierarchy Final-C >> Max-IO
>> Ident-IO (voice) >> *Voiced-Coda
/ʃikɑkíli/ Final-C Max-IO Ident-IO (voice) *Voiced-Coda
ʃikɑkíli *!
ʃikɑkíl * *
ʃikɑkíl̥ * *!
Data and the resolution of inconsistency in Optimality Theory
(22) would lead to the correct outcome but it is inconsistent with (14) which is
motivated by data such as (13). This situation is labelled by Kager (1999: 377) as
a “ranking paradox”. Kager mentions some attempts to grasp this problem and
others in connection with opacity. He is of the opinion that they cannot be solved
simply by extending the constraint set with some new elements because these
phenomena
[…] seem to motivate strongly abstract intermediate levels of representation
between input and output. OT recognizes no such intermediate levels, in line with
standard assumptions […]. In standard OT, the mapping from input (‘Lexical
Representation’) to output (‘Phonetic Representation’) is direct. Moreover, well-
formedness constraints state requirements on output forms only, ignoring the
input. This assumption is another cornerstone of OT, essential to explanation of
segment inventories, conspiracies, and the Duplication Problem […].
(Kager 1999: 376; emphasis as in the original)
4. O
n (P)(b): Analysis and evaluation of the strategies applied
by Kager (1999)
In this Section, I will reconstruct and evaluate the strategies of inconsistency reso-
lution which were applied by Kager (1999) with the help of the p-model.
4.1 T
reatment of Inconsistency No. 1: Joint application of the Contrastive
and the Combinative Strategies
In order to evaluate the treatment of inconsistencies by Kager, one has to recon-
struct the p-contexts in which the inconsistencies emerged. As we have seen in
Csilla Rákosi
Section 2.4 of Kertész & Rákosi (this volume),1 the p-context involves, among
other things, the statements which can be judged to be plausible on the basis of
some source, as well as these sources. First of all, we have to reveal and examine
the sources of the plausibility of the conflicting statements. In Section 3.1 we have
shown that Inconsistency No. 1 results from a conflict between the satisfaction of
the constraints Onset and Dep-IO.
Let us first examine the source of the plausibility of the hypothesis regard-
ing Onset and hypotheses related to it. Statements in (2) can be interpreted as
“linguistic data”, that is, as plausible statements which capture some characteristic
of Axininca Campa and whose plausibility value originates from a direct source
in the sense of Section 2.2 of Kertész & Rákosi (this volume). (23) shows the data
structure of the statements in (2):2
(23) a. |In Axininca Campa, it is the output-form [noŋkomati] that belongs to
the input /no-N-koma-i/.|K = 0.93
b. |In Axininca Campa, it is the output-form [noɲčhiki] that belongs to
the input /no-N-čhik-i/ not the form *[noɲčhikti].|K = 0.9
These two statements seem to be assigned a maximal or at least very high plau-
sibility value by Kager, since he never indicates that statements of this type could
turn out to be false and be in need of revision. The evaluation of the reliability of
these hypotheses is partly based on grammaticality judgments of native speakers.
Nevertheless, we should not forget that both statements contain linguistic analy-
ses, too. Therefore, their plausibility does not stem solely from the linguistic intu-
ition of native speakers but is influenced by the reliability of the linguistic analyses
conducted by Kager, too. Moreover, acceptability judgments cannot be regarded
as totally reliable sources (cf. Kertész & Rákosi 2012: 20f., 68, 170ff.). Therefore,
instead of maximal plausibility (truth with certainty), a very high plausibility value
seems to be more appropriate.
Onset, as introduced in (3), belongs to the structural well-formedness
constraints:
(3) Onset: Syllables must have an onset.
Its plausibility value originates from two sources (cf. Kager 1999: 93f.): it is
supported by typological considerations (onsets are allowed in every language,
and there are languages in which they are obligatory) and by phonetic reasons (the
articulation and perception of vowels is easier after a preceding consonant).
The “linguistic datum” in (23)(a) as well as the typological insight that in most
languages syllables usually start with a consonant serve as premises of the follow-
ing plausible inference:
(24) |In Axininca Campa, it is the output-form [noŋkomati] that belongs to the
input /no-N-koma-i/.|K = 0.9 (= (23)(a))
|In most languages syllables usually start with a consonant.|K = 0.9
|If it is the output-form [noŋkomati] that belongs to the input
/no-N-koma-i/ in Axininca Campa, and in most languages syllables usually
start with a consonant, then output-forms in which all syllables have an
onset are optimal while output-forms containing syllables without an onset
are not optimal in Axininca Campa.|K = 0.3
|If output-forms in which all syllables have an onset are optimal while
output-forms containing syllables without onset are not optimal in
Axininca Campa, then the constraint Onset is undominated in this
language.|K = 1
|The constraint Onset is undominated in Axininca Campa.|I(24) = 0.74
This inference belongs to the first type of plausible inferences (Kertész & Rákosi,
this volume: Section 2.3, 2012: 86ff.), because there is a logical consequence rela-
tion between its premises and conclusion but some premises are only plausible
instead of being true with certainty. The indices K and I(24) refer to the sources
Kager (1999) and this inference, respectively. The premises of the inference in (24)
make the hypothesis that Onset is undominated in this language highly plausible
but not totally certain. This means that its later revision cannot be ruled out but
requires strong counter-arguments.
As a second step, we turn to the other originator of Inconsistency No. 1, the
constraint Dep-IO:
. For the assignment of plausibility values to the conclusion of inferences, see Kertész &
Rákosi (2012: Section 9.5).
Csilla Rákosi
This means that Dep-IO has a high plausibility value in OT. It is, however, incom-
patible with data which show the application of epenthesis. The linguistic data in
(5)(a)–(h) belong to this group and are regarded as having a very high plausibility
value by Kager. From the joint presence of Dep-IO and (5)(a)–(h) in the p-context,
a series of p-inconsistencies5 emerge. For instance, (5)(a) is weak evidence against
Dep-IO, because it is one of the premises of a plausible inference the conclusion of
which is a special case of the negation of Dep-IO:
(25) |In Axininca Campa, it is the output-form [noŋkomati] that belongs to the
input /no-N-koma-i/.|K = 0.9 (= (23)(a))
|If in Axininca Campa, it is the output-form [noŋkomati] that belongs to
the input /no-N-koma-i/, then in the input form /no-N-koma-i/ an onset ‘t’
is supplied.|K = 0.9
|If in the input form /no-N-koma-i/ an onset ‘t’ is supplied, then the out-
put form [noŋkomati] contains a segment which does not have an input
correspondent.|K = 1
|The output form [noŋkomati] contains a segment which does not have an
input correspondent.|I(25) = 0.9
On the basis of similar plausible inferences relying on the data (5)(b)-(h), the fol-
lowing inductive generalisation can be achieved:
(26) There are output segments in Axininca Campa which do not have input
correspondents.
. P-inconsistency means that both members of an inconsistent pair of statements are plau-
sible on the basis of some (but not the same) source; see Kertész & Rákosi (this volume:
Section 2.5, 2012: 130).
Data and the resolution of inconsistency in Optimality Theory
Thought Experiment 1
Step 1: Simulation of functioning of GEN
The plausibility values of hypotheses about output-candidates are ignored and they
receive the same low initial plausibility value which conjectures usually obtain.6
This means in this case that output-candidates employing t-epenthesis as well as
output-candidates mirroring the input are examined as equally plausible possible
output-forms:
|In Axininca Campa, it is the output-form [noŋkomati] that belongs to the
input /no-N-koma-i/.|TE = 0.2
1
|In Axininca Campa, it is the output-form [noŋkomai] that belongs to the
input /no-N-koma-i/.|TE = 0.2
1
. That is, we “forget” the plausibility value of, for example, the statements in (2) and (5).
Csilla Rákosi
c onstraints. The ranking of the constraints is not yet taken into consideration at
this step. Two conflicts can be detected:
i. |onset: Syllables must have an onset.|TE = 0.8
1
vs.
|The last syllable of the output-form [noŋkomai] does not have an
onset.|TE = 0.9
1
ii. |dep-io: Output segments must have input correspondents.|TE = 0.8
1
vs.
|Segment ‘t’ in the last syllable of the output-form [noŋkomati] does not
have an input correspondent.|TE = 0.9
1
Figure 1 shows the current state of the p-context. It boils down to two p-context
versions which contain the two constraints as well as one of the rival hypotheses.
Onset Dep-IO
Figure 1. The p-context after the application of the Contrastive Strategy in
Thought Experiment 1
At his point, an attempt is made to apply the Contrastive Strategy in the sense
of Subsection 2.6.2 of Kertész & Rákosi (this volume)7 insofar as it is checked
whether one of the rival p-context versions can be deemed to be definitely better
than the other. This, however, does not lead to a satisfactory result (resolution of
the p-inconsistency), because both p-context versions are p-inconsistent.
. For a more detailed characterisation of the problem solving strategies, see also Kertész &
Rákosi (2012: Section 10.6).
Data and the resolution of inconsistency in Optimality Theory
(6) receives a moderate plausibility value which is higher than the plausibility of
the hypotheses about the possible output forms but lower than that of the con-
straints. A further important characteristic of the treatment of (6) is that it is inter-
preted in such a way that it states that in Axininca Campa, there are only these two
constraints; that is, the constraint hierarchy in this language is as described in (6).
The reason for this is the circumstance that constraint hierarchies cannot simply
be extended by adding new constraint(s) to them, because the new hierarchy can
evaluate certain output-forms differently from the old hierarchy.8 Thus, the addi-
tion of a new constraint has to be regarded as a rather serious modification of the
original ranking.
During the simulation of the decision procedure, the following plausible
hypotheses are obtained:
(28) a. Since Dep-IO is dominated by Onset according to (6), and the
output-candidate [noŋkomati] does not infringe Onset while its rival,
[noŋkomai] does, the former form violates the constraint hierarchy in
(6) to a lesser extent according to (1)(a).
b. Therefore, the output-candidate [noŋkomati] is more optimal in the
sense of (1)(b) than the output-candidate [noŋkomai].
c. Syllables must generally have an onset in Axininca Campa, but the
optimal output-candidate ([noŋkomati]) as a surface form may contain
onset-less syllables along the lines of (1)(c).9
d. Input segments must generally have output correspondents in
Axininca Campa, but the optimal output-candidate ([noŋkomati]) as
a surface form may contain output segments which do not have input
correspondents along the lines of (1)(c).
e. Since [noŋkomai] is not the optimal output form, therefore the
p-inconsistency between Onset and the hypothesis stating that the
output is [noŋkomai] cannot be tolerated but could be resolved by giv-
ing up the latter.
(28)(e) means that the p-context version around the hypothesis that the output is
[noŋkomai] remains p-inconsistent. As opposed to this, the rival p-context version
resolves the p-inconsistency with the help of the Combinative Strategy, that is, by
keeping both conflicting hypotheses systematically separated from each other (cf.
Kertész & Rákosi, this volume: Subsection 2.6.2, 2012: 158). Namely, (28)(c) and
(d) mean that this p-context version can be divided into two co-existing p-context
versions: into the p-context representing the final state of the evaluation process
conducted by EVAL on the one hand, and the surface representation on the other
hand. The former p-context includes the constraints as well as a hypothesis about
the ranking and optimality of the output-candidates. The latter involves a hypoth-
esis about the output form which appears at the surface but the constraints do
not form a part of it. The reason for this finding is that hypotheses capturing the
constraints cannot be regarded as linguistic rules without exceptions but rather
as tendencies which – at least in the case of highly ranked ones – are satisfied by
most linguistic forms within a given language. In this way it is possible to separate
the constraints and the surface forms in the case of the optimal output-candidate.
This strategy, however, does not function with output-candidates which are not
optimal, because they do not reach the surface representation but remain within a
p-context where the constraints are also present.
From this it follows that on the basis of the information taken into consider-
ation so far it is reasonable to give up the hypothesis that the output is [noŋkomai]
and the p-context version around it and retain its rival. Thus, the p-context
depicted in Figure 1 is transformed into the p-context in Figure 2.
Onset
Dep-IO
Onset >> Dep-IO
(1)
the output is [noŋkomati] the output is the output form is
[noŋkomai] [noŋkomati]
the form [noŋkomati] is
the most optimal form
Final state of the decision Surface
procedure by EVAL representation
the hypothesis about their ranking is given up and altered or new constraints are
proposed etc. In Thought Experiment 1, there is no p-inconsistency between the
hypothesis that it is the output-candidate [noŋkomati] which is the most optimal
form and the datum in (23)(a).
After conducting the thought experiment, the plausibility value of the hypoth-
eses formulating the two constraints in (3) and (4), as well as their ranking in (6)
increased, because inference (29) as an indirect source makes it possible to assing
a positive plausibility value to (6), too:
(29) |If Axininca Campa is characterised by the constraint hierarchy Onset >>
Dep-IO (= (6)), then the output-candidate [noŋkomati] is more optimal
than the output-candidate [noŋkomai].|TE = 1
1
[|If the output-candidate [noŋkomati] is more optimal than the output-
candidate [noŋkomai], then it is the output-candidate [noŋkomati] that is
the most optimal among the output-forms belonging to the input /no-N-
koma-i/.|TE = 0.3]
1
|If it is the output-candidate [noŋkomati] that is the most optimal among
the output-forms belonging to the input /no-N-koma-i/, then it is the
form [noŋkomati] that belongs to the input /no-N-koma-i/ at the surface
representation.|TE = 0.9
1
|It is the form [noŋkomati] that belongs to the input /no-N-koma-i/ at the
surface representation.|K = 0.9 (= (23)(a))
|Axininca Campa is characterised by the constraint hierarchy
Onset >> Dep-IO.|I(29) = 0.6 (= (6))
Thus, the plausibility of (6) on the basis of its direct source and (29) as an indirect
source together is higher than its initial value. To sum up, with the help of the
p-model, Kager’s treatment of the conflict between the optimal output-candidate
[noŋkomati] and the two constraints can be evaluated as follows:
(30) a. Step 2 of the thought experiment makes it possible to reveal
which constraints the single output-candidates violate. In this way,
p-inconsistencies between constraints and output-candidates are
identified.
b. These p-inconsistencies are divided into two groups. Conflicts between
non-optimal output-forms and constraints are not tolerated but they
are regarded as fatal. As a consequence, these output-forms are given
up and they are supposed to not reach the surface representation.
That is, with the help of the Contrastive Strategy, hypotheses about
these output-forms as well as the p-context versions around them are
rejected.
Csilla Rákosi
To sum up, the joint application of the Contrastive and the Combinative Strategies
resulted in a solution of the inconsistency between the hypothesis Dep-IO and lin-
guistic datum in (23)(a). Nevertheless, it would be premature to claim that this is
the resolution of this p-inconsistency (cf. Kertész & Rákosi this volume: Section 2.5,
2012: 133), since, for example, the above thought experiment took into consideration
only two output-candidates associated with an input form. There are, however, fur-
ther possible forms as well. That is, the proposed solution is not satisfactory, since it
does not take into consideration all information available. Therefore, the emergence
of further inconsistencies cannot be ruled out. As we have seen in Section 3.2, the
extension of the p-context by further available data leads in fact to new conflicts.
4.2 T
reatment of Inconsistency No. 2: Modification of the hypothesis about
the constraint hierarchy
As I have already mentioned, the weak point of inference (29) is its first premise
and latent background assumption. On the basis of the comparison of only two
possible output-candidates it is stated that it is [noŋkomati] which is the most
optimal form among the output-forms belonging to the input /no-N-koma-i/.
Thus, the thought experiment made use of only a small subset of data inasmuch
as only two output-candidates were taken into consideration by the simulation of
the evaluation process. It is therefore possible that on the basis of a wider range
of data, this hypothesis loses its plausibility, and, as a consequence, the conclusion
can no longer be regarded as plausible, either.
Data and the resolution of inconsistency in Optimality Theory
Thought Experiment 2
Step 1: Simulation of the functioning of GEN
A third datum is added to those in Thought Experiment 1, which receives the
same low initial plausibility value as the other two:
|In Axininca Campa, it is the output-form [noŋkoma] that belongs to the
input /no-N-koma-i/.|TE = 0.2
2
Onset Dep-IO
p-context version3
central hypothesis:
The output is [noŋkoma]
Figure 3. The p-context after the application of the Contrastive Strategy in
Thought Experiment 2
Onset
Dep-IO the output is
[noŋkomati] the output form is
Onset >> Dep-IO
[noŋkoma]
(1)
[noŋkoma] is the
most optimal form the output is
[noŋkomai] (2)
the output is [noŋkoma]
This means that Thought Experiment 2 as a source makes the hypothesis that
Axininca Campa is characterised by the constraint hierarchy Onset >> Dep-IO
implausible:
(31) |If Axininca Campa is characterised by the constraint hierarchy Onset >>
Dep-IO (= (6)), then the output-candidate [noŋkoma] is more optimal than
the output-candidates [noŋkomati] or [noŋkomai].|TE = 1
2
[|If the output-candidate [noŋkoma] is more optimal than the output-
candidates [noŋkomati] or [noŋkomai], then it is the output-candidate
[noŋkoma] that is the most optimal among the output-forms belonging to
the input /no-N-koma-i/.|TE = 0.4]
2
|If it is the output-candidate [noŋkoma] that is the most optimal among
the output-forms belonging to the input /no-N-koma-i/, then it is the
form [noŋkoma] that belongs to the input /no-N-koma-i/ at the surface
representation.|TE = 0.9
2
|~ It is the output-form [noŋkoma] that belongs to the input /no-N-koma-i/
at the surface representation.|K = 0.9
|~ Axininca Campa is characterised by the constraint hierarchy Onset >>
Dep-IO.|I(31) = 0.8 (= ~ (6))
Data and the resolution of inconsistency in Optimality Theory
In contrast to (29), the first premise and the latent background assumption of
(31) take into consideration three possible output-candidates. The problem is,
however, that the evaluation should not result in our judging [noŋkoma] more
optimal than [noŋkomati]. Therefore, Kager introduces a third constraint that
forbids the deletion of input segments. That is, he modifies the hypothesis about
the supposed constraint hierarchy in Axininca Campa in such a way that it also
contains a constraint which is in conflict with the output-form which has led to
p-inconsistency. Thus, the p-context is extended by (7), further (6) is given up
and replaced by (8):
(7) Max-IO: Input segments must have output correspondents.
(8) Axininca Campa is characterised by the constraint hierarchy
Onset, Max-IO >> Dep-IO
In order to come to a decision between the p-context version built around (6) and
a new solution based on (8), the latter has first to be elaborated. Therefore, the ten-
ability of this constellation is tested again with the help of a thought experiment.
Thought Experiment 3
Step 1: Simulation of the functioning of GEN
The set of “linguistic data” is the same as it was in Step 1 of the previous thought
experiment, because the range of output-candidates has not changed.
Step 2: Simulation of the functioning of EVAL 1: Confronting constraints and
output-candidates
The set of constraints has been extended by Max-IO, which receives the same high
initial plausibility value as Onset and Dep-IO.
Figure 5 shows the result of the confrontation of the extended constraint set
and the output-candidates.
p-context version3
central hypothesis:
The output is [noŋkoma]
Figure 5. The p-context after the application of the Contrastive Strategy in
Thought Experiment 3
Csilla Rákosi
All three rival p-context versions are p-inconsistent because all possible output-
forms violate a constraint.
Step 3: Simulation of the functioning of EVAL 2: The decision procedure
When we also take into consideration (8), that is, the hierarchy of the three
constraints as well, then we obtain that the output-candidate noŋkomati is more
optimal than its two rivals according to (1). Therefore, it is reasonable to give
up the p-context-versions built around the less optimal forms and keep only
p-context version No. 2 by making use of the Contrastive Strategy. Then, the
Exclusive Strategy (cf. Kertész & Rákosi this volume: Subsection 2.6.2, 2012: 158)
has to be applied in order to check whether we obtain a satisfactory solution of
the starting problem. As we have seen, the best p-context-version is p-inconsis-
tent. Therefore, the Combinative Strategy is used in order to prevent a conflict
between the constraints and the surface form as shown in Figure 6.
Onset, Max-IO
Dep-IO the output is
[noŋkoma] the output form is
Onset, Max-IO >> Dep-IO
[noŋkomati]
(1)
[noŋkomati] is the
most optimal form
the output is
the output is [noŋkomati] [noŋkomai]
From these considerations it follows that (8) – that is, the hypothesis stating that
Axininca Campa has three constraints of which Dep-IO has a lower rank than
the other two – should be preferred over (6) on the basis of all the data contained
by the p-context. This means that the p-inconsistency caused by the failure of the
constraint ranking in (6) could be solved with the help of the modification of the
hypothesis about the constraint hierarchy. Nevertheless, the indirect source in (32)
makes (8) (and, of course, (7)) only plausible but not true with certainty. One of its
deficiencies is the circumstance that the first premise and the latent background
assumption relate to only three possible output-forms and to only one input form
of Axininca Campa.
4.3 T
reatment of Inconsistency No. 3: Modification of the hypothesis
regarding the constraint hierarchy
Inference (32) uses the outcome of Thought Experiment 3 as weak evidence for
the current version of the hypothesised constraint hierarchy in Axininca Campa.10
It is an inference belonging to the third type of plausible inferences (cf. Kertész &
Rákosi this volume: Section 2.3, 2012: Subsection 9.5.4): it is not deductively valid
and most of its premises are only plausible but not true with certainty. It concludes
from the plausibility of a necessary condition of (8) – namely, that the evaluation
procedure based on (8) chooses the output-form belonging to a certain input form
correctly – that this hypothesis regarding the constraint hierarchy in Axininca
Campa is plausible. Nevertheless, Kager’s argumentation presented in Section 3.3
. For the concept of weak evidence, see Kertész & Rákosi (this volume: Section 3.2,
2012: 178f.).
Csilla Rákosi
shows that if further input forms are also taken into consideration, p-inconsistency
emerges again.
Thought Experiment 4
Step 1: Simulation of the functioning of GEN
This thought experiment aims at simulating the application of (8) to possible
output-forms related to the input form /osampi/. Therefore, the following three
rival hypotheses are assigned the same low initial plausibility value (cf. (9)(a) in
Section 3.3):
p-context version3
central hypothesis:
The output is [sampi]
Figure 7. The p-context after the application of the Contrastive Strategy in
Thought Experiment 4
the former is the most optimal form among the three rivals examined accord-
ing to (1).
Onset, Max-IO
Dep-IO the output is
[osampi] the output form is
Onset, Max-IO >> Dep-IO
[tosampi]
(1)
[tosampi] is the
most optimal form the output is
the output is [tosampi] [sampi]
(9)(a)
Final state of the decision Surface
procedure by EVAL representation
We are in the same situation as with (23)(a) and the output-form [noŋkoma]
in Section 4.2: the p-context version corresponding to the surface representation
is p-inconsistent.
Therefore, as (32) shows, Thought Experiment 4 can be regarded as weak
evidence11 against the hypothesis that Axininca Campa is characterised by the
constraint hierarchy Onset, Max-IO >> Dep-IO:
. For this concept, see in Kertész & Rákosi (this volume: Section 3.2, 2012: 178f.).
Csilla Rákosi
|If it is the output-candidate [tosampi] that is the most optimal among the
output-forms belonging to the input /osampi/, then it is the form [tosampi]
that belongs to the input /osampi/ at the surface representation.|TE = 0.9
4
|~ It is the output-form [tosampi] that belongs to the input /osampi/ at the
surface representation.|K = 0.9 (cf. (9)(a))
|~ Axininca Campa is characterised by the constraint hierarchy Onset,
Max-IO >> Dep-IO.|I(32) = 0.8 (= ~ (8))
In contrast to this finding, as the inference (31) in Section 4.2 indicates, Thought
Experiment 3 provides weak evidence for (8). As a result, a situation emerges
where one thought experiment provides evidence for the hypothesis that the
constraint hierarchy Onset, Max-IO >> Dep-IO is at work in Axininca Campa,
while another offers evidence against it. This p-inconsistency emerged because
the two thought experiments relied on different “linguistic data”. Since the
constraint hierarchy should be applicable to all input forms of this language, the
supposed constraint hierarchy should be in conflict with neither data set. In order
to overcome this difficulty, Kager proposes giving up (8). That is, the Contras-
tive Strategy is applied again in order to resolve the conflict between a proposed
constraint hierarchy and the broadened set of “linguistic data”. Accordingly, the
p-context is extended by (11), that is, by a hypothesis of a new constraint, and (8)
is replaced by (12):
(11)
Align-L: The left edge of the Grammatical Word coincides with the left
edge of the Prosodic Word.
Thought Experiment 5
p-context version3
central hypothesis:
The output is [sampi]
Figure 9. The p-context after the application of the Contrastive Strategy
in Thought Experiment 5
Align-L, Max-IO
Onset the output is
Dep-IO [tosampi]
Align-L, Max-IO >> the output form is
Onset >> Dep-IO [osampi]
(1)
[osampi] is the
most optimal form the output is
(9)(a)
the output is [osampi] [sampi]
The caveat that Thought Experiment 5 made (12) only more plausible but did not
prove its truth is substantial. This indirect source is not perfectly reliable because,
among others, it relies on a limited data set, too, insofar as it confronts the proposed
constraint ranking solely with output-candidates relating to one input form.
4.4 T
reatment of Inconsistency No. 4: Necessity for the substantial revision
of the theory
The starting p-context contains, among others, the following “linguistic data”:
|In Isthmus Nahuat, it is the output form [tajoːl ]̥ that belongs to the
underlying form /tajoːl/.|K = 0.9 (= (13))
|In Isthmus Nahuat, it is the output form [tam] that belongs to the
underlying form /tami/.|K = 0.9 (= (17))
|In Isthmus Nahuat, it is the output form [ʃikɑkíl] that belongs to the
underlying form /ʃikɑkíli/.|K = 0.9 (= (20))
Datum (13) is weak evidence for hypothesis (14) regarding the ranking of the
constraints *Voiced-Coda and Ident-IO (voice):
(34) |In Isthmus Nahuat, it is the output form [tajoːl ̥] that belongs to the
underlying form /tajoːl/.|K = 0.9 (= (13))
Data and the resolution of inconsistency in Optimality Theory
|If it is the output form [tajoːl ]̥ that belongs to the underlying form /tajoːl/,
then Isthmus Nahuat is characterised by the constraint hierarchy
*Voiced-Coda >> Ident-IO (voice).|K = 0.3
|Isthmus Nahuat is characterised by the constraint hierarchy
*Voiced-Coda >> Ident-IO (voice).|I(34) = 0.6 (= (14))
Similarly, datum (17) is weak evidence for the constraint hierarchy Final-C >>
Max-IO in (18):
(35) |In Isthmus Nahuat, it is the output form [tam] that belongs to the
underlying form /tami/.|K = 0.9 (= (17))
|If it is the output form [tam] that belongs to the underlying form /tami/
in Isthmus Nahuat, then Isthmus Nahuat is characterised by the constraint
hierarchy Final-C >> Max-IO.|K = 0.3
|Isthmus Nahuat is characterised by the constraint hierarchy
Final-C >> Max-IO.|I(35) = 0.6 (= (18))
Nevertheless, both inferences (34) and (35) take into consideration only a small
subset of the available data. It is also clear that neither hypothesis (14) nor (18)
are capable of providing an explanation of the data that can be explained by the
other hypothesis. Therefore, they have to be replaced by a hypothesis which unifies
the two constraint hierarchies. In Section 3.4, Kager (1999) was cited who states
that the four relevant constraints cannot be ranked in such a way that the evalua-
tion procedure relying on them would produce the correct outcome. Kager (1999)
illustrates this claim with two thought experiments, based on the rankings in (21)
and (22), respectively:
Thought Experiment 6
Step 1: Simulation of the functioning of GEN
At the beginning of the thought experiment, the following statements are assigned
a low initial plausibility value:
|In Isthmus Nahuat, it is the output form [tam] that belongs to the
underlying form /tami/.|TE = 0.2
6
|In Isthmus Nahuat, it is the output form [tami] that belongs to the
underlying form /tami/.|TE = 0.2
6
Csilla Rákosi
|In Isthmus Nahuat, it is the output form [tajoːl ] that belongs to the
underlying form /tajoːl/.|TE = 0.2
6
|In Isthmus Nahuat, it is the output form [tajoːl] that belongs to the
underlying form /tajoːl/.|TE = 0.2
6
|In Isthmus Nahuat, it is the output form [ʃikɑkíli] that belongs to the
underlying form /ʃikɑkíli/.|TE = 0.2
6
|In Isthmus Nahuat, it is the output form [ʃikɑkíl] that belongs to the
underlying form /ʃikɑkíli/.|TE = 0.2
6
|In Isthmus Nahuat, it is the output form [ʃikɑkíl ̥] that belongs to the
underlying form /ʃikɑkíli/.|TE = 0.2
6
p-context version3
central hypothesis:
The output is [ʃikkíll]
Figure 11a. The p-context after the application of the Contrastive Strategy in
Thought Experiment 6 I
Figure 11b summarises the conflicts between the constraints and possible
output-forms related to the input form /tami/. Figure 11c displays the conflicts
between the constraints and possible output-forms related to the input form
/tajoːl/.
In all three cases, every p-context version is p-inconsistent because they are in
conflict with at least one constraint.
Data and the resolution of inconsistency in Optimality Theory
p-context version1
central hypothesis:
The output is [tam]
p-context version2
central hypothesis:
The output is [tami]
Figure 11b. The p-context after the application of the Contrastive Strategy in
Thought Experiment 6 II
p-context version1
central hypothesis:
The output is [tajoll]
p-context version2
central hypothesis:
The output is [tajol]
Figure 11c. The p-context after the application of the Contrastive Strategy in
Thought Experiment 6 III
Final-C, Max-IO
*Voiced-Coda
the output is [ʃikkíl]
Ident-IO (voice)
the output is [tami]
Final-C >> Max-IO, the output forms are
*Voiced-Coda >> Ident-IO (v) [ʃikkíll], [tam] and [tajoll]
(1)
[ʃikkíll], [tam] and [tajoll] are
the most optimal forms
the output is [ʃikkíli]
the output are [ʃikkíll], the output is [tajol]
[tam] and [tajoll] (20)
Thought Experiment 7
p-context version3
central hypothesis:
The output is [ʃikkíll]
Figure 13a. The p-context after the application of the Contrastive Strategy in
Thought Experiment 7 I
p-context version1
central hypothesis:
The output is [tam]
p-context version2
central hypothesis:
The output is [tami]
Figure 13b. The p-context after the application of the Contrastive Strategy in
Thought Experiment 7 II
Csilla Rákosi
p-context version1
central hypothesis:
The output is [tajoll]
p-context version2
central hypothesis:
The output is [tajol]
Figure 13c. The p-context after the application of the Contrastive Strategy in
Thought Experiment 7 III
Final-C, Max-IO
*Voiced-Coda
the output is [ʃikkíll] the output forms
Ident-IO (voice)
the output is [tami] at the surface are
Final-C >> Max-IO,
[ʃikkíl], [tam] and [tajol]
*Voiced-Coda >> Ident-IO (v)
(1)
[ʃikkíl], [tam] and [tajol] are
the most optimal forms
the output is [ʃikkíli]
the outputs are [ʃikkíl], (13)
the output is [tajoll]
[tam] and [tajoll]
From this it follows that Thought Experiment 7 yields weak evidence against the
proposed constraint hierarchy:
(37) |If Isthmus Nahuat is characterised by the constraint hierarchy Final-C >>
Max-IO >> Ident-IO (voice) >> *Voiced-Coda (= (22)), then the
output-candidate [tajoːl] is more optimal than the output-candidate
[tajoːl ̥]|TE = 1
7
[|If the output-candidate [tajoːl] is more optimal than the output-candidate
[tajoːl ̥], then it is the output-candidate [tajoːl] that is the most optimal
among the output-forms belonging to the input /tajoːl/.|TE = 0.3]
7
|If it is the output-candidate [tajoːl] that is the most optimal among the
output-forms belonging to the input /tajoːl/, then it is the form [tajoːl] that
belongs to the input /tajoːl/ at the surface representation.|TE = 0.9
7
|~ It is the output-form [tajoːl] that belongs to the input /tajoːl/ at the
surface representation.|K = 0.9 (cf. (13))
|~ Isthmus Nahuat is characterised by the constraint hierarchy Final-C >>
Max-IO >> Ident-IO (voice) >> *Voiced-Coda.|I(37) = 0.7
At this point Kager (1999: 377) comes to the conclusion that there is no resolution
of this problem. There is no order of the four relevant constraints that makes pos-
sible an evaluation which is consistent with the “linguistic data”. The main concern
is that the strategy of modifying the hypothesis regarding the constraint hierarchy
and extending it with new elements – which was applied successfully in the ear-
lier cases – is useless in this situation, because constraints work on output forms,
while the difference between (13) and (20) might require the involvement of fur-
ther representation levels, direct access to the input forms, or other types of major
modifications of the theory.
5. Conclusions
Very important lessons can be drawn from the above solution to (P)(b). First,
there is a connection among the different types of inconsistencies in OT: their
emergence can be traced back to similar causes (overgeneralisation) and they
can in most cases be solved with the help of similar tools, namely, with the joint
Data and the resolution of inconsistency in Optimality Theory
Acknowledgements
Work on the present chapter was supported by the MTA-DE Research Group for
Theoretical Linguistics, by the project OTKA K 77823, the TÁMOP-4.2.2/B-10
/1-2010-0024 project, which is co-financed by the European Union and the
European Social Fund, as well as by the János Bolyai Research Scholarship of the
Hungarian Academy of Sciences.
References
Conclusions
In Chapter 1, Section 2, we mentioned five key features of the p-model that may
be decisive for a workable metascientific solution to the problem of linguistic data
and evidence. Accordingly, the task of this concluding chapter is to highlight some
aspects of the case studies that seem to be especially instructive with respect to
these features, without, however, striving for a comprehensive summary of their
findings. We will put special emphasis on the question of whether, and if so, in
what way and to what extent, the application of the p-model can make linguistic
problem solving more effective.
The acknowledgement of the uncertainty of data played a key role in all case stud-
ies. The conscious reflection on the plausibility of the data meant above all that the
sources of the plausibility value of the data were revealed and in most cases, their
reliability was re-evaluated. A second aspect of this issue was that the authors were
aware of the circumstance that they can build only plausible inferences on the data,
which were plausible but not certainly true, whenever they elaborated on their own
approaches. As a result of this attitude, they were more open to possible counter-
arguments and more critical against the weak points of their own theories.
The significance of the reconstruction of the data structure is best exemplified
in Chapter 5, Section 2 by Enikő Németh T. She has shown that a careful analysis of
the data structure by means of the p-model may provide convincing arguments for
the rejection of theories relying on the data types at issue. First, the re-evaluation
of the reliability of the sources supporting the data of previous theories revealed
that a subset of the data set lost their data status because their sources turned out
András Kertész & Csilla Rákosi
to be unreliable or at least not reliable enough. Second, in certain cases the exami-
nation of the data structure witnessed that the basic concepts of the theory were
obscure and could not be applied to the description of the relevant data. She also
presented an instructive example of how fruitful it is to rely on as many data types
as possible because this strategy considerably reduced the uncertainty and the fal-
libility of the individual data types (such as the linguist’s own linguistic intuition)
and led to more plausible findings.
Similarly to the use of the concept of ‘plausible statement’, the application of the
concept of ‘plausible inference’ resulted in a twofold benefit. First, as a tool of
metascientific analysis of linguistic argumentation, the reliance on plausible infer-
ences made it possible to reconstruct the inferences used in linguistic theories. The
most significant result was in this respect that a considerable amount of inferences
in linguistic theories is deductively invalid but they can be interpreted as plausible
inferences capable of providing a partial support to their conclusions. Accord-
ingly, in several cases the analyses led to the conclusion that the hypotheses of the
investigated theories were neither proven nor falsified by linguistic evidence but
they were made plausible or implausible to some extent. Moreover, the examina-
tion of the structure of the plausible inferences shed light on the latent background
assumptions that often turned out to be implausible on closer examination.
A second gain of the use of this notion appeared at the level of object-scientific
argumentation. That is, the authors of the volume presented and scrutinized the
plausible inferences they made use of when proposing a new approach to the
investigated topic.
In this respect, Zoltán Vecsey’s contribution in Chapter 8 may be especially
instructive. Although, as we mentioned in Chapter 2, plausible inferences are of
informal nature, he has shown the usefulness of the p-model by its application to a
subfield of linguistics in which formal methods are preferred. Namely, he showed
how plausibility considerations enable us to solve a problem in formal semantics
that could not be solved with the help of logic alone.
The idea of the cyclic and prismatic nature of linguistic theorising seems to be
highly effective in linguistic problem solving because it impelled the authors of the
case studies to more thorough, more comprehensive investigations. If one does
Conclusions
not think that linguistic theorising is a linear process but supposes that it is cyclic,
then it is more likely and completely natural that one turns back to the investigated
problems many times from different points of view and considers many possible
solutions.
For example, Katalin Nagy C.’s contribution in Chapter 4 exemplifies the effec-
tiveness of the re-evaluation of the argumentation applied by rival theories. She
re-evaluated several components of the investigated theories: the reliability of the
sources supporting their data, the plausibility of their hypotheses, the acceptability
of the applied methods, the consistency between data and hypotheses. Since she
focused on different components of the p-context, her argumentation was pris-
matic: the revision of the data set or the implementation of methods that are well-
established but were not applied to this research topic shed new light again and
again on the investigated phenomena. The cyclic re-evaluation also comprised the
introduction of further data types and methods into the p-context.
Helga Vanda Koczogh applied in Chapter 6 the techniques of plausible argu-
mentation in the early phase of the linguistic theorising process. Namely, she tried to
solve a problem that is more basic than p-inconsistency or p-incompleteness, which
are related to the plausibility values of hypotheses. She showed how the category
system of a theory can be developed with the help of a cyclic and prismatic argu-
mentation process. She continuously re-evaluated the proposed categorisations and
tried to elaborate a more refined, more clearly-shaped and more comprehensive sys-
tem. To this end, she applied several problem-solving strategies that are similar to
those connected to the resolution of p-inconsistency. For example, the application of
the Diagnostic Perspective may mean in this context that one examines the linguis-
tic phenomenon at issue, seeking for instances whose treatment lead to problems
with the existing category systems. The Contrastive Strategy/Perspective compares
rival category systems in order to come to a decision among them. The Combinative
Strategy/Perspective, in contrast, aims at the integration of several category systems.
This goes beyond putting them simply together. Rather, it requires the selection of
their useable elements and the construction of a novel, more comprehensive model
from them. The evaluative perspective means that the rival category systems are
compared with each other on the basis of a series of criteria.
The p-model provides a differentiated set of tools that facilitate the treatment of
inconsistencies, allowing for the rejection, the temporal or permanent tolerance of
contradictions. The authors of the volume revealed several instantiations of every
strategy and witnessed a high variety of their applications.
András Kertész & Csilla Rákosi
Acknowledgements
Work on the present chapter was supported by the MTA-DE Research Group
for Theoretical Linguistics, the project OTKA K 77823, the TÁMOP-4.2.2/B-10
/1-2010-0024 project, which is co-financed by the European Union and the E
uropean
Social Fund, as well as the János Bolyai Research Scholarship of the Hungarian
Academy of Sciences.
Author index
indirect 20, 22, 25–27, 29, 195, 259–260, 264–265, time parameter see parameter,
41, 90, 231, 233, 287, 280, 310 time
293, 298 strong evidence see evidence, token frequency see frequency,
metalinguistic 74 strong token
typological 74 subject argument type frequency see frequency,
Spanish 76–77, 92 position 107–108, 116, type
medieval 76–77 119, 128–129
standard view subjectless verbs 104–116, U
of linguistic data and 119, 128 universal
evidence 2–5, 44, 223 suitable sources see source, linguistic 88, 96, 276, 306
of the analytical philosophy suitable tendencies 73, 88,
of science 2–3, 16, 225 syntactic alternation see 95–97, 99
statement alternation, syntactic
implausible 17–18, 20, V
22–23, 26–27, 29–30, T verbal disagreement 133–134,
38, 41, 54, 85, 93, 98, thought experiment 221–229, 141, 176
111–112, 114, 228, 238, 235–238, 241–243, verbal particle see particle,
248, 261, 290, 310 254–259, 263–267, verbal
of neutral plausibility 18–19, 283–284, 286–306, 313 *VOICED-CODA 277–278,
22–23, 25, 28, 30, 38, thought experimental data 298–299,
206, 248 see data, thought 302, 305
plausible 17, 21–22, 29, experimental
32, 38, 40–41, 46, thought experimental W
53–56, 67–68, 73–74, report 227–229, 255, weak evidence see evidence,
78, 85, 106, 110, 112, 258, 263, 265, 313 weak