Professional Documents
Culture Documents
Conceptualframework
Conceptualframework
net/publication/249933408
CITATIONS READS
67 1,287
1 author:
Masayoshi Shibatani
Rice University
57 PUBLICATIONS 2,738 CITATIONS
SEE PROFILE
All content following this page was uploaded by Masayoshi Shibatani on 01 August 2017.
Abstract
1. Introduction
avoid raising the issue at all. Various definitions currently o¤ered are of
little use, as they are typically based on an Indo-European active/passive
opposition, and arbitrarily include or exclude a particular phenomenon
from the domain of voice.1
Properly identifying construction types representing a voice sub-
domain is also a serious problem. In Crystal’s (2003) definition (cf. Note
1) reflexives are not recognized as proper voice constructions and their
relationship to the middle voice is not entirely clear. A similar problem
is seen in Kemmer’s (1993) extensive study of middle voice constructions.
There are also severe limitations at the level of explanation. Closer to
the main theme of this volume is the problem of understanding the in-
creases and decreases in valency and accompanying changes in argument
structure observed in voice phenomena. Why do certain phenomena (e.g.
the causative and applicative) show an increase in valency, while others
(e.g. the passive and antipassive) typically have a valency-reducing e¤ect?
What motivates these valency changes in opposite directions?
Functional explanations regarding the distribution of certain voice
constructions go a long way toward an explanatory functional study of
grammatical phenomena (cf. Haiman 1985). Being largely based on
formal properties such as ‘‘linguistic distance’’ and ‘‘full’’ vs. ‘‘reduced
form,’’ these explanations are not functional enough to be able to make
more general predictions.2
The problems outlined above largely stem from two related method-
ological issues. One is the lack of a coherent conceptual framework for
characterizing and analyzing voice phenomena; the other is an over-
reliance on formal properties in both analysis and explanation. Clearly
the latter problem is caused by the former and by the lack of commitment
to the cognition-to-form approach in linguistic analysis.3 The purpose of
this article is thus to lay out a conceptual framework that coherently de-
lineates the domain of voice, which embraces both those phenomena that
are traditionally recognized as falling in the voice domain and those that
have been kept in limbo. The framework required must deal with the fact
that many voice phenomena straddle the semantics-pragmatics boundary,
although the active/middle opposition is basically conceptual or semantic,
and the active/passive opposition is largely pragmatic. We endeavor to
unify these manifestations of voice function by assuming that the prag-
matic relevance of clausal units is semantically determined in the first place.
The conceptual foundations of voice can only be arrived at by inspect-
ing contrasting phenomena across languages. Our initial task is therefore
to learn how a given language, using its own resources, achieves the goal
of expressing a relevant conceptual opposition found in another language.
While the ultimate goal of functional typology is to discover the correla-
Voice phenomena 219
tive patterns between form and function, this article is concerned primarily
with the initial task of postulating conceptual bases of voice phenomena
and identifying constructions across languages that express the relevant
oppositions.
One final introductory remark is due regarding the controversy over
the question of whether the formal relationships between opposing voice
categories should be treated as inflectional or derivational. We consider
this question to be academic in the absence of rigorous definitions for
these processes. In the realm of voice phenomena, some systems, for ex-
ample, the Ancient Greek active/middle system, incorporate voice mor-
phology in their inflectional paradigm. Others like the English active/
passive opposition do not show a simple morphological relationship —
inflectional or derivational — since constructions as a whole enter into
the formal opposition. The regularity or productivity of the pattern is
often taken to be an important criterion distinguishing inflections from
derivations; the former are thought to be regular and obligatory, while
the latter allow exceptions. But regularity in natural language is always
relative, and so are the patterns of voice oppositions. Even among the
known ones, nothing is one hundred percent regular. An alternation that
is well-integrated within the inflectional paradigm may show irregularity.
In Ancient Greek, for example, we find both active forms that do not
have middle counterparts (activa tantum) and middle forms lacking the
corresponding active (media tantum). The active/passive opposition also
shows a high degree of regularity, without ever being one hundred per-
cent (as in the case of English), others place much severer limitations on
the range of permissible passive constructions.
The basic claim of this article is that major voice phenomena have their
conceptual bases rooted in the human cognition of actions. Because such
actions have various e¤ects upon us, we have special interest in the way
that they arise, how they develop, and the manner in which they terminate
— what is referred to as the evolutionary properties or phases of an
action in this article. Through a system of grammatical oppositions, a
language provides a means for expressing conceptual contrasts pertaining
to the evolutionary properties of an action that the speaker finds relevant
for communicative purposes. Among the evolutionary properties, voice is
primarily concerned with the way event participants are involved in ac-
tions, and with the communicative value, or discourse relevance pertain-
ing to the event participants from the nature of this involvement.
220 M. Shibatani
The first opposition to be examined has to do with the nature of the ori-
gin of an action — namely, whether the action in question is brought
about volitionally or nonvolitionally by a human agent.
Volitional/spontaneous opposition:
Is the action brought about volitionally?
Yes ! volitional
No ! spontaneous
Modern Japanese
(2) a. Boku-wa yoku mukasi-no-koto-o
I-TOP often old days-GEN-things-ACC
omo-u. (volitional)
think-PRES
‘I often think about the things of the old days.’
b. Saikin mukasi-no-koto-ga yoku
recently old days-GEN-things-NOM often
omowa-re-ru. (spontaneous)
think-SPON-PRES
‘Recently the things of the old days often come to mind.’
Indonesian
(3) a. Ali memukul anak-nya. (volitional)
Ali AF.hit child-3SG.POSS
‘Ali hit his child.’
b. Ali ter-pukul oleh anak-nya. (spontaneous)
Ali SPON-hit PREP child-3SG.POSS
‘Ali accidentally hit his child.’
(I Wayan Arka pers. comm.)
Russian
(5) a. Kostja poreza-l xleb.
Kostja cut.PERF-PAST.SG.MASC bread
‘Kostja cut the bread.’
b. Kostja poreza-l-sja.
Kostja cut.PERF-PAST.SG.MASC-SPON
‘Kostja has [accidentally] cut himself.’
(Vera Podlesskaya pers. comm.)
Diyari
(6) a. Ðatu yinana danka-na wawa-yi.
1SG.ERG 2SG.O find-PARTC AUX-PRES
‘I found you (after deliberately searching).’
b. Ðani danka-tadi-na wara-yi yiÐka Ðgu.
1SG.ABS find-SPON-PARTC AUX-PREP 2SG.LOC
‘I found you (accidentally).’
(Austin 1981: 154)
Sinhala
(7) a. mam ee wacne kiwwa.
I.NOM that word say.PAST
‘I said that word.’
b. maţ ee wacne kiywuna.
I.DAT that word say.P.PAST
‘I blurted that word out.’
(Gair 1990: 17)
The adaptation of the dative-subject construction for a spontaneous
action is also seen when the ‘‘dative-subject’’ is marked by cases di¤erent
from the dative as in the following Bengali examples, where the nominal
form corresponding to the dative subject is marked with genitive. Here
the volitional/spontaneous contrast takes on interesting nuances:
Bengali
(8) a. Ami toma-ke khub p chondo kor-i.
1SG.NOM 2ORDSG-OBJ very liking do-PRES.1
‘I like you very much.’ (According to my own criteria.)
b. Ama-r toma-ke khub p chondo
1SG-GEN 2ORDSG-OBJ very liking
h y.
become-PRES.3ORD
‘I like you very much.’ (According to some [socially] set
criteria.)
(Onishi 2001: 120)
When the basic meaning of the verb denotes a spontaneous (involun-
tary) action, the volitional voice form can be obtained by using a self-
benefactive construction, as in Marathi and other Indo-Aryan languages:
Marathi
(9) a. sitaa raD-l-i.
Sita.NOM cry-PERF-F
‘Sita cried.’
b. sitaa-ne raD-un ghet-l-a.
Sita-ERG cry-CONJ take-PERF-N
‘Sita cried (so as to relieve herself ).’
(Prashant Pardeshi pers. comm.)
Lhasa Tibetan has a set of auxiliaries expressing di¤erent categories of
perspective. ‘‘Perspective-choice’’ interacts with both person and eviden-
tial categories in a complex way, but the relevant auxiliaries can be di-
vided into a ‘‘self-centered’’ and an ‘‘other-centered’’ group (Denwood
Voice phenomena 227
Eastern Pomo
(13) a. ha: c’e:xelka. (volitional)
1SG.A slip
‘I am sliding.’
b. wı́ c’e:xelka. (spontaneous)
1SG.P slip
‘I am slipping.’
(McLendon 1978: 1–3)
Although the verb forms are the same, when the pronominal form is in-
flected for the patient (13b), the sentence conveys a spontaneous action or
a ‘‘lack of protagonist control’’ (McLendon 1978: 4). A similar contrast
is seen in the Caucasian language Tsova-Tush (Batsbi), where ‘‘[the] ref-
erent of [an ergative] subject is a voluntary, conscious, controlling partic-
ipant in the situation named by the verb’’ (Holisky 1987: 113).
Tsova-Tush (Batsbi)
(14) a. (as) vuiž-n-as.
1SG.ERG fall-AOR-1SG.ESRG
‘I fell down, on purpose.’
b. (so) vož-en-sO.
1SG.NOM fell-AOR-1SG.NOM
‘I fell down, by accident.’
(Holisky 1987: 104)
In addition to these cases of ‘‘fluid-S’’ marking (Dixon 1994), split in-
transitivity may be realized as a lexically-conditioned phenomenon, where
intransitive verbs are classified into an ‘‘agentive’’ class and a ‘‘patientive
class.’’ Agentive and patientive nominals respectively trigger marking
similar to the corresponding arguments of a transitive clause. The Philip-
pine language Cebuano shows this pattern through a focus system which
is characteristic of Formosan and Western Austronesian languages:
Cebuano
(15) Transitive actor-focus construction
Ni-basa ako ug libro.
AF-read I.TOP INDEF book
‘I read a book.’
(16) Transitive patient-focus construction
Gi-basa nako ang libro.
PF-read I TOP book
‘I read the book.’
(17) a. Agentive intransitive
Ni-dagan ako. (actor-focus form)
Voice phenomena 229
AF-run I.TOP
‘I ran.’
b. Patientive intransitive
Gi-kapoy ako. (patient-focus form)
PF-tired I.TOP
‘I got tired/I am tired.’
ative constructions such as John killed Bill and John made Bill walk are
a case in point. They di¤er in form from morphological causatives such
as Quechua wañu-či (die-CAUSE) ‘kill’ and Japanese aruka-se (walk-
CAUSE) ‘make walk’, where the causative meaning is expressed mor-
phologically. Traditionally, grammarians have tended to consider only
morphological causatives as proper cases. However, such a position leads
to the uncomfortable decision of treating the Quechua and Japanese
forms cited above as causative, while treating the semantically parallel
English expressions kill and make walk as noncausative. The form-based
treatment of causatives is tantamount to simply circumscribing morpho-
logical causatives, and does not lead to a comprehensive study of caus-
ative phenomena. Causation is a semantic, not a morphological notion,
and as such the whole range of expression types must be taken into ac-
count in a satisfactory analysis. Indeed, a (functional) typological study
is predicated on the view that a variety of expression types will obtain in
any given conceptual domain. The formal tripartite pattern of lexical,
morphological, and periphrastic causative constructions has now been
widely accepted, and some revealing correlations between form and func-
tion have been identified in the causative domain (see Shibatani and Par-
deshi 2002 on recent developments). We see below that a similar pattern
holds in other voice domains as well.
Having discussed two voice phenomena pertaining to the origin of an
action, we now turn to the next major voice parameter concerning its de-
velopment. We will consider the other voices associated with the nature
of the origin of an action — the passive and the inverse — after dealing
with other conceptually-based voice phenomena.
Sanskrit
(18) a. devadatto yajnadattasya bharyam
Devadatta.NOM Yajnadatta.GEN wife
upayacchati. (active)
have.relations.3SG.ACT
‘Devadatta has relations with Yajnadatta’s wife.’
b. devadatto bharyam upayacchate. (middle)
Devadatta.NOM wife have.relations.3SG.MID
‘Devadatta has relations with his (own) wife.’
(Klaiman 1988: 34)
Voice phenomena 233
Sanxiang Dulong/Rawang
(19) a. aÐ 53 a 31 dffil 31 a 31 be 55 . (active)
3SG mosquito hit
‘S/he is hitting the mosquito.’
b. aÐ 53 a 31 dffil 31 a 31 be 55 -ffl 31 . (middle)
3SG mosquito hit-MID
‘S/he is hitting the mosquito (on her/his body).’
(LaPolla 1996: 1945)
234 M. Shibatani
German, where sich hinsetzen ‘sit down’ has a middle marker, but aufste-
hen ‘stand up’ does not. These irregularities require individual accounts,
based on historical, cognitive, and even cultural data.
The middle voice system has several important implications for our
general understanding of the nature of voice phenomena. Recall that
most of the widely received definitions of voice (such as the one quoted
from Crystal [2003] in Note 1) hold that voice opposition does not entail
a meaning contrast. This is not the case for the active/middle opposition,
as shown by the examples above as well as by the contrast between the
English active form John hit Bill and the middle form John hit himself.
Secondly, these examples show that voice alternations do not necessar-
ily alter argument alignment patterns. There is no change in grammatical
relation in the contrastive pairs in (18) and (19). If the situations depicted
there give the impression of unusual utterances, consider the mundane
situations described by the following Greek examples, where a meaning
contrast is expressed without a realignment of arguments:
Ancient Greek
(20) a. loúô khitôna. (active)
wash.1SG.ACT shirt.ACC
‘I wash a shirt.’
b. loúomai khitôna. (middle)
wash.1SG.MID shirt.ACC
‘I wash my shirt/I wash a shirt for myself.’
Chukchee
(23) a. ltg¼e keyмn penr-nen.
father¼GER bear¼ABS attack¼3SG:3SG/AOR
‘The father attacked the bear.’
b. ltg¼n penr¼tko¼ge
father¼GER attack¼APASS¼3SG.AOR
keyмet. (antipassive)
bear¼DAT
‘The father rushed at the bear.’
(Kozinsky et al. 1988: 652)
Warlpiri
(24) a. nyuntulu-rlu u-npa-ju pantu-rnu ngaju.
you-ERG u-2SG.A-1SG.P spear-PAST I.ABS
‘You speared me.’
b. nyuntulu-rlu u-npa-ju-rla pantu-rnu
you-ERG u-2SG.A-1SG-DAT spear-PAST
ngaju-ku. (antipassive)
I-DAT
‘You speared at me; you tried to spear me.’
(Dixon 1980: 449)
According to Dixon (1980: 449), (24b) above ‘‘indicates that the action
denoted by the verb is not fully carried out, in the sense that it does not
have the intended e¤ect on the entity denoted by the object [read ‘‘pa-
tient’’, MS].’’ Similarly, visual contact is not made when situations in-
volving visual perception are presented in the antipassive voice:
Warrungu
(25) a. nyula nyakaþn wurripaþØ.
3SG.NOM seeþP/P beeþABS
‘He saw bees.’
b. ngaya nyakaþkaliþØ wurripaþwu katyarraþwu.
1SG.NOM see-APASSþP/P beeþDAT possumþDAT
‘I was looking for bees and possums.’
(Tsunoda 1988: 606)
Moreover, for action types a¤ecting a patient, the antipassive voice
presents a situation as not a¤ecting the patient in totality, as in the fol-
lowing examples:
Samoan
(26) a. Sā ‘ai e le teine le i‘a.
PAST eat ERG ART girl ART fish
‘The girl ate the fish.’
Voice phenomena 239
Russian
(29) a. Ivan mojetsja mylom.
Ivan wash.MID soap.INSTR
‘Ivan washed himself with soap.’
b. Babuška rugajetsja.
granny.NOM scold.APASS
‘Granny is scolding.’
(Geniušienė 1987: 9)
In addition, languages may show the well-known connection between the
middle and the passive12 through the use of the same form as the antipas-
sive, thus illustrating a three-way middle-passive-antipassive polysemy:
Russian (cf. the examples immediately above)
(30) Dom stroitsja turezk-oj firm-oj
house.NOM is.being.built.PASS Turkish-INST firm-INST
INKA.
INKA
‘The house is being built by the Turkish company INKA.’
Kuku Yalanji
(31) a. karrkay julurri-ji-y. (middle)
child.ABS wash-MID-NONPAST
‘The child is washing itself.’
b. warru (yaburr-ndu) bayka-ji-ny. (passive)
young man.ABS shark:LOC:pt bite-PASS-PAST
‘The young man was bitten (by a shark).’
c. nyulu dingkar minya-nga nuka-ji-ny. (antipassive)
3SG.NOM man.ABS meat-LOC eat-APASS-PAST
‘The man had a good feast of meat (he wasted nothing).’
(Patz 1982: 244, 248, 255)
No ! active/middle
Yes ! benefactive/malefactive/applicative
While the notion of benefit-giving is a broad one, there is one particular
type with a perceptible change in the beneficiary. This is the case involv-
ing transfer of an object, where the object itself is directly a¤ected by the
act of giving. In a typical giving situation, the object is physically moved
from one owner to a new one. The recipient beneficiary is secondarily af-
fected because it comes into possession of the transferred object. Lan-
guages often have a special benefactive construction that portrays this
type of situation, where the e¤ect on the beneficiary is indicated by its
argument status in syntactic coding. As shown in Shibatani (1996), bene-
factive constructions are typically based on the syntactic schema of the
give-construction even involving the verb form for giving in some lan-
guages, as in the case of Japanese seen below:
(32) a. Taroo-wa Hanako-ni hon-o yat-ta.
Taro-TOP Hanako-DAT book-ACC give-PAST
‘Taro gave Hanako a book.’
b. Taroo-wa Hanako-ni hon-o kat-te
Taro-TOP Hanako-DAT book-ACC buy-CONJ
yat-ta.
BEN-PAST
‘Taro bought Hanako a book.’
In (32b) the buying action is extended beyond the patient (the book),
and a¤ects the beneficiary nominal (Hanako) coded in the dative form.
Compare this construction to the one below, expressing a more general
benefit-giving in which the beneficiary takes on a nonargument form.
(33) Taroo-wa Hanak-no tame-ni hon-o
Taro-TOP Hanako-of sake-for book-ACC
kat-te yat-ta.
buy-CONJ GIVE-PAST
‘Taro bought a book for (the sake of ) Hanako.’
While (33) may express any type of benefit-giving — including one of
buying a book to help Hanako’s book-selling business — (32b) specifi-
cally conveys the meaning that the transfer of the book was intended.
Note also the English translations accompanying these examples, which
show the same contrast.
Benefactive/malefactive events are also realized by so-called external
possession constructions in Indo-European and some other languages
(cf. Payne and Barshi 1999), although the context may determine whether
242 M. Shibatani
mopera.
short
‘He slept with the woman with the short hair.’ (i.e. they had
sex together.)
(Donohue 1999: 231)
The following instrumental applicative from Pulaar also demonstrates
how an applied instrumental nominal can implicate a participant more
thoroughly a¤ected by the agent’s action:
Pulaar
(44) a. mi loot-ii miñ am a
1SG wash-PERF.ACT y.s. 1SG.POSS PREP
saabunnde hee.
soap DET
‘I washed my younger sibling with (some of ) the soap.’
b. mi loot-r-ii miñ am
1.SG wash-INST-PERF.ACT y.s. 1SG.POSS
saabunnde hee.
soap DET
‘I washed my younger sibling with (all of ) the soap.’
(Sebastian Ross-Hagebaum pers. comm.)
The various e¤ects of locative applicatives have also been recognized in
the literature. The Balinese locative expression in (45b) below, for exam-
ple, describes a situation where the action of planting banana trees ex-
tends in such a way as to a¤ect the garden. Here the entire garden ends
up being planted with banana trees, while no such implication is made in
the nonapplicative counterpart (45a).
Balinese
(45) a. Tiang mulan biyu di tegalan tiang-e.
1SG plant banana in garden 1SG-POSS
‘I planted bananas in my garden.’
b. Tiang mulan-in tegalan tiang-e biyu.
1SG plant-APPL garden 1SG-POSS banana
‘I planted my garden with bananas.’
(I. Wayan Arka pers. comm.)
agent and the patient. In other words, the inverse is concerned with the
degree of topicality of the origin of an action, relative to its terminal
point. The direct/inverse system is thus essentially concerned with the
question of where the action originates, and can be characterized in the
following manner:
Direct/inverse opposition:
Does the action originate in an agent higher in discourse relevance than
the patient?
Yes ! direct
No ! inverse
Here we invoke a notion of discourse relevance that is more general
than that of topicality, though the two are intimately connected. By
discourse relevance we refer to two types of relationship which event
participants have with the speaker and hearer and with the discourse.
The first- and second-person event participants are most relevant to the
speaker and hearer, since we have a natural interest in what we do and
what happens to us. Our inclination to talk about ourselves and those
familiar to us leads to the high discourse-topic potential of first- and
second-person referents and other similar entities. The topicality hierar-
chy in (48) and the more elaborated version in (69) below represent the
degrees of discourse relevance in the above sense. Event participants also
vary along di¤erent degrees of relevance commensurate with their infor-
mation value. An entity that is crucial to the information content of a
message has a higher degree of discourse relevance than one supplying
information tangential to the core information conveyed. The two types
of discourse relevance naturally cohere and characterize the most fre-
quently occurring participants in a stretch of discourse.
The inverse system marks constructions denoting situations where the
action originates with an agent low in discourse relevance relative to the
patientive terminal. This is very much like what we see in the passive con-
struction; hence the question naturally arises whether the direct/inverse
and the active/passive systems should be distinguished or not. This has
been a controversial issue in the analysis of some languages. The most
telling di¤erence between the two systems is that while the active/passive
opposition involves changes in the alignment of semantic roles and gram-
matical relations, the direct/inverse opposition does not. In the following
Japanese examples, agents are subjects in both direct and inverse forms:
Japanese
(49) a. Boku-ga Taroo-ni denwa-o si-ta. (direct)
I-NOM Taro-DAT phone-ACC do-PAST
‘I phoned Taro.’
248 M. Shibatani
Nocte
(50) a. nga-ma nang hetho-e.
I-ERG you teach-1PL
‘I will teach you.’
b. nang-ma nga hetho-h-ang.
you-ERG I teach-INV-1
‘You will teach me.’
(Das Gupta 1971, quoted after DeLancey 1981: 641)
Another major distinction between the inverse and the passive is that
the former is a transitive clause requiring the expression of an agent argu-
ment, whereas in the latter the agentive nominal is deranked from argu-
ment status. This well-known feature of the passive can be understood
in light of the extremely low discourse relevance of the agentive nominal
in a passive clause (Jespersen 1965; Shibatani 1985). This prompts us to
the following formulation of the parameter controlling the active/passive
opposition:
Active/passive opposition:
Does the action originate with an agent extremely low in discourse rel-
evance, or at least lower relative to the patient?
Yes ! passive
No ! active
The major di¤erence between the inverse and the passive is that in the
former, the agent has a relatively high discourse relevance compared to
the agent of a passive. That is, while the agents of both inverse and pas-
sive clauses have a low degree of discourse relevance or topicality in com-
parison to the patient, the agent of the inverse has a higher degree of
relevance than that of the passive agent.16 The consequence of this di¤er-
ence is that while the agent of the inverse does not su¤er syntactic derank-
ing, that of the passive does. The null expression of an agent in a passive
clause is a favored syntactic response to its extremely low discourse rele-
vance. This is more widespread in impersonal than in personal passives,
Voice phenomena 249
di¤erences, both the passive and the inverse have a close conceptual a‰n-
ity, since both are concerned with the discourse relevance of the agentive
participant, as shown in Figure 2. We then expect to find situations where
the passive and the inverse share a functional domain, or where the fea-
tures of these two constructions combine. The first case can be observed
in Japanese, where the active/passive and direct/inverse systems divide
the task of indicating the direction of an action with regard to the deictic
center. When simple actions are involved, the active/passive opposition
is utilized. When an action involves the transfer of some entity — for ex-
ample, a letter in letter-writing, or a message in telephoning — the direct/
inverse pattern is invoked.
Japanese
(54) a. Boku-wa Taroo-o nagut-ta. (active)
I-TOP Taroo-ACC hit-PAST
‘I hit Taro.’
b. aTaroo-wa boku-o nagut-ta.18 (active)
Taro-TOP I-ACC hit-PAST
‘Taro hit me.’
c. Boku-wa Taroo-ni nagura-re-ta. (passive)
I-TOP Taroo-by hit-PASS-PAST
‘I was hit by Taro.’
(54 0 ) a. Boku-wa Taroo-ni hon-o okut-ta. (direct)
I-TOP Taro-DAT book-ACC send-PAST
‘I sent a book to Taro.’
b. aTaroo-ga boku-ni hon-o okut-ta. (direct)
Taro-NOM I-DAT book-ACC send-PAST
‘Taro sent me a book.’
c. Taroo-wa boku-ni hon-o okut-te ki-ta. (inverse)
Taro-TOP I-DAT book-ACC send-CONJ INV-PAST
‘Taro sent me a book.’
Southern Tiwa
(55) a. seuanide Ø-liora-mu-ban.
man 3SG.3SG-lady-see-PAST
‘The man saw the lady.’
Voice phenomena 251
about the Southern Tiwa passive is that discourse relevance is fairly rig-
idly determined by the topicality hierarchy — as in the case of an inverse
system — indicating the possibility that the Southern Tiwa passive has
historically developed from an inverse system. Nevertheless, passive con-
structions that are apparently unrelated to the inverse system actually
also show restrictions like those of the Tiwa passive. In Japanese, for ex-
ample, passivization is not as free or optional as normally described.
Indeed, the Japanese system is not unlike the Southern Tiwa system de-
scribed above. Generally, if an action originates with an agent higher
in discourse relevance than the patient, an active sentence is preferred,
whereas when the relevance status is reversed, a passive sentence is
chosen:
(60) a. Boku-wa Taroo-o nagut-ta.
I-TOP Taro-ACC hit-PAST
‘I hit Taro.’
b. aTaroo-wa boku-ni nagura-re-ta.
Taro-TOP I-by hit-PASS-PAST
‘Taro was hit by me.’
(61) a. aTaroo-ga boku-o nagut-ta.
Taro-NOM I-ACC hit-PAST
‘Taro hit me.’
b. Boku-wa Taroo-ni nagura-re-ta.
I-TOP Taro-by hit-PASS-PAST
‘I was hit by Taro.’
a
(63) a. sikan-i na-lul ccoch-ko issyo.
time-NOM I-ACC chase-CONJ be
‘Time is chasing me.’
b. na-nun sikan-e ccoch-ki-ko issyo.
I-TOP time-by chase-PASS-CONJ be
‘I am being chased by the time.’
(64) a. namca-ka kon-ul ccoch-ko issyo.
man-NOM ball-ACC chase-CONJ be
‘A man is chasing a ball.’
b. akon-i namca-eke ccoch-ki-ko issyo.
ball-NOM man-by chase-PASS-CONJ be
‘A ball is being chased by a man.’
(Klaiman 1988: 56–57)
Even in English — where passivization is believed to be highly grammati-
calized — a similar distribution pattern is generally observed, such that
passives like John was hit by me and actives like A dog bit me this morning
are generally avoided. All in all, then, the active/passive system and the
direct/inverse system are controlled by a similar principle based on the
discourse relevance of the origin of an action relative to the patient termi-
nal point.
Next we find a surprising application of the proposed approach to a do-
main that is seemingly very di¤erent from voice, namely split case marking
in transitive clauses. The best known case occurs with the so-called split er-
gativity phenomenon, whereby both accusative and ergative case-marking
patterns coexist within a single language. Here we focus on a so-called
NP-split, where the case-marking pattern is conditioned by the nature of
the nominal element, as illustrated by the Warrgamay examples below:
Warrgamay
(65) a. ngulmburu gaga-ma.
woman.ABS go-FUT
‘The woman will go.’
b. ngulmburu-nggu maal
woman-ERG man.ABS
ngunda-lma. (absolutive-ergative pattern)
see-FUT
‘The woman will see the man.’
(66) a. ngana gaga-ma.
we.NOM go-FUT
‘We will go.’
b. ngana nyurra-nya
we.NOM you-ACC
254 M. Shibatani
tern; that is, the most natural event configuration is expressed by an un-
marked active construction and the least natural by marked passive
forms. To summarize, we obtain the following form-function correlations
regarding the relevant phenomena:
view comes from the distribution of the passive and the antipassive. The
active/passive opposition, which deals with the discourse status of the
agentive nominal, tends to be grammaticalized in accusative languages,
for in this type of language the primary focus is on the agentive partici-
pant. In ergative languages, on the other hand, the status of the patient
is of primary concern, and the ergative/antipassive opposition dealing
with its status is typically grammaticalized here.
The basic voice orientation of accusative- and ergative-type languages
is thus role-based, which motivates a formal mechanism of the marked
voice — the passive and antipassive, respectively. Compared to this, lan-
guages with a Philippine-type focus system do not have a role-based voice
orientation. The choice of primary syntactic argument — known variously
as ‘‘topic,’’ ‘‘subject,’’ or ‘‘pivot’’ — is made broadly without confinement
to the primary focal participants of agent or patient. The choice is moti-
vated by pragmatic factors such as the definiteness of the participants and
aspectual orientation, as well as semantic factors determining the degree
of transitivity. The ‘‘fluid’’ character of Philippine-type voice systems
makes voice conversion facile without special morphosyntactic mecha-
nisms such as passive and antipassive morphology, or the radical re-
arrangement of argument structure seen in the languages with a basic
voice orientation. The best names for the Philippine-type voice patterns
are ‘‘actor-voice,’’ ‘‘patient-voice,’’ ‘‘instrumental-voice,’’ etc.
To summarize the above discussion, the active voice is the relationship
between the transitive situation type, as defined above, and its syntactic
coding in the nominative-accusative fashion. The ergative voice, on the
other hand, is the relationship between the transitive situation type and
its absolutive-ergative coding pattern. Notice that not all coding patterns
of the transitive situation type are active voice. The passive, for example,
codes the transitive situation type, yet it is not an active voice construc-
tion. Likewise, the relationship between the transitive situation type and
the ergative construction cannot be considered as active voice, contrary
to the view expressed by Dixon (1994: 216) in his statement: ‘‘there is
typically an active/antipassive voice contrast in ergative languages.’’ The
relevant contrast he has in mind is between the ergative and the anti-
passive voice.
Both active and ergative constructions are basic construction types
such that the transitive situation type would be coded according to these
basic construction patterns in accusative and ergative languages, respec-
tively, unless pragmatic factors call for the marked voice constructions
of passive and antipassive. In languages with a Philippine-type focus sys-
tem, there is no basic voice orientation such that the transitive situation
type can be coded by either the actor-focus or the patient-focus construc-
Voice phenomena 259
tion (see [15] and [16] above). While the patient-focus constructions are
favored over the actor-focus constructions when a definite patient is in-
volved, the system is not as codified as in the case of ergative languages
that code the transitive situation type in the absolutive-ergative manner
regardless of the definiteness status of the patient nominal.21
We now relate the choice of the central syntactic element of the clause
with the notion of discourse relevance. The primary argument has the
highest discourse relevance in the following senses: 1) it typically has con-
stitutive relevance; 2) it is most salient in the speaker’s mind; 3) it plays
an important role in the propositional act; and 4) it is the entity on which
the hearer’s attention is focused. What distinguishes the three primary
arguments in the three types of languages being examined here — the
nominative argument in accusative languages, the absolutive argument
in ergative languages, and the topic/subject/pivot of the Philippine-type
languages — is their indispensability. That is, except for a few other
minor sentence types, such as existential and exclamatory expressions, all
sentences (N.B.: not clauses) in the respective language types must con-
tain these arguments. We take this fact to be connected to the require-
ment of a proposition to contain an item to be predicated over. In other
words, the primary arguments under consideration all have the referential
function of pointing out what is to be talked about, or predicated over, in
a propositional verbal act. They are what the traditional term ‘‘subject’’
represents in both logic and grammar, and there is no harm in applying
this term to nominative, absolutive, and Philippine-style ‘‘topic’’ nomi-
nals, as long as they are understood in terms of their role in a propositio-
nal act. Just as importantly, they must not be understood in terms of the
agent-based notion of subject, as it applies to the subject in English and
other European languages, or in terms of syntactic roles, that is, their
behavioral and coding properties (Keenan 1976).22 The ranking of dis-
course relevant items at the clausal level may be necessitated by a require-
ment regarding the distribution of attention. The subject of a proposition
is what is most salient in the speaker’s mind, and on which the hearer’s
attention is expected to be focused.23
Situations deviating from prototypical transitive situations are often ex-
pressed by marked voice constructions. Still the structural complexity of a
given voice construction depends on what the normal state of a¤airs is.
A transitive situation involving an agent and a patient is ‘‘normal’’ for
such actions as killing and breaking that entail the transfer of the action
from one party to another. Departures from prototypical transitive situa-
tions for these kinds of activities therefore result in marked situations,
and languages reflect it in marked voice constructions. Thus, a middle
situation in which a transitive action is confined within the agent’s sphere
260 M. Shibatani
6. Conclusion
Notes
* This is a thoroughly revised version of a paper presented in 2002 in Berlin at the work-
shop of the project ‘‘A cognitive-typological study of valency structures: Japanese-
German contrastive perspective’’ supported by the Japan Society for the Promotion of
Science and the Deutsche Forschungsgesellschaft. A fuller account of the overall
framework and individual voice phenomena is presented in my forthcoming book
Voice from Cambridge University Press.
Uncommon abbreviations: AF ¼ actor focus, APASS ¼ antipassive, EV ¼
evidential, INV ¼ inverse, NS ¼ nonsubject, OBV ¼ obviative, P ¼ passive-like
264 M. Shibatani
This definition is problematic at least in the following four respects: 1) there are voice
oppositions that do not involve alterations in grammatical relation; 2) there are voice
phenomena — even passive constructions — which involve meaning contrast; 3) the
active/passive opposition is not the main voice distinction either in a diachronic or a
synchronic sense. Diachronically passive constructions develop secondarily from vari-
ous sources — for example, middles — and synchronically there are numerous lan-
guages that do not have a formal system for an active-passive opposition, even while
showing other types of voice opposition; 4) constructions such as reflexives are voice
constructions par excellence (see text).
2. See Shibatani and Pardeshi (2002).
3. Another general methodological problem, which we have little room to address in
this article, is the importance of diachronic understandings for structural diversities
and for boundary problems seen in various functional domains including the voice
domain.
4. Studies on transitivity subsequent to Hopper and Thompson (1980) — for example,
Lazard (2002), Kittilä (2002), and Næss (2003) — also deal with a set of phenomena
highly similar to that discussed in this study.
5. Cf. the following definitions of voice: ‘‘[voice] is a regular marking in the verb of the
correspondences between units at the syntactic level and units at the semantic level. In
short, voice is a diathesis grammatically marked in the verb’’ (Xolodovič 1970, as
quoted in Geniušienė 1987: 42–53; emphasis added); ‘‘The category of voice is an in-
flectional category such that its grammemes specify such modifications of basic diathe-
sis of a lexical unit that do not a¤ect its propositional meaning’’ (Mel’čuk 1993: 11;
emphasis added).
6. See Malchukov (forthcoming) for a similar view on the iconicity of formal marking of
transitivity features.
7. Cf. Givón (2001: Ch. 13), where the middle voice and some others are characterized as
semantically-contrastive voices and are distinguished from pragmatically-motivated
voices such as the passive. However, Givón does not group causatives and applicatives
with other voice constructions, presumably because they are not detransitiving con-
structions. Yet, they contrast with simple active transitive constructions in meaning,
and applicatives in particular are often pragmatically controlled as the passive and
antipassive are. Moreover, syntactic detransitivization is not even a defining feature of
middle voice constructions, which Givón recognizes as detransitive voice constructions.
Voice phenomena 265
8. See Langacker (1991: Ch. 7) for a fuller discussion of the notion of action chain. See
also Croft (1994), where voice is treated under a similar conception as in this article,
although its scope is much narrower than ours.
9. Not surprisingly, reflexive markers develop from nouns denoting a body or head in
many languages, for example, Amharic ras ‘head’.
10. See Haiman (1985) for a similar approach, which distinguishes two types of verb —
introverted and extroverted.
11. Our position in recognizing intransitive verbs as lexical middles may be challenged on
the basis of the fact that many languages allow marked middle forms for intransitive
verbs: for example, Spanish La pelota cayó de la canasata ‘The ball fell from the bas-
ket’ (as in a basket game) vs. La pelota se cayó de la mesa ‘The ball fell from the table’.
The question is, shouldn’t the former example be considered as an active expressing
the active/middle contrast with the latter? The answer is ‘‘No.’’ The doubling of forms
expressing the same voice domain occurs frequently, as in the case of lexical and mor-
phological causatives (e.g. Japanese ire- ‘put in’ and haira-se- ‘cause to go in’), as well
as that of morphological and periphrastic middle (e.g. Balinese ma-sugi ‘wash.face’ and
nyugiin awak ‘wash.face self ’). These pairs of forms show subtle meaning di¤erences —
as past studies on causatives have shown — but not in terms of voice opposition. See
Maldonado (1992) for a detailed study of the Spanish middle, from which the Spanish
examples above are taken.
A more serious challenge could be raised regarding the verb ending interpreted as
an active marker in Classical Greek, which also occurs in intransitive verbs: politeu-ô
‘I am a citizen/have civic rights’ vs. politeu-omai ‘I act as a citizen/carry out my
civic rights for myself ’ (Klaiman 1988: 32). Our position on this is that the -ô — and
other so-called active endings — should be interpreted as default subject markers oc-
curring in both lexical middles and transitive actives. The middle endings mark
derived middles.
12. The connection between the middle and the passive is believed to be due to the shared
a¤ected meaning of the subject.
13. A ‘‘possessive relation’’ is in turn implicated in constructions other than the benefac-
tive/malefactive, for example, the double subject construction (see Shibatani 1994).
14. According to Ad Foolen (pers. comm.), the Dutch applicative prefix be- shows a simi-
lar e¤ect. While Hij slaapt met een vrouw ‘He sleeps with a woman’ is euphemistically
used to mean ‘He has sex with a woman’, it still has the literal interpretation of simply
sharing a bed with a woman, as in English. The applicative form Hij beslaapt een
vrouw, lit. ‘He sleeps a woman’, on the other hand, specifically means ‘He has sexual
intercourse with a woman’.
15. Here the inverse marker is a grammaticalized form of the verb kuru ‘come’.
16. Givón (2001: 93) characterizes the di¤erence in the degree of discourse relevance of the
agent and patient of the passive and inverse constructions as: Agt f Pat (passive),
Agt < Pat (inverse), where f and < respectively indicate a case where ‘‘the agent is ex-
tremely non-topical . . . , so the patient is the surviving topical argument in the clause,’’
and a case where ‘‘the patient is more topical than the agent but both agent and patient
are topical.’’
17. According to Dryer (1994), Kutenai shows the direct/converse contrast only with com-
binations of a third person agent and a third person patient.
18. A pound sign ðaÞ — as opposed to an asterisk — indicates that a sentence is pragmat-
ically odd, even though it is grammatical. This and other pragmatically odd sentences
are possible in a context that is not deictically anchored to speech time or location, for
example, in a narrative of past events recited from a detached perspective.
266 M. Shibatani
19. A closer examination may indicate a distinction similar to the proximate-obviative con-
trast seen in the inverse system of Kutenai. See Kroskrity’s (1985) discussion of the sta-
tus of third person referents in Arizona Tewa, which has a voice system very similar to
that of Southern Tiwa.
20. See Shaumyan (1986) for a similar view.
21. Basque, for example, uses the ergative construction in encoding both definite and indef-
inite patient participant of a transitive situation, as in:
(i) Jonek ardoa ekarri du.
Jon.ERG wine bring AUX.TR
‘Jon brought (the) wine.’
(Hualde and Ortiz de Urbina 2003: 411)
In Philippine-type languages, the actor-focus construction is used for the indefinite pa-
tient reading and the patient-focus construction for the definite patient reading. The
term ‘‘fluid voice system’’ is intended for this kind of property of the Philippine-type
focus system, which (i) allows alternation between the actor-focus construction and
the patient-focus construction without involving an additional verbal morphology
such as a passive or antipassive marker, and (ii) ‘‘promotes’’ an oblique nominal to
the topic/subject/pivot status without involving an additional/separate applicative
morphology. Austronesian languages with a Philippine-type focus system vary with re-
spect to the second property. Balinese, for example, has developed a separate applica-
tive process, which is required for an oblique to be promoted to the patient status be-
fore it can participate in the actor-focus/patient-focus alternation.
22. The syntactic properties of the subject vary across languages, and in fact the list of
properties such as those in Keenan (1976) do not o¤er universal definitions of the sub-
ject pace his assertion. See Langacker (1991: 305 ¤.) on this point.
23. In this light, the controversial label of ‘‘focus system’’ for the Philippine-type voice phe-
nomenon is not entirely inappropriate. Also see Givón (1976), Langacker (1991: 305
¤.) and the related discussions on the subject and the topic.
24. By ‘‘a systematic formal expression’’ we mean a construction that has the passive func-
tion, approximating the passive prototype as defined, for example, in Shibatani (1985).
According to the surveys by Dik Bakker and Anna Siewierska, 53% of the languages in
their 397-language sample and 57% in their similar 374-language sample do not have a
passive construction. The di¤erence between these two samples is that the former in-
cludes languages with a Philippine-type focus system as languages having a passive
construction, while the latter does not. (I am grateful to Dik Bakker for supplying
this information to me.)
Languages that have not developed a mature passive construction can express pas-
sivity by means of those constructions that share the pragmatic function of the passive
such as indefinite person constructions (e.g. they normally ignore the speed limit on this
freeway) and inverted word order for patient prominence. See Dezső (1988) for the dis-
cussion of how Hungarian copes with the needs for expressing passivity in the absence
of a passive construction.
References
Allen, B.; and Franz, D. (1983). Advancements and verb agreement in Southern Tiwa. In
Studies in Relational Grammar 1, D. Perlmutter (ed.), 303–316. Chicago: University of
Chicago Press.
Voice phenomena 267