Professional Documents
Culture Documents
(Language Acquisition & Language Disorders, v. 20) Kazue Kanno-The Acquisition of Japanese As A Second Language-John Benjamins Pub (1999) PDF
(Language Acquisition & Language Disorders, v. 20) Kazue Kanno-The Acquisition of Japanese As A Second Language-John Benjamins Pub (1999) PDF
(Language Acquisition & Language Disorders, v. 20) Kazue Kanno-The Acquisition of Japanese As A Second Language-John Benjamins Pub (1999) PDF
AS A SECOND LANGUAGE
LANGUAGE ACQUISITION & LANGUAGE DISORDERS
EDITORS
Harald Clahsen Lydia White
University of Essex McGill University
EDITORIAL BOARD
Anne Baker (University of Amsterdam)
Melissa Bowerman (Max Planck Institut für Psycholinguistik, Nijmegen)
Katherine Demuth (Brown University)
Werner Deutsch (Universität Braunschweig)
Kenji Hakuta (UC Santa Cruz)
Nina Hyams (University of California at Los Angeles)
Peter Jordens (Free University, Amsterdam)
Jürgen Meisel (Universität Hamburg)
Kim Plunkett (Oxford University)
Mabel Rice (University of Kansas)
Michael Sharwood Smith (University of Utrecht)
Antonella Sorace (University of Edinburgh)
Karin Stromswold (Rutgers University)
Jürgen Weissenborn (Universität Potsdam)
Helmut Zobl (Carleton University, Ottawa)
Volume 20
Edited by
KAZUE KANNO
University of Hawaii at Manoa
Abbreviations vii
List of Contributors ix
Acknowledgements xi
1. SLA Research and Japanese 1
Kazue Kanno
2. Implicit Negative Feedback 9
Shunji Inagaki and Michael H. Long
3. Tasks and Learners’ Output in Nonnative-Nonnative Interaction 31
Noriko Iwashita
4. SPOT: A Test Measuring “Control” Exercised by Learners of
Japanese 53
Junko Ford-Niwa and Noriko Kobayashi
5. Retesting a Universal: The Empty Category Principle and
Learners of (Pseudo)Japanese 71
Eric Kellerman, John van IJzendoorn and Hide Takashima
6. L2 Acquisition of Japanese Unaccusative Verbs by Speakers of
English and Chinese 89
Makiko Hirakawa
7. Who Knows What and Why? The Acquisition of Multiple
Wh-Questions by Adult Learners of English and Japanese 115
Naoko Yoshinaga
8. Gapping and Coordination in Second Language Acquisition 141
William O’Grady
vi TABLE OF CONTENTS
Makiko Hirakwa
Junko Ford-Niwa
Tokyo International University
Department of Humanities Josai
2509 Matoba Kawagoe, Saitama,
International University
Japan, 350–11
1 Gumyoo Toogane-shi, Chiba, Japan e-mail: hmakiko@tiu.ac.jp
e-mail: junko@jiu.ac.jp
Yoshinaga, Naoko
Noriko Kobayashi
Hirosaki Gakuin University
International Student Center
13–1 Minori-cho Hirosaki-shi,
University of Tsukuba
Aomori-ken, Japan 036
Tennodai, Tsukuba, Ibaraki, Japan 305 naoks@infoaomori.ne.jp
x LIST OF CONTRIBUTORS
During the more than two years that it took to complete this project, I benefited
from help and support from various sources, for which I am very thankful.
I acknowledge with gratitude financial support from the University of Hawaii
Japan Studies Endowment, funded by a grant from the Japanese Government.
I would like to thank the authors of the studies that appear in this volume
for their commitment to the project and for their patience during review process.
I am also indebted to several individuals for their comments on one or more of
the individual chapters that appears in this book, particularly Lydia White, Kevin
Gregg, Craig Chaudron, Graham Crooks, J. D. Brown, Lourdes Ortega, Robert
Bley-Vroman, and William O’Grady.
Special thanks are also due to Keira Ballantyne, who read through the
manuscript in its entirety twice, noting many problems and inconsistencies. I also
thank Todd Fukushima for his technical assistance.
Finally, I would also like to thank my family for their support and patience.
C 1
Kazue Kanno
University of Hawai‘i at Manoa
Almost everyone in the world is called upon at some time in his or her life to
learn a second language. The fact that this endeavor typically does not end in
success raises important practical and theoretical questions having to do with the
effects on the human language faculty of neurological maturation, cognitive
development, and the acquisition of a first language. The study of these questions
lies at the core of research into second language acquisition (SLA).
In principle, the scope of SLA research includes the acquisition of any
language other than one’s native language, and there is widespread agreement
that work in this area should be as broad and varied as possible. In the words of
Larsen-Freeman & Long (1991: 7):
The scope of SLA research must be sufficiently broad to include a variety of
subjects who speak a variety of native languages who are in the process of
acquiring a variety of second languages in a variety of settings for a variety of
reasons.
In fact, however, work to date has focused very heavily on the learning of
English and other European languages, often by native speakers of European lan-
guages. This is clearly an undesirable state of affairs and there is a pressing need
to expand the range of target languages whose acquisition is being investigated.
This book is an attempt to contribute to that objective by examining the
acquisition of Japanese as a second language — a topic that has received little
attention to date outside of Japan. This is unfortunate for at least two reasons.
First, Japanese is more commonly studied as a second language than is any other
language of Asia (with the possible exception of Mandarin). JSL programs are
found not only in Japan, but in many universities in North America, Asia,
Europe, and even South America. There is thus clearly a need for research that
addresses issues related to its acquisition by foreign learners.
2 KAZUE KANNO
Second, Japanese has many linguistic features not found in the European
languages that are more commonly the subject of research on second language
acquisition. This offers researchers unique opportunities to extend the range of
their theories and to test hypotheses that could not otherwise be investigated. In
fact, work along these lines has been the principal focus of my own research
over the past several years (e.g., Kanno 1996, 1998a, b).
This book marks the first-ever collection of papers in English on the
acquisition of Japanese as a second language. In part for this reason, it is
deliberately broad in scope and eclectic in approach, highlighting contributions
from diverse areas in the far-ranging field of second language acquisition
research and seeking to offer a representative survey of studies on the
acquisition of Japanese as a second language.
The contributions to this volume can be grouped into three classes: studies
of input and interaction, research into the evaluation of L2 proficiency, and
investigation of the grammatical system that is the product of second language
acquisition. Brief overviews of each paper follow.
Two papers deal with input- and interaction-related issues in SLA. The first,
by Shunji Inagaki and Michael Long, examines the effects of recasting and
modeling on the acquisition process. Recasting is a type of feedback that occurs
when, in response to a speaker’s utterance, the interlocutor maintains the
previously introduced topic but makes a structural change to one or more of
components of the utterance. It contrasts with modeling, in which the speaker
simply provides an exemplar of the target pattern as part of the input and/or
instruction to which the learner is exposed.
Recent evidence suggests that recasting is superior to modeling in facilitat-
ing grammatical development in L1 acquisition (e.g., Saxton 1997), and Inagaki
& Long set out to investigate the relative effectiveness of the two types of
feedback for the acquisition of Japanese by English speakers. They focus on two
word order phenomena — one involving prenominal adjectives (a color adjective
precedes a size adjective) and the other involving locative-initial patterns (in
which a locative phrase occurs sentence-initially before even the subject).
Twenty-four English speakers who were learning Japanese as a second language
at the university level were divided into two treatment groups — one receiving
recasts for adjective ordering and modeling for the locative-initial construction,
and the other receiving modeling for adjective order and recasts for the locative-
initial. The results indicate that recasting and modeling were equally effective in
facilitating acquisition of the two phenomena.
Noriko Iwashita reports on research that builds on Pica et al.’s (1989) study
of comprehensible output. Two tasks were used — a one-way information gap
SLA RESEARCH AND JAPANESE 3
at S-structure suggest that even native speakers of Japanese do not observe the
unaccusative/unergative contrast, which raises the question of whether Case drop
is an appropriate diagnostic for this phenomenon.
The paper by Naoko Yoshinaga examines the acquisition of multiple
wh-questions such as ‘Who is eating what?’ by English speakers learning
Japanese and Japanese speakers learning English. The two languages differ in
the type of multiple wh-questions they permit: whereas English allows only argu-
ment-argument patterns, Japanese also permit argument-adjunct constructions.
(2) English Japanese
Argument-argument: Who saw what? Dare-ga nani-o mi-ta-no?
who- what- see--
‘Who saw what?’
Argument-adjunct: *Who left why? Dare-ga doosite ittyatta-no?
who- why has.gone-
‘Who left why?’
The results of Yoshinaga’s study revealed that Japanese-speaking ESL learners
rejected all types of English multiple wh-questions, in sharp contrast with the
performance of English native speakers. On the other hand, English-speaking
learners of Japanese resembled Japanese native speakers in accepting all types of
Japanese multiple wh-questions.
The paper by William O’Grady also adopts a comparative perspective,
investigating the understanding of constraints on gapping direction in coordinate
structures by English-speaking learners of Japanese as a second language and
Japanese learners of English as a second language. The phenomenon that
O’Grady examines has its roots in the fact that object-verb languages such as
Japanese uniformly rule out rightward gapping whereas verb-object languages
such as English consistently prohibit leftward gapping.
(3) English
a. Coordinate sentence without a gap
John read Time and Mary read Newsweek.
b. *Leftward gapping
John Ø Time and Mary read Newsweek.
c. Rightward gapping
John read Time and Mary Ø Newsweek.
6 KAZUE KANNO
(4) Japanese
a. Coordinate sentence without a gap
John-wa Time-o yon-de Mary-ga Newsweek-o
John- Time- read- Mary- Newsweek-
yon-da
read-
‘John read Time and Mary read Newsweek.’
b. Leftward gapping
John-wa Time-o Ø Mary-wa Newsweek-o yon-da.
c. Rightward gapping
*John-wa Time-o yon-de Mary-wa Newsweek-o Ø
Working with subjects at various levels of proficiency, O’Grady reports that
whereas ESL learners reject backward gapping in English and show signs of
accepting forward gapping, JSL learners show a strong and long-lasting prefer-
ence for forward gapping in Japanese.
The final paper in the volume, which I authored, complements and extends
O’Grady’s study by investigating the ability of Mandarin-speaking learners to
reject backward (‘leftward’ in O’Grady’s terminology) patterns of gapping in
Japanese. English and Japanese allow verb gapping but they differ with respect
to direction in the way noted above. On the other hand, Mandarin does not
permit verb gapping at all. The primary concern of my paper has to do with what
inferences, if any, English- and Mandarin-speaking learners can make about
gapping direction in Japanese based on exposure to head-final sentences. Of
special interest is the question of whether the difference between English and
Mandarin affects the way native speakers of each language acquire verb gapping
pattern in Japanese.
For the English JSL learners, rejection of the forward (‘rightward’) gapping
pattern seems to involve a long process stretching over several years. By
comparison, the same result appears to be achieved much earlier by the Mandarin
JSL learners, who clearly reject forward gapping patterns in Japanese even in the
early stages of language study while at the same time accepting the backward
construction at a much earlier point than do the English-speaking subjects.
It is my hope that this varied set of studies on the acquisition of Japanese
as a second language will be of use and interest both to SLA researchers and to
teachers of Japanese around the world.
SLA RESEARCH AND JAPANESE 7
References
Fukuda, Minoru. 1993. “Head Government and Case Marker Drop in Japanese.”
Linguistic Inquiry 15: 168–172.
Kageyama, Taro. 1993. Bunpoo to Gokeisei. Tokyo: Hitsuji Shoboo.
Kanno, Kazue. 1996. “The Status of a Non-parametrized Principle in the L2 Initial State.”
Language Acquisition 5 : 317–335.
———. 1998a. “Consistency and Variation in Second Language Acquisition.” Second
Language Research 14: 376–388.
———. 1998b. “The Stability of UG Principles in Second Language Acquisition:
Evidence from Japanese.” Linguistics 36: 1125–1146.
Larsen-Freeman, Diane & Michael Long. 1991. An Introduction to Second Language
Acquisition. London: Longman Group.
Pica, T., L. Holliday, N. Lewis & L. Morgenthaler. 1989. “Comprehensible Output as an
Outcome of Linguistic Demands on the Learner.” Studies in Second Language
Acquisition 11: 63–90.
Saxton, Matthew. 1997. “The Contrast Theory of Negative Evidence.” Journal of Child
Language 24: 139–61.
Swain, Merrill. 1985. “Communicative Competence: Some roles of comprehensible input
and comprehensible output in its development.” In S. Gass & C. Madden (eds.),
Input in Second Language Acquisition. Rowley, MA: Newbury House, 35–253.
C 2
1. Introduction
instructed over naturalistic acquirers (Ellis 1994: 611–663; Long 1983a, 1988),
given that provision of some kind of “error correction” is one of the most
pervasive instructional practices (Krashen & Seliger 1975). The traditional
approach, of course, is to provide explicit feedback on error, with the speakers’
attention overtly directed at problematic code features. With implicit negative
feedback, on the other hand, the message, not the code, remains the interlocutors’
primary attentional focus. This is potentially of particular value in analytic
approaches to language instruction in the sense of Wilkins (1976), exemplified
by some kinds of task-based and content-based language teaching approaches
which utilize non-linguistic units of analysis in their syllabi and attempt to
maintain a predominant focus on meaning, not linguistic forms, in their method-
ology. If implicit negative feedback were shown to be effective, the instructional
options available in classrooms employing analytic syllabi would be significantly
broadened, and necessary negative feedback could at least partly be delivered in
a manner less disruptive for the desired classroom focus on tasks or subject
matter. Put another way, the focus on form which research findings and theoreti-
cal predictions both suggest is needed and facilitative in L2A (for review, see
Long & Robinson 1998) could be achieved in context in lessons whose primary
focus was meaning, or communication, not a series of linguistic forms, as is the
case with synthetic, e.g., structural, syllabi.
In an updated version of the so-called Interaction Hypothesis (Long 1981,
1983b, 1996: 451–454), Long has suggested how implicit negative feedback
operates in L2A. Negotiation for meaning elicits negative feedback, including
recasts. Such feedback draws learners’ attention to mismatches between input and
output. It can induce noticing of the kinds of forms for which a pure diet of
comprehensible input will not suffice, e.g., items that are unlearnable from
positive evidence, or are rare, and/or semantically lightweight, and/or perceptual-
ly non-salient, and/or cause little or no communicative distress:
… it is proposed that environmental contributions to acquisition are mediated
by selective attention and the learners’ developing L2 processing capacity, and
these resources are brought together most usefully, although not exclusively,
during negotiation for meaning. Negative feedback obtained in negotiation work
or elsewhere may be facilitative of SL development, at least for vocabulary,
morphology and language-specific syntax, and essential for learning certain
specifiable L1–L2 contrasts. (Long 1996: 414)
IMPLICIT NEGATIVE FEEDBACK 11
Negative feedback has a long and disputatious history in both L1A (Sokolov &
Snow 1994) and L2A (Schachter 1991). Some linguistic nativists (e.g., Beck &
Eubank 1991; Pinker 1989; Schwartz 1993; Schwartz & Gubala-Ryzak 1992)
claim that the negative evidence supposedly delivered via such feedback is
unavailable and/or irrelevant, or else only useful for cultivating metalinguistic
knowledge that cannot be converted into more widely usable implicit knowledge.
Researchers operating in other traditions (e.g., Bohannon, MacWhinney & Snow
1990; Nelson 1987; Tomasello & Herron 1988) hold very different views.
Supporters of a role for negative feedback note that the rarity of overt
“correction” observed in early child language studies (e.g., Brown & Hanlon
1970) does not mean that implicit negative feedback is unavailable to learners.
While admittedly sometimes ambiguous, such feedback abounds in both caretak-
er/child and native speaker/non-native speaker conversation in a variety of forms,
including unexpected outcomes following learner utterances, communication
breakdowns, puzzled looks from listeners, such “negotiation” moves as confirma-
tion checks and clarification requests, and corrective recasts. Corrective recasts
are responses which reformulate all or part of a learner’s utterance, providing
relevant morphosyntactic information that was obligatory, but either missing or
wrongly supplied, in the learner’s rendition while retaining its central meaning as
in (1):
(1) Child: Doggy eat it!
Adult: The doggy ate it?
Children’s sensitivity to negative feedback is shown by their tendency to imitate
all or part of corrective recasts between two and four times more frequently than
other kinds of parental responses, such as noncorrective recasts, exact repetitions,
topic continuations or topic changes, despite the fact that exact repetitions tend
to be syntactically simpler (Bohannon & Stanowicz 1988; Farrar 1990), suggest-
ing, as Farrar notes (1990: 65), that they are responding to the negative evidence,
not just the recast’s imitative quality. Moreover, controlled studies have found
that, in otherwise natural conversation, recasts are more successful than (1) equal
numbers of noncontingent models of control structures for the same children or
(2) models of the same structures in comparable children (Baker & Nelson 1984;
Farrar 1990; Nelson 1989, 1991; Nelson, Denninger, Bonvillian, Kaplan, &
Baker 1984). In other words, it is not simply because recasts simultaneously
provide additional models of a target form, i.e., positive evidence, that they
appear to work. Children who receive additional positive evidence, either in the
12 SHUNJI INAGAKI & MICHAEL H. LONG
received either six models (on a pre-recorded audiotape) or six recasts (live) of
the structure from the researcher. The control group (n = 9) practiced writing
kanji for an equivalent period of time. All subjects then completed a second
version of the picture-description task as the posttest. Structures (locatives and
adjective order) and treatments (models and recasts) were crossed, each subject
receiving models of one structure and recasts of the other. Structures, treatments,
and pre- and post-test forms were counterbalanced. All sessions were conducted
individually and audio-recorded. Including pretest and posttest, the entire session
for all subjects lasted 30 minutes.
Mito found no learning of either structure in the modeling condition or by
the control group. Conversely, there was a small, but statistically significant
improvement on one or the other of the two structures by six of 18 subjects in
the recast condition.1 Mito’s findings are consistent with the L1 results, suggest-
ing that implicit negative feedback is usable by L2 adult language learners, and
that recasts can be superior to models. The results must be treated with caution,
however, since (1) the target structures proved to be too difficult to provide a
genuine test of the hypothesis for some of Mito’s subjects, and (2) subjects had
an extra output opportunity in the recast condition before they heard the re-
searcher’s reformulation of their utterance, whereas the models delivered on tape
did not require interaction and, hence, learner output.
In a continuation of the line of research represented by Mito’s work, the
present study was conducted at the University of Hawai‘i at Manoa during the
1994–95 academic year. It addressed the same research question: What is the
relative contribution, if any, of models and recasts in foreign language develop-
ment? Given the preliminary L1 and L2 results, there were three hypotheses:
1. Learners who hear models of target L2 structures will show greater ability
to produce those structures, as measured by pretest-posttest gain scores, than
learners not exposed to the structures.
2. Learners who hear recasts of target L2 structures will show greater ability
to produce those structures, as measured by pretest-posttest gain scores, than
learners not exposed to those structures.
3. Learners who hear recasts of target L2 structures will show greater ability
to produce those structures, as measured by pretest-posttest gain scores, than
learners who hear models of those structures.
14 SHUNJI INAGAKI & MICHAEL H. LONG
3. The study
3.1 Method
3.1.1 Subjects
Subjects were 24 young adult learners of Japanese enrolled in a second semester
course, Japanese 102, at the University of Hawai‘i at Manoa. There were 13
males and 11 females, 22 undergraduates and two graduates, ranging in age from
17 to 35 (M = 20.08, SD = 4.24). Twenty reported having studied Japanese in high
school for between one and four years (M = 2.83, SD = 0.82). Half had visited
Japan for brief periods (M = 14.5 days, SD = 5.65); one had lived there for a year.
Subjects were paid five US dollars for their participation in the study.
The second pattern investigated here was a locative construction in which the
locative phrase occurs between verb, a topic and the verb imasu (‘be’), as in (4):
(4) G-wa L-no mae-ni i-masu.
G- L- front- be-.
‘G is in front of L.’
This order seems to be preferred when G is the established discourse topic (thus
marked with wa), or given. When there is no established topic, however, a
second order, shown in (5), is preferred, where the locative phrase occurs
sentence-initially before the subject:
(5) L-no mae-ni G-ga i-masu.
L- front- G- be-.
‘In front of L is G.’
It was this latter construction that was the second target in the present study.
Examination of the course textbook (Jorden & Noda 1988) and conversations
with instructors had established that only the structure in (4), not the structure in
(5), had been taught when the study took place.
3.1.3 Design
The study utilized a pretest-posttest, control group design (see Figure 1).
Subjects were randomly assigned to form five groups. Since the four
treatment groups were to be collapsed into two (model and recast) for the
eventual statistical analysis, (after controlling for any ordering effect), four
subjects were assigned to each treatment group and eight to the control group, so
that there would be equal numbers of subjects (n = 8) in the model, recast and
control conditions. After the pretest, the four subjects in the first treatment group
received recasts of structure A, followed by models of structure B, and then the
posttest. The four subjects in the second treatment group received models of
structure B, followed by recasts of structure A, and so on. Subjects were given
one of two equivalent forms 1 and 2 of the test as the pretest and the other as
the posttest, use of which was also counterbalanced. As in Mito’s study, between
taking the pretest and posttest, subjects in the control group (n = 8) practiced
writing kanji. As shown in Figure 1, structures, treatments, and tests were
crossed and counterbalanced.
native speaker of Japanese, obviously not the male researcher — containing the
“color–size–NP” string with the first adjective in the -kute form. Having been
given instructions (in English) to do so beforehand, they then had 20 seconds to
repeat the utterance so that the researcher, who they could see had not heard it,
could select and hold up the appropriate piece of paper as they, the subjects, did
likewise. The use of headphones was thought beneficial because it created an
information gap between subjects and researcher and gave a communicative
purpose to the former’s repetition of the model, no easy matter in a contrived,
laboratory-type setting. The repetition in turn ensured that both the model and
recast conditions involved comparable input and output opportunities. A typical
trial in the modeling condition is shown in (7):
(7) Tape-recorder: Aka-kute ookii kami.
(only heard by red- large paper
the subject) ‘A large red piece of paper.’
Subject: Aka-kute ookii kami.
[Researcher and subject hold up pieces of paper]
There were six items, so that each subject received six exemplars of adjective
ordering as models. Each item involved one input and one output, as in the
recast condition but in reverse order.
In a variant of this section of Mito’s study, for the locative construction
treatment, researcher and subject had to position four dolls (Tom, Joe, May, and
Ken) on a drawing of a room containing two rows of seats, two seats per row,
matching each other’s configuration while separated visually by a screen. In the
recast condition, with a task very much like that for adjective ordering, subjects
arranged their dolls in a position of their choosing, and then using one sentence,
described the position of two of them in relation to each other so that the
researcher could match the configuration. As soon as 20 seconds had passed,
indicated by the cue hai (‘yes’) from the tape-recorder, the researcher provided
a recast with the locative phrase in sentence-initial position. (8) exemplifies a
typical sequence:
(8) Tape-recorder: Partner A [subject], please choose two dolls and,
using one sentence, describe their relative posi-
tion from your vantage point.
Subject: Joe-wa May-no mae-ni i-masu.
Joe- May- front- be-.
‘Joe is in front of May.’
18 SHUNJI INAGAKI & MICHAEL H. LONG
Tape-recorder: Hai.
‘Yes.’
Researcher: May-no mae-ni Joe-ga i-masu
May- front- Joe- be-.
ne?
‘In front of May is Joe, right?’
Again, there were six items, six recasts, and so six exemplars of the target structure.
In the modeling condition, again paralleling the modeling treatment for
adjective ordering, subjects first heard an utterance containing the target locative
structure through headphones, and then in 20 seconds repeated it so that the
researcher could place dolls as directed while subjects did the same. A typical
trial is shown in (9):
(9) Tape-recorder: May-no mae-ni Joe-ga i-masu.
(only heard by May- front- Joe- be-.
the subject) ‘In front of May is Joe.’
Subject: May-no mae-ni Joe-ga imasu.
There were six models, after which researcher and subject removed the screen to
compare their room plans.
Subjects in the control group practiced writing 12 kanji for a period
equivalent to the treatments. Including pretest and posttest, the entire session for
all subjects lasted 40 minutes.2
The picture designed to elicit use of the target locative construction depicted four
rows of three people in a 12-person choir. Subjects were asked to describe the
position of two individuals relative to each other using imasu (‘be’). One point
was awarded for a sentence-initial locative phrase in the responses, with absence
or misuse of particles, such as ga, wa, ni, and no, ignored. Two native speakers
of Japanese scored the data independently, obtaining acceptably high inter-rater
reliability (r = 0.961). Remaining discrepancies were resolved by discussion.
4. Results
In Table 1, the raw scores on adjective ordering and the locative construction on
the pretest and posttest are presented for individual subjects in different
treatment groups. Table 1 shows that among 24 subjects, seven (subjects 1, 5, 16,
8, 18, 23, and 20) and two (subjects 8 and 10) had some prior knowledge of
adjective ordering and the locative construction, respectively.
This is presumably a function of instruction at high school (see above),
since, as mentioned, the subjects in this study had not been taught either of the
target structures at university. Although it would have been ideal to screen out all
subjects who showed any prior knowledge of either of the target structures on
the pretest and to use the posttest score only for analysis, the small number of
subjects in this study did not allow this; instead, it was decided to calculate gain
scores, excluding those cases where subjects scored three points for adjective
ordering on the pretest (i.e., subjects 5, 16, and 20) and retaining the other cases,
including where subjects showed some, but not full, knowledge, scoring one or
two points for either of the two structures on the pretest.
In Table 2, gain scores on adjective ordering and the locative construction
are presented for individual subjects in the different treatment groups.
Table 2 shows that (1) three of six subjects who received models of
adjective ordering improved on that structure, as did three of eight who received
recasts, and one of seven in the control group, and that (2) two of eight subjects
who received models of the locative structure improved on that structure, as did
two of eight who received recasts, and two of eight in the control group.
Mean gain scores on adjective ordering and the locative construction for all
three groups, model, recast, and control, are shown in Table 3. Results for the
two syntactic structures failed to provide support for Hypotheses 1, 2, and 3.
Gain scores of subjects who had experienced models were not statistically
significantly different from those of control group subjects (t(27) = 1.59, p > .05).
Neither were gain scores of subjects who had experienced recasts (t(29) = 1.14,
20 SHUNJI INAGAKI & MICHAEL H. LONG
p > .05). Nor were gain scores of subjects who had experienced models and
recasts statistically significantly different from one another (with two subjects
excluded due to incomplete data, t(13) = .17, p > .05). Statistical comparisons of
the two treatment groups and of each against the control group separately for
adjective ordering and the locative were also all non-significant.
In Table 4, the raw scores on the -kute form on the pretest and posttest are
presented for individual subjects in the different treatment groups.
IMPLICIT NEGATIVE FEEDBACK 21
Table 4 shows that three subjects (subjects 18, 10, and 24) had some prior
knowledge of the -kute form, again presumably as a function of instruction in
high school. Therefore, following the cases of adjective ordering and the locative
construction, gain scores were again calculated, excluding one subject (i.e.,
subject 10) who scored three points on the pretest and retaining the other two
22 SHUNJI INAGAKI & MICHAEL H. LONG
Table 3. Mean gain scores on adjective ordering and the locative construction
Model Recast Control Total
n=6 n=8 n=7 n = 21
Adj. O. S=6 S=8 S=1 S = 15
(k = 3) M = 1.00 M = 1.00 M = 0.14 M = 0.71
SD = 1.27 SD = 1.69 SD = 0.38 SD = 1.27
n=8 n=8 n=8 n = 24
Locative S=6 S=3 S=3 S = 12
(k = 3) M = 0.75 M = 0.38 M = 0.38 M = 0.50
SD = 1.39 SD = 0.74 SD = 0.74 SD = 0.98
n = 14 n = 16 n = 15 n = 45
Total S = 12 S = 11 S=4 S = 27
(k = 6) M = 0.86 M = 0.69 M = 0.27 M = 0.60
SD = 1.29 SD = 1.30 SD = 0.59 SD = 1.12
5. Discussion
The results for adjective ordering, with six of 14 subjects (42.9%) in the two
treatment groups improving, compared with only two of 18 (11.1%) in Mito’s
study, reflected more learning of that structure in the treatment conditions than
before. This was probably due to simplification of the target construction from
three-adjective to two-adjective strings, the addition of an output opportunity in
the modeling condition to match that in the recast condition, and/or to pre-
IMPLICIT NEGATIVE FEEDBACK 23
message content, of models than would normally be the case due to the fact that
subjects knew they had not only to follow the directions themselves, but also to
repeat them so that their partners could do so. This may have nullified any super-
iority of recasts on this task as noticed input in Schmidt’s terms (Schmidt 1993).
More puzzling is the improvement on locatives after recasts of only two out
of eight subjects (25%) in this study compared with four out of nine (44.4%) in
Mito’s, as well as the improvement on locatives this time by two out of eight
subjects (25%) in the control group. With respect to the apparent lack of an
advantage for either treatment in learning the locative over writing kanji for an
equivalent period of time, examination of posttest transcripts showed that at least
two subjects in each of the treatment groups appeared to have found the test
directions for eliciting the locative ambiguous, which made the task more
demanding than it should have been. The test directions for locatives were,
“Please choose person X and person Y and describe, in one sentence using
imasu, their relative position from your vantage point.”3 There is evidence that
those subjects interpreted “their relative position” as “their position relative to the
others in the group (choir)”, not as “their position relative to each other”, which
was the intended meaning. This seemed to have made the task more demanding
because they were forced to choose some other person(s) in the group as a
reference point and produce longer sentences, such as A to B-no mae-ni X to Y-ga
imasu (‘In front of A and B is X and Y’), which were less like the ones
provided as models or recasts in either treatment session. This may have
obfuscated the effects of both models and recasts on development of the locative
construction. Dormant knowledge of the locative among control group members
may also have been revived by attempts to produce the structure on the pretest.4
The comparable gains across structures after models in the present study
(five of 14 subjects, or 35.7%) and after recasts (five of 16 subjects, or 31.3%)
also contrast with Mito’s findings, where no subjects improved after models, but
six of 18 subjects (33%) receiving recasts did. This suggests the effects of the
modification to the modeling condition and/or of pre-existing latent knowledge
of the target structures.
26 SHUNJI INAGAKI & MICHAEL H. LONG
6. Conclusion
The results of the present study are rather disappointing, but nevertheless provide
some evidence in support of the claim that implicit negative feedback plays a
facilitative role in SLA. While possibly aided by a triggering effect of the
pretest, recasts produced some learning and/or resuscitation of latent prior
knowledge of all three structures. The results are admittedly difficult to interpret
due to the presence among subjects of some prior knowledge of all three learning
targets. The “prior knowledge” problem may have preempted a clear test of the
hypothesized greater effectiveness of recasts than models.5 Also mitigating
against finding a stronger effect for recasts (and hence, producing a tougher test
for Hypothesis 3, as indicated earlier) was a peculiarity of the modeling treat-
ments which arose from the effort to render that condition comparable to the
recast condition, as well as “communicative.” This was the requirement on
subjects to reproduce the models for their partners immediately after hearing
them, something that would be rare in naturalistic acquisition contexts (although
common in traditional classroom L2 instruction, perhaps in a non-communicative
manner), and which probably led the learners to pay greater attention to the form
of the models than would normally be the case, instead of focusing chiefly on
decoding their meaning. Subjects may well have noticed and acquired more
grammatical information from the modeled input as a result, thereby obscuring
the impact of recasts.
The “communicative” nature of the tests, treatments and procedures
developed for this study arguably more closely approximated untutored L2
learning contexts than those utilized in some earlier work on recasts and other
forms of negative feedback, thereby addressing some of the concerns raised over
the external validity of earlier findings. On the other hand, this study failed to
deal adequately with a major methodological challenge in this type of research,
i.e., the need to distinguish between genuine acquisition of new structures and
deployment of existing knowledge. Caution in interpreting findings in this and
future studies of this sort is also needed due to the short-time nature of such
experiments; without one or more delayed posttests with monitoring of input in
the intervening period(s), one is limited to a consideration of short-time effects
of models or recasts. Such short-term (rate) studies may risk (1) overestimating
the effectiveness of either treatment, and (2) biasing results in favor of one type
of input, should each type prove differentially effective over time. Conversely,
(3) the short-time nature and small number of exposures (here, six tokens) to the
learning targets in such studies, and (4) the use of production measures to assess
learning, each risk underestimating the effectiveness of both types of input. As
IMPLICIT NEGATIVE FEEDBACK 27
in so many areas of SLA research, experimental work of the kind reported here
obviously needs to be complimented by detailed longitudinal case studies of the
same issues.
Acknowledgements
The research reported here was partially supported by the National Foreign Language Resource
Center at the University of Hawai‘i at Manoa, funded by the U. S. Department of Education under the
Language Resource Centers program (CFDA 84.229). We thank Nick Ellis for useful discussion of
methodological issues, Rieko Sawyer for her recordings of the instructions for this study, and Scott
Saft for his help in collecting data.
Notes
1. Specifically, four improved on the locative, whereas two improved on adjective order.
2. At the beginning of the session, subjects were given a vocabulary list containing Japanese
words with English translations to be used during the pre- and post-tests and the treatments, and
were asked to familiarize themselves with the vocabulary before starting the pretest. They were
allowed to refer to the list during the entire session. This was an attempt not to let subjects’
lack of vocabulary hinder their performance.
3. The directions were a modification of those in Mito (1993): “From your vantage point, describe,
in one sentence using imasu, person X’s position using person Y as a reference point.” This
modification was an attempt to neutralize the informational status of the referents of person X
and person Y in discourse. Notice that Mito’s directions establish person X as the discourse
topic; thus, one may prefer to answer them using the locative construction as in (4), whereas
our directions establish both person X and person Y as the discourse topic; thus, one may not
be biased to answer them using either the construction as in (4) or the locative-initial construc-
tion as in (5).
4. K. Kanno (personal communication, February 1997) suggests that the context for introducing
the locative construction in the treatment was not fully appropriate, which might in turn have
obfuscated the effects of models and recasts. She points out that the names of the dolls,
introduced at the beginning of the treatment, become anaphoric in the discourse, and thus that
the pattern in (4) would become more appropriate than the target structure in (5). While we
agree with her and think that refinement of the tasks will be called for for future research, it
still leaves unexplained why Mito (1993), who apparently had the same problem, found more
subjects improving on the locative construction in the recast condition.
5. In fact, a study in Spanish as a second language, which used a similar design with a stricter
control for prior knowledge, provided some evidence for the notion that recasts are more
effective than models in achieving short-term gains in learning a previously unknown L2
structure (see Long, Inagaki & Ortega 1998).
28 SHUNJI INAGAKI & MICHAEL H. LONG
References
Baker, Nancy D. and Keith E. Nelson. 1984. “Recasting and Related Conversational
Techniques for Triggering Syntactic Advances by Young Children.” First Language
5: 3–22.
Beck, Maria-Luise and Lynn Eubank. 1991. “Acquisition Theory and Experimental
Design: A critique of Tomasello and Herron.” Studies in Second Language Acquisi-
tion 13: 73–76.
Bohannon, John N., Brian MacWhinney and Catherine Snow. 1990. “No Negative
Evidence Revisited: Beyond learnability or who has to prove what to whom.”
Developmental Psychology 26: 221–226.
Bohannon, John N. and Laura Stanowicz. 1988. “The Issue of Negative Evidence: Adult
responses to children’s language errors.” Developmental Psychology 24: 684–689.
Brown, Roger and Camille Hanlon. 1970. “Derivational Complexity and Order of
Acquisition in Child Speech.” In J. Hayes (ed.), Cognition and the Development of
Language. New York: Wiley, 11–53.
Ellis, Rod. 1994. The Study of Second Language Acquisition. Oxford: Oxford University
Press.
Farrar, Michael J. 1990. “Discourse and the Acquisition of Grammatical Morphemes.”
Journal of Child Language 17: 607–624.
———. 1992. “Negative Evidence and Grammatical Morpheme Acquisition.” Develop-
mental Psychology 28: 90–98.
Herron, Carol and Michael Tomasello. 1988. “Learning Grammatical Structures in a
Foreign Language: Modelling versus feedback.” The French Review 61: 910–923.
Jorden, Eleanor and Mari Noda. 1988. Japanese: The Spoken Language, Part 2. New
Haven: Yale.
Krashen, Stephen and Herbert Seliger. 1975. “The Essential Characteristics of Formal
Instruction.” TESOL Quarterly 9: 173–183.
Long, Michael. 1981. “Input, Interaction, and Second Language Acquisition.” In H.
Winitz (ed.), Native Language and Foreign Language Acquisition. Annals of the New
York Academy of Sciences 379. New York: New York Academy of Sciences,
259–278.
———. 1983a. “Does Second Language Instruction Make a Difference? A Review of
Research.” TESOL Quarterly 17: 359–382.
———. 1983b. “Linguistic and Conversational Adjustments to Non-native Speakers.”
Studies in Second Language Acquisition 5: 177–194.
———. 1988. “Instructed Interlanguage Development.” In L. Beebe (ed.), Issues in
Second Language Acquisition: Multiple perspectives. Rowley, MA: Newbury House,
115–141.
———. 1991. “Focus on Form: A design features in language teaching methodology.” In
K. de Bot, R. Ginsberg and C. Kramsch (eds.), Foreign Language Research in
Cross-cultural Perspective. Amsterdam: John Benjamins, 39–52.
IMPLICIT NEGATIVE FEEDBACK 29
———. 1996. “The Role of the Linguistic Environment in Second Language Acquisi-
tion.” In W. C. Ritchie and T. K. Bhatia (eds.), Handbook of Language Acquisition.
Vol. 2: Second Language Acquisition. New York: Academic Press, 413–468.
Long, Michael, Shunji Inagaki and Lourdes Ortega. 1998. “The Role of Implicit Negative
Feedback in SLA: Models and recasts in Japanese and Spanish.” Modern Language
Journal 82: 357–71.
Long, Michael and Peter Robinson. 1998. “Focus on Form: Theory, research and
practice.” In C. Doughty and J. Williams (eds.), Focus on Form. Cambridge:
Cambridge University Press, 15–41.
Martin, Samuel E. 1975. A Reference Grammar of Japanese. New Haven and London:
Yale University Press.
Mito, Kaoru. 1993. The Effects of Modeling and Recasting on the Acquisition of L2
Grammar Rules. Unpublished Manuscript, University of Hawai‘i at Manoa.
Nelson, Keith E. 1987. “Some Observations from the Perspective of the Rare Event
Cognitive Comparison Theory of Language Acquisition.” In K. E. Nelson and A.
vann Kleeck (eds), Children’s Language. Hillsdale, NJ: Erlbaum, 289–332.
———. 1989. “Strategies for First Language Teaching.” In R. Scheifelbusch and M. Rice
(eds.), The Teachability of Language. Baltimore: Dan Brooks, 263–310.
———. 1991. “On Differential Language-learning Models and Differential
Interventions.” In N. A. Krasnegor, D. M. Rumbaugh, R. L. Scheifelbusch and M.
Studdert-Kennedy (eds.), Biological and Behavioral Determinants of Language
Development. Hillsdale, NJ: Erlbaum, 399–428.
Nelson, Keith E., Marilyn S. Denninger, John D. Bonvillian, Barbara J. Kaplan and
Nancy D. Baker. 1984. “Maternal Input Adjustments as Related to Children’s
Linguistic Advances and to Language Acquisition Theories.” In A. Pellegrini and T.
Yawkey (eds.), The Development of Oral and Written Language in Social Contexts.
Norwood, NJ: Ablex, 31–56.
Oliver, Rhonda. 1995. “Negative Feedback in Child NS/NNS Conversation.” Studies in
Second Language Acquisition 17: 459–481.
Pavesi, Maria. 1986. “Markedness, Discoursal Modes, and Relative Clause Formation in
a Formal and an Informal Context.” Studies in Second Language Acquisition 8(1):
38–55.
Pinker, Steven. 1989. Learnability and Cognition: The acquisition of argument structure.
Cambridge, MA: MIT Press.
Richardson, Mary-Ann. 1993. Negative Evidence and Grammatical Morpheme Acquisition:
Implications for SLA. Unpublished Manuscript, Perth: Graduate School of Education,
University of Western Australia.
———. 1995. The Use of Negative Evidence in Second Language Acquisition of Grammati-
cal Morphemes. M.Ed. in Applied Linguistics Thesis, University of Western
Australia.
Schachter, Jacquelyn. 1991. “Corrective Feedback in Historical Perspective.” Second
Language Research 7: 89–102.
30 SHUNJI INAGAKI & MICHAEL H. LONG
Noriko Iwashita
University of Melbourne
1. Introduction
more impact on the opportunities for modified output than task types (Pica et al.
1989). Similarly Linnell (1996), comparing the amount of learners’ modified
output in response to the same two interactional moves as investigated in Pica et
al.’s 1989 study, claims that clarification requests result in more modification of
learners’ interlanguage than do confirmation checks. Recently Mackey (1995)
and La Pierre (cited in Swain 1995, 1998) have examined further the relationship
between modified output and L2 learning; the findings of these studies show the
positive impact of modified output on L2 learning.
other languages investigated, and Japanese conversation has some special features
(e.g. the backchannel device called ‘aizuchi’).3 Despite the growing population
of learners of Japanese, curriculum development for Japanese language teaching
lags behind that of other languages. In order to investigate how learners acquire
specific features of Japanese and how to improve the quality of current teaching
practice, more studies are needed.
3. The study
3.1 Methodology
Twenty-four subjects (12 males and 12 females) participated in the present study.
They had studied Japanese at the tertiary level for almost 300 hours when the
data were collected and were recruited from two proficiency groups (High and
Low). (The proficiency of subjects as measured by course tests is given in
Appendix I.) Data were collected using closed one-way and two-way tasks. In
the one-way task which was done in pair-work, one subject described a picture
and the partner drew a picture based on that description. Participants then
swapped roles and did the same task using different pictures.
In the two-way task, each participant was given three or four pictures and
had to take turns describing the features of each picture and arranging all
pictures in order to build up a story. Participants were not allowed to look at
their partner’s picture.
All data were coded for C-units as in Pica et al.’s (1989) study. C-units are
defined as utterances (for example, words, phrases, and sentences, grammatical
and ungrammatical) which provide communication value (Rulon & McCreary
1986). The opportunity for modified output was observed through occurrences of
two types of requests (clarification requests and confirmation checks) in the
negotiation model developed by Varonis and Gass (1985).
Confirmation checks (CC) and clarification requests (CR) were categorized
according to the definitions given by Long and Sato (1983: 275). (An example of
each interactional move is also given below.)
(i) Confirmation checks (CC) involve complete or partial repetition of the
previous speaker’s utterance and serve either to elicit confirmation that their user
had heard and/or understood the previous speaker’s previous utterance correctly
or to dispel that belief.
(1) Example:
H8: ‘G’
‘G’
H7: ‘G’ wa doko?
‘Where is ‘G’?’
TASKS AND LEARNERS’ OUTPUT 37
4.1 Results
As shown in Table 1, significant differences were found for task types, but no
significant differences were found for types of interactional moves. The details
of the results for each hypothesis are given in Tables 7 to 10 in Appendix II.
TASKS AND LEARNERS’ OUTPUT 39
The results for hypotheses 1, 2 and 3 show that task types had an influence on
both the opportunity to modify learners’ output through CC/CR and the
subsequent production of modified output. The result for hypothesis 4 shows that
types of interactional moves did not have an impact on the production of
modified output.
4.2 Discussion
Based on the results of the study, task types and types of interactional moves
will be discussed, and further analyses will be provided.
4.2.1 Task types and the optimal opportunity for modified output
Interaction patterns differed across the two tasks, as predicted. In the two-way
task, both participants requested confirmation and clarification, exchanged
information which they held, and attempted modification of their previous
utterances whenever requested. In the one-way task, the learner who drew the
picture (information receiver) concentrated only on asking the interlocutor (the
information giver) for modification. When explaining the picture, information
givers only provided information and tried to modify their previous utterances
whenever requested through CC/CR. Information receivers who drew pictures
might have felt that whether they could draw a picture totally depended on their
understanding of their partner’s description of the picture. A similar tendency
was found in the study by Gass and Varonis (1985). In contrast, completion of
the two-way task did not rely on just one participant. The difference between the
two types of tasks lies in whether both participants are responsible for under-
standing each other’s speech.4,5
Another issue for consideration is the difficulty of the task. If the task is
too easy for subjects, there would not be much need for them to negotiate
meaning; that is, an information giver would not receive confirmation checks,
clarification requests, and any other interactional moves. On the other hand, if
the task is so difficult that learners do not even know the vocabulary which may
be used in the task, they may abandon the task or use their first language.
A further analysis of C-units per turn shows that there is a possibility that
little negotiation occurs if a subject finds the task too easy or too difficult. Table
2 shows that for most dyads in both two-way and one-way tasks the average
number of C-units per turn was similar, but for two dyads, turns in the one-way
task were very long. Two participants (L8 and H8) explained the picture without
being interrupted with clarification requests and confirmation checks from their
partners. In one of these dyads, the interlocutor’s proficiency was high and so
there might not have been much need for negotiation of meaning.
As the analysis in Table 2 shows, there was more speech production and
more interaction (Table 7 in Appendix II) in the one-way task than in the two-
way task. However, information receivers did not seek clarification or confirma-
tion if they understood what their interlocutor had described, so that one-way
tasks might not create an opportunity for modifying output.
In order to make further comparison of the different negotiation patterns in
the tasks, the instances of extended negotiation were counted in each task. The
negotiation model used in the present study was developed by Varonis and Gass
(1985) and is shown in Figure 1.
T → I → R → RR
(Trigger) (Indicator) (Response) (Reaction to Response)
in the production of modified output in each task, but the sample was too small
for further analysis.
The results reported earlier show that learners produced more modified
output in the one-way task than in the two-way task. Again, it may be assumed
that one-way tasks are more useful in encouraging learners to modify their
output. However, further analysis of the modified output found that the two-way
task resulted in more syntactic modification, which Swain (1985) claims to be an
important benefit to the learner and to the acquisition process, particularly in
relation to the kinds of grammatical error found in Canadian immersion students
learning French.
As in Pica et al. (1989), production of modified output was categorized into
lexical and syntactic modification (Table 4). More syntactic modification was
found in the two-way task than in the one-way task. Thus, one may conclude
that though the two-way task did not contribute much to speech production and
negotiation, it led to more syntactic modifications. Expected frequencies,
however, were too small for further analysis. The result should also be interpret-
ed with caution, since only two types of interactional moves (i.e., CC and CR)
were used for the analysis. Some other interactional moves might also have an
impact on modification (i.e., both syntactic and lexical). If other interactional
moves had been included in the analysis, the result might have been different
from what is presented here.
The different emphasis placed on one-way and two-way tasks may have
also resulted in different types of modification. As Pica et al. (1996) explain,
different emphasis would lead task participants to produce different types of
input, feedback and output modification. The two types of tasks used in the
present study each have a different emphasis. The one-way task (describing a
picture of a park/room and people) engaged learners in describing attributes,
states, and conditions in their pictures, which might have led to negotiation
involving names, features and positions of the objects. In contrast, the two-way
task engaged learners in describing a sequence of events and might have led to
negotiation over actions and experiences, with reference to time sequences and
2. A: Dekakemasu.
‘Departs.’
B: Uchi uchi ni dekakemasu? (CC)
‘House, house departing to the house?’
A: Iie, uchi kara shuppatsushimasu, aa pikunikku. (MO)
‘No, from the house they depart, aa picnic.’
b. Type 2. Interlocutor’s modified utterance in response to the
partner’s utterance
A: Etto, onnanoko wa hon ah o yo-yomimasu.
‘Well, a girl will read/reads a book.’
B: Yondeimasu? (CC)
‘Is (she) reading?’
A: Yondeimasu.
‘She is reading.’
c. Type 3. Expanding the interlocutor’s sentence by adding a word
or two
A: Aa, kurma no soto ni.
‘Ah, (they are) outside the car.’
B: Tatteimasu? (CC)
‘Standing?’
A: Hai, kuruma no soto ni tatteimasu.
‘Yes, they are standing outside the car.’
Learner feedback that consisted of a short utterance (e.g. one-word confirmation
check) was also found in Pica et al.’s study (1996). Pica et al. explain that the
principal way learners signaled a need for message comprehensibility was to
repeat an isolated word or phrase from a prior utterance (the authors referred to
this as segmentation). Pica et al.’s findings show that the use of this type of
signal by learners outnumbered their use of other signals, in comparison with
NSs’ greater use of other signals.
In order to investigate the role of these three types of ‘one-word CCs’, all
‘one-word CCs’ were categorized according to the three types (Table 5). Respons-
es to ‘one-word CCs’ were then grouped into six different types (Table 6).
There are several points worth noting with regard to the role of ‘one-word
CCs’ in carrying out tasks. First, one-word CCs provided an opportunity for
learners to modify their initial output. As shown in (7), the modification in
response to the one-word CCs was limited to the modification of the verb (use
of one word). In the whole study, however, 54 modified outputs in total were
produced in response to CCs and CRs in both two-way and one-way tasks. Out
TASKS AND LEARNERS’ OUTPUT 45
5. Conclusion
The major focus of the present study was on whether learners produce modified
output in NNS-NNS interaction in a foreign language situation. The question of
whether task types and types of interactional moves influence these aspects was
also investigated. The principal findings are as follows:
1. Learners were able to give feedback to other learners and also to attend to
the feedback given by other learners and to modify their output accordingly.
2. Task types had more influence on the opportunity for modified output and on
the actual production of modified output than did types of interactional moves.
The present study describes how learner-learner interaction occurs in Japanese, and
the results support the findings of earlier studies of negotiation in NNS-NNS dyads,
including the recent study by Pica et al. (1996). As in NS-NNS interaction where
NSs modify their speech in response to NNSs’ signals of non-understanding,
speakers/learners in NNS-NNS dyads are able to indicate difficulty in under-
TASKS AND LEARNERS’ OUTPUT 47
standing their interlocutors’ speech and to modify the speech that had caused the
communication breakdown. Through negotiation of meaning, learners not only
obtained opportunities to receive comprehensible input, but also to modify their
output. In this NNS-NNS interaction, speakers/learners who modified their output
provided comprehensible input to their interlocutors.
As the results show, one-way tasks have been found to have certain advan-
tages in providing opportunities for modified output and actual production of
modified output. The interaction pattern in one-way tasks revealed that partici-
pants tried hard to find out every detail needed to draw a picture, especially on
the semantic level, and made one-word confirmation checks frequently. Many
responses to these one-word CCs involve simple recognition with ‘yes’. Though
the number is small, learners still did modify their output in response to these
one-word CCs. In addition, through Types 2 and 3 confirmation checks, learners
were able to identify their partners’ ungrammatical utterances and to modify or
expand them.
Pica et al. (1996) suggest that learners’ interaction provides quantitatively
rich data for L2 learning compared with interaction with NS. Learners are quite
effective in providing feedback through segmentation. Recent studies (Mackey
1995 and La Pierre, cited in Swain 1995, 1998) investigating the relationship
between modified output and L2 learning have pointed toward the positive effect
of output on learning, but the results are not yet conclusive. Further research is
required involving other interactional moves and using more than one type of
task. As a further step, it is also necessary to examine whether modification in
learner-learner interaction will lead to long-term improvements and advances in
L2 knowledge and use.
Acknowledgments
This is a revised version of a paper delivered at the Second Language Research forum (McGill
University, October 1994) based on the author’s M. A. thesis (Iwashita 1993) completed at the
University of Melbourne. I wish to thank Prof. Tim McNamara for guidance throughout and also
Joanna Tapper and Neomy Storch for valuable comments and advice.
Notes
1. Foreign language situation means a setting where the target language (e.g. Japanese) is not
spoken in the surrounding environment.
48 NORIKO IWASHITA
2. When the present study was conducted (Iwashita 1993), most of the studies cited above, except
Loschky (1989), had not appeared.
3. See Locastro (1987) and Maynard (1989).
4. This point was made by an anonymous reviewer.
5. In the present study only two interactional moves (CC and CR) were examined. However, as
Parker and Chaudron (1987) note in presenting their extensive list of interactional moves, if a
wider range of moves had been included in the analysis, the results might have been different.
In the two-way task, the occurrence of CR and CC was as frequent as in the one-way tasks, but
if other interactional moves had been included, the result might not have been the same as the
one with analysis with CR and CC. In this regard, the analysis of the data using only CC and CR
has some limitations, and it is hard to claim the superiority of one-way tasks over two-way tasks.
Appendix I
Proficiency of learners
Test Result
Subject Proficiency Japanese Exposure Written (%) Aural (%) Oral (%)
(in Japan)
L1 Low — 62.0 065.0 068.0
L2 Low — 58.0 045.0 066.0
L3 Low — 60.8 045.0 068.0
L4 Low — 61.8 040.0 068.0
L5 Low — 70.0 020.0 060.0
L6 Low — 51.8 020.0 060.0
L7 Low — 58.7 040.0 070.0
L8 Low — 68.0 070.0 070.0
H1 High 6 weeks 87.0 080.0 100.0
H2 High 1 year 92.0 095.0 100.0
H3 High 6 weeks 90.4 100.0 100.0
H4 High — 96.0 085.0 100.0
H5 High 1 year 82.0 095.0 100.0
H6 High 1 year 93.0 085.0 100.0
H7 High 1 year 94.0 100.0 098.0
H8 High 1 year 89.0 090.0 098.0
HL1 Low — 63.0 000.0 060.0
HL2 High — 89.0 075.0 084.0
HL3 Low — 58.0 035.0 050.0
HL4 High 3 months 90.0 100.0 086.0
HL5 Low — 68.9 025.0 066.0
HL6 High 6 weeks 88.0 075.0 087.0
HL7 High 1 year 80.0 090.0 097.0
HL8 Low — 63.0 055.0 063.0
TASKS AND LEARNERS’ OUTPUT 49
Appendix II
Hypothesis 2: The proportion of clarification requests to confirmation requests will be higher in one-
way tasks than in two-way tasks.
Table 8. Proportion of CR to CC in Two-way and One-way tasks
Two-way One-way Total
n % n % n
All (n = 24)
Confirmation check (CC) 126 74.1 143.5 85.7 269.5
Clarification request (CR) 044 25.9 024.0 14.3 068.0
Hypothesis 3: There will be more opportunities in one-way tasks for interlocutors to modify their
interlanguage than in two-way tasks.
Table 9. Production of modified output in one-way and two-way tasks
Two-way One-way Total
n % n % n
All (n = 24)
Modified output 019 11.5 033 20.1 052
Other responses 146 88.5 131 79.9 377
Hypothesis 4: Clarification requests will give more opportunities for interlocutors to modify their
interlanguage than will confirmation checks.
Table 10. Production of modified output in relation to CC and CR
CC CR Total
n % n % n
All (n = 24)
Modified output 69 16.1 19 20.4 88
Other responses 359 83.9 74 79.6 433
50 NORIKO IWASHITA
References
Porter, P. A. 1986. “How Learners Talk to Each Other: Input and interaction in task-
centered discussions.” In R. Day (ed.), Talking to Learn: Conversation in second
language acquisition. Rowley, MA: Newbury House Publishers, 201–222.
Rabie, S. R. 1996a. “Negative Feedback, Modeling, and Vocabulary Acquisition in Task-
based Interaction.” Unpublished Master’s Thesis, University of Hawai‘i at Manoa.
———. 1996b. “A Study of NS/NNS Interaction as Part of a Language Learning
Program.” Unpublished Manuscript, University of Hawai‘i at Manoa.
Richardson, M. A. 1995. “The Use of Negative Evidence in Second Language Acquisition
of Grammatical Morphemes.” Unpublished Master’s Thesis in Education, University
of Western Australia.
Roberts, M. A. 1996. “Awareness and the Efficacy of Error Correction.” In R. Schmidt
(ed.), Attention and Awareness in Foreign Language Learning (Technical Report #9).
Honolulu: University of Hawai‘i, Second Language Teaching & Curriculum Center,
163–182.
Rulon, K. and J. McCreary. 1986. ‘Negotiation of Content: Teacher-fronted and small
group interaction.’ In R. Day (ed.), Talking to Learn: Conversation in Second
language Acquisition. Rowley, Mass: Newbury House.
Shortreed, I. M. 1993. “Variation in Foreigner Talk Input: The effects of task and
proficiency.” In G. Crookes and S. Gass (eds.), Tasks and Language Learning:
Integrating Theory and Practice. Philadelphia: Multilingual Matters, 96–122.
Swain, M. 1985. “Communicative Competence: Some roles of comprehensible input and
comprehensible output in its development.” In S. Gass and C. Madden (eds.), Input
in Second Language Acquisition. Rowly, MA: Newbury House, 235–253.
———. 1995. “Three Functions of Output in Second Language Learning.” In G. Cook
and B. Seidhofer (eds.), For H. G. Widdowson: Principles and practice in the study of
language: A Festschrift on the Occasion of His 60th Birthday. Oxford: Oxford
University Press, 125–144.
———. 1998. “Focus on form through conscious reflection.” In C. Doughty and J.
Williams (eds.), Focus on form in classroom second language acquisition.
Cambridge: Cambridge University Press, 64–81.
Varonis, E. M. and S. Gass. 1985. “Non-native/Non-native Conversations: A model for
negotiation of meaning.” Applied Linguistics 6: 71–90.
Yamaguchi, Y. 1994. “Negative Evidence and Japanese as a Foreign Language Acquisi-
tion.” Unpublished Manuscript, The University of Western Australia.
Yule, G. 1992. “The Variable effects of Some Task-based Learning Procedures on L2
Communicative Effectiveness.” Language Learning 42: 249–277.
Yule, G. and D. McDonald. 1990. “Resolving Referential Conflicts in L2 Interaction: The
effect of proficiency and interactive role.” Language Learning 40: 539–556.
C 4
SPOT
A Test Measuring “Control” Exercised
by Learners of Japanese
1. Introduction
tape is not stopped during the test, the total time required, including time for giving
instructions and collecting the answer sheets, is ten to fifteen minutes. Unlike
other tests, in which a number of answers may be possible for a single question,
SPOT allows only one correct answer for each question, all already given on the
tape. Correcting the test involves merely mechanically checking a single hiragana
character in each sentence. Neither the administration nor the correcting of the
test requires a specialized knowledge of Japanese language education.
SPOT is also simple to match to the estimated level of the incoming
students. There are three main current versions of SPOT, referred to here as
Version 3 (beginners-intermediate), Version 2 (intermediate-advanced) and
Version 4 (a shortened intermediate-advanced edition).3 Each version can be used
for a wide range of students and, as each test requires only ten to fifteen
minutes, two can be used when the suitable version is in doubt. Furthermore, the
level of difficulty of each version can be easily raised by re-recording the audio
tape less clearly.
SPOT is a performance-oriented test. It is not a “performance test” per se,
as it neither comprises a “work sample” nor involves the testee in an “act of
communication” (see McNamara 1996: 6, 26). Nonetheless, as will be discussed
below (Section 3), SPOT measures aspects of what may be termed “control”, i.e.,
“the productive and receptive control possessed by the language user over the
knowledge he or she has of various aspects of the linguistic system” (Sharwood
Smith 1994: 15). As it is this “control” that enables the possessor of linguistic
knowledge to perform in a real-life setting, it is appropriate to consider SPOT
performance oriented.
The accuracy of SPOT as a placement test was determined based upon the test’s
statistical reliability, its ability to discriminate between learners at different
levels, and its correlation with other, more conventional evaluations having
reasonable claims to accuracy. The data for this section are derived from trials
by the present authors at the University of Tsukuba4 and from independent
studies by Y. Hatasa and Y. Tohsaku at the University of Iowa (Hatasa and
Tohsaku 1997)5 and the University of California at San Diego (Hatasa and
Tohsaku 1997),6 respectively, and by R. Spence-Brown at Monash University
(Spence-Brown 1996).7
56 JUNKO FORD-NIWA & NORIKO KOBAYASHI
Table 2. Mean score and standard deviation on SPOT (Version 3) & UT Placement Test at
UI (Hatasa and Tohsaku 1997): 1st year (n = 42) and 2nd year (n = 19) students
SPOT Placement Test Grammar Listening Reading
(no. of questions) 60 82 30 28 24
1st Year Mean 14.19 (23.7%) 19.43 (23.7%) 07.95 09.38 2.97
SD 09.27 05.98 03.36 02.95 2.39
2nd Year Mean 39.47 (65.8%) 28.84 (35.2%) 14.42 11.47 2.95
SD 09.44 06.52 03.70 03.10 1.96
Table 3. Mean score and standard deviation on SPOT (Version 2) & UT Placement Test at
UI (Hatasa and Tohsaku 1997): 3rd year (n = 15) and 4th year (n = 5) students
SPOT Placement Test Grammar Listening Reading
(no. of questions) 65 112 60 28 24
3rd Year Mean 26.07 (40.1%) 45.93 (41.0%) 25.67 12.71 08.40
SD 13.31 16.45 08.19 05.48 04.67
4th Year Mean 44.00 (67.7%) 81.40 (72.7%) 42.20 21.80 17.40
SD 04.94 09.66 07.05 02.86 02.79
SPOT: A TEST MEASURING “CONTROL” 57
successfully placed the vast majority of the students who studied Japanese as a
foreign language at the University of Tsukuba. The aim of SPOT is to provide
an economical means of achieving equivalent results.
Table 5. Rank correlation between SPOT scores and evaluations of speaking ability at
UCSD (Hatasa and Tohsaku 1997)
Course Ranking Correlation9
1st Year (Version 3, n = 42) 0.825* – 1.000*
2nd Year (Version 3, n = 19) 0.887* – 0.953*
3rd Year (Version 2, n = 15) 0.770*
4th Year (Version 2, n = 5) 1.000*
* significant at 0.01 level
SPOT: A TEST MEASURING “CONTROL” 59
deviation was 10.0 (Spence-Brown 1996: 36). The scores were subsequently
compared, using Spearman’s Rank Correlation, with the results of the various
components of the final assessment, including a speaking test (see Table 6).
The correlations are considerably less satisfactory than those for the UT
Placement Test or the UCSD speaking evaluation presented above. Spence-
Brown suggests several possible explanations for the poor correlation, namely,
the effect of the Monash tests being achievement tests, the possibility that SPOT
was above the level of competency of the students, and that SPOT and the
Monash tests measure different things (see Spence-Brown 1996: 37). It may be
pointed out, nonetheless, that the standard deviation of the SPOT scores at MU
was similar to that at UI and, assuming that the Monash students progress at the
same rate as the students at UI, there is reason to believe that SPOT would have
succeeded in discriminating between students at the lower intermediate level and
those at the next level, had the test been administered to both groups.
Table 6. Rank correlation between SPOT scores and components of the formal assessment
for MU lower intermediate level students (Spence-Brown 1996: 37)
Component Correlation
Listening Test 0.55 (p < .001)
Final Written Test 0.51 (p < .001)
Final Grade 0.51 (p < .001)
Speaking Test 0.33 (p < .001)
Visitor Session No correlation
60 JUNKO FORD-NIWA & NORIKO KOBAYASHI
system.” The reality of this dichotomy may be illustrated by the extreme case of
the learner whose speech is fluent, but filled with non-native grammatical
constructions, i.e., control without adequate knowledge, or whose speech is
grammatically correct, but too slow and halting to allow him or her to communi-
cate effectively, i.e., knowledge without adequate control (Bialystok and
Sharwood Smith 1985: 109).
“Control”, i.e., the ability to process linguistic information, may further be
analyzed into two modes, viz., controlled processing, which necessitates the
attention of the language user, and automatic processing, which proceeds without
deliberate awareness (see Shiffrin and Schneider 1977; McLaughlin, et al. 1983;
McLaughlin 1987; Sharwood Smith 1994: 113–116). Automatic processing,
quantified as “automaticity”, is used by the native speaker to process basic
linguistic information, such as grammar and basic vocabulary, quickly and
efficiently. It is the development of automatic processing, in particular, which
enables the language learner to comprehend and use the language within the
bounds of his or her linguistic knowledge at the speed of a native speaker.
As explained in the introduction, SPOT test takers are required to fill in each
blank with a single hiragana (syllabic) character representing all or part of a
grammatical item. With the exception of the missing syllable, the sentences on
the answer sheet are identical to those on the tape. The listening component, i.e.,
the tape, is an essential element of SPOT. Its function is to force the testee to
work in “real-time” in order to answer correctly. In order to examine the effect
of this component on testees at various proficiency levels, we decided to
experiment with a version of SPOT without the tape, hereafter referred to as the
“no-tape version”. This experimental version was then not unlike a conventional
test of grammar. The experiment was conducted at the University of Tsukuba
(UT) in 1993 (cf. Ford-Niwa et al. 1995).
As explained in the introduction, in the normal (taped) version of SPOT only one
correct answer is possible for each blank. In the no-tape version, however, more
than one correct answer was sometimes possible. The possible (i.e.,
grammatically correct) answers to the no-tape version were therefore divided into
three categories: (i) answers that were given in the taped version; (ii) alternative
answers that fit the original context of the sentences; (iii) answers deemed
possible in alternate contextualizations, often involving a usage less likely to be
intentionally envisaged by a learner of Japanese. Three scores were then
calculated for each of the no-tape version exams: (I) the number of correct
answers according to the first category; (II) the number of correct answers
according to either of the first two categories; (III) the number of correct answers
according to any of the three categories.
a speed generally above their “control” ability. When given a sufficient amount
of time, seven times the time of the normal SPOT test (and twice the time
required by the high-scoring group), they were able to score points by using
whatever linguistic knowledge they might have had; however, they did not yet
have much control.
The tape had the opposite effect on the scores of the members of the high-
scoring group. In spite of the fact that the members of this group had, in effect,
64 JUNKO FORD-NIWA & NORIKO KOBAYASHI
only one-third the time to complete the normal taped version and that only one
correct answer was possible for each question, they still fared slightly (although
statistically insignificantly) better with the tape. This would indicate that the tape
did not pose a hindrance to the members of the high scoring group — that is, their
ability to process language in real-time enabled them to exploit the information
contained on the tape at a level closer to that of their grammatical knowledge.
4. Conclusion
Acknowledgments
We would like to take this opportunity to thank Yukiko Abe Hatasa, Yasu-Hiko Tohsaku, Robyn
Spence-Brown, Takako Sakai, Fuzhi Yuan and Hilofumi Yamamoto, without whose collaboration this
paper would not have been possible. We would also like to gratefully acknowledge our debt to the
(anonymous) reviewers for their many thoughtful insights and comments. And finally, we wish to
express our appreciation to Matthew M. Hanley and Michael P. Critchley for improving our English
grammar and style.
This research was supported by an International Scientific Joint Research Grant from Japan’s
Ministry of Education, Science and Culture (Research Group Leader, Noriko Kobayashi, Research
Project #07044003).
Notes
1. A readily available example (for the present authors) is the University of Tsukuba (UT)
Placement Test. (We refer to this test not only because of our own familiarity with it, but
because it, or versions thereof, is currently in use in many other Japanese educational institu-
tions.) The test consists of sections testing listening comprehension, reading comprehension,
grammar, and vocabulary and script, respectively, and lasts approximately two and a half hours.
In spite of the use of a theoretically simple computer formatted answer sheet (except for the
vocabulary-script section), in reality scoring 150 tests is often a long and arduous task.
Proficiency tests are also sometimes used as placement tests. An example is the four-level
Nihongo Nooryoku Shiken, the Japanese-Language Proficiency Test, administered annually by
the Japan Foundation and the Association of International Education and subsequently made
available to other institutions. The exam consists of three sections testing knowledge of
Japanese characters, syllabaries and vocabulary, listening comprehension, and reading and
grammar skills. The time required to take the test varies from one hour and forty minutes at
Level Four (Novice) to three hours at Level One (Superior). Correcting the exam is likewise a
time consuming undertaking. In addition, the testers must decide in advance which of the four
levels of the exam will be the most appropriate for the incoming students; a given level cannot
accurately assess the proficiency of learners who are not at that approximate level: it is either
too difficult for those with low proficiency or too easy for those with high proficiency.
2. To the best of our knowledge, there is at present no other quickly administered Japanese
language placement test. Nor did we find a report of an equivalent English language test in
Reviews of English Language Proficiency Tests (Alderson, et al. (eds.) 1987) or in the latest
volumes of the Mental Measurements Yearbook (Kramer and Conoley 1992, Conoley and
Impara 1995), although the sheer number of English language programs around the world
would certainly not exclude such a possibility. On the distinction between SPOT and cloze tests
or c-tests, see below.
3. The grammatical items in Version 3 (60 questions) are all beginner level. In Version 2 (65
questions) and Version 4 (30 questions) two-thirds of the grammatical items are beginner level
and one-third are intermediate or advanced level. Decisions concerning the level of difficulty
of the grammatical points were based on the syllabus of the course design committee of the
Society for Teaching Japanese as a Foreign Language (1991).
4. Data collected between 1991 and 1995 are as follows: Version 2 (fall 1991); Version 3 (spring
1994); Version 4 (fall 1993 [n = 133]; spring 1994 [n = 127]; fall 1994 [n = 142]; spring 1995
SPOT: A TEST MEASURING “CONTROL” 67
[n = 141]). The prior studies on SPOT by the present authors (see bibliography) are also based
on part or all of this data. Differences in the statistics in the present study are due to
calculation errors which are corrected here or to different methods of analysis.
5. Data collected during the third week of the spring semester 1996 (Hatasa and Tohsaku 1997).
6. Data collected near the end of the academic 1994–1995 year (Hatasa and Tohsaku 1997).
7. Data collected near the end of the academic year [1995] (Spence-Brown 1996: 36).
8. Table i. Reliability estimates (Kuder-Richardson Formula 20) for UT Placement Test at UT
(1991) and UI (1995)
Placement Test Reliability
UT (all levels [n = 137], 128 questions) 0.97
UI (1st & 2nd year students [n = 61], 82 questions) 0.78
UI (3rd & 4th year students [n = 20], 112 questions) 0.93
The low reliability estimate of the placement test for the first and second-year students at UI
can be accounted for by the difficulty of the test, which is reflected in the low scores shown
in Table 2.
9. The range of correlation coefficients for the first and second-year courses is due to the multiple
class sections at these levels.
10. Spence-Brown (1997), developing ideas proposed by Bachman (1990) and Bachman and Palmer
(1996), distinguishes between the “product aspect” of authenticity (i.e., the test tasks them-
selves) and the “process aspect” of authenticity (i.e., the psychological processes, such as
language processing under time constraints) involved in the tasks.
References
Alderson, J. Charles, Karl J. Krahnke and Charles W. Stansfield (eds.). 1987. Reviews of
English Language Proficiency Tests. Washington, DC: Teachers of English to
Speakers of Other Languages.
Alderson, J. Charles and Caroline Clapham. 1992. “Applied Linguistics and Language
Testing: A case study of the ELTS test.” Applied Linguistics 13: 149–167.
Bachman, Lyle F. 1990. Fundamental Considerations in Language Testing. Oxford:
Oxford University Press.
Bachman, Lyle F. and Adrian S. Palmer. 1996. Language Testing in Practice. Oxford:
Oxford University Press.
Bialystok, Ellen and Michael Sharwood Smith. 1985. “Interlanguage is not a State of
Mind: An evaluation of the construct for second-language acquisition.” Applied
Linguistics 6: 101–117.
Canale, Michael and Merrill Swain. 1980. “Theoretical Bases of Communicative Ap-
proaches to Second Language Teaching and Testing.” Applied Linguistics 1: 1–47.
Conoley, Jane C. and James C. Impara (eds.). 1995. The Twelfth Mental Measurements
Yearbook. Lincoln, NE: University of Nebraska Press.
68 JUNKO FORD-NIWA & NORIKO KOBAYASHI
Ellis, Rod. 1994. The Study of Second Language Acquisition. Oxford: Oxford University
Press.
Ford-Niwa, Junko. 1997. “Gengo unyooryoku no sokutee ni mukete: SPOT (Simple
Performance-Oriented Test) no kooseegainen datoosee ni tsuite.” Josai International
University Bulletin 5: 25–38.
Ford, Junko and Noriko Kobayashi. 1993. “Nihongo-gakushuusha ni yoru bunpoo
koomoku no shuutoku ni kansuru ichikoosatsu.” Tsukuba Daigaku Ryuugakusee
Sentaa Nihongo Kyooiku Ronsyuu 8: 185–200.
Ford-Niwa, Junko, Noriko Kobayashi and Hilofumi Yamamoto. 1995. “Nihongo nooryoku
kan’ishiken (SPOT) wa nani o sokuteeshiteiru ka: Onsee teepu yooin no kaiseki.”
Nihongo Kyooiku 86: 93–102.
———. 1996. “Bunpoo koomoku chooshu nooryoku to onsee kankyoo: SPOT (Simple
Performance-Oriented Test) no kuuran ichi ni kansuru kenkyuu.” Tsukuba Daigaku
Ryuugakusee Sentaa Nihongo Kyooiku Ronsyuu 11: 201–212.
Garrett, Nina. 1986. “The Problem with Grammar: What kind can the language learner
use?.” Modern Language Journal 70: 133–148.
Hatasa, Yukiko Abe and Yasu-Hiko Tohsaku. 1997. “SPOT as a Placement Test.” In
H. M. Cook, et al. (eds.), New Trends and Issues in Teaching Japanese language and
Culture. Honolulu: University of Hawaii, Second Language Teaching and Curricu-
lum Center, 77–98.
Hymes, D. H. 1972. “On Communicative Competence.” In J. B. Pride and J. Holmes
(eds.), Sociolinguistics: Selected Readings. Harmondsworth: Penguin, 269–293.
Klein-Braley, Christine. 1985. “A Cloze-up on the C-test: A study in the construct
validation of authentic test.” Language Testing 2: 76–104.
Klein-Braley, Christine and Ulrich Raatz. 1984. “A Survey of Research on the C-test.”
Language Testing 1: 134–146.
Kobayashi, Noriko and Junko Ford. 1992. “Bunpoo koomoku no onsee chooshu ni
kansuru kenkyuu.” Nihongo Kyooiku 78: 167–177.
Kobayashi, Noriko, Junko Ford-Niwa, and Hilofumi Yamamoto. 1995. “Nihongo
nooryoku kan’i shiken (SPOT) no tokuten bunpu keekoo: Chuukyuuu-muke to
shokyuu-muke tesuto.” Tsukuba Daigaku Ryuugakusee Sentaa Nihongo Kyooiku
Ronsyuu 10: 107–120.
———. 1996. “Nihongo nooryoku no atarashii sokuteehoo: SPOT [SPOT: A new testing
method of Japanese language proficiency].” Japanese-Language Education around
the Globe 6: 201–218.
Kramer, Jack J. and Jane C. Conoley (eds.). 1992. The Eleventh Mental Measurements
Yearbook. Lincoln, NE: University of Nebraska Press.
McNamara, Tim. 1996. Measuring Second Language Performance. London and New
York: Longman.
McLaughlin, Barry. 1987. Theories of Second-Language Learning. London: Arnold.
McLaughlin, Barry, Tammi Rossman and Beverly McLeod. 1983. “Second Language
Learning: An information-processing perspective.” Language Learning 33: 135–158.
SPOT: A TEST MEASURING “CONTROL” 69
Retesting a Universal
The Empty Category Principle and
Learners of (Pseudo)Japanese
1. Kanno (1996)
In an admirably clear recent paper, Kanno (1996) addresses the issue of whether
the Japanese of English-speaking adult learners shows evidence of adherence to
the Empty Category Principle (ECP), a non-parametrized component of Universal
Grammar (UG). In so doing, she joins a long line of researchers who have
investigated whether the developing grammars of adult second language learners
are constrained by UG, or whether L1 and L2 acquisition are fundamentally
different (Bley-Vroman 1989). Kanno points out that the tendency has been for
research to focus on parameters rather than invariant principles, since if a
principle appears to be at work in the L2, its presence may be argued away as
merely evidence for the principle as instantiated by the L1 (or transfer). If,
however, learners select settings of parameters in the L2 which are different
from those associated with their native languages, particularly where the settings
are unmarked, then the evidence for the availability of UG to adult learners is
strengthened. However, although the ECP is not parametrized, its application has
radically different outcomes in English and Japanese, so different in effect that
without the necessary theoretical knowledge one would not be able to perceive
any link between them.
The ECP states the conditions for proper government of empty categories. In
English, it is the ECP that determines, amongst other things, what are known as
that-trace effects, accounting for the grammaticality contrasts manifested in (1):
72 E. KELLERMAN, J. VAN IJZENDOORN & H. TAKASHIMA
KP I′
K′ VP I
NP K V′ Pst
Hideyuki ga KP V
K′ yomimashita
NP K
hon Ø
K = Case
KP = Case phrase
(adapted from Kanno 1996)
Since Japanese does not have that-trace phenomena and English nouns are not
overtly marked for case (pronouns notwithstanding), the interesting question
Kanno poses is whether the ECP continues to be available when L1 and L2
display such radically dissimilar behaviors. In an experiment involving native
speakers of Japanese and English-speaking learners of Japanese at the University
RETESTING A UNIVERSAL 73
of Hawai‘i who had just completed their first semester of language study, Kanno
found convincing support for the idea that learners do indeed appear to have
knowledge of the ECP, as evidenced by their awareness of the rules for case drop.
Kanno prepared four types of four simple interrogative sentences containing
one or two arguments and a transitive verb. The task was to judge the acceptabil-
ity of the stimuli on a three-point scale: 3 = ‘natural’; 2 = ‘in-between’; 1 = ‘un-
natural’. Two of the types of sentences (I and II) contained sentences with two
explicit arguments, one type (I) containing sentences with the accusative case
marker missing, the other (II) with the nominative case marker missing. The other
two types (III and IV) contained sentences with one explicit argument.2 Here too,
one set had no accusative case marker (III), and the other no nominative case
marker (IV). By the ECP, sentences with missing nominative case markers (II
and IV) are ungrammatical, while those with missing accusative case markers (I
and III) are not. Kanno illustrates the types with the following examples.3
(3) Type I: Accusative case missing in a sentence with two overt argu-
ments
Suzuki-san-wa dono biiru-Ø nomi-mashita-ka?
Suzuki-Mr(s)- which beer-Ø drink-.-
‘Which beer did Mr(s) Suzuki drink?’
(4) Type II: Nominative case missing/ two overt arguments
*Dono gakusee-Ø biiru-o nomi-mashita-ka?
Which student-Ø beer- drink-.-
‘Which student drank beer?’
(5) Type III: Accusative case missing/one overt argument
Dono biiru-Ø nomi-mashita-ka?
Which beer-Ø drink-.-
‘Which beer did (he/she) drink?’
(6) Type IV: Nominative case missing/one overt argument
*Dono gakusee-Ø nomi-mashita-ka?
Which student-Ø drink-.-
‘Which student drank beer?’
Though there was variation in individual scores (see Kanno’s Figs. 1 and 2, pp.
326–7), and no analysis of performance on individual stimuli was offered, there
seemed to have been a surprising unity of judgment among native speakers and
learners assessing these stimuli. Sentences with missing nominative case-markers
were largely judged as lying on the ‘unnatural’ side of ‘in-between’ by learners,
while native speakers of Japanese stigmatized them more strongly still. In
74 E. KELLERMAN, J. VAN IJZENDOORN & H. TAKASHIMA
contrast, sentences with missing accusative case markers were found to be highly
acceptable by both groups. On the face of it, at least, it appears that learners of
Japanese are able to judge the correctness of Japanese sentences as would be
predicted if a principle of UG, the ECP, was still available to them as adults.
The problem then is to rule out other possible explanations. To do this,
Kanno examines the input the learners in her sample would have received as part
of their coursework. Examination of the coursebook (Japanese: The Spoken
Language, Part 1, Jorden & Noda 1987) shows that it too by and large ‘obeys’
the ECP, with much more frequent omission of accusative case markers on
objects (some 74 out of 107 instances), as against a mere eight cases where
nominative particle ga is dropped.4 Yet there does not seem to be any explicit
statement suggesting that o can be dropped while ga must be retained.5 More-
over, Kanno doubts whether early beginners6 could draw the appropriate
conclusions about case marker deletion from these varying occurrences and non-
occurrences, because the potential for misgeneralisation is also present. Particu-
larly relevant are double-ga structures (Jorden & Noda 1987: 115–116), which
occur with stative transitive verbs like dekimasu ‘can do’, arimasu ‘have’,
irimasu ‘need’, and wakarimasu ‘understand’. Here the object is marked with
nominative particle ga, and this particle (but not the subject-marking ga) may be
omitted, as the following examples (based on Jorden & Noda, and Kanno) show:
(7) Ano gakusee-ga eego-ga wakari-masu
that student- English- understand-.
‘That student understands English.’
(8) Ano gakusee-ga eego-Ø wakari-masu
that student- English understand-.
‘That student understands English.’
(9) Eego-Ø wakari-masu-ka?
English understand-.-
‘Do you understand English?’
Kanno notes that this type of ga-drop was again the rule rather than the excep-
tion in the coursebook, and that particularly in one-argument sentences of the
type illustrated by (9), the opportunity for wrongly assuming that a subject-ga
has been omitted is very strong, ‘thereby undermining the necessary general-
ization’ (Kanno, 1996: 328). The coursebook itself does not seem to state the
rule; in fact its statements on potential omission do not distinguish between ga
and o omission, as Kanno notes (1996: 329). There is nothing here, then, to
encourage the correct generalization either.
RETESTING A UNIVERSAL 75
Kanno’s results are exciting, and the care she takes to rule out extraneous factors
which could have determined or affected her results is an object lesson in itself.
Indeed a perusal of Jorden & Noda, the coursebook used by the informants at the
time of the experiment, does suggest that generalization of the appropriate rule
would be very unlikely. However, there are a number of observations to be made
in this respect. Firstly, throughout the later of the lessons covered by the
informants up to the time of testing, there is a regular sprinkling of examples of
o placed in brackets as an optional element, as in the following (Jorden & Noda,
1987: 125, 127):
(11) Ano gakusee-(o) mi-te kudasai
That student-() look- please
‘Please look at that student.’
(12) Tanaka-san-(o) mi-mashita
Tanaka-Mr(s)–() see-.
‘I saw Mr(s). Tanaka.’
76 E. KELLERMAN, J. VAN IJZENDOORN & H. TAKASHIMA
Despite the uncertainties expressed above about what exactly Kanno’s informants
had or had not learned and how much Japanese they had been exposed to, the
fact remains that they and the native Japanese controls seem to be more or less
of one mind as to the acceptability of ga and o deletion. While the Japanese
native speakers would indeed have acquired the rule through the ECP, the
problem is to rule out the possibility that the American informants did so by
other means. It was for this reason that it was decided to carry out two replica-
tions of Kanno’s experiment in The Netherlands, with rigorous control of input,
in the hope of confirming her results, and thus strengthening the arguments in
favor of a role for UG in adult second language acquisition. Both studies include
a teaching component in which sufficient knowledge of the target language is
taught for the subsequent testing phase to be meaningful.
4.1 Study 1
4.1.1 Informants
Informants were three groups of high school students aged 13–14 (Group 1,
n = 25), 14–15 (Group 2, n = 26) and 15–17 (Group 3, n = 24). None had ever
knowingly been exposed to Japanese before, and none had been to Japan. All
these informants are generally considered to be adults in second language
acquisition terms.
Table 1. Types of stimuli tested in Studies 1 and 2 according to whether they were taught
and/or obey the ECP
Type Taught ECP
S+O+ – –
S+O− –
S−O+
S−O−
S+ – –
S−
O+ – –
O− –
RETESTING A UNIVERSAL 81
4.1.4 Results
Since five informants had marked more than the required six sentences as wrong,
their data were excluded from further analysis. A consideration of the responses
of the three groups revealed no obvious differences between them. Additionally,
there is only very limited evidence for a systematic distinction on the basis of
degree of transitivity,15 so it was decided to combine informants into one group
(N = 70). A repeated analysis of variance (MANOVA) revealed no general
preference for o omission over ga (for one-argument stimuli: F(1, 69) = .24, ns.;
for two-argument stimuli: F(1, 69) = .02, ns.). However, sentences with two
arguments and one missing case marker were significantly more acceptable than
were sentences with one explicit argument and no case marker (F(1, 69) = 7.21,
p < .01). The results are presented in the form of a graph in Figure 1.
57.1
34.3
From Figure 1 it can be seen that those structures that were taught are indeed
almost unanimously accepted. Also noteworthy is the relative unacceptability of
those sentences that were not taught. Of those crucial sentences that were not
taught and whose acceptance would show evidence for the continued availability
of the ECP, S+ O− items are accepted by just over half the informants and O−
by about one-third. Unfortunately, these scores are virtually perfectly matched by
the ‘mirror’ items; S− O+ is accepted by nearly 61% of informants and S− by
35%. S− O− is rejected by nearly 84%.
These figures tell us that missing case markers lead to rejection, but that
82 E. KELLERMAN, J. VAN IJZENDOORN & H. TAKASHIMA
Table 2. Patterns of acceptances and rejections according to the ECP and the ‘One noun’
strategy (+ = accept; = reject)
S+ O− O− S− O+ S− S− O−
ECP + + − − −
‘One-noun’ + − + − −
RETESTING A UNIVERSAL 83
4.2 Study 2
In Study 2, we taught Dutch-based adults the same rules as in the first study, but
this time using Japanese lexical items instead of those of the artificial clone.
4.2.1 Informants
Forty-two first-year students majoring in English at the University of Nijmegen,
The Netherlands, took part in ordinary class time. Thirty-eight were native
speakers of Dutch, two were native speakers of German, one was a native
speaker of Hebrew, and one was a native speaker of Romanian. None knew any
Japanese (beyond hai ‘yes’, sushi, and a few martial arts terms).
presented in the past tense form. Translations in English were provided only for
one-argument stimuli with absent case-marking, to disambiguate the role of the
noun. The full set of stimuli is presented in the Appendix.
In Study 2, informants were told just before beginning the testing phase that
one rule had not been taught, and that that rule concerned case-particles, which
could in certain circumstances be dropped. No further indication as to the nature
of that rule was given. The informants’ job was to guess in which sentences this
was permissible. This format meets a possible objection (from an anonymous
reviewer) to the format of Study 1, where informants were forced to judge only
6 out of 16 stimuli as unacceptable. In this version of the test, they were free to
judge as they wished, and also to rate stimuli according to three levels of
certainty (3 = correct, 2 = uncertain, 1 = incorrect), as in Kanno’s study, rather
than two (correct, incorrect) as in Study 1.
4.2.4 Results
The mean score for two-argument stimuli where o was missing was 1.96, and for
stimuli where ga was missing, 1.85. For one-argument stimuli, the means were
1.55 and 1.56 respectively. No significant differences in terms of differential
treatment of o or ga deletion emerge (for two-argument stimuli, F(1, 41) = .32, ns.;
for one-argument stimuli, F(1, 41) = .93, ns.). In Study 1, case particle deletion is
more acceptable to informants in two-argument stimuli than in one-argument
stimuli, if not quite reaching significance (F(1, 41) = 3.58, p < .07. The mean score
for S− O− stimuli was 1.41, with 29 informants (69%) rejecting both stimuli.
If we examine the patterns of individual behavior, we discover one infor-
mant who scores 2 (‘uncertain’) on all stimuli with o deletion, and 1 (‘unnatural’)
on stimuli with ga deletion. This is the only informant to evince a UG-like series
of responses. Of the remaining 41, 13 (30.95%) show the predominant ‘one-noun
strategy’ pattern of Study 1, namely acceptance of the absence of one case
particle in two-argument stimuli and rejection of any missing case marker in one-
argument stimuli. Seven informants (16.67%) categorically rejected all (or all but
one) of the stimuli with any absent particles, and two judged absent particles in
one-argument stimuli more acceptable than in two-argument stimuli. It is
difficult to detect clear patterns in the remaining 18 informants’ responses.
Again, no effects for type of verb could be found.
RETESTING A UNIVERSAL 85
5. Conclusion
It was hoped that by rigorous control of the input in the learning of Japanese and
a miniature relexified clone of that language, it would be possible to confirm
Kanno’s findings, so laying to rest the question of whether the ECP was truly
available to second language learners. The results of the present studies are
therefore disappointing, since we are left with the awkward problem of consider-
ing why the results in Kanno’s and the present studies are so dissimilar. One
possible suggestion is that the method of elicitation adopted here encourages a
test-taking strategy which has more to do with solving a puzzle than tapping
intuitions of grammaticality; in Study 1, informants were only allowed to reject
a mere six of the ten sentences they presumably would have rejected for lack of
a case particle (or two). Study 2 was designed to address precisely that issue, but
the results do not seem particularly different. As far as it can be said that any
pattern of behavior emerges from these two studies, it is the ‘one-noun strategy’
that predominates (38.57% of informants in Study 1; 30.95% in Study 2).
Another possible explanation for our failure to replicate Kanno’s findings is
that her informants had indeed acquired the case-drop rule through instruction or
input outside the classroom, so that the ECP was not truly guiding their intu-
itions. A further possibility (suggested by Robert Bley-Vroman) is that our
informants had never been confronted with case-drop phenomena before, unlike
Kanno’s learners. This means that they were encountering sentences which
teaching would have led them to believe were ungrammatical. And indeed, some
informants rejected all case-dropping, despite specific instructions designed to
make them countenance the possibility. Their counterparts learning Japanese in
Hawai‘i would certainly have been confronted with instances of case-drop,
however unpredictable this might have seemed to them. Finally, it may be that
the grammar of our newly-taught languages needed time to ‘sink in’ before ECP
effects could come to the fore, our informants being tested immediately after
having been taught. A test after a delay of, say, a few days might have delivered
a different result. Finally, as Kanno herself notes there is the question of the
status of the ECP in current grammatical theory, though that does not free us of
the obligation to investigate an apparent case of acquisition without input.
Obviously continuing research is needed, perhaps with a group of learners of
Japanese in a community where Japanese is a truly exotic language not available
outside the classroom, and where once again input can be tightly controlled.
Given the exciting nature of Kanno’s original results, such a study should
certainly be undertaken as a matter of urgency.
86 E. KELLERMAN, J. VAN IJZENDOORN & H. TAKASHIMA
Acknowledgments
Our thanks go to Kazue Kanno for insightful discussion in all phases of this research, to Keiko
Yoshioka for advice about the Japanese language and teaching Japanese in Hawai‘i, to Hubert
Korzilius, Erik Schils and Monique van der Haagen for statistical expertise, and to Theo Bongaerts
and an anonymous reviewer for detailed and wise commentary on an earlier version of this chapter.
Notes
1. Following Fukuda, Kanno points out that there are cases where Japanese does permit ga-drop,
as in sentences with emphatic final particles such as yo, and in double-ga constructions, a point
to which we will return later.
2. In Japanese, arguments may be omitted if they are fully recoverable from context.
3. All these sentences contain wh-words, which cannot be topicalized. This prevents informants from
interpreting missing case-markers as deleted (and deletable) wa. Whether informants actually
know this rule was not independently established, though see Kanno (1996: 329) for comment.
4. These involve complement NPs.
5. In actual fact, as Kanno notes (1996: 329), Jorden & Noda suggest that both ga and o can be
dropped, with merely an attendant ‘loss of emphasis and focus’ (Jorden & Noda 1989: 91–92).
It should be pointed out that Jorden & Noda is a book designed to teach spoken Japanese.
Indeed, two recent books on Japanese we have consulted (Shibatani, 1990: 367–368 and
Tsujimura 1996: 136) both note that both particles can be dropped in casual speech.
6. Her evaluation of her informants’ proficiency level (1996: 322).
7. Functional strategies (e.g. accepting deletion from an NP immediately preceding a verb, or
accepting deletion from inanimate NPs) are explicitly and convincingly ruled out.
8. Evidence that learners do indeed assume the missing case particle is o in such circumstances is
clearly provided by Dutch learners of Japanese at a comparable level of proficiency (Keiko
Yoshioka, personal communication).
9. By way of comparison, mainland America accounted for 2.3 million visitors, roughly 1% of its
population. Source: Atlas of Hawai‘i, 1983.
10. Atlas of Hawai‘i, 1983.
11. Keiko Yoshioka (personal communication) estimates that roughly 70–80% of students in these
classes are of Japanese origin.
12. It is interesting that Kanno reports that 10 of her informants (38.5%) chose the kana version of
her test questions, even though Jorden uses a romanized script throughout the book.
13. For a discussion of the ECP and Dutch, see Kerstens & Ruys (1994). As in English, the ECP
in Dutch requires the proper government of traces resulting from movement.
14. Called Jumanji.
15. While the data suggest some basis for assuming that three informants were distinguishing at
least some of the time between items containing Jumanji equivalents of ‘hit’ and ‘see’, their
retrospective comments did not support this interpretation. In fact two of them seemed to be
following phonological strategies.
RETESTING A UNIVERSAL 87
Appendix
References
Makiko Hirakawa
Tokyo International University
1. Introduction
role consistently maps to the same syntactic position at D-S. For example, the
UTAH establishes that a Theme will consistently be projected as the verb’s
internal argument position at D-S. Consider the sentences in (3).
(3) a. John broke the watch.
b. The watch broke.
The NP the watch is Theme in both sentences but occurs as object in (3a) and as
subject in (3b). But with the D-S (1a), the watch in (3b) originates as D-S object,
so that both (3a) and (3b) comply with the UTAH.
It has been claimed that the Unaccusative Hypothesis also holds in Japanese
(Kageyama 1993, Miyagawa 1989, Tsujimura 1990, among others). Kageyama
(1993) further claims that the D-S object of an unaccusative stays in situ, and
that nominative Case is assigned in its base position, giving for the representation
in (4).1
(4) Deep & Surface Structures: Unaccusatives
e [VP Tokei-ga koware-ta]
watch- break-
‘The watch broke.’
This unaccusative sentence contrasts with an unergative sentence, whose single
argument is in the canonical subject position at D-S and S-S.
(5) Deep & Surface Structures: Unergatives
John-ga [VP oyoi-da]
John- swim-
‘John swam.’
Unaccusative sentences like (4) and unergative sentences like (5) look alike on
the surface, since the single argument in both types of sentences is marked by
ga, which is the nominative Case marker in Japanese. In addition, Japanese is a
verb-final (i.e. SOV) language, so that the subject and the object are adjacent to
each other, irrespective of NP movement. Thus, it is not clear from the surface
whether the sole argument of unaccusatives has actually moved out of the VP,
or whether it stays within the VP. Kageyama (1993) discusses several pieces of
evidence for the non-movement analysis.
Assuming Kageyama’s non-movement analysis for Japanese unaccusatives, the
present paper investigates whether English and Chinese learners of Japanese observe
the unaccusative/unergative distinction at D-S and S-S. Previous studies report
that L2 learners have problems acquiring unaccusatives in the target language
(Balcom 1997, Hirakawa 1995, Oshita 1997, Sorace 1995, Yip 1995, Yuan 1996,
L2 ACQUISITION OF JAPANESE UNACCUSATIVITY 91
and Zobl 1989). These studies have dealt with English, Italian and Chinese as an
L2, but as far as I am aware, no study has looked at Japanese as an L2.
2. Theoretical background
In this section, I will briefly review the arguments of Kageyama (1993) which
support the Unaccusative Hypothesis for Japanese, focusing on two structures.
Following Levin and Rappaport Hovav (1995), Kageyama distinguishes between
deep and surface unaccusativity. Deep unaccusativity refers to a situation where
the sole argument of an unaccusative is in object position at D-S, whereas
surface unaccusativity refers to a situation where the sole argument remains in
object position at S-S. I will first look at a structure which is relevant to deep
unaccusativity, then turn to the other structure which serves as a diagnostic for
surface unaccusativity in Japanese.
Sentences with the adverb takusan ‘a lot’ constitute one piece of evidence for
deep unaccusativity (Kageyama 1993, 1996). The adverb takusan modifies
almost any NP, irrespective of its animacy or countability. First observe the
sentence in (6), which consists of takusan and a transitive verb.
(6) Takusan yon-da
a lot read-
‘He/she/they etc. read a lot (of things).’
Japanese allows subject as well as object drop, so sentences like (6), where both
the subject and the object are dropped, are grammatical. This raises the question
of what takusan modifies in the sentence; i.e., a subject or an object. The answer
is that it can only modify an object, not a subject, so its reading is unambiguous.
Thus, (6) means that somebody read a lot of things, not that many people read
one thing. In other words, takusan modifies an internal argument, which is inside
the VP, and not an external argument, which is outside the VP.2
92 MAKIKO HIRAKAWA
(7) IP
NP (subject) I
VP I
NP (object) V
When passive sentences with takusan are considered, we find that subject NPs are
successfully modified by the adverb. This suggests that the takusan construction
is a test for a D-S object, since the subject NP of passives originates in the object
position and moves to the canonical subject position (Kuno 1973, Saito 1985).
(8) Takusan yom-are-ta.
a lot read--
‘A lot of things were read.’
Next, consider the two types of intransitive sentences — the unaccusative (9a)
and the unergative (9b).
(9) a. Takusan tui-ta.
a lot arrive-
‘A lot of people arrived.’
b. Takusan nai-ta.
a lot cry-
‘We/they/he/she cried a lot.’
There is a contrast in the readings of the two sentences. The unaccusative (9a)
means that ‘a lot of people arrived’, with takusan modifying what appears to be
the subject in the sentence. In the unergative sentence (9b), on the other hand,
takusan does not modify the subject; thus, the sentence cannot mean that ‘a lot
of people cried’. This suggests that the surface subject of the unaccusative (9a)
originates in the object position, contrasting with that of the unergative (9b)
which is in the logical subject position at D-S. Since unergatives do not have an
internal argument, takusan describes the amount of action denoted by the verb.
Thus, (9b) means that someone did the action of crying a lot; i.e., ‘somebody
cried a lot’.
In sum, the takusan construction groups transitive objects, passive subjects
and unaccusative subjects together on the one hand, and transitive subjects and
L2 ACQUISITION OF JAPANESE UNACCUSATIVITY 93
A piece of evidence for surface unaccusativity comes from the case drop
phenomenon in Japanese. In colloquial Japanese, the accusative Case marker o
is often dropped from the object of a transitive sentence, but the nominative Case
marker ga cannot be omitted from the subject of a transitive sentence (Kageyama
1993, Kuno 1973, Saito 1985, Takezawa 1987, among others). Sentences in (10)
and (11) are from Kageyama (1993).4
(10) a. [Kodomotati-ga hon-(o) yom-u] no mi-ta koto nai.
[children- book-() read- see- fact not
‘I have never seen children read books.’
b. [Kodomotati-*(ga) hon-o yom-u] no mi-ta
[children-() book- read- see-
koto nai.
fact not
‘I have never seen children read books.’
This subject/object asymmetry can be explained if we assume that the verb can
lexically govern the empty Case on a direct object, while no proper governor is
available for the empty Case on a subject (cf. (7)). Kageyama claims case drop
should be considered as a S-S or Phonetic Form (PF) phenomenon.
The subject of an unaccusative sentence behaves like an object of a
transitive sentence; that is, ga can be omitted (11a). This is not the case for an
unergative subject (11b) (Kageyama 1993, Nishigauchi 1993).
(11) a. [Kootuu-ziko-(ga) okor-u] no mi-ta koto nai.
[traffic-accident- happen- see- fact not
‘I have never seen traffic accidents happen.’
b. [Kodomotati-*(ga) asob-u] no mi-ta koto nai.
[children- play- see- fact not
‘I have never seen children play.’
The contrast in (11) in terms of nominative case drop suggests that the unaccus-
ative subject never moves out of the VP; instead it remains in the VP where it
is properly governed by the verb.
The following passive sentence further supports this claim. That is, the
nominative Case marker on a passive subject cannot be omitted. This is predict-
ed, since subjects of passives are moved out of the VP.
94 MAKIKO HIRAKAWA
As briefly discussed above, English exhibits deep unaccusativity but does not in
general manifest surface unaccusativity. The single argument of an unaccusative
moves from D-S object position to S-S subject position, for Case reasons.
In English, surface unaccusativity is manifested only in the there-insertion
construction (13a) and the locative inversion construction (13b) (Levin and
Rappaport Hovav 1995: 19).
(13) a. There appeared a ship on the horizon.
b. Into the room came a man.
Levin and Rappaport Hovav (1995) note that these two constructions are not
found with all unaccusative verbs; rather, they are restricted to verbs of existence
(e.g. exist and remain), verbs of appearance (e.g. appear and arise), and verbs of
inherently directed motion (e.g. come and arrive).
Since English manifests surface unaccusativity only to a limited extent and
with only a subset of unaccusative verbs, it can be claimed that English is quite
different from Japanese, where surface unaccusativity is more widespread. More-
over, it is important to note that English does not have any equivalent structure
to the adverb takusan construction or the case drop phenomenon in Japanese.
Miyata (1992) analyses naturalistic data from young children acquiring Japanese
as an L1 and reports that 2- and 3-year olds as well as 4- and 5-year olds omit
the accusative Case marker o (123 omissions in 161 cases, or 76.4% for 2- and
3- year olds and 102 omissions in 144 cases, or 70.8% for the 4- and 5-year
olds). In contrast, the same children rarely omit nominative Case ga (47 out of
326 cases, or 14.4% for 2- and 3-year olds; 18 omissions in 180 cases, or 10.0%
for 4- and 5-year olds). Miyata further notes that when nominative ga is dropped,
it is mostly with the subject of an unaccusative or a stative predicate. These
findings suggest that children as young as 2 years know when Case markers can
be omitted — namely on transitive objects, unaccusative subjects, and subjects of
stative predicates.8,9
96 MAKIKO HIRAKAWA
4. The experiment
4.1 Hypotheses
4.2 Subjects
Thirteen adult native speakers of English and 16 adult native speakers of Chinese
participated in the experiment. The English speakers were all North Americans
and were enrolled in a 9-month intensive Japanese course offered at the Inter-
University Center for Japanese Studies in Yokohama, Japan. The Chinese
speakers were students at Tokyo International University in Kawagoe, Japan. Ten
subjects were from Taiwan and 6 subjects were from China.
Subjects were given a background questionnaire to determine their prior
experience with Japanese. All of the subjects had learned Japanese in a class-
room setting, and both groups of subjects had started learning Japanese as
adolescents. The English-speaking subjects had generally studied longer (average
5.6 years) than the Chinese-speaking subjects (average 3.2 years), but the
Chinese subjects had generally lived in Japan longer (average 3.1 years) than the
English subjects (average 1.8 years).
All subjects took a Japanese proficiency test consisting of vocabulary and
grammar sections, taken from Japanese standardized proficiency tests. Their
scores ranged from 58% to 90% (equivalent to high intermediate/low advanced).
There were no significant differences in the scores of the two groups.
98 MAKIKO HIRAKAWA
4.3 Tasks
There were two tasks: a picture task and an acceptability judgment task. The
picture task was a truth-value judgment, which included sentences containing
takusan ‘a lot’ designed to examine subjects’ knowledge of deep unaccusativity.
All subjects took the picture task first.
The acceptability judgment task consisted of sentences with case drop,
which were used to examine subjects’ knowledge of surface unaccusativity.10
Since case drop is frequently observed in colloquial, spoken Japanese, but not in
formal, written Japanese, it is more appropriate to think of the task as requiring
‘acceptability judgments’ rather than ‘grammaticality judgments’. Subjects were
encouraged to consider the test sentences as sentences of spoken Japanese and to
give their initial responses to them.
In each task, three types of verbs were used: transitive, unergative, and
unaccusative. Each type was represented by five verbs, as listed in (17).
(17) a. Transitive: kaku ‘write’, arau ‘wash’, yomu ‘read’, taberu ‘eat’,
nuru ‘paint’
b. Unaccusative: otiru ‘fall’, sinu ‘die’, tuku ‘arrive’, wareru
‘break’, yakeru ‘burn’
c. Unergative: naku ‘cry’, utau ‘sing’, asobu ‘play’, oyogu
‘swim’, hasiru ‘run’
4.4 Results
hand and unaccusative subjects like transitive objects on the other. Thus, these
results will be discussed in two parts: in terms of subjects’ performance on
transitive sentences, and then on unergatives and unaccusatives.
Figure 1 shows mean acceptance scores on the transitive sentences. One point
was assigned to each test item if subjects accepted the sentence, and zero if they
rejected the sentence. We expected subjects to accept Tr (T) and reject Tr (F);
thus, the maximal score expected is 5 on Tr (T); in contrast, it is 0 on Tr (F).
Subjects in all three groups correctly accepted Tr (T) and rejected Tr (F). A
two-way repeated measures ANOVA on the acceptance scores shows a signifi-
cant effect for sentence type (F(1, 43) = 231.632, p < 0.0001). However, there is
no significant effect for group (F(2, 43) = 0.891, p < 0.4178), nor is there an
interaction (F(2, 43) = 1.942, p < 0.1558). Sheffé tests show that the difference in
performance on Tr (T) versus Tr (F) was significant for all three groups
(p < 0.05). In other words, all three groups accepted the true picture/sentence
pairings (Tr (T)) significantly more than the false pairings (Tr (F)), suggesting
that they observed the transitive subject/object distinction. The three groups did
not differ on either Tr (T) or Tr (F).
Further analyses were conducted on individual scores in terms of consisten-
cy in their responses. This was to determine whether there were any subjects
0
Tr(T) Tr(F)
Sentence Type
who did not make the transitive subject/object distinction so that they should be
removed from the subsequent analysis of unaccusatives and unergatives. Consis-
tency was defined as accepting at least four of five Tr (T) sentences and
rejecting at least four of five Tr (F) sentences. Six English-speakers and five
Chinese-speakers did not give consistent responses and were removed, leaving
seven English subjects and eight Chinese subjects. There were also three
Japanese controls who failed to give consistent responses and were thus removed,
leaving 17 controls.12
Figure 2 summarizes the mean acceptance scores of these retained subjects
on Unacc (T) (true sentence/picture pairings), Unerg (T) (true sentence/picture
pairings) and Unerg (F) (false sentence/picture pairings).
A two-way repeated measures ANOVA shows that there are significant
effects for sentence type (F(2, 58) = 86.499, p = 0.0001) and an interaction
between group and sentence type (F(4, 58) = 5.832, p = 0.0027), but no effect for
group (F(2, 29) = 0.469 p = 0.63). Sheffé tests (p < 0.05) show the difference
between Unacc (T) and Unerg (F) sentences and the difference between Unerg
(T) and Unerg (F) sentences were significant for all three groups. However, no
significant difference was found in the responses to Unerg (T) and Unacc (T)
sentences for any of the three groups. These results suggest that all three groups
accepted Unacc (T) and Unerg (T) sentences significantly more than Unerg (F)
5
English speakers (n=7)
Chinese speakers (n=8)
4
Mean Acceptance Scores
Sentence Type
Figure 2. Picture task: Mean acceptance scores on Unacc & Unerg sentences
104 MAKIKO HIRAKAWA
sentences. Thus the subjects did distinguish the unaccusative and unergative
sentence types.
For Unacc (T) and Unerg (T) sentences, the two experimental groups and
the control group did not differ from each other. In the case of Unerg (F)
patterns, the Chinese group but not the English group differed from the Japanese
controls, with the two L2 groups not significantly different from each other.
Although the incorrect acceptance of Unerg (F) sentences suggests that the L2
learners thought that takusan ‘a lot’ could modify the subject of an unergative
verb, the important thing to note here is that they allowed the subject of an
unaccusative verb to be modified by takusan to a significantly greater extent than
the subject of an unergative. This suggests that they did distinguish between the
two sentence types.
Overall results show that learners as well as controls did not behave as expected
in a number of cases. Responses from the English and Chinese groups turned out
positive in all four sentence types, suggesting that the L2 learners accepted
sentences with case drop regardless of the subject/object or unaccus-
ative/unergative distinction. In contrast, the native speakers made the transitive
subject/object distinction, rejecting the transitive subjects without case markers
(-1.12) while accepting transitive objects without case markers (0.76). However,
their responses to unaccusative sentences were different from what we had
expected, with a negative score (-0.35) where positive scores were expected.
We will discuss the results in more detail in two parts, focusing first on the
transitive sentences and then on the unaccusative and unergative sentences.
Figure 3 presents mean scores on the two types of transitive sentences. A two-
factor repeated measures ANOVA shows that there are significant effects for
group (F(2, 46) = 7.255, p = 0.0018), sentence type (F(1, 46) = 55.925, p = 0.0001)
and an interaction (F(2, 46) = 19.174, p = 0.0001). Sheffé tests show that the
English and Chinese groups differ significantly (p < 0.05) from the Japanese
controls on subject case drop (Tr S (U)). The L2 groups do not differ from each
other on this sentence type. As for object case drop (Tr O (A)), the L2 groups
and the controls do not differ significantly from each other.
Native speakers showed the subject/object contrast, rejecting case drop on
-0.5
-1
-1.5
Tr S (U) Tr O (A)
Sentence Type
1.5
English speakers (n=1)
1 Chinese speakers (n=1)
Japanese controls (n=9)
0.5
Mean Scores
M =0
(Jap )
0
M =0
(E ng )
-0.5
-1
-1.5
Unacc (A) Unerg (U)
Sentence Type
Figure 4. Acceptability judgement task: Mean scores on Unacc & Unerg sentences
L2 ACQUISITION OF JAPANESE UNACCUSATIVITY 107
As can be seen from Figure 4, neither these learners nor these controls showed
the unaccusative/unergative contrast in terms of sentences with case drop at S-S.
The mean scores of the three groups on the two types of verbs turned out either
zero or very close to zero, contrary to what we had expected.
4.5 Discussion
one of the two types but failed to do so in the other and were thus removed from
further consideration. This still may suggest that the two sentence types are not
the same in their grammar (cf. Grimshaw and Rosen 1990).
5. Conclusion
Acknowledgments
I would like to thank Lydia White and Kevin Gregg for their detailed comments on earlier versions
of this paper. I am also grateful to Lisa Travis, Nigel Duffield, Noriaki Yusa, Taisuke Nishigauchi,
Yahiro Hirakawa and Kazue Kanno for comments and suggestions. Part of this study was presented
at PacSLRF’98, Aoyama Gakuin University, Tokyo, March 1998. I am grateful to audiencesfor
stimulating questions and comments. This research was in part supported by a research grant from
Tokyo International University.
Notes
3. Other pieces of evidence for deep unaccusativity include floated numeral quantifiers, resultative
constructions and verbal compounding. See Miyagawa (1989) and Kageyama (1993) for more
details.
4. It should be noted that the Japanese examples given in section 2.2. consist of two clauses. This
ensures that a Case marker omitted in the embedded clause is not the topic marker wa, which
can be freely omitted, but only the nominative Case marker.
5. Other pieces of evidence for surface unaccusativity in Japanese include generic PRO (Kuroda
1988, Kageyama 1993, Nishigauchi 1992) and causative passive constructions (Kageyama
1993).
6. These sentences are taken from Yuan 1996 (8a,b & c). In the gloss of the example sentences,
= classifier, and = perfective aspect marker.
7. This is also true with English unaccusatives, but Japanese unaccusatives do not have a
definiteness restriction at any level.
8. Examples of stative predicates include: aru ‘exist/have/be’, wakaru ‘understand’, dekiru ‘can’,
sukida ‘like’ and hosii ‘want’. NPs marked with nominative ‘ga’ in stative predicates bear a
theme role, and are thus assumed to be in object position rather than subject position. See
Takezawa (1987), who claims that nominative Case can be easily dropped in such predicates.
9. As far as the transitive subject/object distinction is concerned, experimental studies on young
Japanese children also suggest that they know that Case markers on transitive objects can be
omitted while those on transitive subjects cannot (Otsu 1994, Lakshmanan and Ozeki 1996).
10. The tasks reported here included other types of structures, making the total number of test
sentences larger than the numbers reported here.
11. I follow Kanno (1996) in designing the test sentences in this task. Kanno also uses interrogative
sentences in her experiment on case deletion in L2 Japanese, exploring the issue of access to
non-parametrized principles of UG in L2 acquisition. It should be noted that her subjects were
instructed to correct sentences when they judged the test sentences to be unnatural, which was
not the case with my study. Correction was not required in my study because a pilot study had
shown that many subjects had a tendency to judge the test sentences without case markers as
unacceptable and had provided missing Case markers. Since it was desired to avoid such a
situation, further correction was not asked.
12. Many of the subjects who were removed at this point correctly accepted Tr (T) but failed to
consistently reject Tr (F). This was also the case with three native speakers of Japanese
removed here.
13. In fact, one Chinese and one Japanese subject rejected all sentences checking case drop, and
one Chinese subject accepted them all. Since these subjects did not do so on the other test
sentences (which are not reported here) included in the same task, they were retained in the
analysis.
14. As for those who were removed at this point, we found the same tendency that was observed
in the overall responses. That is, both groups of L2 learners tended to accept Case drop on
transitive objects but failed to reject that on transitive subjects. The Japanese controls in contrast
showed a tendency to reject Case drop on transitive subjects consistently, but did not accept
Case drop on transitive objects.
L2 ACQUISITION OF JAPANESE UNACCUSATIVITY 111
Appendix
Takusan oyogimasita.
a lot swam
Takusan oyogimasita.
a lot swam
112 MAKIKO HIRAKAWA
References
Naoko Yoshinaga
Hirosaki Gakuin University
1. Introduction
A key feature of multiple wh-questions is that there are multiple requests for
information. For example, an appropriate answer to the question who read what?
provides information in response to both who and what.
Languages themselves differ in the range of possible multiple wh-questions.
In English, for instance, multiple wh-questions involving two argument wh-
phrases are acceptable, whereas those involving an adjunct wh-phrase are not
(e.g., Lasnik & Saito 1992).
(1) Both wh-phrases are arguments
a. Who read what?
b. Who went where?
(2) One wh-phrase is an adjunct
a. *Who answered how?
b. *Who died why?
Among adjuncts, where and when behave differently from adjunct how and why
in multiple wh-questions, since multiple wh-questions with a subject wh-phrase
and adjunct where or when are acceptable, as (3) illustrates (Huang 1982; Aoun,
Hornstein, Lightfoot, & Weinberg 1987; Kuno & Takami 1993).
(3) a. Who swam where?
b. Who traveled when?
116 NAOKO YOSHINAGA
4. The Study
4.1 Subjects
subjects began learning English at age 12 (ranging from 9 to 13, mean = 12.07,
SD = .87), and had studied English for 6 to 10 years (mean = 6.95, SD = 1.13),
although none had ever lived in an English-speaking country. None of these
subjects had taken the TOEFL; however, Kyushu University is one of the most
competitive national universities in Japan, and consequently it is not implausible
to assume that their English proficiency is relatively high among Japanese
college students.
The English-speaking learners of Japanese were students enrolled in sections
of a third semester course in Japanese at the University of Hawai‘i. There were
19 males and 14 females in this group, ranging in age from 18 to 31
(mean = 20.63, SD = 2.78). The age at which these subjects started learning
Japanese ranged from 10 to 25 (mean = 15.22, SD = 3.13). They had been
studying Japanese for 1 to 7 years (mean = 3.44, SD = 1.64), although none had
been to Japan. There are no test scores to show their general proficiency levels.
However, they were all enrolled in the third of four semester-long courses that
undergraduates must take in a foreign language.
Native speakers of English and Japanese participated as control groups. The
native speakers of English were students enrolled in sections of first and second
semester courses in Japanese at the University of Hawai‘i. The native speakers of
Japanese were students enrolled in compulsory general English courses primarily
offered to freshmen and sophomores at Kyushu University in Fukuoka, Japan.
4.2 Materials
4.3 Procedures
4.4 Analyses
4.5 Results
Table 1. Descriptive statistics for ratings of the six sentence types by native speakers
Native English speakers (EE) Native Japanese speakers (JJ)
Count Mean S.D. Count Mean S.D.
who-what 30 −2.044 1.325 50 2.444 1.003
who-wherecomp 30 −1.967 1.445 50 2.360 1.104
who-wherenon 30 −1.683 1.734 50 2.319 1.161
who-when 30 −0.842 1.974 50 2.189 1.153
who-how 30 0−.367 1.912 50 1.752 1.278
who-why 30 −1.150 1.873 50 1.575 1.615
124 NAOKO YOSHINAGA
JJ EE
3
-1
-2
-3
who-what who-wherecomp who-wherenon who-when who-how who-why
Table 1 and Figure 1 (JJ) also show that there was a similar tendency in the
acceptability of sentences (in the same descending order) for Japanese, as
reflected in the significant effect of sentence type, F(5, 245) = 13.753*, p = .0001.
However, the slope of the descent for the mean ratings of the six types is much
flatter than for English, and all sentence types are quite high in terms of their
acceptability ratings.
In order to determine which pairs of means were significantly different,
adjacent means were subjected to multiple comparisons. Table 2 summarizes the
results of these multiple comparisons.
As can be seen by examining the last three rows of Table 2, the major
contrasts in English occur between the who-wherenon type and the who-when
type, between the who-when type and the who-how type, and between the
who-how type and the who-why type. This reflects the fact that the mean ratings
WHO KNOWS WHAT AND WHY? 125
for the who-how type and the who-why type are significantly lower than for the
other sentence types, as shown in Table 1. Within the latter set of sentences, the
mean rating for the who-when type is significantly lower than for the who-what,
who-wherecomp, and who-wherenon types (see Table 1).
Turning now to Japanese, there was a major break between the who-when
type and the who-how type, with the mean ratings for the who-how and who-wh
types significantly different from the ratings for the who-what, who-wherecomp,
who-wherenon, and who-when types.
If positive values are taken to indicate that a sentence is acceptable, and
negative values are taken to indicate that it is unacceptable, the results show that
for English, the who-what type and the who-wherecomp type are quite high in
acceptability, that the who-wherenon type is clearly on the acceptable side, and
that although lower on the range, the who-when type is also on the acceptable
side. In contrast, the who-how type and the who-why type are on the unaccept-
able side. For Japanese, every type is clearly on the acceptable side: the who-
what type, the who-wherecomp type, the who-wherenon type, and the who-whn type
are all nearly perfect, while the who-how type and the who-why type receive
quite high ratings.
4.5.2 Native speakers of English vs. Japanese learners of English (EE vs. JE)
The ESL learners rated the six types of multiple wh-questions differently from
native speakers of English, as reflected in the fact that there was a significant
main effect for both language group, F(1, 70) = 14.376*, p = .0003 and sentence
type, F(5, 350) = 53.392*, p = .0001, as well as a significant interaction effect for
language group with sentence type, F(5, 350) = 29.959*, p = .0001. More specifi-
cally, native speakers made rather sharp distinctions among the six types of
multiple wh-questions, whereas the learners rated all types as unacceptable
without making distinctions among them. As Figure 2 shows, the slope of the
126 NAOKO YOSHINAGA
JE EE
3
-1
-2
-3
who-what who-wherecomp who-wherenon who-when who-how who-why
Figure 2. Mean ratings of the six sentence types by Japanese learners of English and native
speakers of English
line connecting the mean ratings by the native speakers of English (EE) is sharp,
whereas the slope for the learners (JE) is very flat and low. As depicted in
Figure 3 below, the mean ratings by native speakers are quite spread out along
the 7-point rating scale, clearly distinguishing among the six types. (The relevant
means are in Table 1.) In contrast, rating by the Japanese learners of English are
clustered around the area below zero to −1, without much variation among the
six types; in other words, basically all types were rejected as unacceptable. (The
relevant means are in Table 3.)
Because the main effect for sentence type was significant, this factor was
examined further. The comparisons reported in Table 4 below indicated that none
of the adjacent pairs of means was statistically significant for the Japanese-
speaking learners of English.
WHO KNOWS WHAT AND WHY? 127
Table 3. Descriptive statistics for ratings of the six sentence types by learners
Japanese learners of English English learners of Japanese
(JE) (EJ)
Count Mean S.D. Count Mean S.D.
who-what 42 −.298 2.132 33 1.689 1.525
who-wherecomp 42 −.565 2.014 33 1.558 1.635
who-wherenon 42 −.702 1.982 33 1.750 1.518
who-when 42 −.851 1.888 33 1.295 1.720
who-how 42 −.851 1.940 33 0.937 1.655
who-why 42 −.935 1.865 33 0.879 1.667
3 who-what
who-wherecomp
who-wherenon
2 who-when
who-how
who-why
1
-1
-2
-3
EE JE
LG
Figure 3. Mean ratings of the six sentence types by native speakers of English and Japanese
learners of English (from another angle)
128 NAOKO YOSHINAGA
who-what vs. who-wherecomp 3.012 0.0842 who-where non vs. who-what 0.095 .7581
who-wherecomp vs. who-wherenon 0.787 0.3761 who-what vs. who-where comp 0.447 .5048
who-wherenon vs. who-when 0.929 0.3361 who-where comp vs. who-when 1.787 .1832
who-when vs. who-how 0.000 1.0000 who-when vs. who-how 3.332 .0698
who-how vs. who-why 0.291 0.5899 who-how vs. who-why 0.087 .7679
EJ JJ
3
-1
-2
-3
who-what who-wherecomp who-wherenon who-when who-how who-why
Figure 4. Mean ratings of the six sentence types by English-speaking learners of Japanese
and native speakers of Japanese
WHO KNOWS WHAT AND WHY? 129
3 who-what
who-where comp
who-where non
2 who-when
who-how
who-why
1
-1
-2
-3
JJ EJ
LG
Figure 5. Mean ratings of the six sentence types by native speakers of Japanese and
English-speaking learners of Japanese (a graph from another angle)
130 NAOKO YOSHINAGA
EE JJ
JE EJ
3
-1
-2
-3
who what wherecomp wherenon when how why
5. Discussion
In this section, I will discuss the results in relation to the four research questions
stated at the outset.
Research question 1: Do native speakers of English distinguish among these six
different types of English multiple wh-questions in their acceptability judgments?
Hypothesis 1: Native speakers of English distinguish multiple wh-questions that
contain how and why, which they treat as unacceptable, from all other types.
Hypothesis 1 was supported. The who-what type and the who-wherecomp type
were rated as clearly acceptable and the who-how type and the who-why type
both fall in the negative range (below the zero line). Although lower on the
scale, the who-wherenon type and the who-when type fall in the positive range
132 NAOKO YOSHINAGA
(see Bley-Vroman & Yoshinaga 1998, whose findings were similar to the
findings in the present study, for suggestions to account for why ratings are
gradual rather than exhibiting clear acceptable/unacceptable distinctions).
Research question 2: Do native speakers of Japanese accept all six types of
multiple wh-questions in Japanese?
Hypothesis 2: Native speakers of Japanese accept all six types of multiple wh-
questions in Japanese.
Although there was a significant effect for sentence type, native speakers of
Japanese rated all six types quite high on the positive side. Thus, the results
support hypothesis 2.
Research question 3: Do Japanese-speaking learners of English exhibit patterns
of acceptability judgments similar to those of native speakers of English?
Hypothesis 3: There is no difference between native speakers of English and
ESL learners in terms of their acceptability judgments (the null hypothesis).
The ratings of the six multiple wh-questions in English by Japanese-speaking
learners of English were significantly different from native speakers of English.
Although native speakers of English made clear distinctions among different
types of multiple wh-questions, Japanese learners of English rated all types as
unacceptable. Thus, hypothesis 3 was not supported.
Research question 4: Do English-speaking learners of Japanese exhibit patterns
of acceptability judgments similar to those of native speakers of Japanese?
Hypothesis 4: There is no difference between native speakers of Japanese and
JSL learners in terms of their acceptability judgments (the null hypothesis).
The results indicated that English-speaking learners of Japanese rated the six
multiple wh-questions in Japanese in more or less the same way that native
speakers of Japanese did. Like native speakers of Japanese, English-speaking
learners of Japanese rated all six types of multiple wh-questions acceptable,
although the learners’ ratings were slightly lower than the native speakers across
the six types. Hence, hypothesis 4 was supported.
Overall, then, the results revealed that Japanese learners of English performed
differently from native speakers of English, whereas English-speaking learners
of Japanese performed like native speakers of Japanese. One might suggest that
WHO KNOWS WHAT AND WHY? 133
the simplest explanation for these results is simply that there are proficiency
differences between the Japanese-speaking learners of English and the English-
speaking learners of Japanese. More precisely, it might perhaps be claimed that
the Japanese-speaking learners of English have low proficiency in the target
language or are at an earlier stage of language acquisition, whereas the English
learners of Japanese have high proficiency in the target language and are at an
advanced stage of language acquisition.
Although this matter can only be settled definitively by comparing the
proficiency levels of the two learners groups with the help of appropriate test
instruments, I believe that this proposal is unlikely to be the correct explanation.
In particular, an informal comparison of the materials used in English classes at
universities in Japan with those used in first and second-year Japanese classes at
the University in Hawai‘i suggests that the ESL learners in the present study
cannot in general be at a lower proficiency level in the target language than JSL
learners. Certainly, it is implausible to think that the difference could be large
enough to explain the contrasts in the acceptability ratings given by the two
learner groups.
Moreover, the ESL learners in the present study cannot plausibly be
considered to be at the very earliest stage of acquisition. As Vainikka and
Young-Scholten (1996: 29) note, with respect to subjects with training similar to
those in this study, students who have been exposed to English for the six years
required of secondary-school students in Japan are certainly not at an initial state
of language acquisition.
Length of study also suggests that the Japanese learners of English should
be more experienced in terms of acquisition stages. The average length of study
for the Japanese learners of English was 6.95 years (with a range from 6 to 10
years), which was more than that for English-speaking learners of Japanese
(mean 3.44 years, with a range from 1 to 7 years).
One possible difference between the two groups of learners is that the instruc-
tion in Japanese language classes in the US may tend to be more communicatively
oriented than English instruction in Japan. Such a difference might have had
some effect on the input to learning. However, it is difficult to see how such an
instructional difference could result in the observed differences in these experi-
mental results, given the general rarity of multiple wh-questions to begin with.
I thus tentatively assume that differences in proficiency level or type of
instruction are unlikely to account for the results reported here.
134 NAOKO YOSHINAGA
I now consider what the results of the present study indicate with respect to the
accessibility of Universal Grammar. The results are examined from two different
perspectives.
If language learning is taking place under conditions where the target language
input is limited, we could expect learners to be very conservative and to reject all
multiple wh-questions. Furthermore, if particular structures are more frequent in
one language, it is possible that the learners of that language could approximate
native-like competence more easily than learners of a language in which the
relevant structures are less frequent in the input.
To my knowledge, there is no research on the question of the relative
frequency of multiple wh-questions in Japanese and English. The very fact that
a wider range of multiple wh-question types are grammatical in Japanese than in
English would lead one to expect that multiple wh-questions would be more
frequent in Japanese. Given the likelihood of this frequency differential, the
learners of Japanese might have a greater chance of encountering the target
structures, which in turn could increase cases of relevant noticing, and thus might
enhance chances of approximating native-like behavior. Admittedly, of course,
this is mere speculation until the frequency of the relevant structures in each
target language is systematically studied and compared.
Another explanation that is related to but slightly different from the idea above
could draw from both structural factors and consideration of exposure to the
target forms. Because of their experience with their native language, speakers of
both Japanese and English know that human languages allow multiple wh-quest-
ions. Second language learners are virtually never exposed to multiple wh-
questions in the target language. Thus, although they may assume that the target
language permits such structures, they do not know what form they should take.
For Japanese speakers learning English, this creates a problem since the only
wh-questions they have been exposed to in significant numbers are simple
wh-questions in which exactly one wh-phrase appears in sentence-initial position.
This blocks formation of multiple wh-questions since on the basis of structures
they have seen so far, they believe that no wh-word can remain in-situ. Thus,
they reject all multiple wh-questions in English since they have never been
exposed to a structure in which one wh-word appears in sentence-initial position
and one wh-word remains in-situ. On the other hand, for English speakers
learning Japanese, there is no such problem since the wh-questions to which they
have been exposed (i.e., simple wh-questions) suffice to reveal a strategy that
works just as well for multiple wh-questions as for simple wh-questions, namely
the in-situ strategy.
It should be noted that on this account both Japanese and English learners
are conservative. The Japanese learners of English would not want to take any
risk: they would not want to risk leaving a wh-word in-situ because they do not
observe any wh-words in-situ in the wh-questions to which they have been
exposed and presumably they would not want to risk moving two words either.
The English speakers learning Japanese are just as conservative, except that their
conservatism does not get in the way: they have evidence for the appropriateness
of an in-situ strategy in wh-questions to which they have been exposed, so there
is no risk in using this strategy for multiple wh-questions.
This account predicts that all learners of English whose native languages
adopt the in-situ strategy would disfavor multiple wh-questions in English. Where
the target language is Japanese, on the other hand, learners should show native-
like behavior in judging multiple wh-questions regardless of their native language
if they follow the in-situ strategy.
WHO KNOWS WHAT AND WHY? 137
6. Conclusion
Acknowledgement
I am grateful for the comments of Kazue Kanno, William O’Grady, and Robert Bley-Vroman from
the earliest phases of this study. I am also grateful to Narahiko Inoue and Satomi Takahasi at Kyushu
University and Hiro Ota and other Japanese instructors at the University of Hawaii for their help with
the data collection.
Notes
1. Huang (1982) and Lasnik & Saito (1984) use Comp/S’ rather than the CP notation.
2. In English, the subject who moves in overt syntax and the other wh-phrases move at LF
whereas in Japanese, both wh-phrases move at LF because Japanese does not have overt
wh-movement (although Watanabe 1991; 1992 claims that a null wh-operator that is associated
with the wh-phrase moves in overt syntax). Lasnik & Saito (1984: 250) argue that in Japanese,
wh-phrases that need to be antecedent-governed move first in the grammatical multiple
wh-questions under consideration here.
3. Tagalog might constitute an example of a language with no multiple wh-questions, because it
allows only “subject” wh-questions (or wh-questions are permitted only when Spec of IP is
empty so that wh-word can stop over on the way to CP Spec, see Nakamura 1993). As
expected, the examples in (a) are ungrammatical, according to Laurie Reid (personal communi-
cation). However, the full range of facts is unclear.
a. *Sino ang bumili ng ano?
who that bought what
‘Who bought what?’
*Ano ang ibinigay ni Tom kanino?
what that gave Tom who
‘What did Tom give to whom?’
Calabrese (1984) reports that multiple wh-questions are not possible in Italian. McCloskey
(1979: 70–71) also notes that Irish does not allow multiple wh-questions since question phrases
are base generated in pre-Comp position. More detailed analysis of such languages is called for.
138 NAOKO YOSHINAGA
4. A 2-way repeated measures ANOVA for each language group was performed to check the
effects of the two different forms. There was no interaction of sentence type with form for any
language group, F(5, 200) = 1.712, p = .1334 n.s. for Japanese learners of English,
F(5, 155) = .963, p = .4427 n.s. for English-speaking learners of Japanese, F(5, 140) = .406,
p = .8442 n.s. for English natives, F(5, 240) = .371, p = .8682 n.s. for Japanese natives. This
indicates that the effect of the sentence types was the same regardless of which form was
provided.
5. A few vocabulary items were unfamiliar to English learners of Japanese, so the meanings in
English were provided in the text. All other words were carefully chosen from vocabulary lists
taught in the class in previous semesters (first and second semester). For Japanese learners of
English, we asked two native Japanese speakers living in Japan to check for any English words
likely to be unfamiliar to college students; they reported that everything should be familiar.
6. Two tokens each of four other types of multiple wh-questions (a total of 8) were also included
in the questionnaire (thus, there were 39 items in all) for the study of a separate issue.
References
Aoun, Joseph, Norbert Hornstein, David Lightfoot, and Amy Weinberg. 1987. “Two
Types of Locality.” Linguistic Inquiry 18: 537–77.
Bley-Vroman, Robert. 1990. “The Logical Problem of Foreign Language Learning.”
Linguistic Analysis 20: 3–49.
———. 1996, June. “Conservative Pattern Accumulation in Foreign Language Learning.”
Paper presented at the European Second Language Association, Nijmegen, the
Netherlands. Available: http://www.lll.hawaii.edu/bley-vroman/ [access October 10,
1996].
Bley-Vroman, Robert and Naoko Yoshinaga. 1998. “The Acquisition of Multiple
wh-questions by High-proficiency Non-native Speakers of English.” University of
Hawai‘i Working Papers in ESL 17.1: 1–25.
Calabrese, Andrea. 1984. “Multiple Questions and Focus in Italian.” In W. de Geest, and
Y. Putseys (eds.), Sentential Complementation, Proceedings of the International
Conference held at UFSAL, Brussels, Foris, 67–74.
Chomsky, Noam. 1986. Knowledge of Language. Its nature, Origin, and Use. Westport,
Connecticut: Praeger.
Cole, Peter, Gabrielle Hermon, and Li-May Sung. 1990. “Principles and Parameters of
Long-distance Reflexives.” Linguistic Inquiry 21: 1–22.
Eubank, Lynn. 1996. “Negation in Early German-English Interlanguage: More valueless
features in the L2 initial states.” Second Language Research 12: 73–106.
Flynn, Suzanne. 1996. “A Parameter-setting Approach to Second Language Acquisition.”
In W. C. Ritchie, and Tej K. Bhatia (eds.). Handbook of Second Language Acquisi-
tion. New York: Academic Press, 121–158.
Huang, C. -T. James. 1982. “Logical Relations in Chinese and the Theory of Grammar.”
Doctoral Dissertation, MIT, Cambridge, Mass.
WHO KNOWS WHAT AND WHY? 139
William O’Grady
University of Hawai‘i at Manoa
1. Introduction
Both English and Japanese have a process known as ‘gapping’ that deletes the
verb in one conjunct of a coordinate clause under conditions of identity with the
verb in another conjunct. However, as the examples below help illustrate, the two
languages differ in terms of which verb is deleted. In English, the verb in the
second conjunct is suppressed.
142 WILLIAM O’GRADY
3. The experiment
3.1 Materials
The experiment took the form of a written questionnaire in which subjects were
asked to judge the acceptability of gapping patterns in the target language by
rating them on the following scale.
1 2 3 4 5 I don’t
bad good know
GAPPING AND COORDINATION IN SLA 145
3.2 Subjects
There were two principal groups of subjects in the experiment — a set of native
English speakers learning Japanese and a set of native Japanese speakers learning
English. The JSL learners were all undergraduates at the University of Hawaii at
146 WILLIAM O’GRADY
4. Results
The experiment yielded quite intriguing results, some of which seem to support
clear-cut conclusions while others point to the need for additional research on a
number of issues. I will begin by reporting the results of the JSL study and then
turn my attention to the ESL study. Some implications of the results will be
considered in section 4.
for the grammatical leftward gapping pattern exemplified in (9a) over the
unacceptable rightward gapping structure in (9b).
(9) a. Leftward gapping pattern (grammatical in Japanese):
[Maki-san-wa buraziru-ni Ø] [Tanaka-san-wa kanada-ni
[Maki-Mr- Brazil-Goal [Tanaka-Mr- Canada-Goal
it-ta] to omo-u.
go- that think-
‘(I) think that Maki to Brazil and Ken went to Canada.’
b. Rightward gapping pattern (ungrammatical in Japanese):
*[Maki-san-wa buraziru-ni it-te] [Tanaka-san-wa
[Miki-Mr- Brazil-Goal go- [Tanaka-Mr-
kanada-ni Ø] to omo-u.
Canada-Goal that think-
‘(I) think that Miki went to Brazil and Ken to Canada.’
The average rating for the leftward patterns among the 10 native speakers who
responded to the questionnaire was 4.52, compared to only 1.26 for the
rightward pattern.
Matters are quite different in the case of the JSL learners, however. Overall,
these subjects exhibited a strong, statistically significant preference for the
ungrammatical rightward gapping pattern in (9b). Moreover, as can be seen by
comparing the scores for the two patterns in Table 1, this preference was
manifested to varying degrees by all three subgroups of L2 learners.
Before proceeding, it is necessary to consider a potentially confounding
factor in the contrast between gapping structures in Japanese and English. As
(10) helps illustrate, the two languages differ not only in gapping direction but
also in terms of the use of a conjunction — English employs a conjunction in its
grammatical gapping patterns but Japanese does not.
(10) a. Gapping in English (rightward pattern):
[John read Time] and [Sue Ø Newsweek].
b. Gapping in Japanese (leftward pattern):
[John-wa Time-o Ø] [Sue-wa Newsweek-o yon-da].
[John- Time- [Sue- Newsweek- read-
‘John Time and Sue read Newsweek.’
This raises the question of whether some English speakers might reject leftward
gapping patterns in Japanese because there is no conjunction, rather than because
of the gapping direction.
In order to answer this question, the final leftward and rightward gapping
148 WILLIAM O’GRADY
Now let us consider the results of the ESL study. Table 2 summarizes the
responses of the native English speakers and the Japanese-speaking subjects to
gapping patterns in English coordinate structures. (Once again, separate repeated
ANOVAs were done for each group and subgroup of subjects.)
Native speakers of English exhibit a strong, statistically significant prefer-
ence for the rightward gapping pattern. This is as expected, although the
relatively low rating (in absolute terms) assigned to the rightward pattern
suggests that this construction may be less than fully natural for many speakers.
In contrast, the ESL subjects seemed to like neither of the gapping patterns
in English — both sentence types received grammaticality ratings toward the low
end of the scale. Nonetheless, it is worth noting that the grammatical rightward
pattern received a higher rating than the ungrammatical leftward pattern. (This
difference was significant for the subjects in the first group and overall, but not
for those in the second group.)
Once again, it is important to be sure that the subjects’ rejection of particu-
lar patterns relates to gapping rather than to the presence of a conjunction.
(Recall that English differs from Japanese in employing a conjunction in gapping
patterns.) As in the case of the JSL study, this issue was addressed by asking the
first-year subjects to correct the final rightward and leftward gapping patterns on
the questionnaire in the event that they judged them to be ungrammatical. Of the
20 students who made changes to the ungrammatical leftward pattern, 16
replaced the gap with a verb and one reversed the order of the overt verb and the
gap; none made any changes to the conjunction. Only 7 students made changes
to the grammatical rightward pattern — always by filling the gap with a verb.
5. Discussion
patterns were quite low, never exceeding 2.5 on the scale of 1 to 5. Moreover,
an examination of individual performance reveals that only 2 of the 34 subjects
assigned either gapping pattern a mean score of 4.0 or greater. (Both subjects
were in the first year English course; one gave a high rating to the rightward
pattern and the other to the leftward pattern.4)
In contrast, the performance of the English-speaking JSL learners is quite
puzzling. As reported in section 4, they apparently fail to respect the gapping
direction constraint, assigning higher mean scores to the ungrammatical rightward
patterns in Japanese than to the grammatical leftward patterns. As noted earlier,
this contrast is statistically significant both overall and for subjects in the second-
and third-semester classes. The ratings assigned to the rightward patterns are high
in absolute terms as well — slightly greater than 3.5 on the scale of 1 to 5 for the
second-semester students and 3.2 for the third-semester students. In contrast, as
just noted, group scores for the ESL students do not exceed 2.5 for either pattern.
In sum, the JSL learners behaved as if there were no correlation between
head–complement order and prohibited gapping direction. Although they clearly
recognize that Japanese is an object–verb language (their own spoken and written
sentences are consistently verb-final), they fail to reject rightward gapping
patterns in that language. Instead, they treat gapping in coordinate structures as
if they were dealing with English — accepting the rightward patterns and
rejecting the leftward ones. (Indeed, the rating of 3.5 assigned to this pattern by
the least advanced group of JSL subjects is very close to the mean score of 3.74
that native speakers of English assign to the rightward gapping patterns in
English.) This obviously constitutes somewhat of a puzzle for the L2 Continuity
Hypothesis.
It is not clear how this puzzle can be resolved. It is always possible that the
phenomenon could be attributed to extraneous factors of some sort, but it is
difficult to see what these factors might be. For example, there is no reason to
think that the L2 learners failed to realize that they were dealing with gapping
patterns: as noted above, the most common response among subjects who tried
152 WILLIAM O’GRADY
Table 4. JSL subjects who gave a mean rating of at least 4 to the rightward gapping pattern
Group # of subjects # with mean ratings ≥ 4 on
rightward patterns
Jpn 102 (2nd semester) 20 9 (45%)
Jpn 201 (3rd semester) 32 7 (21.9%)
Jpn 202 (4th semester) 23 6 (38.3%)
Table 5. Ratings for rightward and leftward gapping by 4th year JSL learners
Sub-group Rightward Leftward Difference Significance
gapping* gapping†
no significant stay in Japan 2.54 2.05 .49 .1142
(n = 15)
lived in Japan for 1–8 years 2.05 2.60 .55 .1970
(n = 11)
*ungrammatical; †grammatical
GAPPING AND COORDINATION IN SLA 153
6. Conclusion
As explained at the outset, the primary purpose of this study has been to explore
an esoteric but highly interesting aspect of second language acquisition —
learners’ knowledge of phenomena that are neither the subject of instruction nor
directly observable in experience.
The particular phenomenon investigated here — the correlation between
head–complement order on the one hand and prohibited gapping direction in
coordinate structures on the other — is a case in point. Japanese (an object–verb
language) exhibits leftward gapping in coordinate structures while English (a
verb–object language) manifests rightward gapping. However, because gapping
structures are rare in actual speech and not a part of the second language curricu-
lum, the key property of this construction cannot be induced from experience.
Rather, if it is to be mastered spontaneously at all, it must be deduced from the
interaction of general principles of sentence architecture with an awareness of the
particular word order employed for verbs and complements in the target
language.
154 WILLIAM O’GRADY
Acknowledgments
Special thanks are due to Kazue Kanno, not only for her comments on this paper, but also for her
assistance with the JSL questionnaire and for arranging to have it given to groups of JSL learners. I
am also grateful to Naoko Yoshinaga for giving the questionnaire to two groups of ESL learners and
for discussion of the results, to Kevin Gregg and Robert Bley-Vroman for their comments and
suggestions, and to the instructors and students in the classes participating in the experiments on
which this study is based.
Notes
1. Bonnie Schwartz, Robert Bley-Vroman and Juegen Meisel inform me that German allows
gapping in either direction in embedded clauses. This suggests that the proposed constraint
should be reformulated to apply only to languages that are uniformly of the verb–object or
object–verb type. Since German is neither, it would escape the constraint.
2. The coordinate structures were placed in embedded clauses in order to ensure that the test
sentences always ended with a verb in Japanese. Otherwise, subjects might have ruled out the
rightward gapping pattern in Japanese simply because the sentence did not end in a verb —
something that they would have come to expect on the basis of both experience and instruction.
3. The data in Tables 3, 6 and 7 is supplemented by data collected by Kazue Kanno using the
same technique and materials employed here. See Kanno (1999).
4. Despite the low ratings for both type of coordinate structure, the first-year students gave
significantly higher scores to the rightward gapping pattern than to the leftward construction. It
is not clear why this should be so, given the unlikelihood that they had been exposed to
rightward patterns with any regularity. It is possible that they were influenced by exposure to
the type of verb deletion pattern illustrated in (i), which does occur with considerable frequency
in English. (I am grateful to Minsun Song for this suggestion.)
(i) a. I left before Mary did Ø.
b. John won’t go but Sue will Ø.
c. I’ll go if I can Ø.
5. It is also predicted that Japanese speakers in the early stages of acquiring English will do less
well then their more advanced counterparts, perhaps even accepting backward gapping in
English. Time and space do not permit further consideration of this prediction here.
Appendix 1
Thank you for agreeing to help me with my research project. Even though you are still learning
English, I’d like to have your opinion about some sentences that I am studying. All you have to do is
read a series of short passages and then indicate whether the underlined sentence is possible in English.
Here is an example.
156 WILLIAM O’GRADY
Larry was looking for the library, but he couldn’t find it. So, he went up to a stranger and said:
‘Can you tell me where the library is?’
For each sentence, there are five choices ranging from ‘bad’ to ‘fully acceptable’ in addition to ‘I
don’t know’.
1 2 3 4 5I don’t
bad good know
You will probably rate the sentence in the example above as ‘good’, since it is clearly a possible
sentence of English.
1 2 3 4 5 I don’t
bad good know
1 2 3 4 5 I don’t
bad good know
You can use the intermediate categories (2, 3 and 4) for cases that are neither completely bad nor
completely good, in your opinion.
There may be some sentences that you are not able to judge. If that happens, you can circle ‘I
don’t know’ on the answer sheet.
References
Ardery, Gail. 1980. “On Coordination in Child Language.” Journal of Child Language 7:
305–20.
Eimas, Peter. 1996. “The Perception and Representation of Speech by Infants.” In J.
Morgan & K. Demuth (eds.), Signal to Syntax: Bootstrapping from speech to
grammar in early acquisition. Mahwah, NJ: Erlbaum, 25–39.
Johannessen, Janne. 1996. “Partial Agreement and Coordination.” Linguistic Inquiry 27:
661–75.
Kanno, Kazue. 1996a. “Access to Universal Grammar in Second Language Acquisition:
Data from the interpretation of null arguments in Japanese.” Linguistics 34:
397–412.
GAPPING AND COORDINATION IN SLA 157
Kazue Kanno
University of Hawai‘i at Manoa
1. Introduction
An important issue in the study of second language acquisition has to do with the
extent to which L2 learning can be facilitated or impeded by properties of the
native language. This paper addresses this question in relation to an intricate
phenomenon that has its roots in the interaction between verb gapping in
coordinate structure and verb–object order (see Ross 1970 and Johannessen
1996), complementing the study by O’Grady (this volume).
Section 2 lays the groundwork for my study by describing the syntax of
verb gapping in three languages — English, Japanese and Mandarin. Section 3
then outlines an experiment that is designed to investigate how differences
among these languages affect the acquisition of this phenomenon in Japanese by
speakers of English and Mandarin. The paper ends with a discussion and
conclusion that seeks to integrate the results of my experiment into our under-
standing of the role of cross-linguistic differences in facilitating and impeding
second language acquisition.
When sentences are conjoined in Japanese, the verb in the first clause appears in
the ‘gerundive’ (tenseless) form while the verb in the second clause is tensed.
160 KAZUE KANNO
clear that if learners have the ability to draw conclusions about gapping direction
from head-complement order in accordance with (5), the information needed to
trigger this inference is abundantly available in the input in the form of sentences
with a verb in final position. (Indeed, teachers generally give beginners explicit
instructions to put the verb at the end of the sentence in Japanese.)
Assuming that the Constraint on Verb Gapping is available to L2 learners,
we predict that learners should be able to reject forward gapping based on
exposure to head-final sentences, but that they need to be exposed to actual
instances of backward gapping in order to learn that this particular pattern is in
fact employed in Japanese. In other words, information about head position alone
does not allow learners to infer that backward gapping is permitted in Japanese,
although it should suffice to rule out the possibility of gapping. (See
O’Grady, this volume, for a more complete discussion of this point.) By
comparing English- and Mandarin-speaking learners, we will be able not only to
test this hypothesis but also to determine the possible effect of the presence or
absence of a (different) gapping pattern in the native language.
3. The experiments
3.1 Material
Five groups of subjects participated in the experiment. The first four groups
consisted of speakers of English who were learning Japanese as a foreign
language at the University of Hawaii; none of these subjects had ever lived in
Japan. The fifth group consisted of 10 adult native speakers of Japanese.
Group I: 20 subjects enrolled in sections of second semester Japanese (Japa-
nese 102)
Group II: 32 subjects enrolled in sections of third semester Japanese (Japanese
201)
Group III: 23 subjects enrolled in sections of fourth semester Japanese (Japa-
nese 202)
Group IV: 17 subjects enrolled in sections of various fourth year Japanese
courses
Group V: 10 adult native speakers of Japanese
164 KAZUE KANNO
3.2.1 Results
I will first report the overall results and then consider the performance of
individual subjects from two perspectives.
3.2.1.1 Overall results. Table 1 presents the overall results for the four English
JSL groups and for the group of Japanese native speakers.
In addition to the mean ratings for forward and backward gapping patterns,
Table 1 also includes the difference between the two scores — which is obtained
by subtracting the mean rating for the forward gapping pattern from that of the
backward gapping pattern.
As can be seen here, the native speaker group has a mean rating of 4.52 for
the backward gapping pattern, compared to a mean rating of 1.26 for the forward
pattern — a difference of 3.26. Thus, as expected, they clearly consider the
former pattern to be natural and the latter to be unnatural.
The overall results for the English JSL learner groups are quite different,
however. The mean rating for the backward gapping patterns is 2.76 for group
I, 2.22 for group II, 2.18 for group III, and 2.29 for group IV. With their mean
scores all beneath 3.00, these subjects evidently do not consider the backward
pattern to be natural. In fact all four groups rate the ungrammatical forward
gapping pattern higher than the backward pattern — the mean rating for the
forward pattern is 3.50 for group I, 3.00 for group II, 2.88 for group III, and
2.34 for group IV.
Although there is no sign of improvement on the grammatical backward
gapping pattern, whose rating is low even in the most advanced group, we can
observe some progress with respect to the ungrammatical forward pattern. As can be
Table 1. Mean ratings for the five groups on the two gapping patterns
Type I Type II differences p values
Backward Forward
Group I: 2.76 3.50 (−0.74) .0479
L2 learner Group II: 2.22 3.00 (−0.78) .0006
groups Group III: 2.18 2.88 (−0.70) .00017
Group IV: 2.29 2.34 (−0.05) .9017
Native speak- Group V: 4.52 1.26 (+3.26) .0001
er
Scale of 1 (ungram.) to 5 (gram).
L2 ACQUISITION OF VERB GAPPING IN JAPANESE 165
seen in table 1, the rating for this structure falls at the more advanced levels,
resulting in a lessening of the difference between the mean ratings for the two types.
In sum, it appears that success in rejecting the forward gapping pattern is
related to the amount of previous exposure to Japanese. Hence, we see a
decrease in the average ratings for this pattern in the more advanced group, with
the lowest ratings coming from the students in Group IV. It is clear, though, that
the rejection of these sentences is the result of a long and laborious process. This
is especially noteworthy in light of the fact that the information that could in
principle rule out this pattern — the head-final word order of Japanese — is
present in abundance throughout the entire period of second language study.
For backward gapping, however, there appears to be no correlation between
amount of experience and rate of acceptance of this pattern. All groups have a
mean rating lower than 3.0 even though there is reason to believe that they are
incidentally exposed to at least some instances of this structure type. (For
example, the textbook (Situational Functional Japanese, volume 1: Drills,
Tsukuba Language Group 1995) used at the University of Hawaii includes no
direct information about gapping, but I uncovered six incidental instances of
backward gapping patterns in the chapters used in the first-semester Japanese
course. (Only one involved a transitive verb like the test sentences used in the
experiment; the other five involve a copula.)
3.2.1.2 Individual scores. Let us now consider the performance of the individual
subjects with a view to determining their success in recognizing the effects of
the Constraint on Gapping in Japanese. I will examine this issue from two per-
spectives, beginning with the number of times individuals rejected the ungrammat-
ical forward gapping pattern, as required by the Constraint on Gapping. I will then
investigate whether individual subjects show a preference for backward gapping.
Evidence for such a preference would indicate not only a sensitivity to the
Constraint on Gapping, but also the realization that Japanese permits backward
gapping. (Recall that the Constraint on Gapping rules out forward gapping but
says nothing about whether backward gapping actually occurs in Japanese.)
Rejection of the forward gapping pattern: If learners have the Constraint on Verb
Gapping, they should not accept forward gapping patterns in Japanese. Table 2
presents the distribution of English JSL learners and Japanese native speakers
with respect to the number of times they gave these patterns a rating of 4 or 5 on
the 5-point scale used on the questionnaire.
Allowing for performance errors, I assume that subjects with knowledge of
the Constraint on Verb Gapping should assign the forward patterns a rating of 4
166 KAZUE KANNO
or 5 no more than once. As expected, all native speakers fall into this category,
uniformly assigning very low scores to the forward gapping pattern. Among the
English JSL learners, knowledge of the Constraint on Verb Gapping appears to
be related to the amount of exposure to Japanese. Hence, the highest proportion
of subjects who uniformly provide a rating of less than 4 (59%) is found in
Group IV.
Table 2. Distribution of English JSL learners based on the number of 4/5 ratings for
forward gapping
No. of times Group I Group II Group III Group IV Native
4 or 5 is selected (n = 20) (n = 32) (n = 23) (n = 17) (n = 10)
0 times or once (6 (12 (11 (10 (10
(30%) (37.5%) (47.5%) (59%) (100%)
2 or 3 times (9 (14 (05 (04 (00
(45%) (44%) (21.5%) (24%)
4 or 5 times (5 (06 (07 (03 (00
(25%) (18%) (30.5) (17%)
Preference for gapping direction: Now let us examine the subjects’ individual
performance in terms of preferred gapping direction, which can be determined by
subtracting the rating for the ungrammatical forward gapping from the rating for
the grammatical backward pattern. Thus, positive figures indicate a preference
for backward gapping, zero signals no preference, and negative figures point to
a preference for forward gapping.
As expected, all native speakers of Japanese indicate a preference for
backward gapping (see table 3 below). Indeed, they all have a difference of 2 or
higher in their rating of the two patterns.
On the other hand, the individual performance of the English JSL learners
paints a totally different picture. As the bold-face percentages indicate, in all
four groups more subjects prefer forward gapping. In group I, for instance, 13
subjects prefer forward gapping and only 7 prefer backward gapping. The ratio
is 22 to 5 in Group II and 17 to 5 in Group III. Even at the fourth instructional
level (group IV), more than half the subjects prefer the unacceptable forward
gapping. These results appear to reflect a transfer effect, with English speakers
assuming that the forward gapping found in their native language is also
permitted in Japanese.
In sum, an examination of the overall results and of individual performance
L2 ACQUISITION OF VERB GAPPING IN JAPANESE 167
3.3.1 Results
3.3.1.1 Overall results. Table 4 presents the overall results for the two groups of
Mandarin JSL learners. For purposes of comparison, the earlier results obtained
168 KAZUE KANNO
Table 4. Mean ratings for the three Mandarin JSL groups on the two gapping patterns
Type I Type II differences p values
Backward Forward
L2 learner Beginning: 3.04 2.01 (+1.03) .0001
groups Intermediate: 2.83 2.24 (+0.59) .0049
Native speak- Group V: 4.52 1.26 (+3.26) .0001
ers
Scale of 1–5: 5 is the highest.
from the Japanese native speakers are included here as well. As before, I focus
on the mean ratings for forward and backward gapping patterns and on the
difference between the two scores — which is obtained by subtracting the mean
rating for the forward gapping pattern from that of the backward gapping pattern.
The mean rating for the forward patterns is 2.01 for the beginning group
and 2.24 for the intermediate group. In other words, subjects in both groups
appear to find forward gapping sentences unnatural, although this response is not
as strong as in the case of the native speakers.
In contrast, the mean rating for the backward gapping patterns is 3.04 for
the beginning group and 2.83 for the intermediate group. These scores appear to
fall into a sort of ‘middle ground’ in that the subjects are neither accepting nor
rejecting these sentences.
There are two noticeable differences between these Mandarin JSL groups
and English JSL groups. First, both Mandarin groups have a preference for the
backward gapping pattern, as the plus figures in the third column of our table
show. By contrast, all English JSL groups exhibit the opposite preference, as
attested by the minus figures in the comparable column in Table 1 above.
Second, the beginning and intermediate groups of Mandarin JSL learners
clearly reject the forward gapping pattern, as their respective mean scores of 2.02
and 2.24 indicate. In contrast, all four groups of English JSL learners assign a
higher mean rating to these patterns.
3.3.1.2 Individual scores. Now let us examine the individual scores of the
Mandarin JSL learners with respect to the number of times each individual
subject gave a rating of 4 or 5 to the forward gapping patterns. As in the case of
the English-speaking JSL learners (see Table 2), I assume that subjects with a
knowledge of the Constraint on Verbal Gapping will give such high ratings no
more than once (see Table 5 below).
By this criterion, 67.5% of the subjects in the beginner group and 74% of
L2 ACQUISITION OF VERB GAPPING IN JAPANESE 169
Table 5. Distribution of Mandarin JSL learners based on No. of times 4/5 ratings for
forward gapping
Beginning Intermediate Native
(n = 40) (n = 35) (n = 10)
0 times or once (27 (26 (10
(67.5%) (74%) (100%)
2 or 3 times (10 (06 (00
(25%) (17%)
4 or 5 times (03 (03 (00
(7.5%) (9%)
the backward pattern (found in Group 1) was a mere 35% (see Table 3).
In sum, the Mandarin-speaking JSL learners differ from their English-
speaking counterparts with respect to overall results and individual performance.
Not only do they clearly reject forward gapping from an early stage of acquisi-
tion, they exhibit a stronger preference for the backward gapping patterns than do
the English JSL learners. Given that both sets of subjects were learning Japanese
in a foreign language context outside of Japan, the relatively poor performance
of the English JSL learners cannot be attributed simply to the limited input
associated with classroom-based language learning.
As stated at the outset, our primary concern has to do with what inferences, if
any, English- and Mandarin-speaking learners of Japanese can make about
gapping direction in that language based on exposure to head-final sentences. Of
special interest is the question of whether the difference between English and
Mandarin affects the way native speakers of each language acquire verb gapping
pattern in Japanese.
For the English JSL learners, rejection of the forward gapping patterns
seems to involve a long and drawn-out process stretching over several years. By
comparison, the same result appears to be achieved much earlier by the Mandarin
JSL learners, who clearly reject forward gapping patterns in Japanese even in the
early stages of language study. Why should this be?
A possible answer to this question lies in the fact that Mandarin differs
from English in not permitting gapping at all. There is thus no possibility of a
transfer effect obscuring the effects of the Constraint on Verb Gapping, and the
Mandarin speakers therefore have no reason to think that forward gapping should
be permitted in Japanese. This conclusion cannot be so easily arrived at by
English speakers, whose familiarity with forward gapping patterns in their native
language apparently supersedes the Constraint on Verb Gapping, at least initially.
Turning now to the backward gapping pattern, we have seen that both
groups of Mandarin JSL learners did much better in accepting this construction
than did any of the English JSL groups. Why should this be? The answer cannot
lie in simple transfer: since Mandarin does not allow gapping in either direction,
Mandarin-speaking learners who relied on their native language in judging the
Japanese sentences would presumably have ruled out the backward gapping
patterns just as strongly as the English speakers did. This is not what happened,
as we have seen.
L2 ACQUISITION OF VERB GAPPING IN JAPANESE 171
One possibility is that the Mandarin JSL learners were simply exposed to more
instances of backward gapping than were their English-speaking counterparts. At
this time, however, there is no reason to believe that this was so, especially since
both groups of learners were exposed to Japanese only in a classroom context
outside Japan. Chie Itami, the instructor at the institute where the Mandarin-
speaking subjects were studying, reports that the textbooks used there contain no
gapping structure and that she did not teach the pattern. Moreover, the Mandarin
JSL learners had roughly the same number of hours of exposure to Japanese
instruction as did the English-speaking students in Japanese 202. Yet, they were
far more willing to accept the backward gapping patterns.
A more interesting possibility is that the absence of gapping in Mandarin
allowed the Mandarin-speaking learners to be more open to the existence of
backward gapping patterns in Japanese. In the absence of any transfer-based
prejudices, their acquisition device was freed up to operate in much the same
way as it would in a first language context. Simple exposure to the backward
patterns was therefore enough to permit the conclusion that they are acceptable.3
In contrast, the English-speaking learners actually began at a disadvantage, since
they believed that forward gapping should be possible, as shown by the high
ratings given by the subjects in group 1. They therefore had to first realize that
forward gapping was impossible. Only at that time were they ready to notice and
recognize the patterns of backward gapping that are found in Japanese.
How precisely do the English-speaking learners come to realize that forward
gapping is not permitted in Japanese? There is reason to believe that the
Constraint on Verb Gapping has an important role to play. The key clue in this
respect is that some of the English-speaking subjects begin to reject the forward
gapping patterns without yet accepting the backward gapping patterns. The fact
that the forward gapping pattern is rejected even in the absence of ‘pressure’
from the backward pattern suggests that it has been forced out by some other
factor — presumably the general word order pattern of the language, which in
turn implies the operation of the Constraint on Verb Gapping.
In sum, we have evidence that learners are sensitive to gapping direction in
a second language and that they are (eventually) able both to rule out an
unacceptable gapping pattern and to learn an acceptable one. Of special interest
is a finding that emerged from our decision to simultaneously study speakers of
a language that permits only forward gapping (English) and speakers of a
language that permits no gapping at all (Mandarin): comparison of the two
groups suggests that mastery of the syntax of gapping is facilitated in cases
where the native language does not contain a gapping pattern different from the
one permitted in the target language. Indeed, if the data from our English-
172 KAZUE KANNO
Acknowledgments
I would like to thank William O’Grady for his comments and discussions. My special thanks go to
Chie Itami for her help with data collection in China and to Song Jiang for his assistance with the
Mandarin questionnaire. I would also like to thank the Japanese native speakers, Japanese-language
instructors, and students who participated in the study.
Notes
References
Johannessen, Janne. 1996. “Partial Agreement and Coordination.” Linguistic Inquiry 27:
661–75.
Kanno, Kazue. 1999. “Acquisition of Verb Gapping by JSL and JFL Learners.” Unpub-
lished Manuscript, University of Hawaii at Manoa.
Kuno, Susumu. 1973. The Structure of the Japanese Language. Cambridge, Massachusetts:
MIT Press.
Neijt, Anneke. 1979. Gapping: A contribution to sentence grammar. Dordrecht, Holland:
Foris Publications.
van Oirsouw, Robert R. 1987. The Syntax of Coordination. New York, NY: Croom Helm.
O’Grady, William. 1999. “Gapping and Coordination in Second Language Acquisition.”
This Volume.
L2 ACQUISITION OF VERB GAPPING IN JAPANESE 173
Ross, John R. 1970. “Gapping and the Order of Constituents.” In M. Bierwisch and K. E.
Heidolph (eds), Progress in Linguistics. The Hague: Mouton, 249–259.
Tsukuba Language Group. 1995. Situational Functional Japanese Volume 1: Drills.
Tokyo: Bonjinsha.
Name Index
A F
Ardery, G. 150 Falodun, J. 33
Ayoun, J. 115 Farrar, M. J. 11, 46
Flynn, S. 119, 134
B Ford-Niwa, J. 53, 60, 61 (see also
Bachman, L. F. 57, 65, 67 Kobayashi et al. (1995, 1996))
Baker, N. 11, 89 Fukuda, M. 4, 72
Balcom, P. 90
Beck, M.-L. 11, 12 G
Bialystok, E. 59, 60 Gass, S. 32, 33, 36, 39, 40, 46
Bley-Vroman, R. 71, 117, 119, 132, Grimshaw, J. 109
134
Bohannon, J. N. 11, 46 H
Bonvillian, J. 11 Hanlon, C. 11
Brown , G. 33 Hatasa, Y. 53, 55, 56, 57, 58
Brown, R. 11 Hermon, G. 118
Burzio, L. 89 Herron, C. 11, 12
Hirakawa, M. 90
C Hopper, P. 78
Chomsky, N. 118 Hornstein, N. 115
Cole, P. 118 Huang, C.-T. J. 115, 116
Conrad, C. A. 54
Crookes, G. 34 I
Inagaki, S. 34, 46
D
Denninger, M. 11 J
Doughty, C. 32 Johannessen, J. 142, 159
Jorden, E. 14, 74, 75
E
Eimas, P. 149 K
Eubank, L. 11, 12 Kageyama, T. 4, 90, 91, 93, 96, 109,
110
176 NAME INDEX
Kanagy, R. 33 Oliver, R. 12
Kanno, K. 3, 71–76, 83, 85, 86, Oller, J. W., Jr. 54
100, 108, 110, 149, 155 Oshita, H. 90
Kaplan, B. 11
Katada, F. 117 P
Klein-Braley, C. 54 Palmer, A. 65, 67
Koizumi, M. 108 Pavasi, M. 9
Krashen, S. 10 Perlmutter, D. 89
Kuno, S. 92, 93, 115, 142, 160 Pica, T. 2, 31, 32, 33, 35, 36, 42, 44,
46, 47
L Pinker, S. 11
Larsen-Freeman, D. 1 Porter, P. A. 32, 33
Lasnik, H. 115, 116, 117
Levin, B. 91, 94 R
Li, A. 94 Raatz, U. 54
Lighfoot, D. 115, 118 Rabie, S. R. 34, 46
Linnell, J. 32, 43 Rappaport Hovav, M. 91, 94
Long, M. 1, 10, 32, 33, 34, 36, Richards, N. 118
46, 149 Richardson, M.-A. 12
Loschky, L. 34 Rosen, S. 109
Ross, J. 142, 159
M Rulon, K. 34
Mackey, A. 32, 47
Martin, S. E. 14 S
McDonald, D. 34 Saito, M. 92, 93, 115, 116, 117
McLaughlin, B. 60 Sato, C. 36
McNamara, T. 55, 59 Saxton, M. 2
MacWhinney, B. 11 Schachter, J. 11
Mito, K. 12, 14, 23, 25, 27, 34, 46 Schmidt, R. 9
Miyagawa, S. 90, 110 Schneider, W. 60
Miyata, H. 95 Schwartz, B. 11, 149
Seliger, H. 10
N Sharwood Smith, M. 55, 59, 60
Nakayama, M. 108 Shiffrin, R. M. 60
Neijt, A. 160 Shortreed, I. M. 33
Nelson, K. E. 11 Snow, C. 11
Nishigauchi, T. 93, 110 Sokolov, J. 11
Noda, M. 14, 74, 75 Sorace, A. 90
Spence-Brown, R. 55, 59, 65
O Sprouse, R. 149
O’Grady, W. 118, 149, 150, 159, 160, Stanowicz, L. 11, 46
161 Sung, L.-M. 118
van Oirsouw, R. 160 Swain, M. 3, 9, 31, 42
NAME INDEX 177
T W
Takami, K. 115 Weinberg, A. 115
Takezawa, K. 93, 110 White, L. 9, 119
Teramura, H. 96 Wilkins, D. 10
Thompson, S. 78
Tohsaku, Y.-H. 53, 55, 56, 57, 58 Y
Tomasello, M. 11, 12 Yip, V. 90
Tsujimura, N. 90 Yoshinaga, N. 117, 132, 134, 154
Young-Sholten, M. 133
U Yuan, B. 90, 94, 110
Uziel, S. 149 Yule, G. 33, 34
V Z
Vainikka, A. 133 Zobl, H. 90
Varonis, E. M. 32, 33, 36, 39, 40, 46
C I
Case drop/deletion/omission 3–4, 72, Interaction Hypothesis 10
74–76, 93–94, 95–96, 98, 100, Interactional moves
104-107, 108, 109, 110 Clarification request (CR) 31–32,
Comprehensible output 2–3, 33 36–37
Comprehensible Output Hypothesis Confirmation check (CC) 31–32,
31 36–37
Computational mechanisms 149, 150
Control 54, 55, 59–60, 63, 64 L
Controlled processing 60 L2 Continuity Hypothesis 149–150, 154
Learner-learner (NNS-NNS) interaction
D 32–33, 46–47
Deep unaccusativity 91, 98, 101–103, Lexical government 72, 93, 116–117
107, 109 see also Proper government and ECP
see also Surface unaccusativity Linguistic knowledge 59, 60, 63
Phonological/grammatical knowledge
E 64
Empty Category Principle (ECP) 4, Locative construction 15
71–72, 93–94, 116–117
M
F Models/modeling 2, 11, 12, 13, 19–20,
Feedback 31, 33 22, 25, 26
Role of negative feedback 9–10, Modeling condition 16–17
11–12 Modified output (MO) 31–32, 37–38,
42
180 SUBJECT INDEX