Professional Documents
Culture Documents
1993 Long
1993 Long
0. INTRODUCTION
There are numerous theories of SLA. They differ in form, type, source, and
scope, and as is shown in part 1 of this paper, many are oppositional. SLA
researchers' social responsibilities and the increased likelihood of SLA's
success as a discipline are argued in part 2 to justify some theory culling. In
part 3, minimum requirements on the form and content of SLA theories are
discussed. From a rationalist position, part 4 describes theory assessment
strategies utilized in other disciplines: namely, 16 assessment strategies
proposed by Darden (1991), and 17 criteria from the general philosophy of
science literature, at least four of which seem valuable for SLA theories. Part 5
addresses the problematic issue of whether it is possible to identify universally
valid criteria for theory evaluation, either of single theories in isolation or of two
or more theories comparatively.
or weaknesses, but each also offers some unique insight. The danger with this
attitude is that it can lead to the kind of eclecticism in SLA that has long afflicted
and Laudan's and Chalmers' views are very useful for people working on those
issues.
most learners, alone will be inadequate, although principles held to govern all
possible human languages (including interlanguages) will be relevant at any
1982, Bloor 1984, Shapere 1986, Hull 1988, Diesing 1991, Martin 1991,
Mulkay 1991). Gould (1981), for instance, recounts grisly episodes in the
Much SLA research is not experimental, not only because of its relatively short
history, but because some of the key variables involved—age of onset, aptitude,
memory, native language background, etc.—are not susceptible to random
assignment (although they are to structural equation modeling) and so are
typically examined in quasi-experimental criterion group designs, and because
some questions in social science are simply not amenable to or best studied via
experimentation. Moreover, Lightbown (1984) and Santos (1989) have drawn
attention to the scarcity of replication studies in SLA and applied linguistics.
Nevertheless, examples exist of the accountability of SLA theory to conflicting
evidence—including cases of falsification—especially accountability to experi-
mental results, often with theorists' explicit recognition that such accountability
is necessary.
First, there is the consistent failure (described in Long and Sato 1984) of
error analysis studies to support early claims about LI transfer. This was a
major factor forcing the abandonment of the strong form of the Contrastive
Analysis Hypothesis in the 1970s (see, for example, Wardaugh 1970, Janicki
1990). Another example is the way in which in the mid-1970s the results of
quasi-experimental studies of the 'natural order' for ESL morphology led
Krashen to recognize two additional constraints (knowledge of the rule and
focus on form) to the one he had initially hypothesized (time). Krashen (1979:
155-6) explicitly recognized this, and in the same paper he articulated his view
of scientific method, lending support to the idea that SLA theorists at least
attempt to adhere to the rationalist conventions described by Newton-Smith
and others:
. . . our generalizations \sic\ need to be able to predict. They are not merely categories
for previously existing data, but must fit data gathered after the hypotheses were
formulated. The way we test our generalizations [sic], then, is to see whether they
predict new data. If they do, we are still in business, but if they do not, we have to change
the hypothesis, alter it. If these alterations cause major changes in the fundamental
assumptions in the original generalizations, make it too ad hoc, too cumbersome, we
may have to abandon the hypothesis. (Krashen 1979: 159)
settled the question concerning the extreme version of the general factor hypothesis
was conducted by Bachman... and Palmer In a very carefully designed experiment,
4. ASSESSMENT STRATEGIES
A potentially useful approach to theory assessment is to be found in the work of
Darden (1991), based on her study of theory change in the history of genetics.
The strategies she outlines may not be appropriate for all areas of, or
approaches to, SLA and applied linguistics, but they are worth consideration by
researchers interested in explanations of learning processes and outcomes
which can be generalized to new educational and other social settings.
One of the appealing qualities of Darden's analysis is her grouping of assess-
ment strategies into five sets distinguished partly according to function. This
may eventually help demystify the wide range of evaluation criteria to which
scientists subscribe. Some of the differences, for example, may be a matter of the
phase in theory development at which the criteria are appropriately applied.
Some differences may also reflect confusion over the dual function many strate-
gies can serve, as constraints on theory construction and/or criteria for theory
evaluation, which Darden sees as interdependent processes (see also Beretta
and Crookes, this issue; Long 1990b). Finally, some differences may be the
consequence of genuine tensions among the outcomes of application of
different criteria.
Darden's first set consisfs of two criteria applicable before or independent of
empirical testing of a theory. Internal consistency refers to the need to avoid
internal contradictions among a theory's components. This would be the case,
for example, in a theory which assumed continuing access to Universal
Grammar to account for the successful acquisition of some complex syntactic
structures by adult acquirers, and use of general problem-solving procedures to
explain the same adults' failure to acquire other structures, when type of
236 ASSESSMENT STRATEGIES FOR SLA THEORIES
structure was not a distinction made in the theory (see, for example, Felix 1985).
Non-tautologousness means, obviously, that a theory should not consist of
valued than a claim that children follow one sequence and adults another.
Darden notes, however, that greater generality does not always mean greater
Again, the Pinker and Bloom (1990) claim about language and natural selection
provides an illustration.
All of Darden's assessment strategies can be used to evaluate single theories
The theory [of instructed second language acquisition) presented here has two primary
purposes. The first is to provide a set of statements or hypotheses about classroom L2
learning which are testable and, therefore, falsifiable. In this sense, the theory aims to be
scientific, as defined by Popper (1976). (Ellis 1990:174, emphasis in the original)
It also has its detractors (see Schumann, this issue). Falsifiability as an assess-
ment strategy, it should not be forgotten, assumes a belief in the existence of
truth and falsity on the part of the person applying it, and so makes it (and
indeed, probably the whole notion of theory evaluation, other than from an
aesthetic standpoint) a non-issue for relativists, who do not accept the notion of
objective truth.
In the light of the problems of falsifiability as a requirement on theories, it
would be better not to try to make it one—especially not an absolute require-
ment in the early stages of theory construction. Rather, a falsifiable theory may
be judged better than an unfalsifiable one in later comparative evaluations. The
notion has additional value. As one form of accountability to data, it is likely to
encourage adherence to related desirable qualities in a theory, such as clarity
and explanatory adequacy. Finally, when evaluating one or more theories,
whether or not they have in fact been falsified is a reasonable, indeed critical
measure to apply, but as part of a more general criterion of empirical
adequacy.11
5. UNIVERSAL STRATEGIES?
Are any assessment strategies universally valid? It would be tempting to believe
that while theories might differ, scientists would at least agree on how to
evaluate them. It is clear, however, that there is no consensus on a universal set
of criteria at present. Indeed, sociologists of science (for example, Mulkay and
Gilbert 1984/1991) have found that scientists vary both as a group in the
criteria they claim to apply in assessing their own and others' theories, and also
as individuals by applying criteria inconsistently.
Independent of the sociological findings or historical record, philosophers of
science have noted the logical impossibility of a current consensus on assess-
ment strategies. The problem, addressed explicitly by Laudan and Laudan
(1989) and Laudan (1990), is that if scientists all adhere to the same scientific
method (lthe scientific method' espoused by positivists, or any other), if they
(supposedly) have access to the same body of 'accepted findings', if they accept
the same set of standards for evaluating theories (and we must recognize we are
implicitly urging this if we advocate a rational narrowing of the range of
theories the field should consider), and if they are rational, how do we explain
the fact that they (ever) disagree? And if standards for theory acceptance are
240 ASSESSMENT STRATEGIES FOR SLA THEORIES
fact that both the theory and the assessment criteria used to evaluate it may turn
out to be false, and the need always to alert practitioners to this.
NOTES
1
This is a revised version of a plenary address to the Applied Linguistics at Michigan
State University conference on Theory Construction and Methodology in Second
Language Acquisition Research, 4-6 October 1991. I thank Alan Beretta, Graham
Crookes, Kevin Gregg, Larry Laudan, Charlie Sato, and two anonymous reviewers for
helpful comments on the earlier version. Remaining errors are my own responsibility.
2
In what follows, only one example of each kind of SLA theory is cited to illustrate
these differences. Numerous examples exist of some categories.
3
While competence is claimed to be the proper domain of inquiry, researchers in the
Chomskyan tradition are as obliged as those in any other to investigate competence by
studying performance, e.g. on grammaticality judgment tasks and via other forms of
introspection.
4
Eckman (1991) and Sharwood-Smith (1991) provide interesting commentaries on
the debate in Applied Linguistics between Gregg and Ellis and Tarone on this issue.
Eckman argues that the relevance of variation data is an empirical matter, not something
to be decided a priori.
5
Responding to an audience question at the 1992 Michigan State University
conference, Schumann announced that he now rejected his earlier (1983) relativist
stance, and declared himself a realist (in the philosophical sense).
6
I once asked a famous applied linguist, a devout Christian, how he reconciled the
rigor of his experimental work with his religious beliefs. After thinking for a moment, he
replied, 'You know, I often ask myself the same thing. Of course, I can test them in one
way. When I die, I'll either go to heaven or to hell.' Then, after a pause, 'The problem is, it
isn't replicable.'
7
Prescriptions for language teaching based on SLA theories which fail to provide
explanations are also misleading if presented as 'theory-based'.
8
One Applied Linguistics reviewer proposed structural equation modeling as a more
useful alternative to experimentation, arguing that 'conceptually, it recognizes that
causation is an unrealistic goal, but that finding a reasonable explanation, or an
explanation that is clearly more consistent with the data than other explanations, is a rea-
sonable goal. Methodologically, it lends itself much more readily to the investigation of
phenomena including multiple variables or factors, which it seems to me, is now a neces-
sary characteristic of SLA research.'
9
For a vivid example in SLA research, see Eubank (1991).
10
A recent example in SLA of 'prediction' of an earlier, but unknown, finding
concerns sensitive periods in language development. Long (1990a) hypothesized the
closure of the first sensitive period, for suprasegmental phonology, to occur as early as
age six, not at puberty, as traditionally claimed in the literature. Chambers (1992: 689)
cited a just translated finding, hitherto available only in Japanese, by Sibata (1958), to the
effect that of 500 children still in Shirakawa in 1949 after being moved from Tokyo and
Yokahama five or six years earlier to escape US bombing, children aged six or seven on
arrival had acquired the pitch-accent features of the Shirakawa dialect almost perfectly by
the time of the interview, while those aged fourteen or over on arrival had not modified
their dialect at all.
244 ASSESSMENT STRATEGIES FOR SLA THEORIES
1
' At least sixteen other theory-assessment criteria are advocated in the history and
philosophy of science literature. Some overlap with those discussed above, but most
differ in emphasis, at least, and repay careful study. They are: fertility as a paradigm for
REFERENCES
Atkinson, M. 1987. 'Mechanisms for language acquisition: learning, parameter-setting
and triggering.' First Language 7: 3-30.
Bachman, L. F. and A. S. Palmer. 1982. 'The construct validation of some components
of communicative proficiency.' TESOL Quarterly 16/4: 449-65.
Barnes, B. and D. Bloor. 1982. 'Relativism, rationalism and the sociology of knowledge'
inM. Hollis and S. Lukes (eds.): Rationality and Relativism. Oxford: Blackwell.
Ben-David, J. and R. Collins. 1966. 'Social factors in the origins of a new science: the case
of psychology.' American Sociological Review 31: 451-65.
Beretta, A. 1991. 'Theory construction in SLA. Complementarity and opposition.'
Studies in Second Language Acquisition 13/3: 493-512.
Bialystok, E. 1991. 'Metalinguistic dimensions of bilingual proficiency' in E. Bialystok
(ed.): Language Processing in Bilingual Children. Cambridge: Cambridge University
Press.
Bley-Vroman, R. 1989. 'The logical problem of foreign language learning.' Linguistic
Analysis 20: 1-2; 3-49.
Bloor, D. 1984. 'The strengths of the strong programme' in J. R. Brown (ed.): Scientific
Rationality: The Sociological Turn. Dordrecht: Reidel.
Broad, W. and N. Wade. 1982. Betrayers of the Truth. Fraud and Deceit in the Halls of
Science. New York: Simon and Schuster.
Brush, S. 1989. 'Prediction and theory evaluation: The case of light bending.' Science
246:1124-9.
Campbell, D. T. 1988. 'A tribal model of the social system vehicle carrying scientific
knowledge' in S. Overman (ed.): Methodology and Epistemology for Social Science:
Selected Papers. Chicago: University of Chicago Press.
Chalmers, A. 1990. Science and its Fabrication. Milton Keynes: Open University Press.
Chambers, J. K. 1992.'Dialect acquisition.' Language 68/4: 673-705.
Clahsen, H. 1987. 'Connecting theories of language processing and (second) language
acquisition' in C. Pfaff (ed.): First and Second Language Acquisition Processes.
Cambridge, MA: Newbury House.
Collins, R. 1989. 'Toward a theory of intellectual change: The social causes of
philosophies: Science, Technology, and Human Values 14/2: 107-40.
MICHAEL H. LONG 245