Professional Documents
Culture Documents
Lietal 2016 MLJ
Lietal 2016 MLJ
Lietal 2016 MLJ
net/publication/299409021
CITATIONS READS
99 1,664
1 author:
Shaofeng Li
Florida State University
80 PUBLICATIONS 4,066 CITATIONS
SEE PROFILE
Some of the authors of this publication are also working on these related projects:
Language aptitude: Advancing theory, testing, research and practice. By: Wen, Skehan, Biedron, Li & Sparks (Routledge, 2019) View project
All content following this page was uploaded by Shaofeng Li on 21 January 2021.
The article reports on a study investigating the comparative effects of immediate and delayed corrective
feedback in learning the English past passive construction, a linguistic structure of which the learners
had little prior knowledge. A total of 120 learners of English as a foreign language (EFL) from 4 intact
classes at a Chinese middle school were randomly assigned to conditions: immediate feedback, delayed
feedback, task-only, and control. The 3 experimental groups attended a 2-hour treatment session where
they performed 2 dictogloss (narrative) tasks in groups, each followed by a reporting phase in which
they took turns telling the narrative to the class. The 2 feedback groups received either immediate or
delayed corrective feedback in the form of a prompt, followed by recasts of utterances containing errors
in their use of the target structure. No effect for the corrective feedback was found on elicited imitation
test scores, but both the immediate and delayed feedback resulted in gains in grammaticality judgment
test scores, with immediate feedback showing some advantage over delayed feedback. We interpret these
results as showing that the feedback only aided the development of declarative/explicit knowledge and
that the advantage found for immediate feedback was due to the learners using the feedback progres-
sively in the production of new past passive sentences, whereas this did not occur in the delayed feedback
condition.
Keywords: feedback timing; immediate feedback; delayed feedback
Note. ∗ In the delayed feedback condition, CF was provided after both tasks were completed.
Delayed feedback was provided after the com- Lyster, 2004; Yang & Lyster, 2010) has treated dif-
pletion of the second task, and the procedure was ferent prompts as equivalent on the grounds that
the same as that for immediate feedback except they all push learners to self-correct. It might also
that in delayed feedback the teacher initiated the be argued that delayed feedback is more explicit
corrective episode by quoting a wrong utterance than immediate feedback so that any difference
a learner had produced when performing the in the effects of the timing is just a reflection of a
tasks. The errors that were corrected were those difference in salience. In fact, efforts were made
recorded by the teacher and another researcher to ensure that the immediate feedback was salient
who sat in the class during the students’ task per- to the learners through prosodic emphasis. While
formance. The delayed feedback was provided as some recasts are implicit in nature, corrective re-
follows: casts are clearly more explicit. Also, any difference
in the degree of explicitness is an inherent fea-
(1) The teacher quoted a learner’s erroneous ture of the timing of the feedback. As we pointed
sentence and asked him/ her to correct it. out in the introduction, investigating the timing
For example, “Tom, you said ‘The driver of feedback is of obvious pedagogical significance.
wanted to run away, but he stopped by a We argue that the timing of feedback needs to be
policeman.’ Can you say it correctly?” The investigated in an ecologically valid way and that
teacher then paused 3–5 seconds for a re- attempts to rigorously control for inherent differ-
sponse. ences that arise from the timing of feedback are
(2) If the student self-corrected the teacher inappropriate.
stopped there. Otherwise the teacher pro- Table 1 provides the information about the in-
ceeded to step 3. structional treatment in the two feedback condi-
(3) The teacher reformulated the sentence, tions including the length and instances of feed-
highlighting the passive part, “he WAS back treatment, the number of errors, and the
STOPPED by a policeman.” The teacher number of learners who committed an error. As
then moved on to the next error. can be seen, the duration of feedback and num-
ber of presenters are very similar between the two
The feedback in both conditions consisted of groups, although there are slightly more instances
an output-prompting move that encourages self- of feedback and more errors in the immediate
repair followed by an input-providing move that feedback condition.
provides positive evidence, thus catering to both
Lyster and Ranta’s (2013) claim that prompts en-
gage learners in deeper cognitive processing of Testing
the linguistic target and to Goo and Mackey’s
(2013) claim that recasts facilitate the learning of Treatment effects were measured via an un-
new structures. timed GJT and an EIT, which were intended to
While every attempt was made to minimize the provide measures of the learners’ explicit and im-
differences between the immediate and delayed plicit knowledge of the target structure, respec-
feedback, the timing of the feedback inevitably tively (Li, 2013, 2014). The GJT asked the learn-
led to some differences. For example, it could ers to judge whether an item was grammatical
be argued that prompts differ in the delayed and or ungrammatical and correct the error if it was
immediate feedback in that the former consti- ungrammatical. The EIT required each learner
tutes an elicitation and the latter a repetition to verbally repeat some sentences (grammatical
of the learner’s erroneous utterance. However, and ungrammatical) presented in an aural mode.
previous research (e.g., Ammar & Spada, 2006; Both tests had three versions—pretest, posttest 1,
284 The Modern Language Journal 100 (2016)
and posttest 2—and the three versions were cre- injure on my way to school today”), of which 30
ated by randomly scrambling the same items. concerned the past passive (the target structure),
Both tests included 30 target items: Twenty con- and 10 were distractors. Half of the target items
tained regular verbs and 10 irregular verbs; 20 (n = 15) were grammatical and half (n = 15) un-
were old items that targeted verbs from the treat- grammatical. Previous research reported no dif-
ment tasks, and 10 were new items, with all the ference between learners’ responses to grammat-
new items targeting regular verbs. The old items ical and ungrammatical items in an EIT (Erlam,
were equally distributed between the two tasks 2006). The statements were comparable in length
(i.e., 10 each), and the two tasks also contributed (each containing around 10 words) and complex-
an equal number of regular and irregular verbs. ity. The stimuli, read at a normal speed by a na-
To minimize the possibility of the learners trans- tive speaker and recorded using a digital recorder,
ferring their answers from one test to the other, were presented via DMDX, a free software pro-
the sentence stimuli in the GJT were different gram used in psycholinguistic studies to present
from those in the EIT, but the contexts for the auditory stimuli and to control or record reac-
obligatory use of the target structure were the tion time. During the test, the learners heard the
same in the two tests. statements one at a time, and after hearing each
Based on the results of a pilot study where 24 statement, they were asked to decide whether it
learners from the same cohort performed the was true, not true, or whether they were not sure,
two treatment tasks, three types of errors were and then repeat the sentence in correct English
included in the ungrammatical items: (a) no regardless of whether it was true of their per-
“be,” as in “Tony badly injured in a fight with a sonal life/experience. One point was given if a
friend,” (b) bare verb, as in “Two men kill in the response contained the correct form of the past
accident,” and (c) no past participle, as in “This passive, and because self-corrections may involve
morning Helen was knock down in the street.” the use of explicit knowledge, only first attempts
To validate the test items, nine native speakers of were scored. A reliability analysis showed that the
English, who were either applied linguists or ex- Cronbach’s alpha for the EIT was .68, .76, and .77
perienced ESL teachers, were invited to respond for the pretest, the immediate posttest, and the
to and comment on a large pool of sentences in delayed posttest, respectively.
terms of grammaticality and wording. The piloted Unlike previous studies (e.g., Ellis, Loewen, &
items were composed by the researchers using Erlam, 2006) where reaction time was not con-
vocabulary from the learners’ textbook and the trolled, thus increasing the chances for learners
treatment tasks. to access their explicit knowledge, in this study a
The GJT was developed and presented via time limit was imposed on each item. The learners
an online computer program. The test included were required to judge the veracity of a sentence
40 items, 30 of which related to the past passive and then repeat it within the time allocated, af-
(the target structure), and 10 of which were dis- ter which the program moved on to the next item
tractors relating to structures the learners had regardless of whether the previous item was re-
been taught, including the third person –s, sim- sponded to. Because the items in the test varied in
ple past, comparative adjectives, and prepositions length, the reaction time allocated for each item
of time. Among the 40 items, 35 were ungrammat- was based on the average time taken to complete
ical, 5 were grammatical, and all 30 target items each item by 26 learners from classes other than
were ungrammatical. The use of ungrammatical those selected for the main study.
items in a GJT as a measure of explicit knowl-
edge was based on Gutiérrez’s (2013) finding that Procedure
learners’ responses to grammatical and ungram-
matical items load on separate factors, with the The study spanned 3 days. On Day 1, the
former tapping implicit knowledge and the latter learners took the grammaticality judgment and
explicit knowledge. In scoring the learners’ an- elicited imitation pretest. One week later, on
swers, one point was given if an ungrammatical Day 2, the experimental groups received the in-
sentence was judged to be ungrammatical and the structional treatment, followed by the immediate
correct form of the passive construction was sup- posttests while the control group just completed
plied. The internal reliability for the grammatical- the tests. On Day 3, 2 weeks later, all learners
ity test, indexed by Cronbach’s alpha, was .91 for took the delayed posttests. To minimize the po-
the pretest and .96 for both posttests. tential modelling effect of a written test on an oral
The EIT consisted of 40 statements relating to test, the EIT always preceded the GJT. Through-
the learners’ personal experience (e.g., “My knee out the study, all four groups of participants
Shaofeng Li, Yan Zhu, and Rod Ellis 285
TABLE 2
Means and Standard Deviations for the Grammaticality Judgment Test
Note. a The number of participants ranges from 28 to 30 across groups and tests after outliers were removed. b The
highest possible (full) score is 30.
continued with their normal instruction, the hoc pairwise comparisons. Because the assump-
learners and their teachers were never informed tion of homogeneity of variances of ANOVA was
of the purpose of the project, and the teach- violated in most cases, the F test was performed
ers did not provide any instruction on the target using the Brown–Forsythe adjustment, and the
structure. post hoc comparisons were conducted with the
Dunnett T3 corrections. In pairwise comparisons,
Analysis in addition to using the p value to evaluate the
significance of a difference in mean scores, the
We elected to investigate both total accuracy
effect size in the form of Cohen’s d was calcu-
scores for past passive and accuracy scores sep-
lated to show the magnitude of the difference. Co-
arately for regular and irregular past participles
hen’s d is primarily based on mean difference and
in past passive constructions. The decision to
thus overcomes the limitation of the sole reliance
examine regular and irregular past participles
on the results of null-hypothesis significance test-
separately was motivated by research that has
ing (i.e., the p value), which is sensitive to sam-
shown that the acquisition of these differs. Ullman
ple size and prone to Type II error. Following
(2001), for example, proposed that irregular past
Cohen’s benchmarks for interpreting effect sizes,
verb forms are lexical and thus stored in declara-
.2, .5, and .8 were considered as small, medium,
tive memory (i.e., they are explicit) whereas regu-
and large effects, respectively.
lar forms can eventually be computed in procedu-
ral memory (i.e., they involve the implicit system).
From this perspective, the clearest test of whether RESULTS
the feedback contributed to the learners’ implicit
Grammaticality Judgment Test Results
knowledge was the effect it had on passive con-
structions involving the regular past participle. Overall. The descriptive statistics appear in
To prevent misleading conclusions resulting Table 2 while the group means are graphically
from the influence of extreme values, outliers in displayed in Figure 1. These results show that
each group in each test were identified and re- (a) the learners’ pretest scores were very low (the
moved prior to analysis. To identify outliers, raw group means ranged 1.29–1.89 out of 30), sug-
scores were transformed into z scores (standard- gesting that they possessed almost no knowledge
ized scores for which the mean is zero and stan- of the English past passive, (b) there was very little
dard deviation is 1), and any score 2.5 standard between-group variation in the learners’ pretest
deviation units above or below the mean was con- scores, and (c) all learners performed better on
sidered an outlier. To explore whether the learn- the posttests than the pretest, and (d) the experi-
ers’ test scores varied as a function of the type of mental groups showed higher scores than the con-
treatment they received and the timing of tests, trol group on both posttests.
a mixed design repeated measure analysis of vari- The repeated measures ANOVA showed a sig-
ance (ANOVA) was conducted, with group (treat- nificant effect for time, F(2,214) = 40.84, p <
ment type) as the between-group variable, and .00, for group, F(3,107) = 3.13, p = .03, and
scores on the pretest and the two posttests as the for time × group interaction, F(6,214) = 57.32,
within-group variable. If the initial mixed design p < .00. These results indicate that the scores
ANOVA detected significant main or interaction of the learners were different across the three
effects, one-way ANOVAs were conducted to lo- time points, that overall there were differences
cate the source of significance, followed by post between the four groups’ test scores regardless
286 The Modern Language Journal 100 (2016)
FIGURE 1
Group Means on the Grammaticality Judgment Test Over Time
10
Group Means
Task + Immediate Feedback
5 Task + Delayed Feedback
Task Only
Control
0
Pretest Posttest 1 Posttest 2
Note. a The number of participants ranged from 28 to 30 across groups and tests after outliers were removed.
b Percentage scores were used because of the unequal numbers of regular (n = 20) and irregular (n = 10) verbs
in the test.
TABLE 5
Post Hoc Comparisons for Regular and Irregular Verbs on the Grammaticality Judgment Test∗
Posttest 1 Posttest 2
Group Contrasts da pb d p d p d p
Immediate Feedback vs. Control 1.13 .00 .75 . 01 .76 .03 .48 .11
Delayed Feedback vs. Control .93 .01 .88 .01 .63 .25 .38 .64
Task-Only vs. Control .45 .83 .57 .23 .59 .43 .58 .27
Immediate Feedback vs. Delayed Feedback .03 .99 −.27 .99 .25 .76 .27 .48
Immediate Feedback vs. Task-Only .77 .01 .31 .72 .27 .65 .06 1.0
Delayed Feedback vs. Task-Only .63 .08 .54 .42 .03 1.0 −.19 .77
Note. ∗ The F tests for the posttest scores were performed using the Brown–Forsythe adjustment and the post hoc
comparisons were conducted with the Dunnett T3 corrections. a Effect size (Cohen’s d). b Results of null hypothesis
significance testing.
immediate feedback was significantly higher than statistics reported in Table 6 showed that overall
that for task-only in the learning of regular verbs. there was little difference in the learners’ perfor-
At the time of posttest 2, the only significant mance on the old and new items.
difference was found between immediate feed- The one-way ANOVA did not detect signifi-
back and control in the learning of regular cant differences between the four groups’ pretest
verbs. On both posttests, the two feedback groups scores for either old or new items. Mixed de-
(in comparison with the control group) showed sign ANOVAs found that for both old and new
larger effect sizes for their performance on regu- items, there were significant main effects for time
lar verbs than irregular verbs. and group and significant interactions between
Old Versus New Items. An analysis was conducted time and group (Appendix B). One-way ANOVAs
to investigate whether the treatment effects in- and post hoc T-tests (Table 7) showed that at
cluded new items or were restricted to old items. the time of posttest 1, both immediate and de-
Recall that the GJT included 30 items, out of layed feedback groups outperformed the control
which 20 concerned verbs that appeared in the group on both old and new items while imme-
treatment tasks and 10 involved new verbs. Among diate feedback worked better than task only on
the 20 old items, 10 targeted regular verbs and 10 old items. At the time of posttest 2, only the im-
irregular verbs; the 10 new verbs were all regular. mediate feedback group performed significantly
Given the difference in the learners’ performance better than the control group in the learning of
for regular and irregular verbs, the analysis was old items, and there were no other significant
only conducted for regular verbs. The descriptive differences.
288 The Modern Language Journal 100 (2016)
TABLE 6
Means and Standard Deviations for the Grammaticality Judgment Test: Old Versus New Regular Verb Items∗
Note. ∗ All new items are regular verbs. a Items that appeared in the treatment tasks. b New items. c The number of
participants ranged from 28 to 30 across groups and tests after outliers were removed. d The highest possible (full)
score is 10.
TABLE 7
Post Hoc Comparisons for Old and New Items on the Grammaticality Judgment Test∗
Posttest 1 Posttest 2
Group Contrasts da pb d p d p d p
Immediate Feedback vs. Control 1.08 .00 .97 .00 .82 .02 .64 .09
Delayed Feedback vs. Control .88 .02 .90 .02 .57 .41 .62 .33
Task-Only vs. Control .20 .95 .57 .81 .67 .06 .59 .71
Immediate Feedback vs. Delayed Feedback .12 .98 −.09 1.0 .40 .39 .13 .39
Immediate Feedback vs. Task-Only .90 .01 .47 .07 .12 .99 .13 .99
Delayed Feedback vs. Task-Only .71 .09 .50 .21 .02 .69 .00 .69
Note. ∗ The F tests for the posttest scores were performed using the Brown-Forsythe adjustment and the post hoc
comparisons were conducted with the Dunnett T3 corrections. a Effect size (Cohen’s d). b Results of null hypothesis
significance testing.
TABLE 8
Means and Standard Deviations for the Elicited Imitation Test
Group na Mb SD n M SD n M SD
Note. a The number of participants ranged from 28 to 30 across groups and tests after outliers were removed. b The
highest possible (full) score is 30.
Elicited Imitation Test Results are plotted on the graph in Figure 2. It can
be seen that the learners’ scores were low on
The EIT aimed to gauge the learners’ implicit the pretest as well as the two posttests, with the
knowledge of the target structure. The means group means ranging from 1.19 to 4.10, and
and standard deviations for each group over that all groups improved from the pretest to the
time are displayed in Table 8, and the means posttests.
Shaofeng Li, Yan Zhu, and Rod Ellis 289
FIGURE 2
Group Means on the Elicited Imitation Test Over Time
Group Means
Task + Immediate Feedback
Task + Delayed Feedback
Task Only
Control
0
Pretest Posttest 1 Posttest 2
A repeated measures ANOVA revealed a sig- EIT posttests. If these tests are taken as measures
nificant effect for time, F(2,200) = 41.42, p < of explicit and implicit knowledge, respectively,
.00, but there was no significant effect for group the conclusion is that the feedback contributed
(treatment), F(3,100) = .92, p = .81, or for time to the learners’ explicit knowledge but not their
× group interaction, F(6,200) = .92, p = .43. One- implicit knowledge. This result differs from some
way ANOVAs found no difference in the groups’ other studies that have investigated the effects of
pretest scores, F(3,109) = 1.57, p = .20, in their recasts (e.g., Mackey & Philp, 1998; Révész, 2012),
scores on the immediate posttest, F(3,114) = .25, which report results showing that recasts lead to
p = .86, or in their scores on the delayed posttest, improved accuracy in the kinds of language use
F(3,116) = 1.20, p = .31. The significant time ef- (e.g., free oral production) likely to tap implicit
fect was due to the increase in the learners’ over- knowledge. Why, then, did the corrective recasts
all performance from the pretest to the posttests. in this study not lead to improved performance in
These results suggest that the improvement in the the EIT?
learners’ posttest scores in comparison with their There are a number of possible answers to this
pretest scores was attributable to practice effects question. One is that the EIT failed to capture
and that there were no effects for the instructional gains in implicit knowledge. The test differed
treatments. The data were subjected to further from the EIT used in other studies (e.g., Zhang,
analysis to explore the impact of other variables 2015) in that it was administered via computer
such as regular versus irregular and old versus rather than face-to-face and that the response
new, but the analyses failed to detect any signifi- time for each sentence was restricted. The aim was
cant treatment effects. to make it difficult for learners to draw on their
explicit knowledge, but it is possible that the EIT
DISCUSSION was simply too demanding on these learners’ pro-
cessing abilities to capture any changes that had
We begin by considering what the results show taken place in their implicit knowledge. A sec-
about the effect of the feedback on the learners’ ond explanation is that these learners were not
implicit and explicit knowledge of the target struc- developmentally ready to acquire the target struc-
ture (past passive). We then provide answers to ture. The past passive is morphologically com-
the research questions by examining the differen- plex (Williams & Evans, 1998) and difficult to
tial effects of the immediate and delayed CF on acquire. The third explanation is that the instruc-
the learners’ explicit knowledge. tion provided in this study was insufficient to ef-
fect development of implicit knowledge. It con-
Effects of the Instruction on Learners’ Explicit and sisted of two tasks completed in the same lesson
Implicit Knowledge lasting only 2 hours. Arguably, more intensive in-
struction is needed to have any effect on learn-
The two tests used in the study—the GJT and ers’ implicit knowledge of a difficult grammatical
the EIT—were intended to provide measures of structure such as the past passive. The final expla-
learners’ explicit and implicit knowledge of the nation is that the learners in this study were accus-
target structure (past passive). The results of these tomed to learning explicitly, so that even though
two tests are clear-cut. The immediate and de- no explicit instruction was provided, they used
layed corrective feedback resulted in statistically the input from the tasks and the feedback pro-
significant gains in the GJT posttests but not in the vided to construct a metalinguistic representation
290 The Modern Language Journal 100 (2016)
of the past passive, and that the effort they put knowledge. In other words, there was no evidence
into achieving this blocked the incidental and im- of the symbiotic relationship between explicit
plicit processes that lead to implicit knowledge. and implicit learning that Long claimed recasts
This last explanation can also account for why the foster.
instruction had a very clear effect on the learners’
acquisition of explicit knowledge, as measured by Is Delayed Feedback Effective?
the GJT. We will now turn to consider the research
questions but only in terms of the effect that the Learners receiving the delayed feedback scored
two kinds of feedback had on the learners’ explicit significantly higher than the learners in the con-
knowledge. trol group on the measure of explicit knowledge
when the effects were tested immediately after the
Is Immediate Feedback Effective? treatment, irrespective of learners’ prior knowl-
edge, verb type, and whether the verbs appeared
The results showed that on posttest 1, the learn- in the treatment. The effects, however, were not
ers receiving immediate feedback outperformed sustained after two weeks. These results lend some
the control group in the GJT, regardless of their support to the pedagogic position outlined in the
previous knowledge, of whether the verbs were introduction, namely that delaying feedback until
regular or irregular, and of whether the test items learners have completed a communicative task is
were old or new. However, the effects of the im- desirable. However, the fact that the effects of the
mediate feedback were less evident in posttest 2. delayed feedback were not sustained suggests that
Much of the previous research involving im- the learning that results from delayed feedback
mediate feedback (e.g., Ellis et al., 2006; Yang & was shallow and that the declarative representa-
Lyster, 2010) has investigated the effect of recasts tions of the past passive that the feedback gener-
on structures of which learners already possessed ated were quickly lost. One anonymous reviewer
some prior knowledge and reported results that pointed out that the larger effects of delayed feed-
show it was effective. Doughty and Varela (1998) back at the time of the immediate posttest may
reported that corrective recasts—the type of re- also be due to the closer time proximity between
cast used in this study—were also effective in im- the treatment and the test.
proving learners’ accurate use of past tense. How-
ever, Long (2015) argued that recasts are needed Is There Any Difference in the Effects of Immediate and
for the acquisition of new linguistic features and Delayed Feedback?
went on to list their advantages: They provide
feedback that (a) is contextualized and motivat- This is the core question. Pairwise comparisons
ing because it is the learner’s message or linguis- of the posttest scores failed to show significant dif-
tic performance that is at stake, (b) is contin- ferences between the effects of the immediate and
gent and caters to the learner’s internal syllabus, delayed feedback. However, a closer look at the re-
(c) displays the error and the correct form in jux- sults suggests that the immediate feedback was su-
taposition so the learner immediately notices the perior: (a) Its effect sizes were larger than those
gap, (d) alleviates the learner’s processing burden of delayed feedback in 9 out of 10 pairwise con-
and increases the chances for a focus on form by trasts when each feedback type was compared with
only modifying the incorrect part while maintain- control and in 8 out of 10 contrasts when the two
ing the meaning, and (e) “capitalizes on a sym- feedback types were compared with each other,
biotic relationship between explicit and implicit and (b) the significant effects of immediate feed-
learning, instruction, and knowledge” (p. 317). back were sustained after 2 weeks, while the ef-
Goo and Mackey (2013) have also emphasized fects of delayed feedback were only present in the
the need to “examine L2 targets to which learn- immediate posttest. Again, though, these advan-
ers have never been exposed” (p. 153). This study tages for immediate feedback were only evident in
constitutes a start in this direction. the GJT; that is, they applied only to the learners’
The pretest showed that the learners had no explicit knowledge of the past passive.
or only very limited knowledge of the past pas- In the introduction we noted that a num-
sive. Immediate feedback in the form of correc- ber of SLA theories lend support to immediate
tive recasts helped them to learn this structure feedback: the Interaction Hypothesis, focus-on-
and to some extent maintain this learning over form, transfer appropriate processing, and skill
time. However, a caveat is in order. As already acquisition theory. We also noted that theories
noted, the immediate feedback only contributed in cognitive psychology (e.g., preparatory atten-
to the development of the learners’ explicit tion and memory theory and reactivation and
Shaofeng Li, Yan Zhu, and Rod Ellis 291
reconsolidation theory) predict that delayed feed- be traced to the methodological differences be-
back will be more effective. By and large the re- tween the two studies: (a) Quinn’s study was con-
sults of this study provide greater support to those ducted in a laboratory setting, while this study
SLA theories that support immediate feedback. was carried out in the classroom, (b) in Quinn’s
However, these theories were formulated to ex- study a recast was provided regardless of whether
plain how learners develop the procedural ability the learner was able to self-correct, while in this
to use those linguistic features they have acquired study a recast was only provided when self-repair
in free communication (i.e., implicit knowledge). failed, (c) Quinn provided pretreatment instruc-
In fact, this study failed to demonstrate that ei- tion on the target structure but this study did not,
ther type of feedback contributed to the develop- (d) the learners already had considerable prior
ment of implicit knowledge. Thus, what is needed knowledge of the target structure in Quinn’s study
is an explanation of why the immediate feedback but not in this study, (e) in Quinn’s study not all
was more effective for the development of explicit feedback in the delayed feedback condition was
knowledge. delayed because feedback was also provided be-
We suggest that the explanation lies in the dif- tween tasks, so the so-called delayed feedback for
ferent processing demands required by the two one task served as pretask instruction for the sub-
feedback conditions. It should be noted that there sequent task, thus confounding the immediate–
was no difference in the time between the errors delayed distinction. We argue that our study con-
and feedback. In both immediate and delayed stitutes a methodologically sounder comparison
feedback the feedback was provided immediately of the effects of the key variable, the timing of the
following the error. The difference lay in the con- feedback.
textual nature of the feedback. In the immedi- We note that the results of our study do mirror
ate feedback condition, learners received feed- those reported in Hattie and Timperley’s (2007)
back on the errors they had just committed as they synthesis of feedback studies in education re-
struggled to reconstruct the narratives. In the de- search. Hattie and Timperley reported an advan-
layed condition, the utterances containing errors tage for immediate feedback directed at inform-
were presented one at a time out of context. In ing students whether they were correct or incor-
the immediate condition, the feedback was linked rect (what they called ‘feedback about task’). We
continuously to the learners’ attempts to tell the argue that the feedback in this study was directed
story and to produce passive sentences. That is, at factual correctness and that, like the feedback
the learners had the opportunity to use the feed- that occurs in subject learning in general, it facili-
back they had received when producing new sen- tated the intentional learning of declarative infor-
tences involving the past passive as they contin- mation.
ued to tell the story. In the delayed condition the
learners were not required to produce their own CONCLUSION
sentences, only correct sentences that the teacher
presented to them. Skill acquisition theory can This study was undertaken to contribute to the
help to explain why the contextualized condition growing body of research on CF by investigat-
was more effective. DeKeyser (1998) talked about ing a variable that has received scant attention to
the importance of learners using their declara- date, the timing of feedback. It also sought to ex-
tive knowledge as a ‘crutch’ to support their at- tend current research by investigating the effect
tempts to communicate. In the present study, this of feedback on a grammatical structure that the
did not appear to aid proceduralization of the L2 learners had no or very little prior knowledge
past passive (i.e., no effects on the imitation test), of. The study compared the effectiveness of im-
perhaps because the ‘practice’ afforded by the mediate and delayed feedback consisting of cor-
two tasks was insufficient, but it helped to em- rective recasts on learners’ acquisition of English
bed the declarative representations of the target past passive. The pretest showed that the learners
structure more deeply in the learners’ memo- had almost no prior knowledge of this structure.
ries, which were therefore better sustained over Both immediate and delayed feedback proved fa-
time. cilitative of learning this new linguistic structure
Finally, we note that the results of this study with some evidence pointing to the superiority of
differ from those of other studies that have in- immediate feedback. However, the effects of the
vestigated immediate and delayed feedback—in feedback were only evident in a GJT indicating
particular Quinn (2014), which reported no dif- that it only contributed to the development of
ference in the effect of the timing of feedback the learners’ explicit knowledge. This is, perhaps,
on learning. The difference in the results can not so surprising. The learners had almost no
292 The Modern Language Journal 100 (2016)
knowledge of the target structure. If the process role that recasts play in the acquisition of implicit
of learning a structure such as regular past verb knowledge of new grammatical features.
forms, which may eventually be processed in pro-
cedural memory, involves an initial explicit repre-
sentation of it, as claimed by Ullman (2001) and ACKNOWLEDGMENTS
N. Ellis (2005), then, in the short term, feedback
of any kind can only be expected to contribute
We would like to thank Professor Dingfang Shu from
to such a representation. We found no evidence
Shanghai International Studies University for his gen-
that the feedback had any effect on the learn- erous support throughout the project. We would also
ers’ implicit knowledge of the regular past verb like to extend our sincere gratitude to the students and
forms. The learners would have needed substan- teachers at Huanan Experimental School in Danyang,
tially more opportunity to use and receive feed- Jiangsu Province, China, for their involvement in vari-
back before they could compute these forms in ous aspects of the study. In particular, we would like to
procedural memory. In other words, our study is acknowledge the following individuals for their support
perhaps best interpreted as showing that the im- and assistance: Suorong Zhang, Xuping Wang, Wen-
mediate rather than the delayed feedback assisted zhong Wang, Yun Wu, Yufang Yin, Bin Ding, Mingdi Li,
Huijun Chen, and Wenjun Zhou.
the initial stage of acquiring both irregular and
regular past participle forms.
The study was motivated by both pedagogical
and theoretical concerns. Teachers are often ad-
vised to delay providing feedback until learners REFERENCES
have completed a communicative task (Scrivener,
2005) on the grounds that providing feedback Aljaafreh, A., & Lantolf, J. (1994). Negative feedback as
during the performance of a task will have a nega- regulation and second language learning in the
tive effect on fluency and on the assumption that zone of proximal development. Modern Language
delayed feedback is effective. We did not investi- Journal, 78, 465–483.
gate whether the immediate feedback interfered Allwright, R. (1975). Problems in the study of the lan-
with fluency but just listening to the recordings guage teacher’s treatment of learner error. In
of the learners’ performance of the tasks in the M. Burt & H. Dulay (Eds.), On TESOL ’75: New di-
immediate feedback and delayed feedback con- rections in language learning, teaching, and bilingual
education (pp. 96–109).Washington, DC: TESOL.
ditions suggests that it did not. Clearly, though,
Ammar, A., & Spada, N. (2006). One size fits all? Re-
there is a need to investigate this possibility more casts, prompts, and L2 learning. Studies in Second
thoroughly. The results of the study have shown Language Acquisition, 28, 543–574.
that delayed feedback is effective—at least where Bitchener, J., & Ferris, D. (2012). Written corrective feed-
explicit knowledge is concerned—and this lends back in second language acquisition and writing. New
some support to the claim of teacher trainers such York: Routledge/Taylor & Francis.
as Scrivener. However, the study also pointed to Brumfit, C. (1984). Communicative methodology in lan-
the superiority of immediate feedback. Perhaps, guage teaching. Cambridge: Cambridge University
then, teachers should not feel so constrained by Press.
the pedagogic advice they receive regarding the Chaudron, C. (1977). A descriptive model of discourse
in the corrective treatment of learners’ errors.
timing of feedback and be prepared to experi-
Language Learning, 27, 29–46.
ment with immediate feedback. DeKeyser, R. M. (1998). Beyond focus on form: Cog-
The study does not provide unequivocal sup- nitive perspectives on learning and practicing
port for the SLA theories that point to the need second language grammar. In C. Doughty &
for immediate feedback. These theories seek to J. Williams (Eds.), Focus on form in classroom second
account for how feedback contributes to the ac- language acquisition (pp. 42–63). New York: Cam-
quisition of implicit knowledge but this study has bridge University Press.
only shown that the immediate feedback (and DeKeyser, R. (2007). Skill acquisition theory. In
delayed feedback) benefited the development of B. VanPatten & J. Williams (Eds.), Theories in sec-
explicit knowledge. We have suggested that may ond language acquisition (pp. 97–113). Mahwah, NJ:
Lawrence Erlbaum.
have been because of the limited nature of the
Doughty, C. (2001). The cognitive underpinnings of
instruction provided or it might also be because focus on form. In P. Robinson (Ed.), Cogni-
the ‘new’ target structure was too developmen- tion and second language instruction (pp. 206–257).
tally advanced for the learners. Further research Cambridge: Cambridge University Press.
is needed to investigate the theoretical claims ad- Doughty, C., & Varela, E. (1998). Communicative focus
vanced by Long (2015) and others regarding the on form. In C. Doughty & J. Williams (Eds.), Focus
Shaofeng Li, Yan Zhu, and Rod Ellis 293
on form in classroom second language acquisition (pp. Long, M. H. (1991). Focus on form: A design feature
114–138). New York: Cambridge University Press. in language teaching methodology. In K. de Bot,
Ellis, N. (2005). At the interface: Dynamic interactions R. Ginsberg, & C. Kramsch (Eds.), Foreign lan-
of explicit and implicit knowledge. Studies in Sec- guage research in cross-cultural perspective (pp. 39–
ond language Acquisition, 27, 305–352. 52). Philadelphia/Amsterdam: John Benjamins.
Ellis, R. (2005). Measuring implicit and explicit knowl- Long, M. H. (1996). The role of the linguistic envi-
edge of a second language: A psychometric study. ronment in second language acquisition. In W.
Studies in Second Language Acquisition, 27, 141–172. Ritchie & T. Bhatia (Eds.), Handbook of second lan-
Ellis, R., Loewen, S., & Erlam, R. (2006). Implicit and guage acquisition (pp. 413–468). San Diego, CA:
explicit corrective feedback and the acquisition of Academic Press.
L2 grammar. Studies in Second Language Acquisition, Long, M. H. (2015). Second language acquisition and
28, 339–368. task-based language teaching. Malden, MA: Wiley–
Ellis, R., & Shintani, N. (2014). Exploring language ped- Blackwell.
agogy through second language acquisition research. Lyster, R. (2004). Different effects of prompts and ef-
London/New York: Routledge. fects in form-focused instruction. Studies in Second
Erlam, R. (2006).Elicited imitation as a measure of L2 Language Acquisition, 26, 399–432.
implicit knowledge: An empirical validation study. Lyster, R., & Ranta, L. (2013). Counterpoint piece: The
Applied Linguistics, 27, 464–491. case for variety in corrective feedback research.
Gass, S., Mackey, A., Fernandez, M., & Alvarez–Torres, Studies in Second Language Acquisition, 35, 167–184.
M. (1999). The effects of task repetition on linguis- Lyster, R., & Saito, K. (2010). Oral feedback in class-
tic output. Language Learning, 49, 549–580. room SLA: A meta-analysis. Studies in Second Lan-
Goo, J., & Mackey, A. (2013). The case against the case guage Acquisition, 32, 265–302.
against recasts. Studies in Second Language Acquisi- Lyster, R., Saito, K., & Sato, M. (2013). Oral correc-
tion, 35, 127–165. tive feedback in second language classrooms. Lan-
Gutiérrez, X. (2013). The construct validity of grammat- guage Teaching, 46, 1–40.
icality judgment tests as measures of implicit and Mackey, A., & Goo, J. (2007). Interaction research in
explicit knowledge. Studies in Second Language Ac- SLA: A meta-analysis and research synthesis. In
quisition, 35, 423–449. A. Mackey (Ed.), Conversational interaction in SLA:
Hattie, J., & Timperley, H. (2007). The power of feed- A collection of empirical studies (pp. 408–452). New
back. Review of Educational Research, 77, 81–112. York: Oxford University Press.
Hedge, T. (2000). Teaching and learning in the language Mackey, A., & Philp, J. (1998). Conversational inter-
classroom. Oxford: Oxford University Press. action and second language development: Re-
Hendrickson, J. (1978). Error correction in foreign lan- casts, responses, and red herrings? Modern Lan-
guage teaching: Recent theory, research, and prac- guage Journal, 82, 338–356.
tice. Modern Language Journal, 62, 387–398. McDaniel, M., Robinson, B., & Einstein, P. (1998).
Ju, M. K. (2000). Overpassivization errors by second lan- Prospective remembering: Perceptually driven or
guage learners. Studies in Second Language Acquisi- conceptually driven processes? Memory & Cogni-
tion, 22, 85–111. tion, 26, 121–134.
Lavolette, E., Polio, C., & Kahng, J. (2015). The accu- Nader, K. (2003). Memory traces unbound. Trends in
racy of computer-assisted feedback and students’ Neurosciences, 26, 65–72.
responses to it. Language Learning & Technology, Norris, J., & Ortega, L. (2000). Effectiveness of L2
19, 50–68. instruction: A research synthesis and quantita-
Li, S. (2010). The effectiveness of corrective feedback in tive meta-analysis. Language Learning, 50, 417–
SLA: A meta-analysis. Language Learning, 60, 309– 528.
365. Qin, J. (2008). The effect of processing instruction
Li, S. (2013). The differential roles of language an- and dictogloss tasks on acquisition of the English
alytic ability and working memory in mediating passive voice. Language Teaching Research, 12, 61–
the effects of two types of feedback on the ac- 82.
quisition of an opaque linguistic structure. In C. Quinn, P. (2014). Delayed versus immediate corrective feed-
Sanz & B. Lado (Eds.), Individual differences, L2 de- back on orally produced passive errors in English.
velopment & language program administration: From (Unpublished doctoral dissertation). University of
theory to application (pp. 32–52). Boston: Cengage Toronto, Toronto.
Learning. Révész, A. (2012). Working memory and the observed
Li, S. (2014). The interface between feedback type, L2 effectiveness of recasts on different L2 outcome
proficiency, and the nature of the linguistic target. measures. Language Learning, 62, 93–132.
Language Teaching Research, 18, 373–396. Rolin–Ianziti, J. (2010). The organization of delayed sec-
Lightbown, P. M. (2008). Transfer appropriate process- ond language correction. Language Teaching Re-
ing as a model for classroom second language ac- search, 14, 183–206.
quisition. In Z. H. Han (Ed.), Understanding second Russell, J., & Spada, N. (2006). The effectiveness of cor-
language process (pp. 27–44). Clevedon, UK: Multi- rective feedback for second language acquisition:
lingual Matters. A meta-analysis of the research. In J. Norris &
294 The Modern Language Journal 100 (2016)
L. Ortega (Eds.), Synthesizing research on language lum innovation in a Chinese secondary school. (Un-
learning and teaching (pp. 131–164). Philadelphia/ published doctoral dissertation). Shanghai Inter-
Amsterdam: John Benjamins. national Studies University, Shanghai.
Schmidt, R. (1994). Deconstructing consciousness in
search of useful definitions for applied linguistics.
AILA Review, 11, 11–26.
Scrivener, J. (2005). Learning teaching: A guidebook for APPENDIX A
English language teachers (2nd ed.). Oxford, UK:
Macmillan.
Seedhouse, P. (2004). The interactional architecture of the Narrative Texts Used in the Treatment Tasks
language classroom: A conversation analysis perspective.
Malden, MA: Blackwell. 1. A Car Accident
Shintani, N., & Aubrey, S. (2016). The effectiveness of
There was a bad car accident yesterday. Three
synchronous and asynchronous written corrective
feedback on grammatical accuracy in a computer- people were killed. Also, one child was injured.
mediated environment. Modern Language Journal, Her leg and arm were broken. Her face was se-
100, 296–319. riously cut. She was driven to the local hospital.
Siyyari, M. (2005). A comparative study of the effect of im- Her injuries were treated there. The relatives of
plicit and delayed, explicit focus on form on Iranian EFL the girl were told about the accident.
learners’ accuracy of oral production. (Unpublished A witness said, “The car was hit by a big truck.
MA thesis). Iran University of Science and Tech- It was badly damaged.” The truck was traveling on
nology, Tehran. the wrong side of the road. The driver of the truck
Ullman, M. (2001). The declarative/procedural model
tried to run away. But he was stopped, and he was
of lexicon and grammar. Journal of Pyscholinguistic
arrested. He was taken to the police station for
Research, 30, 37–69.
Ur, P. (1996). A course in language teaching. Cambridge: questioning. Some bottles of beer were found in
Cambridge University Press. his car. He was charged with drunk driving. He
Varnosfadrani, A. D. (2006). A comparison of the effects was locked in a police cell.
of implicit/explicit and immediate/delayed corrective
feedback on learners’ performance in tailor-made tests.
(Unpublished doctoral dissertation). University of 2. An Earthquake
Auckland, Auckland, NZ.
Kiki was raised in a small house in the country-
Williams, J. (2012). The potential role(s) of writing in
second language development. Journal of Second side. One day he was playing when suddenly there
Language Writing, 21, 321–331. was a big earthquake. He was knocked down by
Williams, J., & Evans, J. (1998). What kind of focus the falling bricks. Then the walls fell down. He
and on which forms? In C. Doughty & J. Williams was trapped in the house. It was very dark. Kiki
(Eds.), Focus on form in classroom second language was badly hurt and could not move. Later Kiki’s
acquisition (pp. 139–155). Cambridge: Cambridge mom came back home. She saw the house was de-
University Press. stroyed. She thought her boy was buried in the
Yang, Y., & Lyster, R. (2010). Effects of form-focused house. She shouted out to him. He could not hear
practice and feedback on Chinese EFL learners’
her because he was covered with bricks.
acquisition of regular and irregular past tense
Some dogs were brought to search for him. Kiki
forms. Studies in Second Language Acquisition, 32,
235–263. was found. The bricks were removed. Kiki was
Zhang, R. (2015). Measuring university-level L2 learn- pulled out of the wreckage of the house. He was
ers’ implicit and explicit knowledge. Studies in Sec- carried to the local hospital. He was put in an
ond Language Acquisition, 37, 457–486. emergency room for treatment. He was given spe-
Zhu, Y. (2015). An ethnographic study on foreign language cial food to help him recover. He was allowed to
teacher cognition and classroom practices within curricu- leave the hospital after one month.
Shaofeng Li, Yan Zhu, and Rod Ellis 295
APPENDIX B
Time × Group
Time Group Interaction
F p F p F p