Professional Documents
Culture Documents
Formulaic Sequences and L2 Oral Proficie
Formulaic Sequences and L2 Oral Proficie
Formulaic Sequences and L2 Oral Proficie
Abstract
This paper investigates the extent to which productive use of formulaic se-
quences by intermediate students of two typologically different languages, i.e.,
English and Spanish, is associated with their oral proficiency in these lan-
guages. Previous research (e.g., Boers et al. 2006) has shown that appropriate
use of formulaic sequences helps learners of English come across as fluent and
idiomatic speakers. The evidence from the present study, which was conducted
with the participation of Dutch-speaking students of English and Spanish, con-
firms that finding, as oral proficiency assessments based on re-tell tasks cor-
related positively with the number of formulaic sequences the students used in
these tasks. The correlations were strongest in the English language samples,
however. It seems that the greater incidence of morphological-inflectional er-
rors in our participants’ spoken Spanish dampens the contribution that using
formulaic sequences tends to make to their oral proficiency (as perceived by
our assessors). The findings are discussed with reference to typological differ-
ences between L1 and L2.
1. Introduction1
Many researchers, including Ellis (1996, 2002), Myles et al. (1998), Schmitt
(2004), Weinert (1995) and Wray (2002), assign a central role to formulaic
sequences in language acquisition and use. Formulaic sequences is used here
as a cover term for a variety of related phenomena also referred to as lexical
phrases or chunks, including classes as diverse as strong collocations (e.g., tell
1. We are grateful to Jean-Pierre van Noppen, Justine Kemlo, Kate McDonald, Carole Fielding,
Isabel Cangas, Rosa Fernández, Ana Gutiérrez, Pilar Fernández, Sofía Gallego and Paola
Acosta for their help with the study. We would also like to thank two anonymous reviewers
for their insightful comments and useful suggestions.
a lie; heavy traffic), idioms (e.g., turn the tide; back to square one), binomials
(e.g., cuts and bruises; research and development), standardised similes (e.g.,
clear as crystal; dry as dust), proverbs and clichés (e.g., When the cat’s away
. . . ; That’s the way the cookie crumbles), discourse organisers (e.g., On the
other hand; Having said that) and social routine formulae (e.g., Nice to meet
you; Have a nice day). Some formulaic sequences are pretty much fixed as far
as their lexical composition is concerned, while others occur in different guises,
showing syntagmatic expansion (e.g., tell a white lie; Have very nice day) or
paradigmatic substitution (e.g., Pleased/Nice to meet you; conduct/carry out
an investigation). Paradigmatic substitution is especially evident in slot-and-
frame patterns, such as ‘take (someone) x time (to do y)’ (e.g., It took us two
hours to get there; It’ll take only five minutes). Additionally, some formulaic
sequences are completely fixed at the morphological level as well (e.g., *On
the other hands; *he was jumping guns), while others vary in accordance with
general ‘rules’ of grammar (e.g., commit/ commits/ committed a crime/ crimes).
Frequently occurring formulaic sequences are believed to play a crucial role
in language acquisition because they provide the material for exemplar-based
learning. This is the position advocated by Ellis (2002), who maintains that,
with regard to L1 acquisition, the usual developmental sequence is from for-
mula, through low-scope slot-and-frame pattern, to creative construction. The
outcome of this development, according to Skehan (1998), is a dual system
for language processing which comprises both a rule-based mode, which al-
lows language users to produce ‘novel’ utterances, and a memory-based mode,
which allows for stringing together holistically stored exemplars at a fast pace.
Which of the two modes takes precedence over the other is believed to de-
pend on the particular circumstances of language use: employing the rule-
based mode requires conditions that allow for some planning time, whereas the
memory-based mode facilitates fluency under ‘real-time’ conditions. Memory-
based language production is characterised by ample use of formulaic se-
quences. As most instances of language use typically involve a mixture of both
rule-based and memory-based processing, formulaic sequences can serve as
stepping stones between ‘novel’ word sequences. They may afford speakers
“processing time while computation proceeds, enabling us to plan ahead for
the content of what we are going to say, as well as the linguistic form” (Ske-
han 1998: 40). How densely distributed the stepping stones are will depend
on the discourse genre and the situational context. The incidence of formulaic
sequences has been shown to be particularly high in situations where language
users have had little opportunity to think about precisely how they will express
their thoughts and where a relatively high pace of delivery is expected (Kuiper
1996).
Like Skehan, Wray (2002) also recognizes two, complementary modes of
L1 language processing: while native speakers can adopt an analytic perspec-
tive on language patterns as the need arises, they will nevertheless tend to pro-
cess word strings in their L1 ‘holistically’, i.e., as sequences that were stored as
units in memory during childhood. The large stock of such ‘prefabricated’ word
strings that native speakers have acquired facilitates fluent language processing
not only in language production, but also in language reception. Experiments
have shown that formulaic sequences can be recognized and interpreted by re-
spondents before completion of the entire word string (Conklin and Schmitt
2008). This is said to free up processing space for interpreting less predictable
parts of incoming discourse. Wray and Perkins put it as follows: “The advan-
tage of the creative system is the freedom to produce or decode the unexpected.
The advantage of the holistic system is economy of effort when dealing with
the expected.” (Wray and Perkins 2000: 11).
There is no definitive figure as to the amount of formulaic language in lan-
guage, but according to some estimates between 50 % and 70 % of adult native
English language consists of formulaic sequences (Altenberg 1990; Erman and
Warren 2000). There are at least three difficulties in calculating the extent to
which discourse is formulaic. Firstly, as mentioned above, the relative density
of formulaic sequences is likely to vary between discourse genres and situ-
ational contexts. Secondly, the notion of formulaic sequence itself is rather
vague. Wray’s often quoted definition is as follows: “a sequence, continuous
or discontinuous, of words or other elements, which is, or appears to be, pre-
fabricated: that is, stored and retrieved whole from memory at the time of use,
rather than being subject to generation or analysis by the language grammar”
(Wray 2002: 9). This definition allows for a fair degree of individual varia-
tion: a given word string may be (more) formulaic for one speaker than for
another. For example, the string (not) statistically significant is more likely to
be experienced as a formulaic sequence by researchers than by lay people. The
string formulaic sequence may have attained ‘chunk status’ in linguistics cir-
cles, but may be experienced as a novelty elsewhere. Indeed, when different
experts are asked to independently identify formulaic sequences in samples of
discourse, agreement among them tends to be far from absolute (Eyckmans et
al. 2007; Foster 2001). A third difficulty in counting the incidence of formulaic
sequences lies in their variability. As mentioned above, many of them occur
with syntagmatic extensions and/or paradigmatic substitutions. Even idioms
show a considerable degree of variation from their dictionary-type form (Her-
rera and White 2010; Moon 1998). As a result, automatic searches for these
types of formulaic sequences in electronic corpora are likely to yield underes-
timations of their frequency of occurrence. We shall return to the possibility of
corpus-based counts of formulaic sequences further below.
From a psycholinguistics perspective, the variability of many formulaic se-
quences raises questions about their holistic representation in the mental lex-
icon (Boers and Lindstromberg 2009: 32-33). Are different variants of ‘the
2. Google searches for the word strings “English is very idiomatic” and “English is a very id-
iomatic language” generated about 2500 hits (on 14 December 2009). Similar searches for
Spanish generated only 9 hits.
This finding, however, does not necessarily imply that the learning of for-
mulaic sequences poses exactly the same challenges in both target languages,
since these languages differ in other ways which might affect the learning
process. In terms of morpho-syntactic typology, English is relatively analytic,
whereas Spanish is more synthetic. An analytic language has comparatively
little grammatical inflection. Syntactic and semantic roles are signalled mostly
by word order and the use of prepositions (or postpositions) rather than by in-
flection. By contrast, a synthetic language tends to use more affixes and mod-
ifications of roots. Word order tends to be more flexible, since syntactic func-
tions and semantic roles are often signalled by the inflected forms (Bauer 2003;
Haspelmath 2002). Synthetic languages are further classified as inflectional or
fusional, if the forms of the words themselves change to indicate how they re-
late to other words in a sentence, or as agglutinating, if words are formed by
the combination of morphemes. It is clear that Spanish is characterised by a
richer inflectional grammar than English. Adjectives agree with nouns for gen-
der as well as number, verbs are systematically marked for person, number and
(indicative or subjunctive) mood, etc.
When it comes to certain types of formulaic sequences, such as verb-noun
collocations, these will almost inevitably display greater morphological vari-
ability in Spanish than in English. While the verb in run a risk, for exam-
ple, has relatively few variants (i.e., run, running, runs and ran), its equiva-
lent in Spanish, correr un riesgo, has many more (e.g., corro, corres, corre,
corremos, corréis and corren, which are but the verb forms of the present
indicative; to these could be added many more belonging to past and future
tenses). The greater variability in verb inflections is also likely to play in the
case of verb-phrase idioms (e.g., echar un cable – ‘lend a hand’, perder la
cabeza – ‘lose one’s mind’), some similes (e.g., venderse como churros – ‘sell
like hotcakes’), and verb-preposition collocations (e.g., soñar con – ‘dream
of’). More variability in Spanish is also to be expected in the case of adverb-
adjective collocations, whose gender marking will be determined by the nature
of the noun they happen to modify (e.g., estrechamente unido / unida – ‘closely
linked’).
Given this wider morpho-syntactic variation in synthetic languages, one may
hypothesize that fluent production of correct formulaic sequences is somewhat
more difficult to achieve in general than in analytic languages, since either
rule-based knowledge will more often need to be employed to mould a canon-
ical or schematic representation of a formulaic sequence into a grammatically
appropriate form, or the grammatically appropriate ‘variant’ of a formulaic se-
quence will need to be selected from a greater set of ready-made forms. In
addition, one may hypothesize that L2 formulaic sequences will start perform-
ing their function as stepping stones less readily in a synthetic target language
if the learner’s L1 happens to lean more towards the analytic pole of the ty-
2. Research questions
Given the fact that Spanish is a relatively more synthetic, inflectional language
than English, we hypothesize that (L1 Dutch learners’) fluent production of for-
mulaic sequences presents a greater challenge in the case of L2 Spanish than
L2 English. This hypothesis is tested in a study which examines whether the
oral proficiency scores obtained by students of Spanish might be less clearly
associated with their use of formulaic sequences than has been reported in con-
nection with English (Boers et al. 2006). More specifically, the study addresses
the following research questions:
(a) Is the productive use of formulaic sequences by (L1 Dutch) students of
English and Spanish as foreign languages positively associated with their
oral proficiency in these languages, evaluated with regard to fluency, range
of expression and accuracy?
(b) If so, is the association equally strong in both languages?
(c) In case the association is weaker in L2 Spanish, could this be due to a
greater incidence of inflectional errors in the learners’ output?
3. Method
3.1. Participants
also subjected to a T-test statistic (see Section 4.2). The participants were not
told that they were taking part in an experimental study until after the study
had been completed.
Students were given a re-tell task and their performance was recorded. This L2
sample served to count the students’ use of formulaic sequences and to gauge
their oral proficiency (see Section 3.3). The input for the task was identical for
both language groups. Students were asked to read a 600-word text in their L1
(Dutch) about a general interest topic. After reading the text, students returned
it to the teacher and they were given three minutes to prepare for the re-tell
task, which was to report the text’s content in their major foreign language.
Planning time was limited to ensure a fair degree of ‘real-time’ language pro-
duction. To help the students recollect the content of the input text, they were
given a list with single-word content cues, corresponding chronologically to
the paragraphs of the text. The reason for using an input text in the students’
mother tongue was to avoid a confounding variable in the Boers et al. (2006)
study, where many participants simply reproduced word strings from an L2
input text that were not necessarily part of their own repertoires of formulaic
sequences.
To identify formulaic sequences, one might resort to corpus data and rely on
statistical scores (so-called MI scores, T-scores or Z-scores) to decide which
words form strong partnerships. Unfortunately, corpus statistics do not always
generate word strings which coincide with people’s intuitions about what con-
stitutes a formulaic sequence (e.g., Ellis, Simpson-Vlach and Maynard 2008;
Nesselhauf 2003; Read and Nation 2004; Schmitt, Grandage and Adolphs
2004; Stengers 2007, 2009). Automatically generated corpus-based colloca-
tions lists may include combinations (e.g., in + the) that for lack of semantic
unity are not recognised as formulaic sequences by respondents. Other combi-
nations, which respondents do identify as formulaic sequences, may be absent
from automatically generated lists. For example, at the time of writing this ar-
ticle, consultation of the Collins Cobuild on-line collocations sampler yielded
no confirmation of the formulaic-sequence status of expressions such as throw
a party. For the present study we decided to resort to several native speak-
ers’ counts of what they considered to be formulaic sequences in the students’
spoken discourse. We realised that only partial agreement could be expected
among the respondents, but hoped to obtain sufficient inter-rater reliability as
calculated through correlation analyses. Three speakers of English and three
speakers of Spanish were asked to individually listen to the recordings and to
list all word strings they considered to be formulaic sequences. They had all
had training in linguistics and were familiar with the literature on phraseology,
including the notion of formulaic sequences.
As we (the authors) first piloted the counting procedure on a small L2 sam-
ple to assess the feasibility of the task we were planning to set for the six vol-
unteer formulaic-sequence counters, we quickly stumbled across word strings
that could perhaps be interpreted as students’ attempts at producing L2 formu-
laic sequences, but which deviated from the L2 targets in one way or another.
Students sometimes produced certain word strings smoothly and confidently –
leaving the impression of ‘holistic’ production – but the strings were actually
not target-like, possibly due to erroneous transfer from L1 (e.g., *if a person
has overweight; *it depends of . . . ). Students sometimes produced word com-
binations that did seem target-like at the level of content-word selection but that
failed to be correct at the level of morpho-syntax and/or function words (e.g.,
*they are desperate seeking for a partner . . . ). Deciding which of these erro-
neous strings might nevertheless be considered close enough approximations
of L2 targets to be included in the tally turned out difficult and the possibil-
ity of including of such approximations seriously reduced the inter-rater agree-
ment among the three authors during the piloting. It therefore became clear that
asking our volunteer formulaic-sequence counters to make such decisions too
would seriously jeopardise the degree of inter-rater agreement we were hop-
ing to obtain. We thus chose to facilitate the task of our formulaic-sequence
counters as much as possible by asking them to list only formulaic sequences
that corresponded fully to L2 targets. This means, of course, that the figures we
will be reporting below are likely to be underestimations of the students’ actual
attempts at producing L2 formulaic sequences.
In order to gauge whether language errors might counteract the positive in-
fluence of the use of formulaic sequences on students’ oral proficiency scores,
we3 made inventories of the students’ language errors independently of the
formulaic-sequence counters. For reliability’s sake we focused on errors at the
level of inflectional grammar (wrong verb conjugations, wrong adjective-noun
agreement, adjectival instead of adverbial form, etc.), since identification of
this type of error is pretty uncontroversial.
3. The first author counted errors in the L2 Spanish recordings; the second author did so in the
L2 English ones.
4. Results
4. We are aware that the CEFR scales for language proficiency are not beyond criticism (see, e.g.,
Hulstijn 2007 on the need for more solid empirical underpinnings). Also, the descriptors are
arguably not precise enough to warrant satisfactory inter-rater reliability. On the other hand,
these scales do now appear to be used quite commonly (in Europe) and so their employment
in the present study is at least ecologically valid.
5. Given the teachers’ previously indicated estimates of their students’ oral proficiency level, it
was felt unrealistic to include the highest CEFR level, i.e., C2, in the guidelines for marking.
A mark of 15 thus corresponded to a convincing performance at the C1 level.
6. All the data were tested for normal distribution by means of the Kolmogorov-Smirnov test.
When they were found to be normally distributed, statistical analyses were performed using
parametric tests. If not, non-parametric alternatives were used.
These results do not mean, however, that the students of Spanish necessar-
ily made fewer attempts at using L2 formulaic sequences than the students of
English. After all, the tallies include only word strings that fully correspond to
L2 targets. As we shall see below, there are reasons to assume that many more
word strings were excluded from the L2 Spanish counts because they contained
morpho-syntactic errors.
The degree to which the language proficiency assessors converged in their scor-
ing, though statistically significant, was not as high as we had hoped, with
Spearman rank correlations ranging from .255 to .716. Nevertheless, for both
language cohorts, the three raters’ scores were combined into one mean score
per student. These mean scores ranged from 7 to 12 (corresponding to CEFR
brackets B1 and B2) in both FL cohorts. Students’ mean group scores in the
re-tell task are summarised in Table 1.
The Pearson correlations between the formulaic-sequence counts and the oral
proficiency scores are summarized in Table 2 below. As mentioned in Sec-
tion 3.4, we calculated mean type counts and mean tokens-per-minute counts.
Given the assumption that using formulaic sequences is a way of avoiding hes-
itations, the correlation coefficient for the Fluency parameter was calculated
using the tokens-per-minute measure. For the Range parameter, the type counts
were used (as a reflection of the diversity or width of students’ repertoires of
formulaic sequences). As the occurrence of errors did not differ between tokens
of the same type, we also used the type counts in connection with Accuracy.
The formulaic-sequence counts and the oral proficiency scores are positively
correlated in both L2 language samples. However, the correlations for L2 En-
glish are clearly stronger than for L2 Spanish. An illustration of the difference
in linearity of the association in both languages for the Range parameter is
given in Figures 1 and 2.
ENG SPA
Fluency .550 (p < .01) .361 (p < .05)
Range .626 (p < .01) .389 (p < .05)
Accuracy .561 (p < .01) .363 (p < .05)
Table 4. Correlations of number of inflectional errors and oral proficiency for L2 Span-
ish
5. Discussion
The data collected reveal positive correlations between the numbers of formu-
laic sequences used by students and their oral proficiency as perceived by blind
judges. The correlations appear to be weaker in L2 Spanish, however, where the
higher incidence of inflectional errors generally seems to dampen the positive
impression which the appropriate use of formulaic sequences helps to make.
Because of the greater importance of inflection in Spanish, it also seems
that formulaic-sequence mastery under real-time conditions is more difficult
to achieve in Spanish than in English. Unlike the students of English, the stu-
dents of Spanish regularly made inflectional errors also in their attempts to
produce word strings which few would contest as being lexically semi-fixed
The findings presented in this article corroborate earlier claims that mastery of
formulaic sequences entails a processing advantage not only for native speakers
but also for L2 learners, although the evidence is more convincing in the L2 En-
glish data than in the L2 Spanish data. While this lends support to pedagogies
aiming to help L2 learners expand and employ their repertoire of formulaic
sequences (e.g., Boers and Lindstromberg 2009; Lewis 1993; Nattinger and
Decarrico 1992), our analysis also suggests that mastery of L2 formulaic se-
quences is more of a challenge in a synthetic language such as Spanish than in
an analytic language such as English, perhaps especially so when the learner’s
L1 leans towards the analytic pole of the typology continuum too. The classes
of formulaic sequences that are not fully fixed tend to be amenable to more
inflectional changes in Spanish than in English. Depending on the model of
automatisation one adheres to, this means that either more variants will need
to be stored holistically or procedural knowledge of grammar will need to be
sufficiently ‘in place’ to assemble formulaic word strings not only fast but also
accurately for real-time language production. Whatever the case may be, this
finding does not undermine the claim that knowledge of phraseology is use-
ful, as we obtained positive correlations in both FL language cohorts. It merely
indicates that it may take longer for learners of inflectional languages to de-
velop this knowledge sufficiently for it to have a measurable impact on their
language proficiency, especially under real-time conditions. Also, in the case of
instructed SLA considered here (i.e., classroom-based learning), it seems that
inflectional languages would benefit from a comparatively greater investment
in focus on form or even focus on forms (Long 2001; Doughty and Williams
1998)7 , and this would then hold true not only for the learning of formulaic
sequences, of course. In short, while our findings suggest that formulaic se-
quences serve the purpose of stepping stones that help learners attain a certain
level of fluency in oral interaction, many of these stepping stones may stay
slippery for longer in the case of synthetic languages.
We need to acknowledge, of course, that the different strengths of associa-
tion between formulaic sequence use and oral proficiency scores between L2
English and L2 Spanish that we have attested here may not be entirely due
to the nature of these target languages per se. Although no statistically sig-
nificant differences were found between the two groups’ proficiency scores,
we cannot rule out the possibility that the assessors of L2 Spanish were more
lenient in their scoring than the assessors of L2 English, due to different ex-
pectations. While the two cohorts were roughly matched as far as the quantity
of classroom-based instruction over time in either language was concerned,
we cannot ignore the possible influence of the broader circumstances in which
our participants were learning either of these languages. As was mentioned
above, students in Flanders tend to get much more out-of-class exposure to
English than to Spanish. As a consequence, learners of English may also get
more opportunities for exemplar-based learning – including opportunities for
the incidental acquisition of formulaic sequences. If the L2 English cohort was
slightly more advanced, after all, then this could add an explanation for the
stronger correlations between formulaic-sequence counts and oral proficiency
scores. The results of previous studies have suggested that it is especially at ad-
7. Focus on form refers to pedagogical approaches in which learners’ attention is drawn to lin-
guistic elements during a communicative activity, whereas focus on forms “entails teaching
discrete linguistic structures in separate lessons in a sequence determined by syllabus writers”
(Laufer and Girsai 2008: 2).
Erasmushogeschool Brussel
"helene.stengers@ehb.be#
Victoria University of Wellington
"frank.boers@vuw.ac.nz#
Vrije Universiteit Brussel
"housen@vub.ac.be#
Erasmushogeschool Brussel
Vrije Universiteit Brussel
"june.eyckmans@vub.ac.be#
Appendix
Examples of morpho-syntactic errors encountered in the re-tell tasks
L2 Spanish
L2 English
References
Accou, Sien. 2009. The repertoire of German figurative idioms. MA dissertation. Brussels: Eras-
mus University College Brussels.
Altenberg, Bengt. 1993. Recurrent verb-complement constructions in the London-Lund Corpus.
In Jan Aarts, Pieter de Haan & Nelleke Oostdijk (eds.), English language corpora: Design,
analysis and exploitation, 227–245. Amsterdam: Rodopi.
Anderson, John R. 1993. Rules of the mind. Hillsdale: Lawrence Erlbaum.
Ellis, Rod & Gary Barkhuizen. 2005. Analysing learner language. Oxford: Oxford University
Press.
Bauer, Laurie. 2003 Introducing linguistic morphology. Washington, D.C.: Georgetown University
Press.
Boers, Frank, June Eyckmans, Jenny Kappel, Hélène Stengers & Murielle Demecheleer. 2006.
Formulaic sequences and perceived oral proficiency: Putting a lexical approach to the test.
Language Teaching Research 10. 245–261.
Boers, Frank & Seth Lindstromberg. 2009. Optimizing a lexical approach to instructed second
language acquisition. Basingstoke: Palgrave Macmillan.
Conklin, Kathy & Norbert Schmitt. 2008. The processing advantage of formulaic Sequences. Ap-
plied Linguistics 29. 72–89.
Dechert, Hans W. 1984. Second language production: Six hypotheses. In Hans W. Dechert, Hans
W., Dorothea Möhle & Manfred Raupach (eds.), Second language productions, 211–230.
Tübingen: Gunter Narr Verlag.
DeKeyser, Robert. 2001. Automaticity and automatization. In Peter Robinson (ed.), Cognition and
second language instruction, 125–151. Cambridge: Cambridge University Press.
Doughty, Catherine & Jessica Williams (eds.) 1998. Focus on form in classroom second language
acquisition. Cambridge: Cambridge University Press.
Ellis, Nick C. 1996. Sequencing in SLA: Phonological memory, chunking, and points of order.
Studies in Second Language Acquisition 18. 91–126.
Ellis, Nick C. 2002. Frequency effects in language acquisition: A review with implications for
theories of implicit and explicit language acquisition. Studies in Second Language Acquisition
2. 143–188.
Ellis, Nick C. & Richard Schmidt. 1997. Morphology and longer distance dependencies: Labora-
tory research illuminating the A in SLA. Studies in Second Language Acquisition 19. 145–171.
Ellis, Nick C., Rita Simpson-Vlach & Carson Maynard. 2008. Formulaic language in native and
second language speakers: Psycholinguistics, corpus linguistics, and TESOL. TESOL Quar-
terly 42. 375–396.
Erman, Britt & Beatrice Warren. 2000. The idiom principle and the open choice principle. Text 20.
87–120.
Eyckmans, June. 2007. Taking SLA research to interpreter training: Does knowledge of phrases
foster fluency? In Frank Boers, Jeroen Darquennes & Rita Temmerman (eds.), Multilingual-
ism and applied comparative linguistics: Pedagogical perspectives, 89–104. Newcastle: Cam-
bridge Scholars Publishing.
Eyckmans, June, Frank Boers & Hélène Stengers. 2007. Identifying chunks: Who can see the wood
for the trees? Language Forum 33. 85–100.
Forsberg, Fanny. 2008. Le langage préfabriqué: Formes, fonctions et fréquences en français parlé
L2 et L1. Bern: Peter Lang Verlag.
Forsberg, Fanny. 2009. Formulaic sequences: A distinctive feature at the advanced/very advanced
levels of second language acquisition. In Emmanuelle Labeau & Florence Myles (eds.), The
advanced learner variety: The case of French, 173–197. Bern: Peter Lang Verlag.
Foster, Pauline. 2001. Rules and routines: A consideration of their role in task-based language
production of native and non-native speakers. In Martin Bygate, Peter Skehan & Merrill Swain
(eds.), Researching pedagogic tasks: Second language learning, teaching, and testing, 75–93.
London: Longman.
Foster, Pauline & Peter Skehan. 1996. The influence of planning on performance in task-based
learning. Studies in Second Language Acquisition 18. 299–324.
Granger Sylviane. 1998. Prefabricated patterns in advanced EFL writing: Collocations and For-
mulae. In Anthony P. Cowie (ed.), Phraseology, theory, analysis and applications, 145–160.
Oxford: Oxford University Press.
Grant, Lynn & Laurie Bauer. 2004. Criteria for re-defining idioms: Are we barking up the wrong
tree? Applied Linguistics 25. 38–61.
Haspelmath, Martin. 2002. Understanding morphology. London: Arnold.
Herrera, Honesto & Michael White. 2010. Canonicity and variation in idiomatic expressions: Ev-
idence from business press headlines. In Sabine De Knop, Frank Boers & Teun De Rycker
(eds.), Fostering language teaching efficiency through cognitive linguistics, 167–187. Berlin:
Mouton de Gruyter.
Housen, Alex & Michel Pierrard. 2005. Investigating Instructed Second Language Acquisition. In
Alex Housen & Michel Pierrard (eds.), Investigations in instructed second language acquis-
tion, 1–27. Berlin: Mouton de Gruyter.
Hulstijn, Jan. 2007. The shaky ground beneath the CEFR: Quantitative and qualitative dimensions
of language proficiency. The Modern Language Journal 91. 663–667.
Kaplan, Robert B. 1966. Cultural thought patterns in intercultural education. Language Learning
16. 1–20.
Kuiper, Koenraad. 1996. Smooth talkers: The linguistic performance of auctioneers and sportscast-
ers. Englewood Cliffs: Lawrence Erlbaum.
Laufer, Batia & Nany Girsai. 2008. Form-focused instruction in second language vocabulary learn-
ing: A case for contrastive analysis and translation. Applied Linguistics 29. 694–716.
Laufer, Batia & Paul Nation. 1995. Vocabulary size and use: Lexical richness in L2 written pro-
duction. Applied Linguistics 16. 307–322.
Lennon, Paul. 1990. Investigating fluency in EFL: A quantitative approach. Language Learning
40. 387–417.
Lewis, Margareta. 2008. The idiom principle in L2 English: Assessing elusive formulaic sequences
as indicators of idiomaticity, fluency, and proficiency. Stockholm: University of Stockholm.
PhD thesis.
Lewis, Michael. 1993. The lexical approach: The state of ELT and a way forward. Hove: Language
Teaching Publications.
Lewis, Michael. 1997. Implementing the lexical approach: Putting theory into practice. Hove:
Language Teaching Publications.
Lewis, Michael (ed.). 2000. Teaching collocation: Further developments in the lexical approach.
Hove: Language Teaching Publications.
Logan, Gordon. 1988. Toward an instance theory of automatization. Psychological Review 95.
492–527.
Long, Michael H. 1991. Focus on form: A design feature in language teaching methodology. In
Kees de Bot, Ralph B. Ginsberg & Claire Kramsch (eds.), Foreign language research in cross-
cultural perspective, 39–52. Amsterdam: John Benjamins.
Luoma, Sari. 2004. Assessing speaking. Cambridge: Cambridge University Press.
MacKay, Donald G. 1982. The problem of flexibility, fluency, and speed-accuracy trade off in
skilled behaviour. Psychological Review 89. 483–506.
Malvern, David D., Brian J. Richards, Ngoni Chipere and Pilar Durán. 2004. Lexical Diversity and
Language Development: Quantification and Assessment. Basingstoke: Palgrave Macmillan.
Moon, Rosamund. 1998. Fixed expressions and idioms in English. Oxford: Clarendon Press.
Myles, Florence, Janet Hooper & Rosamond Mitchell. 1998. Rote or rule: Exploring the role of
formulaic language in classroom foreign language learning. Language Learning 48. 323–364.
Nattinger, James R. & Jeannette S. DeCarrico. 1992. Lexical phrases and language teaching. Ox-
ford: Oxford University Press.
Nesselhauf, Nadja. 2003. The use of collocations by advanced learners of English and some im-
plications for teaching. Applied Linguistics 24. 223–242.
Read, John and Paul Nation 2004. Measurement of formulaic sequences. In Norbert Schmitt (ed.),
Formulaic Sequences: Acquisition, Processing and Use, 23–35. Amsterdam: John Benjamins.
Schmidt, Richard W. 1992. Psychological mechanisms underlying second language fluency. Stud-
ies in Second Language Acquisition 14. 357–385.
Schmitt, Norbert (ed.). 2004. Formulaic sequences: Acquisition, processing and use. Amsterdam:
John Benjamins.
Schmitt, Norbert, Sarah Grandage & Svenja Adolphs. 2004. Are corpus-derived recurrent clus-
ters psycholinguistically valid? In Norbert Schmitt (ed.), Formulaic sequences: Acquisition,
processing and use, 127–147. Amsterdam: John Benjamins.
Segalowitz, Norman S. 2003. Automaticity and second languages. In Catherine Doughty &
Michael H. Long (eds.), The handbook of second language acquisition, 282–308. Oxford:
Blackwell.
Skehan, Peter. 1998. A cognitive approach to language teaching. Oxford: Oxford University Press.
Slobin, Dan I. 2002. Verbalized events: A dynamic approach to linguistic relativity and determin-
ism. In Susanne Niemeier and René Dirven (eds.), Evidence for linguistic relativity, 107–138.
Amsterdam: John Benjamins.
Stauble, Ann-Marie E. 1978. The Process of decreolization: A model for second language devel-
opment. Language Learning 28: 29–54.
Stengers, Hélène. 2007. Is English exceptionally idiomatic? Testing the waters for a lexical ap-
proach to Spanish. In Frank Boers, Jeroen Darquennes & Rita Temmerman (eds.), Multilin-
gualism and applied comparative linguistics: Pedagogical perspectives, 107–125. Newcastle:
Cambridge Scholars Publishing.
Stengers, Hélène. 2009. The idiom principle put to the test: An exercise in applied comparative
linguistics. Brussels: Vrije Universiteit Brussel. PhD thesis.
Talmy, Leonard. 2000. Toward a cognitive semantics, Vol. 1: Concept structuring systems. Michi-
gan: MIT Press.
Towell, Richard, Richard Hawkins & Nives Bazergui. 1996. The development of fluency in ad-
vanced learners of French. Applied Linguistics 17. 84–119.
Vermeer, Anne. 2000. Coming to grips with lexical richness in spontaneous speech data. Language
Testing 17. 65–83.
Weinert, Regina. 1995. The role of formulaic language in second language acquisition: A review.
Applied Linguistics 16. 185–205.
Willis, Dave. 1990. The lexical syllabus: A new approach to language teaching. London: Collins
ELT.
Wray, Alison. 2002. Formulaic language and the lexicon. Cambridge: Cambridge University Press.
Wray, Alison & Michael R. Perkins. 2000. The function of formulaic language: An integrated
model. Language and Communication 20. 1–28.
Yuan, Fangyuan & Rod Ellis. 2003. The effects of pre-task planning and on-line planning on
fluency, complexity and accuracy in L2 oral production. Applied Linguistics 24. 1–27.