Professional Documents
Culture Documents
Coxhead & Boutorwick (2018) Longitudinal Vocabulary Development in An EMI International School Context Learners and
Coxhead & Boutorwick (2018) Longitudinal Vocabulary Development in An EMI International School Context Learners and
Coxhead & Boutorwick (2018) Longitudinal Vocabulary Development in An EMI International School Context Learners and
METHODOLOGY
Participants
A total of 538 students took the VLT. Students who were diagnosed
with suspected learning difficulties (e.g., dyslexia) (N = 67) were
removed from the data set, along with three participants who were
classed as native speakers by the means adopted in this study but who
nevertheless required additional language support. Of the remaining
468 participants, 43% had German nationality, and English-dominant
countries such as the United Kingdom, the United States, Australia,
Canada, and South Africa accounted for 12%, followed by Russia
(5%). Participants came from over 50 different countries.
Participants were grouped into cohorts according to the year they
were enrolled in Grade 6 (see Table 1). Those enrolled in Grade 6 in
2009 belong to the 2009 cohort; students enrolled in Grade 6 in 2010
are in the 2010 cohort. Students who enrolled in Grade 8 in 2011
belong to the 2009 cohort because they would have been in Grade 6
in 2009. Permission was sought from participants for this study. For
ethical reasons, the teacher involved in this study chose to remain
anonymous and not participate as an author of this article.
Between 2009 and 2014, all classes except French, Spanish, and Ger-
man language classes were taught in English. All participants in the
study had English literature classes 5 days a week, for 45 minutes per
class. The least proficient of the nonnative-speaking students
(NNSEAL) were placed into specifically designed classes to improve
their vocabulary knowledge and other language knowledge and skills.
These classes provided extra English as an additional language (EAL)
support (three periods per week) during the modern foreign lan-
guages time-slot. A bilingual English and German model was intro-
duced in the 2014–2015 academic year, starting in Grade 7, which
affected 73 participants in the 2013 cohort, which included 26 first
language (L1) German and eight L1 English speakers, and 39 speakers
of other first languages. An assessment of the impact of this change is
beyond the scope of this article.
In Table 1, the right-most column represents total n-sizes for NS, NNS,
and NNSEAL. The bottom-most row depicts the total number of students
for each cohort. Some students took the VLT each year, but others did
not (because of absences, for example). Table 2 depicts the number of
students in each grade, for the NS, NNS, and NNSEAL cohorts. Many stu-
dents took the tests multiple times over the years of the study.
Note that in this school, Grades 6, 7, and 8 follow an in-house curricu-
lum which draws on the English Key Stage 2 primary curriculum from
TABLE 2
Number of Students in Each Grade for Each of the Three Groups (NS, NNS, NNSEAL)
Learning materials
The learning materials for this study were developed by Grade 6
teachers in the NNSEAL, maths, and science classes and were gathered
from a week each at the beginning, middle, and end of the academic
year. Table 3 shows the number of texts and total running words in
this small corpus. Figure 2 provides examples from the learning mate-
rials from each subject. Note that each example contains some sort of
problem for the students to solve.
1 birth
5 sport
6 victory
FIGURE 2. Samples from the learning materials in English, maths, and science.
TABLE 3
Running Words and Texts of the Learning Materials by Subject
Textbooks
To represent textbooks from Grades 6, 8, 10 and 11 (Table 4), in
English, The Hunger Games (Grade 6) and Pride and Prejudice (Grade
10) were selected from Coxhead (2012; see also Coxhead & White,
2012); and in science, a Grade 8 textbook and a Grade 11 textbook
were selected from Coxhead et al. (2010). In maths, a Grade 8 and a
Grade 11 textbook from a well known series in New Zealand were
used. This corpus of textbooks was carefully checked and cleaned
using Notepad++ to correct any errors, as recommended by Nation
(2016). Because a corpus of suitable textbooks was already available, it
was thought impractical to extend the scope of this project to include
the analysis of the textbooks actually in use by these students. The
teacher confirmed that these textbooks are typical of the ones used in
the international school.
Similar findings to Nation’s (2006) analysis of novels could be
expected for Pride and Prejudice and two chapters of The Hunger Games
in the present study. Having access to the full text for The Hunger
Games would clearly be preferable, but was not possible for this
analysis.
Procedures
Each cohort was first tested on entry to Grade 6, and then tested
each subsequent year. Therefore, the 2009 cohort was first tested in
2009, again in 2010, 2011, and so on until 2014. The 2010 cohort was
first tested in 2010, 2011, and so on until 2015. The 2009, 2010, and
2011 VLT assessments were carried out on paper. Subsequently, the
VLT was administered in class using an online version of the test. The
teacher supervised and marked the tests. Absent students were usually
given another opportunity to complete the assessment. Students were
asked to skip any words they did not know instead of trying to guess
and had 90 minutes to complete the test. The tests were given to par-
ticipants at the beginning of the academic year in 2009–2011; the tim-
ing of the administration changed to the end of the academic year
from 2012.
Data Analysis
The VLT analysis was carried out using R (R Core Team, 2016). Lin-
ear mixed-effects models were computed to compare the groups and
to control variables including individual differences by including ran-
dom effects in the models, so results obtained from the models are
more easily generalizable to a wider population (Baayen, Davidson, &
RESULTS
Grade VLT 2,000 VLT 3,000 VLT 5,000 VLT 10,000 VLT AWL
6 28.8 (1.4) 26.9 (3.3) 23.0 (6.2) 10.2 (7.0) 20.9 (5.2)
7 29.2 (1.1) 28.7 (2.2) 25.3 (4.8) 16.6 (7.4) 25.3 (4.4)
8 29.2 (1.1) 28.6 (2.2) 26.9 (3.5) 18.2 (6.5) 27.0 (2.6)
9 29.8 (0.5) 29.5 (0.8) 28.0 (3.0) 21.7 (6.4) 28.3 (2.5)
10 29.9 (0.2) 29.8 (0.4) 29.6 (1.0) 25.0 (4.4) 28.9 (1.4)
11 29.8 (0.5) 29.9 (0.4) 29.4 (0.9) 25.0 (3.7) 29.6 (0.5)
TABLE 6
Average VLT Scores With Standard Deviations for NNSs
Grade VLT 2,000 VLT 3,000 VLT 5,000 VLT 10,000 VLT AWL
6 26.0 (3.2) 21.2 (5.8) 15.9 (6.3) 6.5 (5.3) 16.1 (6.1)
7 28.0 (2.0) 25.6 (3.7) 20.8 (5.1) 9.7 (5.5) 21.8 (4.6)
8 28.7 (1.3) 27.5 (2.7) 23.3 (4.4) 12.5 (6.0) 24.6 (4.3)
9 29.3 (1.1) 28.1 (2.4) 24.4 (4.5) 14.1 (6.3) 25.7 (3.7)
10 29.4 (1.0) 28.7 (1.9) 26.7 (2.9) 16.2 (6.1) 27.8 (2.2)
11 29.7 (0.8) 29.2 (1.3) 27.5 (2.1) 18.9 (5.6) 28.7 (1.4)
TABLE 7
Average VLT Scores With Standard Deviations for NNSEALs
Grade VLT 2,000 VLT 3,000 VLT 5,000 VLT 10,000 VLT AWL
6 16.0 (7.2) 10.2 (6.2) 6.8 (5.2) 2.1 (2.5) 7.3 (5.7)
7 23.3 (5.1) 17.0 (6.3) 12.5 (5.2) 4.0 (3.4) 12.8 (7.0)
8 25.2 (4.5) 21.4 (6.2) 15.6 (5.0) 6.2 (4.7) 18.0 (7.0)
9 27.4 (2.9) 22.9 (5.7) 19.1 (6.2) 8.7 (5.6) 22.3 (5.3)
10 28.8 (2.2) 25.7 (4.5) 22.9 (3.8) 10.2 (5.2) 25.8 (2.5)
11 29.6 (0.9) 27.9 (1.9) 25.5 (2.6) 14.4 (4.6) 27.2 (2.6)
the start of Grade 11. They did not achieve mastery of the 5,000- and
10,000-word levels.
To determine if there were significant trends in the data, a linear
mixed-effects model was fitted to the data with VLT score as the
dependent variable, and group (NS, NNS, and NNSEAL), VLT level,
and grade as independent variables. Two random variables were also
specified in the model: student and cohort.
The results of the model revealed significant (p < .05) three-way
interactions between group, VLT level, and grade level. To determine
TABLE 8
Levels of the VLT Where the NS/NNS/NNSEAL Groups First Scored Similarly
TABLE 10
Coverage of the BNC/COCA High, Mid, and Supplementary Lists Over English, Maths, and
Science Textbooks (%)
above goes a long way to ameliorating the level of lexical difficulty of this
text.
Table 11 shows that the Grade 8 science textbook has the highest
vocabulary load. It contains low-frequency words such as photosynthesis
and microorganism from the 9,000-word BNC/COCA list and they occur
nearly 40 times each; respiration and micron (10,000-word BNC/COCA
list), fission (11,000-word BNC/COCA list), and chlorophyll and phylum
(12,000-word BNC/COCA list). Technical vocabulary plays a major
role in this text.
Research Question 3: What Coverage Does the AWL Provide Over
the Learning Materials and Textbooks?
The Grade 11 maths textbook contained the highest proportion of
AWL words (7.6%), followed by the Grade 8 science textbook
(6.41%), the Grade 11 Science textbook (6.25%), and the Grade 6
maths textbook at 5.61%. Unsurprisingly, the novels contained many
fewer AWL words, around 1.5%, which is similar to Coxhead’s (2000)
finding of around 1.4% coverage of the word list over a 3.5 million–
word corpus of fiction.
DISCUSSION
This study highlights how EMI classrooms can contain learners with
vastly different levels of vocabulary knowledge in English, including
NSs who demonstrate solid knowledge of high-frequency vocabulary in
the early years of their high school study in Grade 6, but who rapidly
develop mastery in academic and mid-frequency vocabulary as their
studies progress. The NNSs whose VLT scores suggested sufficient
knowledge of English vocabulary to be placed in everyday classes
demonstrated a similar but delayed pattern of vocabulary development
to the NSs. In contrast, the NNSEALs needed at least four years to
score the same as their classmates in high-frequency vocabulary, and
five years for academic vocabulary. As in Webb and Chang (2012),
Limitations
Future Research
A VLT item analysis across years and with complete sets of partici-
pant data would be useful to explore the consistency of results, as
would using other assessment measures to triangulate the VLT results,
such as grade point averages. It would be good to see research on new
versions of a school-based VLT using, for example, Greene and Cox-
head’s (2015) Middle School Vocabulary Lists. Future research could
also look at larger scale analyses of learning materials and textbooks
and technical vocabulary. Finally, more research on how learners
develop their understanding of a subject over time and the vocabulary
of that subject is needed.
CONCLUSION
This study tracked the development of receptive vocabulary knowl-
edge over time using the VLT for native and nonnative speakers of
English in an EMI context. Different levels of the test continued to
show gains even after participants had taken the test multiple times.
The greatest gains were made by the nonnative speaker group who
had low levels of English proficiency in Grade 6. The learners with
higher VLT scores would be best placed to cope with textbooks and
learning materials, but over time NNSEAL students have the opportu-
nity to further develop their vocabulary and, theoretically at least, be
able to close the gap between their lexical knowledge and their class-
room texts. Our analysis also suggested that learners would need siz-
able vocabularies to cope with authentic school textbooks, particularly
in maths and science.
ACKNOWLEDGEMENTS
We wish to acknowledge the teachers and students who took part in this study,
and in particular the teacher who gathered the testing data for this project who
THE AUTHORS
REFERENCES
Anthony, L. (n.d.). Antconc. Retrieved from http://www.laurenceanthony.net/sof
tware/antconc/
Baayen, R., Davidson, D., & Bates, D. (2008). Mixed-effects modeling with crossed
random effects for subjects and items. Journal of Memory and Language, 59, 390–
412. https://doi.org/10.1016/j.jml.2007.12.005
Bates, D., M€achler, M., Bolker, B., & Walker, S. (2015). Fitting linear mixed-effects
models using lme4. Journal of Statistical Software, 67(1), 1–48. https://doi.org/
10.18637/jss.v067.i01
Chung, T., & Nation, I. S. P. (2003). Technical vocabulary in specialised texts.
Reading in a Foreign Language, 15(2), 103–116.
Cobb, T. (n.d.). Compleat Lexical Tutor. Retrieved from http://www.lextutor.ca/
vp/comp/
Coxhead, A. (2000). A new academic word list. TESOL Quarterly, 34, 213–238.
https://doi.org/10.2307/3587951
Coxhead, A. (2012). Researching vocabulary in secondary school English texts: The
Hunger Games and more. English in Aotearoa, 78, 34–41.
Coxhead, A. (2018). Vocabulary and English for specific purposes research: Quantitative
and qualitative perspectives. London, England: Routledge.
Coxhead, A., Nation, P., & Sim, D. (2015). Vocabulary size and native speaker sec-
ondary school students. New Zealand Journal of Educational Studies, 50, 121–135.
https://doi.org/10.1007/s40841-015-0002-3
Coxhead, A., Stevens, L., & Tinkle, J. (2010). Why might secondary science text-
books be difficult to read? New Zealand Studies in Applied Linguistics, 16(2), 35–52.
Coxhead, A., & White, R. (2012). Building a corpus of secondary school texts: First
you have to catch the rabbit. New Zealand Studies in Applied Linguistics, 18(2),
67–73.
Davies, M. (2008). The Corpus of Contemporary American English: 520 million words,
1990–present. Retrieved from http://corpus.byu.edu/coca/