Professional Documents
Culture Documents
Huang-2013-Comparing The Effects of Test Anxiety On Independent and Integrated Speaking Test Performance
Huang-2013-Comparing The Effects of Test Anxiety On Independent and Integrated Speaking Test Performance
REVIEW OF LITERATURE
Definition and prior empirical work. To date, the most popular def-
inition of test anxiety situates itself within the state-trait model
advanced by Spielberger (1966). This model distinguishes between
state anxiety and trait anxiety, with the former denoting “a transitory
emotional state or condition” of tension, apprehension, and autonomic
nervous reactions and the latter concerning the “relatively stable
individual differences in anxiety proneness” to respond with A-state
reactions in an array of stimulus situations (Spielberger, 1972, p. 39).
Defined in reference to this dichotomous distinction, test anxiety has
thus been conceptualized as “a situation-specific personality trait” that
responds with heightened anxiety to evaluative situations (Spielberger,
Anton, & Bedell, 1976, p. 323). Because the research reported here
concerned itself primarily with the anxiety reactions test takers experi-
enced at the time of completing oral test tasks, this psychological
construct was thus operationalized as state anxiety in this study.
In the field of education, test anxiety has been shown to bear a
moderate inverse association with performance in a wide range of test-
ing contexts (Zeidner, 1998). Likewise, in the realm of foreign lan-
guage learning, the relevant studies have presented a similar, though
less conclusive, picture. For instance, Steinberg and Horwitz (1986)
METHOD
Participants
Instruments
Speaking test tasks
Task description. For the independent tasks, the test takers needed
to capitalize on their world knowledge and personal experience to formu-
late oral responses without the benefit of input provided ahead of time.
Counterbalancing the topics. The topics of the test tasks were coun-
terbalanced for the integrated task and the independent task in the
administration phase; that is, half of the participants received one
combination of topics, and the other half were assigned the other
topic combination. This counterbalancing practice arose from the
need to rule out possible topic effects. That is, if the participants all
received the same topics for the independent tasks and the integrated
tasks, measuring the influence of test anxiety on speaking perfor-
mance would be confounded with the topic effects. In other words, it
would be difficult, if not impossible, to discern clearly whether the
change in performance was due solely to test anxiety or resulted from
the interaction of test anxiety and the particular topics used. However,
no counterbalancing occurred for the topics of the practice tasks,
because these tasks merely functioned to get the participants accli-
mated to the test and did not provide data for the official analyses.
Table 1 illustrates the topics for the test tasks and the practice tasks.
Topic
The data collection took place in two computer labs, where each
participant had access to a desktop computer equipped with head-
phones and a microphone. Specifically, the participants were randomly
assigned to one of the two computer labs (lab A and lab B) and took
different combinations of the oral test tasks according to their lab
assignment. That is, although the participants from both labs took the
practice tasks on the same topics, lab A participants took the indepen-
dent task on the topic of biofuels and the integrated task on the air
transportation industry, whereas the lab B participants performed the
two test tasks on reversed topics. The participants first completed the
two independent tasks and then answered an SAI to indicate the level
of anxiety they experienced during each of these tasks. In tandem with
the completion of the SAI, the participants undertook the two inte-
grated tasks and then completed another SAI, the scores of which rep-
resented the anxiety they endured during the taking of this second set
of tasks.
The researchers coded the responses on the SAIs and rated the per-
formance on the oral test tasks. With respect to the SAIs, for the posi-
tively worded items the researchers represented the four response
categories with four ascending numerical values, and for the negatively
worded items they represented the categories with four descending
numerical values. In so doing, they allowed the same numerical value
to reflect a similar amount of anxiety.
In regard to the oral test tasks, the researchers rated the test takers’
performance on the two tasks with reference to the independent
speaking rubrics and integrated speaking rubrics developed by ETS to
evaluate the performance on the speaking component of the TOEFL-
iBT. In accordance with these rubrics, the researchers each evaluated
all speech segments in terms of three criteria: delivery, language use,
and topic development. For each segment, they assigned one score
band on a scale ranging from 0 to 4 for each criterion, totaling three
score bands. The three score bands awarded for each segment by each
researcher were then summed into a composite score. Finally, the two
composite scores generated for each speech segment, one from each
researcher, were then averaged to numerically represent the overall
quality of that particular speech segment. The Cronbach’s alpha coeffi-
cient was found to reach a level as high as .92, suggesting an excel-
lent consistency to be associated with the ratings produced by the
researchers.
Data Analysis
e1
Anxiety_ Independent
Independent Performance
Anxiety_ Integrated
Integrated Performance
e2
Anxiety_ a Independent
Independent Performance
Anxiety_ a Integrated
Integrated Performance
e2
Correlation
Observed variable 1 2 3 4 M SD
1. Anxiety independent 1 57.13 9.61
2. Anxiety integrated .69 1 55.79 9.77
3. Independent performance –.31 –.25 1 7.67 2.31
4. Integrated performance –.27 –.34 .63 1 7.56 2.24
TABLE 3
Correlations, Means, and Standard Deviations for the Observed Variables for Topic Combi-
nation B (n = 169)
Correlation
Observed Variable 1 2 3 4 M SD
1. Anxiety independent 1 51.76 10.88
2. Anxiety integrated .60 1 52.62 9.74
3. Independent performance –.23 –.13 1 7.65 2.79
4. Integrated performance –.17 –.21 .72 1 7.46 2.46
RESULTS
Correlations
The baseline path model for each topic combination contained two
paths and two correlations among the four observed variables. Figure 3
.69 .60
Chi-square =1.405
df = 2
p = .495 e2
CFI = 1.000
RMSEA = .000
SRMR = .0297
.60 .71
Chi-square =.519
df = 2
p = .771 e2
CFI = 1.000
RMSEA = .000
SRMR = .0191
DISCUSSION
Following the path analyses performed on the collected data, three
major findings came to light: absence of topic effects, adverse impact
of test anxiety on speaking performance, and nondifferential
FUTURE RESEARCH
ACKNOWLEDGMENTS
This study was undertaken based on part of the research data collected for the
first author’s doctoral dissertation sponsored by the TOEFL Small Grants for Doc-
toral Research in Second or Foreign Language Assessment Program at ETS. Thus,
we would like to express gratitude to this program for its financial support.
THE AUTHORS
Heng-Tsung Danny Huang obtained his PhD in foreign language education from
the University of Texas at Austin, in the United States, and is currently working as
an assistant professor in the Department of Foreign Languages and Literature at
National Sun Yat-Sen University, in Taiwan. His research interests include lan-
guage testing, computer-assisted language teaching and learning, and quantitative
research methods.
Shao-Ting Alan Hung received his doctoral degree from the Department of Liter-
acy, Culture and Language Education at Indiana University, Bloomington, in the
United States. He is currently an associate professor in the Department of Applied
Foreign Languages at National Taiwan University of Science and Technology, in
Taiwan. His research interests include computer-assisted language teaching and
learning, second language writing and speaking pedagogy, language assessment,
and language teacher education.
REFERENCES
Bachman, L., & Palmer, A. S. (1996). Language testing in practice. Oxford, England:
Oxford University Press.
Bonacci, S., & Reeve, C. L. (2010). The nature and relative importance of
students’ perceptions of the sources of test anxiety. Learning and Individual
Differences, 20, 617–625. doi:10.1016/j.lindif.2010.09.007
Brown, T. A. (2006). Confirmatory factor analysis for applied research. New York, NY:
Guilford Press.
Butler, F. A., Eignor, D., Jones, S., McNamara, T., & Suomi, B. K. (2000). TOEFL
2000 speaking framework: A working paper (TOEFL Monograph No. 20). Prince-
ton, NJ: Educational Testing Service.
Chung, S. C., & Lung, C. F. (1984). 修訂情境與特質量表之研究 [Modifying the
State-Trait Anxiety Inventory]. 測驗年刊 [Psychological Testing], 31, 27–36.
Douglas, D., & Selinker, L. (1992). Analyzing oral proficiency test performance in
general and specific purpose contexts. System, 20, 317–328. doi:10.1016/346-
251X(92)90043-3
Downing, S. (2002). Threats to the validity of locally developed multiple-choice
tests in medical education: Construct-irrelevant variance and construct under-
representation. Advances in Health Sciences Education, 7, 235–241. doi:10.1023/
A:1021112514626