Professional Documents
Culture Documents
Research Principles Methods and Statisti PDF
Research Principles Methods and Statisti PDF
Applied Linguistics
Ebrahim Khodadady (PhD)
Ferdowsi University of Mashhad
September 2013
I
Preface
Research Principles, Methods and Statistics in Applied Linguistics has
been written for university students who major in various fields related
to English language, literature and translation in Iran. Although there are
a number of related textbooks such as Understanding research in second
language learning: A teacher’s guide to statistics and research design
(Brown 1988), Research design and statistics for applied linguistics
(Hatch & Farhady 1982), and The research manual: Design and
statistics for Applied Linguistics (Hatch & Lazaraton 1991) available in
the market, the present textbook enjoys several features which make its
contribution to the field indispensible.
The second new and separate chapter in the present textbook addresses
translation as a research method, i.e., chapter 7. It relates translation
research to other types of research projects and then employs schema
theory to explore translation research projects from macrostrutural and
microstructural perspectives. The chapter resorts to various rendering of
the Quran and Mathnavi Manavi (Rumi, 2001) to provide tangible
examples of qualitative translation research projects.
The third new and separate chapter in the present textbook provides the
literature and rationale to establish a new type of translation research
based on schema theory through which translators choose equivalents on
the basis of their background knowledge with both source and target
languages. In addition to the novelty of the method, it contrasts
quantitative research with its qualitative counterpart in order to help
readers decide which type they prefer to utilize in their own projects.
Pronunciation Symbols (British English)
Consonants Vowels
Symbol Key Word Symbol Key Word
p pen i sit
b bad i: sheep
t ten e bed
d desk æ bad
k key ǎ cut
g get a: father
č chair ǒ pot
j jump o sort
f few u put
v very u: boot
θ thing з father
ð then з: bird
s soon ā make
z zero ō note
š she ī bite
ž pleasure aw now
h hot oy boy
m sum iз here
n sun eз there
ŋ sing uз poor
l led āз player
r red ōз lower
y yet īз tire
w wet awз tower
oyз employer
/ / … slant lines used in pairs to mark the beginning and end of a transcription
' … mark preceding a syllable with primary (strongest) stress as in
discover /dis'kǎvз/
` … mark preceding a syllable with secondary (next-strongest) stress as
in indication /`indi'kāšn/
Table of Contents
Page
Preface I
Pronunciation Symbols IV
Table of Contents V
List of Tables XIII
List of Figures XVII
List of Appendices XXI
References 327
List of Tables
Page
Table 2.1 Codification of gender as a categorical variable 14
for 10 hypothetical participants
Table 2.2 Ordinal scale of writing ability at advanced level 15
Table 2.3 Conversion of ranks into ordinal numbers, 17
cardinal numbers and scores
Table 2.4 Five areas of learning addressed by beliefs 25
explored by the BALLI
Table 3.1 Schema domains, genera, type, token and their 36
ordered percentage in the RMTIAR
Table 4.1 Correlations coefficients of three tests 55
Table 5.1 Classification of test takers based on their IQs. 68
Table 5.2 The first two blocks of the random numbers 72
appearing on page one of Appendix 5.1
Table 5.3 Students enrolled at five strata of school 74
education in Iran in 2004-2005
Table 5.4 Tertiary education students in 2004-2005 75
Table 5.5 The number of undergraduate and graduate 77
students majoring in the specified academic fields
offered in Dr. Ali Shariati Faculty of Literature
and Humanities at FUM in 2008
Table 5.6 Random assignment of 135 participants into three 80
groups
Table 5.7 Standard deviation range in samples having 82
normal population
Table 6.1 IQ means and standard deviations for whites and 89
black
Table 6.2 Mean values of nasal resistance at AAR 97
(Pa/cc/sec), nasal endoscopic scores and nasal
symptoms, before and after treatment
Table 6.3 Comparison and statistical analysis of results 97
List of Figures
Page
Figure 2.1 Interaction of independent and dependent 22
variables
Figure 5.1 Intelligence IQ normal curve 69
Figure 6.1 Black Scores on Four Tests of Cognitive Ability 90
Figure 8.1 Hierarchical relationship of schemata comprising 143
the Quran as a written text
Figure 9.1 Page setup in word 162
Figure 9.2 Original typed text in word 164
Figure 9.3 Find function activated 164
Figure 9.4 Replace function activated 165
Figure 9.5 Successful functioning of Replace All command 165
Figure 9.6 Depunctualized schemata 166
Figure 9.7 Commands involved in converting texts to table 167
Figure 9.8 Last command in converting texts to table 167
Figure 9.9 Schemata column converted to a table 168
Figure 9.10 Forming the column of the first variable called 169
schema
Figure 9.11 Schema and its codes as variables 170
Figure 9.12 Codification of schema tokens 171
Figure 9.13 Coded schema domains, types and tokens 172
Figure 9.14 Naming Sheet 2 for sorting schemata 172
Figure 9.15 Creating variables in the Sorted sheet 173
Figure 9.16 Activating the Ribbon command 173
Figure 9.17 Maximized ribbon 174
Figure 9.18 Activating Sort & Filter dialogue box 174
Figure 9.19 Sorted schemata 175
Figure 9.20 Some sorted schemata and their frequency 176
Figure 9.21 SPSS Data Editor 178
Figure 9.22 SPSS variable types 178
Figure 9.23 Numeric variable types 179
Figure 13.12 Activating the linear regression dialogue box and 279
specifying variables
Figure 13.13 Marking relevant boxes in Statistics dialogue 279
box
Figure 13.14 Specifying functions in Plots 281
Figure 13.15 Specifying functions in Save 281
Figure 13.16 Forming block 1 291
Figure 13.17 Forming block 2 291
List of Appendices
1 Research: Definition
1.1 Introduction
In general, all types of research are done in order to solve problems.
Before embarking on solving any problem one must first decide what
the nature of the problem is. Without a clear definition of the problem,
taking any steps would be a waste of time, effort, energy and of course
money. The necessity of defining the problem before doing anything
sounds quite obvious. There are, however, a large number of people who
usually do something when they encounter a problem and then start to
think about what they have done.
Many parents, for example, face the problem of leisure time in summer.
Since schools close down, their children stay home and get bored. Not
having anything to do or any places to attend may push children to
juvenile delinquency and thus destroy their future lives. Children's free
time and the necessity of filling it with proper activities, therefore, poses
a problem which calls for parents’ immediate solution.
You can guess that parents may adopt various solutions. But which one
is the best and the most useful? As soon as we bring up the topic of
finding the best solution, we need certain principles to fulfill our search
in the best possible manner. These principles will be discussed in the
context of definitions offered for research by a number of scholars.
What established principles are and whether approach and way indicate
the same methods or procedures followed in research designs need to be
explained and discussed by the proposers of single definitions. The
explanation of key words used in single definitions, therefore, render
them complex and difficult to grasp and convey. For this reason, I
believe offering a composite definition captures the nature of research
and sheds light on some research features which are usually overlooked
in single definitions.
While visiting ministry of education, for example, you may come across
some teachers who complain about lack of discipline in their classes. If
you are lucky enough, you may notice an announcement on the board
calling researchers to submit their proposals on a number of issues such
as the effect of media on language use, cultural invasion, and linguistic
imperialism.
Most students usually follow three methods when they face an unknown
word: contextual, authoritative, and referential. They might read the
sentence in which the term simplicity has occurred several times and
focus on its surrounding words and sentences to guess its meaning. The
second method would be to ask an authority, i.e., their English
instructor. And finally, they might consult a reference book such as a
dictionary as the third method.
The personal nature of essays does not, however, entail that their writers
do not employ evidence or logical arguments to support their claims and
adopted positions. The evidence, however, might be idiosyncratic.
Orwell (1958), for example, announced that “from a very early age,
perhaps the age of five or six, I knew that when I grew up I should be a
writer” (Bott 1958, p. 99). Orwell’s statement might mean that great
writers know from their early ages that they will become writers!
Besides, such a claim is too subjective to be verified by a third party
under controlled conditions.
I remember quite well that one of my colleagues was doing his PhD in
soil sciences in the agriculture department of a university in Australia. In
order to study the effect of a certain substance on a sample soil, he had
to use a solution which was mentioned in a study. Since the required
solution was not available in the market, he followed the descriptions
given in the Materials subsection to produce the solution in his own lab.
After his frequent attempts bore no fruit, he contacted the author of the
study who was luckily available. It was then he realized the writer of the
study had forgotten to describe a small step in the process of producing
the solution!
1.4 Summary
Research is an indispensible part of any tertiary or higher education
center. According to Burns and Sandra (2003), old universities were
founded primarily as “research-based institutions,” (p. 24) where
research was conducted for the sake of research. In other words, old
universities addressed only those topics which attracted their own
scholars’ attention.
1
Prose /prōz/ n. the ordinary form of written or spoken language
2
Verse /vз:s/ n. each of the lines of a poem
2 Research Variables
2.1 Introduction
In Chapter One we realized that there would be no research if we do not
face a problem related to language learning and teaching. We also
learned that all research projects must be replicable, i.e., if other
researchers conduct our research, they must get the same or very similar
results. This means that we need to operationalize our research problems
so that there will be no misunderstanding on the part of our research
users. In other words, we must tell our readers what exactly we mean
when we use particular key words in expressing our problems.
It does not matter what type of research you wish to conduct, you must
identify both constants and variables of your project from the very
beginning. After identifying the variables of your research, you should
determine their function so that you can explore their possible
relationship. Variables should therefore be approached from two
perspectives in all types of research: psychometrically and functionally.
Table 2.1
Codification of gender as a categorical variable for 10 hypothetical
participants
As shown in Table 2.1 above, in the fourth and fifth columns, different
signs are used to show the two levels of gender. Similarly, the sixth and
seventh columns show different numbers indicating the same levels.
3
The word project has been used in two senses. It means both a
definitely formulated piece of research and a task engaged in by a
group of students to supplement and apply classroom studies.
Table 2.2
Ordinal scale of writing ability at advanced level
Rank Description
Excellent Error-free, substantial and varied material, resourceful and controlled
in language and expression.
Very good Very good realization of task, ambitious and natural in style.
Good Sufficient assurance and freedom from basic error to maintain theme.
Rank Description
Pass Clear realization of task, reasonably correct and natural.
Weak Near to pass level in general scope, but with either numerous errors or
too elementary or translated in style.
Very poor Basic errors, narrowness of vocabulary
In scales like the one presented in Table 2.2 the intervals between the
ranks are not equal. We cannot, for instance, say that the interval
between Excellent and Very good is the same as the interval between
Very good and Good. We can, however, claim that the ability of an
excellent student in writing essays is higher than the ability of a very
good student.
The labels of ranks such as excellent, very good and good can be
replaced by ordinal numbers 1st, 2nd, 3rd, … nth. These ordinal numbers
are in turn replaced by cardinal numbers 1, 2, 3, … n for purposes of
calculation. When the ordinal numbers are replaced by cardinal
numbers, the assumption underlying the intervals between ranks
changes.
Table 2.3
Conversion of ranks into ordinal numbers, cardinal numbers and scores
4
Suggestopedia, originally developed by Lozanov (1978), is a teaching method
incorporating music, relaxation and suggestion. According to Lozanov, we use only
five to 10 percent of our mental capacity. This percentage can be increased by
desuggesting the feeling that we cannot succeed in learning a modern language.
There are, however, some research projects that are quite complex. One
of these complex projects belongs to Purpura (1997). These projects are
factorial designs, which permit the investigation of additional
independent variables. They also provide researchers with an
opportunity to study the interaction of an independent variable with one
or more other variables called moderator variables.
Figure 2.1
Interaction of independent and dependent variables
Cognitive processing 1 2 3 4
Metacognitive processing 5 6 7 8
5
Polyposis /pǒlз'ōsis/n. a condition in which numerous small growth develop in a
hollow organ such as the nose
6
Idiopathic /idiз'pæθik/ adj. describes a disease or disorder that has no known cause
7
Ethmoid /'eθmoyd/ adj. of, relating to, or being a light spongy bone located between
the orbits, forming part of the walls and septum of the superior nasal cavity, and
containing numerous perforations for the passage of the fibers of the olfactory nerves
a single research project would be too broad to handle. For this reason,
Businco, Businco, Lauriello, and Tirelli (2004) controlled the human
diseases in their study and included those human patients who suffered
only from nasal polyposis.
Businco, Businco, Lauriello, and Tirelli (2004), for example, studied the
effect of nasal polyposis on the anxiety level of 30 male and female
patients whose age ranged between 18 and 77. We may say that the
participants studied in this research might have all been from broken
families where they did not receive emotional support. Patient’s family
background, however, is considered personal and no researcher is
allowed to question it unless they volunteer to disclose it themselves.
The family background might therefore have been an interfering
variable in this research.
Table 2.4 presents the five areas along with the percentage of their
constituting beliefs. As can be seen, belief one and eight other beliefs
have been subsumed under foreign language aptitude (FLA). Based on
this logical categorization Horwitz claims that the FLA is an internal
variable whose existence is manifested by nine observable ordinal
beliefs.
Table 2.4
Five areas of learning addressed by beliefs explored by the BALLI
# Of
# Areas Beliefs Percentage
Beliefs
1 Difficulty of language 3, 4, 6, 23, 27, 34 6 17.6
learning
2 Foreign language aptitude 1, 2, 10, 14, 21, 28, 31, 9 26.5
32, 33
3 The nature of language 5, 8, 11, 15, 19, 24, 25 7 20.6
learning
4 Learning and 7, 9, 12, 13, 16, 17, 18, 8 23.5
communication strategies 20
5 Motivations and 22, 26, 29, 30 4 11.8
expectations
Total 34 100
Ghobadi (2009) and Hong (2006) used a statistical test called Factor
Analysis to explore whether Horwitz’ (1988) identification of five
logical areas has any empirical or factorial validity. They administered
the BALLI to 428 Korean speaking and 423 Persian speaking
undergraduate students, respectively, and asked them to indicate
whether they completely agreed, agreed, had no idea, disagreed or
completely disagreed with the statement. Since the participants’
responses to belief one was collected and analyzed statistically, it was a
surface ordinal variable.
2.5 Summary
Research in applied linguistics is conducted to find sound solutions to
language learning and teaching problems by adopting systematic
approaches. All the problems explored in the field basically deal with
attributes which vary from learners to learners, teachers to teachers and
texts to texts, to name a few. These attributes are technically referred to
as variables. They usually assume three roles in research projects:
psychometric, functional and latent.
3 Research Hypotheses
3.1 Introduction
Identifying a problem will not lead to a verifiable and valid research
project alone if you do not study what other researchers have already
done on related problems and use their findings to formulate your
possible answers to your research problems. This is exactly what great
inventors like Thomas Edison did. According to Microsoft Encarta
(2006), before he started to do any type of experiment, Edison tried to
read all the literature on the subject to avoid repeating experiments that
other people had already conducted. Perhaps the best illustration of
Edison's working method is his own famous statement: "Genius is one
percent inspiration and 99 percent perspiration."
In the three questions raised above, there are three keys terms which
need to be defined before we proceed with the analyses of hypotheses
formulated on these questions. (Similarly, whenever you employ certain
words which play a very important role in your research, you ought to
define them very precisely so that there would be no misunderstanding
on the part of your readers if they use the same words in different
meanings.) They are portfolios, traditional achievement tests, and
critical thinking.
8
A schema, the singular form of schemata, “is any concept realized in a word or
phrase, syntactic or semantic, closed or open, syntagmatic or paradigmatic, which can
stand by itself or combine with other concepts to produce an idiosyncratic image in the
mind of a given person. This image has a direct relationship with the person’s
experiences with the concept gained through its application with other semantically
and syntactically related concepts. Schemata are idiosyncratic because individuals
differ from each other in terms of their experiences” (Khodadady 2001, p. 111).
Item five above, for example, presents test takers with the four choices
load, stuff, pack and fill. Since these choices all have the meaning put
something in, they compete with each other to be chosen as the correct
answer and are therefore referred to as competitives (Khodadady 1997).
However, the utterance "He's alive!" which comes before the competitives
rules out load, pack and fill because they do not indicate animals which are
killed and treated to be filled with special materials. The only competitive
which describes these animals is stuffed.
Test 1: Inference
An inference is a conclusion a person can draw from
certain observed or supposed facts. For example if the lights are on
in a house and music can be heard coming from the house, a
affected her students’ achievement and critical thinking ability. For this
purpose, she raised three questions for which she offered some possible
answers. We will turn to these answers in the next section.
Faravani (2006), for example, wanted to find out whether using reading
portfolios in her classes will help her students gain higher scores not
only on their achievement test but also on Watson-Glaser’s Critical
Thinking Test. For this purpose, she divided her 32 general English
female students into two classes: a control class and an experimental
class.
However, before she started her research project, she formulated three
null hypotheses, two of which are given below. They helped her be
consistent throughout her project.
Table 3.1
Schema domains, genera, type, token and their ordered percentage in the
RMTIAR
3.5 Summary
The development and conduction of a research project depends on
facing a genuine problem and taking principled steps to solve it. The
most fundamental step in this regard is reviewing literature and
employing the findings of other researchers in order to refine the
research problem and formulate hypothesis upon which appropriate
research methods can be chosen. Null hypotheses supply researchers
with the most straightforward answer assuming the lack of any
significant relationship between independent and dependent variables.
Directional hypotheses, however, help researchers focus on a significant
relationship which must rationally exist between two variables. The
specification of variables and the statement of their expected
relationships in hypotheses pave the way to ensure that all the
requirements of an acceptable research project are met. These
requirements or characteristics are discussed in chapter four.
3.6 Application
Read the following hypotheses and indicate whether they are null or
directional.
4 Characteristics of Research
4.1 Introduction
In teaching language, teachers may face many problems requiring
appropriate solutions. The problems may be related to learners, teaching
methods, syllabus, schools, parents and peers, to name a few. In chapter
one, we learned that the first step in conducting a research would be
reading the literature in order to find out what others have done so far.
The review of literature will help us not only save time, energy and
resources but also refine our problem and define it in an operationalised
manner which enables us to collect data and run statistical analyses.
4.2 Validity
Validity denotes the acceptability of an object or an argument as factual
according to certain criteria. For example, the validity of an original
passport depends on its being officially issued for a fixed period of time.
In other words, official approval and having dates of issue and expiry
are among various criteria employed to check the validity of passports.
Another criterion for their validity is bearing a recent photograph of its
holder.
9
Attributes of a person, a piece of text, or an object which varies from person to
person, text to text, object to object, or from time to time (Hatch &Lazaraton, 1991,
p.51)
location, and instruments employed in the projects are among the major
variables that affect their internal validity.
4.2.1.1 Researchers
Some research projects are narrow enough to be conducted by a single
researcher. Others are, however, too broad to be handled by one
researcher, especially when a large amount of data should be collected.
In these research projects bias, implementation, and presence of
researchers and data collectors will have an effect on the nature of the
data they obtain.
4.2.1.1.1 Bias
Bias indicates a researcher’s tendency to consider one person, group, or
idea more favorably than others do. This bias stands in sharp contrast to
that of participants, i.e., halo effect (e.g., Brown 1988, p. 33), and may
influence the researcher’s attitude unfairly in favour of or against the
participants and/or methods he employs in his project.
data-collectors to find and remove their biases and thus standardize all
procedures by their agreement.
4.2.1.1.2 Implementation
Sometimes occasions arise when researchers have to seek the help of
some people to implement the research projects. For example, in
investigating the effect of using two different teaching methods in
Iranian schools, e.g., audio-lingual and silent way, a female researcher
may invite four teachers to join her study. She may ask two teachers to
teach a certain text audiolingually and the other two to teach the same
text in silent way. It may turn out that students performed better in
audio-lingual classes simply because their teachers were better than the
other two. This is known as implementer threat to internal validity.
The implementer threat may also happen when some researchers are
biased towards certain methods. All four teachers mentioned above
might prefer audio-lingual method and the two teachers employing the
silent way in their classes may do so reluctantly. As a result, the audio-
lingual group may perform better on the achievement test.
4.2.1.1.2 Presence
As researchers we should take whatever steps necessary in order to be
present in the field where data is collected. Participants in a research
project love to see that their participation is appreciated. They will take
whatever tests administered to them seriously if they are administered
by the researchers themselves. Since the effect of researchers’ presence
was first discovered in a plant named Hawthorne, it is referred to as
Hawthorne effect in research literature.
4.2.1.2 Participants
Students, parents or other people involved in educational programs such
as teachers and school staff may form the participants of research
projects in applied linguistics. Some researchers employ the term
subjects instead of participants (e.g., Biria 2002, Tajareh & Tahririan
2003). I believe employing participants in the research projects
conducted in social sciences is more descriptive of the people who
voluntarily take part in them. The term participant differentiates humans
from non-human subjects upon whom some experiments are conducted
without their approval in fields such as biology and zoology.
4.2.1.2.1 Attitude
Some research projects require having two groups: experimental and
control. While the experimental group is exposed to research treatment,
the members of the control group receive no treatment at all. If they
realize that they are not treated as the experimental group, they may
become demoralized or resentful and hence perform more poorly than
the treatment group. It may thus appear that the experimental group is
performing better as a result of the treatment when this is not the case.
The following example is given by Fraenkel and Wallen (1993).
4.2.1.2.2 Attrition
When research extends over a long period of time, it may not only affect
the participants’ tolerance but also change the composition of the sample
studied. This change of composition brings about attrition (Seliger &
Shohamy, 1989, p.101). As research takes months to be completed,
some participants may lose their interest and drop out of study. It is also
possible some participants may become sick on certain days and fail to
attend and therefore the data related to their performance on certain
tasks related to the research project will be missing. This will result in
having unequal number of participants on the tasks which may put the
internal validity of the project at risk.
10
a technical term indicating an increase in scores
Attrition not only limits the generalizability of research but also can
introduce bias–if those participants who dropped out would have
answered the questions differently from those who completed the
research. Quite often this is the case because those who drop out or are
absent act this way for a reason.
4.2.1.2.3 Expectancy
Participants are indispensible members of the society in which
researchers conduct their projects. They are therefore well aware of the
norms and expectations of their society. When they come across the
projects which address their norms, they go with the crowd and answer
what they think the researchers expect them to answer. This threat to
internal validity which is known as subject expectancy in the literature
can be removed by adopting appropriate strategies. The first and most
important one requires keeping the identity of participants anonymous.
For this purpose, codes can be specified in advance and the participants
can be asked to employ the codes rather than their names.
4.2.1.2.4 History
In addition to participants’ attitude, attrition and expectancy, their
history also affects the internal validity of research projects. History
refers to whatever happens during research with or without researcher’s
knowledge. If the researcher knows his participants’ history, he might
take necessary steps to control some variables and thus save the internal
validity of his study. If he ignores the history or remains unawares of it,
he jeopardizes the internal validity.
A male English teacher, for example, may wish to explore whether silent
way11 is a more effective method in teaching English to his beginner
11
A method of foreign-language teaching developed by Gattengo which makes use of
gesture, mime, visual aids, wall charts and wooden sticks of different lengths and
colours. In contrast to methods such as Audio-Lingual Method, teachers who employ
Silent Way must stay silent most of the time and use gestures various types of aids in
4.2.1.2.5 Maturation
Maturation indicates growing or developing naturally over time. Many
biological, psychological and emotional changes occur over a long
period of time as a result of maturation. If it is not taken into account,
researchers might take those natural changes as the results of their
research treatment.
4.2.1.2.6 Selection
Participants’ selection relates to their being almost the same in all
characteristics except the variables under investigations. In other words,
certain attributes must stay constant in a research project in order to
ensure that the variables under investigation are treated properly. These
constant attributes might include developmental, educational, political,
social and economic backgrounds.
order to help their learners produce express what they see (see Richards & Rogers,
1986).
12
A method of teaching foreign languages based on five principles: (1) language is
speech, not writing, (2) a language is a set of habits, (3) teach the language, not facts
about the language, (4) a language is what native speakers say, not what someone
thinks they ought to say, and (5) languages are different (Moulton, 1961)
Participants
A total of 30 consecutive patients were enrolled (16 male, 14
female, age range 18-77 years, mean 45.6), all affected by
idiopathic ethomoidal NP, primary or recurrence. Patients
affected by asthma, mental diseases, chronic diseases, in
general, or used any other drugs during the study, were
excluded (p. 327).
4.2.1.2. Sympathy
The sympathy of participants towards researchers is another factor
showing the inappropriateness of calling them subjects. While
participants have the capacity to sympathize with researchers, subjects
such as laboratory animals cannot.
4.2.1.2.7 Tolerance
Tolerance concerns the participants’ capacity to function normally while
a research project is conducted. Although Hatch and Lazaraton (1991,
p.35) discuss features such as getting tired and/or bored under
maturation, differentiating maturation from tolerance seems necessary.
While maturation entails advancing involuntarily in a developmental
direction over a relatively long period of time, tolerance accrues when
participants experience a physical or emotional change in a short span of
time. Maturation and tolerance should therefore be separated from each
other.
4.2.1.3 Location
The particular location where a research project is conducted may affect
the outcome of a research project. The participant who takes a test in the
classroom which is larger, has better lighting and air-conditioning, will
perform better than those placed in small, dark and smelly classrooms.
countries like Iran, few books can be found in classes. Even central
libraries of some Iranian universities lack many newly published books
and journals on English literature and applied linguistics. The number of
specialized books and journals even vary from city to city and university
to university.
4.2.1.4 Instruments
Not only participants’ characteristics and location but also instruments
used in a research project will influence its internal validity. Instruments
relate to whatever tools a researcher employs to quantify his variables.
Some common instruments used in educational research projects include
tests, questionnaires, tapes and films.
No one can dispute the validity of the translation question given above
because it requires translating the sentence from English to Persian. Seif
and Khodadady (2003), however, claimed that schema-based cloze
MCITs, which are designed in only one language, i.e., English, can
measure their takers’ Persian translation ability indirectly.
Table 4.1
Correlations coefficients of three tests
4.2.1.4.4 Directions
Directions relate to all instructions the participants in a research project
must follow to complete the required tasks. Before the directions are
included in the final draft, they must be carefully planned and piloted.
While planning requires specifying exactly and clearly what the
participants must do during the research, piloting indicates administering
the instrument and directions to a representative sample of participants
and analyzing their performance before administering them to the target
sample.
One of the students handed the test booklet to the present author and he
could find directions neither on the questioned page nor on the previous
one. After close scrutiny he could spot the directions in the top corner of
page three where it was stapled to other pages! They were written in
English and were virtually undistinguishable from the test items. The
test was designed on an ESP13 course for Iranian university students
majoring in industrial management. If the students had not sought for
help and if he was not there, they would have answered the questions
differently and thus invalidate the research.
4.2.1.4.5 Subjectivity
Instruments employed in a research project may constitute a threat to the
internal validity if they change according to the person who uses them.
This is usually referred to as subjectivity or “instrument decay”
(Fraenkel & Wallen, 1993, p. 225). It occurs when the instrument
permits different responses as in open-ended questions and essays.
13
English for specific purposes (ESP) courses focus on distinctive features of the
language, especially vocabulary, that are most immediately associated with its
restricted use, e.g., technical terms in agriculture (Munby, 1978, p. 2)
answers must be written by the test designer and answer keys for scoring
the instrument must be prepared before administering the instrument.
For determining whether the students perform better by using the new
text, the researcher has to develop an achievement test on the basis of
content presented in both new and regular texts and administer it to his
students at the beginning of the term. This is technically referred to as
pretest.
At the end of the research or term, the researcher must also administer
the same test, i.e., pretest, to the same students in the same location. This
is called posttest. A significant change in the performance of students on
the pretest and posttest may show which textbook leads to a better or
higher performance on the part of students.
The students may score higher on the posttest because they studied the
new textbook. They may also be test-wise and use the items of the
pretest to determine what will be studied during the research and
accordingly make a greater effort to learn the material. This is known as
test effect.
14
a test designed to “measure the degree of students’ learning from a particular set of
instructional materials” (Farhady, Jafarpoor & Birjandi, 1997, p.24)
otherwise he cannot support his claim that the students scored higher
because the new textbook was taught instead of the regular one.
It seems that Seliger and Shohamy (1989) used internal and external
validities interchangeably. This indiscriminate application results from
their attempt to relate the concept of external validity to research types
in terms of their function, i.e., basic, applied and practical. For example,
in their application of the effect of time on external validity Seliger and
Shohamy wrote:
findings of basic research, i.e., language acquisition, are not meant for
being applied to the classrooms. This is the responsibility or purpose of
applied research. As long as a research project is announced to be basic
in nature, looking for its external validity would be unreasonable.
Though in many cases some researchers try to find out whether the
findings of basic research have any bearing on real life situations.
4.3 Reliability
Reliability of a research instrument such as a test refers to its ability to
provide its users with dependable results. This means that whatever
instruments you employ in designing and conducting your research
project, they must be robust in their content and design in order to yield
consistent results.
For example, if we study the time under which two participants in our
research project read a passage and answer questions related to the
passage correctly and notice that the first participant spends 5 minutes
whereas it takes 10 minutes for the second, we can say that the first
participant reads two times faster than the second.
As the examples of time spent on reading and marking the essays on two
different occasions show the selection of variables in terms of their
psychometric features will affect the reliability of our results. And since
4.4 Feasibility
Before any research project starts, its designers must ask themselves
whether conducting their project would be practical in terms of available
expertise, resources and facilities. For example, if we are interested in
exploring a topic which requires a highly sophisticated software, we
must decide whether we can master the software and work with it within
the projected time.
4.5 Summary
For a research project to be acceptable, it must meet the requirements of
validity, reliability and feasibility. It will be a valid project if its
designers, conductors, participants, instruments and location are studied
carefully and all necessary steps have been taken to foresee and control
extraneous variables which might affect its results.
5.1 Introduction
Applied linguistics deals with identifying the findings of almost all
fields which are related to human languages directly and indirectly and
employing them in teaching language. In addition to applying the
findings of various fields as diverse as sociology and physics, applied
linguists themselves design and conduct various types of research
projects in order to find solutions to the problems not addressed by other
fields.
There are, however, many other sources which deal with certain aspects
of education more specifically and thus provide information in a more
refined manner. These sources can also be accessed electronically. One
Although Persian cats and Iranian students share two different attributes,
both form an indefinite population whose members will vary from place
to place and time to time. They include past, present and future Persian
cats and Iranian students who lived, are living now and might live in
various parts of Iran in particular and of the world in general.
Table 5.1
Classification of test takers based on their IQs.
Figure 5.1
Intelligence IQ normal curve (Source: de la Jara n.d.)
If you look at the normal curve in Figure 5.1 once again and focus on
the IQs 116 and 84, you will find 84.134 and 15.866 cumulative
percentages under them, respectively. (We will deal with cumulative
percentages or percentile in chapter 11.) By subtracting these two IQs
from each other, i.e., 84.134 - 15.866, we will get 68.268. This means
that more than 68% of students have an IQ of 84 to 116 if measured by
Stanford-Binet intelligence test. This knowledge will be very necessary
when we select a sample for our research project in which intelligence
plays an important role.
5.3 Sampling
The ultimate purpose of any research project is to explore the
relationship between two or more variables common in a population.
When a sample is chosen, i.e., participants, it is not the sample which is
explored but the population from which the sample is drawn. For this
very reason, whatever the size of the sample, it must be representative of
the population of interest. We have already seen that the selection of
participants is a vital variable in internal validity (see chapter four). If
the participants of a research project are not sampled properly, the
project will not have internal validity. For obtaining a representative
Since grade one high school students have all passed centrally
administered final examinations when they graduated from guidance
schools, the students of a given grade one class will be homogenous in
their background knowledge. This means that the first principle of
random selection, i.e., homogeneity, is taken into account. If a language
teacher chooses the student sitting in the first front row to answer a
question on a topic like simple present tense, will his selection meet the
second principle of randomness?
You know well that highly able and active students usually sit in the first
front row. If a language teacher chooses one of these students and she
answers the question on simple present tense correctly, it does not
necessarily mean that the whole class knows this particular tense simply
because the selected sample does not represent the whole class. In other
words, the teacher has violated the second principle, i.e., the student was
not chosen from a well mixed list.
While writing, mixing and picking a few slips from a hat might be
practical in a small finite population such as the members of a class, it
becomes cumbersome when researchers start collecting the sample from
a relatively large population. A more practical way of simple random
sampling is using tables of random numbers. A table of random numbers
contains an extremely large list of numbers that has no order or pattern.
consist of five digit numbers such as 39876 and 25879. We read the first
three digit numbers from the left, i.e., 398 and 258. In choosing the
numbers we move down the page reading off 3-digit numbers, skipping
those which do not apply. For choosing our sample of 30 students we
may arbitrarily use the second block of Appendix 5.1.
Table 5.2 presents the first two blocks of Appendix 5.1. If we look at the
second column and the first row of the second block, we will find
15405. We read the first three digits, i.e., 154, and choose the student
having this number because there are only 300 individuals in our
accessible population. Look at the second number: It is 21694. We read
the first three digits, i.e., 216. The third and fourth numbers are 49810
and 32196. Since there are no 498 and 321 in the population, we skip
them and go to the fifth number: It is 13678. We read the first three
digits and choose student number 136. The sixth number in the table of
random numbers is 47609 so it does not get selected. We go to the next
number and so on until we select a total of 30 students.
Table 5.2
The first two blocks of the random numbers appearing on page one of
Appendix 5.1
Table 5.3 presents the five strata of school education in Iran in 2004-
2005. As can be seen, the very stratification of school education enables
researchers to realize that 44% of school population, i.e., pre-primary
and primary school students, was not taught English at the specified
period at all. It also specifies the best strata to conduct research projects
on English language proficiency, i.e., pre-university students. This
specification brings down the population of interest from around 15
million to 400 thousand.
Table 5.3
Students enrolled at five strata of school education in Iran in 2004-2005
15
Cluster /'klǎstз/ n. a group of people, animal or objects that are closely packed
together
Table 5.4
Tertiary education students in 2004-2005
Associate Professional
BA/BS MA/MS PhD
Province Diploma Doctorate
F M F M F M F M F M
Associate Professional
BA/BS MA/MS PhD
Province Diploma Doctorate
F M F M F M F M F M
Table 5.5
The number of undergraduate and graduate students majoring in the
specified academic fields offered in Dr. Ali Shariati Faculty of
Literature and Humanities at FUM in 2008
Day Night
No Fields Degree Total
Female Male Female Male
BA 110 38 111 13 272
Arabic Language
01 MA 11 14 3 4 32
and Literature
PhD 0 8 0 0 8
English Language
02 BA 122 48 179 57 406
and Literature
French Language BA 99 16 92 8 215
03
and Literature MA 11 1 6 0 18
04 French: Literary BA 31 5 9 0 45
05 Geography BA 0 0 1 0 1
Geography and BA 112 53 30 18 213
06
Urban Planning MA 2 8 2 10 22
When the research team met to discuss their procedure and pool their
data together, it was found that almost all the members of the team had
targeted the newsstand near Urmia University because it was the closest!
This means that the researchers were treating their samples as if they
were random. In other words, they thought that the buyers visiting this
particular newsstand represent the newspaper buyers who buy them
from the stands located in other parts of the city.
Table 5.6
Random assignment of 135 participants into three groups
Groups
Participants Schema-based Cloze test Traditional cloze
cloze MCIT MCIT
Non-native speakers 14 14 15
Native speakers 31 31 30
both NSs and NNSs and then assigned the first three highest scorers to
the traditional cloze MCIT, cloze test, and schema-based cloze MCIT.
The order of the groups was changed and then the second three highest
scorers were assigned to schema-based cloze MCIT, cloze test and
traditional cloze MCIT. The same procedure was followed until all
participants were assigned to the specified groups.
Few scholars have specified a definite size for samples because their
selection depends on the type of research projects and statistical tests
applied to the analysis of data. For example, while it is possible to
design a one-page questionnaire in social surveys and administer it to as
many participants as possible, finding volunteer participants to take part
in an experimental research over a long period of time would be really
difficult.
represents the population or not. Table 5.6 presents sample sizes ranging
from 10 to 1000. The standard deviations given in the table belong to
Garrett (1938, p. 243) who used normal distribution to specify sample
size.
Table 5.7
Standard deviation range in samples having normal population
In two of her advanced classes, Faravani (2006) brought up the topic and
asked her own students to take part in her research project voluntarily
and out of class. She even offered to teach them for a complete term free
of charge. Unfortunately, her advanced students seemed to be too busy
and therefore declined to participate in her study. Had they accepted to
take part in her study voluntarily, Farvani could have used simple
random selection to assign her advanced students to experimental and
control groups.
5.6 Summary
In applied linguistics and related fields such as translation and education,
research projects are carried out in order to solve perceived problems.
Researchers select appropriate research methods and follow their
principles in order not to solve an isolated problem but the one which is
common to a given target population.
6 Types of Research
6.1 Introduction
In previous chapters, we learned that in teaching language and mastering
it, not only teachers and students but also parents and language policy
makers may face some problems. If these problems are not properly
addressed and solved, they may entail spending time, energy and public
funds on unproductive programs and activities.
What Dickens and Flynn (2006) did, provides us with an example for
basic research. They conducted a research project to see whether there is
any evidence to support the claim that although blacks have had
1. The Wechsler Intelligence Scale for Children (WISC): The test was
called the WISC-R, WISC-III, and WISC-IV in 1972, 1989, and 2002
respectively (Harcourt, 2005)
2. The Wechsler Adult Intelligence Scale (WAIS) called the WAIS-R
and WAIS-III in 1978 and 1995, respectively (Harcourt, 2005).
3. The Armed Forces Qualification Test called the AFQT in1980 and
1997 (Department of Defense, 2005)
4. The Stanford-Binet-4 and the SB-5 administered in 1985 and 2001
(Thorndike, Hagen, & Sattler, 1986, pp. 34-36; Riverside, 2005).
Table 6.1 contains the summary data from the test publishers and the
Department of Defence. As can be seen, the mean IQ and standard
deviation of the white are greater than the black. If the inheritance
perspective of intelligence is accepted as a fact, the data collected over
years should show the same pattern of difference, i.e., the black will not
gain greater IQs on any of these tests even if their environment becomes
better than before.
Table 6.1
IQ means and standard deviations for whites and black (Source: Dickens
& Flynn, 2006, p. 27)
Standard Number of
Mean IQ
Test deviation observations
White Black White Black White Black
4 103.6 90.0 15.37 13.86 3691 711
Stanford-Binet
5 102.9 92.1 13.93 14.47 2070 384
R 102.3 86.4 14.08 12.63 1870 305
WISC III 103.5 88.6 13.86 12.83 1543 337
IV 103.2 91.7 14.52 15.73 1403 343
R 101.4 86.8 14.65 13.14 1664 192
WAIS
III 102.6 89.1 14.81 13.31 1523 247
R 101.2 87.0 14.28 13.54 519 72
WAIS < 25
III 102.6 90.9 14.59 12.31 413 93
80 100.0 82.0 15.00 13.63 5533 2298
AFQT
97 100.0 85.6 15.00 13.23 2880 1191
Figure 6.1 shows the black scores on four tests of cognitive ability on
the next page. Although all the scores are below the white mean score,
i.e. 100, there is consistent gain over years. Based on the analysis of data
from nine standardization samples for four major tests of cognitive
ability, Dickens and Flynn (2006) declared, “blacks have gained 5 or 6
IQ points on non-Hispanic whites between 1972 and 2002. Gains have
been fairly uniform across the entire range of black cognitive ability” (p.
2).
Figure 6.1
Black Scores on Four Tests of Cognitive Ability (Source: Dickens &
Flynn, 2006, p. 29)
Mehrens and Lehman (1991) stated that among various testing methods
such as open-ended questions and essay writing, MCITs are the most
highly regarded types. Although MCITs are the most popular, reliable,
time and cost effective testing methods, they suffer from one major
shortcoming, i.e., a solid basis in item writing theory. The underlying
The disagreements among scholars arise from the fact that evaluation
research cannot be as controlled as basic research. In contrast to basic
research, evaluation is conducted in natural and real-world settings such
as classrooms. Naturally, the number of variables involved in evaluation
increases because of the nature of settings. It does not, however,
preclude applying research principles to evaluation, i.e., validity,
reliability and practicality. Maibodi (2008), for example, conducted a
research project to find out whether text genre affects students’ reading
comprehension ability.
16
/kз'rikyulзm/ n. designs for carrying out a particular language program. Features
include a primary concern with the specification of linguistic and subject-matter
objectives, sequencing, and materials to meet the needs of a designated group of
learners in a defined context. The term “syllabus” is used more customarily in the
United Kingdom to refer to what is referred to as a curriculum in the United States
(Brown, 1994, pp. 159-160).
After teaching groups A and B for nearly 12-14 weeks, and covering
about ten lessons for each group a TOEFL post-test consisting of 30
reference questions and one short narrative text and another expository
text was administered to control the processing time and proficiency
level of the students chosen for the study. Those scorers who fell within
one standard deviation above and below the mean were taken as final
samples for further study, i.e., 30 students for each group.
17
Kurt Lewin (1890-1947) is a German American psychologist who was born in
Mogilno, Prussia (now in Poland). He received his education from the University of
Berlin. Lewin immigrated to the United States in 1932 and taught at Stanford, Cornell,
and Iowa universities. Finally, he became the director of the Research Center for
Group Dynamics at Massachusetts Institute of Technology in 1944. Lewin explored the
problems of motivation of individuals and groups and conducted research projects on
child development and personality characteristics. His work had a major influence on
modern investigations in psychology. Among his books are A Dynamic Theory of
Personality (trans. 1935), The Conceptual Representation and Measurement of
Psychological Forces (1938), and Resolving Social Conflicts (1947) [Microsoft
Encarta, 2006]
Action researchers are thus no one but the language teachers themselves.
They are an essential part of the process in which the action research is
conducted. This very fact produces a paradox in terms of research
principles. Since the teachers themselves act as researchers, their
research projects can help them respond to their students’ needs and
problems. However, as the findings they obtain in their classes are
limited to their local settings, they cannot generalize their findings and
their researches will therefore lack external validity.
Businco, Businco, Lauriello and Tirelli (2004) asked their patients to fill
out a questionnaire before and at the end of the treatment. It contained
two self-rating psychometric tests: one for anxiety and the other for
depression. Anxiety test comprised state anxiety and trait anxiety, each
consisting of 20 multiple choice items on which a score of > 40 is
considered high (Spielberger, Gorsuch, & Lushene, 1983). The test
allows to distinguish between existing anxiety and anxiety as a relatively
stable personality trait. A medium value of > 40 was used to classify
patients as high anxious and low anxious.
Table 6.2 shows the mean values of AAR and disease symptoms. As can
be seen, all patients revealed a significant reduction of NP at nasal
endoscopy, reduction of symptoms and nasal resistance at AAR.
Table 6.2
Mean values of nasal resistance at AAR18 (Pa19/cc20/sec), nasal
endoscopic scores and nasal symptoms, before and after treatment
Nasal Post-nasal
Time AAR Endoscopy Rhinorrhea Sneezing
obstruction drip
To 2.52 3.2 3.8 3.6 2.6 3.6
7 months 1.22 1.6 1.2 1.5 1.2 1.8
Table 6.3 shows the statistical analysis of results obtained on the anxiety
and depression tests. As can be seen, 19 patients (63.3%) showed high
levels of state anxiety, 21 (70%) showed anxiety as a trait and six (20%)
were positive for depression. However, after the medical treatment,
seven patients (23.3%) showed high levels of state anxiety, 8 (26.6%)
showed anxiety as a state and five (16.6%) were positive for depression.
The percentage of patients with high levels of state anxiety was
significantly higher before treatment than after (63.3% vs. 23.3%; with
X2 = 0.14; p = 0.004) equivalent to the percentage of patients with high
levels of anxiety as a trait (70% vs. 26.6%; X2 = 0.10; p = 0.002); there
was, nonetheless, no significant difference in depression before and after
treatment.
Table 6.3
Comparison and statistical analysis of results
N. of patients N. of patients after % of patients Chi- Pearson
before treatment treatment improved Square p
State anxiety 19 7 63.15 0.14 0.004
Trait anxiety 21 8 61.90 0.10 0.002
Depression 6 5 16.6 1.11 0.090
18
Anterior Active Rhinomanometry /rīnōmз'nǒmзtri/: measurement of the air flow and
pressure within the nose during respiration
19
Pa stands for Pascal which is a unit of pressure equal to one newton per square meter
20
cc: cubic centimeter
21
C-tests are a type of cloze language proficiency tests which are usually developed on
some short passages. The second half of every second word from the second sentence
of each passage is deleted in order to be restored by test takers.
Table 6.4
The raw scores of five participants (Ps) on the TOEFL and C-tests
Tests P1 P2 P3 P4 P5
C-Tests 80 75 53 50 50
TOEFL 105 100 90 74 60
As can be seen in Table 6.4, the C-Tests and TOEFL will correlate with
each other because the scores obtained on both tests by the same test
takers show a positive relationship, i.e., whoever has scored high on the
C-tests; s/he has obtained a high score on the TOEFL, too.
Figure 6.1
6
The linee chart of fivve scores obbtained on tw
wo proficieency tests
6.3.3.1 Longitudinal
Longitudinal research projects are usually conducted over a relatively
long period of time by observing the natural behaviour of a certain
person or a small group of persons. One of the areas which can best be
explored by longitudinal research is the simultaneous acquisition of two
languages from birth, or what is generally referred to as bilingual first
language acquisition (BFLA).
6.3.3.2 Cross-Sectional
Although longitudinal studies provide ideal opportunities for researchers
to observe bilingual first language acquisition (BFLA) as it takes place,
they might not always be feasible. The researchers might not have
bilingual partners to observe their children’s BFLA or have access to
similar children under natural conditions.
Table 6.5
Cross-sectional selection of 16 hypothetical bilingual children
Table 6.6
The application of SEM in research projects on assessment
Table 6.7
A variable by case matrix
Variables Cases
Person 1 Person 2 Person 3 Person 4 Person 5
Sex Male male female male female
Age 36 yrs 19yrs 30yrs 55yrs 42yrs
Political Progressive moderate23 Progressive Traditionalist Traditionalist
22
orientation
class working Lower class Upper class
Upper Middle
middle
From D. A. de Vaus (1985) Surveys in Social research. Sydney: Allen & Unwin, p. 4
22
Progressive is a person who favours social, economic or political reforms
23
Moderate is a person who holds no extreme opinion in politics and favours gradual
reforms
For example, in Table 6.6 there is variation across cases in how they
vote. This is systematically linked to variations in class: the progressives
are working class and the traditionalists are middle class. In other words,
survey research seeks an understanding of what causes some
phenomenon (e.g. vote) by looking at variation in that variable across
cases, and looking for other characteristics which are systematically
linked with it. As such, it aims to draw causal inferences (e.g. class
affects vote) by a careful comparison of the various characteristics of
cases. It does not end there. The next step is to ask why class affects
vote. Survey researchers need to be very careful, however, to avoid
mistaken attribution of causal links (simply to demonstrate that two
things go together does not prove a causal link).
6.4 Summary
All research projects require facing a problem, feeling a need to solve
the problem and taking proper steps to solve it. They are conducted in
order to satisfy an internal desire to know, i.e., basic research, to employ
a theory to address an existing problem, i.e., applied research, to decide
which currently practiced approaches yield the best result, i.e.,
evaluation research, or to address a trouble faced in class or at work, i.e.,
action research.
7 Translation Research
7.1 Introduction
24
According to Hadhrami (2009), Muhammad Marmaduke Pickthall was born in
London in 1875 to an Anglican clergyman. He was contemporary of Winston Churchill
at Harrow private school. He travelled extensively in the Arab world and Turkey and in
1917 reverted to Islam and soon became a leader among the emerging group of British
Muslims.
The Quran is, for example, of great importance not only to English
speaking Muslims but also to those who are in contact with Islamic
societies. Its very importance has encouraged many scholars to translate
the Quran not only from Arabic but also from other languages, e.g.,
Latin and French, to English.
Ross (1649)
'Remember thou what is written of Mary, she retired towards the East,
into a place far remote from her Kindred, and took a Vail to cover her,
we sent her our Spirit in form of a man; she was afraid, and said, God
will preserve me from thee, if thou have his fear before thine eyes.
Sale (1880)
'And remember in the book of the Koran the story of Mary; when she
retired from her family to a place towards the east, and took a veil to
conceal herself from them; and we sent our spirit Gabriel unto her, and
he appeared unto her in the shape of a perfect man. She said, I fly for
refuge unto the merciful God, that he may defend me from thee: if thou
fearest him, thou wilt not approach me.
Rodwell (1909)
'And make mention in the Book, of Mary, when she went apart from her
family, eastward, and took a veil to shroud herself from them: and we
sent our spirit to her, and he took before her the form of a perfect man.
She said: "I fly for refuge from thee to the God of Mercy! If thou fearest
Him, begone from me."
Palmer (1880)
'And mention, in the Book, Mary; when she retired from her family into
an eastern place; and she took a veil to screen herself from them; and we
sent unto her our spirit; and he took for her the semblance of a well-
made man. Said she, "Verily, I take refuge in the Merciful One from
thee, if thou art pious."
Pickthall (1930)
16. And make mention of Mary in the Scripture, when she had
withdrawn from her people to a chamber looking East.
17. And had chosen seclusion from them. Then We sent unto her Our
spirit and it assumed for her the likeness of a perfect man.
18. She said: Lo! I seek refuge in the Beneficent One from thee, if thou
are God-fearing.
Bell (1960)
16. Make mention in the Book of Mary; When she withdrew from her
people to a place, eastward.
17. And took between herself and them a curtain. Then We sent to her
Our spirit, who took for her the form of a human being, shapely.
18. She said: "Lo, I take refuge with the Merciful from thee, if thou art
pious."
And mention in the Book (the story of) Maryam, when she withdrew
from her people to a place eastwards, [I6] then she used a barrier to
hide herself from them. Then, We sent to her Our Spirit and he took
before her the form of a perfect human. [I7] She said, "I seek refuge
with the Rahman (All-Merciful) against you if you are God-fearing.
[I8] (Shafi, n.d., p. 34)
As can be seen in the verses translated above, Usmani and his team have
transliterated the prophets’ names according to their Arabic
pronunciation, and not according to their biblical form. For example, the
biblical Mary has been transliterated as Maryam. Similarly, instead of
biblical Abraham, the Qur'anic Ibrāhīm, and instead of Joseph, the
Qur'anic Yūsuf have been kept in English translation. However, for
proper nouns other than those of prophets, like Pharaoh, their English
forms have been retained.
Hatch and Farhady (1982), for example, discussed five methods, namely
pre-experimental, true experimental, quasi-experimental, ex post facto
and factorial methods. Hatch and Lazaraton (1991) left out factorial
design and elaborated on subdesigns of true experimental and quasi-
experimental studies as independent methods. Other methods such as
historical and survey research projects are discussed in textbooks on
educational research (e.g. Fraenkel & Wallen, 1993; Gay, 1990).
I have translated the first three lines of the Song of the Reed (Rumi
2001, Book 1: Lines 1-34) word by word below.
Nickolson (1926), for example, translated the first three lines described
in 7.3.1.1. As you can see, he has kept the structure of Persian lines as
much as possible and used the most common meaning of words such as
CHON ( )ﭼﻮنas how and JODAIHA ( )ﺟﺪاﻳﻴﻬﺎas separations.
Arberry (1961), for example, translated the Song of the Reed (Rumi
2001, Book 1: Lines 1-34) faithfully by adopting one-to-one English
equivalents for original Persian words. He has, however, tried to make
the English translation as natural as possible.
Redhouse (1881), for example, translated the first three lines of the Song
of the Reed (Rumi 2001, Book 1: Lines 1-34) by following a semantic
method. As can be seen, Redhouse has been quite successful in
preserving the music of the original Persian lines. He has translated
BOBRIDEAND ( )ﺑﺒﺮﻳﺪﻩ اﻧﺪtore. Since it does not rhyme with wept as an
appropriate equivalent of NALIDEAND ()ﻧﺎﻟﻴﺪﻩ اﻧﺪ, he has replaced it with
to weeping eyes sore not only to rhyme with tore but also to stay
semantically faithful.
From reed-flute hear what tale it tells; ﺑﺸﻨﻮ از ﻧﯽ ﭼﻮن ﺷﮑﺎﻳﺖ ﻣﯽ ﮐﻨﺪ
What plaint it makes of absence' ills. از ﺟﺪاﻳﻴﻬﺎ ﺣﮑﺎﻳﺖ ﻣﯽ ﮐﻨﺪ
"From jungle-bed since me they tore, ﮐﺰ ﻧﻴﺴﺘﺎن ﺗﺎ ﻣﺮا ﺑﺒﺮﻳﺪﻩ اﻧﺪ
Men's, women's, eyes have wept right در ﻧﻔﻴﺮم ﻣﺮد و زن ﻧﺎﻟﻴﺪﻩ اﻧﺪ
sore. ﺳﻴﻨﻪ ﺧﻮاهﻢ ﺷﺮﺣﻪ ﺷﺮﺣﻪ در ﻓﺮاق
My breast I tear and rend in twain, Rumi, ) ﺗﺎ ﺑﮕﻮﻳﻢ ﺷﺮح درد اﺷﺘﻴﺎق
To give, through sighs, vent to all my (2001
pain.
Hear, how yon reed in sadly pleasing tales ﺑﺸﻨﻮ از ﻧﯽ ﭼﻮن ﺷﮑﺎﻳﺖ ﻣﯽ ﮐﻨﺪ
Departed bliss and present woe bewails! از ﺟﺪاﻳﻴﻬﺎ ﺣﮑﺎﻳﺖ ﻣﯽ ﮐﻨﺪ
'With me, from native banks untimely torn, ﮐﺰ ﻧﻴﺴﺘﺎن ﺗﺎ ﻣﺮا ﺑﺒﺮﻳﺪﻩ اﻧﺪ
Love-warbling youths and soft-ey'd virgins در ﻧﻔﻴﺮم ﻣﺮد و زن ﻧﺎﻟﻴﺪﻩ اﻧﺪ
mourn. ﺳﻴﻨﻪ ﺧﻮاهﻢ ﺷﺮﺣﻪ ﺷﺮﺣﻪ در
O! Let the heart, by fatal absence rent, ﻓﺮاق
Feel what I sing, and bleed when I lament ﺗﺎ ﺑﮕﻮﻳﻢ ﺷﺮح درد اﺷﺘﻴﺎق
(Arberry, 1954) (Rumi, 2001)
7.3.1.5 Adaptation
Adaptation is a free method of translation employed in dramas and
poetry. While the themes, characters, and plots of the SL text are
preserved, the SL culture is converted to the TL culture and the text is
rewritten by an established dramatist or poet (Newmark, 1988).
three lines of the song appears below. They provide a typical example
for adaptation.
Pay heed to the grievances of the reed ﺑﺸﻨﻮ از ﻧﯽ ﭼﻮن ﺷﮑﺎﻳﺖ ﻣﯽ ﮐﻨﺪ
Of what divisive separations breed از ﺟﺪاﻳﻴﻬﺎ ﺣﮑﺎﻳﺖ ﻣﯽ ﮐﻨﺪ
From the reedbed cut away just like a weed ﮐﺰ ﻧﻴﺴﺘﺎن ﺗﺎ ﻣﺮا ﺑﺒﺮﻳﺪﻩ اﻧﺪ
My music people curse, warn and heed در ﻧﻔﻴﺮم ﻣﺮد و زن ﻧﺎﻟﻴﺪﻩ اﻧﺪ
Sliced to pieces my bosom and heart bleed ﺳﻴﻨﻪ ﺧﻮاهﻢ ﺷﺮﺣﻪ ﺷﺮﺣﻪ در
While I tell this tale of desire and need. ﻓﺮاق
ﺗﺎ ﺑﮕﻮﻳﻢ ﺷﺮح درد اﺷﺘﻴﺎق
(Rumi, 2001)
for a number of reasons. I believe the most tangible reason for free
translation might be found in the difficulty a translator finds himself
when he comes across some unintelligible parts in the source text. By
applying this method they interpret the original and present their own
understanding as the author’s.
ﻖ
َ ﻄ ِﻔ
َ ﻲ َﻓ
ﻋَﻠ ﱠ
َ { ُردﱡوهَﺎ32} ب
ِ ﺤﺠَﺎ
ِ ْﺣﺘﱠﻰ َﺗﻮَا َرتْ ﺑِﺎﻟ
َ ﺨﻴْ ِﺮ ﻋَﻦ ِذآْ ِﺮ َرﺑﱢﻲ
َ ْﺣﺐﱠ اﻟ ُ ﺖ ُ َْﻓﻘَﺎ َل ِإﻧﱢﻲ َأﺣْ َﺒﺒ
{33} ق ِ ق وَاﻟْ َﺄﻋْﻨَﺎ
ِ َﻣﺴْﺤًﺎ ﺑِﺎﻟﺴﱡﻮ
Arberry he said, 'Lo, I have loved the love of good things better
(1964) than the remembrance of my Lord, until the sun was
hidden behind the veil. Return them to me!' And he began
to stroke their shanks and necks.
Khan And he said: "Alas! I did love the good (these horses)
(1970) instead of remembering my Lord (in my 'Asr prayer)" till
the time was over, and (the sun) had hidden in the veil (of
night). Then he said "Bring them (horses) back to me."
Then he began to pass his hand over their legs and their
necks (till the end of the display).
Mohmmad So he said, I love the good things on account of the
Ali (1917) remembrance of my Lord -- until they were hidden
behind the veil. (He said): Bring them back to me. So he
began to stroke (their) legs and necks.
Pickthal And he said: Lo! I have preferred the good things (of the
(1930) world) to the remembrance of my Lord; till they were
taken out of sight behind the curtain. (Then he said):
Bring them back to me, and fell to slashing (with his
sword their) legs and necks.
Sarwar he said, "My love of horses for the cause of God has
(1981) made me continue watching them until sunset, thus
making me miss my prayer". He said, "Bring them back
to me." Then he started to rub their legs and necks.
Shakir Then he said: Surely I preferred the good things to the
(1983) remembrance of my Lord-- until the sun set and time for
Asr prayer was over, (he said): Bring them back to me; so
he began to slash (their) legs and necks.
Sherali He said, `I love the love of good things because they
(1955) remind me of my Lord.' And when they were hidden
behind the veil, He said, `Bring them back to me.' Then
he started stroking their legs and their necks.
Yusufali And he said, "Truly do I love the love of good, with a
(1989) view to the glory of my Lord,"- until (the sun) was hidden
in the veil (of night): "Bring them back to me." then
began he to pass his hand over (their) legs and their
necks.
“the actual words, phrases, clauses, sentences, paragraphs, etc.” (p. 3).
Naturally such an approach is too difficult, if not impossible, to follow
because the term “etc” shows that it can include larger units such as
chapters and an entire book. Lagzian (2013) reviewed the literature on
translation and announced that there is virtually no rationale other than
schema theory to explain the process and provide the translators with an
objective measure to evaluate source and target texts.
And he said, "Truly do I love the love ﺨﻴْ ِﺮ ﻋَﻦ ِذآْ ِﺮ َرﺑﱢﻲ
َ ْﺣﺐﱠ اﻟُ ﺖ
ُ َْﻓﻘَﺎ َل ِإﻧﱢﻲ َأﺣْ َﺒﺒ
of good, with a view to the glory of {32} ب ِ ﺤﺠَﺎ
ِ ْﺣﺘﱠﻰ َﺗﻮَا َرتْ ﺑِﺎﻟ َ
my Lord,"- until (the sun) was hidden
in the veil (of night):
"Bring them back to me." then began ق
ِ ﻖ َﻣﺴْﺤًﺎ ﺑِﺎﻟﺴﱡﻮ
َ ﻄ ِﻔ
َ ﻲ َﻓ
ﻋ َﻠ ﱠ
َ ُردﱡوهَﺎ
he to pass his hand over (their) legs {33} ق ِ وَاﻟْ َﺄﻋْﻨَﺎ
and their necks.
The Arabic schema ﺖ ُ ْ َأﺣْ َﺒﺒin Surah 38, verse 32, for example, consists of
three morphs which have to be translated via three distinct schemata in
English, i.e., the free syntactic sensor I, the bound syntactic finite past
and the free semantic process prefer. The identification and
classification of these three schemata will now help us evaluate the
internal validity of the translations.
Table 7.1 presents the eight translation of the Arabic source schema
ﺖ
ُ َْأﺣْ َﺒﺒ. As can be seen, only Shakir’s translation might be considered
schematic because he has neither inserted any parasyntactic schemata
such as do and did as Yusufali (1989) and Khan (1970) have done, nor
chosen the most frequent sense of the semantic schema ﺣﺐ, i.e. love.
Sarwar’s (1981) seems to be the least valid in the sense that he has
changed the verb schema prefer to the noun love and thus depicted the
prophet as a person controlled by his emotions.
Table 7.1
Validity analysis of eight renderings of the source schema ﺖ
ُ َْأﺣْ َﺒﺒ
The translator of the SL texts can assume the role of either a controlled
or moderator variable. If the text is translated by only one translator, he
will become a controlled variable. In the case of having two translators
render the same SL text, one of them will become an independent
variable and the other a moderator.
situations in the real world” (p. 8). It implies that external validity has
relevance only to applied researches. Since translation is basic in nature,
it does not seem to have external validity.
7.5 Reliability
Reliability literally means the quality of being dependable. In
psychometrics the term reliability is always employed to show
consistency. The reliability of a research project can be approached both
internally and externally. A given translation will be internally reliable if
the same translator translates given parts of a source text independently
on different dates. I will call this translate-retranslate reliability. The
translations can then be scored by an experienced translator and the
correlation coefficients obtained on these translations can be used as an
index of reliability. As Nunnally (1978) emphasized research projects
are reliable to “the extent that they are repeatable” (p. 22).
7.6 Feasibility
Research projects other than translation require relatively longer periods
of time. The two academic semesters in Iranian universities start in
Mehr and Bahman and end in Day and Khordad. Each semester lasts for
almost four and a half months of which the first and last months are
usually spent in registration and examination. This leaves only two and a
half months for both teaching and conducting research projects, which is
barely enough to conduct even field studies.
Even if the problems of time and access are solved in one way or
another, the cost of research remains a major obstacle, especially for
undergraduate students. There is no budget available to cover the
expenses incurred during research such as travelling, typing and copying
tests and questionnaires and buying necessary equipments.
7.7 Summary
Translation is an active, cognitive and creative process requiring the
application of scientific method as other types of research do. Scientific
method is systematic in that it follows an “established principle” (Hatch
& Farhady, 1982, p.4). The three principles followed in research
projects are validity, reliability and feasibility. Translation research
projects are valid if the variables involved in their conduction are
identified, controlled and rendered properly. They are also reliable if
independent translators score a sample translation and provide the
researcher with an acceptable reliability coefficient. And they are the
most feasible in Iran at present in that time, access and cost of their
conduction is fairly manageable.
While the very process of translation is personal and calls for the
qualification and attention of translators, the translated texts can be
compared with their original in order to evaluate the validity of their
content. The very act of contrasting two or more types of texts to find
their similarities and dissimilarities calls for establishing another new
type of research projects. Although there is nothing new about this type
of comparative study, the application of schema theory to its design
renders comparative studies more objective in procedure and sound in
their construct. Chapter 8 addresses this issue and offers text-based
research projects as a new contribution to human investigation.
8.1 Introduction
Chapter 7 provides necessary background to approach translation as a
research method which can claim validity, reliability and feasibility if it
is done by employing a scientific method. None of the eight
macrostructural methods employed traditionally in translation research
projects, i.e., adaptation, communicative translation, faithful translation,
free translation, idiomatic translation, literal translation, semantic
translation, and word-for-word translation can claim objectivity based
on scientific observation. This is because they are by their very nature
qualitative. In other words, there is no procedure available to
operationalize these methods so that their functioning can be observed
and quantified.
typically been reduced to helping create and pose hypotheses which can
then be tested and refined using scientific and/or statistical research
methods and models” (p. 524). In order to appreciate quantitative
translation research projects, we need to discuss the qualitative and
quantitative research projects and detail their differences.
Phonemics, on the other hand, does not address the measurable features
of sounds, but addresses conceptual categories which exist within the
mind of sound producers and receivers. The verification of these
categories seemed to be empirically impossible. For example, patients
who suffer from head damages may produce unusual speech patterns
which might not fit the statistical norms of typical pronunciation.
Phonemically, however, other people could still understand what was
being said because the underlying structure of the language exists in the
minds of both the speaker and the listener. The existence of these
structures, however, is not as observable as normally produced
utterances are.
Pike (1954) generalized phonetics and phonemics into etics and emits
which stand for quantitative and qualitative research projects,
respectively. He admitted that emit research projects led to unverifiable
conclusions. However, Pike argued that they help researchers not only
25
An audiometer is an instrument that produces pure tones of various fixed pitches or
frequencies. Patients put on headphone to listen to these pitches in a soundproof
booth to eliminate external noise. They are provided with a press switch to indicate
whether they heard a given tone or not. Hearing is tested one ear at a time.
understand the culture or language in holistic ways, but also explain the
life, attitudes, motives, interests, responses, conflict, and personality of
specific actors. Pike further argued that the etic or scientific research, in
contrast, hinders the ability to deal with these basic considerations
because such phenomena cannot be empirically validated.
26
Debunk /bi'bâŋk/ v. to show that something is wrong or false; expose; deflate;
demystify; discredit; lay bare; set straight; show up; throw light on
27
Rapport /ræ'po/ n. an emotional bond or friendly relationship between people based
on mutual liking, trust, and a sense that they understand and share each other's
concerns
28
Empathy /'empзθi/ n. the power of mentally identifying oneself with (and so fully
comprehending) a person or object of contemplation: Pity is feeling sorry for
someone; empathy is feeling sorry with someone.
Table 8.1
Differences between quantitative and qualitative research projects
29
Logical positivism is a school in philosophy which holds the idea that personal
experiences are the basis of true knowledge if they are verified scientifically (the
Austrian Ludwig Wittgenstein and the British Bertrand Russell and G. E. Moore
belonged to this school).
30
Constructivism has its origin in developmental psychology founded by Jean Piaget
who argued that children play an active role in their own learning. When they
encounter new experiences, they apply their background knowledge to modify it and
construct the new one (see Ross 2006).
(ST) and supplying their best equivalents on the basis of their textual or
discoursal context.
Figure 8.1
Hierarchical relationship of schemata comprising the Quran as a written
text
Surah schemata1 + … + Section schemata114 → The Quranic schemata
↑
Section schemata1 + Section schemata2 + Section schemata3 + … → Surah schemata
↑
Paragraph schemata1 + Paragaph schemata2 + Paragraph schemata3 + … → Section schemata
↑
Sentential schemata1 + Sentential schemata2 + Sentential schemata3 + … → Paragraph Schemata
↑
Clausal schemata1 + Clausal schemata2 + Clausal schemata3 + … → Sentential schemata
↑
The + all-merciful + has (Syn.) + taught (Sem.) + the (Syn.) + Quran (parasyntactic) + …→
Clausal schemata
↑
The (Syn.) + all (Syn.)- merci (semantic [Sem.] + ful (bound Syn. morph) + …→ Phrasal
schemata
↑
The (free syntactic morph [Syn.]) + …→ Single schema
Table 8.2
The chi-square test of English equivalents provided for the Persian
semantic schema ARZYABEE
Observed Expected
Equivalent Schemata Residual Test
frequency frequency
Assessment/assessing 4 5.5 -1.5 x2 = 11.8
Estimating 5 5.5 -0.5 df = 3
Evaluation/evaluating 12 5.5 6.6 p = 0.008
Studying 1 5.5 -4.5
Total 22
Source: Khodadady (2001, p. 116)
The statistical analysis of source and target texts can be achieved if their
constituting schemata are classified hierarchically to reflect the structure
of human mind. There are three broad domains to which all schemata
can be assigned: Semantic, syntactic and parasyntactic. These domains
are further broken down into genera, species, types and tokens as
discussed below.
For example, the semantic domain schemata consist of four genera, i.e.,
adjectives, adverbs, nouns and verbs. The adjective genus of semantic
domain in turn comprises the species of agentive adjective, agentive
complex adjective, comparative adjective, complex adjective, dative
adjective, complex dative adjective, derivational adjective, derivational
complex adjective, nominal adjective, simple adjective, and superlative
adjective. Similarly, the agentive adjective species consist of a
potentially indefinite types such as interesting, fascinating and
intriguing. If the adjective schema type interesting is used once in a
given text, it will have a token of 1. This hierarchical classification of
schemata will be elaborated and employed in the remaining sections and
chapters in order to make the arguments coherent.
The translated texts of Arberry (1964) and Irving (1985) are giving
below. Based on the stated goals of both translators, it is hypothesized
that if rhyming and being contemporary are reflected in the semantic,
syntactic and parayntactic schemata of the translated texts, they must
differ from each other significantly in terms of schema domains, types
and tokens.
1:1 In the Name of God, the 1:1 In the name of God, the
Merciful, the Compassionate Mercy-giving, the Merciful!
1:2 Praise belongs to God, the Lord 1:2 Praise be to God, Lord of the
of all Being, Universe,
1:3 the All-merciful, the All- 1:3 the Mercygiving, the
compassionate, Merciful!
1:4 the Master of the Day of 1:4 Ruler on the Day for
Doom. Repayment!
1:5 Thee only we serve; to Thee 1:5 You do we worship and You
alone we pray for succour. do we call on for help.
1:6 Guide us in the straight path, 1:6 Guide us along the Straight
1:7 the path of those whom Thou Road,
hast blessed, not of those 1:7 the road of those whom You
against whom Thou art have favored, with whom You
wrathful, nor of those who are are not angry,
astray. (Arberry 1964) nor who are lost! (Irving,
1985).
For example, Arberry (1964) and Irving (1985) used 42 and 41schemata
to translate the whole Surah 1 of the Quran, respectively. Table 8.3
presents the semantic schema types used in their translations. As can be
seen, out of 42 schemata employed by Arberry, 23 (54.8%) belong to
semantic domain, implying that the Surah is semantically more loaded.
Furthermore, none of the semantic schemata of the surah has a
frequency or token of more than 2, indicating that it calls for its readers’
complete attention in terms of what it purports to convey.
Table 8.3
Semantic schema types and tokens comprising the first surah of two
contemporary translations of the Quran
َ rabic
A Arberry Irving Schema Token (f)
No
Schema (1964) (1985) Type Arberry Irving
1 ﱠرﺣِﻴ ِﻢ All-compassionate Merciful Adjective 1 2
Compassionate 1
2 ِ ﱠرﺣْﻤـAll-merciful
ﻦ Mercygiving Adjective 1 2
Merciful 1
3 ﻦ
َ ﺿﱠﺎﻟﱢﻴ Astray Lost Adjective 1 1
َ rabic
A Arberry Irving Schema Token (f)
No
Schema (1964) (1985) Type Arberry Irving
4 ﻣُﺴ َﺘﻘِﻴ َﻢ Straight Straight Adjective 1 1
5 ب
ِ ﻣَﻐﻀُﻮ Wrathful Angry Adjective 1 1
6 ﻦ
َ ﻋَﺎ َﻟﻤِﻴ Being Universe Noun 1 1
7 َﻳﻮْ ِم Day Day Noun 1 1
8 ﻦ
ِ دﱢﻳ Doom Repayment Noun 1 1
9 ﷲ
ِ ا God God Noun 2 2
10 ب
َر ﱢ Lord Lord Noun 1 1
11 ﻚ
ِ ﻣَﺎ ِﻟ Master Ruler Noun 1 1
12 اﺳﻢ Name Name Noun 1 1
13 ط
َ ﺻﺮَا ﱢ Path Road Noun 2 2
14 ﺣﻤْ ُﺪ َ Praise Praise Noun 1 1
15 ﻋْ ُﺒ ُﺪ Serve Worship Noun 1 1
16 ﺳ َﺘﻌِﻴﻦ Succour Help Noun 1 1
17 Are/art (stray) Are Verb 2 2
18 ل Belongs Be to verb 1 1
19 ﺖ
َ أَﻧﻌَﻤ Blessed Favored Verb 1 1
20 اه ِﺪ Guide Guide Verb 1 1
21 ﺳﺌﻞ Pray Call on Verb 1 1
As a verb type of semantic domain, are and be differ from other verb
schemata such as belong. Are and be are simple in their structure
whereas belong is a complex verb in that it is derived from the bound
verb morph be and free adjective morph long. Intuitively, one may argue
that the more complex verb schemata there are in a translated text, the
more difficult it would be in terms of its readability level. In order to
explore whether such an argument holds true, semantic schema types are
further broken into semantic schema tokens.
Table 8.4 presents semantic schema types and tokens. As we can see,
there are four semantic types and 42 semantic tokens in English. The
tabulation and codification of semantic tokens allows us study all
translated texts objectively. Instead of working with the tokens
themselves, we can use their codes as categorical psychometric variables
and apply statistical tests to explore hypotheses.
Table 8.4
Semantic schema species and their example types
Species SC Type
Agentive Adjective 1110 Interesting, fascinating
Agentive Complex Adjective 1111 Flesh-eating, fine-looking
Comparative Adjective 1120 Better; worse, longer
Complex Adjective 1130 antiwar;
Dative Adjective 1140 Interested; devoted,
Complex Dative Adjective 1141 Research-based
Derivational Adjective 1150 Distinctive; gracious, functional,
merciful, compassionate
Derivational Complex Adjective 1151 Nonfunctional, sociopolitical, All-
merciful;
Nominal Adjective 1160 Iraqi, Swedish
Simple Adjective 1170 Good; straight; due
Superlative Adjective 1180 Best, cleverest
Comparative Adverb 1210 Faster; better; more; less
Species SC Type
Derivational Adverb 1220 Quickly, remarkably, across
Simple Adverb 1230 Fast; far; long; well
Superlative Adverb 1240 Fastest; latest
Adjectival Noun 1310 Warmth; ability
Complex Noun 1320 Uprise, greatcoat, background,
greenback, aftereffects, breakthrough
Compound Noun 1330 Notebook; bathrooms
Compound Complex Noun 1331 Slide-and-lantern;
Conversion Noun 1332 "Little and often" is a good
expression; ups; why, how, what
(understanding the how, what and why
of assessment)
Derivational Noun (Simple) 1340 Arrival, student
Gerund Noun 1350 Swimming, Reading,
Gerund Noun (Complex) 1351 Understanding; notemaking
Nominal Noun 1370 Iranian; British,
Simple Noun 1380 Book; heaven;
Complex Verb (Base) 1411 Underlie; undertake
Complex Verb (Third Person) 1412 Underlies; undertakes
Complex Verb (Past participle) 1413 Underlined; undertaken
Complex Verb (Present participle) 1414 Underlying; undertaking
Complex Verb (Simple Past) 1415 Underlined; undertook
Derivational Verb (Base) 1421 Realize; darken; enlarge
Derivational Verb (Third Person) 1422 Realizes; darkens; enlarges
Derivational Verb (Past 1423 Realized; darkened; enlarged
Participle)
Derivational Verb (Present 1424 Realizing; darkening; enlarging
participle)
Derivational Verb (Simple Past) 1425 Realized; darkened; enlarged
Phrasal Verb (Base) 1431 Give up; look down;
Phrasal Verb (Third Person) 1432 Gives up; looked down;
Species SC Type
Phrasal Verb (Past Participle) 1433 Given up; looked down;
Phrasal Verb (Present Participle) 1434 Giving up; looking down;
Phrasal Verb (Simple Past) 1435 Gave up; looked down;
Simple Verb (Base) 1441 Go; see; take, do, walk
Simple Verb (Third Person) 1442 Goes; sees; takes, does, walks
Simple Verb (Past Participle) 1443 Gone; seen; taken, done, walked
Simple Verb (Present participle) 1444 Going; seeing, taking, doing, walking
Simple Verb (Simple Past) 1445 Went; saw; took, did, walked
(Slang) Verb 1446 Kick the bucket
As can be seen in Table 8.4, all species codes (SC) consist of four digits,
i.e., 0000. From the left to the right, the first digit on the left shows a
schema’s domain, the middle digit represents its genera and the two
digits on the right stands for their types. The code 1110, for example
shows that the schema it represents is an agentive adjective in semantic
domain. Similarly, the code 1411 is a complex verb such as underlie
which consists of the prepositional adverb under and the simple verb lie.
Table 8.5
Syntactic schemata comprising the first surah of two contemporary
translations of the Quran
If we compare the results presented in tables 8.3 and 8.5, we realize that
in contrast to 23 semantic schemata (54.8%), Arberry (1964) used only
16 syntactic schemata ((38.1%) in the translation of Surah one.
Although these schemata are fewer in number, their types such as the
determiner the enjoy the highest frequency in the entire translation, i.e., f
Table 8.6
Syntactic schema species and their types
Species SC Type
Conjunction (Phrasal) 2110 as well as; so that, such as
Conjunction (Simple) 2120 But, or, as, while, when, since,
Demonstrative Determiner 2210 This, that, these, those, such; both
Interrogative Determiner 2220 What (season), which (place)
Numeral Determiner 2230 Two, ten
Possessive Determiner 2240 My, your, her, his, its, our, your, their
Ranking Determiner 2260 First, second, twelfth
Specifying Determiner 2270 A, an, the
Complex Preposition 2310 Across; around; toward; between;
beyond
Compound Preposition 2320 Upon, into, within; without;
throughout;
Phrasal Preposition 2330 In spite of, because of, according to
Simple Preposition 2340 Up, on, at; of; than, for
Demonstrative Pronoun 2410 This, that, here, there, both, own, same
Emphatic Pronoun 2420 Myself; yourself
Interrogative Pronoun 2430 Who, where
Object Pronoun 2440 Me, you, him, her, it, us, them
Possessive Pronoun 2441 Mine, yours, theirs
Reflexive Pronoun 2450 Myself; himself
Relative Pronoun 2460 Who, where, when, such that
Subject Pronoun 2470 I, you, he, she, it, we, you, they, there,
it (expletive)
Unspecified Pronoun 2480 One, some, few, many, much, several;
others; something; another, plenty
Specified Pronoun 2481 fourth; half; each other;
Conditional Auxiliary 2510 Had (he known the answer, he would
have come)
Past Auxiliary 2511 Was, were, had, did
Species SC Type
Past Perfect Auxiliary 2512 Had been
Present Auxiliary 2521 Am, are, is, have, do,
Present Perfect Auxiliary 2522 Has been, have been
Present Perfect Continuous Auxiliary 2523 Has been being, have been being
Past Model Auxiliary 2531 Might be, should be, could be, would
be
Past Perfect Model Auxiliary 2532 Might have (been), should have, could
have, would have (been)
Present Model Auxiliary 2541 Will be, can be, may be
Present Perfect Model Auxiliary 2542 Will have, may have, shall have, can
have,
Future Perfect Continuous Auxiliary 2543 Will have been; shall have been
Past Phrasal Auxiliary 2551 Was/were going to, was/were to, had
to, ought to
Past Perfect Phrasal Auxiliary 2552 Ought to have
Present Phrasal Auxiliary 2561 Am/are/is to, has/have to, ought to,
am/are/is going to
Model (Present) 2570 Can, may, shall
Model (Past) 2580 Could, might, should
Syntactic schemata are limited in their types because they depend on and
attach to the semantic schemata comprising the texts in order to
constrain them within the variables of place and time. The auxiliary
schema hast, for example, was used in verse 2, i.e., the path of those
whom Thou hast blessed…, to limit the path only to those blessed by the
Almighty Allah within the constraint of the present perfect tense.
Table 8.7 presents the parasyntactic schemata comprising the first surah
of two contemporary translations of the Quran. As can be seen, few
parasyntactic schemata have been employed by both Arberry (1964) and
Irving (1985). This feature might be an intriguing area of research in text
analysis. It seems that religious texts differ from scientific texts in terms
of their constituting parasyntactic schemata.
Table 8.7
Parasyntactic schemata comprising the first surah of two contemporary
translations of the Quran
Table 8.8 presents parasyntactic schema species and their types. As can
be seen, there are 27 parasyntactic tokens in English among which
adverbs claim for 11 tokens.
Table 8.8
Parayntactic schema species and their example types
Species SC Types
Abbreviations 3110 Adj., Dec., et al, L1; L2; they're; I've;
i.e., e.g.,
Acronyms 3120 NATO, NASA, Scuba, radar
Interjection 3210 Ah, ooh, why! Please,
Name (Full) 3310 Douglas Brown, Ernest Hemingway,
United States
Name (Labeling) 3320 Natural Approach
Name (Organizational) 3330 National Security Council
Name (Single) 3340 Brown, Mary, Iran, America
Name (Titles) 3350 Professor; Sir; Madam
Numeral (Alphabetic ) 3410 A, b, twenty-five, in time,
Numeral (Digital) 3420 0, 1, 20, 3.1,
Numeral (Roman ) 3430 II, xi,
Numeral (Year) 3440 1998,
Para-adverbs (Additive) 3511 Also, too, furthermore
Para-adverbs (Contrasting) 3512 However, nonetheless, nevertheless;
perhaps; regardless; instead
Para-adverbs (Emphatic) 3513 Of course; certainly; on the whole;
indeed, even; all all (in any way)
Para-adverbs (Frequency) 3514 Always, never, ever, again
Para-adverbs (Intensifying) 3515 Very, dead, only, merely; just; most;
more; so; in fact; at least, as…as,
how
Para-adverbs (Interrogative) 3516 How (high is the road?), why, when
Para-adverbs (Manner) 3517 Together; how (to);
Para-adverbs (Negation/Approval) 3518 Not, Yes, No,
Para-adverbs (Prepositional) 3519 Up, upstairs, down,
Para-adverbs (Referential) 3520 Thus, hence, so; first; then; as such;
that is; here; there
Para-adverbs (Time) 3521 Now, ago, tomorrow, yesterday,
since, still; once; already
Para-adverbs (Exemplifying) 3522 For example,
Species SC Type
Particle (Complex) 3610 in order to
Particle (Simple) 3611 to (in the need to)
Symbol (Conventional) 3710 $, &,
Symbol (Scientific) 3720 ×, <,
8.5 Summary
Both original and translated texts provide translation researchers with
necessary data to conduct a quantitative research project provided that
their constituting elements are theoretically defined in advance and a
consistent method developed to operationalize the definitions. Not only
does schema theory define translation as a linguistic and cognitive
process through which translators choose the best target equivalents for
source schemata, i.e., words comprising source phrases, clauses,
paragraphs and texts, but also it explains whether two or more translated
texts differ significantly from each other in terms of their domains, types
and tokens. In this chapter we familiarized ourselves with the domains,
types and tokens involved in the translation of source texts into English.
These are by their very nature categories into which all schemata
comprising translated texts can be schematically assigned. In chapter 9
we will learn how these categories can be treated as categorical variables
to study translated texts as objectively as possible.
9.1 Introduction
Any committed attempt to solve problems poses itself as a research
project which can be conducted qualitatively and quantitatively. While
the former calls for researchers’ personal interpretation of the problems
as their solution, the latter encourages them to review the relevant
literature, formulate hypotheses, adopt an appropriate research method
and stay aloof as much as possible and let the data be collected before
they offer any solutions.
The hypotheses above rest on the fact that Irving was a linguist by
profession and served as a professor of Spanish and Arabic at the
University of Minnesota for some time (Khan, 2000). Similarly, Arberry
was a renowned Orientalist and Professor of Arabic at the Universities
of London and Cambridge (Kidwai, 1987).
In order to treat each and all schemata comprising texts three times as
three different categorical variables, we have no choice but to use
computers. The first step in this process will be typing the text as a word
document if its electronic version is not available. Some students scan
printed pages to bypass the process of typing the texts. Unfortunately,
this does not work because when a given text is scanned, its photo is
taken. For categorizing schemata we need to work with the words
themselves, not their pictures. We will go through the process of
categorizing schemata step by step so that our project will be as
objective and free of error as possible.
16. Do not use automatic numbering. Number the items manually. For
example, the following two items are automatically numbered. They
must have been typed manually as they are done in this file.
1. Classroom instruction was conducted exclusively in the target
language.
2. Only everyday vocabulary and sentences were taught.
As you conduct your own research, you will realize how important it is
to number the items manually. When the entire text is broken into its
constituting schemata, whatever comes after the automatic numbers will
be numbered automatically!
17. Pay close attention to the words whose spellings are very similar.
For example, “It offered innovations at the level of teaching
procedures but lacked a through methodological basis.” Through
should have been typed thorough.
18. Pay attention to morphs forming different tenses. For example,
“Once basic proficiency was establish, …” should have been typed
“Once basic proficiency was established, …”
19. Leave one space between symbols and words that follow them. For
example, “Kelly 1969:53” should be corrected as “Kelly 1969: 53”
(There must be a space between: and 53.)
If we follow the 19 steps described above, we will have the word file
QS001EA.doc whose text appears in Figure 9.2. As can be seen, the
verses consist of sentences and clauses. We need to break the sentences
and clauses of the original typed text into its constituting single and
phrasal schemata. (Note that there are no phrasal schemata such as give
up. If there were any, they must have been put together in one row.) We
do not need to do anything manually because the word itself will break
down the text if we apply the relevant command to the text.
Figure 9.2
Original typed text in word
The command which breaks texts down into their constituting schemata
is called Find and Replace. In order to activate this command, you need
to click on the buttons Ctrl and F together. Upon clicking on these
buttons, the Find and Replace dialogue box will appear on the screen.
We need to click the space bar once in the box appearing in front of
Find what. Figure 9.3 shows the process.
Figure 9.3
Find function activated
After clicking once on the space bar, we have to click on the Replace
button on the top to activate another box called Replace with and type
^p in its front box as shown Figure 9.4.
Figure 9.4
Replace function activated
Upon typing ^p in Replace with box, we must click on Replace All, the
word will change the whole document into a single column of schemata
and a message will appear reading, “Word has completed search of the
document and has made 63 replacements”. If you click on OK, the
message will disappear and you will have the column of schemata in
front of you as shown on the right. Figure 9.5 will appear for our further
processing.
Figure 9.5
Successful functioning of Replace All command
Upon having the column of schemata on our screen, we have to take two
steps. First, if there are any empty spaces between the two rows of
schemata, delete them. Secondly, if there are any punctuation marks
such as single or double quotation marks and comma at the beginning or
end of any schemata, delete them. The deletion of punctuation marks is
very important because they affect the way schemata are automatically
sorted in an Excel file.
We need to take the last step before we start categorizing the schemata
in an Excel file. Click on Ctrl + A to select and highlight the schemata.
Then click on Insert menu in Word 2007. As soon as you click on
Insert a table with appear with a downward arrow. Upon clicking on the
arrow, a dialogue box will appear as shown in Table 9.7.
Figure 9.9
Schemata column converted to a table
You can insert extra sheets or delete them through certain steps to which
we will turn in due course. For the present, you need to open an Excel
file and give it an appropriate name.
If you remember we created two word files and named them QS001EA
and QS001EI to break down Arberry and Irving’s translations. We can
create and give the same names to two Excel files. (We will focus on
working with the first file hoping that the same procedures will be
followed for establishing and working with the second Excel file.
Similarly, we can employ the name TYC for Column C to stand for the
type of schema. If you remember, as a domain, semantic schema
consisted of adjectives, nouns and verbs as its types.
And finally, we use the name TOC for Column D to stand for the
schema tokens. As a semantic type, for example, adjectives were divided
into agentive, comparative, complex, dative, derivational, nominal,
simple and superlative tokens.
Figure 9.11
Schema and its codes as variables
Figure 9.12 presents the codes given to schema tokens as the fourth
variable, i.e., TOC. If you remember, the three digits used in the codes
were systematically chosen in that each digit represents a certain level of
schema classification. The digits on the right, in the middle and on the
left show the domain, type and token, respectively. The schema in, for
example, belongs to syntactic domain, i.e., 2, as a preposition type, 3
which has a simple structure as its token, i.e., 4. The systematic nature
of token codes helps internalize the codes table and employ our memory
instead of the table after a short period of time. The TOC codes can also
be employed to specify the codes required for schema domains (DOC)
and types (TYC).
Figure 9.12
Codification of schema tokens
Figure 9.13
Coded schema domains, types and tokens
sheet will help us find out how many times a given schema type is
repeated. To achieve this, we need to add a variable called frequency.
Figure 9.17
Maximized ribbon
simple verb go. Therefore, if we have a text in which each of these four
forms has been used only once, we will sort them all as one single
schema token and write 4 in the column specifying their frequency.
The schemata act, active, actively and action will, however, all be
considered different schema types because they differ from each other in
being verb, adjective, adverb and noun, respectively. Similarly, if the
schema act appears two times in text, once as a verb and once as a noun,
they will be considered two different schema tokens and coded
differently.
Table 9.1
Comparing the schemata used in the translation of Surah 1 with those of
modern political texts
Figure 9.21 shows SPSS Data Editor file which we have named
QS001EA. As you we can see, there are two icons at the bottom of the
file: Data View and Variable View. As their names show, if we click
on Data View it will activate an SPSS sheet on which we can enter our
data. If we click on Variable View, it will activate the sheet on which
we can specify the names, types, values and features of our variables.
Let us remember filling out these sheets would be the most important
phase in our analysis of the data. The validity of our results will directly
depend on how exactly we fill them out.
As can be seen in Figure 9.21, the Variable View of our SPSS Data
Editor file is activated. The first variable in our analysis is the schemata
used in the text. We, therefore, click in case 1 under Name column and
type schemata. The name of our second variable is DOC (domain code).
So, we click in case 2 in the Name column and type doc. Similarly, we
click in cases 3, 4 and 5 in the Name column and type tyc, i.e., type
code, toc, i.e., token code, and freq, i.e., frequency, respectively.
Figure 9.21
SPSS Data Editor
Figure 9.22 presents SPSS variable types. As we can see, there are eight
types of variable in SPSS. Since our schema variable consist of letters,
we click in the circle called String. As soon as we click in the String
circle, another box called Characters appears. Each characters stands
for one letter. We have chosen 20 because none of the schemata used in
the translation has more than 20 letters. The schema All-compassionate,
for example, consists of 16 characters, including the hyphen.
As we can see in Figure 9.23, for numeric variable types two other icons
called Width and Decimal Places become active. SPSS automatically
chooses 8 for width and 0 for these icons. We do not change them
because our codes are whole numbers and we do not need any decimals.
These two features of variables appear as the third and fourth columns
of the Variable View sheet. If we look at Figure 9.23, we can see 8 in
cases 2, 3, 4, and 5 of the column Width.
The column Label is designed so that we can write the full name of the
variables as we wish them to appear in the table in which we will
present our research results. We can therefore, write the complete names
of the variables in the corresponding cells in the Label column. For
example, for the variables doc, tyc, toc, freq we type Domain, Type,
Token and Frequency as shown in Figure 9.24.
Figure 9.24
Variable Labels
The sixth column of the Variable View sheet deals specifically with
categorical and ordinal variables (see 2.2.1 and 2.2.2 in chapter 2). In
this column called Values, we assign a numeric value to each category
or rank and then specify its label. It is important to remember that string
variables do not have any values (See 2.2.5 in chapter 2). In order to
assign values to our categorical and ordinal variables we need to click in
the relevant cells of the column Values. As soon as we click in one of its
cells, the ellipsis icon (…) on its right becomes active. Upon clicking on
this icon a dialogue box called Value Labels appears.
Figure 9.25 presents the Value Labels dialogue box activated for the
schema domain variable. If you remember, for the variable doc, we
assigned each schema to one of the three domains: semantic, syntactic
and parasyntactic. In the Value box we must type 1 and then go to
Value Label and type Semantic and then click on Add icon. The phrase
1 = “Semantic” will appear in the third big box. We need to go back to
Value box and type 2 and write Syntactic as its label. The phrase 2 =
“Syntactic” will appear in the third box. We follow the same procedure
for the third value and the phrase 3 = Parasyntactic will appear in the big
box. As soon as we have finish specifying the value and label of
categories, we have to click on OK to ensure that the software has saved
them.
Figure 9.25
Activated Value Labels dialogue box
Since there are two other categorical variables in our file, i.e., tyc and
toc. We should follow the same exact procedure described above for
both. Figure 9.26 presents the values and labels given to schema types.
There are sixteen schema types in English: four for semantic, five for
syntactic and seven for parasyntactic. In chapter 8 we gave the following
codes or values to these types: 11 = Adjective, 12 = Adverb, 13 = Noun,
14 = Verb, 21 = Conjunction, 22 = Determiner, 23 = Preposition, 24 =
Pronoun, 25= Verb, 31 = Abbreviation, 32 = Adverb, 33 = Interjection,
34 = Name, 35 = Numeral, 36 = Particle, 37 = Symbol. We must type
and add these values and labels in the Value Labels dialogue box and
click on OK to save them.
Figure 9.26
Specified Value labels for schema type variable
Figure 9.27
Active Data View sheet of the SPSS file
Figure 9.27 above presents the active Data View sheet of the SPSS file
QS001EA and how it looks after we have specified the required features
Figure 9.28
Completed Data View sheet
9.2.5 Copying Data from Different Excel Files and Pasting them in
the Same SPSS File
At the beginning of this chapter we decided to compare Arberry’s
(1964) translation of Surah 1 of the Quran with that of Irving (1985). To
achieve this objective we need to follow the procedures described in
sections 9.2.1, 9.2.2 and 9.2.3 in order to establish an Excel file for the
schemata employed by Irving. We can name the Excel file QS001EI.
another variable to show that the data have been collected from two
translators.
In order to add a new variable to our SPSS file, we need to click and
activate our Variable View sheet of the QS001EA, click in cell 1 under
Name, i.e., schema, go to Data menu and activate its dialogue box as
shown in Figure 9.29.
in Value box and Arberry in Value Label box and then click Add. Then
we need to go back to Value box again and type 2, fill the Value Label
box with Irving and click Add. We should not forget to click on OK
icon otherwise whatever we have typed will be lost.
Table 9.32 above presents the Data View sheet with the new variable.
Since whatever data we have pasted in the file so far belongs to Arberry
whose value is 1, we type 1 in the cells under the variable trans. As we
type 1 and go down the file we will notice that Arberry has used 42
schema token.
We can now copy the data related to our five variables form the Excel
file QS001EI, click in cell 43 under the second column schema and
paste the data. Figure 9.33 presents the pasted data from the file
QS001EI. As we can see, 2 is typed in the first column as a value given
to Irving’s schemata.
Figure 9.33
Adding the data related to the second value of the new variable
Figure 9.36
Activating expected frequency and percentage
In order to tell the software what test to apply to the data, we need to
click on Statistics icon at the bottom of the Crosstabs dialogue box.
Upon clicking on Statistics icon, another dialogue box will appear as
shown in Figure 9.37. First, we will check Chi-Square box so that we
can decide whether there is a significant difference between translators
SPSS automatically produces three tables among which two are very
important and we must copy and paste them in our report. One of them
displays variable by variable crosstabulation as shown in Table 9.2. The
first row of numbers is the observed frequency so we can replace the
word count with observed to help the readers have a clear picture in their
mind.
Table 9.2
Translators by Schema Domain Crosstabulation
Domain
Translators Count Total
Semantic Syntactic Parasyntactic
Arberry Observed 23 16 3 42
Expected 22.3 17.2 2.5 42.0
% within translators 51.2% 43.9% 4.9% 100.0%
Irving Observed 21 18 2 41
Expected 21.7 16.8 2.5 41.0
% within translators 51.2% 43.9% 4.9% 100.0%
Total Observed 44 34 5 83
Expected 44.0 34.0 5.0 83.0
% within translators 53.0% 41.0% 6.0% 100.0%
Figure 9.38
Bar chart of schema domains employed by two translators
Figure 9.38 presents the information given in Table 9.2 as a bar chart.
As we can see, while Arberry (1964) used 23 semantic schemata in his
translation of Surah 1, Irving (1985) employed 21. The fewest number
of schemata employed by both translators were in parasyntactic domain.
Another outstanding feature of the two translations is that the percentage
of syntactic schemata used by Arberry, i.e., 38%, and Irving, i.e., 44%,
is much higher than that of political texts, i.e., 6%. (See Table 9.1)
Table 9.3
Chi-square test of schema domains
Asymp. Sig.
Tests Value df
(2-sided)
Pearson Chi-Square .397(a) 2 .820
Likelihood Ratio .398 2 .820
Linear-by-Linear Association .009 1 .924
N of Valid Cases 83
a 2 cells (33.3%) have expected count less than 5. The minimum expected count is 2.47.
Table 9.4
Chi-square test of schema types
Asymp. Sig.
Tests Value df
(2-sided)
Pearson Chi-Square 3.999(a) 9 .911
Likelihood Ratio 4.110 9 .904
Linear-by-Linear Association .000 1 .983
N of Valid Cases 83
a 11 cells (55.0%) have expected count less than 5. The minimum expected count is .99.
Table 9.5
Chi-square test of schema tokens
Asymp. Sig.
Tests Value df
(2-sided)
Pearson Chi-Square 14.034(a) 22 .900
Likelihood Ratio 18.358 22 .685
Linear-by-Linear Association .000 1 .998
N of Valid Cases 83
a 43 cells (93.5%) have expected count less than 5. The minimum expected count is .49.
9.5 Summary
Research projects sometime require collecting and analyzing categorical
variables such as translators and the schemata they use in their
10.1 Introduction
The creation of man by the Almighty Allah has a direct bearing on
research because He employed an instrument to prove man’s superiority
over angels. According to the Quran (chapter 2: 31-33), God told the
angels that he was going to create and set man as his viceroy31 on the
earth. They objected to his creation and said that why He would create a
creature who would corrupt the earth and shed blood, implying that they
deserved more to be his viceroy than man. In response to their criticism,
God said that he would take a test so that they would know why he had
chosen man.
The test which the Almighty Allah administered to both man and angels
consisted of nouns. In chapter 2 section 2.2.5 we talked about string
variables. We noticed that they are simply a set of characters or letters
such as bus. The string of letters which form nouns or other types of
semantic, syntactic and parasyntactic schemata are by themselves
meaningless unless they are associated with real events, objects, and
people in the mind.
When a child, for example, sees a bus for the first time in her life in an
English speaking environment and looks at her mum to tell her what it is
and she says, “bus,” she immediately forms a concept in her mind by
associating the noun bus with the object of bus as it exists in reality.
Figure 10.1 presents the association. If this concept gets established in
her mind and results in saying bus by seeing the vehicle on the street or
31
Viceroy /'vīsroy/ n. the governor who represents God on the earth
restoring its picture upon hearing the verbal noun bus, it becomes a
schema for that particular child.
Figure 10.1
Association of the noun bus with the real vehicle or its picture in human
mind
1. Attitudes are both cognitive and affective because we not only think
about something in our mind but also attach feelings and emotions to
it.
2. Attitudes are dimensional for they vary in degree. They are not
therefore categorical or bipolar to be either/or
3. Attitudes predispose learners to action in a certain way, their
relationship with actions is not a strong one.
4. Attitudes are not genetic but learned
5. Although attitudes persist, they can be modified by experience or
teaching.
seen, these two students differ from each other as regards the five beliefs
shown in the table. For example, while the graduate student agrees that
some people are born with a special ability which helps them learn
English, the undergraduate disagrees.
Table 10.1
A graduate and undergraduate student’s agreement with five beliefs
Table 10.2 presents the values assigned to degrees of agreement with the
five beliefs expressed by graduate (G) and undergraduate (U) students.
We can now employ three different types of formula developed to study
the relationship between two ordinal variables: Spearman Correlation
Coefficient (ρ), Pearson Correlation Coefficient (r) and Kendall’s Tau
Correlation Coefficient (τ).
Table 10.2
Calculation of Spearman Rank-Order Correlation Coefficient (ρ)
Beliefs G U G-U d d2
1 1 2 1-2 -1 1
2 2 4 2-4 -2 4
3 3 2 3-2 1 1 ρ = 1 – [6 × 11/ 5 (25-1) =
4 2 3 2-3 -1 1 1 – (66/120) =
5 3 5 3-5 -2 4 1 - 0.55 = 0.45
2
Σ d = 11
Calculating the ρ even for five beliefs takes time. It can be much faster if
we enter the data on the SPSS. Since we are exploring the correlation of
beliefs expressed by two participants, we only need to define two
variables, i.e., one for the graduate (Grad) and another one for the
undergraduate (UGra) participant as shown in Figure 10.2.
Figure 10.2
Two participants’ ranking as SPSS variables
For having the SPSS estimate the ρ, we need to go to the Analyze menu
and move down to the list to Correlate. Highlighting this submenu will
activate three command among which we need to click on Bivariate to
activate the Bivariate Correlation dialogue box as shown in Figures 10.3
and 10.4.
beliefs of the two undergraduate and graduate students do not show any
significant relationship, i.e., ρ = 0.35. This coefficient is lower than what
we obtained manually, i.e., ρ = 0.45. According to Howell (2002), this
discrepancy happens because scholars do not agree on an accepted
method to calculate the standard error of ρ for small samples. As a
result, computing confidence limits on spearman rho “is not practical”
(p. 307).
Table 10.3
Spearman's rho obtained on five beliefs
Grad UGra
Correlation Coefficient 1.000 .351
Sig. (2-tailed) . .562
N 5 5
Table 10.4
Calculation of squared individual deviation scores on five beliefs
Where:
G = Graduate student
U = Undergraduate student
∑ .
Variance (VG) = = = 0.7
N
Standard Deviation (SDG) = VG = √0.7 = 0.84
∑ .
Variance (Vu) = = = 1.7
N
Standard Deviation (SDu) = √Vu = √1.7 = 1.30
Table 10.5
Calculation of z scores on five beliefs
We can now use SPSS to save time and labour. If we look at Figure 10.4
again, we will see Pearson appearing as the first and default choice.
Instead of clicking in the box labeled Spearman, we can check
Pearson, click on the OK and see what coefficient we will get from the
SPSS.
Table 10.6
Pearson correlation coefficient obtained on the five beliefs
Grad UGra
Pearson Correlation 1 .413
Sig. (2-tailed) .490
N 5 5
Table 10.7
The ordering of five beliefs according to graduate student’s ranking
and put it in the formula. (No lines should be drawn when the two
participants give the same responses. The two participants whose beliefs
appear in Table 10.7 did not, of course, give the same responses at all.)
For example, the ranking given by the undergraduate student on Belief 1
is comparable to the ranking given by graduate participant on Belief 4
and her own ranking on Belief 3, so we need to draw lines as shown in
Table 10.7. After drawing the lines and specifying the intersections, we
can apply the formula to our data.
2 Number of inversions
Kendall τ 1
N N 1 /2
2 3 6 6
τ 1 1 1 1 .6 0.4
5 5 1 /2 5 4 /2 10
Table 10.8
Kendall's tau_b correlation coefficient obtained by the SPSS on the five
beliefs
Grad UGra
Kendall's tau_b Correlation 1.000 .354
Sig. (2-tailed) . .420
N 5 5
However, what the responses given to each belief on the BALLI do not
show is whether there is a relationship among certain beliefs. The
possible relationships among beliefs which can be explored only through
certain statistical tests such as Factor Analysis, is referred to as latent
variables. According to Tucker and MacCallum (1997), they are more
fundamental than observed variables because “they cannot be directly
measured” (p. 2).
involves the sample size. While Tabachnick and Fidell (2007) suggested
at least 300 cases, Nunnally (1978) suggested the ratio of 10 participants
to one item.
Ghobadi’s (2009) study meets the first criterion because it explored the
beliefs of 423 participants whose ratio to 34 beliefs is well over 10 to
one. The second criterion is the strength of intercorrelations among
items. They can be explored either by focusing on intercorrelation
coefficients or by employing statistical measures. For the former,
Tabachnick and Fidell (2007) stated that if few items showed
coefficients greater than 0.30, factor analysis might not be appropriate.
For the latter, two statistical measures are employed: Bartlett’s test of
sphericity (Bartlett, 1954), and the Kaiser-Myer-Olkin (KMO) measure
of sampling accuracy (Kaiser 1970, 1974). Bartlett’s test of sphericity
should be significant (p <.05) for the factor analysis to be appropriate.
The KMO index ranges from 0 to 1, with .6 suggested as the minimum
value for a good factor analysis.
For this purpose we need to open a Variable View sheet of a SPSS file
and define 34 variables. We might, for example, type B01 under the
column name and specify the type as numeric. Under the six column
called Values, we need to specify the five points presented with each
belief. Figure 10.7 shows the values assigned to B01.
Figure 10.7
Assigning values to an ordinal variable
From the main menu on the top of the SPSS Data View we click on
Analyze to activate another menu upon which we need to click on Data
Reduction. (The newer versions of the SPSS such as the IBM SPSS
Statistics 20 have changed the name to Dimension Reduction.) It offers
three submenus called Factor, Correspondence Analysis and Optimal
Scaling. Among these three submenus we need to choose Factor.
Figure 10.8 shows the steps described so far.
Figure 10.8
Activating Factor Analysis menu on the SPSS
Upon clicking on Factor, its dialogue box appears on the monitor which
requires us to specify the variables to be analyzed as shown in Figure
10.9.
Figure 10.9
Dialogue box requiring the specification of variables to be analysed
Figure 10.10
The Descriptives dialogue box of factor analysis
Figure 10.11
Activating Factor Analysis: Extraction box
Figure 10.12
Activating Factor Analysis: Rotation box
Table 10.9 presents the KMO and Bartlett's Test of the BALLI
administered to 418 graduate and undergraduate students majoring in
Table 10.9
KMO and Bartlett's Test of 418 participants as adequate sample
Table 10.10
The extraction of communalities via Principle Component Analysis
Table 10.11
Total Variance Explained
need to do now is to spot loadings higher than .30 in order to see which
observed variables form the 14 latent variables.
Table 10.12
Component matrix
Components
1 2 3 4 5 6 7 8 9 10 11 12 13 14
B01 .42 -.06 .21 -.12 -.03 .13 -.22 .03 .24 -.36 .09 -.06 .07 -.03
B02 .20 .01 .09 .36 -.02 .08 -.32 -.22 .45 .18 .09 .08 .11 .08
B03 .12 -.04 -.07 .14 -.42 .17 .36 .10 .20 -.17 .42 .02 -.30 .04
B04 -.05 -.46 .32 -.25 -.09 -.12 -.12 -.07 .11 .27 -.11 -.09 .25 .09
B05 .06 .17 .16 -.21 .23 -.08 -.06 .36 .42 .33 -.09 .09 -.09 .26
B06 -.01 .58 -.19 .03 .19 -.06 .12 -.03 -.17 .20 -.17 .06 -.15 .36
B07 .44 .02 -.22 .22 .27 -.09 .06 .16 .08 -.08 -.03 .13 .02 .02
B08 .37 .28 .19 -.05 .12 .26 -.27 .08 .13 .13 -.04 -.01 -.02 -.36
B09 .11 -.12 .15 .32 .51 .08 -.01 -.03 -.03 -.26 .04 .17 .15 .17
B10 .18 .34 -.06 -.04 -.17 -.04 .20 -.06 -.08 .02 .46 -.01 .28 .24
B11 .21 .03 .38 -.19 -.06 .03 -.10 .17 -.11 -.09 .14 .37 -.09 .27
B12 .36 .24 .09 -.21 .03 -.01 -.15 .00 -.35 .13 .25 .25 .04 -.22
B13 .10 .27 .16 -.22 -.41 .39 .16 .09 .17 -.03 -.17 .20 .01 -.11
B14 .21 .56 -.10 .14 -.27 -.12 -.15 .03 .10 .22 -.11 -.11 -.01 .10
B15 .42 -.31 -.42 .00 -.08 .24 .02 .02 -.12 -.03 -.18 .07 -.12 .15
B16 .34 -.01 .02 .09 -.25 .04 -.18 .35 -.19 -.16 -.19 -.01 -.13 .08
B17 .17 .51 -.14 .08 .25 .16 -.16 -.28 -.07 .04 .01 -.21 -.16 -.07
B18 .26 -.16 .04 .08 .36 -.23 .06 .35 .16 .05 .28 -.07 -.23 .03
B19 .41 -.39 -.42 .07 -.02 .11 -.06 -.23 -.04 .07 -.05 -.01 .09 .21
B20 .56 -.07 .13 .03 -.12 -.18 -.24 -.17 -.24 -.08 .05 -.09 -.08 -.08
B21 -.03 .03 -.10 .49 -.16 .01 .13 .24 -.16 .20 -.03 .43 .16 -.27
B22 .44 .14 -.07 .02 .01 -.28 .30 .13 .13 -.10 -.21 -.04 .30 -.03
Components
1 2 3 4 5 6 7 8 9 10 11 12 13 14
B23 .02 .02 .29 .27 .24 .48 .23 .06 -.18 .06 -.27 .04 .11 .05
B24 .11 .13 .51 .09 -.07 .11 -.10 -.36 -.12 -.14 -.07 .09 -.04 .27
B25 .35 -.45 -.14 -.07 .11 .35 -.11 .10 -.04 .30 .14 .02 .14 .03
B26 .52 .06 .06 -.10 -.07 -.27 .19 -.02 .04 .00 -.16 -.12 .33 -.22
B27 .24 -.21 .13 -.11 -.22 .19 .17 .07 -.18 .44 -.04 -.29 -.19 .09
B28 .06 .07 .25 .54 .03 .15 .06 .11 -.04 .10 .24 -.46 .04 -.10
B29 .18 -.10 .37 -.10 .18 -.22 .47 -.23 -.16 .26 .15 .12 -.01 -.03
B30 .39 -.06 .09 -.18 .22 -.16 .08 .08 -.15 -.14 -.15 -.12 -.37 -.14
B31 .48 -.06 -.07 .21 -.21 -.25 .08 -.35 .21 .07 -.07 .16 -.17 .05
B32 .08 -.01 -.03 -.15 .23 .28 .40 -.34 .32 .00 -.07 .11 -.21 -.23
B33 .23 .25 -.15 -.37 .13 .31 .22 .08 .01 -.20 .08 -.23 .30 .18
B34 .00 .11 -.35 -.31 .19 .07 -.23 -.08 -.05 .15 .27 .18 .04 -.16
Extraction Method: Principal Component Analysis
If you remember, on the top right corner of the Factor Analysis box,
there were some buttons. We need to click on the Options button to
activate its box. There are two sections in the box: Missing Values and
Coefficient Display format. In the latter section, there are two boxes as
shown in Figure 10.13. If you click in the box at the beginning of
Suppress absolute values less than: its front box becomes active. You
can now type .30 and click on Continue in order for the SPSS to
provide you with the loadings equal to or higher than .30.
Figure 10.13
Activating Suppress absolute values less than
Table 10.13 presents the loadings equal to or higher than .30. The
suppression of the loadings less than .30 makes the identification of
acceptable loadings much easier. However, the interpretation of what the
loadings mean and what the factors upon which the observed variables
load stand for is not as simple as their identification. In fact, the
researchers need to employ their common sense, reasoning power along
with other researchers’ findings to give sense to the factors they extract
from their data. In other words, factor analysis is a test employed in
inferential statistics where the researchers must interpret what their
findings express.
Table 10.13
Component matrix with the loadings less than .30 suppressed
Components
Beliefs
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Components
Beliefs
1 2 3 4 5 6 7 8 9 10 11 12 13 14
Components
Beliefs
1 2 3 4 5 6 7 8 9 10 11 12 13 14
B27 .44
-
B28 .54
.46
B29 .37 .47
-
B30 .39
.37
B31 .48 -.35
B32 .40 -.34 .32
B33 -.37 .31 .30
B34 -.35 -.31
For example, as can be seen in Table 10.13 above beliefs 1, 7, 8, 12, 15,
16, 19, 20, 22, 25, 26, 30 and 31 all load meaningfully on component
one. If we agree with Tucker and MacCallum (1997), then we should
accept component one as a latent variable consisting of 13 beliefs which
measure a macro belief which cannot be measured directly by any of
these beliefs alone. This means that we need to look for a feature shared
by these 13 beliefs.
Table 10.14
Logical areas of 13 beliefs loading on component one
10.7 Summary
Ordinal variables provide the first quantitative measures through which
human attributes such as beliefs and attitudes can be objectively
measured and then subjected to inferential statistical tests such as factor
analysis and Amos (Arbuckle & Wothke 1999).
11.1 Introduction
If you remember, we talked about Farvani’s (2004) research project in
chapter 3. She conducted her project in order to answer the question,
“Can reading portfolios increase students’ achievement more than
traditional tests?” In order to answer the question, she formulated the
null hypothesis. “there is no significant difference between the means of
pre-tests and post-tests of schema-based cloze multiple choice item test
(achievement) for the experimental and control groups after the
implementation of reading portfolios.”
11, 6, 22, 33, 27, 22, 26, 27, 29, 25, 31, 15, 32
6, 11, 15, 22, 22, 25, 26, 27, 27, 29, 31, 32, 33
By counting the raw scores above, we can say that there are only 13
students in the control group. While two students have scored 22 and the
other two 27, the rest have performed differently on the S-Test. In other
words, with the exception of 22 and 27, nine scores have a frequency of
one. We can then use these frequencies to calculate the percentage and
percentile of raw scores.
Table 11.1 presents the steps involved in calculating the percentage and
percentile of raw scores (X). (The table is given in the next page to save
space.) First, we need to arrange the scores from the highest to the
lowest and put them in column X. The frequency of each score should
then be shown in column F. As we can see in the table, the scores of 33,
32, 31 and 29 have, for example, an F of 1. Scores 5 and 6, however,
have an F of 2, indicating that two students have scored 5 and two others
have scored 6.
Table 11.1
Percentage and percentile of a set of raw scores
Once we have the CF of each score, we can calculate its percentile. Each
percentile shows the relative standing of an individual within a
particular group. For example, the percentile of students 9 and 10 who
have scored 22 in Table 11.1 shows that their position is at the same
rank as 38% of the class.
11.3.1 Mode
In order to find the mode of a set of scores, we ought to sort and arrange
them from the lowest to the highest or vice versa and then look for the
most frequent score(s). The 13 sorted scores obtained by Faravani’s
control group are given below. As we can see, the very act of sorting the
scores, allows us to find the modes, i.e. 22 and 27, easily and
immediately.
6, 11, 15, 22, 22, 25, 26, 27, 27, 29, 31, 32, 33
11.3.2 Median
The median of a set of scores refers to the score which falls in the
middle. In other words, 50% of scores fall below the median and 50%
above. If the set consists of odd number of scores, then one of them will
fall exactly in the middle. Since there are 13 sorted scores in the set
below, 6 will be on the right of the median and six on its left.
Median
6, 11, 15, 22, 22, 25 26 27, 27, 29, 31, 32, 33
middle, add them up and divide them by two in order to find its median.
For example, the sorted scores below were obtained by 10 students in
Faravani’s (2004) experimental group. Since the scores 23 and 24 are
the middlemost scores, the median of the set will therefore be (23 +
24)/2 = 23.5
Median
20, 20, 21, 21, 23 23.5 24, 26, 27, 34, 35
11.3.3 Mean
Although mode and median are the easiest ways of finding the typical
score in an observed set, they suffer from two main shortcomings. First,
they depend on the number of raw scores and usually differ from each
other. For example, while the mode for Faravani’s control group was 22
and 27, its median turned out to be 26. Secondly, they are based on
either the frequency of the scores or the middle scores and thus fail to
take all the raw scores into account.
X
Mean = /
N
The Greek capital letter Σ is conventionally used to show adding up. The
capital letter X refers to individual raw scores. The sloping line solidus32
(/), shows division. And finally the capital letter N indicates the number
of test takers. We can now apply the formula to the scores obtained by
13 participants in the control group.
32
Solidus /'sǒlidзs/ n. a line sloping from right to left (/) and used to show division
The mean value obtained above, i.e., 23.54, is distinctly different from
the median, i.e., 26. (The actual mean estimated by a calculator is
23.538461538461538461538461538462. As we can see there are 30
fractional digits in the mean which might be necessary to be reported in
scientific fields such as physics and chemistry. In social sciences,
however, usually two decimal fractions, i.e., .54 are reported. Note that
we need to round up .53 to .54 because the third decimal fraction, i.e., 8,
is higher than 5)
Since the mean of a set of raw scores is calculated on the basis of all
individual scores, we can be sure that it is the most representative of test
takers’ performance. If a participant who scored 15 on the schema-based
cloze MCIT asked Faravani (2004), for example, how she performed on
the test, she could use the mean and say that her score was lower than
the average student in the class, e.g., 23.5, and therefore she needed to
exert some extra effort to catch up with the rest of the class. (Notice that
we do not employ the total number of items on the test, i.e., 69, as a
criterion to assess her performance.)
11.2 Variation
When we report raw scores, we must specify what the maximum score
on the test would be if a test taker answered all the items. For example, a
student who has scored 23 on a test consisting of 69 items, must be told
that out of 69 pieces of construct measured by the test, s/he has mastered
only 23. Reporting raw scores on a pre-test alone without drawing the
participants’ attention to its mean, however, can be disappointing and
thus affect them adversely. (They may, for example, decide not to
participate in the research project any more.) We should therefore
describe the purpose of the test as thoroughly as possible and draw their
attention to the mean of the class.
11.2.1 Range
The range of a set of scores tells its users how far apart the highest and
lowest ability students are from each other. It is obtained by subtracting
the highest score from the lowest and adding one to show the fact that it
includes scores at both ends of the sorted set. We can apply the formula
to the scores obtained by Faravani’s (2004) control group on the
schema-based cloze MCIT consisting of 69 items as shown below.
6, 11, 15, 22, 22, 25, 26, 27, 27, 29, 31, 32, 33
Range = Highest score – Lowest score + 1 = 33 – 6 + 1 = 28
The range of 28 shows that the highest and lowest ability test takers’
ability differed greatly from each other. As we can see, the value of
range is drastically influenced by the two extreme scores. Just imagine
one of the participants had scored 50 on the test, then the range would
have changed from 28 to 45! Since the range of a set of scores depends
only on two extreme scores, it does not provide test users with an
accurate measure of test takers’ differences. This is achieved by variance
and standard deviation.
11.4.2 Variance
Variance is a measure of test takers’ individual differences which shows
to what extent they fall apart from each other on the basis of their mean
score. After sorting the raw scores (X), and calculating their mean (M),
we need to set a table where we can tabulate the steps involved in
calculating variance.
Table 11.2
Steps involved in calculating variance
Table 11.2 shows the steps involved in calculating variance. They are as
follow.
1. Add up all raw scores (X) and divide the sum () by the number of
test takers (N) to obtain the mean (M), i.e., 306 /13 = 23.54
2. Subtract the mean from each score to get the individual deviation
score (x). See column 5 in Table 11.1
3. Square each individual deviation score (x2) and then add them up (
x2), i.e., 821.21
An SPSS file consists of two sheets: Variable View and Data View. In
order to have the SPSS compute the mean of raw scores given in table
11.2 we need to open, name and save a file, e.g., FData (Faravani Data).
Naming a file is very important because in an actual research project, we
may need to work with several files at the same time. Distinctive file
names will save us a lot of confusion.
Once we have named and saved the file FData in the SPSS, we can
activate its Variable View sheet and define the variable whose central
tendency and variation values we want to have calculated. A typical
SPSS Variable View sheet consists of 10 columns: Name, Type, Width,
Decimals, Label, Values, Missing, Columns, Align, and Measure. Click
under the first column Name and do the following. (We should follow
the same instructions for other variables. We can actually define as
many variables as we wish to study in our project.)
Figure 11.1
Defining an interval variable in the SPSS
Upon defining the variable pretest, we can activate Data View sheet by
clicking on its icon at the bottom of the sheet shown in Figure 11.1.
Once we are in Data View sheet, we can enter the raw scores as shown
in Figure 11.2.
Figure 11.2
Entering Raw Score on the SPSS
Having entered the raw scores on the Data View sheet, we must click on
the top menu, activate Analyze menu, go to Descriptive Statistics and
choose Frequencies as shown in Figure 11.3.
Figure 11.3
Activating Descriptive Statistics on the SPSS
Figure 11.4
SPSS Frequencies dialogue box
We have to use the rightward arrow to transfer the variable to the box on
the right. At the bottom of the Frequencies box, we will find three
icons. If we click on Statistics, another dialogue box will appear in
which we can choose whatever we want the SPSS to do for us (Figure
11.5). Since we have studied measures of central tendency, i.e., mean,
median and mode, as well as dispersion, i.e., standard deviation,
variance and range, we check the corresponding boxes and click on
Continue. It will take us back to the Frequencies box. Click on Ok icon
to find these values calculated within a second.
Figure 11.5
Statistics available in SPSS Frequencies
Table 11.3
Descriptive statistics produced by the SPSS via Frequencies command
N Valid 13
Missing 0
Mean 23.54
Median 26.00
Mode 22(a)
Std. Deviation 8.273
Variance 68.436
Range 27
a Multiple modes exist. The smallest value is shown
Figure 11.6
Normal curve and the percentages falling between its standard
deviations
As we can see in Figure 11.6, normal curve has the following properties:
1. It is symmetrical i.e., half of the scores fall below the mean and the
other half above the mean.
2. Its mean (μ) is always 0. Remember μ is the mean population, not the
sample.
3. Its mode, median and mean (μ) are always the same. Remember these
three measures of central tendency usually differ in real data.
4. Its tails never meet the x axis because no test taker possesses the
complete knowledge or lacks any knowledge of the ability being
measured by the test.
If we remove the curve given in Figure 11.6 and present the data by
employing the terms with which we have studied our samples, we will
have the standard deviations and percentages given in Table 11.4. As we
can see, 68%, 95% and 99% of scores obtained by a sample selected
randomly will always fall between 1, 2 and 3 standard deviations,
respectively.
Table 11.4
Standard deviations and percentages of a normal set of scores
A person who is not interested in statistics at all may ask, "What is the
use of normal curve? What do we need it for?" In response, we should
say that all quantitative research projects depend on the normal curve
because it can be used to convince research users that the results
obtained were the outcome of what the researcher manipulated rather
than chance.
Table 11.5 shows the population scores which could have been obtained
if they had all participated in Faravani's (2004) control group. As we can
see, the observed mean, i.e., 23.5, has been taken as the population
mean. If we add up 23.5 with 16.6 we will get 40.1, indicating that the
student who had scored the highest on the test, i.e., 33, had behaved
normally. In other words, if a given student in the control group had
scored 40.1 and higher, she should have been excluded from the study
because she did not need to study the material presented in the class.
Table 11.5
Comparing the scores obtained by the sample control group with its
population
11.6 Summary
Interval variables are studied in order to find out whether their
manipulation within a research project will result in significant
differences in the sample. For studying these variables, the scores
quantifying these variables must be obtained from the sample and then
their central tendency and variation must be specified by calculating
their mean and standard deviation. In addition to illustrating how the
members of sample cluster and vary from each other, means and
standard deviations obtained on interval variables can be employed to
study tests and groups as discussed in the next chapter.
12.1 Introduction
A psychometric test can be viewed as an interval variable whose items
form its levels or values. As a psychometric test, the schema-based cloze
multiple choice item test (MCIT) employed by Faravani (2006) is, for
example, a 69-level interval variable which measures its takers’
comprehension of certain passages upon which it is developed. Since
test takers differ from each other in terms of their background
knowledge and reading comprehension ability, this difference must be
revealed by the number of levels the test takers achieve on the test.
In chapter 11, we learned that the scores obtained on tests such as the
schema-based cloze MCIT can be utilized to calculate their mean,
variance and standard deviation. (While a mean shows the central
tendency of a set of scores, its variance and standard deviation show
how dispersed the scores are.) We also learned that the mean and
standard deviation of a test could be used to compare the performance of
the sample with other test takers who form the population they
represent. (We will cover the statistical tests related to means in chapter
13.)
In this chapter we will use the mean, variance and standard deviation of
a set of scores in order to explore test reliability and validity. While the
former focuses on the consistency of scores, the latter reveals not only
the degree to which two tests measure a common construct but also the
amount of construct each item shares with the total test score.
KR 21 rk 1
1
Figure 12.1
Specifying the values of categorical variable on the SPSS Data View
sheet
Figure 12.2
Entering the values of categorical variables on the SPSS Data View
sheet
Upon clicking on the Reliability Analysis its dialogue box will appear
as shown in Figure 12.3. In the box on the left, we need to highlight all
the 69 variables and then click on the rightward arrow (►) to transfer
them to item box. As soon as we transfer the items, the OK icon
becomes active.
Upon clicking on the OK icon, the sheet Output 1- SPSS Viewer will
appear on your monitor. We will find the following pieces of
information in the viewer.
R E L I A B I L I T Y A N A L Y S I S - S C A L E
(A L P H A)
Reliability Coefficients
Alpha = .8201
Table 12.1
Steps involved in calculating Pearson product-moment correlation
coefficient
Now we can insert the values obtained in Table 12.1 in the formula
below.
Σxy 658.3 .
r = = .73
NSx Sy 10 16.1 5.6 .
Figure 12.5
Defining two interval variables on the SPSS
Upon defining the scores on the TOEFL and schema based cloze MCIT
as two interval variables, we need to activate Data View sheet and enter
the scores on both tests. The completion of data entry will lead to our
Figure 12.6
Choosing Correlate from Analyze menu
Figure 12.7
Bivariate Correlations dialogue box
Table 12.2
Pearson correlation coefficients and their significance calculated by
SPSS
Table 12.3
Manual calculation of point-biserial correlation coefficient
Test
Item1 Item2 Item3 Item4 Item5 Total Calculations
taker
1 0 1 1 1 0 33 MC=
2 32+31+27+27
1 1 1 1 0 32
+25+15+11+6
3 1 1 0 1 0 31 = 174 ÷ 8 =
4 0 1 1 0 0 29 21.75
5 1 1 1 1 0 27 MI =
33+29+26+22
6 1 1 1 0 1 27 +22 = 132 ÷ 5
7 0 1 0 0 0 26 = 26.4
8 1 1 0 1 0 25 SDT = 8.27
C1 = 8 ÷ 13= 0.62
9 0 1 0 1 0 22 I1 = 5 ÷ 13= 0.38
10 0 1 1 0 1 22
11 1 0 1 1 0 15
12 1 0 1 0 0 11
13 1 0 0 0 0 6
If we apply the equation above to the data given in the table we will
obtain the point-biserial correlation coefficient of -0.27 for item 1.
. . .
× = × √. 62 .38 = × √0.24 =
. .
-0.56 × 0.48 = -0.27
12.4 Summary
Interval variables provide psychometric measures to determine the
reliability and validity of instruments employed in research projects.
Among various methods of estimating the reliability of instruments such
as split-half and parallel tests, Kuder-Richardson Formula 21 provides
13.1 Introduction
Throughout the present textbook we have referred to Faravani’s (2006)
study in order to achieve some sort of unity in the content of topics
explored in a typical research project. In order to conduct her project,
Faravani formed an experimental and control group and taught some
passages of a certain text to both. Since she believed that using reading
portfolios improves learners’ reading comprehension ability better than
traditional approaches, she used portfolios in her experimental group
and employed the conventional methods of testing in her control group.
Based on the content of the passages, she developed a schema-based
cloze MCIT consisting of 69 items and administered it to both
experimental and control groups as a pre-and-post test to achieve two
purposes: testing the homogeneity of both control and experimental
groups and measuring achievement.
Now that we have learned how the mean of a set of scores is calculated,
and we have worked on the scores obtained by Faravani’s (2006)
experimental and control groups, we can explore the type of statistical
tests she used to test her null hypothesis. i.e., “there is no significant
difference between the means of pre-tests and post-tests of schema-
based cloze multiple choice item test (achievement) for the experimental
and control groups after the implementation of reading portfolios.”
13.2 Z Statistic
Z statistic is based on the concept of z scores, which in turn rests on the
concept of normal curve. If you remember, we talked about normal
curve in chapter 11, section 11.5. The normal curve is created on the
assumptions below.
If the scores obtained on the test described above are plotted out on the
basis of their frequency, the result will be a normal curve whose mode,
median and mean will be exactly in the middle of the curve. Five
thousand test takers will fall on one side of the curve and the remaining
5000 on the other, if we collect our data from a sample of 10000.
Table 13.1
Z scores and their percentages in a normal distribution
We can now use Table 13.1 to tell where a specific test taker falls within
the normal population, if we know his raw score, the mean and standard
deviation of the test he has taken. This is achieved by applying z score
formula shown below.
If we look at Table 13.2, we realize that 95.4% of test takers fall below
2 z score. Since 2.19 is higher than the z score of 2, we can say that the
student who has scored 73 on the TOEFL has performed better than 95.4
percent of the population from which he has been selected. (We should
remember that the sample size was very small in that only 10 students
took the test. Raw scores should be converted to z scores when the
sample size is large otherwise the test may overestimate its takers’
ability.)
MT MT
Z statistic
ST S
T
√ nT √nT
We can now apply the z statistic to compare the means obtained on the
69-item schema-based cloze MCIT. Faravani (2006) administered the
test to 15 students in her experimental group. They took the test twice,
once as a pre-test and another time as a posttest. Table 13.2 presents the
scores obtained on the tests.
Table 13.2
Raw scores obtained on the schema-based cloze MCIT administered as a
pre-and-post test
Participants
Tests
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15
Posttest 59 57 57 57 54 53 52 49 49 48 43 38 38 32 29
Pretest 35 27 16 26 21 20 24 13 16 21 19 18 9 20 23
MT MT 47.7 20.5
Z statistic
ST S 9.6 6.2
T
√nT √nT √15 √15
13.3 T Test
Most experimental studies conducted in applied linguistics are based on
intact groups because researchers are not usually allowed to change the
structure of classes due to institutional restrictions. Besides, the number
of students in these intact groups seldom reaches 62. This means that the
researchers have to put up with small groups whose performance would
be inappropriate to be analysed by z statistic.
T test is the only statistic which fits small groups. It has another
advantage over z statistic, i.e., the difference in the members of
participants in each group. While z statistic requires comparing groups
having the same number of participants, t test can be safely applied to
two groups whose number of members vary. For example, the number
of participants in an experimental group may be more than the
participants in a control group.
MT MT
nT nT
1. Subtract the mean of test 1 (MT1) from the mean of test 2 (MT2)
2. Divide the variance of test 1 (VT1) from the number of test takers who
took test 1 (NT1)
3. Divide the variance of test 2 (VT2) from the number of test takers who
took test 2 (NT2)
4. Add up the divided VT1 and VT1
5. Take the square root of the step 4
5. Divide the subtracted means by the result of step 5
If we insert the mean and variance of the posttest and pretest (see Table
13.1) in the T test formula we will get:
Upon estimating the T test index, we need to check its critical value in
the table given in Appendix 13.1. There are nine columns in the
appendix. The first column shows the degree of freedom (df). Since
there were two groups in the study and each group consisted of 15
students, its df will be (15 × 2) – 2= 28. If we check the row having the
df of 28 and move to the ninth column, i.e., p = .0001, we will find
3.674. Since it is lower than the calculated index of 4.53, we can then
say that the result obtained is significant at the highest level possible,
i.e., p <.0001.
Figure 13.1
Defining two variables for T test analysis
Upon defining the two variables called groups and scores as shown in
Figure 13.1, we need to activate Data View sheet in order to type the
data given in Table 13.1 under relevant variables.
Figure 13.2 shows the Data View sheet on which the data related to the
schema-based cloze MCIT have been entered. The test consisted of 69
items and was administered two times: once as a pretest and another
time as a posttest. In order to have the software compare the means, we
have to click on Analyze menu to activate its box containing Compare
Means, which in turn offers five available tests among which we need to
choose One-Sample T Test because the two tests were administered to
the same group of participants.
Figure 13.2
Entering data on the Data View sheet
move this variable to the box, a button called Define Groups becomes
active.
We must click on the button Define Groups and define the two groups
as shown in Figure 13.5. We have to type 1 in Group 1 and 2 in Group 2
in its dialogue box as shown in Figure 13.6. As soon as we click on
Continue button, we will be taken back to the main box with an active
OK button as shown in Figure 13.6.
Table 13.3
Group statistics
The second table produced by the SPSS is petty wide and thus does not
fit the size of this page. It has, therefore, been divided into two parts and
presented in Tables 13.4 and 13.5.
Table 13.4
Levene's Test for Equality of Variances
F Sig.
Table 13.4 presents the Levene’s test for equality of variances. As can
be seen, the F obtained, i.e., 3.780 is not significant, i.e., it is greater
than .05. We can therefore support our claim that the variances of the
scores obtained by the same students on the pretest and posttest are
equal and their mean scores can thus be safely compared.
Table 13.5 shows the T-test for equality of means. As can be seen, the T
test value obtained by the SPSS is almost the same as what we got by
employing the formula, i.e., 9.21. In addition to dispensing with manual
calculation, the SPSS provides the significance level and thus saves the
time required for checking the critical values in a given table.
Table 13.5
T-test for equality of means
Table 13.6 presents the findings of Shiotsu and Weir (2007). As can be
seen, the correlation coefficient obtained, i.e., r, between grammar test
and reading comprehension test (.85) is higher than that of vocabulary
and reading comprehension tests (.79).
Table 13.6
Regression and correlation among three tests (n = 591)
Reading Grammar
× ×
Grammar Vocabulary Vocabulary
Beta .64 .25 -
r .85 .79 .84
% explained 72% 62% 70%
% jointly explained 74% -
Khodadady, Pishghadam and Fakhar (2010) taught the first five units of
True to life: Intermediate class book (Gairns & Redman 1996, pp.4-39),
In the Variable View sheet of the SPSS file, we will create a variable
called Code and assign a single code for each learner. Each code can be
a number. The names of the learners can form the second string
variable. The group to which each learner belongs can be established as
the third variable.
In the sixth column of the Variable View sheet, which is called values,
click on the right side of the cell in front of the third variable named
group to open the Value Labels dialogue box. Type 1 in the box in front
of Value and Control in front of Label and then click on the button Add
as shown in Figures 13.7. Give the values of 2 and 3 to Grammar and
Vocabulary groups.
Figure 13.7
Defining and assigning values to the variable called Group
Figure 13.8
Activating One-Way ANOVA on the Analyze menu
Click on the Analyze button in the top menu as shown in Figure 13.8
above. Move down the menu to reach Compare Means and then go to
One-Way ANOVA. As soon as you click on this icon, its dialogue box
will open as shown in Figure 13.9 below. Click on the variable Schema
and then click on the top rightward arrow to move it to the box called
Dependent list. Then click on the variable Group and use the bottom
rightward arrow to move it to the box called Factor as shown in Figure
13.10 below.
Table 13.7
One-Way ANOVA analysis of scores obtained by control and
experimental groups on the syllabus-and-schema based reading
comprehension pretest
Table 13.8
One-Way ANOVA analysis of scores obtained by control and
experimental groups on the syllabus-and-schema based reading
comprehension posttest
In order to find out which of the three groups’ mean scores on the
posttest differed significantly, we need to run Scheffe Post Hoc Test.
First, we need to specify the members of the three groups by giving
them the values of 1, 2 and 3 as we did for the pretest, then we need to
have each member’s score on the posttest and a continuous variable. We
have to follow the steps described in Figures 13.9 and 13.10 once again
Figure 13.11
Activating Post Hoc Multiple Comparison
Since it is assumed that the members of the three groups will all be of
the same level of reading comprehension ability, in the top part labeled
Equal Variances Assumed click in the box named Scheffe. Then go to
the bottom of the dialogue box and click on Continue. It will take you
back to the One-Way ANOVA dialogue box. If you click on OK
button, the SPSS will provide you with the data related to the Scheffe
Post Hoc Test.
Table 13.9 present the Scheffe Post Hoc Test of the scores obtained on
syllabus-and-schema-based reading comprehension posttest. As we can
Table 13.9
The Scheffe Post Hoc Test of the scores obtained on syllabus-and-
schema-based reading comprehension posttest
Table 13.10 presents the 82 participants’ code (C), group (G) and scores
on the unseen reading comprehension test (UR) designed and
administered by Khodadady, Pishghadam and Fakhar (under review).
Establish an SPSS file and create three variables corresponding to the
codes, groups and scores giving in the table. Then apply One-Way
ANOVA analysis to the variables to find out whether the three Control
(1), Grammar (2) and Vocabulary (3) groups’ performance on the UR
was significantly different. Then apply the Scheffe Post Hoc Test to find
out whether any significant difference appears between two or three
groups. Reach conclusions on the basis of the results you obtain from
the analysis and test.
Table 13.10
The participants’ code (C), group (G) and scores on the unseen reading
comprehension test (UR)
C G UR C G UR C G UR C G UR C G UR
1 1 19 18 1 13 35 3 24 52 3 23 69 2 22
2 1 19 19 1 17 36 3 24 53 3 23 70 2 13
3 1 18 20 1 12 37 3 24 54 3 24 71 2 21
4 1 12 21 1 12 38 3 25 55 3 25 72 2 20
5 1 21 22 1 11 39 3 22 56 2 18 73 2 24
6 1 15 23 1 14 40 3 21 57 2 18 74 2 17
7 1 14 24 1 22 41 3 25 58 2 19 75 2 25
8 1 13 25 1 22 42 3 19 59 2 19 76 2 23
9 1 17 26 3 20 43 3 21 60 2 12 77 2 24
10 1 18 27 3 13 44 3 23 61 2 20 78 2 23
11 1 17 28 3 14 45 3 20 62 2 20 79 2 23
12 1 18 29 3 20 46 3 21 63 2 23 80 2 21
13 1 13 30 3 20 47 3 19 64 2 18 81 2 18
14 1 19 31 3 22 48 3 9 65 2 20 82 2 21
15 1 17 32 3 18 49 3 22 66 2 15
16 1 20 33 3 21 50 3 23 67 2 17
17 1 14 34 3 17 51 3 26 68 2 19
2. Use the second rightward arrow to copy the C-Test, S-Test and LKT
in the second box Independent(s).
3. Let the Method stay Enter as it is.
4. Click on Statistics button on the right top corner of the box to activate
its dialogue box shown in Figure 13.13.
5. Check in (√) Estimates, Confidence Intervals, Model fit,
Descriptives, Part and partial correlations and Collinearity
diagnostics as shown in Figure 13.13 and then click on Continue
button to return to Linear Regression dialogue box.
6. On the right top corner click on the second button called Plots to
activate its dialogue box shown in Figure 13.14 on the next page.
Using the rightward arrow move *ZRESID into the Y box. Click on
*ZPRED and the arrow button to move it to the X box. Under the box
on the left two alternatives are offered for Standardized Residual
Plots, click in the box labeled Normal probability plot. Click on
Continue to return to Linear Regression dialogue box.
7. Click on the Save button on the right top corner to activate its
dialogue box as shown in Figure 13.15. In the section labeled
Residuals, tick Standardized and in the Distances section, tick the
Mahalanobis and Cook’s Boxes. Click on Continue to return to
Linear Regression.
8. Click on the Options button. In the Missing Values section, click on
Exclude cases pairwise. Click on Continue to return to Linear
Regression.
9. Click on OK
The output generated from following the steps specified above are
presented and discussed, albeit briefly, one by one below. The first table
presents the correlation coefficients obtained among the variables. As
we can see, the TOEFL shows significant relationships with the three
independent variables as they do with each other and none of the
relationships is too high, i.e., r=.7 or higher. However, if you find highly
correlated variables in your own data, you may need to delete one of
them or form a composite variable from the scores obtained on the two
highly correlating variables
Correlations
a
Coefficients
Two values need to be consulted in the table above, i.e., Tolerance and
VIF. The first shows how much of the variability of the independent
variable is not explained by the other independent variables in the
model. For each variable it is calculated using the formula 1-R squared.
If the Tolerance value is very small, i.e., less than .10, it shows
multicollinearity among variable. As we can see, the values for the C-
Test, S-Test and LKT, are .692, .578, and .635, respectively. Since they
are all greater than .10, they show that we have not violated the
multicollinearity assumption.
The VIF (variance inflation factor) is the inverse of the tolerance value.
It is obtained by dividing 1 by Tolerance value. The VIF values of above
10 indicate multicollinearity. Since the VIF values of the C-Test, S-Test
and LKT are less than 10, i.e., 1.444, 1.729, and 1.575, respectively,
they provide further evidence that we have not violated the
multicollinearity assumption.
The normality of our data can also be checked by inspecting the outliers
in the Scatterplot of residuals given below. According to Tabachnick
and Fidell (2007), cases more than 3.3 or less than -3.3 are outliers. We
should, however, remember that it is common to find some outlying
residuals in large samples and ours in not an exception. As we can see in
the scatterplot, there are three outliers whose standardized residuals are
less than -3.3.
We can also have the SPSS find the outliers automatically and report
them in table. To do this, we need to go to the Residuals section in the
Statistics menu given as the first button on the top right hand corner of
Linear Regression dialogue box, tick Casewise diagnostics and specify
the magnitude of outliers specified as Standard deviations box. Since it
Casewise Diagnosticsa
As can be seen in the table above, our model could not predict the
TOEFL scores of students 116, 332 and 367 very well. For example, the
model predicted a score of 66.29 for student 116, but s/he obtained 28
on the TOEFL. Similarly, the scores predicted for students 332 and 367
are far higher than what they scored on the TOEFL in reality. We
therefore need to check whether these outliers have had any undue
influence on the results for the model as a whole.
The required results for checking the influence of outliers can be found
in the table labeled Residuals Statistics as presented below. If you
remember we specified statistics about residuals we need by activating
the Save menu and ticking the required boxes (see Figure 13.21).
According to Tabachnick and Fidell (2007), cases with values larger
than 1 are indicative of problems. If we check the table for the
Maximum value for Cook’s Distance, we find .214 which is less than 1
suggesting no potential problem. If the Maximum value we find in the
table is greater than 1, we need to go to our Data View sheet to check
the column COO_1 which is automatically produced by the SPSS itself.
If the Cook’s Distance value given for the outliers are greater than 1 we
will need to remove them and rerun the analysis.
Residuals Statisticsa
Now that we have become certain that all the assumptions of regression
analysis have been met by our data, we can check the Model Summary
table given below. If we check the value given under the R Square,
(.569), we notice that it is almost the same as the value given for
Adjusted R Square (.565). These two values are very much the same
because our sample size is pretty large. With a small sample, it is,
however, better to report the Adjusted R Square. These results show that
57% of variance (.565 × 100) in the TOEFL is explained by the C-Test,
S-Test and LKT.
Model Summaryb
ANOVAa
Upon securing the significance of our study, we can now focus on each
of the independent variables inserted in the model to determine which
one contributes to the prediction of the dependent variable significantly.
This can be achieved by scrutinizing the Coefficients table given below.
(It is broken into two parts to be fitted in the width of the page.) We
need to employ the Standardized Coefficients to discuss our findings.
Coefficientsa
Coefficientsa (Continued)
95.0% Confidence Interval for B Correlations Collinearity Statistics
Lower Bound Upper Bound Zero- Partial Part Tolerance VIF
order
24.355 33.698
.471 .659 .679 .508 .388 .692 1.444
.201 .520 .580 .218 .146 .578 1.729
.214 .424 .569 .287 .197 .635 1.575
As we can see in the first part of Coefficients table, among the three
independent variables, the C-Test has the largest Beta (.466). This
means that C-Test has the strongest unique contribution to explain the
TOEFL when the variance explained by all other variables in the model
is controlled for. Since the value given in Sig. column for the C-Test is
We can also get more results from the table of Coefficients above. In the
correlations columns, we can look for Part correlation coefficients
referred to as “semipartial correlation coefficients” by Tabachnick and
Fidell (2007, p. 145). By squaring the Part coefficients we can
determine how much of the total variance in the TOEFL is explained
uniquely by each independent variable. For example, the Part coefficient
for the C-Test is .388. If we square or multiply it by itself (.388×.388)
we obtain 0.15, indicating that the C-Test explains 15 percent of
variance in the TOEFL. Similarly, 2% (.146×.146=0.021) and 4%
(.197×.197=0.038) of variance in the TOEFL is explained uniquely by
the S-Test and LKT, respectively. If we add up 15 with 2 and 4 percents
we get 21 percent which is much less than total the R square value (.57)
given in the Model Summary table. This is because the total R square
value includes both common (.36) and unique (.21) variances explained
by the three independent variables.
Q: If we control for the possible effect of age and GPA, are the C-Test,
S-Test and LKT still able to predict a significant amount of the variance
in the TOEFL?
Table below provides the model summary. After the variables of age
and GPA are controlled as model 1, they explain 2.9% (.029×100) of the
variance as shown in R Square Change column. The inclusion of the C-
Test, S-Test and LKT in model 2 explain 54.5% (.545×100) of the
variance in the TOEFL when the effects of the age and GPA are
statistically controlled.
Model Summaryc
ANOVAa
To find out how well each of the variances contributes to the final
equation we need to check the Coefficients tables in the model 2 row.
(Due to the large width of the table, it has been scanned so that it can be
presented on the next page.) When we check, the Sig. column, we notice
that four variables contribute to the variance in the model as reflected in
the Standardized Coefficients column. In order of importance, they are
the C-Test (beta=.46), LKT (beta=.25), S-Test (beta=.18), and GPA
(beta=.07).
were statistically significant, with the C-Test having the highest beta
value (beta=.46) followed by LKT (beta=.25), S
Test (beta=.18), and GPA (beta=.07).
13.6 Summary
A large number of human characteristics are identified and
operationalized as variables in applied linguistics so that they can be
measured quantitatively. The quantification of variables endows
researchers with the ability to study their relationships with each other
and determine how they relate to learning languages.
It is hoped that the familiarity and mastery of readers with the principles,
methods and statistics of research in language education will help them
approach language learning and teaching as an exciting and rewarding
process which can ultimately lead to societies populated by ever
conscientious and responsible members. The materials presented in the
chapters so far can be supplemented by reading research papers in which
various methods have been employed. The Iranian sources of these
papers are presented as the last chapter.
14 Finding Research
Papers
14.1 Introduction
Similar to all types of sciences, social sciences have witnessed an ever-
increasing number of journals published in their subfields. The growth
has occurred due to a large number of factors among which economy
and excellence stand out as the two most influential variables governing
organizations offering social sciences.
But how can one decide whether a given educational institute such as a
university has been successful? Among various approaches available,
one has proved to be the most widely used index, i.e., citation. It is
argued that the best universities are those whose academic members
have published the most widely cited research. For this very reason, a
special institute was established in America.
In one study, the ISI decided to the find the “best of the best” as judged
by citations. By employing a computer program it extracted 1381
biomedical papers which were cited at least 300 times and published
between 1990 and 1996. By identifying the “high-impact” papers and
employing 10 of them as the least criterion, the ISI could rank the
institutions in which the papers were produced.
The high-impact papers are not studied by the ISI alone. Other
organizations have followed the suit. Thomosn Reuters (2009), for
example, does the same. The 79 cited journals given in Table 14.1 have
been taken from 1985 journals listed in its 2008 JCR33 Social Science
Edition.
Table 14.1
Selected list of Journals ranked on the basis of their total cites in 2008
Total
No Title of the Journal ISSN
Cites
1 Journal of Memory and Language 0749-596X 4912
2 Brain and Language 0093-934X 4286
Journal of Speech Language and Hearing 1092-4388 2931
3
Research
4 Review of Educational Research 0034-6543 2402
5 Multivariate Behavioral Research 0027-3171 1831
33
Journal Citation Report
Total
No Title of the Journal ISSN
Cites
6 Journal of Communication 0021-9916 1816
7 Language 0097-8507 1807
8 Intelligence 0160-2896 1611
9 Journal of Pragmatics 0378-2166 1364
10 Journal of Child Language 0305-0009 1327
11 Computational Linguistics 0891-2017 1286
12 Linguistics Inquiry 0024-3892 1269
13 Language and Cognitive Processes 0169-0965 1194
14 Journal of Phonetics 0095-4470 1012
15 Journal of Communication Disorders 0021-9924 966
16 Language Learning 0023-8333 954
17 Journal of Educational Research 0022-0671 950
18 Applied Psycholinguistics 0142-7164 924
19 Modern Language Journal 0026-7902 913
20 TESOL Quarterly 0039-8322 912
21 Studies in Second Language Acquisition 0272-2631 835
22 Applied Linguistics 0142-6001 815
American Journal of Speech-Language 1058-0360 763
23
Pathology
24 Journal of Psycholinguistics Research 0090-6905 730
25 Journal of Teacher Education 0022-4871 719
26 Mind & Language 0268-1064 690
27 Journal of Educational Measurement 0022-0655 676
Language Speech and Hearing Services 0161-1461 639
28
in Schools
29 Language in Society 0047-4045 634
30 Language and Speech 0023-8309 589
Total
No Title of the Journal ISSN
Cites
31 Lingua 0024-3841 588
32 Linguistics and Philosophy 0165-0157 557
33 Linguistics 0024-3949 530
34 Bilingualism-Language and Cognition 1366-7289 472
International Journal of Language & 1368-2822 472
35
Communication Disorders
36 Annals of Dyslexia 0736-9387 467
37 Natural Language & Linguistic Theory 0167-806X 452
38 Clinical Linguistics & Phonetics 0269-9206 438
39 American Journal of Education 0195-6744 393
40 Cognitive Linguistics 0936-5907 384
41 Journal of Research in Reading 0141-0423 379
42 Journal of Fluency Disorders 0094-730X 377
43 Dyslexia 1076-9242 371
44 Journal of Neurolinguistics 0911-6044 364
Journal of Language and Social 0261-927X 348
45
Psychology
46 Phonetica 0031-8388 343
47 Syntax and Semantics 0092-4563 342
Research On Language and Social 0835-1813 328
48
Interaction
49 Second Language Research 0267-6583 319
50 Language & Communication 0271-5309 317
51 English for Specific Purposes 0889-4906 316
52 Journal of Second Language Writing 1060-3743 316
53 Journal of Second Language Writing 1060-3743 316
54 American Journal of Evaluation 1098-2140 301
Total
No Title of the Journal ISSN
Cites
55 Journal of Sociolinguistics 1360-6441 292
56 Quarterly Journal of Speech 0033-5630 292
57 Journal of Linguistics 0022-2267 282
58 Language Learning & Technology 1094-3501 270
59 Foreign Language Annals 0015-718X 253
60 Canadian Modern Language Review 0008-4506 224
61 Linguistic Review 0167-6318 216
62 American Speech 0003-1283 201
63 Adult Education Quarterly 0741-7136 187
64 Applied Measurement in Education 0895-7347 182
65 Language Sciences 0388-0001 163
66 Metaphor and Symbol 1092-6488 162
67 International Journal of Lexicography 0950-3846 156
68 Narrative Inquiry 1387-6740 145
69 Language Teaching Research 1362-1688 124
70 Interaction Studies 1572-0373 113
71 Theoretical Linguistics 0301-4428 90
72 Folia Linguistica 0165-4004 88
73 Probus 0921-4771 79
74 Journal of Pidgin and Creole Languages 0920-9034 57
75 Lexikos 1684-4904 40
76 Text & Talk 1860-7330 35
77 Journal of Historical Pragmatics 1566-5852 25
International Journal of Speech Language 1748-8885 22
78
and the Law
79 Interpreter and Translator Trainer 1750-399X 2
Total
No Title of the Journal ISSN
Cites
80 Across Languages and Cultures 1585-1923 -
81 Child Language Teaching & Therapy 0265-6590 -
82 English Teaching-Practice and Critique 1175-8708 -
83 European Journal of Teacher Education 0261-9768 -
84 Functions of Language 0929-998X -
85 Journal of Semantics 0167-5133 -
86 Language and Linguistics 1606-822X -
87 Language and Literature 0963-9470 -
88 Language and Speech 0023-8309 -
89 Language Assessment Quarterly 1543-4303 -
90 Language Awareness 0965-8416 -
91 Language Culture and Curriculum 0790-8318 -
92 Language Matters 1022-8195 -
Language Problems & Language 0272-2690 -
93
Planning
94 Language Testing 0265-5322 -
95 Language Variation and Change 0954-3945 -
96 Phonology 0952-6757 -
97 Review of Research in Education 0091-732X -
98 Teacher and Teacher Education 0742-051X -
99 Translator 1355-6509 -
The fourth column of Table 14.1 shows the total cites. The statistics has
been taken from Thomosn Reuters (2010). As can be seen, among the 99
journals which publish articles in English, Journal of Memory and
Language, Brain and Language, Journal of Speech Language and
Hearing Research have occupied the first, second and third ranks,
respectively. The ranking is based on the total cites. (There was no
statistics for the last 20 journals.)
Table 14.2 presents nine journals selected from among some 150
journals given in the SID site. It includes the name of those journals
which publish papers related to applied linguistics and English language
and literature.
Table 14.2
List of English language related Journals published in Iran
Publication
No Name Place of Publication
Times
1 Ferdowsi Review (not Quarterly Ferdowsi University
listed yet) of Mashhad
2 Iranian Journal of Applied Quarterly Sistan and
Language Studies Baluchestan
University
3 Iranian Journal of Applied Semi-annually Teacher Training
Linguistics (IJAL) University
Publication
No Name Place of Publication
Times
4 Journal of Curriculum Quarterly The Iranian
Studies (J.C.S) Curriculum Studies
Association
(I.C.S.A)
5 Journal of Faculty of Quarterly Tabriz University
Letters and Humanities
(Tabriz)
6 Language and Linguistics Semi-annually Linguistic Society of
Iran
7 Pazhuhesh-e Zabanya-ye Quarterly Faculty of Foreign
Khareji Languages, Tehran
University
8 Research on Foreign Semi-annually Tabriz University
Languages (Journal of
Faculty of Letters and
Humanities) (Tabriz)
9 Teaching English Quarterly English Language
Language and Literature and Literature
Society of Iran (TELLSI) Society of Iran
You can access the journals specified above online. For example, click
on the Iranian Journal of Applied Linguistics (IJAL) to be linked to its
homepage as shown below:
IRANIAN JOURNAL OF
Name:
APPLIED LINGUISTICS (IJAL)
Type: SEMI-ANNUALLY
Editor in MOHAMMAD HOSSEIN
chief : KESHAVARZ
TEACHER TRAINING
Publisher :
UNIVERSITY
MOHAMMAD HOSSEIN
Manager :
KESHAVARZ
NO. 49, SHAHID DR.
MOFATTEH AVE., FOREIGN
LANGUAGES DEPARTMENT,
Address:
OFFICE OF IRANIAN
JOURNAL OF APPLIED
LINGUISTICS (IJAL), 15614
Tel: (021) 88304896
MHKESHAVARZ@YAHOO.CO
E.Mail:
M
According to SID's Policy, all of the articles published in this journal are
indexed in SID web site from the year of 2000 until now.
There is a + mark in front of each year. If you click on that mark, it will
show you the number of issues published in each year. For example, at
the time it was accessed, issue 2006 had two numbers published in
March and September.
Year : 2007
Number 1 MARCH 2007
Year : 2006
Number 2 SEPTEMBER 2006
Number 1 MARCH 2006
If you click on Number 2 in Year 2006, the page below will appear:
Printable Version
If you click on the icon you will access the whole article as a PDF
file. You can then save it on your hard.
Table 14.3
Name and Address of some online journals
14.5 Summary
Papers published in research journals provide the best information
regarding the topics related to given fields. The most widely cited papers
published by top-ranking institutes act like pivots upon which solid
research projects must be built. The ISI and SID provide the most
informative lists for researchers in English language teaching, literature
and translation.
Appendix 3.1
Self-assessment used as reading portfolios (Faravani, 2006, p. 84)
Please put a check mark (√) in the box which best describes your own
reading activities
Almost Almost
Activities Sometimes
Always Never
1. Participates in small group
discussion.
2. Shares responses to the reading.
3. Comprehends questions.
4. Gives quality responses during
small group discussions.
5. Uses structure and background
knowledge (look-in strategy).
6. Uses other sources (dictionary,
thesaurus, content, and Text).
7. Integrates strategies and sources.
8. Uses meaning cues.
9. Uses structural cues.
10. Uses visual cues.
11. Integrates cues (meaning,
structural, visual).
12. Makes predictions and reads to
find out if it was right.
13. Reads the sentences before and
after a word he doesn’t know.
14. Guesses the meanings of the
words he doesn’t know from the
context.
Please put a check mark (√) in the box which best describes your own
reading activities
Almost Almost
Activities Sometimes
Always Never
15. Looks for the main idea.
16. Discusses what he reads with
others.
17. Has the ability to self-correct.
18. Recognizes cause and
relationships between/among
sentences.
19. Draws inferences.
20. Can provide examples from
personal experience and/or prior
knowledge and uses relevant
examples from the text.
21. Recognizes logical order.
22. Recognizes paraphrasing.
Appendix 3.2
Peer-assessment used as reading portfolios (Faravani, 2006, p. 85)
Please put a check mark (√) in the box which best describes your
classmate's reading activities.
Almost Almost
Activities Sometimes
Always Never
1. Participates in small group discussion.
2. Shares responses to the reading.
3. Comprehends questions.
4. Gives quality responses during small
group discussions.
5. Uses structure and background
knowledge (look-in strategy).
6. Uses other sources (dictionary,
thesaurus, content, and Text).
7. Integrates strategies and sources.
8. Uses meaning cues.
9. Uses structural cues.
10. Uses visual cues.
11. Integrates cues (meaning, structural,
visual).
12. Makes predictions and reads to find
out if it was right.
13. Reads the sentences before and after
a word he doesn’t know.
14. Guesses the meanings of the words
he doesn’t know from the context.
Please put a check mark (√) in the box which best describes your
classmate's reading activities.
Almost Almost
Activities Sometimes
Always Never
15. Looks for the main idea.
16. Discusses what he reads with others.
17. Has the ability to self-correct.
18. Recognizes cause and relationships
between/among sentences.
19. Draws inferences.
20. Can provide examples from personal
experience and/or prior knowledge
and uses relevant examples from the
text.
21. Recognizes logical order.
22. Recognizes paraphrasing.
Appendix 3.3
Self-reflections on readings (Faravani, 2006, p. 86)
Name: Date:
Reading Number: Group:
How is the reading related to your everyday life? What does the author
want you to learn?
What problems did you have when you read this passage?( What were
your weaknesses in reading?)
What do you think about your reading now? Do you think you made
progress? What are your strengths in reading now?
Appendix 5.1
Table of random numbers34
34
Based on parts of Table of 105,000 Random Decimal Digits, Interstate Commerce
Commission, Bureau of Transport Economic and Statistics, Washington, D. C.
Appendix 12.1
Critical values for the Pearson Product-moment Correlation Coefficients
Appendix 13.1
Critical values for T test35
35
Mile, J. (2008). Learning Statistics: A blog about learning statistics in psychology,
health and social sciences. Retrieved August 31, 2010 from
http://www.jeremymiles.co.uk/misc/tables/t-test.html
Appendix 13.2
The scores of 30 students obtained on five tests
Code Age GPA TOEFL C-Test S-Test LKT ZRE_1 MAH_1 COO_1
1 19 17.00 85 49 29 33 0.682 0.910 0.001
2 33 18.50 96 55 43 51 0.384 6.051 0.001
3 19 17.00 45 33 19 4 -1.009 3.980 0.003
4 19 16.50 74 52 26 9 0.313 2.220 0.000
5 19 17.50 65 38 25 14 0.095 2.037 0.000
6 24 14.30 81 43 28 26 0.867 1.058 0.001
7 21 16.50 73 61 30 20 -0.718 0.875 0.001
8 28 15.96 74 51 16 14 0.554 2.508 0.001
9 22 16.00 83 47 30 30 0.656 0.846 0.001
10 22 16.00 90 47 31 30 1.277 0.954 0.002
11 20 16.30 76 42 28 17 0.721 1.530 0.001
12 24 16.50 84 58 26 29 0.375 0.482 0.000
13 22 15.50 68 50 22 24 -0.455 0.710 0.000
14 22 15.50 73 50 25 25 -0.118 0.261 0.000
15 22 15.00 79 55 23 13 0.603 1.426 0.001
16 20 17.30 87 61 30 22 0.530 0.593 0.000
17 20 17.30 89 61 30 22 0.717 0.593 0.001
18 21 16.00 79 57 25 15 0.371 1.165 0.000
19 21 16.00 63 57 25 16 -1.154 0.998 0.002
20 24 16.50 76 62 40 12 -0.589 7.076 0.002
21 19 14.00 52 45 42 8 -1.883 11.990 0.031
22 24 17.00 78 56 32 19 -0.025 1.066 0.000
23 19 17.00 100 77 42 39 -0.011 3.381 0.000
24 20 16.50 75 55 22 42 -0.601 5.027 0.001
25 20 18.00 76 65 28 26 -0.761 0.910 0.001
26 23 14.50 81 65 28 42 -0.771 2.987 0.002
27 23 14.50 60 42 6 7 0.266 7.068 0.000
28 21 16.50 104 71 31 22 1.557 2.394 0.005
29 21 16.83 104 71 31 22 1.557 2.394 0.005
30 21 16.83 89 34 37 53 0.981 13.154 0.009
References
Abdelwahab, S. (2002). Portfolio assessment: A qualitative
investigation of portfolio self-assessment in an intermediate EFL
classroom. Unpublished PhD thesis, University of Ohio.
Alderson, J. C. (1984). Reading in a foreign language: A reading
problem or a language problem? In J. C. Alderson & A. H.
Urquhart (Eds.). Reading in a foreign language (pp. 122–135).
New York: Longman.
American Psychological Association (2010). Publication manual (6th
ed.). Washington: American Psychological Association.
Amouzadeh-Mahdiraji, M. (2004). Language and the representation of
reality. Tabriz University Journal of Faculty of Letter &
Humanities, 190, 1-21.
Anastasi, A. (1968). Psychological testing (3rd ed.). London: Macmillan.
Arberry, A. (1964). The Koran interpreted. London: OUP.
Arberry, A. J. (1954). Persian Poems (translated by Sir William Jones,
pp. 118-119). London.
Arberry, A. J. (1961). Tales from the Masnavi. London: George Allen
and Unwin.
Arbuckle, J., & Wothke, W. (1999). AMOS 4.0 user’s guide.
Chicago: Smallwaters.
Armitage, P. (1971). Statistical methods in medical research. Oxford:
Blackwell.
Atkinson, M. (1992). Children’s syntax: An introduction to principles
and parameters theory. Oxford: Blackwell.
Bachman, L.F. & Palmer, A. (1982). The construct validation of some
components of communicative proficiency. TESOL Quarterly 16,
449–65.
Bachman, L.F. & Palmer, A. (1989). The construct validation of self-
ratings of communicative language ability. Language Testing 6,
14–29.
Bachman, L.F. & Palmer, A. (1981). The construct validation of the FSI
oral interview. Language Learning 31, 67–86.