Professional Documents
Culture Documents
Jawapan Peperiksaan Aeu
Jawapan Peperiksaan Aeu
behaviour.
c)Incompleteness refers to the students inability to demonstrate the entire
repertoire of the
construct being measured. As a test is constrained by time and physical
setting, a student will
never be able to show all of what he or she is able to do. Because only a few
questions can be
asked in a test due to time constraints, these questions may not be able to
elicit the students true
or complete ability. Similarly, the constraints placed by the physical setting
of the test may also
restrain the student from demonstrating specific kinds of abilities. As such,
we should take note
that even when a student scores zero points in a test, this does not mean
that he or she is
completely ignorant of the subject or ability being tested. It is just that the
test has not elicited
the knowledge or abilities that the student is able to convey or perform.
d)While we are aware of the importance of having direct tests, it is unlikely
that a test will be
completely free of being an indirect measure of ability. This limitation is
inherent in the testing
situation itself. Many of us have gone through test anxiety. Once the word
test or assessment is
mentioned, the entire situation changes. While some students will be able to
speak well in
situations outside the classroom, they lose this ability once they become
aware that they are being
tested. In addition to this, every test situation has elements that are not
related to the construct
being tested. This is referred to as construct irrelevant variance by Messick
(1989) and examples
may include the test rubrics or instructions, time constraints, and other
rules and regulations of
the test. All these are not present in the actual real-world situation and
must be considered as
aspect of indirectness. As such, we can only conclude that the test situation
is indirect because it
is inauthentic. And by being indirect, it fails to capture the true ability of the
students if they were
to perform in the real world.
Chapter 6
4 types of test
(a) Achievement test.
(b) Aptitude test.
(c) Proficiency test.
(d) Diagnostic test
Closely related to the distinction made between the direct and indirect tests
are authentic tests.
Chapter 7
1 THE CLOZE TEST
The cloze test is a test that is often associated with language proficiency
testing. It is more than
simply filling in blanks in a passage as it has a theoretical basis. The term
cloze comes from the
word closure and reflects a psychoanalytical human tendency to close any
incomplete object. As
such, the cloze test is thought to elicit a respondents language competency
by requiring the
respondent to complete a passage which has been mutilated with blanks.
Although it was
initially intended to be a measure of reading ability, the cloze test has often
been considered as a
measure of overall general language proficiency.
There are many different types of cloze tests, two of the more common are
determined by how
the words in the passage are deleted in order to form blanks in the passage.
The fixed deletion
cloze is a cloze passage where every nth word in the passage is deleted. For
example, a cloze test
where n = 5 means that every fifth word after the first sentence is deleted.
This method is said to
help assess overall language proficiency as the types of words deleted are
thought to be
representative of language in general, given the fact that they have been
deleted on a more or less
random basis.
If the test maker intentionally deletes a certain kind of word, then the cloze
test is referred to as a
rational deletion cloze test. A rational deletion cloze test could involve the
deletion of only verbs,
for example. The number of words between every blank in a rational
deletion cloze test may not
consistently be the same. However, you may also find some cloze tests in
which the passage has
been altered so that only certain types of words are deleted at consistent
intervals. These cloze
passages, even if they consist of blanks that are spaced out equally, are still
rational cloze
passages as the deleted words were selected by the test maker.
7.1.1 THE STRUCTURE OF THE CLOZE TEST
The cloze test consists of a passage with blanks. The first sentence is left
intact without any
blanks. This is to ensure that the test takers have some context to work
with. It also provides
other information such as to indicate the tense of the passage. Normally the
cloze passage is long
enough to allow for about 20 blank spaces as a longer text would make it
extremely difficult.
difficulty level of the cloze procedure include the following:
(a) Length of the text: The longer the text, the more difficult the cloze
passage.
(b) Familiarity of vocabulary and structures: This includes the word
that is neededto fill
in the blank. For example, in a sentence such as The situation was _____
with danger, it is
highly unlikely that non native speakers would be able to provide the
correct word
fraught to fill in the blank.
(c) Length and complexity of the sentences: The longer and more
complex the sentence,
the more difficult it becomes for the student to complete the cloze.
(d) Familiarity with chapter and discourse genre: Familiarity with each
of these would
make the cloze easier.
(e) Frequency with which blanks are spaced: In this case, when the
blanks are closer
together, the more difficult the cloze passage becomes. Normally, the
number of words
between blanks or the N in a cloze passage is between 5 to 7 and seldom
less than 5.
Grading Clozen test
Dictation of test
The dictation is a common form of assessment that many of us have
experienced. The dictation
is seen to have some commonalities with the cloze test, especially in that
both are considered to
be able to predict overall language ability. The dictation is also thought to
provide results that are
similar to those obtained in cloze tests but with the added ability of
assessing listening as well
(Hughes, 1972). In a standard dictation test, the teacher begins by selecting
an appropriate
passage. This passage is usually a short passage no longer than one
paragraph. This stage of the
dictation is an important one as the paragraph that has been selected must
be appropriate to the
students language ability as well as cultural background. After having
selected the passage, the
teacher can proceed with the dictation. 7.2.1 THE STRUCTURE OF THE
DICTATION
The dictation passage is usually read out three times. The first time it is
read out, it is done so at
a normal rate of reading. Students are expected to listen and get the gist of
the passage. The
second reading is a little slower and the students are expected to take down
what is read. During
the second reading, the teacher usually pauses to break the passage into
meaningful chunks
referred to as bursts. Finally, the passage is read a third time and students
are expected to check
their work, editing it for errors.
The dictation passage is usually read out three times. The first time it is
read out, it is done so at
a normal rate of reading. Students are expected to listen and get the gist of
the passage. The
second reading is a little slower and the students are expected to take down
what is read. During
the second reading, the teacher usually pauses to break the passage into
meaningful chunks
referred to as bursts. Finally, the passage is read a third time and students
are expected to check
their work, editing it for errors.
Partial Dictation
The partial dictation is essentially like a listening cloze activity. Students are
provided the passage
with some words or phrases deleted. They are expected to listen to a
passage and fill in words or
phrases. It is commonplace to have partial dictations in which single words
or even short phrases
are deleted.
Dictocomp
Finally, in the dictocomp, the students are expected to use the information
they hear to construct
a coherent piece of composition instead of taking down the passage exactly
as it was dictated.The teacher will determine the key elements of the
original passage which the student is expected
to include in the composition. Therefore, the dictocomp can be said to test
listening
comprehension in a very specific way in that the student has to decide what
pieces of
information are important and should be included. This is reminiscent of
summaries.
Additionally, the dictocomp also tests writing ability as well because the
students are expected to
write a cohesive piece based on the passage that was dictated to them.
Chapter 99.
similarly a test may be more integrative than another. Perhaps the more
important aspect is to be
aware of the discrete point or integrative nature of a test as we must be
careful of what we
believe the test measures.
This brings us to the question of how discrete point is a multiple choice
question type item?
While it is definitely more discrete point than an essay, it may still require
more than just one skill
or ability in order to complete. Lets say you are interested in testing a
students knowledge of the
relative pronoun and decide to do so by using a multiple choice test item. If
he fails to answer
this test item correctly, would you conclude that the student has problems
with the relative
pronoun? The answer may not be as straight forward as it seems. The test is
presented in textual
form and therefore requires the student to read. As such, even the multiple
choice test item
involves some integration of language skills as this example shows, where in
addition to the
grammatical knowledge of relative pronouns, the student must also be able
to read and
understand the question.
Perhaps a clearer way of viewing the distinction between the discrete point
and the integrative
test is to examine the perspective each takes toward language. In the
discrete point test, language
is seen to be made up of smaller units and it may be possible to test
language by testing each unit
at a time. Testing knowledge of the relative pronoun, for example, is
certainly assessing the
students on a particular unit of language and not on the language as a
whole. In an integrative
test, on the other hand, the perspective of language is that of an integrated
whole which cannot
be broken up into smaller units or elements. Hence, the testing of language
should maintain the
integrity or wholeness of the language.
Multiple choice
The multiple choice format is perhaps the most common test format to many
of us. It is also
commonly referred to as an objective test as there is seen to be objectivity
in grading the test.
In this section, we will examine the multiple choice format with respect to
its structure, use, and
construction.
There are a number of situations in which a multiple choice format test may
be useful and
appropriate. Ory outlines some of these situations as follows:
Chapter 8
Essay
Unlike the directed writing task, the continuous writing test item provides
little structure other than the question itself. Students are expected to draw
upon their experience and past knowledge as well as knowledge of writing
conventions and organisation in order to complete the task.
The essay test format provides several advantages compared to the multiple
choice test format.
Some of these advantages as mentioned by Kubiszyn and Borich (2000:18)
are:
(a) It can assess higher order skills. Unlike the multiple choice test
format which
is often limited to assessing low order skills, the essay places a premium on
the
ability to analyse, synthesise and evaluate through topics that require
students to
scoring essay
As we have seen earlier, scoring an essay is not easy as graders can be
easily swayed by many
factors. Scoring remains one of the major issues in grading essays. There
are generally three
major approaches to scoring essays which are the holistic scoring method,
the analytical scoring
method, and the objective scoring method.
Holistic Scoring
In holistic scoring, the reader reacts to the students compositions as a
whole and a single score
is awarded to the writing. Normally this score is on a scale of 1 to 4, or 1 to
6, or even 1 to 10.
(Bailey, 1998 : 187). Each score on the scale will be accompanied with
general descriptors of
ability. The following is an example of a holistic scoring scheme based on a
6 point scale.
The 6 point scale above includes broad descriptors of what a students essay
reflects for each
band. It is quite apparent that graders using this scale are expected to pay
attention to vocabulary,
meaning, organisation, topic development and communication. Mechanics
such as punctuation
are secondary to communication.
Analytical Scoring
Analytical scoring is a familiar approach to many teachers. In analytical
scoring, raters assess
students performance on a variety of categories which are hypothesised to
make up the skill of
writing. Content, for example, is often seen as an important aspect of
writing i.e. is there
substance to what is written? Is the essay meaningful? Similarly, we may
also want to consider
the organisation of the essay. Does the writer begin the essay with an
appropriate topic sentence?
Are there good transitions between paragraphs? Other categories that we
may want to also
consider include vocabulary, language use and mechanics. The following are
some possible
components used in assessing writing ability using an analytical scoring
approach and the
suggested weightage assigned to each:
Objective Scoring
A third type of scoring approach is the objective scoring approach. This
scoring approach relies
on quantified methods of evaluating students writing. A sample of how
objective scoring is
conducted is given by Bailey (1999) as follows:
Establish standardization by limiting the length of the assessment: Count
the first 250 words of
the essay.
Identify the elements to be assessed: Go through the essay up to the 250th
word underlining
every mistake from spelling and mechanics through verb tenses,
morphology, vocabulary, etc.
Include every error that a literate reader might note.
Operationalise the assessment: Assign a weight score to each error, from 3
to 1. A score of 3 is a
severe distortion of readability or flow of ideas; 2 is a moderate distortion;
and 1 is a minor error
that does not affect readability in any significant way.
Quantify the assessment: Calculate the essay Correctness Score by using
250 words as the
numerator of a fraction, and the sum of error scores as the denominator:
The denominator is the
sum of all the error scores:
The steps described above help to provide a clear and systematic method
for assessing essays.
Objective scoring does not necessarily need to use the same values as in
this example. The most
important element in this approach is the objective scoring which is
determined through the
unbiased and fixed values provided according to some concrete aspect of
the essay such as the
number of mistakes made.
of the grading process should therefore be given due consideration and not
ignored. There are
enough incidents of graders jumping the gun and assessing essays
without first becoming
familiar with the scoring criteria. This may only result in having to grade
the paper again.
The purpose of identifying benchmark papers or anchor papers is to provide
a clear and
representative example of students work according to the grading criteria.
Bands can only give a
general description of what is expected. Anchor or benchmark papers
provide concrete examples
and help ensure fairness in grading.
When it comes to the actual grading, some recommend that we first quickly
scan through all the
essays and place them in stacks according to the bands on the scale. All
papers which we
consider A papers will be stacked together, the B papers will be together
and so on. We can then
read each paper more closely in order to confirm our initial impression. If
we need to assign
more precise numerical scores, we can do so at this time. Another pointer in
grading essays,
especially when there are several essays, is to grade all the students on one
essay first before
moving on to the next essay. This is expected to help ensure more
consistent grading.