Professional Documents
Culture Documents
07 Hmef5053 T3
07 Hmef5053 T3
07 Hmef5053 T3
3 Objective
Tests
LEARNING OUTCOMES
By the end of this topic, you should be able to:
1. Define objective tests;
2. Differentiate between different types of objective tests;
3. Discuss the advantages of using different objective tests;
4. Identify the weaknesses of using the different objective tests;
5. Explain the techniques of making different objective test items; and
6. Plan a table of specifications for deciding what to assess and how
to assess.
INTRODUCTION
In Topic 2, we discussed the need to assess students holistically based on
cognitive, affective and psychomotor learning outcomes. In the next four topics,
we will discuss how one can assess these three learning outcomes. In this topic,
however, we will focus on using objective tests in assessing various kinds of
behaviour in the classroom. Three types of objective tests are examined and the
guidelines for the construction of each type of test are discussed. The advantages
and limitations of each of these types of objective tests are explained, followed by
an examination of the table of specifications and its role in determining the
distribution of test items in the assessment of various types of learning outcomes.
When objective tests were first used in 1845 by George Fisher in the United
States, they were not well received by society. However, they were widely used
later, and today objective tests are perhaps the most popular format of assessing
various types of human abilities, competencies and socio-emotional attributes.
Objective tests are extensively used in schools, industry, business, professional
organisations, universities and colleges. Can you guess why?
Objective tests vary depending on how the questions are presented. The three
common types of questions used in most objective tests are multiple-choice
questions, matching questions and true-false questions (see Figure 3.2).
Multiple-choice questions are the most difficult to prepare. These questions have
two parts: a stem that contains the question and four or five options with one
containing the correct answer called the keyed response. The other incorrect
options are called distractors. The stem may be presented as a question, direction
or a statement while the options can be a word, phrase, numbers, symbols and so
forth. The role of the distractor is to attract the attention of respondents who are
not sure of the correct answer.
SELF-CHECK 3.1
McKenna and Bull (1999) offered some guidelines for constructing stems for
multiple-choice questions. All the options in multiple-choice items need to be
plausible, but they also need to separate students of different ability levels. Let us
take a look at these guidelines.
Weak Question:
World War II was:
B. Horrible
Improved Question:
In which of these time periods was World War II fought?
A. 19141917
B. 19291934
C. 19391945
D. 19511955
Note: In the weak question, there is no clue from the stem to what the
question is asking. The improved version identifies more clearly the
question and offers the student a set of homogeneous choices.
Weak Question:
For almost a century, the Rhine river has been used by Europeans for a
variety of purposes. However, in recent years, the increased river traffic
has resulted in increased levels of diesel pollution in the waterway.
Which of the following would be the most dramatic result if, because of
the pollution, the Council of Ministers of the European Community
decided to close the Rhine to all shipping?
A. Increased prices for Ruhr products
B. Shortage of water for Italian industries
C. Reduced competitiveness of the French Aerospace Industry
D. Closure of the busy river Rhine ports of Rotterdam, Marseilles and
Genoa
Improved Question:
Which of the following would be the most dramatic result if, because of
diesel pollution from ships, the river Rhine was closed to all shipping?
A. Increased prices for Ruhr products
B. Shortage of water for Italian industries
C. Reduced competitiveness of the French Aerospace Industry
D. Closure of the busy river Rhine ports of Rotterdam, Marseilles
and Genoa
Note: The weak question is too wordy and contains unnecessary material.
(c) When writing stems, use clear, straightforward language. Questions that
are constructed using complex wording may become a test of reading
comprehension rather than an assessment of students performance with
regard to a specific learning outcome.
Weak Question:
As the level of fertility approaches its nadir, what is the most likely
ramification for the citizenry of a developing nation?
A. A decrease in the labour force participation rate of women
B. A downward trend in the youth dependency ratio
C. A broader base in the population pyramid
D. An increased infant mortality rate
Improved Question:
A major decline in fertility in a developing nation is likely to produce
(d) When writing stems, use negatives sparingly. If negatives must be used,
capitalise, underscore or bold.
Weak Question:
Which of the following is not a symptom of osteoporosis?
Improved Question:
Which of the following is a symptom of osteoporosis?
C. Painful joints
D. Hair loss
Note: The improved question is stated in the positive so as to avoid the use
of the negative not.
(e) When writing stems, put as much of the question in the stem as possible,
rather than duplicating material in each of the options.
Weak Question:
Theorists of pluralism have asserted which of the following?
Improved Question:
Theorists of pluralism have asserted that the maintenance of democracy
requires
(f) When writing stems, avoid giving away the answer because of grammatical
cues.
Weak Question:
A fertile area in the desert in which the water table reaches the ground
surface is called an
A. Mirage
B. Oasis
C. Lake
D. Polder
Improved Question:
A fertile area in the desert in which the water table reaches the ground
surface is called a/an
A. Lake
B. Mirage
C. Oasis
D. Polder
Note: The weak question uses the article an which identifies choice B as
the correct response. Ending the stem with a/an improves the question.
Weak Question:
Which of the following men contributed the most towards the defeat of
Hitlers Germany in World War II?
A. Winston Churchill
B. Josef Stalin
C. Franklin D Roosevelt
D. George Patton
(h) Avoid using ALWAYS and NEVER in the stem as test-wise students are
likely to rule such universal statements out of consideration.
(i) When writing distractors for single response MCQs, make sure that there is
only one correct response.
Weak Question:
What is the main source of pollution of Malaysian rivers?
A. Land clearing
B. Open burning
D. Coastal erosion
Improved Question:
What is the main source of pollution of Malaysian rivers?
A. Open burning
B. Coastal erosion
(j) When writing distractors, use only plausible and attractive alternatives.
Weak Question:
Who was the third Prime Minister of Malaysia?
A. Hussein Onn
B. Ghafar Baba
C. Mahathir Mohamad
D. Musa Hitam
Improved Question:
Who was the third Prime Minister of Malaysia?
A. Hussein Onn
B. Abdullah Badawi
C. Mahathir Mohamad
(k) When writing DISTRACTORS, if possible, avoid the choices All of the
above and None of the above. If you do include them, make sure that
they appear as correct answers some of the time.
(m) Do not create distractors that are so close to the correct answer that they
may confuse students who really know the answer to the question.
Distractors should differ from the key in a substantial way, not just in some
minor nuance of phrasing or emphasis.
You will probably choose to use three, four or five alternatives in a multiple-
choice question. Until recently, it was thought that three or four distractors were
necessary for the item to be suitably difficult. However, a study by Owen and
Freeman suggests that three choices are sufficient (Brown, 1987). Clearly, the
higher the number of distractors, the less likely it is for the correct answer to be
chosen through guessing provided that all alternatives are of equal difficulty.
ACTIVITY 3.1
1. Do you agree that you should not use negatives in the stems of
MCQs? Why?
2. Do you agree that you should not use ALL OF THE ABOVE and
NONE OF THE ABOVE as distractors in MCQs? Why?
(f) Scores are more reliable than subjectively scored items (e.g. essays);
(h) Item analysis can reveal how difficult each item was and how well it
discriminated between the strong and weaker students in the class;
(i) Performance can be compared from class to class and year to year;
(j) Can cover a lot of material very efficiently (about one item per minute of
testing time); and
(k) Items can be written so that students must discriminate among options that
vary in degree of correctness.
(c) MCQs are not as effective for measuring some types of problem-solving
ability and ability to organise and express ideas;
(f) Students can sometimes read more into the question than was intended;
(g) It often focuses on testing factual information and fails to test higher levels
of cognitive thinking;
(i) They place a high degree of independence on the students reading ability
and the constructors writing ability;
Test for important or significant Avoid giving clues through the use of
information faulty grammatical construction
Avoid trick items Avoid the use of humour when
Keep the vocabulary consistent with developing options
the students level of understanding Present practical or real-world
Avoid overly specific knowledge situations to students
when constructing items Use pictorial materials that require
Avoid textbook and verbatim students to apply principles and
phrasing when developing items concepts
SELF-CHECK 3.2
alternatives. These questions can be used when the test designer wishes to
measure a students ability to identify whether statements of fact are accurate or
not. The true-false questions can be used for testing knowledge and judgement in
many subjects. When grouped together, a series of true-false questions on a
specific topic or scenario can test a more complex understanding of an issue.
They can be structured to lead a student through a logical pathway (Brown, 1997)
and can reveal part of the thinking process employed by the student in order to
solve a given problem.
Example:
True False
(a) Guessing A student has a one in two chance of guessing the correct
answer of a question. Scores on true-false items tend to be high because of
the ease of guessing correct answers when the answers are not known. With
only two choices (true or false), the student can expect to guess correctly on
half of the items for which correct answers are not known. Thus, if a
student knows the correct answers to 10 questions out of 20 and guesses on
the other 10, the student can expect a score of 15. The teacher can anticipate
scores ranging from approximately 50 percent for a student who did
nothing but guess on all items to 100 percent for a student who knew the
material;
(b) Because these items are in the form of statements, there is sometimes a
tendency to take quotations from the text, expecting the student to
recognise a correct quotation or note a change (sometimes minor) in
wording. There may also be a tendency to include trivial or inconsequential
material from the text. Both of these practices are discouraged;
(d) The format does not discriminate among students of different abilities as
well as other question types.
(c) Try using true-false questions in combination with other materials, such as
graphs, maps and written material. This combination allows for the testing
of more advanced learning;
(d) Avoid lifting statements directly from assigned readings, notes or other
course materials so that recall alone will not lead to a correct answer;
(e) In general, avoid the use of words which would signal the correct response
to the test-wise student. Absolutes such as none, never, always, all
and impossible tend to be false, while qualifiers such as usually,
generally, sometimes and often are likely to be true;
(f) A similar situation occurs with the use of can in a true-false statement. If
the student knows of a single case in which something can be done, it
would be true;
(g) Ambiguous or vague statements and terms, such as largely, long time,
regularly, some and usually are best avoided in the interest of
clarity. Some terms have more than one meaning and may be interpreted
differently by individuals;
(h) True statements should be about the same length as false statements. There
is a tendency to add details in true statements to make them more precise;
(i) Word the statement so precisely that it can be judged unmistakably as true
or false;
(m) Avoid verbal clues (specific determiners) that indicate the answer;
Example:
Directions: Column A contains statements describing selected Asian cities. For
each description, find the appropriate city in Column B. Each city in Column B
can be used only once.
Column A Column B
1. Ancient capital of Thailand A. Ayutthaya
2. Largest city in Sumatera B. Ho Chi Minh City
3. Capital of Myanmar C. Karachi
4. Formerly known as Saigon D. Medan
5. Former capital of Pakistan E. Yangon
F. Hanoi
G. Surabaya
ACTIVITY 3.2
1. Select five true-false questions in your subject area and analyse
each item using the guidelines above.
3. Suggest how you would improve the weak items for each type of
question.
Definitions terms,
Achievements people,
Descriptions principles.
(b) They can also assess a students ability to apply knowledge by requiring a
test-taker to match the following:
Examples terms,
Functions parts,
Classifications structures,
Problems principles.
(d) Matching questions are generally easy to write and score when the content
tested and objectives are suitable for matching questions.
(a) Matching questions are limited to material that can be listed in two columns
and there may not be much material that lends itself to such a format;
(b) If there are four items in a matching question and the student knows the
answer for three of them, the fourth item is a giveaway through
elimination;
(a) Provide clear directions. They should explain how many times responses
can be used;
(c) Include more responses than premises or allow the responses to be used
more than once;
(e) Correct answers should not be obvious to those who do not know the
content being taught;
(f) There should not be keywords appearing in both the premise and response,
providing clues to the correct answer; and
(g) All of the responses and premises for a matching item should appear on the
same page.
SELF-CHECK 3.3
1. What are some advantages of matching questions?
You can create a matrix by listing the content areas or topics as rows, and levels
of thinking or process skills (such as recall, comprehend, apply, analyse,
synthesise or evaluate) as columns (see Table 3.2). The table of specifications
identifies the objectives and skills that are to be tested and relative weight on the
test given to each. The table of specifications can help you ensure that you are
obtaining the desired coverage of topics and levels of learning outcomes. Once
you have created your test blueprint, you can begin writing your items. When
you start to write or select items (from other sources), you should follow your
plan and the resulting sample of items will be representative of your course
content and emphasis.
Developing a test plan also helps you determine the types of items you want to
include in your test. Your choice between essay items and objective items
depends on the types of objectives you have developed and the level of thinking
you wish to test. You may want to combine several types of items in your test.
In writing test items, you must consider the length of the test or examination as
well as the reading level of your students. You do not want students to feel rushed
and frustrated because they were not able to demonstrate their knowledge of the
material in the allotted time. Some general guidelines regarding time requirements
for secondary school student test takers are shown in Table 3.3. If you are
combining multiple-choice and essay items, these estimates may help you decide
how many of each type of items to include. One mistake often made by many
educators is having too many questions for the time allowed.
Once your questions are developed, make sure that you include clear directions
to the students. For the objective items, specify that they should select one
answer for each item and indicate the point value of each question, especially if
you are weighting sections of the test differently. For essay items, indicate the
point value and suggested time to be spent on the item (we will discuss essay
questions in more detail in Topic 4). If you are teaching a large class with close
seating arrangements and are giving an objective test, you may want to consider
administering several versions of your test to decrease the opportunities for
cheating. You can create versions of your test with different arrangements of the
items.
SELF-CHECK 3.4
3. To what extent do you agree with the allotment of time for each
type of question shown in Table 3.3? Explain.
Objective tests vary depending on how the questions are presented. The three
common types of questions used in most objective tests are multiple-choice
questions, matching questions and true-false questions.
Multiple-choice questions have two parts: a stem that contains the question
and four or five options with one of them containing the correct answer. The
correct option is called the keyed response and incorrect options are called
distractors.
Multiple-choice questions are widely used because they can measure learning
outcomes from simple to complex. They are highly structured, and clear tasks
are provided to test a broad sample of what has been learned.
True-false questions can be written quickly and are easy to score. Because
they can be objectively scored, the scores are more reliable than for items that
are at least partially dependent on the teachers judgement.
In writing test items, you must consider the length of the test or examination
as well as the reading level of your students.
McKenna, C., & Bull, J. (1999). Designing effective objective test questions: an
introductory workshop. In Workshop, Computer Assisted Assessment
Centre, Loughborough University, Leicestershire, UK, June (Vol. 17).