Professional Documents
Culture Documents
Module 3 Content
Module 3 Content
Chapter 3
Designing and Developing Assessment Tools
Time Allotment: 12 hours (Week 6-9)
(Week 10- devoted to Midterm Examination)
Minor Characteristics
A. Administrability – the test should be easy to administer such that the directions
should clearly indicate how a student should respond to the test/task items and
how much time should he/she spend for each test item or for the whole test.
B. Scoreability – the test should be easy to score such that directions for scoring are
clear, point/s for each correct answer(s) is/are specified.
C. Interpretability – test scores can easily be interpreted and described in terms of
the specific tasks that a student can perform or his/her relative position in a clearly
defined group.
D. Economy – the test should be given in the cheapest way in terms of time and effort
spent for administration of the test and answer sheets must be provided so the test
can be given from time to time.
1. Unclear directions – directions that do not clearly indicate to the students how
to respond to the tasks and how to record the responses tend to reduce validity.
2. Reading vocabulary and sentence structure too difficult- vocabulary and
sentence structure that are too complicated for the students result in the
assessment of reading comprehension thus altering the meaning of assessment
result.
3. Ambiguity – ambiguous statements in assessment tasks contribute to
misinterpretations and confusion. Ambiguity sometimes confuses the better
students more that it does the poor students.
4. Inadequate time limits – time limits that do not provide students with enough
time to consider the tasks and provide thoughtful responses can reduce the validity
of interpretations of results. Rather than measuring what a student knows about a
topic or is able to do given adequate time, the assessment may become a measure of
the speed with which the student can respond. For some content (e.g. a typing test),
speed may be important. However, most assessments of achievement should
minimize the effects of speed on student performance.
5. Overemphasis of easy-to assess aspects of domain at the expense of
important, but hard-to assess aspects (construct under representation)
-it is easy to develop test questions that assess factual recall and generally harder to
develop ones that tap conceptual understanding or higher-order thinking processes
2
1. Test length – in general, a longer test is more reliable than a shorter one because
longer tests sample the instructional objectives more adequately.
2. Spread of scores – the type of students taking the test can influence reliability. A
group of students with heterogeneous ability will produce a larger spread of test
scores than a group with homogenous ability.
3. Item difficulty – in general, tests composed of items of moderate or average
difficulty (0.30 to 0.70) will have more influence on reliability than those composed
primarily of easy or very difficult items.
4. Item discrimination – in general, tests composed of more discriminating items
will have greater reliability than those composed of less discriminating items.
5. Time limits- adding a time factor may improve reliability for lower-level cognitive
test items. Since all students do not function at the same pace, a time factor adds
another criterion to the test that causes discrimination, thus improving reliability.
Teachers should not, however, arbitrarily impose a time limit. For higher-level
cognitive test items, the imposition of a time limit may defeat the intended purpose
of the items.
Test
Purposes/Uses of Tests
alternative in each item is merely called answer and the rest of the
alternatives are called distracters or decoys or foils.
ii. True-False or Alternative Response –consists of declarative
statements that one has to respond or mark true or false, right or
wrong, correct or incorrect, yes or no, fact or opinion, agree or disagree
and the like. It is a test made up of items which allow dichotomous
responses.
iii. Matching Type –consists of two parallel columns with each word,
number, or symbol in one column being matched to a word sentence,
or phrase in the other column. The items in Column I or A for which a
match is sought are called premises, and the items in Column II or B
from which the selection is made are called responses.
b. Free Response Type or Supply Test- requires the student to supply
or give the correct answer.
i. Short answer – uses a direct question that can be answered by a
word, phrase, number or symbol.
ii. Completion Test – consists of an incomplete statement that can
also be answered by a word, phrase, number, or symbol
2. Essay Type – essay questions provide freedom of response that is needed
to adequately assess students’ ability to formulate, organize, integrate and
evaluate ideas and information or apply knowledge and skills.
a. Restricted Essay - limits both the content and the response.
Content is usually restricted by the scope of the topic to be discussed.
b. Extended Essay - allows the students to select any factual
information that they think is pertinent to organize their answers in
accordance with their best judgment and to integrate an evaluate
ideas which they think appropriate.
It provides an assurance that the test questions are representative samples of the
lessons covered.
It will result to a balanced test.
Helps teachers determine the content mastery of the learners.
Steps in Making TOS
1. List down the learning outcomes, topics or competencies that you want to measure.
2. Determine the number of class sessions or the no. of hours spent per learning
outcome.
3. Decide on the number of items to be prepared.
4. Determine the number of items to be prepared per outcome.
Divide the no. of hours spent by the total number of class sessions times the
total number of items.
The result tells us of the no. of item per outcome.
5. Distribute each of the items according to the level of thinking skills being
measured.
6. Determine the type of items to be prepared.
One-way TOS
1.
2.
3.
4.
5.
Total
8
Two-way TOS
R U A A E C
1.
2.
3.
4.
5.
Total
Legend: R-remembering
U-understanding
A-applying
A-Analyzing
E-Evaluating
C-Creating
9
R U A A E
I. Introduction 3 3 5 6 1 15 MC
(measurement, assessment, evaluation,
testing)
V. Types of tests: 3 5 4 7 16 MC
( written, oral, performance,
objective, subjective, standardized, non-
standardized, norm-referenced, criterion-
referenced , power & speed; verbal & non-
verbal)
Total 13.5 70
TABLE OF SPECIFICATIONS
Midterm Examination in Educ 106A- Assessment of Learning 1
1st Semester, SY 2021-2022
Step 1: List down the learning outcomes, topics or competencies that you want to
measure.
Learning Outcomes Class No. of Item
Sessions Items Type
(in
hours)
TOTAL
10
Step 2: Determine the number of class sessions or the no. of hours spent per
learning outcome.
Learning Outcomes Class No. of Item
Sessions Items Type
(in hours)
TOTAL 20
TOTAL 20 70
11
TOTAL 20 70
Step 5: Distribute each of the items according to the level of thinking skills being
measured. ( for two-way TOS)
Step 6: Determine the type of items to be prepared.
TOTAL 20 70
12
A. RECALL TYPES
1. Completion type/Supply type of test
a. Only important words or phrases should be omitted to avoid confusion.
Ask question on more significant item not on trivial matter.
EX. Jose Rizal was born on June ___, 1861.
b. Blanks should be of equal lengths. The length of the blanks must not suggest
the answer. So better to make the blanks uniform in size.
c. The blank should be at the end or near the end of the sentence. The question
must first be asked before an answer is expected.
d. Articles a, an, and the should not be provided before the omitted word or
phrase to avoid clues for answers.
e. Do not take statements directly from textbooks
f. If the item is to be expressed in numerical units, indicate the type of answer
wanted.
g. When the completion items are to be used, do not include too many blanks.
Ex.
The ____produced by the ______ is used by the green _____ to
change the ____ and ____ into _____. This process is called ____.
h. Avoid open-ended item. There should be only one acceptable answer. This
item is open-ended hence, not good test item.
Panuto: Tukuyon ang mga sumusunod. Isulat ang tamang sagot sa patlang. (5 puntos)
2. Enumeration type
a. The exact number of expected answers should be stated.
b. Score is the number of correct answers.
Subject: Science
Directions: Enumerate the following. Write your answer in the space provided. (1 point
each)
1-3. Main parts of a plant
4-7. Uses of plants
8-10. Ways of taking care of plants
Subject: TLE
Directions: Read the following statements carefully and identify what farm tools,
implements and equipment are being described. Write your answer in the
blank provided before the number. ( 1 pt. each)
_______1. It is a tool used for digging canals, breaking hard topsoil and digging up
stones and tree stumps.
_______2. It is an implement mounted to a tractor use for tilling and pulverizing the soil.
_______3. It is an equipment used to pull disc plow and disc harrow in preparing much
bigger area of metal.
14
_______4. An implement made of metal mounted to a tractor which is used for tilling
and pulverizing the soil.
_______5. A tool used for cutting branches of planting materials and unnecessary
branches of plants.
B. RECOGNITION TYPES
Statements that use the word “always are almost always false. A test-
wise student can easily guess his way through a test like these and get high
scores even if he does not know anything about the test.
i. Avoid multiple facts or including two ideas in one statement, unless cause-
effect relationship is being measured.
j. If opinion is used, attribute it to some source unless the ability to identify
opinion is being specifically measured.
Ex:
Ang kabataan ang pag-asa ng bayan. (It might be true or false)
Ayon kay Dr.Jose Rizal, ang kabataan ang pag-asa ng bayan. ( This is really
true)
k. True statements and false statements should be approximately equal in
length.
l. Do not give a hint in the body of the question.
Example: The Philippines gained its independence in 1898 and therefore
celebrated its centennial year in 2000.
Directions: Write the word True if the statement is correct and False if otherwise. Write
your answer in the space provided before each number. (1 pt. each)
_______1. Genetics is a branch of Biology that deals with the study of heredity and
variation.
16
_______2. The law of segregation states that different genes are not affected by each
other or separate independently from each other during gamete formation.
_______3. Sex chromosomes determine the sex of an individual.
_______4. A Punnett square is used to predict the results of genetic crosses.
_______5. Gregor Mendel is the father of Biology.
Subject: Science 9
Topic: Light Gives Life
Directions: Write the word True if the statement is correct and it is false, underline the
word/s that make/s the statement incorrect, then write the correct answer in
the blank provided to make the statement correct. (1 point each)
_______1. Photosynthesis is a multistep process whereby light energy is trapped by
chlorophyll in plants and converted into chemical energy.
_______2. Organism use cellular respiration to break down glucose and harvest energy.
_______3. A chloroplast has two membranes surrounding the liquid in its interior called
the granum.
_______4.Oxygen and water are produced during the process of cell respiration.
_______5. Plants are called autotrophs because they are self-feeders.
2. Multiple-response type
a. There should be three to five choices. The number of choices used in the first
item should be the same number of choices in all the items of this type of
test.
b. The choices should be numbered or lettered so that only the number or letter
can be encircled or written on the blank provided.
c. If the choices are figures, they should be arranged in ascending order.
Ex: How many factors does 86 have?
a. 3 b. 4 c. 5 d. 6
d. Avoid the use of “a” or “an” as the last word prior to the listing of the
responses.
e. The correct answer should appear approximately equal number of times but
in random order.
Ex:
1. b 6. c 11. d
2. a 7. c 12. a
3. a 8. b 13. b
4. c 9. d 14. c
5. d 10.b 15. d
f. The choices should be related in some way or should belong to the same
class.
17
g. Use a negatively stated stem only when significant learning outcomes require
it and stress/highlight the negative words for emphasis.
Ex:
The following are properties of solid except
a.
b.
c.
d.
h. An item should only contain one correct or clearly best answer.
i. Better still use “none of the above” and “all of the above” sparingly. But best
not to use them at all.
j. Use the “None of the above “option only when the keyed answer is totally
correct. When choice of the “best” response is intended, “none of the above”
is not appropriate, since the implication has already been made that the
correct response may be partially inaccurate.
k. Note that use of “all of the above” may allow credit for partial knowledge.
In a multiple option item, (allowing only one option choice) if a student only
knew that two (2) options were correct, he could then deduce the
correctness of “all of the above”. This assumes you are allowed only one
correct choice.
l. Do not use unfamiliar words, terms, and phrases. The ability of the
item to discriminate or its level of difficulty should stem from the subject
matter rather than from the wording of the question.
Example: What would be the system reliability of a computer system
whose slave and peripherals are connected in parallel
circuits and each one has a known time to failure
probability of 0.05?
m. Do not use modifiers that are vague and whose meanings can differ from
one person to the next such as: much, often, usually. etc.
Example:
Much of the process of photosynthesis takes place in the:
a. bark
b. leaf
c. stem
n. Do not use negatives or double negatives as such statements tend to be
confusing. It is best to use simpler sentences rather than sentences that
would require expertise in grammatical construction.
Example:
(Poor) Which of the following will not cause inflation in the Philippine
economy?
(Better) Which of the following will cause inflation in the Philippine
economy?
Poor: What does the statement “Development patterns acquired
during the formative years are NOT Unchangeable” imply?
Better: What does the statement “Development patterns acquired
during the formative years are changeable” imply?
o. Each item should be s short as possible; otherwise you risk testing more
for reading and comprehension skills.
18
1. Who will most strongly disagree with the progressivist who claims that
the child should be taught only that which interests him and if he is not
interested, wait till the child gets interested?
A. Essentialist C. Progressivist
B. Empiricist D. Rationalist
2. Which group will most strongly focus its teaching on the interest of the
child?
A. Progressivist C. Perrenialist
B. Essentialist D. Reconstructionist
s. Avoid use of unnecessary words or phrases, which are not relevant to the
problem at hand (unless such discrimination ability is the primary intent of
the evaluation). The item’s value is particularly damaged if the unnecessary
material is designed to distract or mislead. Such items test the student’s
reading comprehension rather than knowledge of the subject matter.
Example:
The side opposite the thirty degree angle in a right triangle is equal to half
the length of the hypotenuse. If the sine of a 30-degree is 0.5 and its hypotenuse
is 5, what is the length of the side opposite the 30-degree angle?
a. 2.5
b. 3.5
c. 5.5
d. 1.5
t. Pack the question in the stem. Here is an example of a question which has
no question. Avoid it by all means.
Example:
The Roman Empire _______.
a. had no central government.
b. had no definite territory
19
c. had no heroes
d. had no common religion
u. Always have the stem and alternatives on the same page.
v. Score is the number of correct answers.
Directions: Choose the best answer. Write the letter of your choice in the space provided
before each number. (1 point each)
_____1. In a positively skewed distribution, the following statements are true except
a. 5 b. 9 c. 10 d. 11
______5. Bert obtained a 97 percentile rank in an aptitude test. This means
3. Matching type
a. There should be two columns. Under “A” are the stimuli which should be
longer and more descriptive than the responses under column “B”. The
response may be a word, a phrase, a number or a formula.
b. The stimuli under column “A” should be numbered and the responses under
column “B” should be lettered. Answers will be indicated by letters only on
lines provided in column “A”.
c. Matching sets should neither be too long nor too short.
d. All items should be on the same page to avoid turning of pages in the process
of matching pairs.
20
Ex.: The test items are all about the Filipino heroes, nothing more
A B
___1. First President of the Republic a. Magellan
___2. National Hero b. Mabini
___3. Discovered the Philippines c. Rizal
___4. Brain of Katipunan d. Lapu-Lapu
___5. The great painter e. Aguinaldo
___6. Defended Limasawa island f. Juan Luna
g. Antonio Luna
f. Include an unequal number of responses and premises and instruct the pupil
that responses may be used once, more than once, or not at all. This is to
avoid guessing.
g. Arrange the list of responses in logical order.
h. Limit a matching exercise to not more than 10 to 15 items.
i. Like any other test, the direction of the test must be given. The examinees
must know exactly what to do.
j. Score is the number of correct answers.
a. Restrict the use of essay questions to those learning outcomes that cannot be
satisfactorily measured by objective items.
b. Construct questions that will call forth the skills specified in the learning
standards.
c. Avoid the use of optional questions
d. Indicate the approximate time limit or the number of points for each
question.
e. Prepare an outline of the expected answer in advance or scoring rubric.
Discuss the different measures of reliability. Justify the use of each measure in the
context of measuring reliability. ( 5 points)
ITEM ANALYSIS
Item analysis is a statistical technique which is used for selecting and rejecting the
items of the test on the basis of their difficulty value and discriminated power.
Note: Items with difficulty index within 0.26 to 0.75 and with discrimination index
from 0.20 and above are to be retained. Items with difficulty index within 0.25 to
0.75 but with discrimination index of 0.19 and below or with discrimination index
of 0.20 and above but with difficulty index not within 0.26 to 0.75 should be
revised. Items with difficulty index not within 0.26 to 0.75 and with
discrimination index of 0.19 and below should be rejected/discarded.
Illustrative Example:
The teacher gave a summative examination in Science consisting of 40 items
among 48 students. Analyze each item of the test to determine the difficulty and
discrimination indices of each item, and decide whether a given item is to be retained,
revised or discarded/rejected.
4. Count the number of right answer in upper group and count the number of right
answer in lower group and compute for the proportion of each group.
- determine the proportion of the students in the upper group and the lower
group by getting the number of students who got the correct answer per
item, then divide it by the total number of students in each group.
Say, there are 13 students in the upper group and 13 students in the lower
group. There are 10 students who got the correct answer in the upper group
and 5 students got the correct answer in the lower group.
Proportion of the upper group: 10/13 = 0.77
Proportion of the lower group: 5/13 = 0.38
5. Solve for the difficulty and discrimination indices.
40 UG =13
UG =13
Note: Item No. 1 is a good item because it is moderately difficult and a discriminating
item. Retain this item.
Item No. 2 is an easy item because most of the students in the upper group and
lower group got the correct answer. It does not discriminate the lower group and the
upper group. Therefore, it should be discarded. Construct another item to replace this
item.
Item No. 3 is an difficult item because almost all of the students in both groups
did not get the correct answer. It does not discriminate the lower group and the
upper group. Therefore, it should be discarded/rejected. Construct another item to
replace this item.
Item number 4 needs to be revised. Although the item is moderately difficult but
it is not discriminating. You can restate or improve the question.
Item number 5 needs to be revised although the item is a discriminating item but
it is an easy item. You can restate or improve the question.
References
Navarro, Rosita L., Santos, Rosita G. and Corpuz, Brenda B. 2017. Assessment of Learning 1.
LORIMAR Publishing Inc.
Professional Education(A Reviewer for the Licensure Examinations for Teachers). Philippine
Normal University. Manila.
Disclaimer
This module is prepared for instructional purposes only based on our course syllabus. The teacher
who prepared this does not claim ownership of this module but patterned the ideas from different
authors.