Professional Documents
Culture Documents
Assessment of Learning Handout
Assessment of Learning Handout
Assessment of Learning Handout
1. Assessment FOR Learning – this Includes three types PRINCIPLE 1: Clarity and Appropriateness of Learning
of assessment done before and during instruction. Targets
These are placement, formative and diagnostic,
- Learning targets should be clearly stated, specific,
a) Placement
and center on what is truly important.
- Done prior to instruction
- Its purpose is to assess the needs of the Learning Targets
learners to have basis in planning for a (Me Milan, 2007; Stiggins, 2007)
relevant instruction. Student mastery of
- Teachers use this assessment to know what Knowledge substantive subject
their students are bringing into the learning matter
situation and use this as a starting point, for Student ability to use
instruction. Reasoning knowledge to reason and
- The results of this assessment place students solve problems
in specific learning groups to facilitate Student ability to
Skills demonstrate
teaching and learning.
achievement-related skills
b) Formative
Student ability to create
- done during instruction Products achievement-related
- This assessment is where teachers products
continuously monitor the students' level of Student attainment of
attainment of the learning objectives. affective states such as
- The results of this assessment are Affective/Disposition attitudes,
communicated clearly and promptly to the values, interests and self-
students for them to know their strengths efficacy.
and weaknesses and the progress of their
learning.
PRINCIPLE 2: Appropriate Methods Assessment
c) Diagnostic
Methods
- done during instruction
- This is used to determine students’ recurring - Learning targets are measured by appropriate
or persistent difficulties. assessment methods.
ASSESSMENT METHOD
TARGETS Performance Oral
Objective Essay Observation Self-Report
Based Questioning
Knowledge 5 4 3 4 3 2
Reasoning 2 5 4 4 2 2
Skills 1 3 5 2 5 3
Products 1 1 5 4 4 4
Affective 1 2 4 4 4 5
MODES OF ASSESSMENT
MODE DESCRIPTION EXAMPLES ADVANTAGES DISADVANTAGES
- Scoring is
objective - Preparation of
- Standardized
The objective paper-and-pen - Administration instrument is
Tests
Traditional test which usually assesses is easy because time consuming
- Teacher-made
low-level thinking skills. students can - Prone to
Tests
take the test at cheating
the same time
- Preparation of
the instrument - Scoring tends to
A mode of assessment that
- Practical Test is relatively be subjective
requires actual
- Oral and Aural easy. without rubrics
Performance demonstration of skills or
Tests - Measures - Administration
creation of products of
- Projects behaviors that is time
learning
cannot be consuming
deceived.
A process of gathering
- Working - Measures - Development is
multiple indicators of
Portfolios student’s time consuming
student progress to support
Portfolio - Show Portfolios growth and - Rating tends to
course goals in dynamic,
- Documentary development be subjective
ongoing and collaborative
Portfolios - Intelligence-fair without rubrics
process.
PRINCIPLE 3: Balance
- A balanced assessment sets targets in all domains of learning (cognitive, affective, and psychomotor) or domains of
Intelligence (verbal-linguistic, logical-mathematical, bodily-kinesthetic, visual-spatial, musical-rhythmic,
Intrapersonal-social, intrapersonal-introspection, physical world-natural, existential-spiritual).
- A balanced assessment makes use of both traditional and alternative assessment.
Reliability – it refers to the consistency of scores obtained by the same person when retested using the same or equivalent
Instrument.
TYPES OF RELIABILITY
METHOD PROCEDURE STATISTICAL MEASURE
MEAUSURE
Give a test twice to the
same group with any time
Test-Retest Measure of Stability interval between tests from Pearson r
several minutes to several
years.
Give parallel form of test
Equivalent Form Measure of Equivalence with close time interval Pearson r
between forms.
Give a test once. Score is
Measure of Internal equivalent halves of the Pearson r and Spearman
Split-Half Form
Consistency test (e.g.) odd-and-even Brown Formula
numbered items.
Give the test once then
correlate the
Measure of Internal Kuder-Richardson Formula
Kuder-Richardson proportion/percentage of
Consistency 20 and 21
the students passing and
not passing a given item.
Types of Rubrics
TYPE DESCRIPTION ADVANTAGES DISADVANTAGES
- It does not clearly
- It allows fast
describe the degree of
- it describes the overall assessment
the criterion satisfied
quality of a - It provides one score
or not by the
performance or to describe the overall
performance or
product performance or
Holistic Rubric product.
- In this rubric, there is quality of work.
- It does not permit
only one rating given - It can indicate the
differential weighting
to the entire work or general strengths and
of the qualities of a
performance weaknesses of the
product or a
work performance.
performance.
- It clearly describes the
- It describes the quality degree of the criterion
of a performance or satisfied or not by the
product in terms of performance or
the identifies product.
- It is more time
dimensions and/or - It permits differential
consuming to use
Analytic Rubric criteria for which are weighting of the
- It is more difficult to
rated independently qualities of a product
construct.
to give a better or performance.
picture of the quality - It helps rater
of work or pinpoints specific
performance. areas of strengths and
weaknesses.
- It is more complex to
- It combines the key - It allows assessment
construct, and many
Ana-Holistic Rubric features of holistic of multiple task using
requires more sheets
and analytic rubric. appropriate format.
and time for scoring.
ITEM ANALYSIS
1. DIFFICULTY INDEX
- refers to the process of examining the student’s
- The number of people who answer a test item
response to each item in the test
correctly.
- There are two characteristics of an item: Desirable The number of students who answer the item correctly
and Undesirable. DI =
𝑇𝑜𝑡𝑎𝑙 𝑛𝑢𝑚𝑏𝑒𝑟 𝑜𝑓 𝑠𝑡𝑢𝑑𝑒𝑛𝑡𝑠 𝑤ℎ𝑜 𝑎𝑛𝑠𝑤𝑒𝑟 𝑡ℎ𝑒 𝑝𝑎𝑟𝑡𝑖𝑐𝑢𝑙𝑎𝑟 𝑖𝑡𝑒𝑚
Desirable – retained for subsequent use
Undesirable – revise of reject Index Range Difficulty Level
0.0 – 0.20 Very Difficult
Criteria in Determining the Desirability and
0.21 – 0.40 Difficult
Undesirability of an Item
0.41 – 0.60 Moderately Difficult
✓ Difficulty Index 0.61 – 0.80 Easy
✓ Discrimination Index 0.81 – 1.00 Very Easy
PROPERTIES OF MEAN
2. DISCRIMINATION INDEX
- It is the difference between the proportion of high - Easy to compute
performing students (UG) who got the item right - It may be an actual observation in the data set
and the proportion of low performing students (LG) - It can be subjected to numerous mathematical
who got an item right. computation
- Is the degree to which the item discriminates - Most widely used
between high performing and low performing - It is easily affected by the extreme values
group. - Applied to interval level data
3(𝑀𝑒𝑎𝑛 − 𝑀𝑒𝑑𝑖𝑎𝑛)
𝑆𝐾 =
𝑆𝐷
Types of Test According to FORMAT 5. there is more PRESSURE FOR SPEEDY
REPORTING OF SCORES than for speedy test
1. Selective Type – provides choices for the answer
preparation.
a. Multiple Choice-consists of a stem which
describes the problem and 3 or more MULTIPLE CHOICE TEST
alternatives which give the suggested
solution. The incorrect alternatives are Multiple Choice Items – it consists of:
the distractors.
b. True False or Alternative Response – 1. Stem – which identifies the question or
consists of declarative statement that problem.
2. Response alternative or OPTIONS
one has to mark true or false, right or
3. Correct Answer
wrong, correct or incorrect, yes or no,
fact or opinion, and the like. Advantages of Using Multiple Choice Items
c. Matching Type – consists of two parallel
columns: Column A, the column of Multiple choice items can provide:
premises from which a match is sought,
Column B, the column of responses from 1. Versatility in measuring all levels of cognitive
which the selection is made. ability
2. Supply Test 2. Highly reliable test scores
a. Short Answer – uses a direct question 3. Scoring efficiency and accuracy
that can be answered by a word, phrase, 4. Objective measurement of student
a number or a symbol. achievement or ability
5. A wide sampling of content or objectives
b. Completion Test – it consists of an
6. A reduced guessing factor when compared to
incomplete statement.
true-false items
3. Essay Test
7. Different response alternatives which can
a. Restricted Response – limits the content provide diagnostic feedback
of the response by restricting the scope
of the topic Limitations of Multiple-Choice Items
b. Extended Response – allows the
students to select any factual 1. Difficult and time consuming to construct
information that they think is pertinent burning.
to organize their answer with their best 2. Lead a teacher to favor simple recall of facts
judgement. 3. Place a high degree of dependence on student's
reading ability and teacher's writing ability.
Guidelines for Constructing Test Items
Suggestion for Writing Multiple Choice Test Items
Essay are appropriate when:
1. the group to be tested is SMALL and the test is 1. When possible, state the stem as a direct
NOT BE USED again; question rather than as an incomplete
2. you wish to encourage and reward the statement.
2. Present a definite, explicit singular question or
development of student’s SKILL IN WRITING;
problem in the stem.
3. you are more interested in exploring the
3. Eliminate excessive verbiage or irrelevant
ATTITUDES than in measuring his/her academic information from the stem.
achievement; 4. Include the stem any word(s) that might
4. you are more confident of your ability as a critical otherwise be repeated in each alternative.
and fair reader than as an imaginative writer of 5. Use negative stated question sparingly. When
good objective test items. used, underline and/or capitalize the negative
word.
Objective test items are appropriate when: 6. Make all alternatives plausible and attractive to
1. the group to be tested is LARGE and the test may the less knowledgeable or skillful student.
be REUSED; 7. Make the alternative grammatically parallel
with each other and consistent with the stem.
2. HIGHLY RELIABLE TEST SCORE must be obtained
8. Make the alternative mutually exclusive.
as efficiently as possible;
9. When possible, present alternatives in some
3. IMPARTIALITY of evaluation, ABSOLUTE logical order (chronological, most to least,
AIRNESS, and FREEDOM from possible test alphabetical).
SCORING INFLUENCES – fatigue, lack of 10. Be sure there is only one correct answer or best
anonymity is essential; response to the item.
4. You are more confident of your ability to express 11. Make alternative approximately equal in length.
objective test items clearly than your ability to
judge essay test answer correctly;
12. Avoid irrelevant clues, such as grammatical 3. Express single idea in each test item.
structure, well-known verbal associations or 4. Include enough background information and
connections between stem and answer. qualification so that the ability to respond
13. Use at least four alternatives for each item to correctly to the item do not depend on some
lower the probability of getting the item special, uncommon knowledge.
correctly by guessing. 5. Avoid lifting statements directly from the text,
14. Randomly distribute the correct response lecture or other materials so that memory alone
among the alternative position throughout the will not permit a correct answer.
test having approximately the same proportion 6. Avoid using negatively stated item statements.
of the alternatives a, b, c, d, and e as correct 7. Avoid the use of unfamiliar vocabulary.
answer. 8. Avoid the use of specific determiners which
15. Use the NONE OF THE ABOVE and ALL OF THE should permit a test wise but unprepared
ABOVE sparingly. When used, such alternatives examinee to respond correctly. Specific
should be occasionally be used as the correct determiners refer to sweeping terms like always,
response. all, none, never, impossible, inevitable.
Statements including such terms are likely to be
TRUE-FALSE TEST ITEMS false. On the other hand, statements using
qualifying determiners such as usually,
True-false test items are typically used to measure the sometimes, often, are likely to be true. On the
ability to identify whether or not the statements of facts other hand, statements using qualifying
are correct. The basic format is simply a declarative determiners such as usually, sometimes, often
statement that the student must judge as true or false. are likely to be true. When statements require
No modification of this basic form in which the student specific determiners, make sure they appear in
must respond "yes" or "no", "agree" or "disagree." both true and false items.
9. False items tend to discriminate more highly
Three Forms: than true items. Therefore, use more false items
than true items (but no more than 15%
1. Simple - consists of only two choices additional false items).
2. Complex - consists of more than two choices
3. Compound - two choices plus a conditional MATCHING TEST ITEMS
completion response
In general matching items consists of a column
Advantages of True-False Items of stimuli presented on the left side of the exam page and
a column of responses placed on the right side of the
1. The widest sampling of content or objectives per page Students are required to match the response
unit of testing time; associated with a given stimulus.
2. Scoring efficiency and accuracy;
3. Versatility in measuring all levels of cognitive Advantages of Using Matching Test items
ability;
4. Highly reliable test scores; and 1. Require short period of reading and response
5. An objective measurement of student time, allowing the teacher to cover more
achievement or ability. content.
2. Provide objective measurement of student
Limitations of True-False Items achievement or ability.
3. Provide highly reliable test scores.
1. Incorporate an extremely high guessing factor. 4. Provide scoring efficiency and accuracy
2. It can often lead the teacher to write ambiguous
statements due to the difficulty of writing Disadvantages of Using Matching Test items
statements which are unequivocally true or
false. 1. Have difficulty measuring learning objectives
3. Do not discriminate between students varying requiring more than simple recall or information
ability as well as other item types. – LOTS
4. It can often include more irrelevant clues than do 2. Are difficult to construct due to the problem of
other item types. selecting a common set of stimuli and responses.
5. It can often lead a teacher to favor testing of
trivial knowledge. Suggestions for Writing Matching Test Items
Suggestions for Writing True-False Items 1. Include directions which clearly state the basis
for matching the stimuli with the responses
1. Base true-false items upon statements that are Explain whether or not the response can be used
absolutely true or false, without qualification or more than once and indicate where to write the
exceptions. answer.
2. Express the item statement as simply as clearly 2. Use only homogenous materials in matching
as possible. test.
3. Arrange the list of responses in some systematic Classifications of Essay Test:
order if possible – chronological, alphabetical.
4. Avoid grammatical or other clues to the correct 1. Extended-response essay item
response. 2. Limited Response or Short-answer essay Mem