Professional Documents
Culture Documents
REVIEWER
REVIEWER
REVIEWER
Assessment - refers to the wide variety of methods or tools that educators use to evaluate, measure,
and document the academic readiness, learning progress, skill acquisition, or educational needs of
students.
3 TYPES OF ASSESSMENT
Assessment of Learning
Assessment for Learning
Assessment as Learning
Assessment for Learning (Formative Assessment) - allows teachers to monitor students on a day-to-day
basis and modify their teaching based on what the students need to be successful.
Assessment as Learning - It develops and supports students' metacognitive skills / Students develop a
sense of ownership and efficacy when they use teacher, peer and self-assessment.
Educational Measurement
Measurement can be defined as the process of assigning numbers to events based on an established set
of rules.
Edward Lee Thorndike - born August 31, 1874, died August 9, 1949. The laws of learning that he studied
we will be able to distinguish the real setting in the learning process.
Law of Primacy
Law of Intensity
Law of recency
Law of Readiness- states that learning can only take place when a student is READY TO LEARN. When
students feel ready, they learn more effectively and with greater satisfaction than when not ready.
Law of Exercise- The more a person practices something, the better he or she is able to RETAIN THAT
KNOWLEDGE. Remember back in grade school when the teacher would have you write spelling words
three times each and then use them in a sentence? Your teacher was practicing this law.
Law of Primacy- Think back to the time when IT WAS PROVEN that the Earth was round and not flat.
This is because once a PERSON LEARNS SOMETHING, IT IS ALMOST IMPOSSIBLE to tell them that it is
actually different.
Law of Recency- This gives your students THE CHANCE TO RETURN TO EARLIER MATERIAL that may have
gotten pushed aside BY INFORMATION NEAR the end of the unit.
Law of Intensity- The more excitement your eLearning course creates, the more likely it will be
remembered. Creating a hands-on experience, or one that CAUSES THE STUDENT TO FEEL STRONG
EMOTION WILL MAKE THE LESSON MORE EASILY REMEMBERED.
FUNCTION OF MEASUREMENT
The functions of measurement and evaluation are as follows:
RELIABILITY The meaning of reliability is accuracy and consistency. Also, it refers to the extent to which
a test is consistent, stable and dependent. In other words, the test approves what it represents. In
addition, the result in it is consistent and taking a test, again and again, will not change the result.
Furthermore, it gives the same result every time.
USABILITY It refers to the extent to which the test can be used without much expenditure of money
effort and time. Most noteworthy, it means practicability. Besides, there are certain factors that
determine the usability that we have mentioned below.
ADMINISTRABILITY It means that a test can be administered with clarity, ease, and uniformity. Also, the
direction is simple, concise, and clear. Besides, it specifies a time limit, sample questions, and oral
instructions. The provision for the test material is definite.
SCOREABILITY It concerns the score of the test. Also, a good test is easy to score. Scoring direction,
scoring key is simple, and an answer is available. Most noteworthy the test score is useful for evaluation
of students.
Evaluation of Learning Programs
KIRKPATRICK’S FOUR LEVELS OF EVALUATION MODE
LEVEL 1 – REACTION Reaction evaluation is the measurement of how the delegates felt about the
training or learning experience.
Was it a good use of their time? This level can be measured via feedback forms, surveys,
questionnaires, verbal reaction etc.
LEVEL 2 – LEARNING gaLearning evaluation is the measurement of the increase in knowledge, skills and
abilities of the learners.
Did they experience what they were supposed to experience? This level can be measured
through tests and assessments before and after training. Interviews or observation can also be
used.
LEVEL 3 – BEHAVIOUR Behavior evaluation is the measurement of the capability of the learners to
change their behavior and apply new skills and knowledge.
Did their performance noticeably change? This level can be measured through testing,
observations and interviews over time
LEVEL 4 – RESULTS Result evaluation is the measurement of the impact of the training on the business.
Has the training impacted KPIs? (Staff turnover, number of complaints, noncompliance, sales,
wastage, growth, retention…)
CHAPTER 2
12 PRINCIPLES OF HIGH QUALITY ASSESSMENT
1. Clarity & Appropriateness of Learning Targets
Assessment should be clearly stated and specified and centered on what is truly important.
LEARNING TARGETS
Knowledge - Students’ mastery of the content
Assessment should utilize assessment methods suitable for a particular learning target.
Essay – Reasoning
Self-report – Affects
3. Balance
Assessment methods should be able to assess all domains of learning and hierarchy of objectives.
DOMAINS OF LEARNING
COGNITIVE
AFFECTIVE
PSYCHOMOTOR
4. Validity
Assessment should be valid. There are several types of validity that are to be established.
TYPES OF VALIDITY
CONTENT VALIDITY
FACE VALIDITY
CONCURRENT VALIDITY
PREDICTIVE VALIDITY
DISCRIMINANT VALIDITY
CONSTRUCT VALIDITY
5. Reliability
Assessment should show consistent and stable results. There are methods which can be used to
measure and establish reliability
6. Fairness
Assessment should give equal opportunities for every student. There should be no discrimination of any
kind (racial, age, gender, etc.)
7. Authenticity
Assessment should touch real life situations and should emphasize practicability.
Assessment should not be used to derogate the students. One example of this is the right to
confidentiality.
Assessment's results should be communicated to the learners and the people involved. Communication
should also be established between the teacher and the learners by way of pre- and post-test reviews.
Assessment should have a positive effect. It should motivate students to learn and do more and should
give way to improve the teacher's instruction.
Written-Response Instruments
This includes objective tests (multiple-choice, true or false, matching type or short answer test), essays,
examinations, and checklists.
Examples:
Objective test – appropriate for the various levels of the hierarchy of educational objectives.
Essay – when properly planned, can test the students’ grasp of high-level cognitive skills particularly in
areas of application, analysis, synthesis, and evaluation.
Product-Rating Scale
These scales measure products that are frequently rated in education such as book reports, maps,
charts, diagram, notebook, essay and creative endeavor of all sorts.
Example:
Classic “Handwriting” Scale–is used in the California Achievement Test, Form W. Prototype handwriting
specimens of pupils are moved along the scale until the quality of handwriting sample is most similar to
the prototype handwriting.
Performance Test
One of these is the performance checklist which consists of the list of behaviors that makes up a certain
type of performance. It is used to determine whether or not an individual behaves in a certain way when
asked to complete a particular task.
Oral Questioning
These are useful supplementary assessment methods when used in conjunction with oral questioning
and oral test.
1. This principle states that assessment should be clearly said and precise on what
truly is important
- CLARITY AND APPROPRIATES OF LEARNING TARGETS
2. Mrs. Jean assigned her TLE students to take care a 45 days chken for their final
project. Which principle does she represents?
- AUTHENTICITY
3. This principle states that assessment methods should be able to assess all
domains of learning
- BALANCE
4. Mr. Jaunito asked his students to prepare a one half piece of paper. As he call
them one by one he writes their grades on the paper. Which principle did he
represent?
- FAIRNESS
5. This principle states that there should motivated the leaners to lean and to do
more.
- POSITIVITY AND CONSEQUENCE
6. Reliability can be measured by giving the same examination within the day
- TEST- RETAKE OR RETEST METHOD
7. This is one of the assessment forms that is done before instruction to assess the
needs of the leaners to determine their capacities and capabilities
- DIAGNOSTIC ASSESSMENT
8. The includes objective tests,essays, examinations and checklist
- WRITTEN-RESPONSE INSTRUMENTS
9. These are useful supplementary assessment methods when used in conjuction
with oral questioning and oral test.
- OBSERVATION AND SELF-REPORTS
10. A generic skill set that is separate from and secondary to the learning of
- COMPLEMENTAY
CHAPTER 3:
Types of Objective Tests
(a) true-false items,
(b) multiple-choice type items,
(c) matching items,
(d) enumeration and filling of blanks and
(e) essays.
An objective test, if it is to be comprehensive, must cover the various levels of Bloom’s taxonomy. Each
objective consists of a statement of what is to be achieved and, preferably, by how many percent of the
students.
Knowledge. The students must be able to identify the subject and the verb in a given sentence.
Comprehension. The students must be able to determine the appropriate form of a verb to be
used given the subject of a sentence.
Application. The students must be able to write sentences observing rules on subject-verb
agreement.
Analysis. The students must be able to break down a given sentence into its subject and
predicate.
Synthesis/Evaluation. The students must be able to formulate rules to be followed regarding
the subject-verb agreement.
The test objectives guide the kind of objective tests that will be designed and constructed by the
teacher.
A table of specifications or TOS is a test map that guides the teacher in constructing a test. The TOS
ensures that there is balance between items that test lower level thinking skills and those which test
higher order thinking skills.
Constructing the test items
it is advised that the actual number of items to be constructed in the draft should be double the desired
number of items.
The test draft is tried out to a group of pupils or students. The purpose of this try out is to determine
the: (a.) item characteristics through item analysis, and (b) characteristics of the test itself-validity,
reliability, and practicality.
Rule 2: Avoid using the words “always”, “never” “often” and other adverbs that tend to be either always
true or always false.
Rule 3: Avoid long sentences as these tend to be “true”. Keep sentences short.
Rule 4. Avoid trick statements with some minor misleading word or spelling anomaly, misplaced
phrases, etc. A wise student who does not know the subject matter may detect this strategy and thus
get the answer correctly
Rule 5: Avoid quoting verbatim from reference materials or textbooks. This practice sends the wrong
signal to the students that it is necessary to memorize the textbook word for word and thus, acquisition
of higher level thinking skills is not given due importance.
Rule 6. Avoid specific determiners or give-away qualifiers. Students quickly learn that strongly worded
statements are more likely to be false than true, for example, statements with “never” “no” “all” or
“always.” Moderately worded statements are more likely to be true than false. Statements with “many”
“often” “sometimes” “generally” ‘frequently” or “some” should be avoided.
Rule 7. With true or false questions, avoid a grossly disproportionate number of either true or false
statements or even patterns in the occurrence of true and false statements.
Rule 2: Do not use modifiers that are vague and whose meanings can differ from one person to the next
such as: much, often, usually, etc.
Rule 3: Avoid complex or awkward word arrangements. Also, avoid use of negatives in the stem as this
may add unnecessary comprehension difficulties.
Rule 4: Do not use negatives or double negatives as such statements tend to be confusing. It is best to
use simpler sentences rather than sentences that would require expertise in grammatical construction.
Rule 5: Each item stem should be as short as possible; otherwise you risk testing more for reading and
comprehension skills. Rule 6: Distracters should be equally plausible and attractive.
Rule 7: All multiple choice options should be grammatically consistent with the stem.
Rule 8: The length, explicitness, or degree of technicality of alternatives should not be the determinants
of the correctness of the answer. The following is an example of this rule
Rule 10: Avoid alternatives that are synonymous with others or those that, include or overlap others.
Rule 11: Avoid presenting sequenced items in the same order as in the text.
Rule 12: Avoid use of assumed qualifiers that many examinees may not be aware of.
Rule 13: Avoid use of unnecessary words or phrases, which are not relevant to the problem at hand
(unless such discriminating ability is the primary intent of the evaluation). The items value is particularly
damaged if the unnecessary material is designed to distract or mislead. Such items test the student’s
reading comprehension rather than knowledge of the subject matter.
Rule 14: Avoid use of non-relevant sources of difficulty such as requiring a complex calculation when
only knowledge of a principle is being tested.
Rule 16: Include as much of the item as possible in the stem. This allows for less repetition and shorter
choice options.
Rule 17: Use the “None of the above” option only when the keyed answer is totally correct. When
choice of the “best” response is intended, “none of the above” is not appropriate, since the implication
has already been made that the correct response may be partially inaccurate.
Rule 18: Note that the use of “all of the above” may allow credit for partial knowledge. In a multiple
option item, (allowing only one option choice) if a student only knew that two (2) options were correct,
he could then deduce the correctness of “all of the above”. This assumes you are allowed only one
correct choice.
Rule 19: Having compound response choices may purposefully increase difficulty of an item.
Rule 20: The difficulty of a multiple choice item may be controlled by varying the homogeneity or degree
of similarity of responses. The more homogeneous, the more difficult the item
Essay questions can be used to measure the attainment of a variety of objectives. Stecklein (1955) has
listed 14 types of abilities that can be measured by essay items:
Note that all these involve the higher-level skills mentioned in Bloom’s Taxonomy.
Rule 1: Phrase the direction in such a way that students are guided on the key concepts to be included.
Rule 2: Inform the students on the criteria to be used for grading their essays. This rule allows the
students to focus on relevant and substantive materials rather than on peripheral and unnecessary facts
and bits of information.
Rule 4: Decide on your essay grading system prior to getting the essays of your students.
Rule 5: Evaluate all of the students’ answers to one question before proceeding to the next question
Rule 6: Evaluate answers to essay questions without knowing the identity of the writer
Rule 7: Whenever possible, have two or more persons grade each answer. The best way to check on the
reliability of the scoring of essay answers is to obtain two or more independent judgments.