Professional Documents
Culture Documents
Topic 8F Validity Reliability and Sources of Error
Topic 8F Validity Reliability and Sources of Error
Topic 8F Validity Reliability and Sources of Error
Sources of Error
Finals
Validity
Validity means the degree to which a test or measuring
instrument measures what it intends to measure. The response
should have a veracity or truthfulness of results or responses.
As an example, the test item is “Who is the goddess of beauty?” Of the
100 students in Methodology, 90 or 90 percent answered“Venus” as
the goddess of beauty. Hence, 90% of the responses is valid because
the correct answer is“Venus”. Another example, the test item is “How
many grams are there in 5 kilograms?” Of the 50 students whotook the
test in mathematics, 50 or 100% answered “5,000 grams,” hence, the
test is valid.
Types of Validity
• Content Validity
• refers to the extent to which the content or topic of the test is truly
representative of the content of the course. It involves, essentially the
systematic examination of the test content to determine whether it covers a
representative sample of the behavior domain. The content validity
depends on the relevance of the individual’s responses to the behavior
area under consideration rather on the apparent relevance of thecontent.
• This type of validity is commonly used in evaluating achievements test
Example
A researcher wishes to validate a questionnaire in Science. He requests experts in
Science to judge ifthe items measure the knowledge, skills, and values supposed to
be measured.
Directions: Below are teaching strategies used in the teaching of Science. Indicate
the extent to which eachstrategy is used in teaching Science in your school by
encircling one of the options on the right column.
The options 4,3,2, and 1 represent the extent of use, thus:4 – very often 2 –
sometimes3 – often 1 – never
• The researcher requires a selected group of experts to validate the
content of the questionnaire on thebasis of the foregoing questions. If
the weight mean is 2.5 and above, the item is retained; 1.5 to 2.4,
needsimprovement; and 1.4 and below, delete.
• For example, there are five experts to validate the above
questionnaire. In Item 1, “Curriculumenrichment,” 4 experts rated
3 and 1, rated 2. Weighted mean is used in this example. The
computation is asfollows: retain
• Thus, the Item 1 “Curriculum enrichment” should be retained
• Concurrent validity
• refer to the degree to which the test agrees or correlates with a criterion set
up as an acceptablemeasure. The criterion is always available at the time of
testing and it is applicable to test employed for thediagnosis of existing status
rather than for prediction of future outcome.
• Example: A researcher wishes to validate a Mathematics achievement test he
has constructed. He administeredthis test to a group of Mathematics
students. The result of this test is correlated with an acceptable
Mathematics test which has been previously proven as valid. If the
correlation is” high,” the Mathematics testhe has constructed is valid
• Predictive validity
• According to Aquino et al. 1974), this validity is determined by showing how
well predictions made fromthe test are confirmed by evidence gathered at
some subsequent time. The criterion measured against thistype of validation
is so important for the reason that the outcome of the subjects is predicted.
• Example: Suppose the researcher wants to estimate how well a high school
student may be able to do in collegecourse on the basis of how well he has
done on tests he took in high school subjects. The criterion measuredagainst
which the test scores are validated and obtained are available and obtained
are available after a longperiod of interval.
• Construct validity
• The construct validity of a test is the extent to which the test measures a
theoretical construct or trait.This involves such test as understanding,
appreciation, and interpretation of data.
• Example: Suppose a researcher wishes to establish the validity of an IQ
( Intelligence Quotient) using SCRIT(Safran Cultures-Reduced Intelligence
Test). He hypothesized that pupils with high IQ also have
highachievement and low IQ, low achievement. He, therefore, administers
both SCRIT and achievement test to two groups of pupils with high and low
IQ, respectively. If the results show that those with high IQ, have high
scoresin the achievement tests, the test is valid.
Reliability
• The word reliability means the extent to which a research instrument
is dependable, consistent, andstable (Meriam, 1975). It means that
the test agrees with itself. If a person takes the same test twice, the
testyields the same results. However, a reliable test may not always
be valid.
• Testing Error
• The testing error is related to how clear and complete the directions are, how
rigidly the instructions are followed, and whether supplementary directions
or motivation is applied.
• Scoring Error
• Errors in scoring relates to the competence, experience, and dedication of the
scorers and to the nature of the scoring itself. The extent to which the scorer
is familiar with the behavior being tested and the test items can greatly affect
scoring accuracy. Carelessness and in attention to detail produce
• Measurement Error
• Measurement error is a source of error because of instrumentation which
includes such obvious causes as inaccuracy and lack of calibration of
mechanical and electronic equipment. It is also refers to the inadequacy of a
test to discriminate between abilities and to the difficulty of scoring some
tests.
End of Slide.