Professional Documents
Culture Documents
9 Reliability
9 Reliability
RELIABILITY
Reliability – refers to the consistency of scores obtained by the same
person when re-examined with the same test on different occasions, or
with different sets of equivalent items, or under other variable
examining condition.
This mainly refers to the attribute of consistency in measurement.
Remember…
Measurement error is common in all fields of science.
Tests that are relatively free of measurement error are considered to be
reliable while tests that contain relatively large measurement error are
considered to be unreliable.
E = X - T
(error) (observed score) - (true score)
Standard error of measurement – the standard deviation of the
distribution of errors for each repeated application of the same test on
an individual.
Error (E) can either be positive or negative. If E positive, the Obtained
Score (X) will be higher than the True Score (T); if E is negative, then X
will be lower than T.
Although it is impossible to eliminate all measurement error, test
developers do strive to minimize psychometric nuisance through careful
attention to the sources of measurement error.
It is important to stress that true score is never known.
Factors that contribute to consistency:
D. Split-Half Reliability
It is obtained by splitting the items on a questionnaire or test in half,
computing a separate score for each half, and then calculating the degree of
consistency between the two scores for a group of participants.
The test can be divided according to the odd and even numbers of the items
(odd-even system).
This model of reliability measures the internal consistency of the test which
is the degree to which each test item measures the same construct. It is simply
the intercorrelations among the items.
If all items on a test measure the same construct, then it has a good internal
consistency.
Spearman-Brown, Kuder-Richardson, and Cronbach’s alpha are the formula
used to measure the internal consistency of a test.
Spearman –Brown Formula
A statistics which allows a test developer to estimate what correlations
between the two halves would have been if each half had been the length of
the whole test and have equal variances.
Alternative 2 1 Item
– Forms sampling
(Immediate)
Alternative 2 2 Item
– Forms sampling
changes over
(Delayed)
time
Split – Half 1 1 Item
sampling
(Spearman-
Brown) Nature of
split
Coefficent 1 1 Item
Alpha & sampling
Kuder -
Test
Richardson
Heterogeneity