Professional Documents
Culture Documents
5.concepts of Reliability
5.concepts of Reliability
5.concepts of Reliability
University of Balamand
PSYC 243
Presentation Plan
I. Introduction
II. Theory and Assumptions
A. Classical Test Theory
B. Item Response Theory
III. Sources of measurement error
IV. Measurement error and reliability
V. Standard error of measurement
VI. Types of Reliability
I- Introduction
• Context
– What type of reliability?
– Reliable for what applications?
• Continuum
– Range of reliabilities
– Highly consistent ← → highly inconsistent
II- Theory and Assumptions
Charles Spearman
• Psychometrician
• Developed factor analysis
• Developed classical test theory
A - Classical Test Theory
X=T+e
• X is the obtained score
• T is the true score
• e represents errors of measurements.
Classical Test Theory
With meas
X=T+E
True Scores
T1 T2 T3
1. Item Selection
2. Test Administration
3. Test Scoring
4. Systematic Measurement Error
1 - Item Selection
• Essentially the
proportion of X that is T
The Reliability Coefficient
• Types of reliability
1) Test-retest
2) Alternate-Forms
1) Test-Retest Reliability
• Sources of error
– random fluctuations in performance
– uncontrolled testing conditions
• extreme changes in weather
• sudden noises / chronic noise
• other distractions
– internal factors
• illness, fatigue, emotional strain, worry
• recent experiences
Test-Retest Reliability
• Practice effects
– Type of carryover effect
– Some skills improve with practice
• Manual dexterity, ingenuity or creativity
– Practice effects may not benefit everybody in the same way.
• Carryover & Practice effects more of a problem with
short inter-test intervals (ITI).
• But, longer ITI’s have other problems
– developmental change, maturation, exposure to historical
events
2) Alternate-Forms Reliability
1) Split-Half Reliability
2) The Spearman-Brown Formula
3) Coefficient Alpha
4) Interscorer Reliability
1) Split-Half Reliability
• How to:
– Compute scores for two halves of single test,
calculate r.
• In addition of calculating a Pearson r between
scores on the two equivalent halves of the
test, the computation of a coefficient of split-
half reliability entails an additional step:
adjusting the half-test reliability using the
Spearman-Brown formula.
2) The Spearman Brown Formula
• Where r* is the
estimated reliability, r is
the correlation between
the halves, j is the new
length proportional to
the old length
The Spearman Brown Formula
CTT IRT
A single reliability (precision of Precision of measurement can vary,
measurement) is typically calculated for depending on where an individual falls in
the entire test. the trait range.
A single set of items is typically A different collection of test items might
administered to all examinees. be used for each examinee to obtain a
predetermined precision of
measurement.
3 – Invariance in IRT