Professional Documents
Culture Documents
Chapter 5 Reliability
Chapter 5 Reliability
Chapter 5 Reliability
5-3 5-4
• An instrument is said to be reliable if it accurately Measurement Error- Other than the variable being
reflects the true score, and thus minimizes the measured, all of the factors associated with the
error component. process of measuring some variable.
– If you see a reliability coefficient of .85, this means • A test written in English will turn out to be measurement
that 85% of the variability in observed scores is error if there are students from different countries
presumed to represent true individual differences and
15% of the variability is due to measurement error.
5-5 5-6
1
10/26/2022
• Test Construction
• Test Administration
Random Error- Systematic Error-
a source of error in a source of error in measuring
• Test Scoring and Interpretation
measuring a targeted variable a variable that is typically • Other Sources
caused by unpredictable constant or proportionate to
inconsistencies of other what is presumed to be the
variables in the measurement true value of the variable being
process (e.g. noise) measured. (e.g. a weight scale
measure half a kilogram less)
5-7 5-8
5-9 5-10
5-11 5-12
2
10/26/2022
5-13 5-14
5-15 5-16
Reliability Estimates
Reliability Estimates
Parallel-Forms or Alternate-Forms
Estimate of test-retest reliability might be low
– In a math test if tutorial is taken before the second
administration.
• Involves using differently worded questions to
– In a personality test if the test-taker suffered from
measure the same construct.
emotional trauma or received counseling before the – Questions or items are reworded to produce two items
second administration. that are similar but not identical.
– During the times of great developmental change – Items must focus on the same exact aspect of behavior
even short time interval between testings might result in with the same vocabulary level and same level of
low test-retest realibility. difficulty.
5-17 5-18
3
10/26/2022
• Reliability is the correlation between the responses to the • Test A: All college students graduate sooner or later.
pairs of questions.
• Alternate forms reliability is said to avoid the practice • Test B: Everyone who works for a good company gets a
effects that can inflate test-retest reliability (i.e., promotion sooner or later.
respondent can recall how they answered on the identical
item on the first test administration).
5-19 5-20
Parallel-Forms and Alternate-Forms are similar • It is a measure of how well related the items.
in two ways:
– Are different items all measure the same thing?
•Reliability is checked by administering two forms
of a test to the same group. • It is applied to groups of items thought to measure
different aspects of the same concept.
•Scores may be affected by error related to the state
of testtakers (e.g. practice, fatigue, etc.) or item
sampling.
5-21 5-22
5-23 5-24
4
10/26/2022
5-25 5-26
Reliability Estimates
Reliability Estimates
Other Methods of Estimating Internal Consistency
The Spearman-Brown Formula
–Usually, but not always, reliability increases as test Kuder-Richardson formula 20- the inter-item
length increases. consistency of dichotomous items
5-27 5-28
Reliability Estimates
Reliability Estimates
Other Methods of Estimating Internal Consistency
• Measures of reliability are estimates and
Cronbach’s Alpha is mean of all possible split-half estimates are subject to error
correlations. • The reliability coefficient varies with the sample
: developed by Cronbach (1951). of testtakers.
– is a function of the number of items in the scale and the
degree of their intercorrelations.
– corrected by the Spearman-Brown formula.
– the most popular approach for internal consistency.
– Values range from 0 to 1.
– High when more than 25 items
5-29 5-30
5
10/26/2022
5-31 5-32
5-33 5-34
5-35 5-36
6
10/26/2022
•An estimate of the amount of error inherent in an –The higher the reliability of the test, the lower the
observed score or measurement. standard error.
5-37 5-38
5-39 5-40
SEM= SD 5 1 1-reliabilit
.84 y
–SEM=2
–Confidence Interval= SEM x 1.96= 2 x 1.96=3.92
•We can be 95% sure that Lawrence's "true" attention span
score falls between 60 and 68.
5-41