Professional Documents
Culture Documents
Reliability and Validity: DR James Betts
Reliability and Validity: DR James Betts
Dr James Betts
Lecture Outline:
•Definition of Terms
•Types of Validity
•Threats to Validity
•Types of Reliability
•Threats to Reliability
•Introduction to Measurement Error.
Commonly used terms…
…in science…
“The conclusion of the study was not valid”
(Nelson 1997)
Some definitions…
• Objectivity
• External
Logical Statistical
Construct
e.g.
If you want to assess reaction
time, measuring how long it
takes an individual to react to
a given stimulus would have Externally
face validity Valid?
Logical Validity
• Face Validity
– Infers that a test is valid by definition
– It is clear that the test measures what it is supposed to
i.e.
Would assessing 15 m sprint
time be a valid means of
assessing reaction time?
e.g.
VO2
max
e.g.
Can performance
during test A be
used to predict
future performance
in test B?
A B
http://www.youtube.com/watch?v=vdPQ3QxDZ1s
Overall:
Sportsmanship.
Logical/Statistical Validity
• Construct Validity
– Infers not only that the test is measuring what it is
supposed to, but also that it is capable of detecting
what should exist, theoretically
– Therefore relates to hypothetical or intangible
constructs
– This makes assessment difficult,
i.e. if what should exist cannot be detected, this could mean:
O 1
T O 2
Threats to Internal Validity
• Maturation (possible solution)
Time series
O 1 O 2 O 3 T O 4 O 5 O 6
Threats to Internal Validity
• Maturation (possible solution)
Pre-test Post-test Randomised Group Comparison
O 1 T O 2
n.b.
R RCT
O 3 P O 4
Threats to Internal Validity
• Maturation (possible solution)
Repeated measures designs can occasionally be an inappropriate
solution, even when randomised and counterbalanced
e.g.
Muscle Damage (repeated bout effect)
Vitamin Supplementation (wash-out period)
O 1
T O 2
e.g. exercise?
Assessing muscle
mass here could make
R them train harder in
…but then respond better
to the T than the P…
O 3
both trials…
P O 4
Threats to Internal/External Validity
• Pre-testing (possible solution)
T
O 1
O 2
P O
O 4
R
3
T
O 5
Solomon Four-
Group Design P O 6
Threats to Internal Validity
Sophomore
Slump & SI
• Statistical Regression ‘Cover Jinx’
– AKA regression to the mean
Oa
Threats to Internal Validity
• Selection Bias (possible solution)
Either: T O 1
-Randomise group
assignment,
-Pre-test and post-
test difference, P
-Repeated Measures Oa
Design.
Threats to Internal/External Validity
• Experimental Mortality
– Missing Data due to subject drop-out
– Reduced n = reduced statistical Power
– Not only challenges quality of data gathered
(Internal Validity) but
also our ability to
generalise
(External Validity).
Therefore, solution =
recruit sufficient (young?)
participants
Threats to External Validity
• Inadequate description
– 5th characteristic of research…
…should be replicable
Therefore, solution =
control the lab environment.
Threats to External Validity
• Demand Characteristics
– Participants detect the purpose of the study and
behave accordingly
e.g.
Sports Science students already know that the
carbohydrate drink is supposedly superior
Therefore, solution =
CHO double or single H2 O
blinding.
Threats to External Validity
• Operationalisation
– AKA Ecological Validity
– The DV must have some relevance in the
‘real world’
e.g.
TTE has no
Olympic
equivalent
• Relative
• Absolute
• Rater reliability (Objectivity)
– Intrarater reliability
– Interrater reliability.
Relative Reliability
• Precision of Measurements
– i.e. if we are happy to measure VO2 max to the nearest
10 ml.kg-1.min-1, then it could probably be reliably
predicted from your training volume and age.
Measurement Errors
• Ultimately, reliability is dependent on the
degree of measurement error in a given study