Professional Documents
Culture Documents
PK 1 Primer Quality
PK 1 Primer Quality
Quality
Introduction
Quantification
Definition
Measurement
Reliability
Measurement Error
Undependability
Objectivity
Validity
Types of Validity
Content-Related Validity
Logical (Face) Validity
Criterion-Related Validity
Concurrent Validity
Predictive Validity
Predictive Index
Construct-Related Validity
Summary of Validity
Relationships: Quantification, Reliability, Objectivity, Validity
The Meaning of Error
Measurement Error (Error of Observation)
Discrete vs. Continuous Variables
Discrete
Continuous
Random vs. Systematic Error
Random (Experimental) Error
Systematic Error
Observed Values
Imprecision and Bias
Precision
Bias
Accuracy
Target Example
Sources of Error
Conceptual Errors
Illegitimate Errors
Blunders
Unfavorable Ambient Conditions
Errors of Coding
Errors of Computation
Computer Program Errors
Chaotic Errors
Errors in Assembling and Reporting Data
Rounding Procedures
Median Values
Correlations Ignore Systematic Errors
Test-Re-test Reliability Coefficients
T-Tests and F-Tests May Ignore Random Errors
The Technical Error of Measurement
Example Calculation (TEM)
The Percent Technical Error of Measurement
Example Calculation (%TEM)
Spreadsheet Calculation of TEM for Individuals
Standards of Excellence
Overview
Examples in Comprehensive Protocols
A Mathematical Simulation of Scale Increments & TEM
Combining Errors
Theory of Combining Errors
Replication and Percent Reduction in Error
Median as Best Estimate of True Value
Practical Decisions Using TEM
Data Management
AMPlan
Troika Training
Introduction
Assessing individual growth status or monitoring change depends upon
"theory and process," as advocated by W Edwards Deming, the
American who taught Japanese business about quality and who helped
bring about a revolution in manufacturing and business. Deming
believes, after all is said and done, that quality is a function of human
commitment (Peters & Austin (1983)).
Quantification
Quantification is the process of ascribing numbers to variables. This
involves two distinct processes:
1. Definition;
2. Measurement.
Reliability
Reliability is defined as the degree to which repeated measurements of
a variable provide a consistent score. Reliability is enhanced by
reducing two factors:
measurement error;
undependability.
Tests Reliability
Physical Endurance .
muscular endurance .85 - .90
Accuracy Tests .
Validity
The validity of a measurement is the extent to which it measures what
it is supposed to measure. This requires external criteria that are also
objective. The American Psychological Association (1985) outlines
different types of validity, as is shown in figure 1 below:
PI = 100 X (1 - (1 - r2)0.5)
Correlation (r) %
0.00 0
0.10 1
0.20 2
0.30 5
0.40 8
0.50 13
0.60 20
0.70 29
0.80 40
0.90 56
0.95 69
0.98 80
0.99 86
0.999 96
It should be noted that the above definitions may not provide exact
boundaries among the different classifications of validity, and that
some examples may fit into more than one category. The definitions
are intended merely as an aid to the assessment of validity in
interpreting anthropometric data.
Continuous variables may take any value within a defined range, and
between any two values an indefinitely large number of in-between
values can occur. Continuous variables can thus be measured with
increasingly smaller fractions or greater number of decimal places,
depending upon the amount of accuracy required and the precision of
the measurement process. For instance, the height of the back of a
chair is a continuous variable. The measurement may be 'rounded off'
to the nearest whole meter, which is likely not to be the exact height
of the chair. In other words, some degree of error is introduced by the
rounding off process. While taking the measurement to a greater
number of decimal places (e.g. to the nearest centimeter, or to the
nearest millimeter) can reduce this error, in theory there is always
some error present. In fact, no matter how small the scale of
measurement is, the 'true' height of the chair may be somewhere
between two units on that scale. The error that results from this
rounding off process is called random error, because the rounded
value is just as likely to be above as below the true value.
improvements in instrumentation;
The other type of error is called systematic error, and it may or may
not be present in a given instance, irrespective of the type of variable
being considered. It is the effect of regular definable departures from
theoretical expectancies. An example of systematic error with a
discrete variable involves two individuals taking someone’s exercise
pulse rate over the same fifteen-second period. The first individual
may use the proper technique, which involves clicking the stopwatch
simultaneously with the counting of the pulse beats from zero. The
other may begin at the same time by starting the pulse count from
one. Thus on every occasion the second individual would have a
systematic error which would overestimate the true value of the pulse
rate by one.
Where:
Target Example
ACCURATE
INACCURATE
INACCURATE
INACCURATE
Sources of Error
Conceptual Errors (These reduce validity!)
Blunders:
Examples include incorrect reading of values, poor verbalization or
other faulty transmission of these values, recording error, and
computer entry error. Automated instrumentation may help but
cannot substitute for vigilance.
Errors of coding:
These most commonly involve ascribing data to the wrong subject,
or mis-classifying a subject.
Errors of computation:
These are common when protocol calls for two or three trials which
are averaged, rather than taking three trials and using the median.
They may also include decimal point errors and reversals of digits.
Chaotic errors:
These are large errors of the kind that force the criterion
anthropometrist to call a "halt and reassessment." For example,
assigning data to the wrong subject, or the failure of an electronic
device; e.g. a scale or instrument dependent on batteries.
Rounding Procedures
. A B C Mean Median
True Values 5 5 5 5 5
Measured 5 5 11 9 5
Values
A B
5 6
4 5
3 4
2 3
1 2
T-tests and F-tests are often inadequate for describing random error.
Suppose the repeated measures were as shown below in Table 5.
5 1
4 2
3 3
2 4
1 5
. . . . .
2.75 0.50 3.75
. . . . . . .
S D2 7.00
. . . . . . .
TEM 0.48
= (7.00/(2 X 15))0.5
= 0.230.5
= 0.48
With the median of the first series above being 10, the %TEM is:
= 100 (0.48/10.0)
= 4.8%
Spreadsheet Calculation of TEM's for Individuals
Table 7 is a copy of a Microsoft Excel spreadsheet technical error and
percent technical error calculator for individual subjects.
. A B C D E F
Standards of Excellence
New instruments for lengths and breadths are now available, and they
appear to reduce the technical error. However, controlled experiments
are difficult to conduct since "experience" with one instrument may
bias the data obtained by another. The arrow in figure 5, below, calls
attention to the relatively high %TEM for hand breadth. A grip breadth
technique is proposed in this treatise since hands are malleable,
especially in children. Skinfold technical errors are relatively small in
absolute terms but are magnified when expressed as percentage
values.
Combining Errors
Equation 6: Total error is the square root of the sum of the error
variances.
it is a real value;
Data Management
( A to B I-D-E-A)
General display and report for scientific and professional interpretation for groups
and individuals - display & report
Troika Training
Cited References
Carr RV, Rempel RD and Ross WD (1989) Sitting height: an analysis of five
measurement techniques. Am J Phys Anthropol 79: 339-344.
Carr RV, Blade L, Rempel R and Ross WD (1993) Technical note: on the
measurement of direct vs. projected anthropometric lengths. Am J Phys
Anthropol 90: 515-517.
Carter JEL (1975) The Heath-Carter Somatotype Method (2nd ed) San Diego:
San Diego State University.
Johnston FE, Hamill PVV and Lemeshow S (1972) Skinfold thickness of children
6-11 years. Unit, Unied States DHEW Pub no (HSM) 73, 1602. Washington, DC:
US Government Printing Office.
Martin AD, Drinkwater DT, Clarys JP and Ross WD (1986) The inconsistency of
the fat free mass: a reappraisal with implications for densitometry. In: T Reilly, J
Watson and J Borms (eds): Kinanthropometry III, London: E & FN Spon, 92-97.
Ross WD and Ward R (1984a) Proportionality of Olympic athletes. In: JEL Carter
(ed) Physical Structure of Olympic Athletes. Med Sport Sci, Basel: Karger (18):
110-143.
Ross WD and Eiben OG (1991) The sum of skinfolds and the O-Scale System
for physique assessment rating of adiposity. Anthrop Közl, 33: 299-303.
Ross WD, Kerr DA, Carter JEL, Ackland TR and Bach TM (1994) Appendix B -
Anthropometric Techniques: Precision and Accuracy. In: JEL Carter & TR
Ackland (eds) Kinanthropometry in Aquatic Sports (5). Champaign, IL: Human
Kinetics, 158-169.
Ross WD, Carr RV and Carter JEL (1999) Anthropometry Illustrated. (An
interactive CD-ROM text and learning system) Surrey BC: Turnpike Electronic
Publications Inc.
Schmidt PK and Carter JEL (1990) Static and dynamic differences among five
types of skinfold calipers. Hum Biol 62 (3) (June): 369-388.
Schutz RW (1998) An informal survey of reliabilities for various tests taken from
the disciplines involved in kinesiology / human kinetics. Personal correspondence
RVC.
Sokal RR and Rohlf FJ (1981) Biometry: The Principles and Practice of Statistics
in Biological Research (2nd ed). New York: WH Freeman, 571-572.
Ward Richard (1989) The O-Scale System for Human Physique Assessment.
Doctoral dissertation, Simon Fraser University, Burnaby, BC, Canada.