Professional Documents
Culture Documents
Dtatistical Measures
Dtatistical Measures
Dtatistical Measures
Major Points
1. Measures of Central Tendency
1. Mean
2. Median
3. Mode
2. Measures of Position
1. Quartiles
2. Deciles
3. Percentiles
3. Measures of Dispersion
1. Range
2. Variance
3. Standard Deviation
4. Coefficient of variation
5. Interquartile Range
4. Other descriptive measures
1. Geometric mean
2. Weighted mean
Measures of Central Tendency
x
X
n
X/n = (9 + 10 + 12 + 13 + 15 + 15 + 15 + 16 + 18 +19)
10
= 142/10
= 14.2
x
X
n
n is used to refer to the number of scores in the data
.set (termed sample size)
Population Mean
• The algebraic definition of the population mean is as
follows:
X
N
N is used to refer to the number of scores in the data
set (termed population size).
Properties of the Mean
NOTE!
• When determining the median, you must arrange the
scores in ascending or descending order first!
Median
• If there are an ODD number of scores, the median is the
middle score:
1, 3, 6, 7, 8, 13, 15, 17, 18, 21,
23
Median = value of the (n + 1)/2 observation, (11+1)/2=6.
Look at the value of the 6th observation,
Median = 13
Q1 Q2 Q3
Step 2
Q1
= (170 + 175)/2
= 172.5 Q2
= (190 + 210)/2
= 200.0 Q3
= (235 + 240)/2
= 237.5
Quartiles Example
• Similarly the values which divide the data into ten equal
parts are called deciles and are denoted by D1, D2,.....,
D9, while the values dividing the data into one hundred
parts are called percentiles and are denoted by P1,
P2,....., P99.
• E.g.: 90th percentile, is the value such that 90% of
the observations are less or equal to.
Percentile
The value below / above which a particular percentage of values fall
(median is the 50th percentile)
e.g 5th percentile - 5% of values fall below it, 95% of values fall
above it.
A series of percentiles (1st, 5th, 25th, 50th, 75th, 95, 99th) gives a
good general idea of the scatter and shape of the data
Range
39 45 36 40 35 38 47 BC
27 52 18 33 70 PC
Examples
1. What factors account for the variance (or difference) in IQ
among individuals?
2. What factors account for the variance in treatment
compliance among different groups of patients?
1- Range
• The range tells us the span over which the data are
distributed, and is only a very rough measure of
variability
• Range: The difference between the maximum and
minimum scores (X max-X min)
– Example: The most amount of tips made in a night is 270 and
the least is 150. Therefore, the range of tips made that night is
270 – 150 = $120
• Range is the simplest measure of dispersion.
• It is not the best measure of dispersion as it depends
entirely on the extreme scores and tells us nothing
about the middle values. Also, it does not take in
consideration all values in a series of scores
Variation
XX
X
5 0.00 This is an example of data
5 0.00 with NO variability
5 0.00
5 0.00
X 5
= 25
0.00
n=5
X
=5
Variation
X
XX
6 +1.00 This is an example of data
X = 25 n=5 X =5
Variation
X
XX
8 +3.00 This is an example of data
1 -4.00 with higher variability
9 +4.00
5 0.00
2 -3.00
X = 25 n=5 X =5
2- Mean deviation
X X
n
BUT: We have a problem.
(X X ) will always add up to zero
Mean Deviation
The
e.g.
This average
Blood urea
deviation
indicates levelon
that, is(mg/dl)
the average
average,forthe
5
of
the
individuals:
absolute
values deviations
of x (blood urea (i.e. regardless
level) deviate
the
11.2sign) of the
mg/dl fromindividual
the meanobservations
of the
from their mean.
distribution.
Deviations from the mean
• In any group of scores, the sum of the deviations from the
mean equals zero:
X X- µ n=6
3 3 - 5.50 = -2.50 µ = Σ X/n
5 5 - 5.50 = -0.50 µ = 33/6
9 9 - 5.50 = +3.50 µ = 5.50
2 2 - 5.50 = -3.50
8 8 - 5.50 = +2.50
6 6 - 5.50 = +0.50
ΣX = 33 Σ(X- µ) = 0.00
Variance & Standard Deviation
s 2 X x 2
n 1
Disadvantages of Varience
X x
2
Standard Deviation SD
n 1
Steps to calculate standard deviation
X
8 +3.00 9.00
1 -4.00 16.00
9 +4.00 16.00
5 0.00 0.00
2 -3.00 9.00
X = 25
X X = 0.00
X X
2
= 50.00
X X
2
Note: The is called the Sum of Squares
Why use Standard Deviation and not
Variance!??!
• Normally, you will only calculate variance in order to calculate
standard deviation, as standard deviation is what we typically
want.
Higher variability
for body weight
CV = 30 / 85 x 100 = 35.29