Professional Documents
Culture Documents
Measures of Variability
Measures of Variability
1
Variability measures
• In addition to locating the center of the observed
values of the variable in the data, another important
aspect of a descriptive study of the variable is
numerically measuring the extent of variation around
the center. Two data sets of the same variable may
exhibit similar positions of center but may be
remarkably different with respect to variability.
• The variability measures should have the following
characteristics:
- be minimum if all the value of the distribution are the
same
-increase as increase the difference among the values
of the distribution
2
Shop Revenues Costs employe place Director Shop R.O
s e gender On-line
1 350 205 5 city male yes 145
3
Variability
Possible distribution
revenu revenu revenu revenu
e e e e (C) All the 3 possible
(A) (B) distribution have the
same mean of the
350 325 300 140
observed one
200 325 350 270 x 325
600 325 400 830
500 325 200 605
270 325 300 120
180 325 325 200
205 325 300 190 BUT the distribution are
very different!!!
340 325 400 200
280 325 350 370
Observed distribution
4
Some measures of variability
Range range xmax xmin
Interquartile range dQ Q3 Q1
5
Example
Revenue Revenue Revenue Revenue
(A) (B) (C)
350 325 300 140 A
No Variability
200 325 350 270
All values are
600 325 400 830 the same
500 325 200 605
6
Deviation from the mean
The variance σ2 is function of the differences
among each value xi and the mean x
1 n
xi x
2 2
n 11
2 0
i1
7
The standard is the squared root of the
variance
1 n
i x x 2
n i1
n
25 625
350
i
x x 2
Dev(X) 163200
200 -125 15625 i1
3 2 19,34
4 1 4,45
6 3 0,04
7 1 0,79
10 2 30,26
x 6,11
1 K 54,88
x j x n j
2
2
6,10
n j1 9
2,47
6,10 2,47 CV 100 40,43%
6,11
10
Standardised values
If a quantitative variable X as mean x
and standard deviation σ, it is possible to
obtain its standardised values
yi xi x / i 1...n
12
Comparison among the performance of two
founds (different mean)
F1 F2
F1 has a mean and a variance
2003 9,7 1,4
higher than F2.
2004 7,1 1,9
Can we say that F1 is an higher risk
2005 0,9 2,2
2006 9,9 2,1 found than F2?
2007 7,5 4,9
media 7,0 2,5
var 10,6 1,5
CV 46,5 49,3
13