Professional Documents
Culture Documents
06.measures of Dispersion With Notes-D
06.measures of Dispersion With Notes-D
Dispersion
Measures of Dispersion
• The degree to which numerical data tend to spread about
an average value is called the dispersion, or variation of the
data.
3
Measures of Dispersion
Measures of Absolute Dispersion
• carries the unit of measure of the observations
• can be used to compare data sets with the same
means and the same units of measurement
6
Mean Deviation
Mean Deviation
Class Interval Class Midpoint Frequency I(x-x̄)I f I(x-x̄)I
53-59 56 3 19.78 59.34
60-66 63 14 12.78 178.92
67-73 70 15 5.78 86.7
74-80 77 26 1.22 31.72
81-87 84 10 8.22 82.2
88-94 91 8 15.22 121.76
95-101 98 4 22.22 88.88
80 75.78 6062.4
Sum 6711.92
Mean = 75.78
MAD = 83.90
Semi-Interquartile Range
• The semi-interquartile range or quartile deviation, of a set
of data is denoted by Q and is defined by
where Q1 and Q3 are the first and third quartiles for the
data.
• The interquartile range Q3–Q1 is sometimes used, but the
semi-interquartile range is more common as a measure of
dispersion.
Semi-Interquartile Range
Example:
Find the semi-interquartile range for the height distribution:
Lower quartile, Q1 = 65.64
Upper quartile, Q3 = 69.61
Semi-interquartile range, Q = (69.61 – 65.64)/2 = 1.98
Note that 50% of the cases lie between Q1 and Q3, thus 50
students have heights between 65.64 and 69.91. We can
consider ½ (Q1 + Q3) = 67.63 to be a measure of central
tendency. It follows that 50% of the heights lie in the range
67.63 ± 1.98 in.
The 10 – 90 Percentile Range
/ Interdecile Range
The 10 – 90 percentile range of a set of data is defined by
where P10 and P90 are the 10th and 90th percentiles for the
data. The semi- 10 – 90 percentile range, ½ (P90 – P10) can
also be used but is not commonly employed.
The 10 – 90 Percentile Range
/ Interdecile Range
Example:
Find the 10 – 90 percentile range of the heights of the
students
10th percentile, P10 = 63.33 in.
90th percentile, P90 = 71.27 in.
10 – 90 percentile range = 71.27 – 63.33 = 7.94.
Since ½ (P90 + P10) = 67. 30in and ½ (P90 – P10) = 3.97 in, we can
conclude that 80% of the students in the range 67.30 ± 3.97 in.
Variance
▪ The variance of a set of data is the square of the standard
deviation and is given by “s2”. To distinguish the standard
deviation of a population to the standard deviation of a
sample drawn, the symbol “s” is used for the latter and “σ”
for the former.
▪ It describes how far the observations are from the mean. It
comes in square of the unit of measure of the
observations.
Variance
14
Variance for Grouped Data
Sheppard’s Correction for
Variance
The computation of the standard deviation is somewhat in error
as a result of grouping the data into classes (grouping error). To
adjust for grouping error:
x = 10 + 12 + 14 + 15 + 17 + 18 + 18 + 24 = 128 = 16
8 8
18
Standard Deviation
Example:
Consider the pre-test scores of eight sampled
participants in descriptive statistics training course:
10, 12, 14, 15, 17, 18, 18, 24
10 − 16 2 + 12 − 16 2 + + 24 − 16 2
S=
( ) ( ) ( ) = 4.3095
8-1
19
Standard Deviation
Example:
Site A: Heights (inches) of five trees
Data B 11 12 13 14 15 16 17 18 19 20 21
Mean = 15.5
s = 0.9258
Data C
Mean = 15.5
11 12 13 14 15 16 17 18 19 20 21 s = 4.57
Mean = 75.78
With the frequency distribution above, find the following:
a. Semi-interquartile Range
b. Interdecile Range
c. Variance (with and without Sheppard’s Correction)
d. Standard Deviation
Empirical Relations Between
Measures of Dispersion
For moderately skewed distributions, we have the empirical
formulas
29
Coefficient of Variation
Example:
Suppose you have two options in buying a stock. Stock 1 is
currently priced at P2000 per share and stock 2 is priced
P550 per share. In buying stocks, risk is reduced by choosing
a stock with stable price. However, once could take a chance
on a stock that shows greater variation in price, hoping the
prices go up rather than down. A sample of prices of Stock 1
and Stock 2 were collected at the close of trading for the
past months.
30
Coefficient of Variation
Example:
33
Standard Score
Z = x− μ Z= x −x
σ s
Population z-score Sample z-score
σ = population standard s = sample standard
deviation deviation
μ = population mean x = sample mean
34
Standard Score
Example:
The mean score of participants in Exercise 1 of the training
course is 70% with standard deviation of 10%; while in
Exercise 2, the mean score is 80% with a standard
deviation of 10%.
exer1 exer2
36
Standard Score
Remark on the Standard Score
• It can be used in identifying possible outliers in the
data set. By rule of thumb, if the absolute value of
the standard score is
at least 3 then that observation is marked
as a possible outlier.
38