Professional Documents
Culture Documents
MODULE 5 - Measures of Variability
MODULE 5 - Measures of Variability
MEASURES OF VARIABILITY
Introduction
Group 1 45 50 76 77 80 81 90 95
Group 2 25 50 76 77 80 81 90 95
Example
56 57 63 75 78 79 80 82 87 89 90 92
Q1 = (n+1)/4 , is the position of the first quartile (Q1), where n is the number of
scores
= (12+1)/4
= 3.25
Similarly,
쳌䁓 쳌쳌
Therefore: i i i 䁓
For Grouped Scores
Here is a set of score from a class of 50 student. The scores have already been
summarized into a frequency distribution.
X f cf
95-99 3 50
90-94 4 47
85-89 5 43
80-84 8 38
75-79 6 30 Q3
70-74 10 24
65-69 4 14
60-64 4 10 Q1
55-59 2 6
50-54 0 4
45-49 1 4
40-44 3 3
㠶㐠
i
㐠
Where : Q1= the first quartile (25th percentile)
LQ1 = lower limit of the first quartile (25th percentile class)
N = total number of frequencies in the distribution
cf = cumulative frequency of the first quartile
fQ1 = frequency of the first quartile (25th percentile)
i= size of the interval of the first quartile
Note: Just like in finding the median of a set of grouped scores, the first step is to get
the cumulative frequency. In the formula, N is divided by 4 (N/4), because we are
looking for the score which is at middle of the lower half of the distribution. Just like the
median, the same procedure is applied in looking for the value of cf, f and the lowest
limit.
㠶㐠
So, i
㐠
Similarly, we use the same formula in finding Q3. The only difference is to get ¾
of N, because we are looking for the score that lies at the middle of the top half of the
distribution.
㠶㐠
i
㐠
Where : Q3= the third quartile (75th percentile)
LQ3 = lower limit of the third quartile (75th percentile class)
N = total number of frequencies in the distribution
cf= cumulative frequency of the 3rd quartile class
fQ3 = frequency of the third quartile
i= size of the interval of the 3rd quartile class
㠶㐠
So, i
㐠
The score that divides the distribution into ten equal parts is called the
decile. Just like Q1 and Q3, we can compute the Decile by determining the
number of cases required. For example, we are looking for the 1st Decile (D1),
then we divide N by 10, then 2N/10 for D2, 3N/10 for D3 , and so on.
Example . Find the 3nd Decile (D3) and 8th Decile (D8)
56 57 63 75 78 79 80 82 87 89 90 92 95 96
97
If the answer is not a whole number, round-up to the nearest whole number, and
that is the position of the decile. In this example 4.5 = 5 Thus, the 2nd decile (D2)
is the 5th score from the lowest = 78.
If the answer is a whole number, get the average of that corresponding value in
your data set and the value that directly follows it. In this example, the 8th decile
is the average of the 12th and the 13th score. That is (92+95)/2 = 93.5
For Grouped Scores
Just like in finding the quartiles of a set of grouped scores, the first step is to get the
cumulative frequency. The same procedure is applied in looking for the value of cf, f
and the lowest limit.
The same procedure may be used when we find the score below which
any percentage of the group falls. These values are called percentiles. The
median is the 50th percentile, i.e., the score below which 50 percent of individuals
fall. If we want to find the 40th percentile, we must find the score below which 40
percent of the cases fall. Any other percentiles can be found in the same way.
Percentiles have many uses, especially in connection with test norms and
interpretation of scores
Example . Find the 45th Percentile (P45) and 60th Percentile (P60)
56 57 63 75 78 79 80 82 87 89 90 92 95 96 97
Just like the deciles, if the answer is a whole number, get the average of that
corresponding value in your data set and the value that directly follows it. In this
example, the 60th percentile is the average of the 9th and the 10th score. That is
(87 + 89)/2 = 88.
For Grouped Scores
Just like in finding the quartiles and deciles of a set of grouped scores, the first
step is to get the cumulative frequency. The same procedure is applied in looking for
the value of cf, f and the lowest limit.
Note: The values of P20 and D2 are the same because in either measure, we are
looking for the score in which 20 percent of the cases falls below it.
The Variance (Ungrouped Data)
s2 i ; i = 39.11
쳌
Example 2. Compute for variance of the following Geometry scores of the ten
students: 92 95 75 63 45 87 99 90 98 86
Score d d2
92 9 81
95 12 144
75 -8 64
63 -20 400
45 -38 1444
87 4 16
99 16 256
90 7 49
98 15 225
86 3 9
N = 10 ∑d2= 2,688
Mean = 83
ǡ쳌
Sample variance (s2) i = 298.67
쳌
Take note that the two examples have the same value for the Mean but with different
values for the variance. The variance for the second example (Geometry scores) is
larger because the scores are more dispersed/scattered from the mean, thus higher
variability.
The Standard Deviation (Ungrouped Data)
The standard deviation gives a better idea of how the data entries differ from the
mean. It is computed by extracting the square root of the variance. The formula for the
So how do we interpret the standard deviation of 6.25 and 17.28. For the
Algebra scores, it means that on the average, the scores are 6.25 away from the mean.
For the Geometry scores, it means that on the average, the distance of the scores from
the mean is 17.28. Theoretically, standard deviation and variance describe how
scattered the scores are from a central point (the mean). In layman’s term, the higher
the value of the standard deviation or variance, the more the scores scatter from the
mean. Thus, the distances of the scores are larger. Based on the two given examples,
the average scores are the same (Mean =83), and the number of scores is also the
same (n=10). But the scores for Geometry are farther away from each other and from
the mean, compared to the scores in Algebra.
An alternative way of computing for the standard deviation is to use the sum of
all the scores and the sum of all its squares. The formula is:
t
i ; where: Xi = the ith observed value for the given variable X
n = sample size
Using Example 1
Xi (scores) Xi2
92 8,464
75 5,625
85 7,225
83 6,889
90 8,100
73 5,329
79 6,241
80 6,400
88 7,744
85 7,225
∑Xi= 830 ∑Xi2= 69,242
Steps:
1. Get the sum of scores
2. Square all the scores and get the sum .
3. Substitute it with the formula
ິ t l l ິ ິl
i l
;i ິ
; i ິ
= 쳌䁓 i 6.25
t l l ິl
i l
;i ິ
; i ິ
= 쳌 䁓쳌 i 䁓
The standard deviation and variance of grouped data are calculated using
the class marks of each step interval, or using the deviations
1. Finding SD using the Class Marks, the formula is:
㐠t香䁘 t 㐠香䁘
i t
X F Class f x CM f x (CM)2
Mark (CM)
Not
= l䁓l
= 7.75 The variance (s2) in this data set is (7.75)2 = 60.08.
where: = interval
= number of samples
㐠 = summation of frequency deviation
㐠 = summation of frequency x squared deviation
Steps:
1. Choose any step interval for the assumed mean as the arbitrary starting point
or “origin”. In the example given, the interval 60-62 has been chosen. Call this
interval zero deviation, and the next higher interval +1, the lower interval -1, etc.
These are shown in the column labeled d. (Note: Any interval can be chosen,
and the final result will be the same)
2. Multiply frequency (f) by the number of deviations (d) and the resulting
product is shown in column labeled fd. Get the sum of fd by taking into account
the plus and minus signs.
3. To get fd2, multiply d by the fd. Then get the sum of fd2
Illustration:
X F d fd fd2
75-77 3 5 15 75
72-74 4 4 16 64
69-71 6 3 18 54
66-68 5 2 10 20
63-65 8 1 8 +67 8
60-62 9 0 0 0
57-59 5 -1 -5 5
54-56 8 -2 -16 32
51-53 3 -3 -9 27
48-50 2 -4 -8 32
45-47 2 -5 -10 - 48 50
N=55 ∑fd = +19 ∑fd2 = 367
㐠 㐠
i
쳌 쳌
i 㷟
㷟
i 쳌䁓 䁓 㷟
i 쳌䁓쳌
i 䁓
= 7.75
The Coefficient of Variation
쳌䁐 䁐䁠 E L䁐쳌LL
CV = E䁐
x 100%
Using Example 1 (Algebra scores), the Standard deviation (s) = 6.25 and Mean = 83
쳌䁓
CV = x 100% = 7.53 %
Using Example 2 (Geometry scores), the Standard deviation (s) = 17.28 and Mean = 83
䁓
CV = x 100% = 20.82 %