Professional Documents
Culture Documents
Stat Module 4
Stat Module 4
With the knowledge of these statistical tools, the student in research will be able to present and
describe the result of the research output in a meaningful manner.
DESCRIPTIVE STATISTICS
Lesson 1 Measures Of Central Tendency
Lesson 2 Measures of Central Location
Lesson 3 Measures of Variation & Disperson
Weighted Mean equation is a statistical method which calculates the average by multiplying the weights with its
respective mean and taking its sum. It is a type of average in which weights are assigned to individual values
in order to determine the relative importance of each observation.
Weighted mean is calculated by multiplying the weight with the quantitative outcome associated with it and
then adding all the products together. If all the weights are equal, then the weighted mean and arithmetic
mean will be the same.
Where
x is the value
Step 1: List the numbers and weights in tabular form. Presentation in tabular form is not compulsory but
makes the calculations easy.
Step 2: Multiply each number and relevant weight assigned to that number (w1 by x1, w2 by x2 and so on)
Step 5: Divide the total of the values obtained in Step 3 by the sum of the weights obtained in Step 4
(∑x1wi/∑wi)
∑( xi)(wi)
Xw =
∑ wi
= 1950
23
Xw = 84.78
Example 2:
The following data presents the results of a survey made by a team of researchers on the extent by
which Enhanced Community Quarantine due to COVID 19 contributed to changes in the way of life of the
residents of Tabuk City.
The research made use of four point scale in categorizing the data collected from a sample of 150
Tabukeños.
SCALE LIMITS DECRIPTION SYMBOL
4 3.25 - 4.00 Very Much Contribute VMC
3 2.50 - 3.24 Much Contribute MC
2 1.25 - 2.49 Moderately Contribute MoC
1 1.00 - 1.24 Less Contribute LC
VM Mo
C MC C LC
The Enhanced Community Quarantine . . . (4) (3) (2) (1)
1. Provided an opportunity for families to have closer relationship among the
members. 53 42 25 30
2. Increased awareness on the importance of life 55 40 22 33
3. Realization on the importance of sensitivity to the needs of others 58 41 31 20
4. Increased awareness in going back to basics. 49 33 28 40
5. It brought about a new normal way of life. 44 52 20 34
Required: Compute (1) the weighted mean for each indicator, and (2) the total average weighted mean for
the survey. Describe the data using the description equivalent no. 1 & 2.
Answers:
The Enhanced Community Quarantine . . . VMC MC MoC LC
16 Prepared by: Leah Rosete-Garming, MPA
(4) (3) (2) (1) Xw VMC
1. Provided an opportunity for families to have closer 53 42 25 30
relationship among the members. (212) (126) (50) (30) 2.79 MC
55 40 22 33
2. Increased awareness on the importance of life (220) (120) (44) (33) 2.78 MC
3. Realization on the importance of sensitivity to the needs 58 41 31 20
of others (232) (123) (62) (20) 2.91 MC
49 33 28 40
4. Increased awareness in going back to basics. (196) (99) (56) (40) 2.61 MC
44 52 20 34
5. It brought about a new normal way of life. (176) (156) (60) (34) 2.84 MC
TOTAL AVERAGE WEIGHTED MEAN 2.79 MC
Solution: Process:
1. Multiply the frequency by the weight of the indicator
Indicator 1 1st Column (53 X 4) = 212
nd
2 Column (42 X 3) = 126
3rd Column (25 X 2) = 50
4th Column (30 X 1) = 30
Total 418
2. Divide the total with the total no. of respondents (n) 150
∑( xi)(wi) 418
Xw = = = 2.786 Much Contribute (MC)
∑ wi 150
Description of the Data: The total average weighted mean of 2.79 indicates that the Tabukeños
believed that the
ECQ implemented because of COVID 19 much contributed to the changes in the
way of life in Tabuk City. This can also be observed in the computed weighted
mean in all indicators. It can be noted that “Realization on the importance of
sensitivity to the needs of others” was rated as the highest with a weighted mean
of 2.91 though it has the same description of “much contribute”.
Specific Objectives: At the end of this lesson the student will be able to:
1. develop the ability to follow the step by step procedure in the computation of the measures of central
location.
2. gain knowledge and understanding on the use and or application of the measures of central location.
3. To appreciate and be able to describe the data using the measures of central location.
When data is arranged in ascending or descending order, it can be divided into various parts by different
values such as quartiles, deciles and percentiles. These values are collectively called quantiles and are the
extension of median formula which divides data into two equal parts.
Since the basic purpose of these partition values is to divide data into different parts therefore a relationship
exists between them. This relationship is given below and is elaborated with the help of simple problems.
Q1 = n /4
Q2 = 2n/4
Q3 = 3n/4
91 62 48 94 89 91 85 98
61 93 83 52 100 73 43 69
46 90 59 51 92 68 78 77
62 99 96 59 70 49 50 42
54 76 66 62 59 54 45 46
Solution: Arrange the scores from the lowest to the highest as indicated below
18 Prepared by: Leah Rosete-Garming, MPA
42 48 54 6 68 77 90 94
1
43 49 54 6 69 78 91 96
2
45 50 59 6 70 83 91 98
2
46 51 59 6 73 85 92 99
2
46 Q1 10th 52 59 Q2 20th 6 76 Q3 30th 89 93 100
6
Q1 = n /4 Q2 = 2n/4 Q3 = 3n/4
= 10 = 80/4 = 120/4
= 20 = 30
The following are the formulas for Q1 Q2 and Q3 under grouped data:
1n
−F
Q1 = L + ( 4 )i
f
2n
−F
Q2 = L +( 4 )i
f
3n
−F
Q3 = L +( 4 )i
f
Thus,
kn
−F
Qk = L +( 4 )i
f
Where:
Scores f F
First Quartile
1n
−F 1n
Q1 = L +( 4 )I Solve for −¿= 50/4
4
f
= 12.5
12.5−9
= 24.5 + ( )5
12
3.5
= 24.5 + ( )5
12
17.5
=24.5 +
12
=24.5 + 1.46
= 25.96
Second Quartile
2n
−F 2n
Q2 = L +( 4 )I Solve for −¿= 100/4
4
f
= 25
25−21
= 29.5 + ( )5
10
4
= 29.5 + ( )5
10
20
=29.5 +
10
=29.5 + 2
= 31.5
3n
−F 3n
Q3 = L +( 4 )I Solve for −¿= 3 (50)/4
4
f
= 150/4 = 37.5
37.5−31
= 34.5 + ( )5
11
6.5
= 34.5 + ( )5
11
32.5
=34.5 +
11
=34.5 + 2.95
= 37.45
Students with the lowest scores of 15 to 25 will be categorized under letter D; from 26-31, group C, from 32-36,
group B from 37 to the highest score of 49, group A.
Dk = kn/10
Where:
D= the decile
k= from 1, 2, 9
n =the sample size
Example 1 Using the same data from the previous example on quartiles (arranged in array from lowest to
highest). Solver for D3,D5, and D8.ution: Arrange the scores from the lowest to the highest as indicated
below
42 48 54 61 68 77 90 94
43 49 12th 54 D3 62 69 78 32nd 91 D8 96
45 50 59 62 70 83 91 98
46 51 59 62 73 85 92 99
= 12 = 20 = 32
Deciles are used to divide a distribution into ten equal parts. These are denoted by D1, D2, D5 . . .D9
The following are the formulas for D1 D2, D5 and D9 under grouped data:
1n
−F
D1 = L + ( 10 )i
f
2n
−F
D2 = L +( 10 )i
f
5n
−F
D5 = L +( 10 )i
f
9n
−F
D9 = L +( 10 )i
f
Thus,
kn
−F
Dk = L +( 10 )i
f
Where:
Example 2 Find the values of D1, D5 and D9 from the given frequency distribution of the scores in a
History class of fifty students.
First Decile
1n
−F 1n
D1 = L +( 10 )I Solve for −¿= 1(50)/10
10
f
= 50/10 =5
5−4
= 19.5 + ( )5
5
1
= 19.5 + ( )5
5
5
=19.5 +
5
=19.5 + 1
= 20.5
Fifth Decile
5n
−F 5n
D5 = L +( 10 )I Solve for −¿= 5(50)/10
10
f
= 250/10 = 25
25−21
= 29.5 + ( )5
10
4
= 29.5 + ( )5
10
20
=29.5 +
10
=29.5 + 2
Ninth Decile
9n
−F 9n
D5 = L +( 10 )I Solve for −¿= 9(50)/10
10
f
= 450/10 = 45
45−42
= 39.5 + ( )5
6
3
= 39.5 + ( )5
6
15
=39.5 +
6
=39.5 + 2.5
= 42
One of the uses of the deciles is to divide the distribution into 10 sub-groups. It can be used to divide the
distribution into 5 sub-groups by computing the values D2, D4, D6, and D8
Example:
Scores f F
45-49 2 50
40-44 6 48
D8 L 34.5 35-39 11 42
D6 L 29.5 30-34 10 31
D4 D2 L 24.5 25-29 12 21
20-24 5 9
15-19 4 4
N = 50
Second Decile
2n
−F 2n
D2 = L +( 10 )I Solve for −¿= 2(50)/10
10
f
= 100/10 = 10
10−9
= 24.5 + ( )5
12
5
=24.5 +
12
=24.5 + .42
= 24.92
Fourth Decile
4n
−F 4n
D4 = L +( 10 )I Solve for −¿= 4(50)/10
10
f
= 200/10 = 20
20−9
= 24.5 + ( )5
12
11
= 24.5 + ( )5
12
55
=24.5 +
12
=24.5 + .4.58
= 29.08
Sixth Decile
6n
−F 6n
D6 = L +( 10 )I Solve for −¿= 6(50)/10
10
f
= 300/10 = 30
30−21
= 29.5 + ( )5
10
9
= 29.5 + ( )5
10
45
=29.5 +
10
=29.5 + .4.5
= 34
8n
−F 8n
D8 = L +( 10 )I Solve for −¿= 8(50)/10
10
f
= 400/10 = 40
40−31
= 34.5 + ( )5
11
9
= 34.5 + ( )5
11
45
=34.5 +
11
=34.5 + .4.09
= 38.59
The above table indicates that scores from 39-49 will be considered group A; scores from 34-38, B;
from 29-33, group C; from 25-28, D and 15-24 group E.
Where:
P= the percentile
k= from 1, 2, 3 99
n= the sample size
Example 1 Using the same data from the previous example on quartiles (arranged in array from lowest to
highest). Solve for P50, P66, and P98.
ution: Arrange the scores from the lowest to the highest as indicated below
42 48 54 61 68 26th 77 P66 90 94
43 49 54 62 69 78 91 96
45 50 59 62 70 83 91 98
th
46 51 59 62 73 85 92 39 99 P98
46 52 59 20th 66 P50 76 89 93 10
0
= 20 = 26.4 = 39.2
It can be noted that the position of the score of 66 in the above distribution are the same when the following
measures are computed such as the Md = Q2 = D5 = P50
Md = Q2 = D5 = P50
Md = 20th = 66
Q2 = 20th = 66
D5 = 20th = 66
P50 = 20th = 66
Percentiles are the ninety-nine score points which divide a distribution into one hundred equal parts. For
example, the 2nd percentile (P2) separates the lowest 2% from 98%. The formula for percentile is shown below.
kn
−F
Pk = L +( 100 )i
f
Where:
The following are the formulas for D1 D2, D5 and D9 under grouped data:
33 n
−F
P33 = L + ( 100 )i
f
66 n
−F
P66 = L +( 100 )i
f
75 n
−F
P75 = L +( 100 )i
f
Example 1. Solve for P20, P50, and P80 from the frequency distribution of the scores of 50 students in
History.
Scores f F
27 Prepared by: Leah Rosete-Garming, MPA
45-49 2 50
40-44 6 48
P80 L 34.5 35-39 11 42
P50 L 29.5 30-34 10 31
P20 L 24.5 25-29 12 21
20-24 5 9
15-19 4 4
n = 50
20th Percentile
20 n
−F 20 n
P20 = L +( 100 )I Solve for −¿=20(50)/100
100
f
= 1000/100 = 10
10−9
= 24.5 + ( )5
12
1
= 24.5 + ( )5
12
5
=24.5 +
12
=24.5 + ..41
= 24.91
50th Percentile
50 n
−F 50 n
P50 = L +( 100 )I Solve for −¿=50(50)/100
100
f
= 2500/100 = 25
25−21
= 29.5 + ( )5
10
4
= 29.5 + ( )5
10
20
=29.5 +
10
=29.5 + ..2
= 31.5
80 n
−F 80 n
P80 = L +( 100 )I Solve for −¿=80(50)/100
100
f
= 4000/100 = 40
40−31
= 34.5 + ( )5
11
9
= 34.5 + ( )5
11
45
=34.5 +
11
=34.5 + ..4.09
= 38.59
In order to describe fully a distribution, it is necessary to present the mean or many of the measure of central
tendency. To give more meaning to these measure, there is nother measure of spread, more popularly known
as measure of dispersion or measures of variation composed of the range, average deviation, standard
deviation, quartile deviation and variance.
Scores f
45-49 49.5 2
40-44 6
35-39 11
30-34 10
25-29 12
20-24 5
14.5 15-19 4
n = 50
The range is the least reliable of the measures and is used only when one is in a hurry to get a measure of
variability. It may be used with ordinal, interval or ratio data.
∑f (m−x)
AD¿
n
Where:
AD = average deviation
∑ f /m−x /¿ = sum of the products of the frequency times the difference between the midpoint
and the mean
n = sample size
Scores f m fm / m- x /
45-49 2 47 94 15.5
40-44 6 42 252 10.5
35-39 11 37 407 5.5
30-34 10 32 320 .5
25-29 12 27 324 - 4.5
20-24 5 22 110 - 9.5
14.5 4 17 68 -14.5
n = 50 ∑fm = 1575 ∑f/m- x /=319.0
AD¿∑f/m- x / X = ∑ fm
n n
= 319 = 1575
50 50
= 6.38 = 31.50
Formula:
2
(∑ fd)
SD =
i
√ ∑ f d 2−
n−1
n
Example:
Scores f d fd fd2
45-49 2 +3 +6 +18
40-44 6 +2 +12 +29 +24
35-39 11 +1 +11 +11
30-34 10 0 0 0
25-29 12 -1 -12 +12
20-24 5 -2 -10 -34 +20
14.5 4 -3 -12 +36
n = 50 ∑fd2 = -5 ∑fd2 =121
Solution:
(∑ fd)2
SD =
i
√ ∑ f d 2−
n−1
n
(−5)2
SD =
5
√ 121−
50
50−1
121−.5
SD = 5
√ 49
120.5
SD = 5
√ 49
SD = 5 √ 2.46
SD = 5 ¿
SD = 7.85
The standard deviation is used whenever a distribution approximates a normal distribution. It is the most
reliable measure of variability which is used with interval and ratio data.
ACTIVITY:1
The following data is a result of the survey on the students’ level of awareness of the serious symptoms
of COVID 19. The survey was administered to the senior students of public and private secondary
schools in Tabuk who were selected at random. Total number of samples considered was 120.
M
VMA A MoA LA
Serious Symptoms of COVID 19 (4) (3) (2) (1)
1. Trouble breathing 47 20 25 28
2. Constant pain or pressure in your
chest 44 26 30 20
3. Bluish lips or face 22 27 31 40
4. Sudden confusion 25 33 18 44
5. Sore throat 62 35 15 8
6. Headache 52 40 19 9
7. Chills, sometime with shaking 20 29 22 49
8. Loss of smell or taste 23 32 36 29
9. Nausea or vomiting 23 28 30 39
10. Diarrhea 43 17 25 35
REQUIRED:
ACTIVITY:2
The following frequency distribution presents the training experiences in years of 50 faculty members of
St. Louis College of Bulanao, Inc.
Symbol
Mean X
Median Md
Quartile 1 Q1
Quartile 3 Q3
IV. Assessment
1. In one paragraph give reactions/comments on whether the objectives of the lessons presented
were attained. Explain.
References:
Arao, Rosalia R. , et. al. Statistics (Based on CMO 03 Series of 2007) Rex Book Store. Reprinted
2011.
Broto, Antonio S. Statistics Made Simple, University of Eastern Philippines, Catarman, Northern
Samar, 2003.
Calmorin, Laurentina Paler, Research and Thesis Writing with Statistics Computer Application, Revised
Edition, Rex Book Store, 2016.
Downie N.M. and Heath Robert W. Basic Statistical Methods, Basic Statistical Methods, National Book
Store 1984.
Fraenkel, Jack R and Wallen Norman E., How to Design and Evluate Research in Education, McGraw-
Hill International Edition, 2006
Yazon Alberto D, Callo Eden C and Buenvinida, Lerma P. Learning Guide in Methods of Research.
Wiseman’s Book Trading, Inc. 2019
Data on COVID19 retrieve from https//endcov.ph/dashboard