Professional Documents
Culture Documents
Mathematics7 - Q4 - Mod30 - Analyzing and Interpreting Statistical Data - v5
Mathematics7 - Q4 - Mod30 - Analyzing and Interpreting Statistical Data - v5
Mathematics7 - Q4 - Mod30 - Analyzing and Interpreting Statistical Data - v5
7
Mathematics
Fourth Quarter – Module 30
Analyzing and Interpreting
Statistical Data
Mathematics – Grade 7
Alternative Delivery Mode
Fourth Quarter – Module 30: Analyzing and Interpreting Statistical Data
First Edition, 2020
Republic Act 8293, Section 176 states that: No copyright shall subsist in any work
of the Government of the Philippines. However, prior approval of the government
agency or office wherein the work is created shall be necessary for exploitation of
such work for profit. Such agency or office may, among other things, impose as a
condition the payment of royalties.
Borrowed materials (i.e., songs, stories, poems, pictures, photos, brand
names, trademarks, etc.) included in this module are owned by their respective
copyright holders. Every effort has been exerted to locate and seek permission to
use these materials from their respective copyright owners. The publisher and
authors do not represent nor claim ownership over them.
Published by the Department of Education: Region 10
Regional Director: Dr. Arturo B. Bayocot, CESO III
Assistant Regional Director: Dr. Victor G. De Gracia Jr., CESO V
Development Team of the Module
Author: Maribel Shiela M. Tatoy, T II
Reviewers: Darelyn L. Cajeles, T I Ronato L. Taban-ud, T III
Pelmar M. Acosta, T II Ramonita D. Mercado, MT I
Ana T. Cuevas, HT III Jessieca C. Hornejas, HT I
Nora T. Bongcawel, MTII Errol O. Taguran, HT III
Illustrator and Layout Artist: Vernie P. Bacayo, T II
Errol O. Taguran, HT III
Conformance Reviewers: Alma C. Salomon, HT III Lourgen V. Maalam, MT II
Ana T. Cuevas, HT III Errol O. Taguran, HT III
Vernie P. Bacayo, T II Stephanie Mae R. Lanzaderas, T
III
Darelyn L. Cajeles, T I Rica O. Boquecosa, SST I
Ann J. Robiato, T III Mary Joy A. Cartagena, T I
Rhina T. Cuajotor, T II
Management Team
Chairperson: Dr. Arturo B. Bayocot, CESO III
Regional Director
Co-Chairpersons: Dr. Victor G. De Gracia Jr. CESO V
Asst. Regional Director
Edwin R. Maribojoc, EdD, CESO VI
Schools Division Superintendent
Eugene I. Macahis, Jr.
OIC - Assistant Schools Division Superintendent
Mala Epra B. Magnaong, Chief ES, CLMD
Members: Marie Emerald A. Cabigas, PhD, EPS II - LRMS
Bienvenido U. Tagolimot, Jr., EPS - ADM
Samuel C. Silacan, EdD, CID Chief
Ernie J. Caguindangan, EPS - Mathematics
Rone Ray M. Portacion, EdD, EPS - LRMS
Leah G. Xenos, PSDS
Tita Pita L. Bago, EdD, Principal III/District In - Charge
Agnes P. Gonzales, PDO II
Vilma M. Inso, Librarian II
The hand is one of the most symbolized parts of the human body. It is often
used to depict skill, action and purpose. Through our hands we may learn, create and
accomplish. Hence, the hand in this learning resource signifies that you as a learner
is capable and empowered to successfully achieve the relevant competencies and
skills at your own pace and time. Your academic success lies in your own hands!
This module was designed to provide you with fun and meaningful
opportunities for guided and independent learning at your own pace and time. You
will be enabled to process the contents of the learning resource while being an active
learner.
What I Need to Know This will give you an idea of the skills or
competencies you are expected to learn in
the module.
1. Use the module with care. Do not put unnecessary mark/s on any part of the
module. Use a separate sheet of paper in answering the exercises.
2. Don’t forget to answer What I Know before moving on to the other activities
included in the module.
3. Read the instruction carefully before doing each task.
4. Observe honesty and integrity in doing the tasks and checking your answers.
5. Finish the task at hand before proceeding to the next.
6. Return this module to your teacher/facilitator once you are through with it.
If you encounter any difficulty in answering the tasks in this module, do not
hesitate to consult your teacher or facilitator. Always bear in mind that you are not
alone.
We hope that through this material, you will experience meaningful learning
and gain deep understanding of the relevant competencies. You can do it!
Table of Contents
Lesson 2 ---------------------- 11
This module is designed and written with you in mind. It is here to help
you develop your skill in analyzing and interpreting data using the statistical
measures you have learned from the previous modules. The scope of this
module permits it to be used in many different ways. The language recognizes
the diverse vocabulary level of students. The lessons are arranged to follow
the standard sequence of the course. But the order in which you read them
can be changed to correspond with the textbook you are now using.
Based on the competency, this module will help you use appropriate
statistical measures in analyzing and interpreting statistical data (M7SP-IVj-1).
Specifically, after going through this module, you are expected to:
Directions: Read the questions carefully and write the letter of the answers
on
your answer sheet.
1. Which measure of central tendency is best to use when there are extreme
scores in the data?
A. mean B. median C. mode D. variance
2. Which statistical measure is used in finding out the brand of the most
saleable canned sardines in a grocery store?
A. mean B. mode C. range D. standard deviation
3. What is the most reliable measure of variability?
A. average deviation C. standard deviation
B. range D. variance
4. Which measure of variability considers only the highest and the lowest
scores in the data set?
A. average deviation C. standard deviation
B. range D. variance
5. For the set of data consisting 5, 5, 6, 7, 7, which statement is correct?
A. mean = mode C. median = mode
B. mean = median D. median > mean
6. Which measure/s of central tendency is best to use in finding out how the
students of a certain city get to school:
I. Mean II. Median III. Mode?
A. I only B. III only C. I and II D. II and III
7. Given is the set of values 4, 5, 4, 13, 6. Dan said that the number that best
represents the data is 6.4. Rica, on the other hand, insisted that the best
representation of the data is 5. Who is correct? Explain.
A. Dan. The best measure to represent the data is always the mean.
B. Rica. The best measure to represent the given data is the median
because of the presence of extreme value 13.
C. Both Dan and Rica. Any measure of central tendency can be used in
representing a given set of data.
D. Both Dan and Rica. Their computations for the mean and median
respectively are both correct.
For numbers 8-10, refer to the situation given below.
Julius grows two different types of tomato plant in his garden.
For one week, he keeps a record of the number of tomatoes he picks from
each type of tomato plant.
Day Mon Tue Wed Thu Fr Sat Sun
i
Type 5 5 4 1 0 1 5
A
Type 3 4 3 3 7 9 6
B
8. Find the mean, median, and mode for the Type A tomato plants.
A. mean = 2; median = 3; mode = 4
B. mean = 2; median = 4; mode = 4
C. mean = 3; median = 4; mode = 5
D. mean = 3; median = 5; mode = 6
9. Find the mean, median, and mode for the Type B tomato plants.
A. mean = 2; median = 3; mode = 4
B. mean = 3; median = 4; mode = 5
C. mean = 5; median= 3; mode = 2
D. mean = 5; median= 4; mode = 3
10. Which measure would you use to justify that there is no difference
between the two types?
A. mean C. mode
B. median D. both A and B
What’s In
Let’s Do It This Way: Answer the given problem on your answer sheet.
Find the:
What’s New
What Is It
mean
median
mode
If you take a look at the plot below for the values in Set B, you would
notice that the scores are positioned closely to each other except for one
score, that is, 98.
This score is located
extremely far from
other scores.
The score 98 which is far different from the other scores is referred
to as the extreme value in a data set.
Set A Set B
Without Extreme Value With Extreme Value
From the data Set B, there are four scores (83, 85, 85, 87) lower than
the mean 87.67. On the other hand, there are only two scores (88,
98) higher than the mean. It implies that the mean is not the center of
the data.
From the data Set B, there are three scores (83, 85, 85) lower than
the median 86. On the other hand, there are also three scores (87,
88, 98) higher than the median. It implies that the median is the
center of the data.
Hence, for Set B, the most appropriate measure to describe the data
is the median 86.
Consider the question again: “Is it appropriate to always use the
mean in describing all kinds of data set?” The answer is: No, especially
when there is extreme value in the data set.
Since there are two values in the middle, then the average of such
two values has to be taken to get the median. So, you have
55+60
Median=
2
115
Median=
2
Median=57.5
Thus, a Grade 7 student exercises at an average time of 57.5
minutes.
2. Below is the recorded average lowest temperature, in ºC, in each of the
six cities in Northern Mindanao today. What is the average lowest
temperature the whole region has experienced today?
City Cagayan Iligan Malaybalay Oroquieta Ozamiz Tangub
de Oro
Temperature
21 22 19 18 20 17
(ºC)
Answer:
Looking at the data in the table, the recorded set of temperature of
the cities in the region does not have extreme value. So, the mean of
these data measures is appropriate to use in finding out the average
lowest temperature of the whole region.
Solving for the mean, you have
x=
∑x
n
21+22+19+18+20+17
x=
6
117
x=
6
x=19.5
Thus, the average lowest temperature Northern Mindanao has
experienced today is 19.5ºC.
3. A sari-sari store was able to sell 12 cans of sardines in one day. Which of
the brands of the canned sardines is the most saleable?
Answer:
Since the set of data is not numerical, talking only about the brand
of sardines, then the mode is to be used. In a day, the store sold 1 can of
Brand W, 3 cans of Brand X, 3 cans of Brand Y, and 5 cans of Brand Z.
Therefore, the most saleable brand of canned sardines is Brand Z.
What’s More
Answer: ______________________
median
Begin here...
Answer: ______________________
Answer: ______________________
Answer: ______________________
5. What is the most popular mobile unit among the Mobile Legend players?
Answer: ______________________
What’s New
What Is It
This suggests that Ryan and Diane had different working conditions. If
it is only the mean working time that is considered, such difference would not
be noticed. Hence, it will lead to a wrong interpretation on the data.
Illustrating the situation on the number line below, it can be seen that
Ryan’s working hours are all pretty close at the mean 9 while Diane’s working
hours are spread out. This being spread out is called variability.
Diane’s data set range, which is 9, is greater than Ryan’s data set
range which is 2. This means that there is more variability in Diane’s working
hours than in Ryan’s. Furthermore, it tells that Diane’s working hours are
more spread out than Ryan’s.
The range tells how spread out or scattered the values in the data
set are. A larger range indicates that the data are more scattered. A smaller
range indicates that the data are less scattered.
A . D .=
∑ |x i−x|,
n
you have the results below:
As you can see, the average
deviation in Diane’s set of
working hours is greater the
average deviation in Ryan’s
= =
= = working hours. This implies that
∑ |x i−x| A . D=
∑ |x i−x| Diane’s working hours are
A . D=
n n
scattered from the mean and
3 16
A . D= A . D= Ryan’s are closer to the mean.
5 5
Similar to the range being a measure of variability, the average
deviation also tells how spread out or scattered the values in the data set
are. Specifically, it describes how the data values are scattered or how
close they are from the mean.
A larger average deviation indicates that the data are more scattered
from the mean. A smaller average deviation indicates that the data are less
However,
scattered from thesince
mean.the absolute value is involved in calculating the
average deviation, it gives a little inconvenience because case-by-case
analysis and a little of higher mathematics concepts might be needed at some
point. That is why another measure of variability is much more common. It is
the variance.
Let’s Talk About the Variance
Remember the formula in solving for the variance:
2
s=
∑ ( x−x )2
n−1
Computing the variance of each data set of working hours, you have
the following results: Ryan Diane
The variance tells how close the scores of the distribution are with
respect to the mean.
A larger variance means a greater variability, that is, the data are
more spread out. A smaller variance indicates a lesser variability, that is, the
data are less spread out.
variance. In symbols, s=
√ ∑ ( x−x )2 .
n−1
To compute for the standard deviation of each data set of working
hours of Ryan and Diane, simply take the square root of the variance
obtained. So, we have
The standard deviation is just the square root of the variance. It is
also used to find out the variability of the data set.
So,Standard deviation
the different is useful
measures as the value
of variability is in the
have been same
applied in unit as the
analyzing
anddata from which
interpreting bothit sets
is computed.
of working time of Ryan and Diane. The bottom line
The smaller
here is Diane’s the standard
set of working hours deviation,
has more the less varied
variability are the This
than Ryan’s. data
values.
implies that the working hours of Diane are more spread out than the working
hours of Ryan. The conclusion of these results shall be discussed further in
the next module.
Solution:
You have already learned that the most reliable measure of variability is
the standard deviation.
Recall how the standard deviation is solved as presented in the previous
module. A table will help you organize the values obtained from the
computations.
Store A Store B
x x−x ( x−x )2 x x−x ( x−x )2
7 −1 1 7 −1.4 1.96
13 5 25 14 5.6 31.36
6 −2 4 7 −1.4 1.96
7 −1 1 8 −0.4 0.16
7 −1 1 6 −2.4 5.76
∑ x =40 ∑ ( x −x ) =32
2
∑ x =42 ∑ ( x −x ) =41.20
2
x=8 x=8.4
s=
√ ∑ ( x−x )2
n−1
s=
√ ∑ ( x−x )2
n−1
s=
√ 32
5−1
s=
√
41.20
5−1
s=
√
32
s= √ 8
4
s=
√
41.20
4
s= √ 10.3
s=2.83 s=3.21
Interpretation:
The prices in Store A are less variable than in Store B
because it has smaller standard deviation, which is 2.83,
compared to 3.21 of Store B. This implies that the mean of the
prices in Store A is more reliable than the mean of the prices
in Store B.
Example 2:
A group of Grade 7 students were asked how much they spent weekly
on a cellphone load. The following table shows the data. Analyze and interpret
the data.
Weekly Cellphone Load (in Number of Students
pesos)
50-59 4
40-49 11
30-39 21
20-29 16
10-19 8
Solution:
Recall how the standard deviation of a grouped data is computed, as
presented in the previous module. Still, a table would help you in organizing
the data obtained from the computation.
x f Class fx m xm
2 2
fx m
mark
xm
50-59 4 54.5 218.0 2970.25 11881.00
40-49 11 44.5 489.5 1980.25 21782.75
30-39 21 34.5 724.5 1190.25 24995.25
20-29 16 24.5 392.0 600.25 9604.00
10-19 8 14.5 116.0 210.25 1682.00
∑ f =n ∑ f x m=1940 ∑ f x 2m=69945
¿ 60
√
2
n ∑ f x m− ( ∑ f x m )
2
s=
n ( n−1 )
√
2
60 ( 69945 )− (1940 )
s=
60 ( 60−1 )
s=
√
4196700−3763600
60 ( 59 )
s=
√ 433100
3540
s= √ 122.34
s=11.06
Interpretation:
Based on the computation, the standard deviation is 11.06, which
means that the data set of cellphone load consumption has a large variability
and the mean is not reliable.
What’s More
26.4 x=¿
x=¿ 3.4
s=
√ ∑ ( x−x )2
n−1 s=
√ ∑ ( x−x )2
n−1
s=
√ 26.4
9
s= √ 2.93
s=1.71
Interpretation:
The set of scores of Bryan in the ten math quizzes has a standard
deviation of ________. On the other hand, the set of scores of Brenda has a
standard deviation of __________. Bryan’s scores has __________ variability
than Brenda’s scores. The mean of the scores of Bryan is ________ reliable
than the mean of the scores of Brenda.
What I Can Do
452
x=
5
x=90.4
Mean
Media
n
Mode
Assessment
Directions: Read the questions carefully and write the letter of the answers
on
your answer sheet.
1. Which measure of central tendency is best to use when there are no
extreme scores in the data?
A. mean B. median C. mode D. variance
2. Which measure of variability takes the difference between the highest
value in the data and the lowest value in the data?
A. average deviation C. variance
B. range D. standard deviation
3. Which statistical measure is used in finding out the highest number of
votes of an election for the President of the Supreme Student
Government?
A. mean B. median C. mode D. standard deviation
4. Among the measures of variability, which is the most reliable?
A. average deviation C. standard deviation
B. range D. variance
5. For the set of data consisting 6, 6, 7, 8, 8, which statement is correct?
A. mean = mode C. median = mode
B. mean= median D. median > mean
6. Which measure of central tendency is best to use to find out the highest
selling item in a gift shop: I. Mean II. Median III. Mode?
A. I only B. II only C. III only D. I and II
7. Given is the record of the number of buses that were late to arrive in a
bus terminal.
0, 7, 0, 0, 2, 5, 0, 0, 6, 0, 1, 25, 0, 10, 1, 1, 22
Rey said that the number that best represents the data is 5. Tina
argued and said that 1 is the best representation of the data. Vince
disagreed and insisted that 0 is the most appropriate number to
represent the data. Who is correct? Explain.
A. Rey. He is using the mean 5 and the mean is always the best
measure to represent any set of data.
B. Tina. The best measure to represent the given data is the median 1
because there are extreme values in the data set.
C. Vince. 0 is the mode and it is the most appropriate measure to use
in this kind of data.
D. All of them. Any measure of central tendency is appropriate to use in
describing the data.
For numbers 8-10, refer to the situation given below.
Cathy sells two different types of
apples. For one week, she keeps a
record of the number of apples sold
from each type.
8. Find the mean, median, and mode for the Type A apple.
A. mean = 2; median = 3; mode = 4
B. mean = 2; median = 4; mode = 4
C. mean = 3; median= 4; mode = 5
D. mean = 4; median= 5; mode = 6
9. Find the mean, median, and mode for the Type B apple.
A. mean = 2; median = 3; mode = 4
B. mean = 3; median = 4; mode = 5
C. mean = 4; median= 3; mode = 2
D. mean = 5; median= 4; mode = 3
10. Which measure would you use to argue that there is no difference
between the two types?
A. mean C. mode
B. median D. both A and B
Additional Activities
x f Class fx m 2
xm
2
fx m
mark
xm
46-50 1 48
41-45 10 43
36-40 10 38
31-35 16 33
26-30 9 28
21-25 4 23
∑ f =n ∑ f x m=¿¿ ¿ ∑ f x 2m=¿ ¿ ¿
¿50
Formula:
√
2
n ∑ f x m− ( ∑ f x m )
2
s=
n ( n−1 )
Answer Key
Lesson 1 Lesson 1
What’s More (Activity 3) What’s New (Activity 2)
A. 1. median 1-2.
2. mean Set Set B
3. median A
4. mode Mean 85.6 87.7
5. mode Median 85 86
Mode 85 85
What’s In
1. a. 10.63 3. a. mean
b. 11.50 b. median
s=1.75
The set of scores of Bryan
in the ten math quizzes has a
standard deviation of 1.71. On the
other hand, the set of scores of
Brenda has a standard deviation of
1.75. Bryan’s scores has lesser
Lesson 2
variability than Brenda’s scores.
The mean of the scores of Bryan is What’s New (Activity 5)
more reliable than the mean of the
a. 9
scores of Brenda.
b. 9
c. The two means
are equal.
d. It is Ryan’s
working time that
are closer to the
mean.
References
BOOKS:
Calaca, Uy, et.al. Statistics and Probability. Vibal. Chapter 2. pp. 59-65.
LINKS:
https://www.youtube.com/watch?v=hQ6BnQkD43g
https://www.youtube.com/watch?v=fvgDqVda9L8
https://www.youtube.com/watch?v=bpyU0Q7L3jo
https://www.youtube.com/watch?v=IArn-sKC8Mc
https://www.cimt.org.uk/projects/mepres/book7/bk7i18/bk7_18i2.htm
For inquiries or feedback, please write or call: