Professional Documents
Culture Documents
An Analysis of The Probability Distribution of The Heights of Grade 11 Students by CUDIO Et Al, 2020
An Analysis of The Probability Distribution of The Heights of Grade 11 Students by CUDIO Et Al, 2020
An Inferential Study
Presented to
by Group 1
March 2020
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
TABLE OF CONTENTS
List of Figures iv
I. INTRODUCTION
B. Plan of Investigation 2
II. INFORMATION/MEASUREMENTS
B. Data Instruments 3
C. Data Organization 4
D. Analysis of Data 4
ii
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
IV. VALIDITY 22
Bibliography 30
iii
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
LIST OF TABLES
TABLE IID.1 Frequency Distribution Table of the Heights of Sampled Female Students 4
TABLE IID.2 Frequency Distribution Table of the Heights of Sampled Male Students 5
TABLE IIIAa.1 Detailed Frequency Distribution Table of Data from Female Respondents 7
TABLE IIIAb.1 Values of Specific Left-Tailed Probabilities for Female Students’ Heights 8
TABLE IIIAb.2 Probability Distribution per Class Interval for Female Students’ Heights 9
TABLE IIIAc.1 Values of Correlational Variables for the Female Heights and Ages 10
TABLE IIIBa.1 Detailed Frequency Distribution Table of Data from Male Respondents 11
TABLE IIIBb.1 Values of Specific Left-Tailed Probabilities for Male Students’ Heights 13
TABLE IIIBb.2 Probability Distribution per Class Interval for Male Students’ Heights 13
TABLE IIIBc.1 Values of Correlational Variables for the Male Heights and Ages 15
TABLE IIICa.1 Detailed Frequency Distribution Table of Data from Sampled Students 16
TABLE IIICb.2 Probability Distribution per Class Interval for Respondents’ Heights 18
iv
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
LIST OF FIGURES
FIGURE IIIAb.2 Comparison between the Two P(X) Value Sets for the Female Data Set 9
FIGURE IIIAc.1 Scatter Plot Graph comparing Female Heights and Ages 10
FIGURE IIIBb.2 Comparison between the Two P(X) Value Sets for the Male Data Set 14
FIGURE IIIBc.1 Scatter Plot Graph comparing Male Heights and Ages 15
FIGURE IIICb.2 Comparison between the Two P(X) Value Sets for the Sample Data Set 19
FIGURE IIICc.1 Scatter Plot Graph comparing the Respondents’ Heights and Ages 20
v
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
I. INTRODUCTION
The researchers were tasked to measure the heights of 30 male and 30 female students randomly
selected from Palawan State University - Laboratory Senior High School Department and discover
whether the heights of the students follow a standard normal distribution.
When the researchers have measured the heights of 30 male and 30 female students randomly
sampled from Palawan State University – Laboratory Senior High School, the researchers
organized the data and created a histogram that describes the data’s probability distribution. Then,
the researchers differentiated the histogram to a standard normal distribution relative to the data
gathered. In order to accomplish this, the mean and standard deviation was calculated by the
researchers. For better analyses, the researchers also accounted for the median and the mode in an
attempt to verify if the heights follow a standard normal distribution or not.
The researchers created three inferential analyses from the data. The first, being whether the
heights of Male G11 Students follow a standard normal distribution; The second is whether the
heights of Female G11 Students follow a standard normal distribution; Lastly, the third is if
combined as a whole, the G11 Students’ heights follow a standard normal distribution.
In order for the study to be more holistic, the researchers then decided to determine whether
age is also a factor which leads to outlier variables. This was done by simply creating a 2-variable
correlational analysis on the data obtained by the researchers, namely the student’s heights and
their respective ages. This process was done mainly because of the idea of ‘puberty’ playing a role
on the study. Through this, the researchers will then determine whether age should be the basis for
the demographic of the participants.
1
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
B. Plan of Investigation
The researchers are tasked to measure the heights of the students. However, due to constraints
brought about by the COVID-19 Pandemic and the notice on “Social Distancing” implemented by
the City Government, the researchers simply sampled 30 male and female students and asked for
their height and age via an Online Survey Questionnaire.
Regarding the sampling, the researchers only took a sample from the 5 sections of G11
Students. 3 of which are from STEM, namely the sections of Commitment, Courage, and Courtesy,
and ABM, composed of the sections of Creativity and Innovativeness. From a list of names, the
researchers used Simple Random Sampling with regard to the specific demographic choosing set
by the researchers.
The researchers decided to take a sample only from the G11 Students of Palawan State
University - Laboratory Senior High School because data that could be gathered from Grade 12
would greatly serve as outlier variables and would therefore result to a definite conclusion that the
heights of the students would not follow a standard normal distribution.
Therefore, the expected demographical distribution of the participants should follow the table
below:
2
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
II. INFORMATION/MEASUREMENTS
The researchers utilized an Online Survey Questionnaire via “Google Forms” as a medium for
data gathering. After randomly sampling 6 male and 6 female students for every G11 Section in
Palawan State University – Laboratory Senior High School, the researchers sent a link to the chosen
respondents. The Online Survey Questionnaire asked for the respondents’ Section, Gender, Age,
and Height, in inches.
Afterwards, the data was summarized in a Microsoft Excel Sheet for easier analyses and data
treatment. Then, the researchers calculated for the Measures of Central Tendencies when the data
was treated as Grouped Data, while the Measures of Absolute Location were calculated when the
Data was treated as Ungrouped Data.
B. Data Instruments
The researchers used Tables, such as Frequency Distribution Tables, and Graphs, specifically
Normal Distribution Graphs, Histograms, and Scatter Plot Graphs, to discover more about the
heights of students from Palawan State University – Laboratory Senior High School and the trends
that they follow. These figures helped the researches derive a generalization towards a conclusion
on whether the heights of the students follow a normal distribution and whether the outlier variables
are affected by age.
The Mathematical Calculations that the data underwent are the following:
• The raw data was treated in such a way that it was grouped by the researchers into
Frequency Distribution Tables when the Mean, Median and Mode was accounted for, since
the data are continuous.
• When the data mean and standard deviation were accounted for in the creation of a
standard normal distribution reference, the data were treated as ungrouped.
3
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
C. Data Organization
Since the data is continuous in nature, the researchers decided to group the data and create
Frequency Distribution Tables of the data they were able to gather. The data was organized and the
researchers created three Frequency Distribution Tables, specifically, two by gender and one to
summarize the whole data set.
The Tables were made according to the following guidelines considered to be the “Standard
Procedure” in making Frequency Distribution Tables:
• The class size is determined as the value of 𝑅𝑎𝑛𝑔𝑒⁄𝑘 where 𝑘 is the least integer
value such that 2𝑘 > 𝑛; and 𝑛 = number of data values in the set;
• When a data value 𝑥 is set to be at the boundary of two classes, 𝑥 belongs to the class
with lesser value; and
• In the event that a Median Class is to be determined, the value 𝑛⁄2 shall determine
where the Median Class is.
D. Analysis of Data
The tables that were made by the researchers are exhibited below.
TABLE IID.1 Frequency Distribution Table of the Heights of Sampled Female Students
4
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
TABLE IID.2 Frequency Distribution Table of the Heights of Sampled Male Students
These are simply organizations of the data gathered by the researchers. The Frequency
Distribution Tables were made via the “Standard Procedure” of constructing Frequency
Distribution Tables for ungrouped data.
5
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
The researchers decided to treat the data variously. The data set was grouped in three various
data sets: 1) data from the male respondents; 2) data from the female respondents; and 3) data of
all the respondents altogether.
Afterwards, the analyses and conclusions were made via three methods/processes. First, the
researchers obtained the Measures of Central Tendencies, specifically, the Mean, Median, and
Mode. The researchers used these values to check the respective data sets on whether the Mean,
Median and Mode are far apart or nearly coincide.
The second method is the comparison between the Probability Distribution of the Data Set and
a Normal Distribution based on the Mean and Standard Deviation of the said Data Set. The two
Probability Densities are compared to determine whether the probabilities are far apart or nearly
coincide.
The third method is one of correlational analysis – the age and height data of the respondents
were observed to determine whether a strong and significant correlation exists between the age and
height of the respondents which leads to possible outlier variables to determine whether the student
should be purposively chosen with respect to age in order to assure that the heights of the sampled
students follow normal probability distribution.
From Table IID.1, the researchers created a more elaborate Frequency Distribution Table where
the class mark is added and the product between the class mark and the frequency of said class is
also included.
6
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
TABLE IIIAa.1 Detailed Frequency Distribution Table of Data from Female Respondents
TOTAL: 30 1820.16
∑(𝑓 ∙ 𝑚)
𝑀𝑒𝑎𝑛 =
∑𝑓
1820.16
𝑆𝑢𝑏𝑠𝑡𝑖𝑡𝑢𝑡𝑖𝑛𝑔 𝑣𝑎𝑙𝑢𝑒𝑠 𝑓𝑟𝑜𝑚 𝑡ℎ𝑒 𝑡𝑎𝑏𝑙𝑒, 𝑀𝑒𝑎𝑛 = = 𝟔𝟎. 𝟔𝟕𝟐
30
∑𝑓
−𝐵
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + 2 ∙𝐶
𝐺
𝑤ℎ𝑒𝑟𝑒 𝐿 = 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝;
𝐵 = 𝑐𝑓 < 𝑏𝑒𝑓𝑜𝑟𝑒 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝; 𝐺 = 𝑓 𝑜𝑓 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝; 𝑎𝑛𝑑 𝐶 = 𝐶𝑙𝑎𝑠𝑠 𝑆𝑖𝑧𝑒
𝑓𝑚 − 𝑓𝑚−1
𝑀𝑜𝑑𝑒 = 𝐿 + ∙𝐶
(𝑓𝑚 − 𝑓𝑚−1 ) + (𝑓𝑚 − 𝑓𝑚+1 )
𝑤ℎ𝑒𝑟𝑒 𝐿 = 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑓𝑚 = 𝑓 𝑜𝑓 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝;
𝑓𝑚−1 = 𝑓 𝑏𝑒𝑓𝑜𝑟𝑒 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑓𝑚+1 = 𝑓 𝑎𝑓𝑡𝑒𝑟 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑎𝑛𝑑 𝐶 = 𝐶𝑙𝑎𝑠𝑠 𝑆𝑖𝑧𝑒
𝑆𝑢𝑏𝑠𝑡𝑖𝑡𝑢𝑡𝑖𝑛𝑔 𝑣𝑎𝑙𝑢𝑒𝑠 𝐿 = 59.52 − 0.5 = 59.02; 𝑓𝑚 = 15; 𝑓𝑚−1 = 3; 𝑓𝑚+1 = 8; 𝑎𝑛𝑑 𝐶 = 3.84,
15 − 3
𝑀𝑜𝑑𝑒 = 59.02 + ∙ 3.84 ≈ 𝟔𝟏. 𝟒𝟒𝟓
(15 − 3) + (15 − 8)
60.762 + 61.068 + 61.445
𝑇ℎ𝑒 𝑀𝑒𝑎𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑒𝑎𝑠𝑢𝑟𝑒𝑠 𝑜𝑓 𝐶𝑒𝑛𝑡𝑟𝑎𝑙 𝑇𝑒𝑛𝑑𝑒𝑛𝑐𝑦 𝑖𝑠: ≈ 𝟔𝟏. 𝟎𝟗𝟐
3
7
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
From the calculations on the previous page, the Measures of Central Tendency are near each
other. Therefore, the density of the data set in the median section of the graph is proved to follow
a normal distribution graph. However, it cannot be determined whether the data set features a
skewed graph.
From the data set under the demographic category of Female SHS Students, the following
measures have been found: 𝜇 ≈ 60.899; 𝑎𝑛𝑑 𝜎 ≈ 3.295.
TABLE IIIAb.1 Values of Specific Left-Tailed Probabilities for Female Students’ Heights
8
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
TABLE IIIAb.2 Probability Distribution per Class Interval for Female Students’ Heights
48 - 51.84 3 0.1
TOTAL: 30 1.0000
FIGURE IIIAb.2 Comparison between the Two P(X) Value Sets for the Female Data Set
0.3
0.2
0.1
0
48-51.84 51.84-55.68 55.68-59.52 59.52-63.36 63.36-67.2
Class Interval
9
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
From the observations on the Figure IIIAb.2, the researchers were able to conclude that the
probability distribution of the data set does not follow a standard normal distribution on the left
tail, but slightly similar towards the right tail. However from the graph, the researchers were able
to fortify their analysis that the densest portion of the graph is within the group of 59.52-63.36.
The values of specific Correlational Analysis values, specifically r and P-Value, were
determined by the researches to determine whether the Ages and Heights of the female respondents
are correlated significantly.
FIGURE IIIAc.1 Scatter Plot Graph comparing Female Heights and Ages
60
50
40
30
20
10
0
14.5 15 15.5 16 16.5 17 17.5 18 18.5
Age (in years)
TABLE IIIAc.1 Values of Correlational Variables for the Female Heights and Ages
10
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
From the values on Table IIIAc.1, the researchers were able to derive an analysis that there is
very weak to almost no positive correlation between the Ages and Heights of the female
respondents as the r is near zero. Also, this analysis should be held of great regard as it is considered
to be significant due to its relatively low P-value.
From Table IID.2, the researchers created a more elaborate Frequency Distribution Table where
the class mark is added and the product between the class mark and the frequency of said class is
also included.
TABLE IIIBa.1 Detailed Frequency Distribution Table of Data from Male Respondents
62 - 64 7 63 441
64 - 66 8 65 520
66 - 68 9 67 603
68 - 70 3 69 207
70 - 72 3 71 213
TOTAL: 30 1,984
11
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
∑(𝑓 ∙ 𝑚)
𝑀𝑒𝑎𝑛 =
∑𝑓
1984
𝑆𝑢𝑏𝑠𝑡𝑖𝑡𝑢𝑡𝑖𝑛𝑔 𝑣𝑎𝑙𝑢𝑒𝑠 𝑓𝑟𝑜𝑚 𝑡ℎ𝑒 𝑡𝑎𝑏𝑙𝑒, 𝑀𝑒𝑎𝑛 = ≈ 𝟔𝟔. 𝟏𝟑𝟑
30
∑𝑓
−𝐵
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + 2 ∙𝐶
𝐺
𝑤ℎ𝑒𝑟𝑒 𝐿 = 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝;
𝐵 = 𝑐𝑓 < 𝑏𝑒𝑓𝑜𝑟𝑒 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝; 𝐺 = 𝑓 𝑜𝑓 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝; 𝑎𝑛𝑑 𝐶 = 𝐶𝑙𝑎𝑠𝑠 𝑆𝑖𝑧𝑒
𝑓𝑚 − 𝑓𝑚−1
𝑀𝑜𝑑𝑒 = 𝐿 + ∙𝐶
(𝑓𝑚 − 𝑓𝑚−1 ) + (𝑓𝑚 − 𝑓𝑚+1 )
𝑤ℎ𝑒𝑟𝑒 𝐿 = 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑓𝑚 = 𝑓 𝑜𝑓 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝;
𝑓𝑚−1 = 𝑓 𝑏𝑒𝑓𝑜𝑟𝑒 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑓𝑚+1 = 𝑓 𝑎𝑓𝑡𝑒𝑟 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑎𝑛𝑑 𝐶 = 𝐶𝑙𝑎𝑠𝑠 𝑆𝑖𝑧𝑒
𝑆𝑢𝑏𝑠𝑡𝑖𝑡𝑢𝑡𝑖𝑛𝑔 𝑣𝑎𝑙𝑢𝑒𝑠 𝐿 = 66 − 0.5 = 65.5; 𝑓𝑚 = 9; 𝑓𝑚−1 = 8; 𝑓𝑚+1 = 3; 𝑎𝑛𝑑 𝐶 = 2,
9−8
𝑀𝑜𝑑𝑒 = 65.5 + ∙ 2 ≈ 𝟔𝟓. 𝟕𝟖𝟔
(9 − 8) + (9 − 3)
From the calculations above, the one of the Measures of Central Tendency, specifically the
Median is greatly deviated from the two other values. Therefore, the density of the data set is not
assured in the median section of the graph and it is not proven that the data set follows a normal
distribution graph. In addition to this, it cannot be determined whether the data set features a skewed
graph.
From the data set under the demographic category of Male SHS Students, the following
measures have been found: 𝜇 ≈ 66.391; 𝑎𝑛𝑑 𝜎 ≈ 1.945.
12
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
TABLE IIIBb.1 Values of Specific Left-Tailed Probabilities for Male Students’ Heights
TABLE IIIBb.2 Probability Distribution per Class Interval for Male Students’ Heights
13
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
62 - 64 7 0.2333
64 - 66 8 0.2667
66 - 68 9 0.3
68 - 70 3 0.1
70 - 72 3 0.1
TOTAL: 30 1.0000
FIGURE IIIBb.2 Comparison between the Two P(X) Value Sets for the Male Data Set
0.25
0.2
0.15
0.1
0.05
0
48-52 52-56 56-60 60-64 64-68
Class Interval
Raw Data Probability Normal Probability Values
From the observations on the Figure IIIBb.2, the researchers were able to conclude that the
probability distribution of the data set slightly follows a standard normal distribution on the left and
right tail. Also from the graph, the researchers were able to grapple an analysis that the densest
portion of the graph is within the group of 66-68, which is described by the mean and mode, but
not the median from Section IIIBa.
14
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
The values of specific Correlational Analysis values, specifically r and P-Value, were
determined by the researches to determine whether the Ages and Heights of the male respondents
are correlated significantly.
FIGURE IIIBc.1 Scatter Plot Graph comparing Male Heights and Ages
70
68
66
64
62
60
15.5 16 16.5 17 17.5 18 18.5
Age (in years)
TABLE IIIBc.1 Values of Correlational Variables for the Male Heights and Ages
P-value 0.730707107
From the values on Table IIIBc.1, the researchers were able to derive an analysis that there is
a strong positive correlation between the Ages and Heights of the male respondents as the r is near
the value of +1. However, this analysis should be held of low regard as it is considered to be only
relatively significant due to its relatively high P-value.
15
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
From Table IID.3, the researchers created a more elaborate Frequency Distribution Table where
the class mark is added and the product between the class mark and the frequency of said class is
also included.
TABLE IIICa.1 Detailed Frequency Distribution Table of Data from Sampled Students
52 - 56 1 54 54
56 - 60 5 58 290
60 - 64 23 62 1,426
64 - 68 22 66 1,452
68 - 72 6 70 420
TOTAL: 60 3,792
16
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
∑(𝑓 ∙ 𝑚)
𝑀𝑒𝑎𝑛 =
∑𝑓
3792
𝑆𝑢𝑏𝑠𝑡𝑖𝑡𝑢𝑡𝑖𝑛𝑔 𝑣𝑎𝑙𝑢𝑒𝑠 𝑓𝑟𝑜𝑚 𝑡ℎ𝑒 𝑡𝑎𝑏𝑙𝑒, 𝑀𝑒𝑎𝑛 = = 𝟔𝟑. 𝟐
60
∑𝑓
−𝐵
𝑀𝑒𝑑𝑖𝑎𝑛 = 𝐿 + 2 ∙𝐶
𝐺
𝑤ℎ𝑒𝑟𝑒 𝐿 = 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝;
𝐵 = 𝑐𝑓 < 𝑏𝑒𝑓𝑜𝑟𝑒 𝑡ℎ𝑒 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝; 𝐺 = 𝑓 𝑜𝑓 𝑀𝑒𝑑𝑖𝑎𝑛 𝐺𝑟𝑜𝑢𝑝; 𝑎𝑛𝑑 𝐶 = 𝐶𝑙𝑎𝑠𝑠 𝑆𝑖𝑧𝑒
𝑓𝑚 − 𝑓𝑚−1
𝑀𝑜𝑑𝑒 = 𝐿 + ∙𝐶
(𝑓𝑚 − 𝑓𝑚−1 ) + (𝑓𝑚 − 𝑓𝑚+1 )
𝑤ℎ𝑒𝑟𝑒 𝐿 = 𝐿𝑜𝑤𝑒𝑟 𝐶𝑙𝑎𝑠𝑠 𝐵𝑜𝑢𝑛𝑑𝑎𝑟𝑦 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑓𝑚 = 𝑓 𝑜𝑓 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝;
𝑓𝑚−1 = 𝑓 𝑏𝑒𝑓𝑜𝑟𝑒 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑓𝑚+1 = 𝑓 𝑎𝑓𝑡𝑒𝑟 𝑀𝑜𝑑𝑎𝑙 𝐺𝑟𝑜𝑢𝑝; 𝑎𝑛𝑑 𝐶 = 𝐶𝑙𝑎𝑠𝑠 𝑆𝑖𝑧𝑒
𝑆𝑢𝑏𝑠𝑡𝑖𝑡𝑢𝑡𝑖𝑛𝑔 𝑣𝑎𝑙𝑢𝑒𝑠 𝐿 = 60 − 0.5 = 59.5; 𝑓𝑚 = 23; 𝑓𝑚−1 = 5; 𝑓𝑚+1 = 22; 𝑎𝑛𝑑 𝐶 = 4,
23 − 5
𝑀𝑜𝑑𝑒 = 59.5 + ∙ 4 ≈ 𝟔𝟑. 𝟐𝟖𝟗
(23 − 5) + (23 − 22)
63.2 + 63.152 + 63.289
𝑇ℎ𝑒 𝑀𝑒𝑎𝑛 𝑜𝑓 𝑡ℎ𝑒 𝑀𝑒𝑎𝑠𝑢𝑟𝑒𝑠 𝑜𝑓 𝐶𝑒𝑛𝑡𝑟𝑎𝑙 𝑇𝑒𝑛𝑑𝑒𝑛𝑐𝑦 𝑖𝑠: ≈ 𝟔𝟑. 𝟐𝟏𝟒
3
From the calculations above, the Measures of Central Tendency are very near to the Mean of
the said measures. Therefore, it can be expected that the density of the data set is assured in the
median section of the graph and it is proven that the data set follows a normal distribution graph
from the values in the calculation above. In addition to this, it cannot be determined whether the
data set features a skewed graph.
From the data set under the demographic category of SHS Students, the following measures
have been found: 𝜇 ≈ 66.391; 𝑎𝑛𝑑 𝜎 ≈ 1.945.
17
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
TABLE IIICb.2 Probability Distribution per Class Interval for Respondents’ Heights
18
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
52 - 56 1 0.0167
56 - 60 5 0.0833
60 - 64 23 0.3833
64 - 68 22 0.3667
68 - 72 6 0.1
TOTAL: 60 1.0000
FIGURE IIICb.2 Comparison between the Two P(X) Value Sets for the Sample Data Set
0.25
0.2
0.15
0.1
0.05
0
48-52 52-56 56-60 60-64 64-68 68-72
Class Interval
Raw Data Probability Normal Probability Values
From the observations on the Figure IIICb.2, the researchers were able to conclude that the
probability distribution of the data set follows a standard normal distribution on the right tail,
however it slightly differs from the left tail. Also from the graph, the researchers were able to realize
that the densest portion of the graph is within the group of 60-64, which only fortifies the conclusion
set on Section IIICa.
19
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
The values of specific Correlational Analysis values, specifically r and P-Value, were
determined by the researches to determine whether the Ages and Heights of the sampled
respondents are correlated significantly.
FIGURE IIICc.1 Scatter Plot Graph comparing the Respondents’ Heights and Ages
50
40
30
20
10
0
14.5 15 15.5 16 16.5 17 17.5 18 18.5
Age (in years)
P-value 0.0000739339
From the values on Table IIICc.1, the researchers were able to derive an analysis that there is
very weak to almost no positive correlation between the Ages and Heights of the Sampled
Respondents as the r is near zero. Also, this analysis should be held of great regard as it is
considered to be significant due to its extremely low P-value.
20
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
From Section IIIA, the researchers were able to derive that there is weak correlation between
the values of the heights and ages of the Sampled Female Respondents. So, the outlier variables are
not well justified. In addition to this, the data set obtained from the Female Respondents follows
the density of the Normal Distribution in the middle section. However, it is slightly deviated within
the left tail.
From Section IIIB, the researchers were able to find out that there is a strong, yet slightly,
significant correlation between the values of the heights and ages of the Sampled Male
Respondents. So, the outlier variables can be slightly justified. Also, the researchers discovered
that the data set obtained from the Male respondents follow the probability density of certain
sections, however not all.
From Section IIIC, the researchers were able to derive that there is weak correlation between
the values of the heights and ages of all the Sampled Respondents. So, the outlier variables are not
well justified. To summarize the whole data set as well, the whole data set follows the density of
the Normal Distribution in the middle section. However, it is slightly deviated within the left tail.
These interpretations led to the discovery of the researchers that regardless of gender, the
students’ heights tend to follow a normal probability distribution. However, the correlational
variables state that the age range of Male Senior High School Students are a great factor on why
the heights of students vary greatly. Therefore, it is only fit that the researchers sampled from one
Year Level only so as the age range would be as small as possible and there would be less outlier
variables in the data set.
21
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
IV. VALIDITY
The validity of the study is perceptively non-questionable as the researchers have justified the
processes well and the study was able to derive valid conclusions. This is further affirmed by the
Science behind ‘puberty,’ which is considered to be relevant to this study. The correlation between
the Heights and Ages of the Respondents was observed to further fortify the concept that average
woman becomes taller within the age range of 10-16 years old, while the average man grows taller
at the age range of 13-19 years old. Hence, it is only right that the researchers chose a sample from
one Year Level only, specifically the Grade 11 Students of the STEM and ABM strands from the
Palawan State University – Laboratory Senior High School Department.
In addition to this, the online sources and tools that were used are reliable. Therefore, this study
should be held of great regard as it is very much valid.
From the study done by the researchers, they were able to reach a conclusion that the students
from Palawan State University – Laboratory Senior High School garner heights that follow a
Standard Normal Distribution when collected generally. However, when the heights of the students
are treated as two different data sets, which are gender-based, both the male and female data sets
have slightly deviated tails but somehow follow the normal distribution in a similar and relative
manner. Therefore, the researchers were able to conclude that the sampled students reflected normal
distribution in the most excellent way possible and it reflected scientific bases as well. This only
fortifies the statement: “Statistics is the grammar of Science.”
22
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
Appendix A
Online Questionnaire on Google Forms
23
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
24
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
25
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
Appendix B
Data Gathered From the 60 Participants
26
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
Female Innovativeness 17 63
Female Innovativeness 16 60
Male Commitment 17 63.6
Male Commitment 17 62.9
Male Commitment 16 66
Male Commitment 16 65
Male Commitment 16 66
Male Commitment 17 69
Male Courage 17 66.5
Male Courage 18 69
Male Courage 18 72
Male Courage 18 66
Male Courage 17 66
Male Courage 18 66
Male Courtesy 17 66
Male Courtesy 16 67.5
Male Courtesy 16 66.53
Male Courtesy 18 68
Male Courtesy 17 62.4
Male Courtesy 16 67.2
Male Creativity 16 64
Male Creativity 17 67
Male Creativity 16 67.2
Male Creativity 17 67
Male Creativity 17 63
Male Creativity 16 62
Male Innovativeness 17 67
Male Innovativeness 16 72
Male Innovativeness 18 71
Male Innovativeness 17 63
Male Innovativeness 16 64.8
Male Innovativeness 17 68.11
27
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
Appendix C
Normal Distribution Applet
28
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
Appendix D
Pearson’s r Interpretation Table
29
PALAWAN STATE UNIVERSITY
LABORATORY SENIOR HIGH SCHOOL
SCIENCE, TECHNOLOGY, ENGINEERING, AND MATHEMATICS
BIBLIOGRAPHY
Rumsey, Deborah. “What a p-value tells you about Statistical Data”. Statistics for Dummies.
https://www.dummies.com/education/math/statistics/what-a-p-value-tells-you-about-
statistical-data/. 2020. Web. March 2020.
Stöppler, Melissa Conrad. “Puberty First Signs, Symptoms, Ages, and Stages in Girls and Boys”.
https://www.medicinenet.com/puberty/article.htm.
WebMD, LLC, 12 Dec. 2019. Web. March 2020
30