Professional Documents
Culture Documents
Statistics and Probability: Lesson 1
Statistics and Probability: Lesson 1
Statistics and Probability: Lesson 1
Quarter 3 – Week 4
Module 2: Normal Distribution
Solution:
What’s New
Mrs. Bastillada conducted a test survey about determining the current level of students’ learning on
Geometry. After checking the test questionnaires, she was amazed that a student attained 50 correct
answers. What is the corresponding z-value of the student’s score if the sample mean of the test results
was 45 and standard deviation is 6?
Reflective Questions:
1. How will you find the probability of the data if data is too large?
____________________________________________________________________________
2. Can you make use of the normal curve to find for the probability of a large value? How?
____________________________________________________________________________
What Is It
Let us begin understanding the z-scores by acknowledging that for a given distribution,
it is more preferred to have a larger set of data in order to make good generalizations.
However, at times in a given distribution, raw scores may be composed of large values
and large values cannot be accommodated at the baseline of the normal curve. Thus,
the raw scores have to be transformed into z-scores in order to get meaningful decisions
relative to the concepts of finding the equivalent percentage and probability of the given
measure of value from the mean.
For any population, the mean and the standard deviation are fixed. This gives the way to understand that
for every raw score X, there corresponds an exactly one z-score value, and vice versa. Therefore, if we
wish to find the percentage associated with X, we must find its matched z-value using the z formula.
1|P age
The z formula is given by:
The area of the region under the normal curve represents the probability or percentage or proportion of a
given measurement value. It is computed by subtracting the measurement X to the sample Ẍ or population
mean, then dividing the result by the standard deviation. The resulting value gives the z-score. The z-
score indicates the distance between a given measurement X and the mean expressed in standard
deviations. It locates either within a sample or within a population. But for now, a readily available z-Table
is made to obtain the corresponding area given a z-score.
Steps in finding the z-score given the mean (µ) , standard deviation ( σ ) and the
measurement (X)
0.5
39 41 43 45 46 47 49
2|P age
2. Leslie got a score of 68 in an examination she participated. What is the corresponding z-
value of her score if the result has mean = 75 and standard deviation= 5.
Steps Solution
a. Use the formula for finding the z-
scores of population data
b. Check the given values µ=75,σ=5, X=68
c. Substitute the given values in the z = 68− 75
Formula 5
d. Compute the z- value z = -7 = -1.4
5
Therefore, the z-value that corresponds
to the raw score of 68 is -1.4.
3. Mrs. Bastillada conducted a test survey about determining the current level of students’
learning on Geometry. After checking the test questionnaires, she was amazed that a student
attained 50 correct answers. What is the corresponding z-value of the student’s score if the
sample mean of the test results was 45 and standard deviation is 6?
Steps Solution
a. Use the formula for finding the z-
scores of population data
What I Can Do
Activity 1
1. You are applying for a job that requires a score on either a P Test or an A Test. The
company needs only one test result. You have acquired scores on both tests. You’ve scored 98
on the P Test and 112 on the A Test. Which test results will you most likely to submit? Is the
information enough for you to convince yourself on submitting your chosen test result? Explain
why.
Activity 2
What Is It
A percentile is a measure used in statistics indicating the value below which a given
percentage of observations in a group of observations fall. It is a measure of relative standing as it
measures the relationship of a measurement with respect to the rest of the data.
For instance, you were told by your teacher that your Mathematics grade for the quarter
is 92 and your grade is at the 88 th percentile. This means 88% of your classmates scored lower than 92
and 21% scored more than 92 in the subject. This explains that 88 th percentile is located at a point such
that 88% of the total population lies below that point and 12% lies above.
At this point, we are given with the area or the proportion of the area of the region under
the normal curve. We wish to identify the corresponding z-value of the given area. With this, we need the
z-Table to find the z-value required corresponding to the given area and if not found, we take the nearest
value. The number that corresponds to the first column and first row of the intersection attained from the
locating the given area is the required z-value.
Example 1
1. Find the 86th percentile of the normal curve. By definition, the P86 means locating an
area before or below the point. We want to know what the z-value is at this point.
Steps Solution
By definition, the P32 means locating an area before or below the point. We want to
know what the z-value is at this point.
Steps Solution
g. Describe the shaded region Since 0.92 must be located at the left
side of the curve, it must contain a
negative sign indicating to where it
should be located. Thus, the shaded
region is the 32% percentile of the
distribution. Meaning, 32% of the
distribution lies below the z-value
0.92.
5|P age
What I Have Learned
➢
A standard normal curve is a normal probability distribution has mean
equal to 0 and standard deviation equal to 1.
➢
The normal probability distribution has the following properties:
1. The curve of the distribution is a bell-shaped.
2. The curve is symmetrical about the mean.
3. The mean, median and mode are of equal values and when sketched,
they coincide at the center of the graph.
4. The width of the curve is determined by the standard deviation of the
distribution.
5. The curve extends indefinitely approaching the x-axis but never
touching it. Thus, the curve is asymptotic to the line.
6. The area of the region under the curve is 1. It represents the probability
or percentage or proportion associated with the specific sets of
measurement values.
6|P age
Statistics and Probability
Quarter 3 – Week 5
Module 3: Sampling and Sampling Distribution
Lesson 1 Random Sampling
What’s New
1. Mrs. Cruz wants to do an analysis on students' final examination scores in her 40 Grade 11-
Baguio Mathematics class for the semester. Should she consider her all the students’ scores
or only some of the students’ scores?
2. A group of researchers want to determine the average Mathematics grade for the semester of
Grade 11 students in the Philippines. Should the group consider conducting a survey to all the
Grade 11 students in the Philippines or random students of selected places?
Analysis: For case number 1, this task is only dealing with the students' final examination scores in
one specific class with 40 students. The intent is not to the students' final examination scores of students
in a much bigger population. Since the teacher has the entire population available for this situation, she
should use the all of them. In case number 2, the population is extremely large. There is actually an
impractical and an inconvenient way of obtaining all of the data in the population. You simply will not
have all of the data available for your use especially if you only have a limited time. You will need to
use a sample of the population.
What is It
Population is the group you want to generalize. It consists of all the members of the group you
are interested in. Sample is the subset from the population you want to examine. A population
commonly contains too many individuals to study conveniently and practically, so an investigation is
often restricted to one or more samples drawn from it. A well-chosen sample will contain most of the
information about a particular population parameter but the relation between the sample and the
population must be such as to allow true inferences to be made about a population from that sample.
Sampling is a process used in statistical analysis in which a predetermined number of
observations are taken from a larger population. There are various sampling methods that allow all the
units in the population to have an equal chance of being selected. These sampling methods are
discussed below.
Page | 7
To determine the desired number of samples needed given a certain number of population, there are
different formulas can be used, one of which is Slovin’s Formula.
𝑁
Slovin’s Formula: 𝑛 = 𝑁𝑒 2
𝑁
𝑛=
𝑁𝑒 2
1000
𝑛=
1000(0.05)2
1000
𝑛=
1000(0.0025)
1000
𝑛=
2.5
𝒏 = 𝟒𝟎𝟎 (sample size)
2. Systematic Random Sampling
This can be done by listing all the elements in the population and selecting every kth element in
your population list. This is equally precise as the simple random sampling. It is often used on
long population lists. To determine the interval to be used in identifying the samples to who will
𝑁
participate in the study, use the formula K = (population/sample size).
𝑛
Example:
𝑁
If Population (N) = 2000, sample size (n) = 500, K = 𝑛 , so k = 2000/500 = 4th. Use a table of random
numbers to determine the starting point for selecting every 4th subject. With list of the 2000 subjects
in the sampling frame, go to the starting point, and select every 4th name on the list until the sample
size is reached. Probably will have to return to the beginning of the list to complete the selection of
the sample.
3. Stratified Random Sampling
This can be done by first dividing the elements in the population into strata and then
samples are randomly selected from each stratum ensuring that each selected element is
proportionately represented in the total population. Sampling fraction: n/N (desired sample size
divided by the population size)
Example: Assume you have a population of 1000 students with 500 from grade school, 300 from high
school, and 200 from senior high school. Determine the how many samples you need or you can use
the Slovin’s Formula or any other formula for computing the sample size. In this example, Slovin’s
Formula is used and a sample size of 400 is computed. To get the samples from each stratum, divide
400 by 1000 and the answer is 0.4. Multiply 0.4 to each of the number of students per stratum (e.g.
0.4 x 500 grade school is 200).
Page | 8
More examples are given below.
Stratified Sampling
A teacher who is conducting a research on the effects of using mobile phones in teaching English
decided to divide her students into male and female and then she selected students from each gender
group.
Systematic Sampling
The school office personnel gave the researcher a list of 2000 Grade 10 students. The researcher
selected every 25th name on the list.
Cluster Sampling
A researcher surveyed all dengue patients in each of the 10 randomly selected hospitals in Misamis
Oriental.
Let’s Summarize!
___________________________________________________________________
Assessment
Identify the type of sampling method used by the researcher in each situation: simple random
sampling, systematic sampling, stratified sampling, or cluster sampling.
_______________1. A researcher chose the participants of his study by selecting every 8 th member of
the population.
_______________2. A researcher interviewed all the teachers in each of 15 randomly selected
private schools in Cagayan de Oro City.
_______________3. A researcher interviewed people from each barangay in the municipality of
Alubijid for his research on population.
Page | 9
_______________4. A researcher is doing a research work on the students’ reaction to the newly
implemented curriculum in mathematics and interviewed every 5 th student
entering the gate of the school.
_______________5. A researcher randomly selected 15 barangays in a town for her study. She
did this by writing the names of each barangay on a piece of paper which she
folded and put in a bowl then she draw 15 pieces of paper from the bowl.
_______________6. A researcher selected a sample of 𝑛 = 300 from a population of 900 by using
the Table of Random Numbers.
_______________7. A researcher interviewed all the nurses in each of the 5 randomly selected
private hospitals in Northern Mindanao.
_______________8. A statistician selected a sample of 𝑛 = 500 high school students from a
private school with 2,500 students. He randomly selected the students from
each grade level.
_______________9. A Statistics student did a research on the time spent by Grade 11 and 12
students in playing mobile legends. He randomly selected his subjects by
using the Table of Random Numbers.
_______________10. A teacher conducted a study in her school to determine who were better in
mathematics: the males or the females.
Study the cases below. Identify which of the cases involves measures from a population and a
sample.
1. A researcher randomly selected a sample of 1600 people in Cagayan de Oro City and asked if they
use a certain detergent brand and 40% of them said yes.
2. A researcher interviewed all the members of the Mathematics department with 10 female teachers,
15 male teachers and 1 department head. He wants to know the average hours per day they spend
in training students for competitions and found out that they spend an average of 2 hours per day
for training.
Analysis: The first case contains a measure for a sample. It indicated that 40% of the 1600 samples
said yes. On the other hand, the second case contains a measure from a population because an
average of 2 hours per day spend in training is from all of the people in the Mathematics department.
What is It
A measure that describes a population is called parameter. A measure that describes a sample
is called statistic. In population parameter, mean is represented by 𝜇, 𝜎 2 represents variance, and 𝜎
represents standard deviation. In sample statistics, mean is represented by 𝑥̅ ,𝑠 2 represents variance,
and 𝑠represents standard deviation. Data from the sample are used to calculate statistics. Statistics are
estimates of the corresponding parameters.
Examples:
Parameter: 50% of the 24 Philippine senators agreed to support a certain measure.
Statistic: A researcher found out that 25% of the students in the Philippines reported to have internet
connection at home.
Explanation:
1. The example indicates a parameter since there are only 24 senators in the Philippines.
2. Researchers cannot ask millions of students if they have internet connection at home, so they
take samples from the target population and calculate.
What’s More
Page | 10
Read each statistical study below. For each study, identify both the parameter and the statistic.
Refer to an example given.
Example: A researcher wants to estimate the average monthly allowance of the Grade 11 female
students. From a random sample of 50 female students, the researcher obtains a sample mean monthly
allowance of 75 pesos.
1. A teacher wants to determine the average score in the first periodic examination in General
Mathematics of his 5 classes. From a random sample of 120 students. The teacher obtains an
average score of 84.
Parameter: _______________________________________________________
Statistic: _________________________________________________________
2. A teacher wants to know the average hours spend on social media of his advisory class. He
randomly selected and asked 35 students and found out that they spend an average of 3 hours per
day on social media.
Parameter: _______________________________________________________
Statistic: _________________________________________________________
Let’s Summarize!
• Parameter is a measure that describes a population. Parameter includes population mean 𝜇,
population variance 𝜎 2 , and population standard deviation 𝜎.
• Statistic is a measure that describes a sample. Statistics include sample mean 𝑥̅ , sample
variance 𝑠 2 and sample standard deviation 𝑠.
Assessment
__________1. The researcher found out that the 29 senior high school teachers of a certain school
spend an average of 2 hours preparing their lessons.
__________2. Based on a sample of 900 elementary students, it was found out 30% of them do not
know multiplication facts.
__________3. Based on sample of 1,200 surveyed students, it was found out that 20% of them needed
financial assistance.
__________4. A teacher surveyed all 50 students under his advisory class about their learning styles
and found out that most them are visual learners.
__________5. The Statistics teacher wants to know the average score of the students in the final
exam. He randomly selected 35 students and obtained an average score of 43.
Page | 11
Statistics and Probability
Quarter 3 – Week 6
Module 3: Sampling and Sampling Distribution
Lesson 3 Sampling Distribution of the Sample Means
What’s New
Activity
1. A population consists of 2, 3, and 4. List all possible samples of size 3 which can be drawn with
replacement from this population and compute the mean of each sample. One possible sample is
given as your guide.
Sample Mean
2, 3 2.5
What is It
If you list all the possible samples of size 3 drawn from a population of 4 with elements 2, 3,
and 4 you will have 9 samples. The table below shows the 9 samples and their corresponding means.
Sample Sample Mean
2, 2 2.0
2, 3 2.5
2, 4 3.0
3, 2 2.5
3, 3 3.0
3, 4 3.5
4, 2 3.0
4, 3 3.5
4, 4 4.0
This let us learn how to make a probability distribution of the sample means. We shall call this
distribution, the sampling distribution of sample means.
̅
Sample Mean 𝒙 Frequency Probability P(𝒙̅)
1
2 1
9
2
2.5 2
9
1
3 3
3
2
3.5 2
9
1
4 1
9
Total 𝐧=𝟗 1.00
A sampling distribution of sample means is a probability distribution that describes the
probability for each mean of all samples with the same sample size 𝑛.
Finding the Mean and Variance of the Sampling Distribution of Sample Means
The following are formulae needed to compute the mean, variance and standard deviation of a
population and mean, variance, and standard deviation of the sampling distribution of sample means.
Page | 12
Sampling Distribution of Sample
Population
Means
∑𝑥 ∑ 𝑥̅
Mean 𝜇= 𝜇𝑥̅ =
𝑁 𝑛
2
∑(𝑥 − 𝜇) ∑(𝑥̅ − 𝜇𝑥̅ )2
Variance 𝜎2 = 𝜎 2 𝑥̅ =
𝑁 𝑛
Standard ∑(𝑥 − 𝜇)2 ∑(𝑥̅ − 𝜇𝑥̅ )2
Deviation 𝜎=√ 𝜎𝑥̅ = √
𝑁 𝑛
d. List all the possible samples of size 2 with replacement and their corresponding means.
Observation Samples ̅
𝒙
1 2, 2 2.0
2 2, 5 3.5
3 2, 8 5.0
4 5, 2 3.5
5 5, 5 5.0
6 5, 8 6.5
7 8, 2 5.0
8 8, 5 6.5
9 8, 8 8.0
Page | 13
e. Find the mean of the sampling distribution of means.
Observation Samples ̅
𝒙
1 2, 2 2.0
2 2, 5 3.5
3 2, 8 5.0
4 5, 2 3.5
5 5, 5 5.0
6 5, 8 6.5
7 8, 2 5.0
8 8, 5 6.5
9 8, 8 8.0
̅ = 𝟒𝟓
∑𝒙
∑ 𝑥̅ 45
𝜇𝑥̅ = =
𝑛 9
𝝁𝒙̅ = 𝟓 Hence, the mean of the sampling distribution of sample means is 5.
f. Find the variance of the sampling distribution of means.
Observation Samples ̅
𝒙 ̅ − 𝝁𝒙̅
𝒙 ̅ − 𝝁𝒙̅ )𝟐
(𝒙
1 2, 2 2.0 −3 9
2 2, 5 3.5 −1.5 2.25
3 2, 8 5.0 0 0
4 5, 2 3.5 −1.5 2.25
5 5, 5 5.0 0 0
6 5, 8 6.5 1.5 2.25
7 8, 2 5.0 0 0
8 8, 5 6.5 1.5 2.25
9 8, 8 8.0 3 9
̅ = 𝟒𝟓
∑𝒙 ̅ − 𝝁𝒙̅ )𝟐 = 𝟐𝟕
∑(𝒙
∑(𝑥̅ − 𝜇𝑥̅ )2 27
𝜎 2 𝑥̅ = =
𝑛 9
𝝈𝟐 𝒙̅ = 3 Hence, the variance of the sampling distribution of sample means is 3.
g. Find the standard deviation of the sampling distribution.
∑(𝑥̅ − 𝜇𝑥̅ )2
𝜎𝑥̅ = √ = √3
𝑛
𝝈𝒙̅ = 𝟏. 𝟕𝟑
Hence, the standard deviation of the sampling distribution of sample means is 1.73.
Page | 14
𝒙 𝒙−𝝁 (𝒙 − 𝝁)𝟐
1 −2 4
3 0 0
5 2 4
2
∑ (𝑥 − 𝜇 ) = 8
∑(𝑥 − 𝜇)2
𝜎2 =
𝑁
2
8
𝜎 =
3
𝝈𝟐 = 𝟐. 𝟔𝟕
Hence, the population variance is 2.67.
d. List all the possible samples of size 2 with replacement and their corresponding means.
Observation Samples ̅
𝒙
1 1, 3 2
2 1, 5 3
3 3, 1 2
4 3, 5 4
5 5, 1 3
6 5, 3 4
e. Find the mean of the sampling distribution of means.
Observation Samples ̅
𝒙
1 1, 3 2
2 1, 5 3
3 3, 1 2
4 3, 5 4
5 5, 1 3
6 5, 3 4
̅ = 𝟏𝟖
∑𝒙
∑ 𝑥̅ 18
𝜇𝑥̅ = =
𝑛 6
𝝁𝒙̅ = 𝟑 Hence, the mean of the sampling distribution of sample means is 3.
f. Find the variance of the sampling distribution of means.
Observation Samples ̅
𝒙 ̅ − 𝝁𝒙̅
𝒙 ̅ − 𝝁𝒙̅ )𝟐
(𝒙
1 1, 3 2 −1 1
2 1, 5 3 0 0
3 3, 1 2 −1 1
4 3, 5 4 1 1
5 5, 1 3 0 0
6 5, 3 4 1 1
̅ = 𝟏𝟖
∑𝒙 ̅ − 𝝁𝒙̅ )𝟐 = 𝟒
∑(𝒙
∑(𝑥̅ − 𝜇𝑥̅ )2 4
𝜎 2 𝑥̅ = =
𝑛 6
𝝈𝟐 𝒙̅ = 𝟎. 𝟔𝟕 Hence, the variance of the sampling distribution of sample means is 0.67.
Page | 15
g. Find the standard deviation of the sampling distribution.
∑(𝑥̅ − 𝜇𝑥̅ )2
𝜎𝑥̅ = √ = √0.67
𝑛
𝝈𝒙̅ = 𝟎. 𝟖𝟐
Hence, the standard deviation of the sampling distribution of sample means is 0.82.
1. What do you notice about the population mean and the mean of the sampling distribution of
sample means? How do you compare them?
2. How do you compare the population variance and the variance of the sampling distribution of
sample means?
Let us summarize the example above by comparing the means and variances of population and
the sampling distribution of the sample means.
If all possible samples of size 𝑛 that can be drawn from the population of size N with mean 𝜇 and
variance 𝜎 2 , then the sampling distribution of the sample means has the following properties.
With Replacement
• The mean of the sampling distribution of means is equal to the mean of the population.
𝜇𝑥̅ = 𝜇
• The variance of the sampling distribution of means is equal to the population variance
divided by the size of 𝑛 of the samples. That is,
𝜎2
𝜎 2 𝑥̅ =
𝑛
• The standard deviation of the sampling distribution of means is equal to the population
standard deviation divided by the square root of the sample size of 𝑛 of the samples. That
is,
𝜎
𝜎𝑥̅ =
√𝑛
Without Replacement
• The mean of the sampling distribution of means is equal to the mean of the population.
𝜇𝑥̅ = 𝜇
• The variance of the sampling distribution of means is equal to the population variance
divided by the size of 𝑛 of the samples. That is,
𝜎2 𝑁 − 𝑛
𝜎 2 𝑥̅ = ( )
𝑛 𝑁−1
• The standard deviation of the sampling distribution of means is equal to the population
standard deviation divided by the square root of the sample size of 𝑛 of the samples. That
is,
𝜎 𝑁−𝑛
𝜎𝑥̅ = √( )
√𝑛 𝑁−1
Page | 16
Let’s Summarize!
• The probability distribution of the sample means is called the sampling distribution of sample
means.
• A sampling distribution of sample means is a probability distribution that describes the probability
for each mean of all samples with the same sample size 𝑛.
• If all possible samples of size 𝑛 that can be drawn from the population of size N with mean 𝜇 and
variance 𝜎 2 , then the sampling distribution of the sample means has the following properties.
If all possible samples of size 𝑛 that can be drawn from the population of size N with mean 𝜇
and variance 𝜎 2 , then the sampling distribution of the sample means has the following properties.
With Replacement
• The mean of the sampling distribution of means is equal to the mean of the population.
𝜇𝑥̅ = 𝜇
• The variance of the sampling distribution of means is equal to the population variance divided
𝜎2
by the size of 𝑛 of the samples. That is, 𝜎 2 𝑥̅ =
𝑛
• The standard deviation of the sampling distribution of means is equal to the population
standard deviation divided by the square root of the sample size of 𝑛 of the samples. That is,
𝜎
𝜎𝑥̅ = 𝑛
√
Without Replacement
• The mean of the sampling distribution of means is equal to the mean of the population.
𝜇𝑥̅ = 𝜇
• The variance of the sampling distribution of means is equal to the population variance
divided by the size of 𝑛 of the samples. That is,
𝜎2 𝑁 − 𝑛
𝜎 2 𝑥̅ = ( )
𝑛 𝑁−1
• The standard deviation of the sampling distribution of means is equal to the population
standard deviation divided by the square root of the sample size of 𝑛 of the samples. That
is,
𝜎 𝑁−𝑛
𝜎𝑥̅ = √( )
√𝑛 𝑁−1
Assessment
Page | 17
3. A random sample of 𝑛 = 65 measurements is obtained from a population with 𝜇 = 20 and 𝜎 2 = 400.
Describe the sampling distribution for the sample means by computing 𝜇𝑥̅ 𝑎𝑛𝑑 𝜎 2 𝑥̅ . Consider a
sampling without replacement.
Page | 18