Professional Documents
Culture Documents
MATH03-CO4-Lesson1-Sampling and Sampling Distribution
MATH03-CO4-Lesson1-Sampling and Sampling Distribution
Prepared by:
Engr. Diamante, Lenolan L.
Revised by:
Ipapo, Redwaynne Jester T.
Objective: Discuss and apply comprehensively the concepts of sampling and sampling distribution.
Subject Matters:
Lesson 1. Sampling and Sampling Distribution
Lesson 1.1. Random Sample
Lesson 1.2. Sampling Distribution
Lesson 1.3. Sampling and Non-Sampling Errors
Lesson 1.4. Central Limit Theorem
Learning Competencies:
I. Apply the concepts of sampling and sampling distribution to real world problems.
Evaluation:
Performance Task: PT4 to be deployed on Blackboard (Week 13)
1. Which of the following refers to the standard error of the sampling distribution of the sample mean?
a. Mean c. Standard Deviation
b. Sample d. Variance
2. Which of the following represents the mean of the sampling distribution?
a. 𝜇𝑋̅ c. 𝜎
b. 𝑋̅ d. 𝜎𝑋̅
3. What is the equivalent of the variance of the sampling distribution of sample mean?
𝜎2
a. 𝜎 2 c. 𝑛
𝜇2
b. 𝜇2 d.
𝑛
4. Which of the following is the formula for sampling error?
a. 𝜇 + 𝑥̅ c. 𝑥̅ − 𝜇
b. 𝜇 d. 𝑥̅
5. Given the scores of five students in Biology Exam (35, 40, 28, 32, 42), if a sample of three students is
taken, which set of scores will lead to the smallest sampling error?
a. {35, 40, 42} c. {35, 32, 28}
b. {35, 28, 42} d. {40, 28, 32}
6. Given the scores of five students in Biology Exam (35, 40, 28, 32, 42), a sample of three students
were taken (35, 40, 28). Unfortunately, 28 is mistakenly written as 38. What is the nonsampling error?
a. 3.34 c. 3.67
b. 2.34 d. 2.67
7. Dogs in a dog show weigh an average of 12 pounds, with a standard deviation of 8 pounds. If 16 dogs
are chosen at random, what is the standard error of the sample mean?
a. 𝜎𝑋̅ = 2 c. 𝜎𝑋̅ = 1
b. 𝜎𝑋̅ = 3 d. 𝜎𝑋̅ = 4
8. Dogs in a dog show weigh an average of 12 pounds, with a standard deviation of 8 pounds. If 16 dogs
are chosen at random, what is the probability that they weigh an average of more than 10 pounds?
a. 0.3413 c. 0.1587
b. 0.8413 d. 0.3174
9. The mean number of minutes of app engagement of tablet users is 10.3 minutes with a standard
deviation of 1.2 minutes. Given a sample of 36 tablet users, what is the standard error of the sampling
distribution?
a. 0.3 c. 0.4
b. 0.1 d. 0.2
10. The mean number of minutes of app engagement of tablet users is 10.3 minutes with a standard
deviation of 1.2 minutes. Given a sample of 36 tablet users, what is the probability that the average
app engagement is less than 10 minutes?
a. 0.0655 c. 0.0548
b. 0.0668 d. 0.0577
What is it?
I. Sampling and Sampling Distribution
Sampling Theory is a study of relationships existing between a population and samples drawn
from the population. This theory is useful in (1) Estimating unknown population parameters
(such as mean, variance or proportion) from a knowledge of a certain sample quantities
(sample statistics) and in (2) determining whether the observed differences between two
samples are due to chance variation or whether they are really significant. (Is a certain
production process better than the other?)
Technically, a simple random sample is a set of n objects in a population of N objects where all
possible samples are equally likely to happen. Here’s a basic example of how to get a simple
random sample: put 100 numbered bingo balls into a bowl (this is the population N). Select 10
balls from the bowl without looking (this is your sample n). Note that it’s important not to look
as you could (unknowingly) bias the sample. While the “lottery bowl” method can work fine for
MATH03 |Core | CO4
E-mail address: shs@mapua.edu.ph
Telephone number: (02) 8247 - 5000
MATH03 | Statistics and Probability
Simple random sampling of a sample “n” of 3 from a population “N” of 12. Image: Dan Kernler
|Wikimedia Commons
Imagine that the people illustrated in the image above are game pieces. Place the 12 game pieces
in a bowl and choose 3. This is simple random sampling. A simple random sample is chosen in
such a way that every set of individuals has an equal chance to be in the selected sample.
For our random sample of 𝑛 observations, we have:
∑𝑥𝑖
𝑥̅ =
𝑛
∑(𝑥𝑖 − 𝑥̅ )2
𝑠2 =
𝑛−1
Alternatively, the sample variance can be computed as follows:
𝑛∑𝑥𝑖2 − (∑𝑥𝑖 )2
𝑠2 =
𝑛(𝑛 − 1)
plays a significant role in obtaining the values of a sample statistic. For example, if we calculate
the mean 𝜇 of a population of a data set, there will be no randomness involved, as all the data
values in the population are included in the calculation of mean 𝜇. However, if we select a sample
from this population and calculate the sample mean 𝑥̅, it will assume one of the numerous
possible values. Assuming we select this sample randomly, the value of 𝑥̅ depends on which
values from the population are included in the sample.
Consider all possible samples of three scores each that can be selected, without replacement,
from that population. The total number of possible samples, given by the combination’s formula
discussed in Chapter 4, is 10; that is,
These 10 samples and their respective means are listed in the table below (7.3). Note that the
first two samples have the same three scores. The reason for this is that two of the students (C
and D) have the same score of 80, and, hence, the samples ABC and ABD contain the same
values. The mean of each sample is obtained by dividing the sum of the three scores included in
70+78+80
that sample by 3. For instance, the mean of the first sample is = 76. Note that he values
3
of the means of the samples in the table are rounded to two decimal places.
By using the values of 𝑥̅ given in the table, we record the frequency distribution of 𝑥̅ by dividing the
frequencies of the various values of 𝑥̅ by the sum of all frequencies, we obtain the relative
MATH03 |Core | CO4
E-mail address: shs@mapua.edu.ph
Telephone number: (02) 8247 - 5000
MATH03 | Statistics and Probability
frequencies of the classes, which are listed in the third column of the table below(7.4). These
relative frequencies are used as probabilities and listed in the next table (7.5)
If we select just one sample of three scores from the population of five scores, we may draw any
of the 10 possible samples. Hence, the sample mean, 𝑥̅, can assume any of the values listed in
table 7.5 with the corresponding probability. For instance, the probability that the mean of a
randomly selected sample of three scores is 81.67 is 0.20. This probability can be written as
Sampling error is the difference between the value of a sample statistic and the value of the
corresponding population parameter. In the case of t mean,
𝑆𝑎𝑚𝑝𝑙𝑖𝑛𝑔 𝑒𝑟𝑟𝑜𝑟 = 𝑥̅ − 𝜇
Assuming that the sample is random and no sampling error has been made. The sampling
error occurs only in a sample survey, and not in a census.
Note:
in the real world, it is not possible to find the sampling error because 𝜇 is not known. If 𝜇 is known
then we do not need to find 𝑥̅. But the concept of sampling error is very important in making
inferences in later chapters.
It is important to remember that a sampling error occurs because of chance. The errors that occur
for other reasons, such as errors made during collection, recording, and tabulation of data, are
called nonsampling errors. These errors occur because of human mistakes, and not chance. Note
that there is only one kind of sampling error—the error that occurs due to chance. However,
there is not just one nonsampling error, but there are many nonsampling errors that may occur
for different reasons.
Nonsampling errors are the errors that occur in the collection, recording and tabulation of the
data. The nonsampling errors can occur both in a sample survey and in a census.
Examples
Problem 1. There are only five students in an advanced statistics class and the midterm scores of
these five students are:
70 78 80 80 95
a. Suppose one sample of three scores is selected from the population, and this sample includes
the scores 70, 80 and 95. Find the sampling error.
b. If it is mistakenly recorded the second score of the sample as 82 instead of 80, what is the
value of non-sampling error?
Solutions:
a.
∑𝑥𝑖 70 + 78 + 80 + 80 + 95
𝜇= = = 80.6
𝑛 5
∑𝑥𝑖 70 + 80 + 95
𝑥̅ = = = 81.67
𝑛 3
𝑆𝑎𝑚𝑝𝑙𝑖𝑛𝑔 𝑒𝑟𝑟𝑜𝑟 = 𝑥̅ − 𝜇 = 81.67 − 80.6 = 1.07
b.
∑𝑥𝑖 70 + 82 + 95
𝑥̅ ′ = = = 82.33
𝑛 3
𝑁𝑜𝑛 − 𝑠𝑎𝑚𝑝𝑙𝑖𝑛𝑔 𝑒𝑟𝑟𝑜𝑟 = 𝑥̅ ′ − 𝑥̅ = 82.33 − 81.67 = 0.66
population (either finite or infinite) with mean μ and variance 𝜎2, and if 𝑋̅ is the sample mean,
the limiting form of the distribution of
𝑋̅ − 𝜇
𝑧= 𝜎
ξ𝑛
as 𝑛 → ∞, is the standard normal distribution.
Examples
1. An electrical firm manufactures light bulbs that have a length of life that is approximately
normally distributed, with mean equal to 800 hours and a standard deviation of 40 hours.
Find the probability that a random sample of 16 bulbs will have an average life of less than
775 hours.
Solution:
According to the problem, 𝜇 = 800 ℎ𝑟𝑠, 𝜎 = 40 ℎ𝑟𝑠 𝑎𝑛𝑑 𝑛 = 16 𝑏𝑢𝑙𝑏𝑠
Solution:
According to the problem, 𝜇 = 7 𝑦𝑟𝑠, 𝜎 = 1 𝑦𝑟 𝑎𝑛𝑑 𝑛 = 9 𝑏𝑢𝑙𝑏𝑠
𝑃(6.4 < 𝑋̅ < 7.2) =?
In solving this problem, Convert into standard random variable.
𝑋̅ − 𝜇 6.4 − 7
𝑧1 = 𝜎 = = −1.8
1
ξ𝑛 ξ9
𝑋̅ − 𝜇 7.2 − 7
𝑧2 = 𝜎 = = 0.6
1
ξ𝑛 ξ9
Therefore, we can say that 𝑃(6.4 < 𝑋̅ < 7.2) is the same as 𝑃(−1.8 < 𝑧 < 0.6).
Using the Standard normal distribution table, the value equivalent to 𝑃(−1.8 < 𝑧 < 0.6) is
𝑃 (−1.8 < 𝑧 < 0.6)
= 𝑃(𝑧 < 0.6) − 𝑃(𝑧 < −1.8)
= 0.7257 − 0.0359
= 0.6898
Hence,
𝑃(−1.8 < 𝑧 < 0.6) = 0.6898
3. If a certain machine makes electrical resistors having a mean resistance of 40 ohms and
a standard deviation of 2 ohms, what is the probability that a random sample of 36
resistors will have a combined resistance of more than 1458 ohms?
Solution:
According to the problem, 𝜇 = 40 ohms, 𝜎 = 2 𝑜ℎ𝑚𝑠,
𝑐𝑜𝑚𝑏𝑖𝑛𝑒𝑑 𝑟𝑒𝑠𝑖𝑠𝑡𝑎𝑛𝑐𝑒 = 1458 𝑜ℎ𝑚𝑠 𝑎𝑛𝑑 𝑛 = 36 𝑏𝑢𝑙𝑏𝑠
Since the given is combined resistance for 36 bulbs, solve first for the sample mean resistance of
those 36 bulbs.
∑𝑥 1458
𝑋̅ = = = 40.5
𝑛 36
̅ > 40.5 𝑜ℎ𝑚𝑠) =?
𝑃(𝑋
In solving this problem, Convert into standard random variable.
𝑋̅ − 𝜇 40.5 − 40
𝑧= 𝜎 = = 1.5
2
ξ𝑛 ξ36
̅ > 40.5 𝑜ℎ𝑚𝑠) is the same as 𝑃(𝑧 > 1.5).
Therefore, we can say that 𝑃(𝑋
Using the Standard normal distribution table, the value equivalent to 𝑃(𝑧 > 1.5)
𝑃(𝑧 > 1.5)
= 1 − 𝑃(𝑧 < 1.5)
= 1 − 0.9332
= 0.0668
Hence,
̅ > 40.5 𝑜ℎ𝑚𝑠 ) . = 0.0668
𝑃 (𝑋
What I can do
Solve the following problems
1. The population average milk production of cows per day is 16 L with a standard
deviation of 3.4 L. For a sample of 36 cows, what is the probability that the sample
average milk production of cows is:
a. Less than 15 L?
b. Between 14.5 L to 16.5 L?
MATH03 | Statistics and Probability
Assessment (Post-test)
Choose the letter of the correct answer for each item.
1. Which of the following refers to the standard error of the sampling distribution of the sample
mean?
a. Mean c. Standard Deviation
b. Sample d. Variance
2. Which of the following represents the mean of the sampling distribution?
a. 𝜇𝑋̅ c. 𝜎
̅
b. 𝑋 d. 𝜎𝑋̅
3. What is the equivalent of the variance of the sampling distribution of sample mean?
𝜎2
a. 𝜎 2 c. 𝑛
𝜇2
b. 𝜇2 d.
𝑛
4. Which of the following is the formula for sampling error?
a. 𝜇 + 𝑥̅ c. 𝑥̅ − 𝜇
b. 𝜇 d. 𝑥̅
5. Given the scores of five students in Biology Exam (35, 40, 28, 32, 42), if a sample of three students
is taken, which set of scores will lead to the smallest sampling error?
a. {35, 40, 42} c. {35, 32, 28}
b. {35, 28, 42} d. {40, 28, 32}
6. Given the scores of five students in Biology Exam (35, 40, 28, 32, 42), a sample of three students
were taken (35, 40, 28). Unfortunately, 28 is mistakenly written as 38. What is the nonsampling
error?
a. 3.34 c. 3.67
b. 2.34 d. 2.67
7. Dogs in a dog show weigh an average of 12 pounds, with a standard deviation of 8 pounds. If 16
dogs are chosen at random, what is the standard error of the sample mean?
a. 𝜎𝑋̅ = 2 c. 𝜎𝑋̅ = 1
b. 𝜎𝑋̅ = 3 d. 𝜎𝑋̅ = 4
8. Dogs in a dog show weigh an average of 12 pounds, with a standard deviation of 8 pounds. If 16
dogs are chosen at random, what is the probability that they weigh an average of more than 10
pounds?
a. 0.3413 c. 0.1587
b. 0.8413 d. 0.3174
9. The mean number of minutes of app engagement of tablet users is 10.3 minutes with a standard
deviation of 1.2 minutes. Given a sample of 36 tablet users, what is the standard error of the
sampling distribution?
a. 0.3 c. 0.4
b. 0.1 d. 0.2
10. The mean number of minutes of app engagement of tablet users is 10.3 minutes with a standard
deviation of 1.2 minutes. Given a sample of 36 tablet users, what is the probability that the
average app engagement is less than 10 minutes?
a. 0.0655 c. 0.0548
b. 0.0668 d. 0.0577
Answer Key
10. b 10. b
9. d 9. d
8. c 8. c
7. a 7. a
6. a b. approx. 1 6. a
5. c 2. a. 0.3446 b. 0.9215 5. c
4. c 2. a. 0.0764 4. c
3. c c. 0.1003 3. c
2. a b. 0.9896 b. 0.8066 2. a
1. c 1. a. 0.1003 1. a. 0.0392 1. c
What do I know? What’s more? What I can do Assessment
References
Elementary Statistics: A Step By Step Approach 10th Edition by Allan Bluman. McGraw-Hill
Education.
16
MATH03 |Core | CO4
E-mail address: shs@mapua.edu.ph
Telephone number: (02) 8247 - 5000