Download as pdf or txt
Download as pdf or txt
You are on page 1of 10

STATISTICS AND PROBABILITY

Name: Date:
Section: Q3 – Week 7- Week 8

MEASURES OF CENTRAL TENDENCY

Background Information
What is the Central Limit Theorem (CLT)?
In the study of probability theory, the central limit theorem (CLT) states that the distribution of
sample approximates a normal distribution (also known as a “bell curve”) as the sample size becomes
larger, assuming that all samples are identical in size, and regardless of the population distribution
shape.

Said another way, CLT is a statistical theory stating that given a sufficiently large sample size
from a population with a finite level of variance, the mean of all samples from the same population will
be approximately equal to the mean of the population. Furthermore, all the samples will follow an
approximate normal distribution pattern, with all variances being approximately equal to the variance
of the population, divided by each sample's size.

Key Terms and Illustrations to Review and Recall:

1. Inferential Statistics – it is a technique, used to draw conclusions and trends about a population based from a
sample.
2. Population – is a collection of all possible observations under a particular study.
3. Sample – is a chosen portion of the whole population.
4. Population Parameter – is a summary measure to describe the characteristic of the whole population.
5. Sample Statistic – is a summary measure computed from a sample to describe the characteristic of the whole
population.
6. Z-score – the corresponding value of a statistic of a normally distributed sample in the standard normal
distribution.

The diagram illustrates the relationship of the population and sample

Statistic
POPULATION SAMPLE
𝜇 𝑥̅
𝜎 Parameter 𝑠

Commonly Used Statistics and their Corresponding Parameters

Statistic Parameter
Measure
(sample) (population)
Mean 𝑥̅ 𝜇
Variance 𝑠2 𝑜𝑟 𝜎𝑥̅ 2 𝜎2
Standard Deviation 𝑠 𝑜𝑟 𝜎𝑥̅ 𝜎
Proportion 𝑃𝑥̅ 𝑃

Sampling Distribution of the Sample Means:

In a given population, suppose we draw all the possible samples each of size 𝒏. Suppose further that
we compute the mean of each sample. The probability distribution of these means is called
Sampling Distribution of the Sample Means.

1
Example 1:
Consider the possible outcomes of rolling a six-
sided die. In rolling a die, the number of dots that 1 2 3 4 5 6
will appear can be any one of these:
There six possible outcomes and each of them has an equal chance of being selected so each
1
outcome has a probability of or approximately 0.167.
6

OUTCOM PROBABILIT
MEAN
E Y
1
1 1 𝑜𝑟 0.167
6
1
2 2 𝑜𝑟 0.167
6
1
3 3 𝑜𝑟 0.167
6
1
4 4 𝑜𝑟 0.167
6
1
5 5 𝑜𝑟 0.167
6
1
6 6 𝑜𝑟 0.167
6
Results and Means obtained from
Rolling a Die and Their Corresponding Probability Distribution of the Outcome of Rolling a Die
Probabilities

Example 2:
Now suppose rolling 2 dice and take Die 1 Die 2 Outcome
the mean of each set of the sample
Outcom
points. In this experiment there are 1 2 3 4 5 6
e
36 sample points and they are:
1 1 1.5 2 2.5 3 3.5
(1, 1), (1, 2), (1, 3), (1, 4), (1, 5), (1, 6), (2, 1), 2 1.5 2 2.5 3 3.5 4
(2, 2), (2, 3), (2, 4), … … … , (6, 4), (6, 5), (6, 6). The
mean for each set of sample points can either be 3 2 2.5 3 3.5 4 4.5
1, 1.5, 2, 2.5, 3, 3.5, 4, 4.5, 5, 5.5 𝑜𝑟 6. 4 2.5 3 3.5 4 4.5 5
There can only be one possible way to obtain a
1 5 3 3.5 4 4.5 5 5.5
sample mean of 1. Thus, its probability is .
36 6 3.5 4 4.5 5 5.5 6
Moreover, the sample points conatining1 dot and
The mean of each possible outcome in rolling 2 dice
2 dots give a sample mean of 1.5. There are two
possible arrangements for that: these are the
sample points (1, 2) and (2, 1). Hence the
probability of obtaining a sample mean of 1.5 is
2 1
or .
36 18
There is only one way in obtaining a sample
1
mean of 1 and its probability is . Moreover, only
36
two sample points can result to a sample mean
2
of 1.5, therefore its probability is . The
36
3
probability of obtaining a sample mean of 2 is ,
36
and so on.
Sampling Distribution of the Means from Rolling 2 Dice

MEAN 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6


PRO- 1 2 3 4 5 6 5 4 3 2 1
𝑜𝑟 0.028 𝑜𝑟 0.056 𝑜𝑟 0.083 𝑜𝑟 0.111 𝑜𝑟 0.139 𝑜𝑟 0.167 𝑜𝑟 0.139 𝑜𝑟 0.111 𝑜𝑟 0.083 𝑜𝑟 0.056 𝑜𝑟 0.028
BABILITY 36 36 36 36 36 36 36 36 36 36 36

Sampling Means from Rolling 2 Dice and their Corresponding Probabilities

2
Example 3:
Five children have the following weights. Child’s Name Weight (kg.)

1. Get all the possible sample sizes of 3. Jose 25


2. Construct the sampling distribution of Yuan 21
the means. Kevin 19
3. Construct the graph that shows its
Maxine 23
probability distribution.
Alexa 18
1.) Weights of 5 Children

10 POSSIBLE COMPOSITIONS Sample Values (Weights of


Sample Mean
Samples (Group of 3 Children) Children)
1. Jose, Yuan, Kevin 25 21 19 21.67
2. Jose, Yuan, Maxine 25 21 23 23
3. Jose, Yuan, Alexa 25 21 18 21.33
4. Jose, Kevin, Maxine 25 19 23 22.33
5. Jose, Kevin, Alexa 25 19 18 20.67
6. Jose, Maxine, Alexa 25 23 18 22
7. Yuan, Kevin, Maxine 21 19 23 21
8. Yuan, Kevin, Alexa 21 19 18 19.33
9. Yuan, Maxine, Alexa 21 23 18 20.67
10. Kevin, Maxine, Alexa 19 23 18 20
Possible Sample Means of Weights

2.)
̅ (𝑴𝒆𝒂𝒏)
𝒙 19.33 20 20.67 21 21.33 21.67 22 22.33 23
1 1 2 1 1 1 1 1 1
̅)
𝑷(𝒙 𝑜𝑟 0.1 𝑜𝑟 0.1 𝑜𝑟 0.2 𝑜𝑟 0.1 𝑜𝑟 0.1 𝑜𝑟 0.1 𝑜𝑟 0.1 𝑜𝑟 0.1 𝑜𝑟 0.1
10 10 10 10 10 10 10 10 10
Sampling Distribution of the Sample Means of Weights of Size 3

3.) Graph of the Probability Distribution.

 𝑥̅ does not have a normal distribution. Reason:


the sample size is small.

Sampling Distribution of the Sample Means of Weights of


Size 3

KEY TAKEAWAYS:
1. The central limit theorem (CLT) states that the distribution of sample means approximates a normal
distribution as the sample size gets larger. Central
2. Sample sizes equal to or greater than 30 are considered sufficient for the CLT to hold. Limit
3. A key aspect of CLT is that the average of the sample means and standard deviations will equal the Theorem
population mean and standard deviation. (CLT)
4. A sufficiently large sample size can predict the characteristics of a population accurately.

3
If random sizes of 𝒏 are drawn from a finite or
infinite population with mean 𝝁 and variance The expected mean of the sampling means and the
𝝈𝟐 , then the sampling distribution of sample variance of the sample means in a sampling
mean 𝒙̅ ̅ is approximately normally distributed distribution of the sample means are as follows:
having an expected value of 𝝁𝒙̅ = 𝝁 and
𝝈 ̅)
̅ 𝒇(𝒙
𝝁̅𝒙 = ∑ 𝒙 𝝈𝟐
standard deviation 𝝈𝒙̅ = . Hence the 𝝈𝒙̅ 𝟐 = ∑(𝒙 − 𝝁𝒙̅ )𝟐 𝒇(𝒙
̅) =
√𝒏 𝒏
=𝝁
transformation to standard normal variable 𝒛
(Table A) is given by the formula:

̅−𝝁
𝒙
𝒛= 𝝈
√𝒏

which approximately follows the Standard Normal Distribution.

1. The SUMMARY OF THE SAMPLING


normal approximation of 𝑥̅ DISTRIBUTION OF THE MEANS
[Rolling
depends on the sampleone(1)
size 𝑛.Die and two(2) Dice Experiment]
Example, the results of using
the experiment of throwing a
die

a. The distribution obtained


for throws of a six-sided fair
die.
b. The distribution obtained of
𝑥̅ becomes closer to normal
as 𝑛 grows larger.
Probability Distribution for a sample size of 𝑛 = 1
2. For any population, the normal
approximation of 𝑥̅ will be
generally good if 𝑛 ≥ 30.

3. The sample size of 30 is a


guideline to use for the Central
Limit Theorem.

Probability Distribution for a sample size of 𝑛 = 2

Example 4:
Compute the mean and variance of the sampling distribution given:

MEAN 1 1.5 2 2.5 3 3.5 4 4.5 5 5.5 6


PRO- 1 2 3 4 5 6 5 4 3 2 1
𝑜𝑟 0.028 𝑜𝑟 0.056 𝑜𝑟 0.083 𝑜𝑟 0.111 𝑜𝑟 0.139 𝑜𝑟 0.167 𝑜𝑟 0.139 𝑜𝑟 0.111 𝑜𝑟 0.083 𝑜𝑟 0.056 𝑜𝑟 0.028
BABILITY 36 36 36 36 36 36 36 36 36 36 36

Sampling Means from Rolling 2 Dice and their Corresponding Probabilities

4
The mean and variance of the sampling distribution are calculated as follows:

Solution:

Sample Mean
̅ 𝒇(𝒙
𝝁𝒙̅ = ∑ 𝒙 ̅) → 𝑭𝒐𝒓𝒎𝒖𝒍𝒂 𝒇𝒓𝒐𝒎 𝑪𝒆𝒏𝒕𝒓𝒂𝒍 𝑳𝒊𝒎𝒊𝒕 𝑻𝒉𝒆𝒐𝒓𝒆𝒎
1 2 3 4 5 16 5 4 3
𝜇𝑥̅ = 1 ( ) + 1.5 ( ) + 2 ( ) + 2.5 ( ) + 3 ( ) + 3.5 ( ) + 4 ( ) + 4.5 ( ) + 5 ( )
36 36 36 36 36 36 36 36 36
2 1
+ 5.5 ( ) + 6 ( )
36 36
𝝁𝒙̅ = 𝟑. 𝟓

Sample Variance
𝝈𝒙̅ 𝟐 = ∑(𝑥̅ − 𝜇𝑥̅ )2 𝑓(𝑥̅ ) → 𝑭𝒐𝒓𝒎𝒖𝒍𝒂 𝒇𝒓𝒐𝒎 𝑪𝒆𝒏𝒕𝒓𝒂𝒍 𝑳𝒊𝒎𝒊𝒕 𝑻𝒉𝒆𝒐𝒓𝒆𝒎
1 2 3 4 5
𝜎𝑥̅ 2 = (1 − 3.5)2 ( ) + (1.5 − 3.5)2 ( ) + (2 − 3.5)2 ( ) + (2.5 − 3.5)2 ( ) + (3 − 3.5)2 ( )
36 36 36 36 36
16 5 4 3
+ (3.5 − 3.5)2 ( ) + (4 − 3.5)2 ( ) + (4.5 − 3.5)2 ( ) + (5 − 3.5)2 ( )
36 36 36 36
2 1
+ (5.5 − 3.5)2 ( ) + (6 − 3.5)2 ( )
36 36
𝝈𝒙̅ 𝟐 = 𝟏. 𝟒𝟓𝟖

Example 5:
A school district has 500 Eight Graders. In that district, the average weight of an Eight Grader
is 35 kilograms, with a standard deviation of 5 kilograms. Drawing a sample of 50 students. What is
the probability that the average weight of the sampled students will be less than 34 kilograms?

Solution:
Sample Standard Deviation: Standard Normal Variable 𝒛:
𝝈 ̅
𝒙−𝝁
𝝁 = 𝟑𝟓 𝝈𝒙̅ = 𝒛= 𝝈
√𝒏
√𝒏
5 34−35
̅ = 𝟑𝟒
𝒙 𝜎𝑥̅ = 𝑧=
√50 0.707
−1
𝒏 = 𝟓𝟎 𝝈𝒙̅ = 𝟎. 𝟕𝟎𝟕 𝑧=
0.707
𝝈= 𝟓 𝒛 = −𝟏. 𝟒𝟏
Thus 𝑃(𝑥̅ < 34) = 𝑃(𝑧 < −1.41) = 0.0793. Value obtained from Table A (Areas under the Standard Normal
Distribution).
Therefore, the probability that the sampled students will have an average weight of less than 34 kilograms is
7. 93%.
Example 6:
An electrical parts factory manufactures resistors that have a resistance of 100 ohms. The
distribution resistance is normal. Find the probability that a random sample of 50 resistors will have
an average resistance greater than 95 ohms but less than 105 ohms, as the standard deviation of the
resistors is 14 ohms.

Solution:
Sample Standard Deviation: Standard Normal Variable 𝒛:
𝝈 ̅ −𝝁
𝒙 ̅𝟐−𝝁
𝒙
𝝁𝒙̅ = 𝝁 = 𝟏𝟎𝟎 𝝈𝒙̅ = 𝒛𝟏 = 𝟏𝝈 𝒛𝟐 = 𝝈
√𝒏
√𝒏 √𝒏
14 95−100 105−100
̅𝟏 = 𝟗𝟓
𝒙 𝜎𝑥̅ = 𝑧1 = 𝑧2 =
√50 1.98 1.98
−5 5
̅𝟐 = 𝟏𝟎𝟓
𝒙 𝝈𝒙̅ = 𝟏. 𝟗𝟖 𝑧1 = 𝑧2 =
1.98 1.98
𝒏 = 𝟓𝟎 𝒛𝟏 = −𝟐. 𝟓𝟑 𝒛𝟐 = 𝟐. 𝟓𝟑
𝝈 = 𝟏𝟒

5
Thus 𝑃(95 < 𝑥̅ < 105) = 𝑃(𝑧2 < 2.53) − 𝑃(𝑧1 < −2.53) = 0.9943 − 0.0057 = 0.9886. Value obtained
from Table A (Areas under the Standard Normal Distribution).
Therefore, 98.86% of the sample size 50 will have an average resistance between 95 and 105 ohms.

Example 7:
Maliwanag E-Lights Co. produces LED bulbs that have a life span that is approximately
normally distributed, with the mean equal to 10,000 hours and a standard deviation of 3,600 hours.
Find the probability that a random sample of 35 bulbs will have an average lifespan greater than 8,500
hours.
Solution:
Sample Standard Deviation: Standard Normal Variable 𝒛:
𝝈 ̅
𝒙−𝝁
𝝁𝒙̅ = 𝝁 = 𝟏𝟎, 𝟎𝟎𝟎 𝝈𝒙̅ = 𝒛= 𝝈
√𝒏
√𝒏
3,600 8,500−10,000
̅ = 𝟖, 𝟓𝟎𝟎
𝒙 𝜎𝑥̅ = 𝑧=
√35 608.51
−1,500
𝒏 = 𝟑𝟓 𝝈𝒙̅ = 𝟔𝟎𝟖. 𝟓𝟏 𝑧=
608.51
𝝈 = 𝟑, 𝟔𝟎𝟎 𝒛 = −𝟐. 𝟒𝟕
Thus 𝑃(𝑥̅ > 8,500) = 1 − 𝑃(𝑧 < −2.47) = 1 − 0.0068 = 0.9932. Value obtained from Table A (Areas under
the Standard Normal Distribution).
Therefore, the probability that the 35 randomly chosen LED bulbs will last for more than 8,500 hours on the
average is 99.32%.

Learning Competencies:

1. Illustrates the Central Limit Theorem. (M11/12SP-IIIe-2)


2. Defines the sampling distribution of the sample mean using the Central Limit Theorem.
(M11/12SP-III-3)
3. Solves problems involving sampling distributions of the sample mean. (M11SP-IIIe-f-1)

GENERAL DIRECTIONS: Read carefully the directions indicated in the following Activities and
Reflection, then provide what is needed or asked in each item. Write the solutions and answers
on a separate sheet of paper.

Activity 1: Identify the following.


1. States that the distribution of sample means approximates a normal distribution as the
sample size gets larger.
2. Drawing all the possible samples each of size n in a population, and further computing the
mean of each sample. The probability distribution of these means is called …….
3. It is a summary measure to describe the characteristic of the whole population.
4. It is considered the sufficient sample size for the CLT to hold.
5. It is the corresponding value of a statistic of a normally distributed sample in the standard
normal distribution.

Activity 2: Fill the table with the correct answer.

The table below consists of 6 people aged 25 years old, where the response ‘yes’ indicates that the
individual has traveled abroad at least once during the past 3 years. While the response ‘no’
indicates that the person has no overseas trip.

6
All the possible sample sizes of 4 and the sampling distribution of ‘yes’ responses.
Combinations (Groups of
Person Response Values Mean (YES)
4)
A YES 1. A, B, C, D YES NO NO YES 2
B NO 2.
C NO 3.
D YES 4. A, B, D, E YES NO YES NO 2
E NO 5.
F YES 6.
7.
8.
9. A, C, D, F YES NO YES YES 3
10.
11.
12. B, C, D, E NO NO YES NO 1
13.
14.
15. B, D, E, F NO YES NO YES 2
16.
Activity 3:
Directions: Solve the worded problems with COMPLETE SOLUTIONS and ANSWERS.

1. Here say, you randomly selected a sample of 100 measurements from a normally distributed
population, having a mean of 68, and standard deviation of 7.

a.) Find the mean and the standard deviation of the sampling distribution of the sample
mean.
b.) What is the probability that the mean of the sample is greater than 68?
c.) What is the probability that the mean of the sample is less than 67?

2. The scores of the Senior High School entrance exam in a DEPED high school in 2014 has a
normal distribution, with a mean of 65, and a standard deviation of 3.2.

a.) What is the probability that a student randomly chosen scored more than 70?
b.) What are the mean and standard deviation of the sample mean score of one hundred
students (who took the test) that were randomly chosen?

3. Suppose the age of cars on the road is normally distributed with a mean of 7.2 years. If the
standard deviation is known to be 2.1 years, what is the probability that 12 randomly selected
cars have been on the road for between 6 and 8 years?

Activity 3: Rubrics
CATEGORY 4 3 2 1
Student shows Student shows Student shows very
Student shows some
complete substantial limited understanding
understanding of the
Mathematical understanding of the understanding of the of the underlying
mathematical concepts
Concepts mathematical concepts mathematical concepts concepts needed to
needed to solve the
used to solve the used to solve the solve the problem(s) or
problem(s).
problem(s). problem(s). no knowledge.
Almost all (85-89%) of Most (75-84%) of the More than 75% of the
90-100% of the steps
Mathematical the steps and solutions steps and solutions steps and solutions
and solutions have no
Errors have no mathematical have no mathematical have mathematical
mathematical errors.
errors. errors. errors.

7
Sometimes uses an
Typically, uses an
Typically, uses an effective strategy to Rarely uses an
Strategy efficient and effective
effective strategy to solve problems but effective strategy to
/Procedures strategy to solve the
solve the problem(s). does not do it solve problems.
problem(s).
consistently.

The work appears


The work is presented The work is presented The work is presented sloppy and
Neatness and in a neat, clear, in a neat and in an organized unorganized. It is hard
Organization organized fashion that organized fashion that fashion but may be to know what
is easy to read. is usually easy to read. hard to read at times. information goes
together.

All but one of the All but two of the Several of the
All problems are
Completion problems are problems are problems are not
completed.
completed. completed. completed.

Total:

Reflection: What have you learned from the topics and how can you relate them to real life situation?

References:

Simon L. Chua, et al., Soaring 21st Century Mathematics – Statistics and Probability (K to 12), Quezon
City: The Phoenix Publishing House Inc., copyright 2016, Chapter 3, pages 2-36

Jericho B. Bacani, et al., Statistics and Probability for Grade 11, Quezon City: EPHESIANS Publishing
Inc., copyright 2018, pages 53-74

Akhilesh Ganti, September 13, 2019, https://www.investopedia.com/ terms/c/central_limit_theorem.asp

Learning Resource Portal, https://lrmds.deped.gov.ph/search?filter=&search_param=all&query=


statistics&education_use_type_id=8&page=3

Solutions to Central Limit Theorem Problems , http://btravers.weebly.com/uploads/6/7/2/9/6729909/


central _limit_theorem_problems_solutions.pdf

Deped Tambayan. “Rubric Assessment Tool for Problem Solving in Math”


https://drive.google.com/open?id=1TYVrymZuZoxhmb_jXd7xH0VzFUtM_CZ1

Z Table, https://www.ztable.net/

Z-score 5.

8
Z TABLE (Table A)

Negative Z score table

Use the negative Z score table below to find values on the left of the mean
as can be seen in the graph alongside. Corresponding values which are
less than the mean are marked with a negative score in the z-table and
represent the area under the bell curve to the left of z.

9
Positive Z score table
Use the positive Z score table below to find values on the right of the
mean as can be seen in the graph alongside. Corresponding values
which are greater than the mean are marked with a positive score in
the z-table and represent the area under the bell curve to the left of z.

10

You might also like