Professional Documents
Culture Documents
Q4 W1 Focused and Consolidated With Annotation
Q4 W1 Focused and Consolidated With Annotation
Probability
Quarter 4 Module 1:
Testing Hypothesis
2. What is the average daily usage of social media of her friends? Compare
it with the previous average usage.
3. Which of the two claims could probably be true? Why?
4. If Sofia computed the average daily internet usage of her friends to be
higher than the global survey, do you think it would be significantly
higher?
5. What is your idea of an average value being significantly higher than the
global average value?
6. What do you think is the difference between simple comparison of data
and hypothesis testing?
What Is It
Here are the examples of questions you can answer with a hypothesis test:
Does the mean height of Grade 12 students differ from 66 inches?
Do male and female Grade 7 and Grade 12 students differ in height
on average?
Note: You can think of the null hypothesis as the current value of the
population parameter, which you hope to disprove in favor of your
alternative hypothesis.
6
Take a look at this example.
The school record claims that the mean score in Math of the incoming
Grade 11 students is 81. The teacher wishes to find out if the claim is true.
She tests if there is a significant difference between the batch mean score
and the mean score of students in her class.
Solution:
Let be the population mean score and be the mean score of
students in her class.
You may select any of the following statements as your null and
alternative hypothesis as shown in Option 1 and Option 2.
Option 1:
: The mean score of the incoming Grade 11 students is 81 or = 81.
: The mean score of the incoming Grade 11 students is not 81 or 81.
Option 2:
: The mean score of the incoming Grade 11 students has no significant
difference with the mean score of her students or = .
: The mean score of the incoming Grade 11 students has a significant
difference with the mean score of her students or .
formulate two hypotheses about the global average usage ( ) and the average
usage of her friends ( ) on the blanks provided below.
: _____________________________________________
: _____________________________________________
You can verify your answer to your teacher and start working on the next
activity.
Here is another key term you should know!
Level of Significance
The level of significance denoted by alpha or refers to the degree of
significance in which we accept or reject the null hypothesis.
100% accuracy is not possible in accepting or rejecting a hypothesis.
is also the probability of making the wrong decision
when the null hypothesis is true.
Do you know that the most common levels of significance used are 1%, 5%,
or 10%?
Some statistics books can provide us table of values for these levels of
significance.
7
Take a look at this example.
Maria uses 5% level of significance in proving that there is no
significant change in the average number of enrollees in the 10 sections for
the last two years. It means that the chance that the null hypothesis ( )
would be rejected when it is true is 5%.
If Sofia used a 0.10 level of significance, what are the chances that she
would have a wrong conclusion if the two values have no significant
difference?
8
However, if the school registrar believes that the average number of enrollees
this school year is less than the previous school year, then you will have:
:
:
On the other hand, if the school registrar believes that the average number
of enrollees this school year is greater than the previous school year, then
you will have:
:
:
Now back to the two claims of Sofia, what do you think should be the type of
test in her following claims?
Claim A: The average daily usage of social media of her friends is
the same as the global average usage.
Non-Rejection
Region Rejection Region
Critical Value
Illustrative Example 1:
She computed for the t-value using the formula where = 142,
= 152, s = 19.855, and n = 10.
Use a scientific
This t-test formula calculator to
was discussed in verify the
the last chapter. computed t-
value.
10
From the table of t-values, determine the critical value. Use df = n-1 = 9,
one-tailed test at 5% level of significance.
The critical t-value is 1.833.
How did we get that value?
Look at this illustration!
Now, you can sketch a t distribution curve and label showing the rejection
area (shaded part), the non-rejection region, the critical value, and the
computed t-value. This is how your t distribution curve should look like!
Rejection
Region
Non-Rejection
Region
1.593 1.833
(Computed Value) (Critical Value)
11
Illustrative Example 2:
A medical trial is conducted to test whether or not a certain drug reduces
cholesterol level. Upon trial, the computed z-value of 2.715 lies in the
rejection area.
Illustrative Example 3:
Sketch the rejection region of the test hypothesis with critical values of
and determine if the computed t-value of 1.52 lies in that region.
Solution:
Draw a t-distribution curve. Since there are two critical values, it is a two
tailed test. Locate the critical values and shade the rejection regions.
Now, locate the computed t-value of 1.52. You can clearly see that it is not
at the rejection region as shown in the following figure. The computed t-value
is at the non-rejection region. Therefore, we fail to reject the null hypothesis,
.
1.52
1.753 1.753
(critical value) (critical value)
12
Type I and Type II Errors
Region where is
false
To summarize the difference between the Type I and Type II errors, take a
look at the table below.
13
Now, complete the statements that follow.
Type I
Error, Type II Error, or a Correct Decision.
1. true and she fails to reject it, then she commits a ____________________.
2. true and she rejects it, then she commits a _____________________.
3. false and she fails to reject it, then she commits a __________________.
4. false and she rejects it, then she commits a _____________________.
Your answers should be: 1) Correct Decision, 2) Type I Error, 3) Type II
Error, and 4) Correct Decision.
Illustrative Example:
The Type I error is the first statement because he rejected the true
null hypothesis.
14
15. If the computed z-value is 1.915 and the critical value is 1.812, which of
the following statements could be true?
A. It lies in the rejection region, must be rejected.
B. It lies in the rejection region, hence we fail to reject .
C. It lies in the non-rejection region, must be rejected.
D. It lies in the non-rejection region, hence we fail to reject .
Additional Activities
A medical trial is conducted to test whether or not a certain drug can treat a
certain allergy. Upon trial, the t-value is computed as 1.311. Sketch and
complete the table below to discuss the findings of the medical trial.
23
Statistics and
Probability
Quarter 4 Module 2:
Identifying Parameters for
Testing in Given Real-Life
Problems
Activity 2: Grouping!
Directions: Group the following symbols into two. Place the first group
inside Box A and the second group in Box B.
A B
Guide Questions:
1. What are the symbols that you placed in Box A? Box B?
2. How did you categorize each symbol or notation?
3. What mathematical principle did you consider in answering the
activity?
4. Which symbols seemed to be familiar to you and which are not?
What Is It
5
Different symbols are used to denote parameters. Based on Activity 2,
symbols are grouped as indicated in the table below.
Measure Statistic Parameter
(x-bar) (myu)
(sigma squared)
(sigma)
(p hat)
In this claim, there are different parameters used but the parameter
to be tested in this hypothesis would be the average allowance of Senior
High School students since it relates to the population, not in sample.
Statistical hypothesis is a conjecture abo
why you will look for the population mean, population standard deviation, or
population proportion but not sample mean.
6
Activity 3. Translate It!
Directions: Determine the notation of the given parameter, inequality
symbol, or value of the parameter.
1. The television habits of children were observed and found out that the
standard deviation is 12.4 hours per week.
2. A newspaper article stated that students in the country take an average
of 4 years to finish their undergraduate degrees. Suppose that you
believe the mean time is longer, you conducted survey on 49 students.
The result obtained a sample mean of 5 with a sample standard deviation
of 1.2.
3. According to DOLE, registered nurses in government earned an average
monthly salary of 9,700. For that same year, a survey was conducted on
41 registered nurses to determine if the mean salary is higher than the
previous survey. The sample average was 10,000 with a sample
standard deviation of 2,500.
4. Records of the Department of Health (DOH) revealed that 14.7% of the
country's Filipino smokers have maintained their habit of smoking.
7
11
What I Know What's In
1. B 10. C Activity 1 Additional Activities
2. D 11. D 1. A
2. B Activity 6
3. D 12. D 3. B
4. C 13. A 1. average life of
4. D
2,600 hours (µ)
5. D 14. C 5. D
6. C 15. D 2. average price of
7. D What's New Honda Vios is at
8. B Activity 2 least 662,000.00
A or B and vice versa (µ)
9. D
{
{
Assessment
Activity 3 Activity 5
1. 1. 1. C
2. A
2. 2.
3. C
3. 3.
4. A
4. 4. p = 0.147
5. B
5.
6. C
7. B
Activity 4 8. B
1. 9. D
habits hours per week is 12.4 10.B
2. an average of 4 years to finish undergraduate 11.C
degrees 12.C
3. an average monthly salary of registered 13.D
14.B
government nurse is 9,700
15.A
4.
smokers maintain their smoking habits
Answer Key
evidence to reject the dealer s claim at
640,000.00 and standard deviation of 24,000.00. Is there enough
that random sample of 15 similar vehicles has the mean price of
Statistics
Quarter 4 Module 3:
Formulating Appropriate Null
and Alternative Hypotheses on a
Population Mean
Guide Questions:
What Is It
In statistical hypothesis testing, there are always two hypotheses: the null
and alternative hypotheses. Below is a comparison between the two.
6
Hypothesis-Testing Common Phrases
is equal to is not equal to
is the same as is not the same
is exactly the same as is different from
has not changed from has changed from
is increased is decreased
is greater than is less than
is higher than is lower than
is above is below
is bigger than is smaller than
is longer than is decreased or reduced from
is more than is not more than
is at least is at most
is not less than is not more than
is greater than or equal to is less than or equal to
The claim used the word less than which as seen in the table above,
corresponds to the symbol . Therefore, the answer is n<20.
Note:
always has = symbol in it. never has an = symbol in it. The choice of
symbol depends on the wording of the hypothesis test. However, be aware
that many researchers use = (equal sign) in the null hypothesis, even with
> or < as the symbol in the alternative hypothesis. Notice also that the
notation of alternative hypothesis complements the null hypothesis.
Illustrative Examples:
Solution: First, identify the parameter which is the mean height of all
Grade 11 students. Since it is a population mean, use the notation .
The claim in this example is that the average weight is 169 cm which
translates to and is considered as null hypothesis. To formulate
7
the alternative hypothesis, write the complement/opposite of the null
hypothesis which is the average weight is not equal to 169 cm.
Solution: In this example, the parameter is the average and the claim
is that the average is at least 730,000. The word at least has the
notation of which means that the claim is at the null hypothesis. In
8
the alternative hypothesis, you will use (<) as its complement.
Therefore:
or (claim)
5.
time is at most 240 minutes per day, on average. Another survey
was conducted to find whether the claim is true. The group took a
random sample of 30 students and found a mean study time of 300
minutes with standard deviation of 90 minutes. What are the null
and alternative hypotheses?
Solution: The parameter used in this example is average (µ) and the
claim is that average is at most 240 minutes. The word has
the notation of which means that claim is at the null hypothesis.
The null hypothesis would be . To formulate the alternative,
use the notation as the complement of . Therefore, alternative
hypothesis is .
or (claim)
On the other hand, some hypotheses predict only that one value will
be different from another, without additionally predicting which will be
higher. The test of such a hypothesis is nondirectional or two-
tailed because an extreme test statistic in either tail of the distribution
(positive or negative) will lead to the rejection of the null hypothesis of no
difference.
One-Tailed Two-Tailed
Alternative hypothesis contains Alternative contains the
the greater than (>) or less than symbol.
(<) symbols
It is directional (either right-tailed It has no direction.
or left-tailed)
9
The table below shows the null and alternative hypotheses stated
together with the directional test.
2. A piggery owner believes that using organic feeds on his pigs will
yield greater income. His average income from the previous year
was 120, 000. State the hypothesis and identify the directional
test.
In this example, the null hypothesis is . You may
notice that the hypothesis used the phrase greater income that is
associated with greater than. Therefore, . This
hypothesis uses inequality symbol so it is one-tailed test and it uses
greater than which specifically called for the right-tailed test.
and (right-tailed test)
10
Activity 4. One-Tailed or Two-Tailed!
1. A used car dealer says that the mean price car in the Philippines is at
least 350,000.
Activity 5. Formu-Tail
13
4. The average price of a certain type of car is greater than 600,000.
_________________ _________________ _______- tailed test
6. A study claims that the mean survival period for certain cancer patients
treated immediately with chemotherapy and radiation is 24 months.
_________________ _________________ _______- tailed test
7. The average pre-school cost for tuition fees last year was 15,500. The
following year, 20 schools had a mean of 13, 100 and standard
deviation of 2,500.
_________________ _________________ _______- tailed test
9. The principal of Mabundok High School claims that the students in his
IQ scores have a mean score of 113. The mean population IQ is 100 with
a standard deviation of 15. Is there an evidence to support his claim?
________________ __________________ _______-tailed test
10. The owner of BYD manufacturer claims that their batteries last an
average of at most 350 hours under normal use. A researcher randomly
selected 20 batteries from the production line and tested them. The
tested batteries had a mean life span of 270 hours with a standard
deviation of 50 hours.
________________ __________________ _______-tailed test
14
19
What I Know
1. C 11. D What's In Activity 2
2. A 12. A 1. Mean,
3. C 13. C Activity 1 2. ,
4. C 14. D
5. B 15. D 1. A 3. Average,
6. B 2. B 4. mean weight time is
7. C 3. D at most 8.7,
8. C 4. C
9. D 5. A 5. ,
10. A
Activity 3 Activity 5 Additional
1. , 1. , Activities
two-tailed 1. a. ,
2. , 2. or
,
right-tailed
3. , b. Right-tailed test
3. or
2. a. ,
4. , left-tailed ,
4. or
b. Left-tailed test
5. , right-tailed
5. , Assessment
two-tailed
Activity 4 1. B
6.
two-tailed 2. D
1. ONE
7. , 3. B
2. ONE
two-tailed 4. D
3. TWO
5. A
4. ONE 8. or
6. C
5. ONE
7. D
left-tailed 8. A
9. 9. B
10.A
right-tailed 11.C
10. or 12.D
13.A
right-tailed 14.C
15.A
Answer Key
Statistics and
Probability
Quarter 4 Module 4:
Identifying Appropriate Test
Statistics Involving Population
Mean
What Is It
Example:
Now you already know how to get the data needed in choosing test
statistics. This time, you will determine what test statistic is appropriate in
computing test value in the hypothesis testing.
7
A test statistic is a random variable that is calculated from sample
data and used in a hypothesis test. You can use test statistics to determine
whether to reject or accept the null hypothesis. The test statistic compares
your data with what is expected under the null hypothesis.
To identify the test statistic, you must consider whether the
population standard deviation/variance is known or unknown. If the
population standard deviation is known, then the mean has a normal
distribution. Use z-test. If the population standard deviation is unknown,
then the mean has a t- distribution. Use t-test. Instead of the population
standard deviation, use the sample standard deviation.
z-test
In a z-test, the sample is assumed to be normally distributed. A z-score
is calculated with population parameters such as
and . It is used to validate a
hypothesis that the sample drawn belongs to the same population. When the
variance is known and either the distribution is normal or sample size is
large, use a z-test statistic.
t-test
Like a z-test, a t-test also assumes a normal distribution of the
sample. A t-test is used when the population variance or standard deviation
are not known. When the variance is unknown and a sample size is less
than 30, use a t-test statistic assuming that the population is normal or
approximately normal.
8
The table shows what test statistic is appropriate when:
Population Variance Is Population Variance Is Central Limit Theorem
Known Unknown (CLT)
Population is normal or Population may not be
Population is normally
nearly normally normally distributed.
distributed.
distributed.
or considered
sufficiently large
Population standard Sample standard
Variance is known/
deviation ( ) is known. deviation (s) is known.
unknown.
Population standard
deviation ( ) is unknown.
Use z-test by replacing
population standard
z-test t-test deviation ( by sample
standard deviation in
the formula.
Identifying Appropriate Test Statistic
Illustrative Examples:
1. A manufacturer claimed that the average life of batteries used in their
electronic games is 150 hours. It is known that the standard deviation of
this type of battery is 20 hours. A consumer wished to test the
the battery. It was found out that the mean is equal to 144 hours.
Here, the sample size (n) is 100 (extremely large) and population
standard deviation (20 hours) is known, then the appropriate test
statistic to be used is z-test.
9
The sample size (n) is 12 which is less than 30 and sample
standard deviation (5 words per minute) was given. Therefore, the
appropriate test is t-test.
Note:
The illustrative examples above used standard deviations instead of
variances. Variance is the square of the standard deviation and conversely,
the standard deviation is the square root of the variance. Hence, if the
standard deviation is known in the problem, then basically, variance is also
known.
.
2. An electric lamps manufacturer is testing a new production method that
will be considered acceptable if the lamps produced by this method result
in a normal population with an average life of 1,300 hours and a
standard deviation equal to 120. A sample of 100 lamps produced by this
method has an average life of 1,250 hours.
10
students. Among the sampled students, the average IQ is 108 with a
standard deviation of 10.
5. A new energy-efficient lawn mower engine was developed by a well-known
inventor. He claims that the engine will run continuously for 5 hours on
a single gallon of regular gasoline. From his stock of 2,000 engines, the
inventor selects a simple random sample of 50 engines for testing. The
engines run for an average of 295 minutes with a standard deviation of
20 minutes.
Activity 4. Check It Out!
Directions: Read and analyze each problem. On the table below, put a
check on the columns of the criteria that correspond to the given problem.
11
is known. is unknown. z-test t-test
1.
2.
3.
4.
5.
___________2. Based on the report of the school nurse, the average height of
Grade 11 students has increased. Five years ago, the average height of
Grade 11 students was 170cm with standard deviation of 38cm. She took a
random sample of 150 students and derived the average height of 165cm.
12
17
Activity 4
is is z-test t-test
known unknown
1.
2.
3.
4.
5.
Activity 5
1. t-test
2. z-test
3. t-test
4. t-test
5. z-test
Assessment
Additional
Activities 1. B
2. D
Activity 6 3. B
1. a. df=11 4. C
b. t-test 5. A
6. C
2. a. left-tailed 7. B
b. z-test 8. A
9. A
10. B
11. B
12. B
13. A
14. A
15. A
Answer Key