Professional Documents
Culture Documents
7b Z-Test
7b Z-Test
and
The Z-test
Dr. Robert Randall
Z-Score
X = Raw Score
X − μ𝑋 𝝁𝑿 =
∑ 𝑿
z= = Population Mean for Distribution 𝑵
σ𝑋 of Individual Scores.
√
𝟐
= Population Standard Deviation for
𝝈𝑿 =
∑ (𝑿 − 𝑿)
Distribution of Individual Scores. 𝑵
Z-Test
= Sample Mean
X − μ X = Population Mean for Distribution 𝝁 𝑿 =𝝁 𝑿
z= of Sample Means.
σX
= Population Standard Deviation for
Distribution of Sample Means.
Example 1A Using Z-Test
Population Mean IQ is 100 and Population Standard Deviation is 16
Dr. Tom Petty wrote a book he claims will significantly Raise IQ Scores.
A researcher wants to test this book. She selects the 5% (α = 0.05) Level of
Significance to test the book. She selects 256 individuals and has them
read Dr. Petty’s Book. Then she administers an IQ exam to each of the 256
individuals and finds a sample mean of 102.
= = + 2.0
= = + 1.25
Dr. Kris Kristofferson has invented an Inversion Table he claims will increase Height.
A researcher wants to test this Inversion Table. She selects the 1% (α = 0.01) Level of
Significance for this hypothesis test. She selects 400 men and has each of them
use the inversion table for 12 months. At the end of the 12-month period, she measures
each man’s height and finds a sample mean of 70.6”.
Example 2 Using α = .01
Population Mean Female Height is 65” and Population Standard Deviation is 3.5”
Dr. Grace Slick has invented a Pill she claims will “make you 10 feet tall”.
Dr. Exene Cervenka does not believe it will make people 10 feet tall, but wants to test
whether it will make women any taller. She selects the 5% (α = 0.05) Level of
Significance for this hypothesis test. She selects 100 women and has each of them
take the pill once per day for 6 months. At the end of the 6-month period, she measures
each woman’s height and finds a sample mean of 65.7”.
Example 3 Using α = .05
Population Mean Sat Score is 500” and the Population Standard Deviation is 100.
Dr. Riff Raff has created a study program he claims will increase SAT scores.
A researcher tests whether this program works. She selects the 1% (α = 0.01) Level of
Significance for this hypothesis test. She selects 1600 individuals and has each of them
complete the study program. Then, she has each take the SAT test and finds a sample
mean of 508.
Example 4 Using α = .01
Population Mean for Amateur Golfer is 90 and Population Standard Deviation is 28.
Dr. Anne Murray has developed a Golf Strategy she claims will lower golf scores.
A researcher wants to test this Golf Strategy. She selects the 5% (α = 0.05) Level of
Significance for this hypothesis test. She selects 144 Amateur Golfers has each of them
use Dr. Murray’s Golf Strategy for 8 months. At the end of the 8-month period, she
measures has each Amateur Golfer play 18 rounds of golf and finds a Mean Golf Score
of 81.
Example 5 Using α = .05
Best Single-Point Estimate = Sample Mean (e.g. 70.6” from Last Example)
But, Not Very Confident Treated Population Mean Exactly Equals Sample Mean.
The Confidence Interval is a Range of Scores for which we can be More Confident that
the True Population Mean Falls Somewhere within this Interval.
For this example, if we Fail to Reject Ho, the Population Mean of the Treated Individuals
is Equal to 70.0 (Ho: μ = 70); You Do Not Calculate a Confidence Interval.
(3) Once You Reject Ho, Then What?
Therefore, we know that the Population Mean for the Treated Individuals is NOT Equal
to 70, which is what it is Equal to for the Untreated Individuals.
We do NOT KNOW what it is Equal to, But we Can Estimate What it is Equal to.
(1) Digression, Male Height
Let’s say you were sitting in the classroom, you were blindfolded, a man entered the
room, and I asked you to estimate how tall he was.
Your Best Single-Point Estimate is the Population Mean for Male Height; 5’10” (70”).
However, you would not be very Confident that he was Exactly 5’10”.
Therefore, you could Create an Interval for which you would be More Confident that his
height was somewhere between a Range of Heights from a Height Below the Population
Mean to a Height Above the Mean.
For Example that his height was between 5’8” and 6’0”.
(2) Digression, Male Height
You would be More Confident (68.26% Confident) that his height was somewhere
between 5’6” and 6’2”. That is, that his height was between 1 SD Below the Population
Mean and 1 SD Above the Population Mean.
You would be Even More Confident (95.44% Confident) that his height was somewhere
between 5’2” and 6’6”. That is, that his height was between 2 SD Below the Population
Mean and 2 SD Above the Population Mean.
You would be Even MORE Confident (99.74% Confident) that his height was
somewhere between 4’8” and 6’10”. That is, that his height was between 2 SD Below
the Population Mean and 2 SD Above the Population Mean.
Distribution of Sample Means - 68.25% Confidence Interval
68.25%
Confidence
Interval
68.25%
Z = -1.0 Z = +1.0
Raw Score = 5’6” 5’10” 6’2”
Distribution of Sample Means - 95.44% Confidence Interval
95.44%
Confidence
Interval
95.44%
Z = -2.0 Z = +2.0
Raw Score = 5’2” 5’10” 6’6”
Distribution of Sample Means – 99.74% Confidence Interval
99.74%
Confidence
Interval
99.74%
Z = -3.0 Z = +3.0
Raw Score = 4’8” 5’10” 6’10”
Confidence Intervals
95%
Confidence
Interval
95%
Z = -1.96 Z = +1.96
Raw Score = 5’2.16” 5’10” 6’6.84”
Distribution of Sample Means – 99% Confidence Interval
99%
Confidence
Interval
99.74%
Z = -2.576 Z = +2.576
Raw Score = 4’9.75” 5’10” 6’6.92”
(1) Confidence Interval Using Example 2
On Previous Slide, you see that Ho was Rejected. Therefore, Conclude that the
Population Mean for the Treated Population is NOT 70”. We estimate that the
Population Mean for the Treated Population is somewhere around the Sample Mean.
Z = -1.96 Z = +1.96
Raw Score = 70.21” 70.6” 70.99”
Confidence Interval Using Example 2 – 99% Confidence Interval
Z = -2.576 Z = +2.576
Raw Score = 70.08” 70.6” 71.12”
(1) Confidence Interval Using Example 3
On Previous Slide, you see that Ho was Rejected. Therefore, Conclude that the
Population Mean for the Treated Population is NOT 65”. We estimate that the
Population Mean for the Treated Population is somewhere around the Sample Mean.
Z = -1.96 Z = +1.96
Raw Score = 65.01” 65.7” 65.69”
Confidence Interval Using Example 3 – 99% Confidence Interval
Z = -2.576 Z = +2.576
Raw Score = 64.8” 65.7” 65.9”
(1) Confidence Interval Using Example 4
On Previous Slide, you see that Ho was Rejected. Therefore, Conclude that the
Population Mean for the Treated Population is NOT 500. We estimate that the
Population Mean for the Treated Population is somewhere around the Sample Mean.
Z = -1.96 Z = +1.96
Raw Score = 503.1 508 512.9
Confidence Interval Using Example 4 – 99% Confidence Interval
Z = -2.576 Z = +2.576
Raw Score = 501.56 508 514.44
(1) Confidence Interval Using Example 5
Z = -1.96 Z = +1.96
Raw Score = 76.43 81 85.57
Confidence Interval Using Example 5 – 99% Confidence Interval
Z = -2.576 Z = +2.576
Raw Score = 74.99 81 87.01
Effect Size
(1) Effect Size
It has been argued that any difference between the Population Mean and the Sample
Mean can be found to be significant if the Sample Size is Large Enough.
For Example:
Population Mean IQ = 100, Population Standard Deviation = 16
Research selects sample of 1,000,000 and has them read her new book that
supposedly raises IQ. The Sample Mean for this sample of 1,000,000 people
is 100.1
= = + 6.25
; Therefore, is divided by
Then, you compare the Observed d-value (the one you calculate above) to critical
values of d to determine the Size of the Effect.
Small = 0.00 – 0.20; Medium = 0.21 – 0.79; Large = 0.80 and Above
(3) Effect Size
Note: When we Fail to Reject Ho, DO NOT CALCULATE A TREATMENT EFECT SIZE
because we know the Exact Value of the Treatment Effect Size; it is EQUAL TO ZERO,
there is NO TREATMENT EFFECT.
Then, you compare the Observed d-value (the one you calculate above) to
critical values of d to determine the Size of the Treatment Effect.
Small = 0.00 – 0.20; Medium = 0.21 – 0.79; Large = 0.80 and Above
(1) Effect Size For Example 3
Then, you compare the Observed d-value (the one you calculate above) to
critical values of d to determine the Size of the Treatment Effect.
Small = 0.00 – 0.20; Medium = 0.21 – 0.79; Large = 0.80 and Above
(1) Effect Size For Example 4
Then, you compare the Observed d-value (the one you calculate above) to
critical values of d to determine the Size of the Treatment Effect.
Small = 0.00 – 0.20; Medium = 0.21 – 0.79; Large = 0.80 and Above
(1) Effect Size For Example 5
Then, you compare the Observed d-value (the one you calculate above) to
critical values of d to determine the Size of the Treatment Effect.
Small = 0.00 – 0.20; Medium = 0.21 – 0.79; Large = 0.80 and Above
Power
(1) Calculating Power: Example 1A Using Z-Test Revisited
Let’s say the Population Mean for those who read Dr. Tom Petty’s Book is really
103, not 100, Meaning Ho is False (H1 is True).
(2) Calculating Power: Example 1A Using Z-Test Revisited
(1) Calculate the Raw Scores (e.g. IQ Scores) associated with the Z-Critical
Values Assuming Ho is True:
(2) Calculate the Z-Scores for these “Critical” Raw Scores Assuming Some Other
Population Mean (e.g. 103) Exists for the “Treated” Population:
(3) Look up these New Z-Values Assuming Some Other Population Mean
(e.g. 103) Exists for the “Treated” Population and Find the Area that
Falls Above These Values. This is Power:
Therefore, for this example, Power Drops from ,8508 using Alpha = .05 to
.5628 using Alpha = .01.
This means that as the Probability of Making a Type Error is Reduced from
Alpha = .05 (5% Probability) to Alpha = .01 (1% Probability), The Power of
Finding a Significant Difference when A significant Difference Truly Exists
is Reduced from Power = .8508 (85% Power) to Power = ,5628 (56% Power).
(6) Calculating Power: Example 1A Using Z-Test Revisited
Note: When the True Population Mean under Ho is Higher than the Population
Mean under H0, you Most Likely only Reject if the Sample Mean is Above the
Population Mean under Ho.
If the True Population mean under Ho is Lower than the Population Mean under
Ho, you Most Likely will only Reject if the Sample Mean is Below the Population
Mean under Ho.
Decision – 95%
Fail to Reject Ho Reject Ho
Remember,