AP Review Part IV - Harrison

AP Review IV - Hypothesis Tests/Confidence Intervals (30% – 40%)
I. Why do we do statistics? To answer a question. So the first step is to decide what question we’re trying to
answer. If we don’t have a burning question, then statistics is not needed. Now the only way to really know the
truth is to study the entire population, but we don’t have time for that. So studying a sample to answer our
question works for us.
II. Confidence Interval or Hypothesis Test?
 Confidence Interval – When you don’t know the population characteristic you want and you wish you
did, this is your choice. Used to estimate the population parameter. The CI is an interval of plausible
values that we hope will capture the value of the population characteristic. Helps us to get an idea of
what we want to know.
 Hypothesis Test – When you have a set standard and want to know if your sample data meets that
standard (or not), this is your choice. Helps to settle arguments of the type “yes, it does”, “no it
doesn’t”, “yes, it does”, “no it doesn’t”. You should expect a bit of a difference, but a hypothesis test
will help you decide if the difference is significant.
III. Questions to ask:

1. Will the data be categorical or quantitative?
2. How many samples are we dealing with? How many variables?
3. What statistics do we need to carry out our test or make a confidence interval?
4. What procedures must we follow so our answer will be credible?
5. Most important: What question are you trying to answer?
Are the data categorical or quantitative?

proportions
means
categorical quantitative
How many samples? How many samples?
1 2 1 2
1-proportion z-test How many

2-proportion z-test 2-sample t-test
or variables?
or
χ2 Goodness of Fit test χ2 Test of Homogeneity
or
2
χ Test of Independence
1 2
t-test linear
or regression
paired t-test t-test
TO RECEIVE FULL CREDIT FOR A HYPOTHESIS TEST YOU MUST:
1) Write the null and alternative hypothesis and define each variable
2) Write which test you are using in words or with the appropriate formula and why you chose that test
3) Write and check all conditions for that test
4) Give the test statistic and the p-value and df , if applicable
5) Reject or fail to reject Ho based on the p-value (or critical value)
6) Write a conclusion in terms of the problem
You either have enough evidence to claim whatever the alternative hypothesis represents (reject H o)or you do
not have enough evidence to claim whatever the alternative hypothesis represents (fail to reject H o)
TO RECEIVE FULL CREDIT FOR A CONFIDENCE INTERVAL YOU MUST:

1) Correctly identify the type of interval by name or formula
2) Write and check all conditions for that interval
3) Correctly calculate the interval – show work
4) Correctly interpret the interval in terms of the problem
5) You may also be required to correctly interpret the confidence level
NOTE: All confidence intervals have the same assumptions of the corresponding hypothesis test.
*******************************************************************
Type I errors, Type II errors, and Power
Type I error – (α ) – when Ho is true but you go with Ha.

Type II error – (β) – when the alternative, Ha, is true but you go with Ho
Type I and Type II errors are inversely related; as one increases the other decreases.
Power = 1 – β, so Type II errors and power are inversely related. Type I errors and power are directly related.
Type I Error = α
Power = 1 – β
Type II Error = β
For each of the following scenarios, determine the type of inference procedure to use. Then, proceed with the
inference procedure. (will need to do this on another page)
1. (2011B #5) During a flu vaccine shortage in the United States, it was believed that 45 percent of vaccine-eligible
people received a flu vaccine. The results of a survey given to a random sample of 2,350 vaccine-eligible people
indicated that 978 of the 2,350 people had received flu vaccine.
(a) Construct a 99 percent confidence interval for the proportion of vaccine-eligible people who had received flu
vaccine. Use your confidence interval to comment on the belief that 45 percent of the vaccine-eligible people had
received flu vaccine.
(b) Suppose a similar survey will be given to vaccine-eligible people in Canada by Canadian health officials. A 99
percent confidence interval for the proportion of people who will have received flu vaccine is to be constructed. What
is the smallest sample size that can be used to guarantee that the margin of error will be less than or equal to 0.02?
Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____
Which inference procedure?_____________________________________________________
2. (2013 #1b) An environmental group conducted a study to determine whether crows in a certain region were ingesting
food containing unhealthy levels of lead. A biologist classified lead levels greater than 6.0 parts per million (ppm) as
unhealthy. The lead levels of a random sample of 23 crows in the region were measured and recorded. The data are
shown in the stemplot below.
The mean lead level of the 23 crows in the sample was 4.90 ppm and the standard deviation was 1.12 ppm. Construct and
interpret a 95 percent confidence interval for the mean lead level of crows in the region.
3. (2015 #4) A researcher conducted a medical study to investigate whether taking a low-dose aspirin reduces the chance
of developing colon cancer. As part of the study, 1,000 adult volunteers were randomly assigned to one of two groups.
Half of the volunteers were assigned to the experimental group that took a low-dose aspirin each day, and the other half
were assigned to the control group that took a placebo each day. At the end of six years, 15 of the people who took the
low-dose aspirin had developed colon cancer and 26 of the people who took the placebo had developed colon cancer. At
the significance level α = 0.05, do the data provide convincing evidence that taking a low-dose aspirin each day would
reduce the chance of developing colon cancer among all people similar to the volunteers?

4. (2011 #5c) Windmills generate electricity by transferring energy from wind to a turbine. A study was conducted to
examine the relationship between wind velocity in miles per hour (mph) and electricity production in amperes for one
particular windmill. For the windmill, measurements were taken on twenty-five randomly selected days, and the
computer output for the regression analysis for predicting electricity production based on wind velocity is given below.
The regression model assumptions were checked and determined to be reasonable over the interval of wind speeds
represented in the data, which were from 10 miles per hour to 40 miles per hour.
Is there statistically convincing evidence that electricity production by the windmill is related to wind velocity? Explain.
5. (1998 #5) A large university provides housing for 10 percent of its graduate students to live on campus. The
university's housing office thinks that the percentage of graduate students looking for housing on campus may be more
than 10 percent. The housing office decides to survey a random sample of graduate students, and 62 of the 481
respondents say that they are looking for housing on campus.
On the basis of the survey data, would you recommend that the housing office consider increasing the amount of housing
on campus available to graduate students? Give appropriate evidence to support your recommendation.
6. (2006 #4) Patients with heart-attack symptoms arrive at an emergency room either by ambulance or self-transportation
provided by themselves, family, or friends. When a patient arrives at the emergency room, the time of arrival is recorded.
The time when the patient’s diagnostic treatment begins is also recorded.
An administrator of a large hospital wanted to determine whether the mean wait time (time between arrival and diagnostic
treatment) for patients with heart-attack symptoms differs according to the mode of transportation. A random sample of
150 patients with heart-attack symptoms who had reported to the emergency room was selected. For each patient, the
mode of transportation and wait time were recorded. Summary statistics for each mode of transportation are shown in the
table below.
(a) Use a 99 percent confidence interval to estimate the difference between the mean wait times for ambulance-
transported patients and self-transported patients at this emergency room.
(b) Based only on this confidence interval, do you think the difference in the mean wait times is statistically significant?
Justify your answer.

7. (2006B #4) The developers of a training program designed to improve manual dexterity claim that people who
complete the 6-week program will increase their manual dexterity. A random sample of 12 people enrolled in the training
program was selected. A measure of each person’s dexterity on a scale from 1 (lowest) to 9 (highest) was recorded just
before the start of and just after the completion of the 6-week program. The data are shown in the table below.
Can one conclude that the mean manual dexterity for people who have completed the 6-week training program has
significantly increased? Support your conclusion with appropriate statistical evidence.
8. (2013 #4) The Behavioral Risk Factor Surveillance System is an ongoing health survey system that tracks health
conditions and risk behaviors in the United States. In one of their studies, a random sample of 8,866 adults answered the
question “Do you consume five or more servings of fruits and vegetables per day?” The data are summarized by response
and by age-group in the frequency table below.
Do the data provide convincing statistical evidence that there is an association between age-group and whether or not a
person consumes five or more servings of fruits and vegetables per day for adults in the United States?

9. (2009 #5 b & c) For many years, the medically accepted practice of giving aid to a person experiencing a heart attack
was to have the person who placed the emergency call administer chest compression (CC) plus standard mouth-to-mouth
resuscitation (MMR) to the heart attack patient until the emergency response team arrived. However, some researchers
believed that CC alone would be a more effective approach.
In the 1990s a study was conducted in Seattle in which 518 cases were randomly assigned to treatments: 278 to CC plus
standard MMR and 240 to CC alone. A total of 64 patients survived the heart attack: 29 in the group receiving CC plus
standard MMR, and 35 in the group receiving CC alone. A test of significance was conducted on the following
hypotheses.
H0: The survival rates for the two treatments are equal.
Ha: The treatment that uses CC alone produces a higher survival rate.
This test resulted in a p-value of 0.0761.
(b) Based on the p-value and study design, what conclusion should be drawn in the context of this study? Use a
significance level of α = 0.05.
(c) Based on your conclusion in part (b), which type of error, Type I or Type II, could have been made? What is one
potential consequence of this error?
Multiple Choice Practice:

10. Automobile manufacturer claims that the average gas mileage of a new model is 35 miles per gallon (mpg).
A consumer group is skeptical of this claim and thinks the manufacturer may be overstating the average gas
mileage. If µ represents the true average gas mileage for this new model, which of the following gives the
null and alternative hypotheses that the consumer group should test?
(A) H 0 : μ<35 mpg (B) H 0 : μ ≤35 mpg (C) H 0 : μ=35 mpg
H a : μ ≥35 mpg H a : μ>35 mpg H a : μ>35 mpg
(D) H 0 : μ=35 mpg (E) H 0 : μ=35 mpg

H a : μ<35 mpg H a : μ ≠35 mpg
11. Which of the following is a criterion for choosing a t-test rather than a z-test when making an inference
about the mean of a population?
(A) The standard deviation of the population is unknown.
(B) The mean of the population is unknown.
(C) The sample may not have been a simple random sample.
(D) The population is not normally distributed.
(E) The sample size is less than 100.
12. A large-sample 98 percent confidence interval for the proportion of hotel reservations that are canceled on the
intended arrival day is (0.048, 0.112). What is the point estimate for the proportion of hotel reservations that are canceled
on the intended arrival day from which this interval was constructed?
(A) 0.032
(B) 0.064
(C) 0.080
(D) 0.160
(E) It cannot be determined from the information given.
13. When using a one-sample t-procedure to construct a confidence interval for the mean of a finite population, a
condition is that the population size be at least 10 times the sample size. The reason for the condition is to ensure that
(A) the sample size is large enough

(B) the central limit theorem is applicable for the sample mean
(C) the sample standard deviation is a good approximation of the population standard deviation
(D) the degree of dependence among observations negligible
(E) the sampling method is not biased.
14. A random sample of 50 students at a large high school resulted in a 95 percent confidence interval for the mean
number of hours of sleep per day of (6.73, 7.67). Which of the following statements best summarizes the meaning of this
confidence interval?
(A) About 95% of all random samples of 50 students from this population would result in a 95% confidence interval
of (6.73, 7.67).
(B) About 95% of all random samples of 50 students from this population would result in a 95% confidence interval
that covered the population mean number of hours of sleep per day.
(C) 95% of the students in the survey reported sleeping between 6.73 and 7.67 hours per day.
(D) 95% of the students in this high school sleep between 6.73 and 7.67 hours per day.
(E) A student selected at random from this population sleeps between 6.73 and 7.67 hours per day for 95% of the
time.
15. In order to plan its next advertising campaign, the Trendy Motor Vehicle company is investigating whether the type
of vehicle and the color of vehicle are related. Each person in a random sample of size 275 selected from the company’s
mailing list was classified according to the type (car or truck) and the color of vehicle he or she drove. The data are
shown in the table below.
Which of the following procedures would be most appropriate to use for investigating whether there is a relationship
between vehicle type and color?
(A) A two-sample t-test

(B) A two-sample z-test
(C) A matched pairs t-test
(D) A chi-square goodness-of-fit test
(E) A chi-square test of independence
16. A random sample of 432 voters revealed that 100 are in favor of a certain bond issue. A 95 percent confidence
interval for the proportion of the population of voters who are in favor of the bond issue is
(A) 100 ± 1.96

√ 0.5( 0.5)
432
(B) 100 ± 1.645
√ 0.5(0.5)
432
(C) 100 ± 1.96
√ 0.231(0.769)
432
(D) 0.231 ±1.96

√ 0.231(0.769)
432
(E) 0.231 ±1.645
√ 0.231(0.769)
432
16. As part of a class project at a large university, Amber selected a random sample of 12 students in her major field of
study. All students in the sample were asked to report their number of hours spent studying for the final exam and their
score on the final exam. A regression analysis on the data produced the following partial computer output.
Amber wants to compute a 95 percent confidence interval for the slope of the least squares regression line in the
population of all students in her major field of study. Assuming that conditions for inference are satisfied, which of the
following gives the margin of error for the confidence interval?
(A) (2.228)(0.745) (B) (2.228) ( 0.745

√ 12 )
(C) (2.228)(5.505)
(D) (2.228) ( 5.505

√ 12 )
(E) (2.228)(2.697)
17. Perchlorate is a chemical used in rocket fuel. People who live near a former rocket-testing site are concerned that
perchlorate is present in unsafe amounts in their drinking water. Drinking water is considered safe when the average level
of perchlorate is 24.5 parts per billion (ppb) or less. A random sample of 28 water sources in this area produces a mean
perchlorate measure of 25.3 ppb. Which of the following is an appropriate alternative hypothesis that addresses their
concern?
(A) Ha : µ < 25.3 (B) Ha : µ > 25.3 (C) Ha : µ < 24.5
(D) Ha : µ > 24.5 (E) Ha : µ ≠ 24.5
18. A manufacturer claims its Brand A battery lasts longer than its competitor’s Brand B battery. Nine batteries of each
brand are tested independently, and the hours of battery life are shown in the table below.
Provided that the assumptions for inference are met, which of the following tests should be conducted to determine if
Brand A batteries do, in fact, last longer than Brand B batteries?
(A) A one-sided, paired t-test

(B) A one-sided, two-sample t-test
(C) A two-sided, two-sample t-test
(D) A one-sided, two sample z-test
(E) A two-sided two sample z-test

AP Review Part IV - Harrison

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

AP Review Part IV - Harrison

Uploaded by

Copyright:

Available Formats

AP Review IV - Hypothesis Tests/Confidence Intervals (30% – 40%)

II. Confidence Interval or Hypothesis Test?

III. Questions to ask:

Are the data categorical or quantitative?

How many samples? How many samples?

1-proportion z-test How many

TO RECEIVE FULL CREDIT FOR A CONFIDENCE INTERVAL YOU MUST:

Type I error – (α ) – when Ho is true but you go with Ha.

Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____

Which inference procedure?_____________________________________________________

Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____

Which inference procedure?_____________________________________________________

Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____

Which inference procedure?_____________________________________________________

Which inference procedure?_____________________________________________________

Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____

Which inference procedure?_____________________________________________________

Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____

Which inference procedure?_____________________________________________________

Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____

Which inference procedure?_____________________________________________________

Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____

Which inference procedure?_____________________________________________________

Categorical or Quantitative?_____ Number of samples? ______ Number of variables? _____

Which inference procedure?_____________________________________________________

Multiple Choice Practice:

(D) H 0 : μ=35 mpg (E) H 0 : μ=35 mpg

(A) the sample size is large enough

(A) A two-sample t-test

(A) 100 ± 1.96

(D) 0.231 ±1.96

(A) (2.228)(0.745) (B) (2.228) ( 0.745

(D) (2.228) ( 5.505

(A) Ha : µ < 25.3 (B) Ha : µ > 25.3 (C) Ha : µ < 24.5

(D) Ha : µ > 24.5 (E) Ha : µ ≠ 24.5

(A) A one-sided, paired t-test

You might also like

Categorical or Quantitative?_ Number of samples? Number of variables? ___

Categorical or Quantitative?_ Number of samples? Number of variables? ___

Categorical or Quantitative?_ Number of samples? Number of variables? ___

Categorical or Quantitative?_ Number of samples? Number of variables? ___

Categorical or Quantitative?_ Number of samples? Number of variables? ___

Categorical or Quantitative?_ Number of samples? Number of variables? ___

Categorical or Quantitative?_ Number of samples? Number of variables? ___

Categorical or Quantitative?_ Number of samples? Number of variables? ___