Professional Documents
Culture Documents
Chapter 10 - 2 - 2
Chapter 10 - 2 - 2
Chapter 10 - 2 - 2
INTRODUCTION TO
STATISTICS & PROBABILITY
Take SRS of 20 eruptions from the population and calculate LSRL. How does the slope
of the sample regression line (LSRL) relate to the slope of the population regression
line? (green points in each graph are the selected points)
my = b 0 + b 1x
The intercept b 0 , the slope b 1 , and the standard deviation s of y are the
unknown parameters of the regression model. We rely on the random sample
data to provide unbiased estimates of these parameters.
➢ The value of ŷ from the least-squares regression line is really a prediction of the
mean value of y (m y) for a given value of x.
➢ The least-squares regression line (ŷ = b0 + b1x) obtained from sample data is the
best estimate of the true population regression line (my = b0 + b1x).
sy
b1 = r ( ) = 0.40(8.75 / 3.95) = 0.886
sx
Before you can trust the results of inference, you must check the conditions for
inference one by one.
A line has been fit to data representing cholesterol readings for 28 individuals starting
a cholesterol-reducing drug. The computer provides the following output:
c. 0.6627 ± 0.1428.
Infants who cry easily may be more easily stimulated than others. This may be a sign of higher
IQ. Child development researchers explored the relationship between the crying of infants 4 to
10 days old and their later IQ test scores. A snap of a rubber band on the sole of the foot
caused the infants to cry. The researchers recorded the crying and measured its intensity by the
number of peaks in the most active 20 seconds. They later measured the children’s IQ at age 3
years using the Stanford-Binet IQ test. A scatterplot and Minitab output for the data from a
random sample of 38 infants is below.
• The scatterplot suggests a moderately positive linear relationship between crying peaks
and IQ.
The test statistic and P-value can be found in the Minitab output.
b1 1.4929
t= = = 3.07
SE b1 0.4870
The Minitab output gives P = 0.004 as the P-value
for a two-sided test. The P-value for the one-sided
test is half of this,
P = 0.002.
The P-value, 0.002, is less than our α = 0.05 significance level, so we have enough evidence to
reject H0 and conclude that there is a positive linear relationship between intensity of crying
and IQ score in the population of infants.
Copyright© Nahid Sultana 2017-2018 1/24/2023
Confidence Interval for Mean Response
23
We can also calculate a confidence interval for the population mean μy of all
responses y when x takes the value x* (within the range of data tested).
One use of regression is for predicting the value of y at some value of x within
the range of data tested. Reliable predictions require statistical inference.
Total n−1
Analysis of Variance
Source DF SS MS F P
Regression 1 1754.6 1754.6 34.60 0.000
Residual Error 13 659.2 50.7
Total 14 2413.7
What is the value of s, the estimated standard deviation about the regression line?
Example
A realtor is trying to assess the prices of homes in a new
development. She wants to know if the age of the house (x)
can explain the selling price (y) of a home, in thousands of
dollars.
x 1 2 3 4 5 6 7 8 9 10
y 245 180 200 200 171 120 115 69 60 47
Ans: 8
Example
For the below ANOVA table, what is the missing
value?
Source Df SS MS
Regression 5 6,500 1,300
Error 94 ? 37.234
Total ? 10,000
ANOVA
df SS MS F
Regression 1 17200
Error 15 3400
Total 16 20600
Ans: 75.88