Professional Documents
Culture Documents
Lecture10 Correlation
Lecture10 Correlation
Research Problem:
1
I. THE SCATTERPLOT
7
6
5
Y 4
(Score) 3
2
1
0
0 1 2 3 4 5 6 7 8
X (Hours Studying)
2
II. Pearson Correlation Coefficeint
Symbol: r
Closer to 0, weaker
3
Figure 16-3 (p. 524)
Examples of positive and negative relationships. (a) Beer sales are positively related to
temperature. (b) Coffee sales are negatively related to temperature.
4
Pearson Correlation Coefficient
-1.0 0 +1.0
Perfect No Linear Perfect
Negative Relationship Positive
Relationship Relationship
Closer to 0 = weaker
5
r = .90 r = -.40
r = .00
r = -1.0
6
What does r represent?
r = covariance of X & Y
variance of X & Y
SP
r= SS XSSY
SP = “Sum of Products”
SP = (X- X )(Y- Y )
SSX=(X- X )2
SSy=(Y- Y )2
7
VARIANCE INTERPRETATION OF r :
r 2 = “Coefficient of determination”
% of variance in Y predicted by X
8
III. Factors that affect the size of r
r 0 could mean many things:
No relationship at all between X & Y
Non-linear relationship between X & Y
Restricted range on X and/or Y
Outlier may be causing problem
Non-linear relationships
Curvilinear relationship
Restricted range
Outliers
9
Examples of how restricted range can distort a correlation
(a) In this example, the full range of X and Y values shows a strong, positive correlation, but the restricted
range of scores produces a correlation near zero.
(b) An example in which the full range of X and Y values shows a correlation near zero, but the scores in the
restricted range produce a strong, positive correlation.
10
Example of how an outlier can distort a correlation
A demonstration of how one extreme data point (an outlier) can influence the value of a correlation.
11
IV. CORRELATION VS. CAUSALITY:
X Y
Y X
X Third variable problem
Z
Y
example:
12
V. OTHER CORRELATION COEFFICIENTS
13