Professional Documents
Culture Documents
Linear Regression and Correlation
Linear Regression and Correlation
2
Definition: Dependent and Independent Variable
4
Scatter Plot (or scatter diagram)
5
Positive Linear Correlation
y y y
x x x
(a) Positive (b) Strong (c) Perfect
positive positive
6
Negative Linear Correlation
y y y
x x x
(d) Negative (e) Strong (f) Perfect
negative negative
7
No Linear Correlation
y y
x x
(g) No Correlation (h) Nonlinear Correlation
8
Example: Correlation
10
Pearson Correlation Coefficient
12
Different Level of the Strength of
Relationship
13
What does r represent?:
14
Solving for correlation coefficient
Sales Intelligence Sales (‘000 Tk.) X2 Y2 XY
Test Score (X) (Y)
1 45 2 2025 4 90
2 75 6.5 5625 42 488
3 50 3.5 2500 12 175
4 60 5 3600 25 300
5 80 4.5 6400 20 360
6 90 6 8100 36 540
7 85 6.5 7225 42 553
8 40 2.5 1600 6.3 100
9 80 5.5 6400 30 440
10 55 4.5 3025 20 248
∑X2 = ∑Y2= ∑XY=
N=10 ΣX=660 ΣY=46.5 46500 2239 3293
15
X . Y
XY
n
r
( X )
2
( Y )
2
X 2
Y 2
n n
660 46 . 5
3293
10
2 2
660 46 . 5
( 46500 )( 2239 )
10 10
0 . 869
Comment: Correlation coefficient value of 0.869 implies that
there exist a strong and positive statistical association
between intelligent test score of the sales person and their
sales performance.
16
Coefficient of Determination
17
Linear Regression Model
Regression analysis is a technique used to develop an
equation
to express the linear (straight line) relationship between two
variables and provide the estimates. The objective of this
analysis is to estimate a regression equation which is used to
estimate the change in the dependent variable due to any
change in independent variable.
18
a
Y b
X
n n
XY X .Y / n
b
X
2
X 2
n
b
XY X .Y / n and a
Y
b
X
X
2
n n
X
2
n
19
Computing the Slope of the Line
X
2
n
660 46 . 5
3293
10
2
660
46500
10
0 . 076
20
Computing the Y-Intercept
a
Y b
X
n n
46 . 5 660
( 0 . 076 )
10 10
0 . 366
Hence, the regression equation of sales on test score is
y=-0.366+0.076x. This regression equation can be explained
as if the intelligence test score increased by 1 marks, sales
performance is expected to increase by 0.076 thousand or
Tk. 76.
21
Predicting Values with Regression Equation
22
Difference Between Correlation and
Regression
23
Difference Between Correlation and
Regression
24
The Standard Error of Estimate
Y 2 a Y b XY
sy.x
n 2
In our example, the standard error of the regression
estimates is 0.847865. Here, the lower the value of the
standard error, greater the statistical reliability of the
regression estimates.
25
Charles Spearman’s Coefficient of Correlation
(Rank Correlation)
26
Charles Spearman’s Coefficient of Correlation
(Rank Correlation)
28
End of Chapter 6
29