Professional Documents
Culture Documents
Correlation
Correlation
Correlation
Objectives: At the end of the lesson, the students are expected to:
- discuss correlation;
- discuss the relationship between two Parametric Data set using Correlation
coefficient;
- Test the significance of the relationship.
Lesson Proper
Test of correlation is one of the vital tools in Statistics that describes the relationship
between two data sets.
The most common tool for correlation is devised and named after Karl Pearson which known as
Pearson’s r.
nΣxy−ΣxΣy
r=
√¿¿¿
The Pearson product-moment correlation coefficient (or Pearson correlation coefficient, for
short) is a measure of the strength of a linear association between two variables and is denoted
by r. Basically, a Pearson product-moment correlation attempts to draw a line of best fit through
the data of two variables, and the Pearson correlation coefficient, r, indicates how far away all these
data points are to this line of best fit (how well the data points fit this new model/line of best fit)
Example:
Find the relationship exists between the scores of 10 students in English and in
Mathematics examination
Studen
t English (X) Math (Y) X2 Y2 XY
1 40 45 1600 2025 1800
2 45 45 2025 2025 2025
3 43 42 1849 1764 1806
4 41 40 1681 1600 1640
5 32 25 1024 625 800
6 35 26 1225 676 910
7 37 39 1369 1521 1443
8 38 36 1444 1296 1368
9 35 35 1225 1225 1225
10 41 34 1681 1156 1394
Total 387 367 15123 13913 14411
nΣxy−ΣxΣy
r=
√¿¿¿
2081
r=
√ [1461][4441]
2081
r=
2547.214
r =0.817
Now use the line below to determine how strong is the relationship between x and y.
-1.0 0 +1.0
Strong Negative No Relationship Strong Negative
Relationship Relationship
“If the r-value is positive, it implies that as x increases, y also increases. Conversely, as x decreases, y also
decreases.”
“If the r-value is negative, it implies that as x variable increases, y decreases. Conversely, as x decreases, y
increases.”
In the above example, the r-value of 0.817 tells that those students who got high
scores in English exam also got high scores in Mathematics.
The ratio of the explained variation to the total variation is called the coefficient of
determination. If there is zero explained variation (i.e., the total variation is all
unexplained), this ratio is 0. If there is zero unexplained variation (i.e., the total variation is
all explained), the ratio is 1. In other cases the ratio lies between 0 and 1.
The coefficient of determination (COD) in the given example is
COD = r2×100
COD = (0.817)2×100
COD = .6675 ×100
COD = 66.75%
0.817 √ 10−2
t=
√(1−0.6675)
t=4.007
To test the significance of relationship, one may follow the hypothesis testing procedure.
2. Set the appropriate alpha level of significance and find the critical/tabular value
a. At 5% alpha level of significance with 8 df, the tabular value of t is 2.306. (Take
note that if the computed value is greater than the tabular value, then the researcher has to reject
the null hypothesis.)
As computed previously, t-value is 4.007 which is greater than the critical value of 2.306.
a. Since the t-computed is greater than the critical value, the decision is to reject
the null hypothesis
a. The finding implies that English scores and Mathematics scores of students are
highly significantly correlated which means that those who are good in
comprehension, grammar and communication are also good in terms of
manipulative skills.
Score:
Activity
CORRELATION
_________
B. Problem solving
Given the data set below, determine if there is a significant relationship between
students’ age and their academic performance in Mathematics. (Please follow the
hypothesis testing procedure)