Professional Documents
Culture Documents
15 Correlation
15 Correlation
• This formula has the drawback of requiring that each score be transformed
into a z score, and consequently, an equivalent formula that uses raw scores is
used more often to calculate the Pearson correlation coefficient.
• Writing this formula in terms of the sum of the squares gives us the
following correlation coefficient:
• To illustrate how we can use this formula to measure the distance that points
fall from the regression line, consider the example, plotted in Figure 15 .6, in
which we seek to determine the relationship between the number of months
that students attended college and the number of classes they missed.
• Notice that some data points fall on the line and others fall some distance
from the line. The correlation coefficient measures the variance in the
distance that data points fall from the regression line.
• The value in the numerator of the Pearson correlation coefficient reflects the
extent to which values on the x-axis (X) andy-axis ( Y) vary together.
• The extent to which the values of two factors vary together is called
covariance.
• The extent to which values of X and Y vary independently, or separately, is
placed in the denominator.
The formula for r can be stated as follows:
• The correlation coefficient, r, measures the variance of X and the variance of
Y, which constitutes the total variance that can be measured.
• The total variance is placed in the denominator of the formula for r.
• The covariance in the numerator is the amount or proportion of the total
variance that is shared by X and Y.
• The larger the covariance, the closer data points will fall to the regression line.
• When all data points for X and Y fall exactly on a regression line, the
covariance equals the total variance, making the formula for r equal to +1.0
or -1.0, depending on the direction of the relationship.
• The farther that data points fall from the regression line, the smaller the
covariance will be compared to the total variance in the denominator,
resulting in a value of r closer to 0.
We compute the Pearson correlation coefficient
for data measured on an interval or ratio scale
of measurement, following these steps:
• Step 1: Compute preliminary calculations.
• Step 2: Compute the Pearson correlation coefficient (r).
Sample Problem
• An area of research of particular interest is the relationship between mood and
appetite (Hammen & Keenan-Miller, 2013; Privitera, Antonelli, & Creary, 2013;
Wagner, Boswell, Kelley, & Heatherton, 2012). As an example of one such study
from this area of research, suppose a health psychologist tests if mood and eating
are related by recording data for each variable in a sample of 8 participants. She
measures mood using a 9-point rating scale in which higher ratings indicate better
mood. She measures eating as the average number of daily calories that each
participant consumed in the previous week. The results for this study are listed in
Figure 1 5.8.
• We will compute the Pearson correlation coefficient using these data.
Step 1 : Compute preliminary calculations.
• We begin by making the preliminary calculations. The signs (+ and -) of the
values we measure for each factor are essential to making accurate
computations. The goal is to find the sum of squares needed to complete the
formula for r.
Effect Size: The Coefficient of Determination