Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 4

Correlation and Regression

-correlation one of the easiest things


- Regression is just reading tables but a little harder
- 0-0.3 is weak
- 0.3-0.8 is moderate
- 0.8-1 is strong

Using data to identify patterns and Make Predictions


Correlation (use the words related and relationship)
- Identify a relationship between two variables
- Generally interval/ratio
- Correlation coefficient ranges from -1.0-+1/0
- Sign indicates direction of slope
- Positive correlation (0< r is less than or equal to +1)
- Negative correlation (-1.0 is less than or equal to r < 0)
- Zero correlation (r=0) indicates NO linear relationship
- Closer to +/- 1 indicates stronger relationship
- Finding nothing in correlation means nothing in regression as well
- Pearson’s Correlation Coefficient (Done in SPSS)
- Measures strength and direction of linear relationship between variables
(interval or ratio level)

- EXAMPLE: Does mood (specifically depression) impact the number of calories


consumed?
- Mood index: Higher sores= more depressed (1-10 scale)
- Mood is interval, Calories are ratio
- Alternative: There is a relationship between calories consumed and the
mood
- Null: There is not a relationship between calories consumed and the
mood
- 1. Analyze
- 2. Correlate
- 3. Bivariate (2 variables)
- 4. Move to the right -> Mood score and calorie consumed
- 5. Okay
- Yes it is significant because it is less than 0.05, so it does have
correlation
- Pearson's r= -0.932 so it does have correlation because it is very close
to -1
- Strong negative relationship
- There is a negative relationship because it is -0.932
- The strength is a strong relationship because it is -0.932
- 0.3 is moderate relationship, anything below is weak
- Reject the Null hypothesis because there is a significant relationship
between mood and calories consumed

- Proportion of Variance (coefficient of determination)


- Square the r value
- (-0.932)^2 = 0.87
- 87% of the variability in calories consumed can be explained by a
participant's mood rating.
- Degrees of freedom= n-2
- n is sample size
- 20-2= 18
- **** Give Pearson’s R and P value
-
Homoscedasticity
- Homo- equal
- Scedasticity- scatter

Linearity
- The relationship has to be linear

Linear Regression
- Predict values of one factor, given known values of a second factor in the population l
- Does give you a correlation
- Generally interval/ratio level data
- “Line of best fit”
- Y = bx + a
- A= y-intercept
- b= slope
- If slope, b is (-) then correlation is negative
- r squared has special meaning- it represents the proportion of variance in Y (DV) and X
(IV)
- Establishes how X causes Y to change
- EXAMPLE:
- Correlation with Therapy sessions and number symptoms
- r = -0.731
- This is significant because it is less than 0.05.
- Moderately strong negative relationship
- Negative means one is increasing, one is decreasing
1. Analyze
2. Regression
3. Linear
Dependent variable is number of symptoms
independent variable is therapy sessions (therapy impacts number of symptoms)
- Regression equation y= 4.314- 0.46x
- 0.46= slope
- 4.31= b
- x= 4 because of 4 therapy sessions (got 4 from the question not SPSS)
- Null (H0): the variance in the number of symptoms expressed (Y) is not related
to changes in the number of therapy sessions attended (X)
- Alternative (H1): the variance in the number of symptoms expressed (Y) is
related to changes in the number of therapy sessions attended (X)
- Set criteria
- Level of significance = 0.05
- Df (regression variation/ numerator)= each source of variance (# of
variables predicting variance) = 1
- Df (sample size- 2) 20-2= 18

Effect size look at r squared not r

April 19th, 2022


Sleepiness score and Multiple IV’s
Regression
1. Analyze
2. Regression
3. Linear
4. Dependent= sleepy
5. Independent= weight rate, smoke, satisfaction with sleep, stop breathing during
sleep, stress
6. Ok
7. We have mode summary and anova and all coefficients
8. Reject the null bc it is significant

When interpreting beta


- For every one unit increase in IV, the outcome variable will increase or decrease
by whatever the beta is.

Chi Square Analysis


1. Analyze
2. Descriptive statistics
3. Crosstabs
4. Looking at two variables only (Gender and smoking) you must his stats and then
Chi-squared is CHOSEN!!!!!!!!!!!

Phi and Cramers V

You might also like