Professional Documents
Culture Documents
NOTES
NOTES
INFERENTIAL STATISTICS
WHAT TO CONSIDER
- Problem
- Nature of the data
Shape of the distribution (normal or skewed)
Level of measurement (nominal, ordinal, interval, and ratio)
- The number of variables involved
Univariate (1 variable)
Bivariate ( variables)
Multivariate (3 or more variables)
PURPOSE: COMPARISON
EX. IF YOU WANT TO DETERMINE IF THERE IS A SIGNIFICANT DIFFERENCE BETWEEN 3 SAMPLES AND IT
IS NORMALL DISTRIBUTED AND ATLEAST INTERVAL--- POSSIBLE YUNG T-TEST FOR INDEPENDENT
SAMPLES PERO THRICE GAGAMITIN PERO PWEDE RIN ONE-WAY ANOVA LANG—IF THERE’S
SGINIFICANT DIFFERENCE-POST HOC
STATISTICAL ASSUMPTIONS
Assumptions
- Requirements of inferential test
- Directions for appropriate use of a product
- Explain when it is and isn’t reasonable to perform a specific statistical test
Parametric Test
- T-test, ANOVA
- Designed for data that have certain characteristics, including approximately normal distributions
- Check for the nature of data—is it at least interval data?
- More assumptions here compared to non-parametric
Non Parametric Test
- Often used when the assumptions of a parametric test are violated
- For categorical
COMMON ASSUMPTIONS
HOMOGENEITY OF VARIANCES
- Is it equal or not equal?—Critical assumptions; Levene’s Test (not equal-critical assumption
because it affects result)
NORMALITY
- A symmetrical bell-shaped or normal curve
INDEPENDENCE OF OBSERVATIONS
- Assumptions that there is no relationship between the scores for one person and those of
another person—affected by using non-random sampling like snowball sampling---it can be that
one individual recommends with same characteristics as him/her; selecting a class homogenous,
cut off grades
LINEARITY
- Two variables are related in a linear function; directly correlated
DETERMINING THE SHAPE OF DISTRIBUTION
Considerations in determining the appropriate test to use in the data
Research Problem
- Descriptive
- Inferential problem
Shape of distribution
- Normal/skewed
Number of variables
- Uni/bi/multi-variate
Shape of distribution
- If the skewness is between -0.5 and 0.5, the data are fairly symmetrical
- If the skewness is between -1 and -0.5 or between 0.5 and 1, the data are moderately skewed.
- If the skewness is less than -1 or greater than 1, the data are highly skewed
Kolmogorov-Smirnov Test (K-S test)
- Compares your data with a known distribution and lets you know if they have the same
distribution
- It Is commonly used as a test for normality to see if your data is normally distributed. It’s also
used to check the assumption of normality in Analysis of Variance
Shapiro-Wilk Test
- Is a way to tell if a random sample comes from a normal distribution. The test gives you a W
value; small values indicate your sample is not normally distributed (you can reject the null
hypothesis that your population is normally distributed if your values are under a certain
threshold)
USE KOLMOGOROV SMIRNOV TEST TO TEST NORMALITY OF THE DATA
Problem: Is there a significant difference between the mean content of colas given to the pupils during
the outreach activity and its label (8oz)?
Objective: to compare the mean content of colas with the content claimed by the Cola’s company
Ho: There is no significant difference between the mean content of colas given to the pupils during the
outreach activity and its label (8oz).
Ha: There is significant difference between the mean content of colas given to the pupils during the
outreach activity and its label (8oz).
With the preliminary test, the appropriate test for the data is a one-sample t-test.
- In case the data is below 0.01, try to regenerate the result by trying 0.01 as the level of
significance.
Step 5: Decision
Ex. Reject the null HO. There is significant difference between the mean content of colas given to the
pupils during the outreach activity and its label.
Step 6: Interpretation
Table 1 displays the one-sample t-test result on the significant difference between the mean content of
colas given to the pupils during the outreach activity and its label
As gleaned in the table 1, the mean content of colas given during the outreach program was 7.47 oz. This
mean content was lower than the content labeled in the cola’s bottle. Moreover, the one-sample t-test
result shows that there was a significant difference between the sample mean content and the 8oz cola’s
label. This indicates that the colas given to the pupils were significantly lower than 8oz. This further
means that the colas given did not pass the quality control of the Cola’s company. Thus, the colas were
donated instead of bringing back to the company.
STEP 5: REJECT HO
STEP 6:
T-TEST FOR INDEPENDENT SAMPLES
(KATULOY SA 2: BY KS, p=.58023 and p=.52666 male and female groups respectively)
(katuloy sa 3- f is 0.0029, p=.957184. with the preliminary test, the appropriate test for the data s a t-test
for independent samples)
Step 4:
One-way ANOVA