Professional Documents
Culture Documents
Cac Chi So Mean, Median
Cac Chi So Mean, Median
Cac Chi So Mean, Median
Original scores: 1, 2, 3, 5, 4, 4, 2, 3, 1, 3, 2, 3, 2, 2
Frequency table:
1 was the lowest reported
score. Two people had a
2 was the most commonly
score of 1.
reported score. Five
people had a score of 2.
Section A-1
Simple Frequency
•Ratio
= = = = 0.24 (people)
X 100
Ratio
Simple
Proportion
frequencies
Percentage
Relative frequency =
Relative frequency =
80 5 5 : 60 0.08
70 15 15 : 60 0.25
60 20 20 : 60 0.34
50 15 15 : 60 0.25
40 5 5 : 60 0.08
N = 60
RATE
• Rate is used to show how often an event happens compared
with how often it might happen.
• For example; we might use rate to show the number of
people who do learn a language compared with the number
of people who might learn the language.
- There are more ESL speakers who are in the age group 25-44
- However, there is a greater proportion of ESL speakers in the 6-16
age group.
For example:
o Your exam score is X = 43
o 60% of the class had scores of 43 or lower
o Your score has a percentile rank of 60% and is called the
60th
percentile.
Example: student placement in English classes at the university
Placement
f F Percentile
repuirement
= 74th
=> 74% of the students who took the test scored at or lower than that level
Score Frequency Relative Cumulative Percentile
(f) Frequency Frequency (F)
80 5 0.08 60 100
70 15 0.25 55 92
60 20 0.34 40 67
50 15 0.25 20 33
40 5 0.08 5 8
Relative Frequency =
Percentile = (100)
Section A-2
Table forms
Features
• One half of the curve looks like the other half (symrnetric distribution)
• Most of the scores fall in the middle of this curve, the high part.
• There is a nice gradual slope of scores below and scores
above that one midpoint.
The Shape of a Frequency
Distribution
Skewed
Distributions
Positively Skewed Negatively Skewed
• Scores tend to pile up on the • Scores tend to pile up on the
left side right side
• Tail “points” to the right” • Tail “points” to the left
• The “skew” is on the • The “skew” is on the
“positive” side of the curve “negative” side of the curve
Name That Distribution!
Pretest Posttest
It tells you:
At the time of the pretest there appears to be no difference between
the groups.
At the time of the posttest (after the treatment). there appears to be
a large difference between the groups (your treatment worked).
The difference is in favor of you r experimental group, not the
control. Thus, the most typical score is both useful and crucial to
your research.
Descriptive statistics
• Only with ordinal and interval variables
MEAN
The formula for obtaining the mean is
MEDIAN
• The median score is also easy to find. Arrange your scores
in rank order. The median is the score which is at the center
of the distribution.
• Half of the scores are above the median and half are
below.
If the number of scores is odd, the median is the middle
score: 4 4 5 7 9 10 11.
If the number of scores is even, use the midpoint between
the two middle scores as the median:
4 5 7 9- 10 12 (7 + 9 2 = 8)
Ex: There were 20 items and the scores you obtained were:
16 10 5 6 8 15 20 14 16 10
MODE
• The mode is the most frequently obtained score in the
data. For example, in the following data the mode is
25:
20 22 23 23 25 25 25 25 27 29 30
In bimodal · distributions there.are two values which are
obtained most often, e.g.:
2 3 4 4 4 4 5 7 7 9
10 10 10 10 12 12 13 15
This distribution has two modes, 4 and 10.
The mode is the easiest measure of central tendency to
identify
MEASURES OF VARIABILITY
• Just as there are three ways of talking about the
most typical score in your data, there are three major
ways, too, to show how the data are spread out from
that point the range, the standard deviation, and
variance.
Range
• The easiest, most informal way to talk about the
spread of the distribution of scores is the range.
• Arrange the scores from the highest to the lowest.
• Subtract the lowest score from the highest score.
95%
Confidence
Interval of the
Difference
Mean -6.0
Std. Deviation 1.188
Paired differences Std. Error Mean .266
95% Confidence Interval of Lower -1.16
the Difference Upper - .04
t -2.259
Df 19
Sig. (2-tailed) .036
Conclusion
As Sig. (2-tailed) = .036<0.05, there is significant
difference between pre-test and post-test
scores. Specifically, the post-test scores are 6%
higher than pre-test scores.
Analysis of Variance (ANOVA)
ANOVA: Compare ONE value among 3 or above
subjects. Ex. Compare HAPPINESS among people
aged 10-15, 15-20, 20-25, etc.
ONE WAY ANOVA
1= white people
2= black people
3= other people
Descriptive
95% Confidence
Interval for Mean
Lower Upper
Std. Bound Bound
N Mean Deviation Std. Error Minimum Maximum
1 1256 1.77 .604 .017 1.73 1.80 1 3
Sum of Mean F
Squares df Square Sig.
Between 7.680 2 3.840 10.225 .000
Groups
1 2
race N
1
1256 1.77
3
47 1.94
2
201 1.97
Sig.
1.000 .725
Conclusion
• Between white and black people, there is no
difference in HAPPINESS since Sig= (.725) >
0.05 (the significance).
Chi-square test
• Used to indicate whether there is a relationship
between VARIABLES to ONE VARIABLE.
• Variables: TWO NOMINAL VARIABLES or ONE
NOMINAL VARIABLE and ONE ORDINAL
VARIABLE
• Ex. To indicate whether there is a relationship
between MALE AND FEMALE to EDUCATIONAL
LEVEL.
Sex race Crosstabulation
race Total
High University Post-
school graduate graduate
sex Male Count 545 71 20 636
Expected 529.9 85.5 20.5 636.0
count
Residual 15.1 -14.5 -5
Female Count 719 133 29 881
Expected 734.1 188.5 28.5 881.0
count
Residual -15.1 14.5 .5
Total Count 1264 204 49 1517
Expected 1264.0 204.0 49.0 1517.0
count
Chi-square test
• Those figures in RED are actual values.
Chi-square test
Value Df Asymp. Sig. (2-sided)