Professional Documents
Culture Documents
Descriptive Statistics - Presentation
Descriptive Statistics - Presentation
3
TABLE 1. Age Distribution within the Sample
17 4 6,7
18 22 36,7
19 20 33,3
20 8 13,3
21 6 10,0
4
TABLE 2. Age by Gender Distribution within the Sample (N = 60)
Females Males
(n = 30, 50 %) (n = 30, 50 %)
Age f % f %
17 0 0 4 13,3
18 16 53,3 6 20,0
19 6 20,0 14 46,7
20 5 16,7 3 10,0
21 3 10,0 3 10,0
5
TABLE 3. Grouped Frequency Distribution of Number of English MP3 Songs in
Cellphone (N = 60)
Number of
Songs Frequency Percentage
0 – 99 13 21,7
>900 3 5,0
6
TABLE 4. Grouped Frequency Distribution of Number of English MP3 Songs in Cellphone
by Gender (N = 60)
Females Males
(n = 30, 50 %) (n = 30, 50 %)
Number of f % f %
Songs
0 – 99 7 23,3 6 20
7
TABLE 5. Cumulative Frequency Distribution of Region (N = 60)
Number of Participants Percentage of
Region Participants Cumulative Percentage
8
TABLE 6. Cumulative Frequency Distribution of Region by Gender (N = 60)
Females Males
(n = 30, 50 %) (n = 30, 50 %)
Cumltv. Cumltv.
Region f % % f % %
9
A stem-and-leaf display is similar to a
grouped frequency distribution but it
contains no loss of information
In a stem-and-leaf display:
◦ first, the score intervals are set up on the left side
of a vertical line
◦ these intervals (stem) contain all but the last digit
of the scores falling into each interval
◦ then, to the right of the vertical line, the final digit
of each score in the interval is given (leaf)
10
TABLE 7. Stem-and-Leaf Display of Proficiency Scores
0 367
2 799
3 235
4 459
5 12233467799
6 1222223344677799
7 11222336
8 0134688
9 0122
10 0 0
11
In a histogram:
◦ vertical columns indicate how many times any given
score (or score intervals) appears in the data set
◦ the horizontal axis (x axis) is labeled with scores on
the dependent variable
◦ the vertical axis (y axis) is labeled with frequencies
◦ a tall bar indicates a high frequency of occurrence
◦ a short bar indicates a low frequency of occurrence
12
Figure 1. Distribution of Proficiency Scores in the Sample (N = 60)
13
A frequency polygon (line graph):
◦ is similar to a histogram
◦ has a horizontal axis labeled with individual scores
or score intervals
◦ has a vertical axis labeled with frequencies
◦ first a single dot is put for the frequency of each
score on the horizontal axis
◦ then the dots are connected with straight lines
14
Figure 2. Frequency Polygon for Age Distribution in the Sample (N = 60)
15
A bar graph is different from a histogram
16
Figure 3. Distribution of Participants According to Region (N = 60)
17
Figure 4. Pie Graph Percentage Distribution of Participants According to
Region (N = 60)
18
The mode (Mo) is the most frequently
occurring score in a data set.
19
Sample mode, median, mean calculation:
6 2 5 1 2 9 3 6 2
1 2 2 2 3 5 6 6 9
20
A measure of variability
◦ indicates the degree of dispersion among the
scores;
◦ indicates how spread out the scores are.
21
The range is the difference between the lowest and
highest scores.
◦ the numerical value that separates the scores into two equal
halves is the median (Q2).
16 38 43 19 6 45 47 41 26 8 51 31 12 61 46 67 14 17 44 19 37 32 19
6 8 12 14 16 17 19 19 19 26 31 32 37 38 41 43 44 45 46 47 51 61 67
Q1 Q2 Q3
24
With a box-and-whisker plot, the degree of variability within a
data set is summarized with a picture.
The positions of the top and bottom sides of the rectangle are
determined by Q3 and Q1.
If there are any scores further out than the whiskers, they are
considered to be outliers, and their positions are indicated by
small circles or asterisks.
25
26
27
28
The standard deviation ( SD, s, σ , ∓, sigma)
◦ is based on all the scores in a group of scores;
◦ is determined by
figuring how much each score deviates from the mean
and putting these deviation scores into a computational
formula;
29
N Minimum Maximum Mean Std. Dev iation Variance
AGE 60 17 21 18,83 1,076 1,158
PROFICIENCY SCORE 60 3 100 61,78 21,744 472,817
LLS USE SCORE 60 1,99 4,22 3,2400 ,43710 ,191
ANXIETY SCORE 60 1,08 5,96 3,3480 1,03192 1,065
NUMBER of MP3s 60 0 3245 306,58 464,637 215887,6
Valid N (listwise) 60
PROFICIENCY SCORE
St andard
Count Mean Dev iation
PROFICIENCY SCORE GENDER FEMALE 30 65 17
St andard MALE 30 58 25
Count Mean Dev iation
REGION MARMARA 16 63 21
AEGEAN 10 62 21
MEDITERRANEAN 9 61 16 PROFICIENCY SCORE
BLACK SEA 8 67 27 St andard
CENTRAL ANT. 7 68 25 Count Mean Dev iation
EASTERN ANT. 6 49 30 S. E.S. HIGH S.E. S. 9 61 28
SOUTHEASTERN ANT. 4 59 9 MID S.E.S. 34 61 22
LOW S.E.S. 17 64 20
30
Almost all the techniques used for describing
data describe features of the entire data set.
31
With z-scores, the mean is fixed at 0 and the
standard deviation is fixed at 1.
For example,
◦ a z-score of +3 indicates that that person’s score
was 3 standard deviations above the group mean;
◦ a z-score of -1.4 indicates that that person’s score
was 1.4 standard deviations below the group mean.
32
Some proficiency scores from
the Data Set 1 z-scores
(mean: 61,78; SD: 21,74)
67 + 0,23
92 + 1,38
44 - 0,81
63 + 0,05
3 - 2,70
81 + 0,88
62 + 0,009
100 + 1,75
33
34
35
36
37
38
39