Professional Documents
Culture Documents
Data Descriptors
Data Descriptors
PRESENTATION
WWW.CESARRITZCOLLEGES.EDU
Data Descriptors
Central Dispersion
Tendency Shape
or Variation
Range
Mean Skewness
Interquartile
Range
Mode
Kurtosis
Variance
Standard
Median Deviation
Central tendency
• Mean x: Average (50th percentile)
• Percentiles: a value on a scale of 100 that
indicates the percent of a distribution that is
equal or below it.
• Median: Middle number (for uneven
number, average of two numbers around
middle for even)
• Mode: Number occurring the most (can be
more than 1 or none)
Dispersion or variation:
• Range: Max – min. Measure of dispersion
of the observations
• IQR: Q3 – Q1. Ignores extremes and
focuses on the middle 50% of the data.
Good measure of spread for skewed
distributions
• Standard deviation σ: how closely is data
grouped about the mean
• Variance: The square of SD
Standard Deviation
- Mean +
• Working on both sides of the mean
• The positives and negatives will cancel each
other out.
• Therefor use the mean of the squared
values and then get the square root
Steps
Total: ….
Standard deviation:
Standard deviation for frequency
data
Example with frequency data
Number of returns (x) 0 3 4 5 6 7 8 9 10 11 12 13 14
• Pg 44
Number of days (f) 2 7 9 12 24 40 75 98 56 25 14 2 1
• X = 8.40
x f X–X (x – x)2 f(x – x)2
• Total =
• Total / n =
• =
Shape
Shape
• Skewness: the measure of the degree of
asymmetry of a distribution
– Reporting the median along with Mean in
skewed distributions is a good idea.
– Skewness = zero. Indicates symmetry, normal
distribution
– Positive value indicates scores bunched up on
low en of score scale
– Negative value: scores bunched up on high end
of scale
– Skewness > ± 1.36 suggest severe skewness
Measure of skewness