Professional Documents
Culture Documents
Review Lecture 3 Descriptive Statistics, Part 2
Review Lecture 3 Descriptive Statistics, Part 2
Descriptive Statistics
Descriptive statistics
Describe or summarize a large set of data with a graph and/or just a few n
Online Review Course of Undergraduate Probability and Statistics Applies mainly to univariate data
The statistics that can be used depend on the measurement scale (nom
Review Lecture 3 Descriptive
Statistics, part 2
Chris A. Mack
Adjunct Associate Professor
Descriptive Statistics
40
How do we describe
35Lecture2&3 .xlsx , this data?
Data Set 2 Shape
30 Central tendency
>- Spread
25 Other measures could be important as wel I (e.g., skew)
C1J Aggregating data and looking only at summary statistics can hide important features of the data
:i 20
er
u.. 15
10
Maoss _,...,._,...,-
380 388 396 404 412 420 428 436
Below lOg (mg)
:hrisMock, 2014
Distribution Shape
N = 100
• Median:
- rank order the data, pick out the middle value (odd
number of data points) or the average of the two
middle values (even number of data points)
• Mode:
- Most frequent data value
• 10% Trimmed Mean:
- Rank order the data, throw away the 10% of data with
the largest values and the 10% of data with the
smallest values, then calculate mean of the rest
' r l!> Mac 2014
2
7/21/2014
Measures of Spread
Measures of Spread
• Range: max - min
- An extremely unstable metric! • Sample Variance (mean squared deviation from
the mean) : N
• Interquartile Range (IQR) :
-
-
rank order the data, divide it in half
pick out the 251h percentile value (lower quartile, 01)
s2 = N 1
i =1
L
(x; - x)2
- pick out the 75th percentile value (upper quartile, 03) • Standard deviation is the square root of the
- IOR = 03 - 01 N
variance
• Mean absolute deviation = ,L1 x; -x i • N-1 is the number of degrees of freedom (DoF)
, since the sample mean uses up one DoF
- This results in an unbiased estimator for the
variance
• Standard i =1
• For a population variance , use N rather than N-1
deviation 1 r Mack, 2014
Summary Statistics
The five-number summary: Box and Whiskers Plot
120 --
Display graphically (box & whiskers plot) and/or in table form Test a Test b Test c Test d
' rll> Mac 2014 This makes for very quick comparison between the four tests.
Practice
Using the two data sets in
Lecture2&3 .xlsx , practice using Excel to calculate a variety of statistics
Measures of central tendency
Measures of spread
Create a box & whiskers plot for data set 1 (or data set 2)
:hrisMock, 2014
3
7/21/2014
12