Professional Documents
Culture Documents
Chapter 3 - Numberical Summary
Chapter 3 - Numberical Summary
JOHN S. LOUCKS
St. Edward’s University
Chapter 3
Descriptive Statistics: Numerical Methods
Measures of Location Measures of Relative Location
• Mean and Detecting Outliers
• Median • z-Scores
• Mode • Chebyshev’s Theorem
• Percentiles • Empirical Rule
• Quartiles • Detecting Outliers
Measures of Variability Exploratory Data Analysis
• Range • Box plot
• Interquartile Range Measures of Association
• Variance Between Two Variables
• Standard Deviation The Weighted Mean and
• Coefficient of Variation (risk Working with Grouped Data
to reward ratio)
Slide 2
1
Measures of Location
Mean
Median
Mode
Percentiles
Quartiles
Slide 3
Slide 4
2
Mean
Slide 5
Mean
Slide 6
3
Median
Slide 7
Median
Slide 8
4
Example: Apartment Rents
Median
Median = 50th percentile
i = (p/100)n = (50/100)70 = 35.5
Averaging the 35th and 36th data values:
Median = (475 + 475)/2 = 475
Slide 9
Mode
Slide 10
10
5
Example: Apartment Rents
Mode
450 occurred most frequently (7 times)
Mode = 450
Slide 11
11
Percentiles
Slide 12
12
6
Percentiles
The pth percentile of a data set is a value such that at
least p percent of the items take on this value or less
and at least (100 - p) percent of the items take on this
value or more.
• Arrange the data in ascending order.
• Compute index i, the position of the pth percentile.
i = (p/100)n
• If i is not an integer, round up. The pth percentile is
the value in the ith position.
• If i is an integer, the pth percentile is the average of
the values in positions i and i+1.
Slide 13
13
90th Percentile
i = (p/100)n = (90/100)70 = 63
Averaging the 63rd and 64th data values:
90th Percentile = (580 + 590)/2 = 585
Slide 14
14
7
Quartiles
Slide 15
15
Third Quartile
Third quartile = 75th percentile
i = (p/100)n = (75/100)70 = 52.5 = 53
Third quartile = 525
Slide 16
16
8
Measures of Variability
Slide 17
17
Measures of Variability
Range
Interquartile Range
Variance
Standard Deviation
Coefficient of Variation
Slide 18
18
9
Range
Slide 19
19
Range
Range = largest value - smallest value
Range = 615 - 425 = 190
Slide 20
20
10
Interquartile Range
Slide 21
21
Interquartile Range
3rd Quartile (Q3) = 525
1st Quartile (Q1) = 445
Interquartile Range = Q3 - Q1 = 525 - 445 = 80
Slide 22
22
11
Variance
Slide 23
23
Variance
Slide 24
24
12
Standard Deviation
Slide 25
25
Coefficient of Variation
Slide 26
26
13
Example: Apartment Rents
Variance
Standard Deviation
Coefficient of Variation
Slide 27
27
Slide 28
28
14
z-Scores
Slide 29
29
Slide 30
30
15
Chebyshev’s Theorem
Slide 31
31
Chebyshev’s Theorem
Slide 32
32
16
Example: Apartment Rents
Slide 33
33
Empirical Rule
Slide 34
34
17
Empirical Rule
Slide 35
35
Empirical Rule
Slide 36
36
18
Example: Apartment Rents
Empirical Rule
Interval % in Interval
Within +/- 1s 436.06 to 545.54 48/70 = 69%
Within +/- 2s 381.32 to 600.28 68/70 = 97%
Within +/- 3s 326.58 to 655.02 70/70 = 100%
Slide 37
37
Detecting Outliers
Slide 38
38
19
Example: Apartment Rents
Detecting Outliers
The most extreme z-scores are -1.20 and 2.27.
Using |z| > 3 as the criterion for an outlier,
there are no outliers in this data set.
Standardized Values for Apartment Rents
Slide 39
39
Five-Number Summary
Box Plot
Slide 40
40
20
Five-Number Summary
Smallest Value
First Quartile
Median
Third Quartile
Largest Value
Slide 41
41
Five-Number Summary
Lowest Value = 425 First Quartile = 450
Median = 475
Third Quartile = 525 Largest Value = 615
Slide 42
42
21
Box Plot
Slide 43
43
Slide 44
44
22
Example: Apartment Rents
Box Plot
Slide 45
45
Measures of Association
Between Two Variables
Covariance
Correlation Coefficient
Slide 46
46
23
Covariance
Slide 47
47
Covariance
Slide 48
48
24
Correlation Coefficient
Slide 49
49
Slide 50
50
25
Weighted Mean
Slide 51
51
Weighted Mean
x = wi xi
wi
where:
xi = value of observation i
wi = weight for observation i
Slide 52
52
26
Grouped Data
Slide 53
53
Sample Data
Population Data
where:
fi = frequency of class i
Mi = midpoint of class i
Slide 54
54
27
Example: Apartment Rents
Slide 55
55
This approximation
differs by $2.41 from
the actual sample
mean of $490.80.
Slide 56
56
28
Variance for Grouped Data
Sample Data
Population Data
Slide 57
57
Slide 58
58
29
Supplementary Problems for Chapter Three
Slide 59
59
Slide 60
60
30
End of Chapter 3
Slide 61
61
31