Professional Documents
Culture Documents
Chapter 3 - The Nature of Statistics
Chapter 3 - The Nature of Statistics
800000
700000
600000
500000
400000
300000
200000
100000
0
1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 2010 2011 2012 2013 2014 2015 2016 2017 2018
ITEMS -
Series3 TOTAL EXPORTS OF GOODS
Now try these!
Color Frequency
Blue 15
Green 25
Red 30
White 18
Yellow 12
• Arithmetic Mean
• The mean of a set of measurements is
the sum of a sample of measurements
divided by the number of data points.
If there are data points in a data set,
and the data points are represented as ,
then the mean may be computed as
Example: Arithmetic Mean
• Median
• The median of a set of data arranged
according to size (ascending or
descending) is the value of the middle
data point if the number of data points
is odd, and the mean of the two most
middle data points if the number of
data points is even.
Example: Median
• Mode
• The mode is simply the value in a data
set that occurs with the highest
frequency and more than once.
• It is possible that in a set of data, there
is no mode, or more than 1 mode
Example: Mode
Values Frequency
20 2
29 4
30 4
39 3
44 2
Example: Mean (Frequency Distribution Table)
20 2 40
29 4 116
30 4 120
39 3 117
44 2 88
15 481
Now try these!
• A teacher records scores on a 20-point quiz
for the 30 students in his class. The scores are:
19 20 18 18 17 18 19 17 20 18
20 16 20 15 17 12 18 19 18 19
17 20 18 16 15 18 20 5 0 0
(a) Construct a frequency distribution
table.
(b) Find the mean, median, and
mode.
Now try these!
• Coefficient of Variation
• The range is R = 17 – 8 = 9
• The variance is:
Hours, x ( 𝑥 − 𝑥) ( 𝑥 − 𝑥 )2
12 0 0
15 3 9
8 -4 16
• The standard 9 -3 9
14 2 4
deviation is 8
17
-4
5
16
25
14 2 4
8 -4 16
15 3 9
120 108
Sample: Measures of Variation
• Wind Speed
100 3
115 5
120 10
125 9
130 7
135 2
Now try this!
• The Internet was used to make a survey of the per
quart of synthetic motor oil for the high-performance
go-cart. The sample data, converted to pesos per
quart, were summarized in the following table.
Determine the variance and the standard deviation of
the price.
• Percentiles
• Assume that the elements in a data set are rank
ordered from the smallest to the largest. The values that
divide a rank-ordered set of elements into 100 equal parts
are called percentiles.
• An element having a percentile rank of Pi would have
a greater value than i percent of all the elements in the
set. Thus, the observation at the 50th percentile would be
denoted P50, and it would be greater than 50 percent of
the observations in the set. An observation at the 50th
percentile would correspond to the median value in the
set.
Measures of Relative Standing (Position)
• Quartiles
• Quartiles divide a rank-ordered data set into
four equal parts. The values that divide each
part are called the first, second, and third
quartiles; and they are denoted by Q1, Q2, and
Q3, respectively.
• Note the relationship between quartiles and
percentiles. Q1 corresponds to P25,
Q2 corresponds to P50, Q3 corresponds to P75.
Q2 is the median value in the set.
Measures of Relative Standing (Position)
• Standard Scores (z-Scores)
• A standard score (z-score) indicates how many standard deviations an element is
from the mean.
• A standard score can be calculated using the following formula.
where z is the z-score, X is the value of the element, μ is the mean of the population,
and σ is the standard deviation.
Here is how to interpret z-scores.
A z-score less than 0 represents an element less than the mean.
A z-score greater than 0 represents an element greater than the mean.
A z-score equal to 0 represents an element equal to the mean.
A z-score equal to 1 represents an element that is 1 standard deviation greater than
the mean; a z-score equal to 2, 2 standard deviations greater than the mean; etc.
A z-score equal to -1 represents an element that is 1 standard deviation less than
the mean; a z-score equal to -2, 2 standard deviations less than the mean; etc.
Measures of Relative Standing (Position)
X = (z)(σ) + 100
X = 18 + 100 = 118
Outliers
0
0 1 2 3 4
Height (m)
Negative Correlation Scatter Diagram
80
60
40
20
0
0 10 20 30 40 50 60
Codling moths trapped per acre
No Correlation Scatter Diagram
10
0
0 100 200 300
Pillbugs/Square Meter
Correlation Coefficient
𝑛 ∑ 𝑥𝑦 − ( ∑ 𝑥 )( ∑ 𝑦 )
𝑟=
√[ 𝑛( ∑ 𝑥 ) − (∑ 𝑥 ) ][ 𝑛 (∑ 𝑦 ) − (∑ 𝑦 ) ]
2 2 2 2
The area to the left of z = -2.43 is 0.0075 and the area to the
left of z =1.84 is 0.9671. To find the required area subtract the
two areas, giving
0.9671- 0.0075 = 0.9596.
Example: Applications
(a) what is the probability that the player weighs less than 225
pounds?
(b) what is the probability that the player weighs more than 225
pounds?