Professional Documents
Culture Documents
Business Statistics (BUSA 3101) Dr. Lari H. Arjomand Lariarjomand@clayton - Edu
Business Statistics (BUSA 3101) Dr. Lari H. Arjomand Lariarjomand@clayton - Edu
1
Slide
Slide
Business Statistics (BUSA 3101)
Business Statistics (BUSA 3101)
Dr.
Dr.
Lari
Lari
H.
H.
Arjomand
Arjomand
lariarjomand@clayton.edu
lariarjomand@clayton.edu
2
2
Slide
Slide
Chapter 4 (Part A)
Chapter 4 (Part A)
Descriptive Statistics: Numerical Measures
Descriptive Statistics: Numerical Measures
Measures of Location
Measures of Location
Measures of Variability
Measures of Variability
Numerical Data
Properties
Mean
Median
Mode
Midrange
Midhinge
Central
Tendency
Range
Interquartile
Range
Variance
Standard Deviation
Coeff. of Variation
Variation
Skew
Kurtosis
Shape
Numerical Data
Properties
Mean
Median
Mode
Midrange
Midhinge
Central
Tendency
Range
Interquartile
Range
Variance
Standard Deviation
Coeff. of Variation
Variation
Skew
Kurtosis
Shape
3
3
Slide
Slide
Measures of Location
Measures of Location
If the measures are computed
If the measures are computed
for data from a sample,
for data from a sample,
they are called
they are called
sample statistics
sample statistics
.
.
If the measures are computed
If the measures are computed
for data from a population,
for data from a population,
they are called
they are called
population parameters
population parameters
.
.
A sample statistic is referred to
A sample statistic is referred to
as the
as the
point estimator
point estimator
of the
of the
corresponding population parameter.
corresponding population parameter.
For example,
For example,
the
the
sample mean is a
sample mean is a
point estimator of the population mean.
point estimator of the population mean.
Mean
Mean
Median
Median
Mode
Mode
Percentiles
Percentiles
Quartiles
Quartiles
4
4
Slide
Slide
Mean
Mean
The
The
mean
mean
of a data set is the average of all the data
of a data set is the average of all the data
values.
values.
As we said, the sample mean is the point estimator
As we said, the sample mean is the point estimator
of the population mean
of the population mean
.
.
x
x
5
5
Slide
Slide
Sample Mean
Sample Mean
x
x
Number of
observations
in the sample
Number of
Number of
observations
observations
in the sample
in the sample
Sum of the values
of the n
observations
Sum of the values
Sum of the values
of the
of the
n
n
observations
observations
i
x
x
n
=
i
x
x
n
=
6
6
Slide
Slide
Population Mean
Population Mean
Number of
observations in
the population
Number of
Number of
observations in
observations in
the population
the population
Sum of the values
of the N
observations
Sum of the values
Sum of the values
of the
of the
N
N
observations
observations
i
x
N
=
i
x
N
=
7
7
Slide
Slide
Seventy efficiency apartments
Seventy efficiency apartments
were randomly sampled in
were randomly sampled in
a small college town. The
a small college town. The
monthly rent prices for
monthly rent prices for
these apartments are listed
these apartments are listed
in ascending order on the next slide.
in ascending order on the next slide.
Sample Mean
Sample Mean
Example:
Example:
Apartment Rents
Apartment Rents
8
8
Slide
Slide
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Sample Mean
Sample Mean
Example Continued
Example Continued
Monthly Rent for 70 Apartments
Monthly Rent for 70 Apartments
9
9
Slide
Slide
34, 356
490.80
70
i
x
x
n
= = =
34, 356
490.80
70
i
x
x
n
= = =
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Sample Mean
Sample Mean
Example Continued
Example Continued
Monthly Rent for 70 Apartments
Monthly Rent for 70 Apartments
10
10
Slide
Slide
1
1
-
-
Every set of interval
Every set of interval
-
-
level and ratio
level and ratio
-
-
level data has a
level data has a
mean.
mean.
2
2
-
-
All the values are included in computing the mean.
All the values are included in computing the mean.
3
3
-
-
A set of data has a unique mean.
A set of data has a unique mean.
4
4
-
-
The mean is affected by unusually large or small data
The mean is affected by unusually large or small data
values.
values.
5
5
-
-
The arithmetic mean is the only measure of central
The arithmetic mean is the only measure of central
tendency where the
tendency where the
sum of the deviations of each value
sum of the deviations of each value
from the mean is zero.
from the mean is zero.
Properties of the Arithmetic Mean
Properties of the Arithmetic Mean
Properties of the Arithmetic Mean
( ) X X =
0
See next
Slide for
An example
11
11
Slide
Slide
Illustration of Item
Number 5
on Previous Slide
Illustration of Item
Illustration of Item
Number
Number
5
5
on Previous Slide
on Previous Slide
Consider the set of values: 3, 8, and 4. The
Consider the set of values: 3, 8, and 4. The
mean
mean
is 5.
is 5.
So (3
So (3
-
-
5) + (8
5) + (8
-
-
5) + (4
5) + (4
-
-
5) =
5) =
-
-
2 + 3
2 + 3
-
-
1 = 0.
1 = 0.
Symbolically we write:
Symbolically we write:
( ) X X =
0
12
12
Slide
Slide
Median
Median
The
The
median
median
of a data set is the value in the middle
of a data set is the value in the middle
when the data items are arranged in ascending order
when the data items are arranged in ascending order
.
.
Positioning Point =
+ n 1
2
Positioning Point =
+ n 1
2
13
13
Slide
Slide
Median
Median
12
12
14
14
19
19
26
26
27
27
18
18 27
27
For an
For an
odd number
odd number
of observations:
of observations:
in ascending order
in ascending order
26
26
18
18
27
27
12
12
14
14
27
27
19
19
7 observations
7 observations
the median is the middle value.
the median is the middle value.
Median = 19
Median = 19
14
14
Slide
Slide
12
12
14
14
19
19
26
26
27
27
18
18 27
27
Median
Median
For an
For an
even number
even number
of observations:
of observations:
in ascending order
in ascending order
26
26
18
18
27
27
12
12
14
14
27
27
30
30
8 observations
8 observations
the median is the average of the middle two values.
the median is the average of the middle two values.
Median = (19 + 26)/2 = 22.5
Median = (19 + 26)/2 = 22.5
19
19
30
30
15
15
Slide
Slide
Median:
Median:
Example
Example
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Averaging the 35th and 36th data values:
Averaging the 35th and 36th data values:
Median = (475 + 475)/2 = 475
Median = (475 + 475)/2 = 475
Monthly Rent for 70 Apartments
Monthly Rent for 70 Apartments
16
16
Slide
Slide
Mode
Mode
The
The
mode
mode
of a data set is the value that occurs with
of a data set is the value that occurs with
greatest frequency
greatest frequency
.
.
If the data have more than two modes, the data are
If the data have more than two modes, the data are
multimodal
multimodal
.
.
17
17
Slide
Slide
Mode:
Mode:
Example
Example
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
450 occurred most frequently (7 times)
450 occurred most frequently (7 times)
Mode = 450
Mode = 450
Monthly Rent for 70 Apartments
Monthly Rent for 70 Apartments
18
18
Slide
Slide
Mode: Another Example
Mode:
Mode:
Another Example
Another Example
No Mode
No Mode
Raw Data:
Raw Data:
10.3
10.3
4.9
4.9
8.9
8.9
11.7
11.7
6.3
6.3
7.7
7.7
One Mode
One Mode
Raw Data:
Raw Data:
6.0
6.0
4.9
4.9
6.0 8.9
6.0 8.9
6.3
6.3
4.9
4.9
4.9
4.9
More Than 1 Mode
More Than 1 Mode
Raw Data:
Raw Data:
21
21
28
28
28
28
41
41
43
43
43
43
19
19
Slide
Slide
Use Excel to Compute
Use Excel to Compute
the Mean, Median, and Mode
the Mean, Median, and Mode
of the Following Data and Explain the Answers:
of the Following Data and Explain the Answers:
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
STUDENTS
20
20
Slide
Slide
Percentiles
Percentiles
A percentile
A percentile
provides information about how the
provides information about how the
data are spread over the interval
data are spread over the interval
from the smallest
from the smallest
value to the largest value.
value to the largest value.
At least 80%
At least 80%
of the items
of the items
take on a value
take on a value
of 542 or less.
of 542 or less.
At least 20%
At least 20%
of the items
of the items
take on a value
take on a value
of 542 or more.
of 542 or more.
56/70 = .8 or 80%
56/70 = .8 or 80%
14/70 = .2 or 20%
14/70 = .2 or 20%
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
25
25
Slide
Slide
A B C D E
1
Apart-
ment
Monthl y
Rent ($) 80
th
Percenti l e
2 1 525 =PERCENTILE(B2:B71,.8)
3 2 440
4 3 450
5 4 615
6 5 480
Use Excel to Find 80
Use Excel to Find 80
th th
Percentile
Percentile
First Quartile =
First Quartile =
25th Percentile
25th Percentile
Second Quartile =
Second Quartile =
50th Percentile
50th Percentile
=
=
Median
Median
Third Quartile =
Third Quartile =
75th Percentile
75th Percentile
Unless the sample size is large, percentiles may not make
Unless the sample size is large, percentiles may not make
sense, since percentiles divide the data into 100 groups.
sense, since percentiles divide the data into 100 groups.
In smaller samples, we might divide the data into four
In smaller samples, we might divide the data into four
groups
groups
(
(
quartiles
quartiles
).
).
Since almost any sample can be
Since almost any sample can be
divided into four groups, the quartiles are important
divided into four groups, the quartiles are important
descriptive statistics to explain.
descriptive statistics to explain.
29
29
Slide
Slide
A B C D E
1
Apart-
ment
Monthl y
Rent ($) Thi rd Quarti l e
2 1 525 =QUARTILE(B2:B71,3)
3 2 440
4 3 450
5 4 615
6 5 480
The
The
range
range
of a data set is the difference between the
of a data set is the difference between the
largest and smallest data values.
largest and smallest data values.
It is the
It is the
simplest measure
simplest measure
of variability.
of variability.
It is
It is
very sensitive
very sensitive
to the smallest and largest data
to the smallest and largest data
values.
values.
35
35
Slide
Slide
Range:
Range:
Example
Example
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Range = largest value
Range = largest value
-
-
smallest value
smallest value
Range = 615
Range = 615
-
-
425 = 190
425 = 190
Monthly Rent for 70 Apartments
Monthly Rent for 70 Apartments
36
36
Slide
Slide
Interquartile Range
Interquartile Range
or
or
Midspread
Midspread
The
The
interquartile range
interquartile range
of a data set is the difference
of a data set is the difference
between the
between the
third quartile
third quartile
and the
and the
first quartile
first quartile
.
.
it is
it is
not effected by the extreme values.
not effected by the extreme values.
Interquartil e Range = Q Q
3 1
Interquartil e Range = Q Q
3 1
37
37
Slide
Slide
Interquartile Range:
Interquartile Range:
Example
Example
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
3rd Quartile (
3rd Quartile (
Q
Q
3) = 525
3) = 525
1st Quartile (
1st Quartile (
Q
Q
1) = 445
1) = 445
Interquartile Range =
Interquartile Range =
Q
Q
3
3
-
-
Q
Q
1 = 525
1 = 525
-
-
445 = 80
445 = 80
Monthly Rent for 70 Apartments
Monthly Rent for 70 Apartments
38
38
Slide
Slide
Given the following data, use Excel to
Given the following data, use Excel to
find the Interquartile Range :
find the Interquartile Range :
357 550
357 550
654 290
654 290
763 700
763 700
621 789
621 789
900 605
900 605
EXAMPL
EXAMPL
39
39
Slide
Slide
The
The
variance
variance
is a measure of variability that utilizes
is a measure of variability that utilizes
all the data.
all the data.
Variance
Variance
It is based on the difference between the value of
It is based on the difference between the value of
each observation (
each observation (
x
x
i i
) and the mean ( for a sample,
) and the mean ( for a sample,
for a population).
for a population).
x
x
40
40
Slide
Slide
Variance
Variance
The variance is computed as follows:
The variance is computed as follows:
The variance is the
The variance is the
average of the squared
average of the squared
differences
differences
between each data value and the mean.
between each data value and the mean.
for a
for a
sample
sample
for a
for a
population
population
o
2
2
=
( ) x
N
i
o
2
2
=
( ) x
N
i
s
x
i
x
n
2
2
1
=
( )
s
x
i
x
n
2
2
1
=
( )
41
41
Slide
Slide
Standard Deviation
Standard Deviation
The
The
standard deviation
standard deviation
of a data set is the positive
of a data set is the positive
square root of the variance.
square root of the variance.
It is measured in the
It is measured in the
same units as the data
same units as the data
,
,
making
making
it more easily interpreted than the variance.
it more easily interpreted than the variance.
42
42
Slide
Slide
The standard deviation is computed as follows:
The standard deviation is computed as follows:
for a
for a
sample
sample
for a
for a
population
population
Standard Deviation
Standard Deviation
s s =
2
s s =
2
o o =
2
o o =
2
43
43
Slide
Slide
The coefficient of variation is computed as follows:
The coefficient of variation is computed as follows:
Coefficient of Variation
Coefficient of Variation
100 %
s
x
| |
|
\ .
100 %
s
x
| |
|
\ .
The
The
coefficient of variation
coefficient of variation
indicates how large the
indicates how large the
standard deviation is in relation to the mean.
standard deviation is in relation to the mean.
for a
for a
sample
sample
for a
for a
population
population
100 %
o
| |
|
\ .
100 %
o
| |
|
\ .
44
44
Slide
Slide
Measure of
Measure of
relative
relative
dispersion
dispersion
Always a %
Always a %
CV is the standard deviation expressed as percent of
CV is the standard deviation expressed as percent of
the mean
the mean
Used to compare two or more groups
Used to compare two or more groups
Weakness: CV is undefined if the mean is zero or if
Weakness: CV is undefined if the mean is zero or if
data are negative.
data are negative.
Thus, CV is used only for variables whose values are
Thus, CV is used only for variables whose values are
X>=0
X>=0
Coefficient of Variation
Coefficient of Variation
(Continued)
(Continued)
45
45
Slide
Slide
425 430 430 435 435 435 435 435 440 440
440 440 440 445 445 445 445 445 450 450
450 450 450 450 450 460 460 460 465 465
465 470 470 472 475 475 475 480 480 480
480 485 490 490 490 500 500 500 500 510
510 515 525 525 525 535 549 550 570 570
575 575 580 590 600 600 600 600 615 615
Example Continued
Example Continued
Monthly Rent for 70 Apartments
Monthly Rent for 70 Apartments
Given the following monthly rent prices for 70 apartments, find
Given the following monthly rent prices for 70 apartments, find
variance, standard deviation, and the coefficient of variation.:
variance, standard deviation, and the coefficient of variation.:
use
use
equations & Excel
equations & Excel
46
46
Slide
Slide
| | | |
| | = =
| |
\ . \ .
54.74
100 % 100 % 11.15%
490.80
s
x
| | | |
| | = =
| |
\ . \ .
54.74
100 % 100 % 11.15%
490.80
s
x
2
2
( )
2, 996.16
1
i
x x
s
n
= =
2
2
( )
2, 996.16
1
i
x x
s
n
= =
2
2996.47 54.74 s s = = =
2
2996.47 54.74 s s = = =
the standard
the standard
deviation is
deviation is
about 11% of
about 11% of
of the mean
of the mean
Variance
Variance
Standard Deviation
Standard Deviation
Coefficient of Variation
Coefficient of Variation
Solutions
Solutions
Note that
Note that
CV is the standard deviation expressed as percent of
CV is the standard deviation expressed as percent of
the mean.
the mean.
48
48
Slide
Slide
Given the following data, use Excel to
Given the following data, use Excel to
find the followings:
find the followings:
357 550
357 550
654 290
654 290
763 700
763 700
621 789
621 789
900 605
900 605
EXAMPL
EXAMPL
49
49
Slide
Slide
EXAMPLE
EXAMPLE
Given the
Given the
following data:
following data:
357 550
357 550
654 290
654 290
763 700
763 700
621 789
621 789
900 605
900 605
Use Excel to find:
A. The mean
B. The mode
C. The median
D. The 75th percentile
E. The first and the third
quartile
F. The range
G. The interquartile range or
midspread
H. The standard deviation
I. The coefficient of variation
Use Excel to find:
Use Excel to find:
A. The mean
B. The mode
C. The median
D. The 75th percentile
E. The first and the third
quartile
F. The range
G. The interquartile range or
midspread
H. The standard deviation
I. The coefficient of variation
If you need help with
this, see next slides.
If you need help with
If you need help with
this, see next slides.
this, see next slides.
50
50
Slide
Slide
A Problem Using Excel
A Problem Using Excel
A private research
A private research
organization studying
organization studying
families in various
families in various
countries reported
countries reported
the following data for
the following data for
the amount of time 4
the amount of time 4
-
-
year old children
year old children
spent alone with their
spent alone with their
fathers each day.
fathers each day.
Country
Time with Dad
(minutes)
Belgium 30
Canada 44
China 54
Finland 50
Germany 36
Nigeria 42
Sweden 46
U.S.A. 42
51
51
Slide
Slide
Use Excel, answer the following questions and explain
your answers (round all numbers into two decimal
places):
A. The mean
B. The mode
C. The median
D. The 75th percentile
E. The first and the third quartile
F. The range
G. The interquartile range or midspread
H. The standard deviation
I. The coefficient of variation
A Problem Using Excel
A Problem Using Excel
(Continued)
(Continued)
Note:
Note:
All results are rounded to two decimal places.
All results are rounded to two decimal places.
52
52
Slide
Slide
Using SWStat
Using SWStat
+ +
(
(
Creating Data Area
Creating Data Area
)
)
Data Area
Data Area
53
53
Slide
Slide
Using SWStat
Using SWStat
+ +
(Choose Statistics; Ungrouped Data; Choose Measures)
(Choose Statistics; Ungrouped Data; Choose Measures)
54
54
Slide
Slide
Using SWStat
+
(Numerical Data, Summary Measures (Sample); Calculate)
55
55
Slide
Slide
Using SWStat
Using SWStat
+ +
(Results)
(Results)
56
56
Slide
Slide
Using SWStat
Using SWStat
+ +
(Numerical Data; Percentile; Calculate)
(Numerical Data; Percentile; Calculate)
57
57
Slide
Slide
Using SWStat
Using SWStat
+ +
(Results)
(Results)
58
58
Slide
Slide
End of Chapter 4,
End of Chapter 4,
Part A
Part A