Download as pdf or txt
Download as pdf or txt
You are on page 1of 54

Mathematics in the Modern World – UNIT 4

Descriptive Statistics

Mathematics & Physics Department


University of Santo Tomas

Prepared by:
Asst. Prof. Xandro Alexi A. Nieto
MEASURES OF CENTRAL TENDENCY

also known as “average”

• MEAN
• MEDIAN
• MODE
MEASURES OF CENTRAL TENDENCY

• MEAN (arithmetic mean)


the sum of observations divided by the number of observations.
MEASURES OF CENTRAL TENDENCY

• Example 1:
A marketing specialist gathered five
randomly selected customers and
their age (years) are
19, 25, 32, 27 and 41.
Find the mean age of the customers.

The mean age of the customers is 28.8 years.


MEASURES OF CENTRAL TENDENCY

• Example 2.1:
An researcher wants to determine the
cholesterol level (mg/dL) of all the six
residents of Guyan Island. Observations
are as follows:
120, 120, 140, 150, 160, 190.
Compute for its mean.

The mean cholesterol level of the


residents is 146.67 mg/dL.
MEASURES OF CENTRAL TENDENCY

• Example 2.2:
Suppose that the researcher wants to
determine the cholesterol level (mg/dL) of all
the six residents of Guyan Island. But due to
some constraints, he can only get three of
the population. His observations are as
follows:
120, 140, 160.
Compute for its mean.

The mean cholesterol level of the


residents is 140 mg/dL.
140 mg/dL is an estimate of the cholesterol level of all the residents,
which is 146.67 mg/dL.
STATISTICS is an estimate of the PARAMETER. .
MEASURES OF CENTRAL TENDENCY
MEASURES OF CENTRAL TENDENCY
MEASURES OF CENTRAL TENDENCY
MEASURES OF CENTRAL TENDENCY

• MEDIAN
the middle value of ordered observations
MEASURES OF CENTRAL TENDENCY

• Example 1:
Arranging the observations ascendingly:
19, 25, 27, 32, 41.
A marketing specialist gathered five
randomly selected customers and
middle value
their age (years) are
19, 25, 32, 27 and 41. The middle value is 27.
What is the median age of the
customers?

The median age of customers is 27 years.


MEASURES OF CENTRAL TENDENCY

• Example 2: Arranging the observations ascendingly:


A researcher wants to determine the 120, 120, 140, 150, 160, 190.
cholesterol level (mg/dL) of all the six
residents of Guyan Island. Observations
are as follows: middle values
120, 120, 140, 150, 160, 190.
Find its median. The middle values are 140 and 150.

Just take the middle value of 140 and 150.


x෤ = 145

The median cholesterol level of the


residents is 145 mg/dL.
MEASURES OF CENTRAL TENDENCY

In the previous examples,


Example 1: 19, 25, 27, 32, 41
x1, x2, x3, x4, x5
x෤ = x3=27

Example 2: 120, 120, 140, 150, 160, 190


x1, x2, x3, x4, x5, x6
x෤ = x3.5 = 145
MEASURES OF CENTRAL TENDENCY

Thus,
if x1< x2 < x 3 < x4 < x5 x෤ = x3
if x1< x2 < x3 < x4 < x5 < x6 x෤ = x3.5
if x1< x2 < x3 < x4 < x5 < x6 < x7 x෤ = x4
if x1< x2 < x3 < x4 < x5 < … < x8 x෤ = x4.5
if x1< x2 < x3 < x4 < x5 < … < x11 x෤ = x6
MEASURES OF CENTRAL TENDENCY

Given that
x1< x2< … < xn,
The median is
𝐱෤ = 𝐱 𝟏(𝐧+𝟏) .
𝟐
MEASURES OF CENTRAL TENDENCY

Find the median of the data presented in the frequency


table below:

𝑥෤ = 30
MEASURES OF CENTRAL TENDENCY

• MODE
the most frequent observation(s)
xො
MEASURES OF CENTRAL TENDENCY

• Example 2:
A researcher wants to determine the
cholesterol level (mg/dL) of all the six
residents of Guyan Island. Observations
xො = 120 mg/dL
are as follows:
120, 120, 140, 150, 160, 190.
What is its mode?
MEASURES OF CENTRAL TENDENCY

• Example 1:
A marketing specialist gathered five
randomly selected customers and
their age (years) are
19, 25, 32, 27 and 41. xො does not exist.
What is the modal age of the
customers?
MEASURES OF CENTRAL TENDENCY

• Example 3:
A social media specialist wanted to
determine the daily number of tweets
of first year college students. Results
from seven randomly selected
students are
12, 12, 40, 40, 50, 70, and 75. xො = 12 & 40.
Find its mode.
MEASURES OF CENTRAL TENDENCY

A set of observations with


one mode is called unimodal,
two modes is called bimodal,
three modes is trimodal, and
more than three modes is
multimodal.
MEASURES OF CENTRAL TENDENCY

Find the mode of the data presented in the


frequency table below:

xො = 29 and 30
MEASURES OF CENTRAL TENDENCY

When do we use the different


measures of central tendency?
Mean Median Mode
Ratio/Interval ✓✓ ✓ ✓

Ordinal X ✓✓ ✓

Nominal X X ✓
MEASURES OF OTHER POSITION

also known as “quantiles”

• QUARTILES
• DECILES
• PERCENTILES
MEASURES OF OTHER POSITION
MEASURES OF OTHER POSITION

Given that x1< x2< … < xn,


MEASURES OF OTHER POSITION

• Example 1:
Arranging the observations ascendingly:
A marketing specialist gathered five 19, 25, 27, 32, 41.
randomly selected customers and
their age (years) are Q3 = x3(n+1)
19, 25, 32, 27 and 41. 4
What is the third quartile (Q3) of the
= x3(5+1) = x4.5 = 36.5
customers’ age? 4

Alternatively,

𝑥4.5 = 𝑥4 + 0.5(𝑥5 − 𝑥4 )
= 32 + 0.5 (41 - 32)
= 36.5
MEASURES OF OTHER POSITION

• Example 1:
Arranging the observations ascendingly:
A marketing specialist gathered five 19, 25, 27, 32, 41.
randomly selected customers and
their age (years) are Q1 = x1(n+1)
19, 25, 32, 27 and 41. 4
What is the first quartile (Q1) of the
= x1(5+1) = x1.5
customers’ age? 4

x1.5 = x1 + 0.5(x2 − x1 )
= 19+ 0.5 (25 – 19)
= 22
MEASURES OF OTHER POSITION

• Example 1:
Arranging the observations ascendingly:
A marketing specialist gathered five 19, 25, 27, 32, 41.
randomly selected customers and
their age (years) are
19, 25, 32, 27 and 41.
P70 = x 70 (n+1)
What is the 70th percentile (P70) of
100
the customers’ age? = x 70 = x4.2
100(5+1)
x4.2 = x4 + 0.2(x5 − x4 )
= 32+ 0.2 (41 – 32)
= 33.8
MEASURES OF OTHER POSITION

• Example 2:
A researcher wants to determine the
P73 = x 73
cholesterol level (mg/dL) of all the six 100 n+1
residents of Guyan Island. Observations
are as follows: = x 73 6+1
= x5.11
120, 120, 140, 150, 160, 190. 100
What is its 73rd Percentile (P73)?

x5.11 = x5 + 0.11(x6 − x5 )
= 160+ 0.11 (190 - 160)
= 163.3
MEASURES OF VARIATION

• RANGE
• INTERQUARTILE RANGE (IQR)
• MEAN ABSOLUTE DEVIATION
• VARIANCE
• STANDARD DEVIATION
• COEFFICIENT OF VARIATION
MEASURES OF VARIATION

• RANGE
the difference between the lowest & highest observations
MEASURES OF VARIATION

• Example 1:
A marketing specialist gathered five
randomly selected customers and
their age (years) are
19, 25, 32, 27 and 41.
Find its range.

Range = 41 − 19 = 22
MEASURES OF VARIATION

• INTERQUARTILE RANGE (IQR)


the difference between the Q1 and Q3.
MEASURES OF VARIATION

• Example 1:
A marketing specialist gathered five
randomly selected customers and
their age (years) are
19, 25, 32, 27 and 41.
Find its IQR.
Since Q1 = 22 and Q3 = 36.5,
then IQR = Q3 – Q1 = 36.5 - 22 = 14.5
MEASURES OF VARIATION

• Example 1: Additionally,
A marketing specialist gathered five
randomly selected customers and LB = Q1 – 1.5(IQR)
their age (years) are = 22 – 1.5(14.5)
19, 25, 32, 27 and 41. = 0.25
UB = Q3 + 1.5(IQR)
= 36.5+ 1.5(14.5)
= 58.25
MEASURES OF VARIATION

• Example 1:
Summary: UB = 58.25
A marketing specialist gathered five Highest = 41
randomly selected customers and 𝑄3 = 36.5
their age (years) are 𝑄2 = 27
19, 25, 32, 27 and 41.
𝑄1 = 22
Lowest = 19
LB = 0.25
MEASURES OF VARIATION

𝐓𝐡𝐞 𝐁𝐨𝐱𝐩𝐥𝐨𝐭
also known as the
Box and Whiskers plot
MEASURES OF VARIATION

• Example 1: Summary: UB = 58.25


Highest = 41
A marketing specialist gathered five
𝑄3 = 36.5
randomly selected customers and
their age (years) are 𝑄2 = 27
19, 25, 32, 27 and 41. 𝑄1 = 22
Lowest = 19
LB = 0.25
MEASURES OF VARIATION

Boxplot
with outliers
MEASURES OF VARIATION

• MEAN ABSOLUTE DEVIATION (MAD)


- the average distance of each observation from the mean

σ 𝑥−𝜇 σ 𝑥 − 𝑥ҧ
MAD = MAD =
N n
MEASURES OF OTHER POSITION

• Example 4: Compute first for its mean.


All the five students of students in 𝜇 = σN𝑥 = 11 hrs
section A were asked by the
teacher to record the number of MAD =
σ 𝑥−𝜇
N
hours each spent studying for a =
9 − 11 + 15 − 11 + 8 − 11 + 9 − 11 + 14 − 11
5
given exam from the time the 14
=
exam was announced in class. 5
= 2.8 hrs
The following
observations were
the recorded (in hrs):
9, 15, 8, 9, 14.
Compute for MAD.
MEASURES OF VARIATION

• VARIANCE

2 σ x − xത 2
σ x−μ 2
2
σ = s =
N n−1
MEASURES OF OTHER POSITION

• Example 4: Compute first for its mean.


All the five students of students in 𝜇 = σN𝑥 = 11 hrs
section A were asked by the
teacher to record the number of
hours each spent studying for a σ x−μ 2
2
given exam from the time the σ =
exam was announced in class. 9 − 11 2 N+ 15 − 11 2
+ 8 − 11 2
+ 9 − 11 2
+ 14 − 11 2
=
5
The following
42
observations were = = 8.4 hrs2
5
the recorded (in hrs):
9, 15, 8, 9, 14.
Compute for its variance.
MEASURES OF VARIATION

• Example 1: Compute first for its mean:


σ x 144
A marketing specialist gathered five
n
xത =
= = 28.8 years
randomly selected customers and 5
their age (years) are 2
σ 𝑥 − 𝑥ҧ
19, 25, 32, 27 and 41. 𝑠2 =
Compute for its variance. n−1
2 2 2 2 2
19 − 28.8 + 25 − 28.8 + 32 − 28.8 + 27 − 28.8 + 41 − 28.8
=
5−1
272.8
= = 68.2 yrs 2
4
MEASURES OF OTHER POSITION

If x1, x2, … , xn are random


samples from a population
with variance σ2 ,
2 σ x−തx 2
then s = is an
n−1
2
unbiased estimate of σ .
MEASURES OF OTHER POSITION

• STANDARD DEVIATION

𝜎= 𝜎2 𝑠= 𝑠2
MEASURES OF OTHER POSITION

• Example 4:
All the five students of students in
section A were asked by the Since 𝜎 2 = 8.4 hrs2 ,
teacher to record the number of
then 𝜎 = 8.4 hrs2
hours each spent studying for a
= 2.90 hrs
given exam from the time the
exam was announced in class.
The following
observations were
the recorded (in hrs):
9, 15, 8, 9, 14.
Compute for the
standard deviation.
MEASURES OF VARIATION

• Example 1:
A marketing specialist gathered five Since s 2 = 68.2 yrs2 ,
randomly selected customers and
their age (years) are then s = 68.2 yrs2
19, 25, 32, 27 and 41. = 8.26 yrs
Compute for its standard deviation.
MEASURES OF VARIATION

Which variable
(A or B) has more
variation?
MEASURES OF VARIATION

Which variable
(A or B) has more
variation?
𝜎 𝑠
CV = 100% or CV = 100%
𝜇 𝑥ҧ
MEASURES OF VARIATION

Which variable
(A or B) has more
variation?
𝜎 𝑠
CV = 100% or CV = 100%
𝜇 𝑥ҧ
MEASURES OF OTHER POSITION

COEFFICIENT OF VARIATION (CV)


𝜎 𝑠
CV = 100% CV = 100%
𝜇 𝑥ҧ

Used to compare the variability of two or more variables


with different means.

Used to compare the variability of two or more variables


with different units of measurement.
END OF UNIT 4.2

You might also like