Professional Documents
Culture Documents
Quartiles and Percentiles
Quartiles and Percentiles
Quartiles and Percentiles
UNGROUPED
Quartiles
- Are natural extension of the median idea in that they are values
which divide a set of data into equal parts.
- While the median divides the distribution into two parts, the
quartiles divide into four, or ten for deciles, and hundred for
percentiles. The quantiles that divide the distribution into four equal
part s are called quartiles. These values are denoted by Q1 ,Q2 , and
Q3 .Twenty five percent fall below the first quantile ( Q 1 ) ,
50 % arebelow the second quantile ( Q 2 ) ,∧75 % are less
than the third quantile (Q3 ). Those which divide the distribution into
ten parts are called deciles. The data set has nine deciles which are
denoted by D 1 , D2 , … … . D 9 . D1 is the number that∣thebutton 10% of the data
from the top 90%and so on. Those which divide the distribution into
100 equal parts are called Percentiles. A data has 99
percentiles which is denoted by P1 , P2 , … … .. P99 .
Quartiles
x
Formula: Q x = ( n+1 ) ;
4
n=number of data set .
x=the required quartile / position .
Example:
Given: 12, 14, 16, 21,31,36, 44, n=7
X1 = 12, x2 = 14, x3 = 16, x4 = 21, x5=31, x6 = 36, and x7 = 44
Required: a) Q1 b ¿Q 2 c ¿Q 3
x
Solution: Q x = ( n+1 ) ;
4
a) Q1
1
Q1= (n+1)
4
1
Q1= ( 7+1 )
4
1
Q1= (8)
4
2
b) Q2= (n+1)
4
1
Q2= ( n+1 )
2
1
Q2= (7+1)
2
1
Q2= (8)
2
8
Q 2=
2
Q2=4 means position number X 4 , which is
Q2=21 Answer , same as Median.; 50%
c) Q3
3
Q3= ( n+ 1 )
4
3
Q3= (7+1)
4
3
Q3= (8)
4
Q3=.75 ( 8 )=X 6 ;means( pwesto) ;Q3=36 Answer
Q4 = 4/4(n+1)
Q4 = 1(7+1) = 8 , none ; Q2 = d5 = P50 = Median
DECILES(10) = 10/10(n+1)
Formula:
x
D x= ( n+1 )
10
Examples: 12,16,22,31,22,19,35
Required: a ¿ D ¿2 b ¿ D 5 c ¿ D8
5
b) D 5=
10
(n+1)
5
D5= ( 7+ 1 )
10
1
D 5= ( 8 )
2
D5=4
Means position X4 , remember D5=Median ;
D5=22 .
8
c) D 8=
10
( n+1 ) ; 12,16,19,22,22,31,35
8
D8= (7+1)
10
8
D8= (8)
10
64
D 8=
10
D8=6.4 ; position data 6 and 7, therefore.
D8=31+.4∗(35−31 )
D8=31+ 0.4∗4
D8=31+1.6
D8=32.6 Answer
PERCENTILE☹100
Formula:
x
P x= ( n+1 )
100
Example:
Given: 20,30,40,50,60,70
Required: a ¿ P ¿20 b ¿ P50 c ¿ P75
Solution:
20
a ¿ P ¿20= ( 6+1 )
100
Given: 20,30,40,50,60,70
20
P20 = ( 7)
100
140
P20 =
100
P20 =1.4=20+0.4 ( 30−20 )
P20 =20+ 4=24 Answer
50
b) P50 =
100
( 6+1 )
50
P50= ( 7)
100
P50=.50 ( 7 )
P50 =3.5
P50 =40+.5(10)
Given: 20,30,40,50,60,70
75
c) P75=
100
( n+1 )
75
P75= ( 6+1 )
100
75
P75= ( 7)
100
P75=5.25
P75=60+ 0.25(70−60)
P75=60+2.5
MEASURE OF VARIABILITY
Descriptive measures that are used to indicate the amount of variation in a data set are
called measures of variability, dispersion, or spread. When descriptive statistics are presented,
there is usually at least one measure of central tendency and at least one measure of variability
reported. The measure of dispersion are as follows:
1. RANGE
- The range of a data set is defined to be the difference between the highest and
lowest values in the data set.
Range ( R )=Highest value−Lowest value
1.a The characteristics of the Range
It is easy to compute and understand. It emphasizes the extreme values.
However, it is the most unstable or unreliable measure because its value easily
changes or fluctuates with the change in the extreme values.
1.b Some Uses of the Range
The range is used to report the movement of stock prices over a time
period and the weather reports typically state the high and low temperature
readings for 24 hours period.
Example:
Find the range in sets A, B, and C
Set A: 13,19,22,30,40
Set B: 1, ½, 3, 5, 4, 6
Set C: 12,102,2,70,80,35
Solution:
Set A: 13,19,22,30,40
Range ( R )=Highest value−Lowest value
Range ( R )=40−13
Range ( R )=27 Answer
Set B: 1, ½, 3, 5, 4, 6
Range ( R )=Highest value−Lowest value
1
Range ( R )=6−
2
1
Range ( R )=5 Answer
2
Set C: 12,102,2,70,80,35
Range ( R )=Highest value−Lowest value
Range ( R )=1 0 2−2
Range ( R )=100 Answer
Based on the computed range for set A, B, and C, it can be concluded that C
has greater variability as compared to A and B
MAD=
∑|x −x|
n
where : MAD=Mean Absolute Deviation
x=a particular data
x=sample mean−Simple
n=total number of observations .
||=absolute value .
EXAMPLE:
Consider the hourly rate of the randomly selected teachers in three
different Universities in Metro Manila.
X=
∑ X = 2000 =400 ;
n 5
MAD=
∑|x −x| 300
= 5 =60
n
The MAD for these data of five items is 60. This mean that, on the average, the
values deviated from the mean value of 400 by 60.
X=
∑ X = 2,200 =440 ;
n 5
MAD=
∑|x −x| 460
= 5 =92
n
The MAD for these data of five items is 92. This mean that, on the average,
the values deviated from the mean value of 440 by 92.
CIIT
X X −X |x−x|
250 250 – 315= -65 65
275 275 – 315 = -40 40
300 300 – 315=-15 15
350 350 – 315 = 35 35
400 400 – 315 = 85 85
∑ X=1575 0 ∑|x−x|=¿ 240 ¿
X=
∑ X = 1575 =315;
n 5
MAD=
∑|x −x| 240
= 5 =48
n
The MAD for these data of five items is 48. This mean that, on the average,
the values deviated from the mean value of 315 by 48.
Based on the computed MAD for schools UE, FEU, and CIIT, it can be concluded that
FAR EASTERN UNIVERSITY has a greater variability as compared to UNIVERSITY
OF THE EAST, and CIIT.
Formulas:
Q3−Q1
QD=
2
IQR=Q3 −Q1 ; Includes approximately the middle 50 % of the
values arranged ∈array .
Example:
Given :
4, 8, 10, 12, 16
X 1 ¿ 4 , X 2=8 , X 3=10 , X 4=12 , X 5=16
Required: IQR, QD
Solution:
Q 1= X 1 ; Q1 =X 1
(n +1) (5+1)
4 4
Q 1= X 1
(6 )
4
Q 1= X 6
4
Q1= X 1.5
X 1 + X 2 4 +8 12
Q 1= = = =6
2 2 2
Q1=4+.5 ( 8−4 )=4+0.5 ( 4 )=4+ 2=6
28
Q 3=
2
Q3=14 Answer
Q3−Q1
QD=
2
IQR=Q3 −Q1
Q3−Q1
QD=
2
IQR=14−6
IQR=8 Answer
Q3−Q1
QD=
2
14−6
QD=
2
8
QD=
2
QD=4 Answer
That means middle 50% of the data lies between 4 and 14.
S =∑ ¿ ¿¿ ; S= √∑ ¿¿ ¿ ¿
2
where
2
S =variance of a population
µ = population mean
x=values of observation
2
S =variance of a population
x = mean sample
x=values of observation
Example:
Let us use our previous example the University of the East for their hourly
rate.
UNIVERSITY OF THE EAST
X X −X ¿
300 300 – 400= -100 10000
350 350 – 400 = -50 2500
400 400 – 400 = 0 0
450 450 – 400 = 50 2500
500 500 – 400 = 100 10000
∑ X=2000 ∑ ¿¿25,000
X=
∑ X = 2000 =400 ;
n 5
S =∑ ¿ ¿¿ ;
2
25000
S2 = ;
5−1
2 25,000
S=
4 ; S2=6250 ;
S= √∑ ¿¿ ¿ ¿
S=
√ 25000
5−1
S=
√ 25000
4
S=79.06 Answer
SHORT-CUT FORMULA:
S =n ∑ x −¿¿ ¿ ;
2 2
S= √ n ∑ x 2−¿ ¿ ¿ ¿
X 2
x
300 90000
350 122500
400 160000
450 202500
500 250000
∑ X=2000 ∑ x 2=825000
VARIANCE:
S =n ∑ x −¿¿ ¿
2 2
2
S =5 ( 825000 )−¿ ¿
125000
S2 =
20
2
S =6250 Answer
STANDARD DEVIATION:
S =n ∑ x −¿¿ ¿
2 2
S= √ n ∑ x −¿ ¿ ¿ ¿
2
√
S= n ∑ x −¿ ¿ ¿ ¿
2
S= √5 ( 825000 ) −¿ ¿ ¿
S=¿
√ 125000
20
S=¿ √ 6250
Assignment :
Given: Ungrouped.
10,12,34,36,45,50,55
Required:
Q2 ,Q3 , D5 , D7 , P25 , P50 , P75 , , QD , IQR , MAD , Variance ,∧¿
Standard deviation
Submission July 20,2020 till 2:00 pm. through canvas…