Professional Documents
Culture Documents
Skewness & Kurtosis
Skewness & Kurtosis
Chapter 4
Measures of Shape and Relative Location
A4. MEASURES OF SHAPE Thus:- "Lack of asymmetry or symmetr÷ of a given
distribution of a random variable is called skewness". The
idea about the shape of the curve which can be drawn with
The principal measures of distribution shape used in-
the help of the given data is provided by skewness.
statistics are skewness and kurtosis
A distributionis said to be skewed if:
Measures of Shape 1) The values of mean, median and mode are not equal,
ie., Mean # Median # Mode.
2) Quartiles are not equi-distant from median, and
Skewness Moments 3) The curve which is drawn with the help of the given
data is not symmetrical and stretched more to one side
Kurtosis in comparison to other side.
X = Med= Mode X
X Med MÍ
Med
Symmetrical Distribution Positively Skewed Distribution Negatively Skewed Distribution
Figure 4.1
) Symmetrical Distribution: It is clear from the given 3) Negatively Skewed Distribution: The value of mode
igure that the values of mean, median and mode are is maximum and the value of mnean is minimum in
equal in a 'symmetric distribution'. Equal number of case of negatively skewed distribution. Median lies in
ltems is spread on both sides of the mid-point of the between the mean and mode.
CUrve.
It should be noted that in
lia distribution of a random variable is not symmetric moderately symmetrical
distributions, the interval between the mean
then it is referred as skewed distribution and this type and the
median is approximately one-third of the interval between
of distribution could either be positively skewed or the mean and the mode. It is the
negatively skewed. provides ameans of measuring the degreerelationship which
of skewness.
4) Positively Skewed Distribution: It is clear from the
8iven figure that the value of mean is maximum and 4,1.1.1. Objectives of Skewness
he value of mode is minimum in a positively skewed 1) Measures of skewness tell about the degree
distribuion Median lies in between the mean and Concentration. of
mode.
Statistics al
Semester (Business
MBA First
58 (Unit-I)
facility to
compare twO or
by
more frequency
two methods: distributions.
the basis of the We can find the
skewness
Moderately skewed distribution is mode. The
mean, median and
2) Measure of Skewness
empirical relations of tells about the extent to which
measure of skewness
holds true.
the empirical relationship about the normality of the
Karl Pearson's
Coefficient of Skewness Bowley'sCoefficient of
3) Skewness helps to know such as error Skewness
distribution.Various statistical measures assumption
are based on the Pearson's Coefficient
of mean, sampling error 4.1.1:5. Karl of
of a normal distribution. nature
gives an idea about the Skewness
4) Measures of skewness value. Measure, Sk = Mean -
Mode
of variation of the item
the
from central 1) Absolute Mean - Mode
help in knowing about the
5) Measures of skewness
Or
differs in the Re lative Measure ,J= Stan dard Deviation
dispersion on either side of mode, which
arrangement of their frequencies. X-Z
4.1.1.2. Tests of Skewness
Following are the tests which are applied to find that a Where, X =Arithmetic mean; Z Mode: and
distribution is skewed or not: G=:standard deviation.
1) If a distribution is skewed then the value of mean,
median and mode would not coincide. The value of 2) If mode is not determined, then
median generally lies between the mean and the Absolute Measure,
mode. In a moderately asymmetrical distribution, Sk = 3(Mean - Median) or 3(X-M)
Mean Mode = 3 (Mean -Median)
Relative measure, J =
3(Mean Median ) or
’3 Median =3 Mean + Mode - Mean
Stan dard Deviation
=2 Mean + Mode
=2 Mean + Mode + 2 Mode -
3(X-M)
2 Mode Where, M= Median.
=3Mode + 2 (Mean Mode) Note: Mathematically there are no limits for J, but
’ Median = Mode +(Mean - Mode ) practically
Mean Mode
J= takes values between ± 1;
2) If a distribution is skewed then the two quartiles S tan dard Deviation
would not be equi-distant from the median. In other 3(Mean - Median) takes values between ±3
words, it can be said that (Q - M) (M-Q) #0. J=
S tan dard Deviation
3) If a distribution is skewed then its graph would not
give a symmetrical bell-shaped curve.
4) The sum of positive deviations and the sum of negative
Example 1: From the following data find out the K
Pearson's Coefficient of Skewness:
deviations from the median would not be equal.
Measurement 20 21 22 24 25
5) At various points, frequencies are not equally
distributed which are equi-distant from the mode. In 1 3 8 1 6 1
Frequency
an asymmetrical distribution (Mean - Median) = 3
(Mean - Mode). Solution: Let the assumed mean (A) = 22
Calculation of Karl Pearson's Coefficient of Skewness
4.K13.
1)
Advantages of Skewness Measurement| Frequency|d, fd,
a Af given distribution is normal then the skewness (X) () (X
would be zero 'and it is called symmetric distribution. 22)
But generally, data points are not perfectly 20 -2 -2 4
2) With the help of skewness, we
symmetric. 21 3 -3
know
from the mean is whether positive or that the deviation 22 8 0
negative. 23 + 1 + 11 1
3) D'Agostino's K-squared test is 24
a goodness-of-fit 24 6 +2 + 12
normality test based on sample skewness and sample 25 1 +3 +3 9
kårtosis. N=30 fd, -51
Q =L +L4 f -xi
Solution: Bowley's Coefficient of skewness is:
Q3 +Q-2M Here, L =300, = 400, f =20, c= 28, i= (400 -
Sk 300) = 100
Substituting the values in the above formula, Q, =300+ 37.5-28 -x100 = 300 + 9.5 x100
20 20
SkB Q3+16.4-(2x24.2)
Q3-16.4 =300 +
95
= 300 +47.5 =347.5
2
-0.56 Q3+16.4 48.4
th
Q3-16.4 3) Upper Quartile (Q3) is the value of 3N
Or l12.5th
or
-0.56 (Q3-16.4) =Q3-32 4
or
-0.56 Q3 +9.184=Q3-32 item which is lies in the
class-interval (600-700).
or -0.56 Q3 -Q,=-32 -9.184 3N
- 1.56 O% =-41.184
Q3 = L + x i
Q3 = -41.184 = 26.6 f
-1.56 Here, L= 600, y=700, f = 28, c= 104, i= 100
Now, we have the values of both the upper and the
quartiles. lower Q3 =600 +112.5104 -x100 = 600 + 8.5x100
28 28
Coefficient of quartile deviation = Q3-Q1 = 600 + 30.35 = 630.35
Q3 +Q
26.4 -16.4 10
Bowley'sCoefficient of Skewness or
26.4 +16.4 Q, +Q-2M 630.35 +347.5-2x503.33
42.8 SK
=0.234 approx. 630.35 -347.5
Example 5: Calculate Bowley's
Coefficient of 977.85-1006.66 28.81
from the data given below: Skewness 282.85 282.85
=-0.102
Profits per Shop ) |100-200-300-
200 300 400 400-500-600-700
500 600 700
800
Example 6: Compute an appropriate measure of skewness
Number of Shops 10 18 20 26 30 28 18 for the following data:
Solution: Sales ( lac) Number of Companies
Below 50 12
Calculation of Coefficient of Skewness
Profits per Number of 50-60 30
Shop ) (X) Shops (f) Cumulative 60-70 65
100-200 10 Frequency
10
(c.£.) 70-80 78
200-300 18 80-90 80
28
300-400 20 90-100
48 55
400-500 26
100-110 45
500-600 30
104 110-120 25
600-700 28
132 Above 120 10
700-800 18 150
Measures of Shape and Relative Location (Chapter 4)
61
Skewness
he shape of the
series. the composition of the series. Where, N=2f
62 (Unit-) MBA First Semester (Business Statistics and Analytics) AKTU
4.1.2.2. Coefficients Based on Moments curve is (ermed as 'Leptokurtic" and a flat topped curve is
On the basis of relative proportions of different moments, termed as "Platykurtic'.
Alpha () Beta () and Gamma () cocfficicnts are 1) Leptokurtic: A frequency curve
which is more
calculated with the hclp of the following formulae: peaked than the normal curve is
known as
Leptokurtic. In comparison to
Alpha Beta Cocfficicnts Gamma the
Coefficients Coefficients distribution, the Leptokurtic distribution has normal
higher
peak around the mean which leads to thick taila
B, ==a? both sides. These peaks result from the data
highly concentrated around the mean which ie
because the
variations are lower within the observations.
a,= s l VB, 3/2a Y2 =B, -3 2) Platykurtic: A frequency curve
H4-3u,
2
peak than the normal curve is which has flatter
2 In comparison to the known as Platykurtic
normal distribution. the
Platykurtic distribution have flatter peak around the
=O4 mean which leads to thin tails on
peaks result from the data which is both
less
sides. These
variationsconcentrated
around the mean because the
2 within the observations. are large
3) Mesokurtic: A frequency curve with a
These coefficients are used to measure is known as normal
The kurtosis coefficientcurve
Kurtosis.
skewness and
normal Mesokurtic. of a
distribution
is 3.
high then the distribution will
Ifthe kurtosis coefficient is
4.1.3. Kurtosis the mean. When the be more peaked around
In than 3 then the kurtosis coefficient is greater
probability theory and statistics, measurement of the distribution is called leptokurtic and if
"peakedness" of the it is less than 3 then it is called
valued probability
random variable is known asdistribution of a real platykurtic.
kurtosis. In the figure 4.3, curve No. 1
is
The form of any distribution is given by curve No. 2 is more peaked than thenormal normal
or mesokurtic,
distribution is plotted on the graph then it kurtosis. If any is leptokurtic and curve No. 3 is curve, hence
more flat than the normal
a normal curve or a curve more flat than the represents either curve, and is platykurtic.
or a curve more peaked than the norma curve,
normal curve. Kurtosis is
a measure which tells how
much 'peaked' or flat' the data Lepto-kurtic Curve No. 2
is in relation to a normal
distribution. In other words,
kurtosis depicts a distinct peak near the mean, shows a
rapid decline and ends up with a heavy Curve No. 1
thin tail. tail, ending up with Meso-kurtic
5400
Solution: = 54
Calculation of Moments 100
Weight No. of fX d(X-X) fd fd fd fd Sfd
Workers =0,
(X) 100
()
60 3 180 -5 -15 75-375| 1875
16 64 256 1024 43700
61 4 244 4 :437
7 434 3 -21 63 189 567 100
62
63 9 567 -2 36 -72 144
0
442800
65 12 780 =-4428,
70 6 420 30 150750 3750 100
75 75 10 10 100 1000| 10000
160 15 30 450 6750|101250 43866500
80 2 =438665
Sf= 44 SfX 100
=0 118610
|2860 9387608 (4428)? 19607184
=0.235;
(437) 83453453
2860 438665 438665
X= 65 = 2,297
44 (437)2 190969