Download as pptx, pdf, or txt
Download as pptx, pdf, or txt
You are on page 1of 40

Lecture-2

Review of Descriptive Statistic


● Skewness and Kurtosis
● Graphical representation of data
● Learning through SPSS

Larson & Farber, Elementary Statistics: Picturing the World, 3e 1


Measures of Shape: Skewness and Kurtosis
• skewness tells you the amount and direction of skew (departure
from horizontal symmetry)
• kurtosis tells you how tall and sharp the central peak is, relative
to a standard bell curve.
• Skewness is a statistical number that tells us if a distribution is
symmetric or not.
• A distribution is symmetric if the right side of the distribution is
similar to the left side of the distribution. If a distribution is
symmetric, then the Skewness value is 0. i.e. If a distribution is
Symmetric (normal distribution): median= mean= mode, (Skewness
value is 0)
• If Skewness is greater than 0, then it is called right-skewed or that
the right tail is longer than the left tail.
• If Skewness is less than 0, then it is called left-skewed or that the left
tail is longer than
Larson the right
& Farber, tail. Statistics: Picturing the World, 3e
Elementary 2
Skewness
 Skewness. Tiltness of distribution,
When they are displayed graphically,
some distributions have many more
observations on one side of the graph
than the other. Distributions with
fewer observations on the right
(toward higher values) are said to be
skewed right; and distributions with
fewer observations on the left
(toward lower values) are said to
be skewed left.
 When the distribution is symmetric,
the value of skewness should be zero.

IS 310 – Business Statistics Slide


3
Skewed & Symmetrical Distributions
Skewed Distribution:
 In a skewed distribution, the mode will be located at the peak on one side
and the mean usually will be displaced toward the tail on the other side.
 The median is usually located between the mean and the mode.
 If a normal distribution has a skewness of 0, right skewed is greater then 0
and left skewed is less than 0.
Symetrical Distribution:
 In a symmetrical distribution, the mean and median will always be equal.
 Because the mean, the median, and the mode are all measuring central
tendency, the three measures are often systematically related to each other.
 If a symmetrical distribution has only one mode, the mode, mean, and
median will all have the same value.
In Symetrical: mean = median = mode = Normal Distribution
When the distribution is symmetric, the value of skewness should be zero

IS 310 – Business Statistics Slide


4
Skewed & Symmetrical Distributions
Difference between Variance and Skewness :
The following two points of difference between variance and skewness should
be carefully noted.
1. Variance tells us about the amount of variability while skewness gives the
direction of variability.
2. 2. In business and economic series, measures of variation have greater
practical application than measures of skewness. However, in medical and
life science field measures of skewness have greater practical applications
than the variance.

IS 310 – Business Statistics Slide


5
VARIOUS MEASURES OF SKEWNESS
Absolute Measures of Skewness: Following are the absolute measures of
skewness:
 1. Skewness (Sk) = Mean – Median
 2. Skewness (Sk) = Mean – Mode
 3. Skewness (Sk) = (Q3 - Q2) - (Q2 - Q1)

IS 310 – Business Statistics Slide


6
 Example1: For a distribution Karl Pearson’s coefficient of
skewness is 0.64, standard deviation is 13 and mean is 59.2
Find mode and median.
 Solution: We have given

Sk = 0.64, SD= σ = 13 and Mean = 59.2


Therefore by using formulae:

0.64 = (59.2 – Mode) / 13 Ungrouped


Mode = 59.20 – 8.32 = 50.88
Mode = 3 Median – 2 Mean
50.88 = 3 Median - 2 (59.2)
Median = (50.88 + 118.4) / 3
Median = (169.28) / 3
Median = 56.42 Ans
IS 310 – Business Statistics Slide
7
 Example2: The first four raw moments of a distribution are 2,
136, 320, and 40,000. Find out SD.
 Solution:

SD= σ = ? Sk = 1.28, Mean = 164, Mode = 100


Therefore by using formulae:

1.28 = (164 -100) / σ


σ = (64 / 1.28) = 50

IS 310 – Business Statistics Slide


8
Kurtosis
 Kurtosis is a statistical number that
tells us if a distribution is taller or
shorter than a normal distribution.
 If a distribution is similar to the normal
distribution, the Kurtosis value is 0.
 If Kurtosis is greater than 0, then it has
a higher peak compared to the normal
distribution.
 If Kurtosis is less than 0, then it is
flatter than a normal distribution.
 There are three types of distributions:
 Leptokurtic: Sharply peaked with fat
tails, and less variable.
 Mesokurtic: Medium peaked
 Platykurtic: Flattest peak and highly
dispersed.
IS 310 – Business Statistics Slide
9
Measures of Shape: Skewness and Kurtosis
Key Differences Between Skewness and Kurtosis This is the
fundamental differences between skewness and kurtosis:
 The characteristic of a frequency distribution that ascertains its
symmetry about the mean is called skewness. On the other hand,
Kurtosis means the relative pointedness of the standard bell
curve, defined by the frequency distribution.
 Skewness is a measure of the degree of lopsidedness in the
frequency distribution. Conversely, kurtosis is a measure of
degree of tailedness in the frequency distribution.
 Skewness is an indicator of lack of symmetry, i.e. both left and
right sides of the curve are unequal, with respect to the central
point. As against this, kurtosis is a measure of data, that is either
peaked or flat, with respect to the probability distribution.
 Skewness shows how much and in which direction, the values
deviate from the mean? In contrast, kurtosis explain how tall and
sharp
IS 310the centralStatistics
– Business peak is Slide
10
Examples: Calculate Sample Skewness and Sample Kurtosis from the following
grouped data.

Class Mid f f⋅ x (x-ˉx) f⋅(x-ˉx)2 f⋅(x-ˉx)3 f⋅(x-ˉx)4


es value
(x) Mean=5.2
Classes Mid value (x) f f⋅x (x-ˉx) f⋅(x-ˉx)2
2-4 3 3 f⋅(x-ˉx)3
3×3= 9 3-5.2=-2.2 3×-2.2×-2.2=14.52 1
f⋅(x-ˉx)4 -31.944 70.27
2 - 4 3 3 3×3= 9 3-5.2=-2.2 3×-2.2×-
4-6 5 4 2.2=14.52
20 14.52×-2.2=
0.2 0.16 70.27
-31.944 -0.032 0.0064
4 - 6 5 4 4×5= 20 5-5.2=-0.2 4×-0.2×-
6-8 7 2 14 1.8 6.48 11.664 20.98
0.2=0.16 0.16×-0.2= -0.032 0.0064
8 - 10 9 1 6 - 8 79 2 2×7= 14 3.87-5.2=1.814.44 54.872 208.5
2×1.8×1.8=6.48 6.48×1.8=11.664 20.98
--- --- ---8 - 10---
9 1 1×9=--- 9 9-5.2=3.8--- ---
1×3.8×3.8=14.44 14.44×3.8= 54.872 208.5
- --- n=---
10--- ∑f
---⋅x=52 ------
--- --- --- 35.6 34.56 𝟐99.79
TOT -TOTAL- -- n=10 ∑f⋅x=52 `-- =35.6 =34.56
AL- =𝟐𝟐𝟐𝟐𝟐𝟐.𝟕𝟕�

IS 310 – Business Statistics Slide


11
Class Mid f f⋅ x (x-ˉx) f⋅(x-ˉx)2 f⋅(x-ˉx)3 f⋅(x-ˉx)4
es value
(x) Mean=5.2
2-4 3 3 3×3= 9 3-5.2=-2.2 3×-2.2×-2.2=14.52 1 -31.944 70.27

4-6 5 4 20 0.2 0.16 -0.032 0.0064

6-8 7 2 14 1.8 6.48 11.664 20.98

8 - 10 9 1 9 3.8 14.44 54.872 208.5

--- --- --- --- --- --- ---

- --- ∑f=10 ∑f⋅x=52 --- 35.6 34.56 𝟐99.79


TOTAL

IS 310 – Business Statistics Slide


12
Class Mid f f⋅ x (x-ˉx) f⋅(x-ˉx)2 f⋅(x-ˉx)3 f⋅(x-ˉx)4
es value
(x) Mean=5.2
2-4 3 3 3×3= 9 3-5.2=-2.2 3×-2.2×-2.2=14.52 1 -31.944 70.27

4-6 5 4 20 0.2 0.16 -0.032 0.0064

6-8 7 2 14 1.8 6.48 11.664 20.98

8 - 10 9 1 9 3.8 14.44 54.872 208.5

--- --- --- --- --- --- ---

- --- ∑f=10 ∑f⋅x=52 --- 35.6 34.56 𝟐99.79


TOTAL

IS 310 – Business Statistics Slide


13
Symetrical Distribution:

IS 310 – Business Statistics Slide


14
Negatively skewed distributions-(Left)

 Negatively skewed distributions, skewed to the left, occur


when most of the scores are toward the high end of the
distribution.
 In a negatively skewed distribution, the mode > median >
mean.

IS 310 – Business Statistics Slide


15
IS 310 – Business Statistics Slide
16
Positively skewed distributions (Rt)

 Positively skewed distributions occur when most of the scores


are toward the low end of the distribution.
 In a positively skewed distribution, mode< median< mean.

IS 310 – Business Statistics Slide


17
IS 310 – Business Statistics Slide
18
IS 310 – Business Statistics Slide
19
Graphical representation

Graphical representation refers to the use of charts and


graphs to visually display, analyze, clarify, and
interpret numerical data, functions, and other
qualitative structures.
Graphical representation is a form of visually displaying
data through various methods like graphs, diagrams, charts,
and plots. It helps in sorting, visualizing, and presenting
data in a clear manner through different types of graphs.
Statistics mainly use graphical representation to show data.

Larson & Farber, Elementary Statistics: Picturing the World, 3e 20


Graphical representation

© 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
21
or duplicated, or posted to a publicly accessible website, in whole or in part.
Graphical representation
Graphical Summary: Histogram
 Example: Hudson Auto

© 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
22
or duplicated, or posted to a publicly accessible website, in whole or in part.
SPSS

© 2015 Cengage Learning. All Rights Reserved. May not be scanned, copied
Slide
23
or duplicated, or posted to a publicly accessible website, in whole or in part.
Box plot
 A box and whisker plot—also called a box plot—displays the five-number summary
of a set of data. The five-number summary is the minimum, first quartile, median,
third quartile, and maximum.
 In a box plot, we draw a box from the first quartile to the third quartile. A vertical
line goes through the box at the median. The whiskers go from each quartile to the
minimum or maximum.
 What is a box plot? In descriptive statistics, a box plot or boxplot (also known as box
and whisker plot) is a type of chart often used in explanatory data analysis. Box
plots visually show the distribution of numerical data and skewness through
displaying the data quartiles (or percentiles) and averages.
 In descriptive statistics, a box plot or boxplot is a method for graphically
demonstrating the locality, spread and skewness groups of numerical data through
their quartiles.
 Two of the most commonly used variation of Box Plot are: variable-width Box Plots
and notched Box Plots.
 Box plots visually show the distribution of numerical data and skewness through
displaying the data quartiles (or percentiles) and averages.

IS 310 – Business Statistics Slide


24
IS 310 – Business Statistics Slide
25
Box plot
Example : The following data are the heights of 40 students in a
statistics class.
59; 60; 61; 62; 62; 63; 63; 64; 64; 64; 65; 65; 65; 65; 65; 65; 65; 65; 65; 
66; 66; 67; 67; 68; 68; 69; 70; 70; 70; 70; 70; 71; 71; 72; 72; 73; 74; 74; 
75; 77.
Construct a box plot with the following properties; the calculator
instructions for the minimum and maximum values as well as
the quartiles follow the example.
 Minimum value = 59
 Maximum value = 77
 Q1: First quartile = 64.5
 Q2: Second quartile or median= 66
 Q3: Third quartile = 70

IS 310 – Business Statistics Slide


26
Solution (Explanation):
 Each quarter has approximately 25% of the data.
 The spreads of the four quarters are 64.5–59=5.5 (first quarter), 
66,64.5=1.5 (second quarter), 70–66=4 (third quarter) & 77–70 =7  (fourth
quarter). So, the second quarter has the smallest spread and the fourth
quarter has the largest spread.
 Range = maximum value – the minimum value = 77 – 59 = 18
 Interquartile Range: IQR = Q3 – Q1 = 70–64.5=5.5.
 The interval 59–65 has more than 25% of the data so it has more data in it
than the interval 66 through 70 which has 25% of the data.
 The middle 50% (middle half) of the data has a range of 5.5 inches.

IS 310 – Business Statistics Slide


27
Box plot
Example : Finding the number summary of given Data
5,10,16,17, 18,20,32.
 Step 1: Order the data from smallest to largest.
 Step 2: Find the median. 17
 Step 3: Find the quartiles. Q1= 10 & Q3 = 20
 Step 4: Finding the min and the max data value. 5, 32
 Draw an arrow line

IS 310 – Business Statistics Slide


28
IS 310 – Business Statistics Slide
29
Graphs
 In math, a graph can be defined as a pictorial representation or a diagram
that represents data or values in an organized manner. The points on the
graph often represent the relationship between two or more things. Like,
The representation of the information through pictures is called pictograph
 Types of Graphs: Three commonly used types of graphs are bar graphs,
circle graphs, and line graphs.
 Bar Chart/Graph.
 Pie / circle graphs or Chart.
 Line Graph or Chart.
 Histogram Chart.
 Area Chart.
 Dot Graph or Plot.
 Scatter Plot.
 Flowchart. ...
 Pie chart. ...
 Gantt chart. ...
 Funnel chart.
IS 310 – Business Statistics Slide
30
Key features include: 
 Intercepts; intervals where the function is increasing,
decreasing, positive, or negative; relative maximums and
minimums; symmetries; end behavior; and periodicity.
Building Bar Graphs may requires:
 The Title. The title offers a short explanation of what is in your
graph. ...
 The Source. The source explains where you found the
information that is in your graph. ...
 X-Axis. Bar graphs have an x-axis and a y-axis. ...
 Y-Axis. ...
 The Data. ...
 The Legend.

IS 310 – Business Statistics Slide


31
Graphs
 Line graph. A line graph, also known as a line chart or a line
plot, is commonly drawn to show information that changes
over time. Line graphs illustrate how related data changes over
a specific period of time. A line graph uses dots connected by
lines to show the changes over a period of time.

 Histogram. A histogram is another type of bar graph that


illustrates the distribution of numeric data across categories.
 Bar Graph. A bar graph is the representation of numerical data
by rectangles (or bars) of equal width and varying height. The
gap between one bar and another is uniform throughout. Bar
graphs can be either horizontal or vertical. The height or length
of each bar relates directly to its value.

IS 310 – Business Statistics Slide


32
Chart is a graphical representation for data visualization, in
which "the data is represented by symbols, such as bars in a bar
chart, lines in a line chart, or slices in a pie chart". A chart can
represent tabular numeric data, functions or some kinds of
quality structure and provides different info. 
 Pie chart is also known as a circle graph. It shows how a whole
is divided into different parts. The pie chart shows the relative
size of each data set in proportion to the entire data set.
Percentages are used to show how much of the whole each
category occupies.

IS 310 – Business Statistics Slide


33
IS 310 – Business Statistics Slide
34
Solution

IS 310 – Business Statistics Slide


35
IS 310 – Business Statistics Slide
36
Problem-3

IS 310 – Business Statistics Slide


37
IS 310 – Business Statistics Slide
38
IS 310 – Business Statistics Slide
39
End

IS 310 – Business Statistics Slide


40

You might also like