Professional Documents
Culture Documents
Faculty Name: Dr. M. Massarrat Ali Khan Course Name: Introduction To Statistics Email: Mokhan@iba - Edu.pk
Faculty Name: Dr. M. Massarrat Ali Khan Course Name: Introduction To Statistics Email: Mokhan@iba - Edu.pk
Week 2
Frequency Distribution
What Does It Mean? The reason for grouping is to organize the data into a
sensible number of classes in order to make the data more accessible and
understandable.
With this method, each class consists of a range of values. The smallest
value that could go in a class is called the lower limit of the class, and the
largest value that could go in the class is called the upper limit of the
class.
A third way to group quantitative data is to use class cut point. As with limit grouping, each
class consists of a range of values. The smallest value that could go in a class is called the
lower cut point of the class, and the smallest value that could go in the next higher class is
called the upper cut point of the class.
Note that the lower cut point of a class is the same as its lower limit and that the upper
cut point of a class is the same as the lower limit of the next higher class.
This method is particularly useful when the data are continuous and are expressed with
decimals.
Example
Histogram
Requirements:
• Mid value
• Class Boundaries
• Less than cumulative frequency
• More than cumulative frequency
Example
Marks No. of students (f) Mid value (x) Class Boundaries
Histogram
35
30
25
20
15
10
0
49.5 59.5 69.5 79.5 89.5 99.5
Frequency Polygon
Frequency Polygon
35
30
25
20
15
10
0
44.5 54.5 64.5 74.5 84.5 94.5 104.5
Column2
Frequency Curve:
Frequency Curve
35
30
25
20
15
10
0
44.5 54.5 64.5 74.5 84.5 94.5 104.5
Column2
Dotplots:
• Dotplots
Another type of graphical display for quantitative data is the dotplot.
• DEFINITION Dotplot
A dotplot is a graph in which each observation is plotted as a dot at an
appropriate place above a horizontal axis. Observations having equal
values are stacked vertically.
Dotplots are particularly useful for showing the relative positions of the
data in a data set or for comparing two or more data sets.
Procedure 2.6
To Construct a Dotplot
Step 1 Draw a horizontal axis that displays the possible values of the
quantitative data.
Step 2 Record each observation by placing a dot over the appropriate
value on the horizontal axis.
Step 3 Label the horizontal axis with the name of the variable.
Step 2 Record each observation by placing a dot over the appropriate
value on the horizontal axis.
The first price is $210, which calls for a dot over the “210” on the horizontal
axis in Fig. 2.7. Continuing in this manner, we get all the dots shown in Fig.
2.7.
Step 3 Label the horizontal axis with the name of the variable.
The variable here is “Price,” with which we label the horizontal axis in Fig.
2.7.
Dotplots are similar to histograms. In fact, when data are grouped using
singlevalue grouping, a dotplot and a frequency histogram are essentially
identical. However, for single-value grouped data that involve decimals,
dotplots are generally preferableto histograms because they are easier to
construct and use.
Stem-and-Leaf Diagram
• Symmetry
Skewness:
• Skewness
• The frequency of an observation tells you the number of times the
observation occurs in the data. For example, in the following list of
numbers, the frequency of the number 9 is 5 (because it occurs 5
times):
• 1, 2, 3, 4, 6, 9, 9, 8, 5, 1, 1, 9, 9, 0, 6, 9.
• Tables can show either categorical variables (sometimes called
qualitative variables) or quantitative variables (sometimes called
numeric variables). You can think of categorical variables as categories
(like eye color or brand of dog food) and quantitative variables as
numbers.
Measure of Central Tendency
• Mean
• Median
• Mode
Measure of Central Tendency
•
Arithmetic Mean:
• Population A.M is denoted by µ
µ= (ungrouped data)
µ= ∑ fx/ ∑ f (grouped data)
• The Median Another frequently used measure of center is the median. Essentially,
the median of a data set is the number that divides the bottom 50% of the data
from the top 50%. A more precise definition of the median follows.
• 300 300 300 300 300 400 400 450 940 1050
What is Median?
• The number of observations is 10, so
(n + 1)/2 = (10 + 1)/2 = 5.5.
Consequently, the median is halfway between the fifth and sixth
observations (shown in boldface) in the ordered list, which is 350
Median for Grouped Data
35 – 45 2 18
Mode for Group Data
• Mode
example
C.B f C.F
•Mode=
*h
5 - 15 5 5
15 - 25 8 13 Mode = 15 * 10
25-35 3 16
=15+3.75
= 18.75
35 – 45 2 18