Professional Documents
Culture Documents
Applied Statistics in Business & Economics,: David P. Doane and Lori E. Seward
Applied Statistics in Business & Economics,: David P. Doane and Lori E. Seward
McGraw-Hill/Irwin Copyright © 2013 by The McGraw-Hill Companies, Inc. All rights reserved.
Chapter 3
Describing Data Visually
Chapter Contents
3.1 Stem-and-Leaf Displays and Dot Plots
3.2 Frequency Distributions and Histograms
3.3 Excel Charts
3.4 Line Charts
3.5 Bar Charts
3.6 Pie Charts
3.7 Scatter Plots
3.8 Tables
3.9 Deceptive Graphs
3-2
Chapter 3
Describing Data Visually
3-3
Chapter 3
Describing Data Visually
LO3-6: Know the rules for effective bar charts and pie charts.
LO3-7: Make and interpret a scatter plot using Excel.
LO3-8: Make simple tables and pivot tables.
LO3-9: Recognize deceptive graphing techniques.
3-4
Chapter 3
3.1 Stem-and-Leaf Displays and
Dot Plots
• Methods of organizing, exploring and summarizing data include:
3-5
Chapter 3
3.1 Stem-and-Leaf Displays and
Dot Plots
• Begin with univariate data (a set of n observations on one variable)
and consider the following:
3-6
Chapter 3
3.1 Stem-and-Leaf Displays and
Dot Plots
• Measurement
• Look at the data and visualize how they were collected and
measured.
3-7
Chapter 3
LO3-1 3.1 Stem-and-leaf Displays and
Dot Plots
The type of graph you use to display your data is dependent on the
type of data you have. Some charts are better suited for quantitative
data, while others are better for displaying categorical data.
3-8
Chapter 3
LO3-1 3.1 Stem-and-Leaf Displays
and Dot Plots
For the 44 P/E ratios, the stem-and-leaf plot is given below.
For example, the data values in the fourth stem are 31, 37, 37, 38. We always use equally
spaced stems (even if some stems are empty). The stem-and-leaf can reveal central tendency
(24 of the 44 P/E ratios were in the 10–19 stem) as well as dispersion (the range is from 7 to 59).
In this illustration, the leaf digits have been sorted, although this is not necessary. The stem-and-
leaf has the advantage that we can retrieve the raw data by concatenating a stem digit with each
of its leaf digits. For example, the last stem has data values 50 and 59.
3-9
Chapter 3
LO3-1 3.1 Stem-and-Leaf Displays and
Dot Plots
Dot Plots
• A dot plot is the simplest graphical display of n individual values of numerical
data.
- Easy to understand.
- It reveals dispersion, central tendency, and the shape of the distribution.
• Steps in Making a Dot Plot
1. Make a scale that covers the data range.
3. Plot each data value as a dot above the scale at its approximate location.
Note: If more than one data value lies at about the same axis
location, the dots are stacked vertically.
3-10
Chapter 3
LO3-1 3.1 Stem-and-Leaf Displays and
Dot Plots
3-11
Chapter 3
LO3-1 3.1 Stem-and-Leaf Displays and
Dot Plots
Comparing Groups
• A stacked dot plot compares two or more groups using a common
X-axis scale.
3-12
Chapter 3
3.2 Frequency Distributions and
LO3-2 Histograms
LO3-2: Create a frequency distribution for a data set
3-13
Chapter 3
3.2 Frequency Distributions and
LO3-2 Histograms
Constructing a Frequency Distribution
- Herbert Sturges proposed the following rule:
3-14
Chapter 3
3.2 Frequency Distributions and
LO3-2 Histograms
3-15
Chapter 3
3.2 Frequency Distributions and
LO3-2 Histograms
Histograms
3-16
Chapter 3
3.2 Frequency Distributions and
LO3-3 Histograms
LO3-3: Make a histogram with appropriate bins.
• Consider 3 histograms for the P/E ratio data with different bin
widths. What do they tell you?
3-17
Chapter 3
3.2 Frequency Distributions and
LO3-3 Histograms
3-18
Chapter 3
3.2 Frequency Distributions and
LO3-3 Histograms
Modal Class
• A histogram bar that is higher than those on either side.
• Modal classes may be artifacts of the way bin limits are chosen.
3-19
Chapter 3
3.2 Frequency Distributions and
LO3-4 Histograms
LO3-4: Identify skewness, modes, and outliers in a histogram.
Shape
• A histogram may suggest the shape of the population.
• It is influenced by the number of bins and bin limits.
• Skewness – indicated by the direction of the longer tail of the
histogram.
Left-skewed – (negatively skewed) a longer left tail.
3-21
Chapter 3
3.2 Frequency Distributions and
Histograms
Frequency Polygons and Ogives
• A frequency polygon is a line graph that connects the midpoints of
the histogram intervals, plus extra intervals at the beginning and
end so that the line will touch the X-axis.
• It serves the same purpose as a histogram, but is attractive when
you need to compare two data sets (since more than one
frequency polygon can be plotted on the same scale).
• An ogive (pronounced “oh-jive”) is a line graph of the cumulative
frequencies.
• It is useful for finding percentiles or in comparing the shape of the
sample with a known benchmark such as the normal distribution
(that you will be seeing in the next chapter).
3-22
Chapter 3
3.2 Frequency Distributions and
Histograms
3-23
Chapter 3
3.3 Excel Charts
3-24
Chapter 3
LO3-5 3.4 Line Charts
LO3-5: Make an effective line chart using Excel.
3-25
Chapter 3
LO3-5 3.4 Line Charts
Simple Line Charts
• Two-scale line chart – used to compare variables that differ in
magnitude or are measured in different units.
3-26
Chapter 3
LO3-5 3.4 Line Charts
Log Scales
• Arithmetic scale – distances on the Y-axis are proportional to the
magnitude of the variable being displayed.
• Use a log scale for the vertical axis when data vary over a wide
range, say, by more than an order of magnitude.
3-27
Chapter 3
LO3-5 3.4 Line Charts
Log Scales
A log scale is useful for time series data that might be expected to grow at a
compound annual percentage rate (e.g., GDP, the national debt, or your
future income). It reveals whether the quantity is growing at an
increasing percent (concave upward),
constant percent (straight line), or
declining percent (concave downward)
3-28
Chapter 3
LO3-6 3.5 Bar Charts
LO3-6: Know the rules for effective bar charts and pie charts.
3-29
Chapter 3
LO3-6 3.5 Bar Charts
Pareto Charts
• Special type of bar chart used in quality management to display the
frequency of defects or errors of different types.
• Categories are
displayed in
descending order
of frequency.
• Focus on
significant few
(i.e., few
categories that
account for most defects or errors).
3-30
Chapter 3
LO3-6 3.5 Bar Charts
Stacked Bar Chart
3-31
Chapter 3
LO3-6 3.6 Pie Charts
LO3-6: Know the rules for effective bar charts and pie charts.
An Oft-Abused Chart
• A pie chart can only convey a general idea of the data.
• Pie charts should be used to portray data which sum to a total
(e.g., percent market shares).
• A pie chart should only have a few (i.e., 2 or 3) slices.
• Each slice can be labeled with data values or percents.
3-32
Chapter 3
LO3-6 3.6 Pie Charts
An Oft-Abused Chart
• Consider the following charts used to illustrate an article from the Wall Street
Journal. Which type appears to be better?
3-33
Chapter 3
LO3-6 3.6 Pie Charts
Pie Chart Options
• Exploded and 3-D pie charts add strong visual impact.
3-34
Chapter 3
LO3-7 3.7 Scatter Plots
LO3-7: Make and interpret a scatter plot using Excel.
• Scatter plots can convey patterns in data pairs that would not be
apparent from a table.
Refer to
the text for
EXCEL
outputs.
3-35
Chapter 3
3.8 Tables
• Tables are the simplest form of data display.
• A compound table is a table that contains time series data down the
columns and variables across the rows.
• The data can be viewed by focusing on the time pattern (down the
columns) or by comparing the variables (across the rows).
3-36
Chapter 3
3.8 Tables
Example: School Expenditures
Deceptive Correct
3-39
Chapter 3
LO3-9 3.9 Deceptive Graphs
Error 2: Elastic Graph Proportions
3-40
Chapter 3
LO3-9 3.9 Deceptive Graphs
Error 4: 3-D and Novelty Graphs
3-41
Chapter 3
LO3-9 3.9 Deceptive Graphs
Error 5: 3-D and Rotated Graphs
3-42
Chapter 3
LO3-9 3.9 Deceptive Graphs
Error 8: Complex Graphs
3-43
Chapter 3
LO3-9 3.9 Deceptive Graphs
Error 11: Area Trick
3-44
Chapter 3
LO3-9 3.9 Deceptive Graphs
3-45