Professional Documents
Culture Documents
2.3.2 Blob Diagrams and Box-And-Whisker Plots: Fig. 2.3 Blob Diagram For Valve Liner Data
2.3.2 Blob Diagrams and Box-And-Whisker Plots: Fig. 2.3 Blob Diagram For Valve Liner Data
These indications are better shown by a frequency curve (Fig. 2.2(b)) than a
polygon (Fig. 2.2(a)) which merely joins the tops of the histogram columns (and
in fact does not even properly portray the sample, as the areas are incorrectly
distributed over the classes).
•• • ••
• •••• • •• ••
• • • • •
• •• • ••
• •••••••••••••••••••••••• •
1 1 1 r r 1 1 1 1 1 1 1 1 1 r t 1 r 1 1 1 1 1 1 r 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 1 r r • 1 1 1 1 1 r 1
236 237 238 239 240 241 242
• • ••• •• •
•
• • •• •• •• •• •• •• •• •• • • • A
• • •• •• •• •• •• ••• •• • •• • •• • B
• • •• •
Fig. 2.4 Back-to-hack blob diagram .
Graphical presentation 15
We here assume that the reader is familiar with the median as a measure of
location. When all individual values are available, it is defined as the middle
value, or the average of the middle two, when the values are arranged in order.
We also introduce the first and third quartiles, the values which separate the
lower and upper 25% of the data from the central 50%. Note that the median
is also the second quartile.
To locate the median and quartiles in the ordered data, for example in a stem-
and-leaf table or a blob diagram, use the following expressions. The rationale
will be explained in connection with probability plotting in Chapter 13.
Q 1 =X;, where i = i-(n + 1);
M =X;, i=!(n + 1);
Q3 =X;, i = }(n + 1).
Thus, for the 52 items in the data set forming the main example in this chapter,
For Q1, i = i(52 + 1) = 13.25; so Q1 lies between the thirteenth and fourteenth
values from the lower end. These are in fact 238.9 and 239.0, so we may take
Q1 as 238.9 approximately (second decimal place accuracy is not justified).
ForM, i =!(52+ 1) = 26.5; with x 26 and x 27 both 239.5, the median is also
239.5.
x,
ei------------------~ 1--------------·x52
•,____________________________--4~----------------~~~·
(a)
·-·-----·--------·-------------,···-·-···-·
(b)
•
Fig. 2.6 Box-and-whisker plot expanded to blob diagram. A box-and-whisker plot (a)
would suggest some peculiarity in the original data. A follow-up diagram (b) reveals
values clustered at the upper end with wide gaps between values below the median.
16 Data collection and graphical summaries
244
Digidot
237 ~ v
1 236
236
Fig. 2.7 Run chart and d igidot for va lve li ner da ta.
18 Data collection and graphical summaries
@ = 2 result
2.5 2.5
• •
2.0
·--·• • •
•
••
••
2.0
~ • • ••
~ ·~· @ .
• • • ·-=--+· •••••
c 1.5
••••••• 1.5
e. • • •••••
0
• e@ • • • •••••••
0
• • • @ • • @• • .
@
• •••••••••
·-·
••••••
(J)
.0
•••••••••
(G
0
40 45 50 55 60
i
65 70
•
0
the cause variable is assigned to x and the effect variable to y. In fact the run
chart uses this convention in that time is often regarded as a cause of changes
in a system.
Each pair of x, y values is plotted at the appropriate coordinates, and any
relationship will appear as a pattern. If the data arise from different sources
(e.g. machines, operators, days, material batches) symbols or colour coding for
these subgroupings may reveal further relationships. It may also be useful to
compile blob diagrams for x and y on their respective axes, as shown in Fig. 2.8.