Professional Documents
Culture Documents
L02 Quantiles PDF
L02 Quantiles PDF
T R(1 U ) T (X) = Y .
21
(R1) R is nondecreasing.
..
.....
.....
..........................................
.....
.
.
.
.
....
......
F (u) F (u+)
(1)
(2)
22
(R1) R is nondecreasing.
(R2) U Uniform(0, 1) = R(U ) X.
? F (u) R(u) for all u (0, 1): Indeed, let u (0, 1) be given.
By
(by (R2))
P[U u]
=u
(by (R1))
(since U is uniform).
? R(u) F (u+) for all u (0, 1): Indeed, let u (0, 1) be given.
By Exercise 1.8, F (u+) R(u) u F ((R(u))). Now use
Example 1. Suppose X takes the values 0, 1/2, and 1 with probabilities 1/3 each. We have seen that F has the following graph:
1
F (u)
0,
r1 ,
R(u) = 1/2,
r ,
2
1,
if
if
if
if
(R1) R is nondecreasing.
1
2
0
0
1
3
23
2
3
1 u
19:31 10/01/2000
Case B: X Uniform on 1, 2, 3
...................
......
........
.....
......
.....
.....
.....
.....
.
.
.
.
.....
...
.
.
.....
.
...
.....
.
.
.
..
.....
.
.
.
.
.....
...
.
.
.
.....
.
...
......
.
.
.
.
..
.......
.
.
.
.
.
.
........
....
.
.
.
.
.
.............
.
.
.
..
..............
......................
Area
u
1/3
1/3
1/3
Area
1u
(3)
(4)
(5)
(6)
1 u P [ X x] = 1 P [ X < x ] = 1 F (x)
u F (x) F (u+) x;
(7)
and
here (6) used the switching formula (1.5) and (7) used the alternate
switching formula given in Exercise 1.8. Consequently,
Q is a quantile function for X
F (u) Q(u) F (u+) for all u (0, 1)
Q is a representing function for X,
the last step following from Theorem 2.
Theorems 1 and 3 imply
Theorem 4. Suppose Q is a quantile function for X and Y = T (X).
, if 6= 0
R (u) :=
(8)
u
log
,
if = 0
1u
for 0 < u < 1. Note that
d
R (u) = u1 + (1 u)1 > 0
du
= lim
d
d
u log u (1 u) log(1 u)
= lim
0
1
u
= log
= R0 (u).
1u
In statistics, R0 is known as logit transformation.
27
3 . The Tukey family contains (or almost contains) some common distributions.
For = 1, we have
u (1 u)
R1 (u) =
= 2u 1
1
for each u. Thus X1 = R1 (U ) Uniform(1, 1).
1
R1 (u)
0
1
..
...
...
..
.
..
..
...
...
..
.
...
...
...
...
.
.
u1
u (1 u)
0
d (u (1 u) )
R (u) = u (1 u) / for 6= 0.
ex
d
F0 (x) =
= F0 (x) 1 F0 (x) ,
(10)
f0 (x) :=
x
2
dx
(1 + e )
again for < x < . This distribution is called the (standard)
logistic distribution.
T(1) is very close to the distribution of T1 , where T1 has a
t-distribution with 1 degree of freedom; see Figure 5 below.
4 . As goes from 1 to 1 (and the distribution goes (modulo
scaling) from uniform, to almost normal, to logistic, to almost t1 ), the
tails of the T() distribution go from very light to very heavy. This
is another reason why the family is good for robustness studies.
28
19:31 10/01/2000
Interpreting Q/Q-plots. Suppose X1 and X2 are two random variables such that
X2 a + bX1
(11
for some constants a and b with b > 0; this says that the distribution
of X2 is obtained from that of X1 by a change of scale (multiplication
by b) and a change in location (addition of a). Let Q1 be the quantile
function for X1 . Since the transformation x a + bx is increasing,
Theorem 4 implies that the quantile function for X2 is given by
Q2 (u) = a + bQ1 (u).
(12
Probabilities
Quantiles of X2
8
6
4
2
0
0.25
0.5
0.75
..
......
.....
.....
....
.
.
.
.
.
.....
.....
.....
......
.
.
.
.
.
..
......
......
......
......
.
.
.
.
.
....
......
....... ......
......
.
.
.
.
1
2
.
.
.......
........
........
..
.
.
.
.
.
.
.
.........
..........
...........
1
2
............
......
.
.
.
.
.
.
.
.
.
.
.
.
.
.........
..................
........................
Figure 1
Q/Q-plot of X2 23
versus X1 N (0, 1)
(Q (0.75), Q (0.75))
(Q (0.5), Q (0.5))
1.5
1.0
0.5
0.0
0.5
0.95
1.0
1.5
Quantiles of X2
0.05
.......
..........
..........
.........
.
.
.
.
.
.
.
.
.
........
.........
..........
.........
..........
.
.
.
.
.
.
.
.
......
.........
2
..........
..........
.........
.
.
.
.
.
.
.
.
.
......
.
.
.
.
.
.
.
.
..
..........
.........
..........
..........
Q -intercept a
slope b
1
1
Quantiles of X1
then the plot displays Q1 (u), u for u (0, 1); this is essentially just
a plot ofthe df of X1 . (B) if X1 is standard uniform, then the plot
displays u, Q2 (u) for u (0, 1); this is a graph of the inverse df F2 .
29
2 10
Quantiles of X1
Figure 4
We have seen that linearity in a Q/Q-plot means that the distributions of the two variables involved are related by a change in
location and scale, in the sense that (12) and, equivalently, (11) hold.
What does curvature mean? Consider the following three Q/Q-plots;
in each plot the quantiles of some random variable X2 are plotted
against the quantiles of some X1 .
0.01
0.1
0.25
Probabilities
0.5
0.75
0.9
0.99
0.999 0.9999
The Tukey family, revisited. The next three pages exhibit Q/Qplots for various members of the Tukey family. In each case you should
think about what the plots say about the Tukey distributions. In Figure 4 the solid, nearly-diagonal line is the Q/Q-plot of T(0.14) versus
standard normal. The dotted line is the least squares linear
fit to the
solid line; the slope of the least squares line is almost 2. In Figure 5 the solid line is the Q/Q-plot of T(1) versus the t distribution
with 1 degree of freedom. The dotted line is the straight line with
Q2 -intercept 0 and slope . Figure 6 is a simultaneous Q/Q-plot of
the Tukey distributions for = 1 to 1 by 0.2, each against standard
normal.
2 11
In case (A), the quantiles of X2 grow ever more rapidly than do those
of X1 as u tends to 0 and to 1; this means that X2 has heavier tails
than does X1 . In case (B), X2 has lighter tails than X1 . In case
(C), the right tail of X2 is heavier than that of X1 , but the left tail is
lighter. Looking back at Figure 1, you can see that the right tail of the
23 is somewhat heavier than normal, and the left tail is somewhat
lighter.
(C)
-2
(B)
..
...
..
..
.
.
...
...
...
..
.
...
..
...
....
.
.
.
.
...
....
.....
.....
.....
.
.
.
.
.
........
..........
.......................
-4
(A)
...
.........
.......
.....
.....
.
.
.
....
....
...
...
..
.
.
...
...
..
..
.
.
.
...
....
....
....
.
.
.
.
.
......
.......
...........
-4
-2
0
Normal quantiles
2 12
19:31 10/01/2000
Figure 5
Figure 6
0.9975
0.005 0.01
0.05
0.1
0.25
Probabilities
0.5
0.75
0.9
0.95
0.990.995
1000
0.001
Probabilities
0.0050.01 0.5 0.990.995
Tukey quantiles
0
la= 0.2
la= 0.4
la= 0.6
la= 0.8
la= 1
-1000
-4
-500
-2
Tukey quantiles
0
500
-1
-0.8
-0.6
-0.4
-0.2
0
0.2
0.4
0.6
0.8
1
-300
-200
-100
0
t quantiles
2 13
100
200
300
-2
-1
0
Normal quantiles
2 14
(13)
Exercise 6. The text pointed out several reasons why the Tukey
family is good for robustness studies. Give at least one reason why it
is not good.
Exercise 7. Show that T(1) and T(2) are the same distribution, up
to a change in scale. Is there a generalization of this fact?
Exercise 8. Below Figure 1, the text states that a plot of (Q1 (u), u)
for u (0, 1) is essentially just a plot of the df of X1 . The term
essentially just was used because the two plots can in fact be slightly
different. Explain how.
Exercise 9. Figure 3 shows the Q/Q-plots for three random variables, X1 , X2 , and X3 , each plotted against Z N (0, 1). The
plots are drawn to different scales. In scrambled order, the distributions of the Xi s are: (i) standard uniform, (ii) standard exponential, and (iii) t with 1 degree of freedom. Match the plots with the
distributions.
Exercise 3. Show that (3) holds if and only if both (4) and (5) hold.
Exercise 10. Let X1 be a standard exponential random variable
(see Exercise 4) and let X2 be a random variable distributed like X.
(i) Draw a Q/Q-plot of X2 against X1 , by plotting Q2 (u) against
Q1 (u). (ii) Now plot Q2 (1 u) versus Q1 (u). What is the general
lesson to be learned from (i) and (ii)?
2 16