Bonushw 2 Sol

CSCI 319/739 Bayesian statistics
Homework 2
Solution
Saad Mneimneh
Visiting Professor
Hunter College of CUNY
Problem 0: Readings
Read pages 12 to 22 and pages 34 to 42 in the Bayesian statistics book. Also
read the notes posted on the course website for lectures 4, 5 and 6.
Problem 1: Polyas urn
Polyas urn represents to some extent a generalization of a Binomial random
variable. Consider the following scheme: An urn contains b black and r red
balls. The ball drawn is always replaced, and, in addition, c balls of the color
drawn are added to the urn.
Note that if c = 0, drawings ar equivalent to independent Bernoulli processes
b
with p = b+r
. However, with c 6= 0, the Bernoulli processes are dependent, each
with a parameter that depends on the sequence of previous drawings.
For instance, if the first ball is black, the (conditional) probability of a black
ball at the second drawing is (b + c)/(b + c + r). The probability of the sequence
black, black is, therefore,
b
b+c
b+rb+c+r
Let Bi be the event that a black ball is drawn at the ith drawing. Define similarly Ri . For instance, B1 R2 B3 is the event that a black ball is drawn first,
then a red one, then a black one.
(a) Show that the probability of a sequence of Bs and Rs remains the same
if we permute the Bs and the Rs in any way. Derive an expression for the
probability of k black balls in n k drawings.
Solution: The probability that the nth ball is black, conditioned on k 1 black
balls in the first n 1 rounds (k n) is
b + (k 1)c
b + r + (n 1)c
Therefore, if the ith B appears in the j th position in a sequence, it contributes
a probability:
b + (i 1)c
b + r + (j 1)c
The same is true for red balls with b and r switched. Therefore, a sequence with
k black balls will contribute the terms b, b+c, . . . , b+(k1)c; and r, r+c, . . . , r+
(n k 1)c, in the numerator; and the terms b + r, b + r + c, . . . , b + r + (n 1)c
in the denominator.
This makes the probability of k black balls in n rounds equal to the following
expression (regardless of the order in which they were drawn):
n
k
Qk
i=1 [b
Qnk
+ (i 1)c] i=1 [r + (i 1)c]
Qn
i=1 [b + r + (i 1)c]
In light of what we have seen lately, this can be rewritten as:
n
k
Qk
b
i=1 [ c
n
k
n
k
Qnk r
+ i 1]
[ + i 1]
Qn b+r i=1 c
[
+
i
1]
i=1 c
( cb +k)( rc +nk)
( cb )( rc )
( b+r
c +n)
( b+r
c )
( cb + k)( rc + n k) ( cb + rc )
( cb +
r
c
+ n)
( cb )( rc )
Therefore, the important parameters are b/c and r/c.

(b) Using part (a), show that the probability of drawing a black ball is always
b/(b + r).
Solution: Consider the event of drawing a black ball in round i. This corresponds to all sequences where B appears in the ith position. From part (a), we
can switch Bi with the first letter of the sequence without altering the probability. It follows that the probability of having a B in the ith position is equal to
the probability of having a B in the first position, which in turn corresponds to
the event of drawing a black ball in round 1, an event of probability b/(b + r).
(c) Using part (b) and Bayes rule, show an interesting symmetry of Polyas urn:
P (Bi |Bj ) = P (Bj |Bi )
P (Ri |Rj ) = P (Rj |Ri )
P (Bi |Rj ) = P (Bj |Ri )
P (Ri |Bj ) = P (Rj |Bi )
Solution:
P (Bi |Bj ) =
P (Bj |Bi )P (Bi )

P (Bj )
P (Ri |Rj ) =
P (Rj |Ri )P (Ri )

P (Rj )
Since P (Bi ) = P (Bj ) = b/(b + r) and similarly P (Ri ) = P (Rj ) = r/(b + r), the
first two equalities follow immediately.
P (Bi |Rj ) =
P (Bj , Ri )
P (Bi , Rj )
=
= P (Bj |Ri )
Rj
Ri
The first equality follows from the definition of conditioning, the second equality follows from the fact that we can permute the sequence (part (a)) and that
Ri = Rj (part (b)), and the last equality follows again from the definition of
conditioning.
The last case is symmetric. The proof of the first two is actually identical to
this one since P (X|Y )P (Y ) in the Bayes rule is nothing but P (X, Y ).
Problem 2: Splitting a Poisson
Consider a Poisson random variable Z with
k e
k!
Consider also the following two random variables X and Y = Z X
n
P (X = k|Z = n) =
pk (1 p)nk
k
n
P (Y = k|Z = n) =
(1 p)k pnk
k
P (Z = k) =
(a) Show that X and Y are both Poisson random variables with parameters p
and (1 p) respectively.
Solution: We need to find P (X = k).
P (X = k) =
P (X = k|Z = n)P (Z = n)
n=0
Since P (X = k|Z = n) = 0 for n < k, the above sum can start at n = k.

X
n e
n
P (X = k) =
pk (1 p)nk
k
n!
n=k
X
n=k
n!
n e
pk (1 p)nk
k!(n k)!
n!
X
pk (1 p)nk n
e
k!(n k)!
n=k
X
n=k
(p)k ep [(1 p)]nk e(1p)

k!
(n k)!
(p)k ep X [(1 p)]nk e(1p)

k!
(n k)!
n=k
k p h
i
[(1 p)] e
[(1 p)]1 e(1p)
(p) e
+
+ ...
k!
0!
1!
This sum represents the sum of all Poisson probabilities with parameter p(1).
Therefore, the sum should evaluate to 1. We end up with:
0 (1p)
P (X = k) =
(p)k ep
k!
The same can be done for P (Y = k).

(b) Show that X and Y are independent (note: they are not if conditioned on
Z). Hint: compute P (X = k1 , Y = k2 |Z = k1 + k2 ) and this must be equal to?
(Bayes)
Solution:
P (X = k1 , Y = k2 |Z = k1 + k2 ) = P (X = k1 , Z X = k2 |Z = k1 + k2)
k1 + k2
= P (X = k1|Z = k1 + k2 ) =
pk1 (1 p)k2
k1
Now by Bayes rule:
P (X = k1 , Y = k2 |Z = k1 +k2 ) =
=
P (Z = k1 + k2 |X = k1 , Y = k2 )P (X = k1 , Y = k2 )
P (Z = k1 + k2 )
1 P (X = k1 , Y = k2 )
P (Z = k1 + k2 )
Therefore,
P (X = k1 , Y = k2 ) =
=
k1 + k2
k1
pk1 (1 p)k2
k1 +k2 e
(k1 + k2 )!
(p)k1 ep [(1 p)]k2 e(1p)

= P (X = k1 )P (X = k2 )
k1 !
k2 !
This shows that X and Y are independent.

Problem 3: Merging two Poissons
Consider two independent Poisson random variables X and Y with parameters
and respectively.
(a) Show that Z = X + Y is Poisson with parameter + . Hint: you might
find the following Binomial Theorem useful:
(a + b)n =
n
X
n
ak bnk
k
k=0
Solution: P (Z = n) = P (X = 0, Y = n) + P (X = 1, Y = n 1) + . . . + P (X =
n, Y = 0) and since X and Y are independent:
P (Z = n) =
n
X
P (X = k)P (Y = n k)
k=0
Replacing P (X = k) and P (Y = n k) with their corresponding expressions,

we get:
P (Z = n) =
n
X
k e nk e
k=0
k!
(n k)!
X
1 (+) X
n!
1
e
k nk = e(+)
n!
k!(n k)!
n!
k=0
k=0
n
k
k nk
1 (+)
e
( + )n
n!
This shows that Z is Poisson with parameter + .
P (Z = n) =
(b) Show that, conditional on Z = n, X and Y are binomial random variables
with index n and parameters p = +

and q = +
respectively. Hint: find
P (X = k|Z = n).
Solution:
P (X = k|Z = n) =
P (Z = n|X = k)P (X = k)
P (X + Y = n|X = k)P (X = k)
=
P (Z = n)
P (Z = n)
P (Y = n k)P (X = k)
nk e k e
n!
=
P (Z = n)
(n k)! k! ( + )n e(+)
k nk
n
=
k
+
+
The same can be done for Y .

Problem 4: Sample variance
Let X be a random variable with unknown mean and
Pn unknown variance
2 . Let x1 . . . xn be n independent samples. Let x
= i=1 xi /n. Note that
E[
x] = . Define:
Pn
(xi x
)2
2
Sn = i=1
n1
Show that E[Sn2 ] = 2 .
PS: This suggests that Sn2 should be used as an estimate of 2 rather than
n1 2
n Sn (which is more intuitive, but wrong).
Solution:
E[Sn2 ]
1 X
=
E[(xi x
)2 ]
n 1 i=1
Since xi are identically distributed, we can rewrite this as:

E[Sn2 ] =
n
E[(x x
)2 ]
n1
Let us compute E[(x x

)2 ]. Note that this is not the variance x2 because x
is
not E[x].
E[(x x
)2 ] = E[x2 ] + E[
x2 ] 2E[x
x]
Lets compute E[
x2 ] first.
E[
x2 ] =
1
E[(x1 + . . . + xn )(x1 + . . . + xn )]
n2
If we expand, we have n terms of the form xi xi and n(n 1) terms of the form
xi xj (i 6= j). Since xi and xj are independent, E[xi xj ] = E[xi ]E[xj ]. And since
all xi are identically distributed, we end up with:
E[
x2 ] =
1
n1
E[x2 ] +
E[x]2
n
n
Now lets compute E[x

x].
1
1
n1
E[x(x1 + . . . + xn )] = E[x2 ] +
E[x]2
n
n
n
Combining the two results, we have:
E[x
x] =
E[(x x
)2 ] = E[x2 ] +
=
n1
2
n1
1
E[x2 ] +
E[x]2 E[x2 ] 2
E[x]2
n
n
n
n
n1
n1
n1
n1 2
E[x2 ]
E[x]2 =
(E[x2 ] E[x]2 ) =
x
n
n
n
n
Therefore,
E[Sn2 ] = x2
Problem 5: Law of large numbers
Consider flipping a fair coin 2n times and let S2n be the number of heads. We
can prove the following result (0 t n):
P (n t S2n n + t) 1 et
/(n+t)
(a) How large must n be to guarantee that the fraction of heads is within 0.01
of 1/2 with probability at least 0.99?
Solution: Given the above inequality, we can rewrite it as follows:
P (|
2
S2n
1
t
|
) 1 et /(n+t)
2n
2
2n
2
Therefore, we need t/2n = 0.01, i.e. t = 0.02n. In this case, et /(n+t) =

2
e0.02 n/(1+0.02) . We want this quantity to be 0.01, i.e. n 11743.183.
Therefore, a choice of n = 11744 is appropriate.
(b) Asnwer the same question using Chebychev inequality.
Chebychev inequality states that:
|S2n
2n
We want = 0.01 and, therefore,
n 1/(8 0.013 ) = 125000.
P(
1
2
| )
2
2n2
2 /(2n0.012 ) 0.01. Since 2 = 1/4,
(c) Answer the same question using the Central Limit Theorem.
Solution:
S2n
1
S2n n
S2n n
P (|
| 0.01) = P (|
| 0.01) = P (|
|
2n
2
2n
2n
2n0.01
)
n
By the central limit theorem, S2n2n
N (0, 1) for large n. Therefore,
S2n n
2n0.01
2n0.01
P (|
|
) 2(1 (
)) 0.01
2n
Therefore, we need ( 2n0.01

) 0.995, or 2n0.01/ 3.2903 (from tables).
This means n 54130.37. So n = 54131 should be enough.
Problem 6: Cutting a square

Consider a circle. We are interested in cutting the circle in two pieces using the
following procedure. First, an angle x is chosen uniformly at random from 0 to
2. Then, given x, we choose an angle y uniformly at random from x to 2.
Then we can the piece defined by the two angles. Find the following:
(a) f (x)
Solution:
f (x) =
1
2
0 x 2
(b) f (y|x)
Solution:
f (y|x) =
1
2 x
x y 2
(c) f (x, y)
Solution:
f (x, y) = f (y|x)f (x) =
1
2(2 x)
0 x y 2
(d) f (y)
Solution:
Z
f (y) =
f (y|x)f (x)dx =
0
=
0
f (x, y)dx
0
1
1
dx =
log(2 x)|y0
2(2 x)
2
1
1
log(2)
log(2 y)
2
2
1
2
=
log
0 y 2
2
2 y
(e) f (x|y)
Solution:
f (x|y) =
(f) the expected cut
f (y|x)f (x)
f (x, y)
1
=
=
f (y)
f (y)
(2 x) log
2
2y
Solution:
E[X] =
(uniform)
2 + x
]
(uniform)
2
Therefore, E[Y ] = 3/2. The expected cut is given by the quadrant [, 3/2].
E[Y ] = EX E[Y |X = x] = EX [
Problem 7: Computing binomial probabilities

In the lab session, we saw how to compute the posterior probabilities using a
discrete prior and binomial observations.
P (p|x) P (x|p)P (p)
P (x|p) ps (1 p)f
When s and f are large, ps (1 p)f becomes hard to compute (very small number). Find a solution to this problem. Hint 1: es log p+f log(1p) will not work
either. Hint 2: As long as the values of P (p|x) are proportionally correct, we
dont care about the exact values for P (p|x).
Solution: We need to multiply ps (1 p)f by a large number. However, there
are two issues to consider:
how large?
how to multiply?
One strategy is to choose a number c that is large enough to make the largest
cps (1 p)f = 1. Therefore,
c=
1
1
= min s
= min ps (1 p)f
i pi (1 pi )f
i
maxi psi (1 pi )f
But how do we compute c if we cannot compute psi (1 pi )f to start with? We

can look at log c = mini s log pi f log(1pi ) to determine which pi maximized
psi (1 pi )f . Then c can be determined by computing the exponential of log c.
But what if c is too large? On the other hand, even if c is known, multiplying
by c is not straight forward as the multiplication process must go through ps
and (1 p)f anyway. This may be avoided by starting with c and successively
multiplying by either p or 1 p. But, again, if c is too large, we loose precision.
To solve these two problems, we perform everything in log space. Consider the
followig expression:
es log pi +f log(1pi )+log c = cpsi (1 pi )f
Therefore, we only need to compute log c = mini s log pi f log(1 pi ), then
for each pi , compute es log pi +f log(1pi )+log c .

Bonushw 2 Sol

Uploaded by

Document Information

Original Description:

Original Title

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Bonushw 2 Sol

Uploaded by

Copyright:

Available Formats

CSCI 319/739 Bayesian statistics

In light of what we have seen lately, this can be rewritten as:

Therefore, the important parameters are b/c and r/c.

P (Bj |Bi )P (Bi )

P (Rj |Ri )P (Ri )

Since P (X = k|Z = n) = 0 for n < k, the above sum can start at n = k.

(p)k ep [(1 p)]nk e(1p)

(p)k ep X [(1 p)]nk e(1p)

The same can be done for P (Y = k).

(p)k1 ep [(1 p)]k2 e(1p)

This shows that X and Y are independent.

Replacing P (X = k) and P (Y = n k) with their corresponding expressions,

(b) Show that, conditional on Z = n, X and Y are binomial random variables

with index n and parameters p = +

The same can be done for Y .

Since xi are identically distributed, we can rewrite this as:

Let us compute E[(x x

Now lets compute E[x

Therefore, we need t/2n = 0.01, i.e. t = 0.02n. In this case, et /(n+t) =

Therefore, we need ( 2n0.01

This means n 54130.37. So n = 54131 should be enough.

Problem 6: Cutting a square

Problem 7: Computing binomial probabilities

But how do we compute c if we cannot compute psi (1 pi )f to start with? We

You might also like