Professional Documents
Culture Documents
Bayesian Credible Sets: C (x) θ∈C (x)
Bayesian Credible Sets: C (x) θ∈C (x)
Bayesian Credible Sets: C (x) θ∈C (x)
Lecture 12:
The last lecture introduced the idea of using data X to estimate by reporting a random
set C(X) that we think is likely to contain the true value of . We now consider a Bayesian
approach for constructing such a set, which is called a credible set.
Posterior Probabilities
Bayesian inference is based on the posterior distribution of the unknown parameter given
the observed data X = x, which we typically write as ( x). A credible set for with
posterior probability is a set C(x) such that P [ C(x) X = x] = . The posterior
probability is typically reported as a percentage and the credible set is identified by this
percentage (e.g., a 95% credible set for when = 0.95). As with confidence intervals, it is
common to write this posterior probability as 1 instead of .
Computing Credible Intervals
The posterior probability in the definition above is simply is simply
P [ C(x) X = x] =
C(x)
( x) d
or
P [ C(x) X = x] = ( x)
C(x)
according to whether is continuous or discrete. Note that for any particular value of 1
and any particular observed data X = x, there will typically be many sets C(x) such that
P [ C(x) X = x] = 1 . There are two commonly used approaches for choosing a
particular set C(x).
Equal-Tailed Credible Intervals
One approach is to to choose the set C(x) to be an interval {0 C1 (x) < 0 < C2 (x)}
such that P [ C1 (x) X = x] = P [ C2 (x) X = x] = /2, i.e.,
C1 (x)
( x) d =
C2 (x)
( x) d =
or
( x) =
C1 (x)
( x) =
C2 (x)
i=1
i=1
x Beta(a + xi , b + n xi ).
Since we cannot integrate the pdf of the beta distribution in closed form, there is no direct
formula for an equal-tailed credible interval with probability 1. However, for any particular
values of a, b, n, and x, it is trivial to compute such an interval using statistical software
or tables of the beta distribution. For example, if a = b = 1, n = 25, and ni=1 xi = 24, then a
95% equal-tailed credible interval for is approximately (0.803, 0.991).
Example 12.0.2: For the posterior in Example 12.0.1, a highest posterior density credible
interval is not as easy to obtain numerically as an equal-tailed credible interval. Since the
beta pdf is unimodal, the problem reduces to finding two endpoints for which (i) the beta pdf
takes the same value and (ii) the integral of the beta pdf between the two endpoints is 1 .
(If the beta pdf were not unimodal, the task would be more difficult.) This problem can
still be solved numerically, and we find that a 95% HPD credible interval is approximately
(0.829, 0.998).
The following theorem provides an intuitively sensible property of highest posterior density
credible sets. To state the theorem concisely, we first define some notation. For any subset H
of the parameter space , let A(H) denote the total length of the set (if is continuous)
or the total number of points in the set (if is discrete), i.e., A(H) = H d or A(H) = H.
Note: If you are familiar with the notions of Lebesgue measure and counting measure,
then you recognize that we are simply taking A(H) to be the measure of the set H
under whichever of these measures is appropriate.
Theorem 12.0.3. Let C(x) be a highest posterior density credible set for with posterior
P [ C(x) C(x)
X = x] = P [ C(x) X = x] P [ C(x) C(x)
X = x]
= P [ C(x)
X = x] P [ C(x) C(x)
X = x]
= P [ C(x)
C(x) X = x],
P [ C(x) C(x)
X = x] k A[C(x) C(x)]
since ( x) > k for all C(x), and
P [ C(x)
C(x) X = x] k A[C(x)
C(x)]
A[C(x) C(x)]
A[C(x)
C(x)].
Then the result follows from the fact that A[C(x)] = A[C(x) C(x)]
+ A[C(x) C(x)]
and A[C(x)]
= A[C(x) C(x)]
+ A[C(x)
C(x)].
The basic idea of Theorem 12.0.3 is simpler than it may appear from the proof. Essentially,
to capture a specified posterior probability in the smallest possible set, we should choose
a set on which the posterior pdf or pmf is as large as possible. Such a set is precisely the
definition of a highest posterior density credible set.
Numerical or Approximate Calculations
As we saw in Example 12.0.1 and Example 12.0.2, simple formulas for credible intervals often
cannot be found since the posterior distribution is often a pdf that cannot be integrated in
closed form. Thus, we typically must rely on either numerical solutions (as in Example 12.0.1
and Example 12.0.2) or approximations.