Professional Documents
Culture Documents
Vono ICASSP2019
Vono ICASSP2019
ABSTRACT methods, has not benefited from these recent advances in opti-
mization making them less attractive. In particular, their high
In recent years, much research has been devoted to the restoration
computational cost compared to variational methods can be pro-
of Poissonian images using optimization-based methods. On the
hibitive in high-dimensional applications (e.g. in astronomy) where
other hand, the derivation of efficient and general fully Bayesian
the size of the observations can be of the order of the megapixel
approaches is still an active area of research and especially if stan-
(resp. megavoxel). As a result, few efficient simulation-based
dard regularization functions are used, e.g. the total variation (TV)
methods exist to tackle image restoration under Poisson noise. For
norm. This paper proposes to use the recent split-and-augmented
instance, [20, 21] dealt with the Poissonian likelihood by using a
Gibbs sampler (SPA) to sample efficiently from an approximation of
Taylor series expansion leading to a quadratic approximation of
the initial target distribution when log-concave prior distributions are
the former. In a similar vein, [22] used a Metropolis-within-Gibbs
used. SPA embeds proximal Markov chain Monte Carlo (MCMC)
algorithm with a Gaussian proposal while the Riemmanian manifold
algorithms to sample from possibly non-smooth log-concave full
Hamiltonian Monte Carlo method [23] has been proposed in [24].
conditionals. The benefit of the proposed approach is illustrated on
Finally, a recent MCMC approach has been proposed in [25] ex-
several experiments including different regularizers, intensity levels
ploiting the Poisson-Gamma conjugacy which appears to compete
and with both analysis and synthesis approaches.
with standard regularization functions such as TV.
Index Terms— Bayesian inference, image restoration, Poisson However, up to our knowledge, no fully Bayesian approach con-
noise, split-and-augmented Gibbs sampler. sidered the sampling from a posterior distribution built on a Poisso-
nian likelihood and standard convex and possibly non-smooth reg-
1. INTRODUCTION ularizers. This paper proposes an approach relying on the recent
split-and-augmented Gibbs sampler (SPA) [26] to tackle this prob-
Poisson noise can appear in a lot of image processing problems lem. In particular, the log-concave property of the prior permits to
where observations are obtained through a count of particles (e.g. derive log-concave full conditionals that can be sampled by proxi-
photons) emitted by the object of interest and arriving in the im- mal MCMC algorithms [27, 28] which rely on proximity operators
age plane where a detector (e.g. a charged coupled device (CCD) of convex funtions. This approach permits to fulfill a number of
camera) is located [1]. For instance, this statistical property of the important criteria for the practitioner. It yields good and reliable
noise occurs in emission tomography (ET) [2], fluorescence mi- reconstruction results, a moderate computational time compared to
croscopy [3] or astronomy [4, 5]. In particular, a growing interest in optimization-based methods, the computation of credibility intervals
restoring astronomical Poissonian images can be traced back at least and the ability to approximate several Bayesian estimators in a single
to the Hubble Space Telescope optical aberration in the early 90’s. run.
Methods to tackle such image restoration problems in these years To this purpose, Section 2 presents the considered image pro-
were mainly based on Tikhonov-Miller inverse filter and maximum cessing problem and the main sampling issues associated to the de-
likelihood (ML) estimation via the expectation-maximization (EM) rived posterior. Section 3 presents how SPA can handle complicated
algorithm [2, 6, 7]. This is for instance the case for the classical probability distributions of the form (7) with an approximate but
Richardson-Lucy algorithm [8, 9] used and revisited in a lot of ap- controlled sampling scheme. Then, Section 4 derives SPA for the
plications [10, 11] since noise amplification tended to appear. We considered Poissonian image restoration problem where full con-
refer the interested reader to [12] for a review of Poissonian image ditionals are sampled using efficient state-of-the-art MCMC meth-
restoration methods up to 2006. Since then, much research has been ods. The proposed approach is illustrated on several experiments
devoted to these problems and a lot of advances have been made in and compared to its determistic counterpart described in [16] as well
optimization-based methods. Among other works, [13] considered as to the proximal Moreau-Yoshida unadjusted Langevin algorithm
a forward-backward splitting algorithm using the Anscombe vari- (P-MYULA) [28] although its direct application is not straightfor-
ance stabilizing transform (VST) [14] while [15–17] used alternate ward and necessitates approximation schemes. Finally, Section 5
minimization schemes in order to tackle the exact Poissonian like- draws concluding remarks.
lihood function. The optimization-based algorithms derived by the
aforementioned authors appear to be efficient in various scenarios 2. PROBLEM FORMULATION
ranging from analysis to synthesis approaches [18] with different
regularization functions (e.g. total variation (TV) [19] or ℓ1 norm) This section presents the Bayesian inference problem related to Pois-
in low or high signal-to-noise ratio (SNR) cases. sonian observations with a log-concave prior distribution. The main
On the other hand, although variational Bayes methods have properties of the derived posterior distribution are presented before
been also considered, Poissonian image restoration with fully giving a brief review of existing MCMC methods that could be used
Bayesian approaches, such as Markov chain Monte Carlo (MCMC) to sample from this posterior.
2.1. Model of using proximal MCMC approaches [27, 28] to tackle the non-
differentiability of the potential function associated to π thanks to
In this section, we consider the observation of some image y ∈ N , m
Moreau-Yoshida regularization [31]. However, these approaches re-
damaged (e.g. blurred or with missing pixels) and contaminated by quire the existence of a smooth gradient-Lipschitz continuous term
Poisson noise. We assume that each individual observation yi , i ∈ in the potential function which is not the case in the problem consid-
J1, mK corresponds to an independent realization of a Poisson ran- ered, see property 3 in Prop. 1. Note that it is possible to tackle this
dom variable, that is for all i ∈ J1, mK, issue by using the Anscombe variance stabilizing transform [14],
i.i.d. ( ) see Section 4. Additionally, proximal MCMC algorithms assume
yi ∼ P [Hx + b]i , (1) that the proximal operator associated to the non-smooth potential,
here f2 + f3 , is available. This is not the case for the considered
where x ∈ Rd+ stands for an original image to recover, H ∈ Rm×d potentials defined in (5) if one considers a synthesis frame-based
is an operator related to the point spread function (PSF) and b ∈ Rd+ approach, see Section 4.1. Therefore, one has to resort to a more
stands for some background noise, e.g. accidental coincidences in complicated scheme (e.g. the splitting method of maximal mono-
positron emission tomography (PET). We assume in the sequel that tone operators [32,33]) to compute this proximity operator. To avoid
for all x ∈ Rd+ , Hx ∈ Rd+ . In order to make an easier link with the the above issues, the considered sampling problem is dealt with by
proposed approach, the likelihood distribution is rewritten as follows resorting to a variable-splitting inspired MCMC algorithm, namely
( ) the split-and-augmented Gibbs sampler (SPA, see Section 3) that we
p(y|x) ∝ exp −f1 (Hx; y) , (2)
recently proposed in [26]. Then, each full conditional distribution
where involves a smooth gradient-Lipschitz continuous potential function
along with a possibly non-smooth potential enabling the use of prox-
∑
m
( ) imal MCMC for each of them.
f1 (Hx; y) = −yi log [Hx + b]i + [Hx + b]i . (3)
i=1
3. SPLIT-AND-AUGMENTED GIBBS SAMPLER
Following a Bayesian approach [29], the prior distribution is de-
fined as follows for all x ∈ Rd , This section presents a particular type of variable splitting-inspired
( ) hierarchical Bayes models in order to solve the inference problem
p(x) ∝ exp −f2 (x) − f3 (x) , (4)
presented in Section 2.1. The associated joint distribution, namely
where f2 (x) = τ ψ(x) and f3 (x) = ιRd (x), (5) the split-and-augmented distribution is presented along with its main
+
properties and will be targetted by SPA. In the sequel, the object x
to infer is not necessarily the original image (e.g. x could be coef-
where ψ : Rd → R stands for a proper, coercive, convex, lower
ficients associated to a frame). Thereby, in order to embed a large
semicontinuous (l.s.c), possibly non-differentiable function, τ is a
number of cases, the variable splitting-based approach is presented
positive parameter; ιC is the indicator function of set C, i.e. ιC (x) =
below under a general formulation.
0 if x ∈ C and ιC (x) = +∞ otherwise. The functions f2 (via ψ)
and f3 being convex, the potential f2 + f3 is also convex and the
prior in (4) is log-concave. The function f2 stands for some regu- 3.1. Hierarchical Bayes model
larization term (e.g. ℓ1 norm) while the function f3 guarantees the
Let us start from the initial target distribution π under the very gen-
non-negativity of the original image x since the latter can be viewed
eral form
as an intensity. By the application of Bayes’ rule, the posterior dis-
tribution writes ∑ b
( ) π(x) ∝ exp − fi (Ki x) , (7)
π(x) ∝ exp −f1 (Hx) − f2 (x) − f3 (x) , (6) i=1
4.3. Results
90
proaches. Although the first approximation can be controlled with a 70
80
60
single parameter [28], the second one is not justified in all scenarios 50 10
70
60
30
experiments.
Credibility intervals – Additionally to compute pointwise Bayesian Fig. 1. Original images (1st row), noisy and blurred observations
estimates (e.g. MMSE estimate, see fig. 1), the proposed approach (2nd row), MMSE estimates computed with SPA (3rd row) and as-
has the benefit of proposing credibility intervals by exploring the sociated 95% credibility intervals (4th row).
whole posterior distribution of the variable of interest x.
5. CONCLUSION
Thus, fig. 1 shows the 95% credibility intervals computed by
SPA for a couple of experiments. For the cameraman and Saturn This paper derives a novel MCMC approach to restore Poissonian
images, one can remark that the credibility intervals associated to images under a log-concave prior distribution. Along with efficient
highly Poisson contaminated regions appear to have a noise struc- optimization-based algorithms, the proposed approach has the ben-
ture. On the other hand, the range of credibility intervals associated efit of completing the inference task by providing uncertainty quan-
to the neuron image are roughly piecewise constants. This difference tification or by performing model selection. Moreover, the proposed
is mainly due to the amount of regularization (via the parameter τ ) approach has reasonable computation time w.r.t. optimization-based
set for each image. Indeed, for the cameraman and Saturn image, τ methods thanks to the embedding of efficient proximal MCMC algo-
was smaller since more details were present in the image. As pointed rithms. This work paves the way toward efficient fully Bayesian ap-
out by [28], we believe that simulation and optimization-based algo- proaches for even more complicated models (e.g. Poisson-Gaussian
rithms should be considered together to conduct image processing noise) or richer models using sophisticated regularization functions
and analysis tasks. Indeed, first a quick MAP estimation can be per- (e.g. total generalized variation).
formed by optimization techniques with standard regularizers. Then,
the proposed approach can be used to conduct uncertainty quantifica-
tion, hypothesis testing or even model selection by comparing model
evidences since the potential of the target πρ,α in (8) corresponds to
an arbitrarily small approximation of the initial potential function.
6. REFERENCES [20] A. L. M. Howard and M. Fowler, “Sampling-based uncertainty
quantification in deconvolution of X-ray radiographs,” J. Com-
[1] M. Bertero et al., “Image deblurring with Poisson data: from put. Appl. Math., vol. 270, pp. 43–51, 2014.
cells to galaxies,” Inv. Probl., vol. 25, no. 12, 2009.
[21] M. J. Fowler et al., “A stochastic approach to quantifying the
[2] L. A. Shepp and Y. Vardi, “Maximum likelihood reconstruc- blur with uncertainty estimation for high-energy X-ray imag-
tion for emission tomography,” IEEE Trans. Med. Imag., vol. 1, ing systems,” Inverse Probl. Sci. Eng., vol. 24, no. 3, pp. 353–
no. 2, 1982. 371, 2016.
[3] D. A. Agard and J. W. Sedat, “Three-dimensional architecture [22] J. M. Bardsley and A. Luttman, “A Metropolis-Hastings
of a polytene nucleus,” Nature, vol. 302, pp. 676–681, 1983. method for linear inverse problems with Poisson likelihood and
[4] R. J. Hanisch and R. L. W. (ed.), The restoration of HST images Gaussian prior,” Int. J. Unc. Quant, vol. 6, no. 1, pp. 35–55,
and spectra, 1991. 2016.
[5] R. J. Hanisch and R. L. W. (ed.), The restoration of HST images [23] M. Girolami and B. Calderhead, “Riemann manifold Langevin
and spectra II, 1994. and Hamiltonian Monte Carlo methods,” J. Roy. Stat. Soc. Ser.
B, vol. 73, no. 2, pp. 123–214, 2011.
[6] M. Bertero et al., “Three-dimensional image restoration and
super-resolution in fluorescence confocal microscopy,” Journal [24] C. C. S. Pedemonte and K. V. Leemput, “Bayesian to-
of Microscopy, vol. 157, no. 1, pp. 3–20, 1990. mographic reconstruction using Riemannian MCMC,” in
Med. Image Comp. Computer-Assisted Intervention (MICCAI),
[7] T. J. Holmes, “Maximum-likelihood image restoration adapted
2015.
for noncoherent optical imaging,” J. Opt. Soc. Am. A, vol. 5,
no. 5, pp. 666–673, 1988. [25] Y. Altmann et al., “A Bayesian approach to denoising of single-
photon binary images,” IEEE Trans. Comput. Imag., vol. 3,
[8] W. H. Richardson, “Bayesian-based iterative method of image
no. 3, pp. 460–471, 2017.
restoration,” J. Opt. Soc. Am. A, vol. 62, no. 1, pp. 55–59, 1972.
[26] M. Vono, N. Dobigeon, and P. Chainais, “Split-and-augmented
[9] L. B. Lucy, “An iterative technique for the rectification of ob-
Gibbs sampler - Application to large-scale inference prob-
served distributions,” Astron. J., vol. 79, pp. 745–754, 1974.
lems,” IEEE Trans. Signal Process., vol. 67, no. 6, pp. 1648–
[10] J.-L. Starck, A. Bijaoui, and F. Murtagh, “Multiresolution sup- 1661, 2019.
port applied to image filtering and deconvolution,” Graphical
[27] M. Pereyra, “Proximal Markov chain Monte Carlo algorithms,”
Models and Image Processing, vol. 57, pp. 420–431, 1995.
Stat. Comput., vol. 26, no. 4, pp. 745–760, July 2016.
[11] N. Dey et al., “Richardson-Lucy algorithm with total variation
[28] A. Durmus, E. Moulines, and M. Pereyra, “Efficient Bayesian
regularization for 3D confocal microscope deconvolution,” Mi-
computation by proximal Markov chain Monte Carlo: When
crosc. Res. Tech., vol. 69, no. 4, pp. 260–266, 2006.
Langevin meets Moreau,” SIAM J. Imag. Sci., vol. 11, no. 1,
[12] P. Sarder and A. Nehorai, “Deconvolution methods for 3-D pp. 473–506, 2018.
fluorescence microscopy images,” IEEE Signal Process. Mag.,
vol. 23, no. 3, pp. 32–45, 2006. [29] C. P. Robert, The Bayesian Choice: a decision-theoretic moti-
vation. Springer, 2001.
[13] F. Dupé, J. M. Fadili, and J. Starck, “A proximal iteration for
deconvolving Poisson noisy images using sparse representa- [30] S. Duane et al., “Hybrid Monte Carlo,” Phys. Lett. B, vol. 195,
tions,” IEEE Trans. Image Process., vol. 18, no. 2, pp. 310– no. 2, pp. 216 – 222, 1987.
321, 2009. [31] J. J. Moreau, “Fonctions convexes duales et points proximaux
[14] F. J. Anscombe, “The transformation of Poisson, binomial and dans un espace Hilbertien,” C. R. Acad. Sci. Paris Sér. A Math.,
negative-binomial data,” Biometrika, vol. 35, pp. 246–254, vol. 255, pp. 2897–2899, 1965.
1948. [32] J. Eckstein and D. P. Bertsekas, “On the douglas—rachford
[15] N. Pustelnik, C. Chaux, and J.-C. Pesquet, “Parallel proximal splitting method and the proximal point algorithm for maximal
algorithm for image restoration using hybrid regularization,” monotone operators,” Mathematical Programming, vol. 55,
IEEE Trans. Image Process., vol. 20, no. 9, pp. 2450–2462, no. 1, pp. 293–318, April 1992.
2011. [33] P. Lions and B. Mercier, “Splitting algorithms for the sum of
[16] M. A. T. Figueiredo and J. M. Bioucas-Dias, “Restoration of two nonlinear operators,” SIAM J. Numer. Anal., vol. 16, no. 6,
Poissonian images using alternating direction optimization,” pp. 964–979, 1979.
IEEE Trans. Image Process., vol. 19, no. 12, pp. 3133–3145, [34] G. Papandreou and A. L. Yuille, “Gaussian sampling by local
2010. perturbations,” in Adv. in Neural Information Process. Systems,
[17] F. Dupé, J. M. Fadili, and J. Starck, “Deconvolution under 2010, pp. 1858–1866.
Poisson noise using exact data fidelity and synthesis or anal- [35] A. R. Barron and T. M. Cover, “Minimum complexity density
ysis sparsity priors,” Statistical Methodology, vol. 9, no. 1, pp. estimation,” IEEE Trans. Inf. Theory, vol. 37, no. 4, pp. 1034–
4–18, 2012. 1054, 1991.
[18] M. Elad, P. Milanfar, and R. Rubinstein, “Analysis versus syn-
thesis in signal priors,” Inv. Probl., vol. 23, pp. 947–968, 2007.
[19] L. I. Rudin, S. Osher, and E. Fatemi, “Nonlinear total variation
based noise removal algorithms,” Phys. Rev. D, vol. 60, no. 1-4,
pp. 259–268, Nov. 1992.