Professional Documents
Culture Documents
ArticleMDC EURASIP JASP
ArticleMDC EURASIP JASP
Abstract
We consider the problem of efcient image/video transmission over wireless channels. Such a problem involves good
compression rates and effectiveness in presence of channel failures. In this work, we use the multiple description coding
(MDC) techniques, based on Wavelet Transforms, that have been shown to be powerful against channel failures. We
propose a bit allocation procedure that dispatches the source redundancy between the different channels when
compressing to a target bit rate with a bounded side distortion. In this way, we develop an MDC scheme well adapted
to channel noise. Then, we extend this bit allocation to video. This extension uses the 3D scan-based DWT of Parisot
et al. (MMSP, IEEE, New York, 2001); that involves scan-based MDC with rate or quality control. In this work we
propose to adapt redundancy between the descriptors in function of channel model and state (BER). Furthermore, the
channels can have time-varying states. We consider BSC channels, Gaussian channels, and UMTS channels. The
method can be efciently extended to other channels when the channel model that matches the channel behavior is
known.
We evaluate the performance of the proposed MDC image/video coder for two descriptions. In a rst group of
simulations, we compare our application with some different MDC techniques presented by Vaishampayan (IEEE
Trans. Inform. Theory 39 (3) (1993) 821) and Servetto et al. (ICIP, IEEE, Chicago, USA, 1998; IEEE Trans. Image
Process. 9 (5) (2000) 813826). In a second group of simulations, we evaluate the robustness of the proposed MDC
when submitted to different kinds of channels noise.
r 2003 Elsevier B.V. All rights reserved.
Keywords: Joint source channel coding; Multiple description coding; Bit allocation; Adaptability to channel model and state; Error
resilience video coding; Video over 3G wireless systems
1. Introduction
*Corresponding author.
E-mail addresses: pereira@i3s.unice.fr (M. Pereira), am@
i3s.unice.fr (M. Antonini), barlaud@i3s.unice.fr (M. Barlaud).
URLs: http://www/i3s.unice.fr/pereira, http://www.i3s.uni
ce.fr/barlaud.
1
On leave from Universidade da Beira Interior, Portugal.
Research partially supported by PRAXIS XXI Grant SFRH/
BD/1234/2000.
The use of mobile communication and multimedia communication knew an enormous increase
in the latest decade, being the wireless channels
considered as a transport medium for various
types of multimedia information. However, the
scarcity of wireless bandwidth, the time-varying
characteristics of the channel, and the power
0923-5965/$ - see front matter r 2003 Elsevier B.V. All rights reserved.
doi:10.1016/j.image.2003.08.009
ARTICLE IN PRESS
926
limitations of wireless devices, imposes tremendous challenges on wireless multimedia communications. The present work involves image and
video transmission over wireless channels.
Given the bandwidth limitations, compression
will be required for image and video transmission
over wireless channels. Wireless video transmission is specially difcult because the huge volume
of data required to describe a video greatly slows
down transmission and then, involves the use of
lossy compression at low bit rates. The transmission delay is very important since information that
arrives too late at decoder is considered as lost.
Moreover, data are corrupted by channel noise.
The channel noise can occur in the form of
random bit errors, bursty bit errors or packet
losses. Both problems result on losses of perceptual quality at decoder. This made robust compression schemes very useful especially for
transmission at low bit rates.
1.1. Error control techniques or multiple
description coding (MDC)?
The classic technique to combat transmission
errors is forward error correction (FEC)
[5,6,8,27,34,35,38]. FEC involves the addition of
redundant data to the compressed signal, which
allows the decoder to correct errors up to a certain
level. This redundancy increases the total number
of bits required and thus reduces compression.
Moreover, FEC code must be designed with a
worst case channel scenario in mind. For channels
that have a highly variable quality, this worst case
may imply the need for a very powerful code, and
hence highly or even prohibitive amount of
redundancy, which will severally reduce the
compression performance. In the case of burst
errors, the error correction capability is often
exceeded or the block is error-free in which case
additional redundancy is wasted. To overcome this
limitation, FEC is often enhanced by a technique
known as interleaving. For burst errors, this
effectively reduces concentration of errors in single
code words, more precisely, a burst of b consecutive symbol errors causes a maximum of b=M
symbol errors in each code word. Thought
interleaving can be implemented with low com-
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
927
descriptions, using a generalized Lloyd-like clustering algorithm that minimizes the Lagrangian of the
rates and expected distortions R1 ; R2 ; D1 ; D2 ; D0
[31,32,4042]: In the second approach, pioneered by
Wang, Orchard, and Reibman [44], MD quantizers
are constructed by separately describing (i.e.,
quantizing and coding) the N coefcients of an N
N block linear transform, which has been designed
to introduce a controlled amount of correlation
between the transform coefcients [10,9]. In the
third approach, pioneered by Goyal, Kovacevic,
and Vetterli, MD quantizers are constructed by
separately describing the N coefcients of an overcomplete N K tight frame expansion [10,11].
Other works were developed by Jiang et al. who
propose an MD extension to SPIHT coder
presented in [30], by separating Zerotrees into
polyphase components [13]. Rogers et al. propose
to rearrange bits at the output of one conguration
of the SPIHT coder, in such a way that the loss of
one packet results in an error that does not
propagate beyond the image region contained in
that packet [29]. Mohr et al. propose the use of
error correcting codes of different strengths applied
to different portions of a progressive bit stream
such as that generated by SPIHT coder [20].
Previous MD coding dedicated to video was
proposed by Vaishampayan in [43]. A predictive
MD system was applied along with transform
coding to construct an interframe balanced MD
video coder based on the H.263 standard. In [2],
Apostolopoulos and Wee show that MD coding
and path diversity provide improved reliability in
systems with multiple paths with equal or unequal bandwidths. In [28], Reibman et al. proposed MD video coders which use motion
compensated predictions. In [23], we propose an
extension of [25] for video that uses the 3D scanbased DWT.
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
928
2. Problem statement
Our MD scheme focus on the special case in
which there are two channels of equal capacity
between a transmitter and a receiver. In such a
scheme, a sequence of source symbols is given to
an encoder to produce two independent bitstreams
of equal importance. These bitstreams are transmitted to three decoders over two noisy channels.
One decoder (the central decoder) receives information sent over both channels while the remaining two decoders (the side decoders) receive
information only from their respective channel.
The amount of redundancy is dispatched on the
different descriptions by taking into account
the channel model and state (or bit error rate).
The general coder is presented in Fig. 1.
Given the side rate Rl and the side distortion
Dl ; the generation of the two descriptions is
constrained to some conditions that we detail in
the following.
( R l , Dl )
Sequence
Video
3D
Scan Based
DWT
( R 1 , D1)
MDC
( R 2 , D2)
Quality Control
or
Rate Control
bitstream
Side
Decoder
( R 1 , D1)
Central
Decoder
( R 0 , D0)
Side
Decoder
( R 2 , D2)
noisy channel
bitstream
BER
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
Condition 1. The central decoder has to reconstruct the original sequence from the two descriptors with minimal central distortion D0 :
Condition 2. An MDC coder must generate two
descriptors each with a side rate R1 R2 Rl :
This allocation problem is a constrained problem which can be solved by introducing the
Lagrange operators. The Lagrangian functional
for the constrained optimization problem, is
given by
Jfqi;1 ; qi;2 g D0
929
2
X
lj Rj pRl
j1
2
X
mj Dj pDl ;
j1
Recall that the central distortion is the distortion of the decoded image when using both
descriptors at decoding. When the decoder receives
both descriptors, each subband appears twice,
with different bit rates (different associated quantization steps). In this case, if the subbands are
noiseless, the central decoder chooses the subbands with the smaller quantization step and the
others, that we will call the redundant ones, are
only considered for side decoders. Hence, we can
calculate, the
central distortion of the decoded
P#SB
image as
i1 minDi;1 ; Di;2 ; where Di;1 ; Di;2
are the distortions of subband i for descriptors
1 and 2, respectively. In the general case we
have to take into account channel noise. So, we
cannot despise the redundant subbands as in
the noiseless case. Actually, the level of redundancy should increase when the BER increases,
such in the case of very noisy channels where the
redundant subbands are as important as the
others. In this case the central
P#SB distortion can
simply be written
as
i1 minDi;1 ; Di;2
P
maxDi;1 ; Di;2 #SB
D
Di;2 :
i;1
i1
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
930
1
s
s
N
i;2
>
i;0
<
s2i;1 Di;1 ;
>
>
>
2
>
1
qi;2
> si;2
>
D
>
i;2
>
2
>
si;2
si;0 rN 1
>
>
>
>
2
>
>
si;1 rN
qi;1
>
>
otherwise:
D
:
i;1
si;1
s2i;0 rN 1
Hy x
;
maxpx Hx
px
px
4
The amount of redundancy, i.e., the importance of
the redundant subbands, depends on the channel
BER and we propose in the following some
strategies for the choice of rN :
We
know
that
0pHy xpHx:
Thus,
0pCpmaxpx Hx: Using (7) we can conclude
that 0prN p1 as pretended.
We will show (in Section 6) that the channel
capacity is known for several interesting channel
models. Thus, more details about the calculi of rN
parameters for different channel models will be
done in that section.
rN
maxpx HX C
:
maxpx Hx
3.4. Constraints
Conditions 2 Rj1;2 pRl and 3 Dj1;2 pDl
have to be dened for each descriptor. For the
different descriptors j 1; 2; we write Condition 2
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
10
This penalty
allows us to nd a solution
P#SB function
2
with
i1 Di si;j Di;j qi;j =si;j pDl : In this case we
say that the penalty is veried.
3.5. Solution of the problem
Considering the central distortion given by (3)
the constraints (8) and (10), the Lagrangian
functional (1) can be rewritten as
Jfqi;1 ; qi;2 g
qi;1 qi;2
;
si;1 si;2
i1
!
2
#SB
X
X
qi;j
lj
ai Ri;j
Rl
si;j
j1
i1
2
X
jDj Dl j Dj Dl 2
mj
:
2
2
j1
#SB
X
Di s2i;0 Di;0
931
0:
m1
@qi;1
2
2
12a
12b
12c
8
>
s2i;1 1
@
qi;1
>
>
D
>
i;1
2
>
>
r
@q
1
s
s
i;1
i;1
>
< i;0 N
>
>
>
>
s2i;1 rN
@
qi;1
>
>
D
>
i;1
: s2 r 1 @q
s
N
i;1
i;1
i;0
otherwise:
s2i;1 Di;1 ;
13
In this way, the derivation of the Lagrangian
functional (12) becomes
@Jfqi;1 ; qi;2 g
@qi;1
Ci;1
@
qi;1
2
Di si;1
Di;1
@qi;1
1 rN
si;1
@
qi;1
l1 ai
Ri;1
@qi;1
si;1
@ jD1 Dl j D1 Dl 2
0; 14
m1
@qi;1
2
2
where, from (13), Ci;1 is dened by
(
1
if mins2i;1 Di;1 ; s2i;2 Di;2 s2i;1 Di;1 ;
Ci;1
rN otherwise:
15
11
Solution
of
(11)
is
obtained
when
@Jfqi;1 ; qi;2 g=@qi;1 0;
@Jfqi;1 ; qi;2 g=@qi;2 0
and @Jfqi;1 ; qi;2 g=l 0: In the following, we
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
932
Di si;1
Di;1
@qi;1
1 rN
si;1
@
qi;1
l1 ai
Ri;1
@qi;1
si;1
@
qi;1
2
m1 E1 Di si;1
Di;1
0;
@qi;1
si;1
we have
pa;s x aejbxj
with
1
b
s
17
Ci;j
2
@Ri;j si;j
Di s
m Ej
j
i;j 1rN
X#SB
aR
i1 i i;j
qi;j
si;j
Rl 0:
19b
@Di;j
q
*
@Ri;j
s
G3=a
G1=a
and
ba
:
2G1=a
where
Prm
mqq=2
pa;s x dx
mqq=2
mqq=2
@f1;m
@f ;m
*
*
*
*
*2 0
@q* a;qf1;m a;qf0;m a;qf1;m a;q @q* a;q
m1
*2
f0;m a;q
P @f0;m
*
pa;1 q=2
ln f0;0 a; q
* 1 N
*
f0;m a; q
*
m1 @q* a; qln
2
PN 2
4. Model-based R and D
In each subband the probability density function of the wavelet coefcients can be approximated with generalized Gaussian. Therefore,
ln 2
20
with
@fn;m
a; q
*
@q*
"
q*
pa;1 mq*
2
n1
1
q*
m
pa;1 mq*
q* n :
2
2
m
1
2
n1
21
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
933
5. Proposed algorithm
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
934
Wavelet
maxpx HX C
maxpx Hx
p log2 p 1 plog2 1 p:
Scalar
Quantization
Entropy
Coder
( R 1 , D1)
Scalar
Quantization
Entropy
Coder
( R 2 , D2)
( R l , Dl )
Bit
Allocation
coefficient
subband
BER
22
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
935
2
maxpx Hx
log2 1 S=N
:
23
1
2
log2 e e
N=S
e ln S=N N=S
: 24
2
7. Results
7.1. Scan-based DWT video coder
For spatial decomposition our coder uses 9-7
biorthogonal lter [1] and three level of decomposition. For temporal decomposition it uses the
(2,2) lter and performs two level of decomposition. The frames of the video sequence are
acquired and processed on the y to generate the
3D wavelet coefcients and the data are stored in
ARTICLE IN PRESS
936
Fig. 7. Lena image coded at 1 bpp: BSC channel at BER 0:001 for left image and 0.01 for right image. Left image PSNR is 33:89 dB;
right image PSNR is 24:99 dB:
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
937
Fig. 9. Lena image coded at 1 bpp: Gaussian channel at BER 0:001 for left image and 0.01 for right image. Left image PSNR is
34:90 dB; right image PSNR is 25:49 dB:
Table 1
Mean PSNR (dB) results of QCIF silent video, coded at
200 kbits=s: Channel transmission at BER 0:01; same rN for
the two cases (with and without noise)
Channel
BER
BER
BER
2
10
2
10
Table 2
Mean PSNR (dB) results of QCIF silent video, coded at
200 kbits=s: Channel transmission at BER 0:001; same rN for
the two cases (with and without noise)
Channel
BER
BER
3
BER
3
10
10
33.17
31.54
31.54
31.54
31.50
30.08
31.45
31.45
40.72
39.11
39.11
39.11
40.00
39.07
38.99
39.04
103
41.51
40.09
40.09
40.09
41.15
40.09
40.04
40.060
2
10
UMTSIndoor
31.47 25.57 39.06 38.56 40.06 39.76
UMTSPedestrian 31.47 28.02 39.06 38.67 40.06 39.89
UMTSVehicular 31.47 26.26 39.06 38.58 40.06 39.68
Gaussian
UMTSIndoor
UMTSPedestrian
UMTSVehicular
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
938
Y
35
30
PSNR
25
20
15
10
0
1
7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64 67 70 73 76 79 82 85 88 91 94
Frame
Fig. 10. Mean PSNRs of each frame for Y component for Gaussian channel at BER 0:001; pink: no noise; dark blue: proposed
MDC; yellow: SDC; light blue: SDC TC:
U
45
40
35
PSNR
30
25
20
15
10
5
0
1
7 10 13 16 19 22 25 28 31 34 37 40 43 46 49 52 55 58 61 64 67 70 73 76 79 82 85 88 91 94
Frame
Fig. 11. Mean PSNRs of each frame for U component for Gaussian channel at BER 0:001; pink: no noise; dark blue: proposed
MDC; yellow: SDC; light blue: SDC TC:
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
939
V
45
40
35
PSNR
30
25
20
15
10
5
0
Frame
Fig. 12. Mean PSNRs of each frame for V component for Gaussian channel at BER 0:001; pink: no noise; dark blue: proposed
MDC; yellow: SDC; light blue: SDC TC:
35
30
25
PSNR
20
15
10
94
91
88
85
82
79
76
73
70
67
64
61
58
55
52
49
46
43
40
37
34
31
28
25
22
19
16
13
10
Frame
Fig. 13. Mean PSNRs of each frame for Y component for UMTS Indoor channel at BER 0:01; pink: no noise; blue: proposed
MDC; yellow: SDC case.
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
940
Y
35
30
PSNR
25
20
15
10
94
91
88
85
82
79
76
73
70
67
64
61
58
55
52
49
46
43
40
37
34
31
28
25
22
19
16
13
10
Frame
Fig. 14. Mean PSNRs of each frame for Y component for UMTS Pedestrian channel at BER 0:01; pink: no noise; blue: proposed
MDC; yellow: SDC case.
Y
35
30
PSNR
25
20
15
10
94
91
88
85
82
79
76
73
70
67
64
61
58
55
52
49
46
43
40
37
34
31
28
25
22
19
16
13
10
Frame
Fig. 15. Mean PSNRs of each frame for Y component for UMTS Vehicular channel at BER 0:01; pink: no noise; blue: proposed
MDC; yellow: SDC case.
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
8. Conclusion
In this paper, we proposed an MDC scheme
for noisy time-varying channels. It includes
941
Fig. 16. Gaussian channel at BER 0:001: Two left columns using the proposed MDC and two right columns with the SDC TC:
ARTICLE IN PRESS
942
Fig. 17. Gaussian channel at BER 0:001: Two left columns using the proposed MDC and two right columns with the SDC TC:
Acknowledgements
The authors want to acknowledge the anonymous reviewers for their advice which improved
the quality of the paper.
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
943
Fig. 18. UMTS Pedestrian channel at BER 0:01: Two left columns using the proposed MDC and two right columns with the SDC.
References
[1] M. Antonini, M. Barlaud, P. Mathieu, I. Daubechies,
Image coding using wavelet transform, IEEE Trans. Image
Process. 4 (8) (1992) 10531060.
[2] J. Apostolopoulos, S. Wee, Unbalanced multiple description video communication using path diversity, in: International Conference on Image Processing, IEEE, Greece,
2001.
ARTICLE IN PRESS
944
[6] Q. Chen, T. Fisher, Image coding using robust quantization for noisy digital transmission, IEEE Trans. Image
Process. 7 (1998) 496505.
[7] ETSI (European Telecommunications Standards Institute), Selection procedures for the choice of radio
transmission technologies of the UMTS, tr 101 112
v3.2.0 Edition, April 1998.
[8] J. Garcia-Frias, J. Villasenor, An analytical treatment of
channel-induced distortion in entropy coded image subbands, in: Data Compression Conference, Snowbird, UT,
1997.
[9] V. Goyal, J. Kovacevic, Optimal multiple description
transform coding of gaussian vectors, in: International
Conference on Image Processing, Chicago, IL, 1998.
[10] V. Goyal, J. Kovacevic, R. Arean, M. Vetterli, Multiple
description transform of coding of image, in: International
Conference on Image Processing, Chicago, USA, 1998.
[11] V. Goyal, J. Kovacevic, M. Vetterli, Multiple description
transform coding: Robustness to erasures using tight
frame expansions, in: Proceedings of the International
Symposium Information Theory, Cambridge, MA, 1998,
p. 408.
[12] C. Hsu, A. Ortega, M. Khansari, Rate control for robust
video transmission over burst-error wireless channels,
IEEE J. Select. Areas Commn. 17 (5) (1999) 117.
[13] W. Jiang, A. Ortega, Multiple description coding via
polyphase transform and selective quantization, in: Proceedings of the SPIE Visual Communications and Image
Processing Conference (VCIP), San Jose, CA, 1999.
[14] N. Kamaci, Y. Altunbasak, R. Mersereau, Multiple
description coding with multiple transmit and receive
antennas for wireless channels: the case of digital modulation, in: GLOBECOM, San Antonio, TX, 2001,
pp. 32723276.
[15] K. Kintzley, An application of multiple description scalar
quantizers to speech coding on correlated fading channels,
Masters Thesis, Electrical Engineering Department, Texas
A&M University, College Station, TX, August 1995.
[16] W. Lee, Estimate of channel capacity in Rayleigh fading
environment, IEEE Trans. Veh. Technol. 39 (1990) 187189.
[17] S. Lin, D. Costello, M. Miller, Automatic repeat request
error control schemes, IEEE Comm. Mag. 22 (12) (1984)
517.
[18] H. Liu, M. Zarki, Performance of h263 video transmission
over wireless channels using hybrid ARQ, IEEE J. Select.
Areas Comm. 15 (9) (1997) 17751785.
[19] A. Miguel, A. Mohr, E. Riskin, Spith for generalized
multiple description coding, in: International Conference
on Image Processing, Vol. 3, IEEE, New York, 1999,
pp. 842846.
[20] A. Mohr, E. Riskin, R. Ladner, Graceful degradation
over packet erasures channels through forward error
correcting, in: Proceedings of the IEEE Data Compression
Conference (DCC), Snowbird, UT, 1999.
[21] C. Parisot, Allocations bases mod!eles et transforme en
ondelettes au l de 1eau pour le codage dimages et de
[22]
[23]
[24]
[25]
[26]
[27]
[28]
[29]
[30]
[31]
[32]
[33]
[34]
[35]
[36]
[37]
ARTICLE IN PRESS
M. Pereira et al. / Signal Processing: Image Communication 18 (2003) 925945
[38] N. Tanabe, N. Farvardin, Subband image coding using
entropy coded quantization over noisy channels, IEEE J.
Select. Areas Comm. 10 (1992) 926943.
[39] D. Taubman, M. Marcellin, Jpeg2000: Image Compression
Fundamentals, Standards, and Practice, Kluwer International Series in Engineering and Computer Science,
Vol. 642, Kluwer Academic Publishers, Boston, MA, 2001.
[40] V. Vaishampayan, Vector quantizer design for diversity
systems, in: CISS, Baltimore, MD, 1991.
[41] V. Vaishampayan, Design of multiple description scalar
quantizers, IEEE Trans. Inform. Theory 39 (3) (1993) 821834.
[42] V. Vaishampayan, Design of entropy-constrained multiple
description scalar quantizers, IEEE Trans. Inform. Theory
40 (1) (1994) 245250.
945
[43] V. Vaishampayan, S. John, Interframe balanced-multipledescription video compression, in: Packet Video Workshop, Columbia University, NY, 1998.
[44] Y. Wang, M. Orchard, A. Reibman, Multiple description
image coding for noisy channels by paring transform
coefcients, in: IEEE 1997 First Workshop on Multimedia
Signal Processing, Princeton, NJ, 1997.
[45] Y. Wang, Q. Zhu, Error control and concealment for
video communication: a review, Proc. IEEE 86 (5) (1998)
974997.
[46] S. Yang, V. Vaishampayan, Low delay communication for
Rayleigh fading channels: an application of the multiple
description quantizer, IEEE Trans. Comm. 43 (1995)
27712783.