Advanced de Interlacing Techniques 1

You might also like

Download as pdf or txt
Download as pdf or txt
You are on page 1of 14

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/250138476

Advanced de-interlacing techniques

Article · January 1996

CITATIONS READS

38 788

2 authors, including:

Erwin Bellers
V-Silicon
51 PUBLICATIONS 677 CITATIONS

SEE PROFILE

All content following this page was uploaded by Erwin Bellers on 29 October 2018.

The user has requested enhancement of the downloaded file.


Advanced de-interlacing techniques
E.B. Bellers and G. de Haan
Philips Research Laboratories
Television Systems Group
Prof. Holstlaan 4
5656 AA Eindhoven
The Netherlands
tel: +31-40-2744285 fax:+31-40-2742630

Abstract: This paper presents an overview of de- transformed to lower frequencies or even DC at the
interlacing techniques and describes some advanced motion retina. Consequently, suppression of these frequencies
compensated de-interlacing techniques in more detail. These with a temporal lowpass filter, results in artifacts for this
motion compensated methods are analyzed and mutually observer.
compared. It is concluded that the success of motion compen- Many de-interlacing algorithms have been proposed
sated de-interlacing depends critically on a protection mecha- in order to find a good balance between (hardware)
nism. A good example is presented. cost and quality of the de-interlacing process. These
de-interlacing algorithms range from simple intra-field
interpolation methods to Motion Compensated (MC)
1 Introduction methods, sometimes applying a generalization of the
sampling theorem (GST). This paper presents a brief
Historically, interlacing has been introduced in video
overview of some simple de-interlacing algorithms and
signals to reduce bandwidth. A major drawback of the
describes in more detail some interesting MC de-
interlaced scanning format on current bright high reso-
interlacing methods. The evaluation of these algorithms
lution displays is the line flicker and serration of mov-
shows a preference for the algorithm based on a gen-
ing edges. In the literature, several de-interlacing algo-
eralization of the sampling theorem which is extended
rithms have been proposed to reduce these artifacts, or
with a protection mechanism.
to serve as a base for other scan rate conversions.
Section 2 provides an overview, without pretending
De-interlacing may seem a straightforward applica-
to be complete, of several de-interlacing algorithms.
tion of general Sample Rate Conversion (SRC) theory,
Section 3 briefly introduces the 3D Recursive Search
by just realizing an vertical upconversion of a factor
(3D-RS) block matcher of [1], which will be used in the
two. However, such an upconversion is only valid if
analysis for the MC de-interlacing methods. Section 4
the signal satisfies the Nyquist criterion. Interlaced sig-
shows the result of experiments with these algorithms
nals are not vertically and temporally filtered prior to
and finally in section 5 some conclusions are drawn.
subsampling in order to satisfy this Nyquist criterion.
Sampling in vertical and temporal direction is realized
in the camera, which means that prefiltering should be
realized in the optical path, which is very complicated
2 De-interlacing techniques
in practice. Existing algorithms can be categorized in, either spatial,
In addition to this practical problem, an even more or spatio-temporal de-interlacing techniques. Spatial
fundamental problem exists. The two-dimensional up- de-interlacing algorithms are by definition intra-field al-
conversion cannot be solved correctly, since we do gorithms. Similarly, spatio-temporal algorithms are by
not know the temporal frequencies at the retina of a definition inter-field techniques. This chapter presents
movement-tracking observer. For a tracking observer, an overview of some de-interlacing methods for both
very high temporal frequencies on the screen can be categories.

1
2.1 Spatial de-interlacing techniques 2.2.1 Inter-field line averaging
Spatial de-interlacing algorithms form the simplest cat- Instead of vertical line averaging as realized in the spa-
egory, and do not demand high hardware costs. These tial linear averaging de-interlacer of subsection 2.1.2,
algorithms exploit the high correlation between the it is also possible to perform averaging in the temporal
samples in the current field and the ones to be inter- domain as shown in figure 1.
polated.
y-1 original sample
A
2.1.1 Line repetition y
median resulted sample
y+1
sample to be interpolated

Line repetition is the simplest method, and defined by: y-1


n-1 x
 y
 f (E
x , n)  y mod 2 = n mod 2
f out (E
x , n) 0 (1) y-1

 f (E
x− , n) n x-1 x+1 B
else x y
1 field number y+1
y+2
y+3
n+1 x
with xE = (x, y)t the spatial position and t for transpose,
f (E
x , n) the input field and f out (E
x , n) the de-interlaced
Figure 1: Inter-field line averaging
output frame. Note that y mod 2 = n mod 2 is true
for odd lines in odd fields and even lines in even fields
only, which will be called original lines in this paper The output sample is determined by:
(the remaining lines of the de-interlaced image will be
called interpolated lines). Note that f (E x , n) is unknown 
f (E
x , n) y mod 2 = n mod 2
for ∀E x |(((y mod 2 = 0) ∧ (n mod 2 = 1)) ∨ ((y mod f out (E
x , n) = f (E
x ,n−1)+ f (E
x ,n+1)
else
2 = 1) ∧ (n mod 2 = 0))). 2
(4)

2.1.2 Linear filtering


Linear vertical filtering can be a simple averaging filter 2.2.2 Vertical-temporal median filtering
defined by:
 Median filters, which belong to the class of order sta-
 f (E
 x ,n) y mod  2=
n mod 2 tistical filters, are very popular in image processing.
f out (E
x , n) = 0 0 (2) Experimentally, it has also proven to be a very attrac-
x − ,n)+ f (E
f (E x + ,n)

 1 1 tive method for de-interlacing [2]. The form most en-
2 else countered in this application is the three-tap vertical-
temporal median filter illustrated in figure 2.
or a larger vertical filter with impulse response de-
fined by:
y-1 original sample
 y
C result from median sample

 f (E
x , n)  y mod  2 = n mod 2 y+1 sample to be interpolated
 P
0 A
f out (E
x , n) = k f (E
x+ , n)h(k) else (3) y-1

 2k n-1 x

y

(k ∈ {. . . , −1, 0, 1, 2, . . .}) B
y+1
y+2
y+3
field number x-1
n x+1
x
2.2 Spatio-temporal de-interlacing tech-
niques Figure 2: Vertical-temporal median filtering
Spatio-temporal de-interlacing algorithms require at
least one field memory. This generally increases the The interpolated samples are found by median filter-
cost of these methods, and the increased freedom in al- ing the spatial neighbours in the vertical direction (A
gorithms increases the quality of the de-interlacing. and B) and the corresponding sample in the previous

2
field (C): with
        
f (E
x , n)  1 0


   2 = nmod 2
y mod A= f xE −
,n B= f xE − ,n

  0   1    1  

 
 f (E
x− , n), 
 −1 −1

 1 
 C = f xE − ,n D = f xE + ,n (7)
fout (E
x , n) =   (5)

 median 0 else   1    1 

 
 f (E
x+ , n), 
 0 1

 
 1 
 E = f xE + ,n F = f xE + ,n
   1 1
f (E
x , n − 1)
and α and β the integer weights where α A indicates
The underlaying assumption is that in case of station-
the number of A‘s that occur in expression (6). (For
arity, f (E
x , n − 1) is expected to have a value in be-
example 3A means A, A, A).
tween the vertical neighbours in the current field result-
A motion detector is used to determine the weights
ing in temporal interpolation, whereas in case of mo-
[4]. A large value of α increases the probability of field
tion, intra-field interpolation results, since in that case,
insertion, whereas a large β increases the probability of
the correlation between the samples in the current field
line averaging at the output.
is expected to be the highest.
Doyle et al. [6] use the same neighbourhood of sam-
ples, but incorporate edge information. If intra-field in-
2.2.3 Weighted and edge dependent median filter- terpolation turns out to be necessary because of motion
ing in the image sequence, then preferably, the interpolation
Median filtering is an effective method for de- should preserve the frequency content of the image as
interlacing. The filter of subsection 2.2.2 is based on good as possible. After determining the most stationary
the assumption of the highest correlation in the vertical direction, the signal is interpolated in that direction.
The interpolated sample X, as shown in figure 3, is
direction. However, this is not always optimal. With
determined by a luminance gradient indication calcu-
a high correlation in the diagonal direction, a diagonal lated from its direct neighbourhood:
interpolation is better. To cope with all possibilities, a  
larger neighbourhood is required. 
 (|A − F| < |C − D|)∧

 XA
(|A − F| < |B − E|)
In [3] and [4], a seven point spatio-temporal median 



filter was suggested applying an additional motion de-  
f out (E
x , n) = (|C − D| < |A − F|)∧ (8)
tector to control the importance or ‘weight’ of the indi- 
 X


C
(|C − D| < |B − E|)
vidual pixels at the input of the median filter. Justusson 



[5] called this method weighted median filtering. Figure 
Median{B, E, f (E
x , n − 1), } else
3 shows the positions of the samples used in the median
filter. where X A and X C are defined by:
y-2 original sample
XA = A+F
2 XC = C+D
2
(9)
y-1 A B C interpolated sample

y X
It is uncertain whether a zero difference between
y+1 D E F
pairs of neighbouring samples, indicates the spatial di-
y+2 rection in which the signal is stationary. For example,
x-1 x x+1 noise, or more fundamentally alias, can negatively in-
fluence the decision. An edge detector can be applied to
Figure 3: Spatial position of the samples participating switch or fade between at least two processing modes,
in the weighted median filtering each of them optimal for interpolation of a certain ori-
entation of the spatial edge.
The output is defined by: Some variations to this algorithm have also been pro-
 posed.

 f (E
x , n)  y mod 2 = nmod 2

 A, B, C, D, E, F, 
f out (E
x , n) = 2.2.4 Edge detection based de-interlacing

 median α f (E x , n − 1), else
  
β(B + E) Hentschel [7, 8] proposed to detect edges within a field,
(6) indicating vertical high frequencies, rather than motion.

3
The edge detector E D signal is defined by: motion compensated de-interlacing methods have the
         strongest physical background, as due to their inertia,
0 0 it always takes time for objects to completely disap-
E D(Ex , n) = g f xE − , n − f xE + ,n
1 1 pear, or change geometry, resulting a strong correlation
y mod 2 6= n mod 2 of successive images. This in contrast to spatial inter-
(10) polation for which there is a statistical but no physical
with g() a (non-linear) function which determines background.
when an edge is detected. The output of g() is either
0 or 1. MC median filtering
If a transient is detected, the output is determined Vertical-temporal median filtering as discussed in
by field insertion, otherwise intra-field line averaging subsection 2.2.2, generally performs line repetition in
is used as defined by: case of motion. If motion vectors are known, the corre-
 lation in the temporal domain can be increased by shift-

 f (E
x , n) y mod 2 = n mod 2

 ing the current spatial sample position, but in the previ-

 

 (y mod 2 6= n mod 2)∧ ous field, over the motion vector. This is illustrated in

 f (E
x , n − 1)
f out (E
x , n) = (E D(Ex , n) = 1) figure reffig:4.


      


 f xE−0,n+ f xE+0,n
original sample

 result from median sample

 1 1 y-2 sample to be interpolated

2
else motion vector
y
(11) y-1
n-1 x y
y+1
y+2
y+3
2.2.5 Vertical-temporal filtering field number
n x-1
x
x+1

Linear vertical-temporal filtering can be applied for de-


interlacing purposes as defined by: Figure 4: Motion compensated median filtering
 E x , n), the (sub-pixel accurate) motion vector
 f (E
x , n) y mod 2 = n mod 2 With d(E

 at spatial position xE in field n, the output is defined by:

     
f out (E
x , n) = P 0
 k, j f xE − , n − j h(k, j ) 
 f (E
x , n) y mod 2 = n mod 2

 2k 


 
 
else (k integer) 
 E x , n), n − 1
 
 f (E
x − d(E
  
(12) 

f out (E
x , n) =  0
with k = −L . . . L and vertical filter length of 2L +1 
 f xE − ,n

 median  1  else
taps, j = −J . . . 0 and the temporal filter length or 
 


 
 0
depth. Since field memories are still relatively expen-   f xE + ,n
1
sive in consumer electronics applications and tempo- (13)
ral interpolation may result in severe artifacts, J is re-
stricted to a small number. Temporal backward projection
Woods et al. [9] describe a de-interlacing algorithm
2.2.6 MC de-interlacing that attempts to find the interpolated sample in the pre-
vious fields (backward projection), which is in this pa-
MC de-interlacing algorithms attempt to interpolate in per addressed as the Temporal Backward Projection
the direction with the highest correlation, i.e. interpola- (TBP) deinterlacing algorithm. The algorithm is illus-
tion along the motion trajectory1 . However, not all tem- trated in figure 5.
poral information changes can be adequately described If the motion vector is known, the missing sample can
with object velocities. Fades, concealed or obscured be searched for in the previous field. If it is not within
background, are some of the difficulties. Nevertheless, the vicinity of an existing pixel, the motion vector is
1 The motion trajectory is defined by the line that connects identi- extended into the pre-previous field. If still the vector
cal picture parts from two successive pictures. is not addressing in the vicinity of an existing sample,

4
Woods et al. [9] propose to calculate the average be- the previous field. The output is defined by:
tween the nearest existing samples in the previous field. 
This algorithm implicitly assumes uniform motion over 
 f (E
x , n) y mod 2 = n mod 2


a two field period. 
 

 E x , n), n − 1),
 

  (E
f out x −d(E  
f out (E
x , n) = 
 0
MC time-recursive de-interlacing 
 f xE − ,n ,

 median  1  else

 

The MC Time-Recursive (TR) de-interlacing as pro- 
 
 0
  f xE + ,n ,
posed by Wang et al. [10] applies the motion compen- 1
sated previously de-interlaced field, according to: (15)
Although this is a very effective method, the median
filter can introduce alias in the de-interlaced image.

 f (E
x , n) y mod 2 = n mod 2
f out (E
x , n) = (14) Generalized Sampling Theorem based de-inter-
 E x , n), n − 1) else lacing
fout (Ex − d(E
From the sampling theorem, it is known that a ban-
dlimited signal with maximum frequency 2fs can ex-
actly be reconstructed if this signal is sampled with a
Once a perfect de-interlaced image is available, and frequency of at least f s . In 1956, Yen [11] showed
the motion vectors are accurate, standard sample rate a generalization of this theorem. Yen proved that any
conversion theory can be used to calculate the lines to signal that is bandlimited by 0.5 f s can exactly be re-
be interpolated in the current field. However, in prac- constructed by N independent sets of samples, sam-
tice, de-interlacing and motion vectors are not perfect. pled with frequency f s /N. This theorem can effectively
Besides, samples interpolated in the current frame are, be used to perform de-interlacing as first presented by
generally, (partly) based on samples interpolated in the Delogne [12] and Vandendorpe [13]. We will call it
de-interlacing process of the previous field. As an im- GST de-interlacer method (Generalized Sampling The-
plication, errors originating in an output frame, can orem).
propagate into later output frames. This is inherent to Figure 6 shows the calculation of the samples to
the recursive approach, and the most important draw- be interpolated. Samples from the previous field are
back of this method. shifted over the motion vector towards the current field
in order to create two independent sets of samples valid
To prevent serious errors from propagating, several
at the same temporal instance. An appropriate filter that
solution have been described in [10]. Particularly, the
matches the desired interpolator can calculate the out-
median filter is recommended to solve the problem. As
put sample.
a consequence, the TR de-interlacing is very similar to
the motion compensated median filter approach, how- odd even odd

ever, differs from this method in that the previous image existing samples
consists of a pre- viously de-interlaced field instead of y+4
motion compensated samples
vertical position y

y+3
calculated samples
y+2
y+1
y
original sample

sample to be interpolated y-1


y-1 field number
n-2 n-1 n
y

y+1 Figure 6: De-interlacing using a generalization of the


y+2 sampling theorem

n-2 n-1 n The calculation of the filter coefficients is explained


in the papers of Delogne [12] and Vanderdorpe [13].
Figure 5: Temporal backward projection Kalker [14] shows a generalization of this concept

5
which does not require the translation via the Fourier Using the following set of characteristics:
domain. X
According to Kalker [14], the even output samples at x o (z, n) = x(k, n) · z −k
temporal instance , are calculated by (for vertical mo- k is odd
X
tion only2 ): x e (z, n) = x(k, n) · z −k
X   k is even
0
f e (E
x , n) = f (E
x− , n)h 1 (k) + (22)
2k + 1
k
  (X (z, n) · Y (z, n))e = X o (z, n) · Yo (z, n) +
X 0
f (E
x− , n − 1)h 2 (m) (16) X e (z, n) · Ye (z, n)
2m
m (X (z, n) · Y (z, n))o = X o (z, n) · Ye (z, n) +
were f e (E x , n) is the sample from the even field, with = X e (z, n) · Yo (z, n)
field number n at position (x, y), f (E x , n − 1) the sam-
equation 21 results in:
ple from field n − 1 at vertical position (x, y), h(k) the
desired filter impulse response which models the shift Fe (z, n) = Fo (z, n − 1) · Ho (z) + Fe (z, n − 1) · He (z)
due to motion and the interpolator (which is splitted (23)
into h 1 (k) and h 2 (k)), and ( f )0 the odd field of f . Note Similarly, the odd field at temporal instance n can be
that, referring to figure 6, f (E x , n − 2) is unknown for calculated according to:
y mod 2 = 0 and similarly f (E x , n − 1) is unknown for
y mod 2 = 1. Fo (z, n) = (Fp (z, n − 1) · Ho (z))o
Referring to the vertical component only, expres- = Fo (z, n − 1) · He (z) + (24)
sion 16 can be rewritten into:
Fe (z, n − 1) · Ho (z)
X
f e (y, n) = f (y − (2k + 1), n)h 1 (k) + Solving Fe (z, n) from equations 23 and 24 results in:
k
X
f (y − 2m, n − 1)h 2 (m) (17) Fe (z, n) = a(z) · Fe (z, n − 1) + b(z) · Fo (z, n) (25)
m
with
In the z-domain, equation 17 is rewritten into:
Ho2 (z)
a(z) = He (z) − (26)
Fe (z, n) = (F(z, n)· H1 (z)+ F(z, n−1)· H2 (z))e (18) He (z)
Ho (z)
with P b(z) =
H1 (z) = Pk h 1 (k)z −(2k+1) He (z)
−2m (19)
H2 (z) = m h 2 (m)z If we assume a bilinear interpolator and a shift over 1 −
If we assume, we have the complete frame Fp (z, n − 1) α modelling the motion:
available, (from which field n − 1 is extracted), then:
H (z) = α + (1 − α)z −1 (27)
Fe (z, n − 1) = (Fp (z, n − 1))e (20) the filter coefficients are defined by:

Field n can be reconstructed shifting the samples from (1 − α)2 −2


frame n −1 over the motion vector, applying the desired a(z) = α − z (28)
α
interpolator, and extracting the desired field samples. 1 − α −1
So, b(z) = z
α
Fe (z, n) = (Fp (z, n − 1) · H (z))e (21)
As an example, consider the situation of a motion of
where H (z) describes the motion over one field period 0.5 pixels per field and a bilinear interpolator, then α =
and the desired interpolation in the z-domain. 1
2 , and the output can be calculated according to:
2 Horizontal motion is irrelevant for this explanation, since it can
   
be solved with simple sample rate conversion theory. Therefore, it is 0
set to zero for clarity
f out (E
x , n) = f xE + ,n +
1

6
1 E x , n), n − 1) + Since a median filter has shown before to be effective
f (E
x − d(E (29)
2     in elimination of such errors, it would provide a suf-
1 E x , n) + 0 , n − 1 ficient solution. However, median filtering introduces
f xE − d(E
2 2 alias in the output signal. Furthermore, only interpo-
y mod 2 6= n mod 2 lation for nearcritical velocities demands a protection
mechanism. In order to accomplish these characteris-
This is illustrated in figure 7. tics, we introduced a selective median filter in [15].
odd even odd
The output f outG ST seM (E x , n) is accordingly defined by:
calculated samples

existing samples 
 f (Ex , n)   y mod 2 = n mod 2


y+4
motion compensated samples 
 
 f outG ST (E
x ,
   n), 
  
vertical position y

y+3 1/2

 
 0 
 (near-critical
f (Ex− , n),
calculated samples

   velocities) ∧
y+2 median 1
1

 
 
y+1

 
 0  (y mod 2 6= n mod 2)
 f (E x+ , n)
-1/2

 
y

 1

y-1
field number
f outG ST (E
x , n) else
n-2 n-1 n
(31)
with
Figure 7: Calculation of the missing samples using gen-
eralized sampling f outG ST (E
x , n) =  
P 0
k f (E
x − d(E x , n) − , n)h 1 (k)+
The output samples are determined by the origi- (32) 2k + 1
nal samples of the current and the previous field only. P 0
m f (Ex − d(Ex , n) −
2m
, n − 1)h 1 (m)
Therefore, errors, due to incorrect de-interlacing, will
not propagate, which is a major advantage compared to as in expression 30.
the TR-algorithm. This approach will be referred to as the GST with
The de-interlaced output is defined by3 : selective median.
 A block-diagram of a circuit to implement this algo-
 f (E
 x , n) y mod 2 = n mod 2

 rithm is shown in figure 8.
   


 P 0
f (E
x − d(E
x , n) −
, n)h 1 (k)+
f out (E
x , n) = k
2k +1

 P

GST

 0
 m f (E
x − d(E
x , n) − , n − 1)h 2 (m)
deinterlacer


field


 2m memory


else median
de-interlaced
output
(30) mux
field
memory
interlaced
input

Selective-median GST de-interlacing = line memory


motion
estimator
The GST method is effective, but error-sensitive for
near-critical velocities as discussed in [15]. For near-
critical velocities, shifted samples are mapped closely Figure 8: Block-diagram of a possible implementation
to original sample positions. The difference between
the corresponding sample values greatly influences the
value of the output sample. As a consequence, this
difference is amplified, which boosts the noise level. 3 The 3D Recursive-search block
Therefore, inaccuracies can occur, resulting in unde- matcher
sired artifacts for which a remedy turned out to be nec-
essary. The presented MC de-interlacing algorithms have been
3 The input of the motion estimator, however, will be median fil- compared with each other, using the 3D RS block
tered in order to avoid severe errors from propagating via the estima- matcher of [1] as a basis for the common motion es-
tor, but not the output of the interlacer timator.

7
This block matcher uses a small number of candidate set C S(bEc , n), which is defined by:
vectors per block of pixels and realizes a quarter pixel    

 W 
accuracy. Furthermore, due to the inherent smooth- 

E E
(d(bc − , n) + U1 (bc , n)), 
E E 

ness constraint, it yields very coherent vector fields that 
  H  

 
closely correspond to the true-motion of objects. This E E E −W E E
C S(bc , n) = (d(bc − , n) + U2 (bc , n)),
makes this method also suitable for scan rate conver- 
  H 


 

sion. This section briefly summarizes its characteristics. 
 E E 0 

(d( b c − , n − 1)), 
2H
In block-matching motion estimation algorithms, a (34)
E bEc , n) is as-
displacement vector (or motion vector) d( where the update vectors UE1 (bEc , n) and UE2 (bEc , n) are
signed to the center bEc = (x c , yc )t , with t for trans- selected from an update set U S, defined as:
pose, of a block of pixels B(bEc ) in the current n field by
searching a similar block within a search area S A(bEc ), U S(bEc , n) = U Si (bEc , n) ∪ U S f (bEc , n) (35)
also centered at bEc , but in the previous field n − 1. This
similar block has a center which is shifted with respect with the integer updates U Si (bEc , n) defined by:
to bEc over the displacement vector d( E bEc , n). To find
d(bc , n), a number of candidate vectors CE are evaluated
E E   
applying an error measure e(C, E bEc , n to quantify block 
 0 


 , 
similarity. Figure 9 illustrates the procedure. 
  0        


 
 0
 0 1 −1  
 , , , ,

E  1 −1 0  0 
U Si (bc , n) =
 0
 0 2 −2  

 , , , ,

  2  −2   0   0 


 
−3 
_
C

 0 0 3 

yc  , , , 
W 3 −3 0 0
(36)
n-1 and the fractional updates U S f (bE c , n), necessary to re-
xc B(bc )
yc alize sub-pixel accuracy, are defined by:
image number      

 0 0 
n H 
 , , 



  0.25   −0.25   

 
−0.25 
xc

 0.25 
 , , 

0 0
Figure 9: Illustration of block-matching U S f (bEc , n) =     (37)

 0 0 


 , , 


 0.5 0.5  

 


 0.5 0.5 
 , ,  
The block of pixels (positions) is defined by: 0 0
 
0
    Either UE1 (bEc , n) or UE2 (bEc , n) equals the zero update, .
x c − W2 ≤ x ≤ x c + W2  ∧ 0
E
B(bc ) = (x, y)| From these equations it can be concluded that the
yc − H2 ≤ y ≤ yc + H2
(33) candidate set consists of spatial and spatio-temporal
with W and H the block width and block height4 , re- ‘prediction vectors’ from a 3D neighbourhood and an
spectively, and the spatial position in the image. updated prediction vector. This implicitly assumes spa-
tial and/or temporal consistency. The updating process
The candidate vectors are selected from the candidate involves updates added to either of the spatial predic-
tions. Figure 10 shows where the spatial and spatio-
temporal prediction vectors are located relative to the
4 In out experiments, W was set to 8 pixels and H to 8 frames. current block.
(Block sub-sampling is neglected for clarity. A detailed description The displacement vector d( E bEc , n) resulting from
can be found in [1] and [16]). the block-matching process, is a candidate vector CE

8
current block tive evaluation. We preferred to use the objective mea-
block in curren field surement based on the mean-square-error, since it is
yc --> spatial prediction
block in previous field used in the papers of the described de-interlacing meth-
--> spatio-temporal prediction
ods. However, it is not always a reliable indicator. New
H
tools which better reflect the relation between the mea-
xc W surement and the perception are still desired.
Several sequences with different characteristics were
Figure 10: Positions, relative to the current block, from processed in order to evaluate the discussed methods.
which the prediction vectors are taken in the 3D RS As an objective measure, the interlaced Mean-Square-
block-matcher. Error, M S E i , is calculated according to:
1 X E x , n), n − 1))2
M S E i (n) = ( f (E
x , n) − fout (E
x − d(E
N MW xE∈MW
which yields the minimum value of the error function
E bEc , n):
e(C,
(40)
with M W indicating the Measurement Window, N MW
E bEc , n) = {CE ∈ C S|e(C,
d( E bEc , n) ≤ e(VE , bEc , n)) (38) the number of samples within the measurement win-
∀(VE ∈ C S(bEc , n))} dow, and f (E x , n) the original samples in field n. All
field lines within M W contribute to the interlaced MSE.
The error function is a cost function of the luminance As an additional criterion, the Motion Trajectory In-
values, f (E x , n) with spatial position xE = (x, y)t , of the consistency, M T I [17, 15], will be calculated:
pixels in the current block and those of the shifted block 1 X E x , n), n − 1) − f out (E
from the previous field, summed over the block B(bEc ). MT I (n) = N MW ( f out (E
x − d(E x , n))2
xE∈MW
A common choice, which we also use, is the Sum of (41)
the Absolute Differences (S AD). The error function is All frame lines within M W contribute to the M T I .
defined by: A low M T I score indicates a high correlation be-
E E E E tween the previous de-interlaced image and the cur-
e(C, bc , n) = S AD(C, bc , n) (39)
X rently calculated de-interlaced image, or more specific,
= | f (E
x , n) − f out (E E
x − C, n − 1)| it is a measure for the temporal consistency along the
xE∈B(bEc ) estimated motion trajectory.
with f out (Ex , n −1) the de-interlaced output frame n −1. As indicated in [17], a problem with this measure is
that a good score is a necessary but insufficient con-
straint. Switching the output to zero, forces the M T I to
4 Evaluation zero, while the picture is seriously degraded. However,
a lower M T I coupled to an almost stable (interlaced)
Spatial de-interlacing algorithms do not require a field M S E is a strong indication for quality improvement.
memory, and are therefore relative cheap to implement. These measurements together form a useful tool to
Spatio-temporal de-interlacing algorithms require at evaluate the alternative algorithms.
least a field memory, which increases the hardware
costs, but the performance is also better then for the spa- 4.2 Results
tial algorithms. The best quality can be achieved by the
MC spatio-temporal de-interlacing methods.However, The sequences used in the evaluation are Renata (am-
the implementation cost increases again. ple vertical detail, horizontal velocities), Mobile (both
In this section, the presented MC techniques will slow horizontal and vertical velocities, including criti-
be mutually compared. As a simple spatio-temporal cal ones), Shopping (ample vertical and horizontal de-
de-interlacing algorithm, the non-MC median filtering tail, with zoom and critical velocities), RenataSpeed
technique is used as a reference. (same as Renata, but accelerated 3 times), Tokyo (slow
vertical and horizontal motion) and Bicycle (rotation,
covering and uncovering). A snapshot of these se-
4.1 Tools
quences are shown in figure 11.
Several tools can be used to evaluate the de-interlaced The results are categorized into two groups; one with
results ranging from objective measurements to subjec- (near) uniform motion (Mobile, Shopping and Tokyo),

9
(a) Renata (b) Mobile (c) Shopping

(d) RenataSpeed (e) Tokyo (f) Bicycle


Figure 11: Images from the used sequences. Images b,c,e have nearly uniform motion, whereas a,d,f are typical
non-uniform

and non-uniform motion (Renata, RenataSpeed, and 160

Bicycle).
MSE / MTI

140

4.2.1 Selective median filter 120


i

The selective median filter is activated in case of near- 100


critical velocities. The deviation around the critical ve-
locity defining the region of median filter activity is 80

called the aperture of the selective median filter. The


distance between two samples is normalized to one. 60

In a first attempt to find the optimum aperture of


|

0 0.5 1 1.5 2 2.5 3 3.5 4

the selective median, experiments were performed with aperture (*1/4)


the selective median switched on in case of near criti-
cal velocities which demand an interpolation of a quar- Figure 12: Aperture size for selective median versus
ter pixel position difference to an original sample, half M S E i (solid line) and M T I (dashed line)
pixel position difference, and three-quarter position dif-
ference. Furthermore, experiments were realized with of this paper, an aperture of a quarter pixel distance to
no selective median (no median), and maximum aper- an original sample position is used for the selective me-
ture (always median filtering). dian in the GST de-interlacer.
The average figures for all sequences have been plot
in figure 12.
4.2.2 Error measurements
From figure 12 it is possible to conclude that the opti-
mum aperture is between a quarter and half the distance The results of the different de-interlacing algorithms in
from an original sample position. In the remaining part terms of M S E i and M T I have been plot in figure 13

10
30 40

%MTI improvement
%MSEi improvement

20 20

10 Med 0
TBP
0 TR -20

GST
-10 -40
GST sel Med
-20 -60

-30 -80
Uniform Non-uniform . _ Uniform Non-uniform

Figure 13: Percentage of M S E i and M T I improvement compared to the MC median algorithm


50 100

%MTI improvement
%MSEi improvement

30 60

Med
10 TBP 20

TR
-10 GST -20

GST sel Med


-30 -60

-100
-50 R M S RS B T
R M S RS B T . _

Figure 14: Percentage M S E i and M T I improvement compared to the MC median method (R=Renata,
M=Mobile, S=Shopping, RS=RenataSpeed, B=Bicycle and T=Tokyo).

and 14. In this situation, the TR method is superior to the


Some observations from these results: other methods. The selective-median GST does
• MC de-interlacing techniques are potentially su- not always profit from the median. In the TR de-
perior to non MC techniques. In the figures 13 interlacer, the median filter is always active. Be-
and 14, this is reflected in the comparison of the sides, Bicycle does not contain vertical high fre-
median (non-MC) de-interlacer with the MC tech- quencies. Consequently, it can not benefit from the
niques. fact that the selective-median GST is capable of re-
• The selective-median GST generally outperforms constructing such frequencies. The TBP method is
the other de-interlacing techniques. Apart from invalid in this case, since the motion is not uniform
the sequence Bicycle, the M S E i and M T I scores over 2 field period for many part of the image.
are the best of all methods. This selective-median
GST method is, due to elimination of a permanent • The TBP method has a good score for sequences
median filter, able to reconstruct vertical high fre- with small velocities.
quencies in many cases, and also benefits from the
median filter in case of high risks. • The TR method performs generally better than the
MC median approach, however, this difference is
• Especially for non-uniform motion sequences, a relative small in the score of these objective crite-
protection mechanism is useful. As shown in fig- ria.
ure 13, only the protected algorithms have a good
score.
• The improvement of the GST approach due to the
• The Bicycle sequence contains complex motion. protection mechanism is significant.

11
5 Conclusions [5] B.I. Justusson, “Median filtering: Statistical
properties,” in Topics in Applied Physics, T.S.
Several de-interlacing techniques have been evaluated. Huang, Ed., vol. 43 of Two-dimensional Digi-
It has been shown that the MC de-interlacing meth- tal Signal Processing II, pp. 161–196. Springer-
ods are generally superior over the non-MC methods Verlag, Berlin, 1981.
for moving sequences. The GST algorithm with the
selective median outperforms the other de-interlacing [6] T. Doyle and M. Looymans, “Progressive Scan
methods. Only one sequence was found in which an al- Conversion using Edge Information,” in Proc.
ternative method performs better. For complex motion 3rd Int. Workshop on HDTV and beyond, Torino,
sequences (with no vertical high frequencies) the MC 1989.
median and TR method are found to be superior over
the other methods. [7] C. Hentschel, “Comparison between median fil-
It can also be concluded that the objective improve- tering and vertical edge controlled interpolation
ment of the TR method compared to the MC median for flicker reduction,” in IEEE Tr. on Consumer
filtering is relatively small for the evaluated sequences. Electronics, August 1989, vol. VE-35,3, pp. 279–
The major difference, in favour of the TR method, is ex- 289.
pected for sequences with vertical velocities, since the
[8] C. Hentschel, Fernsehen mit erhöhten
de-interlaced field in the TR algorithm allows better in- Bildqualität, Flimmerreduction durch erhöhter
terpolation filters to be used compared to field interpo- Vertikalfrequenz im Empfänger, Ph.D. thesis,
lation in the MC median filter algorithm. This is partly Technische Universität Braunschweig, Institut fur
validated in the evaluation. Nachrichtentechnik, February 1990, Dissertation.
Another important conclusion of the evaluation is
based on the fact that from the MC spatio-temporal al- [9] J.W. Woods and Soo-Chul Han, “Hierarchical
gorithms, the high correlation between pixels in the cur- Motion Compensated De-interlacing,” SPIE Vi-
rent field and the interpolated ones is only exploited in sual Communication and Image Processing VI,
the GST method. The benefit is obvious. However, it is November 1991, Boston.
concluded that the success of MC de-interlacing algo-
rithms critically depends on a protection mechanism. [10] F.M. Wang, D. Anastassiou, and A.N. Netravali,
“Time-Recursive Deinterlacing for IDTV and
Pyramid Coding,” in Signal processing: Image
References Communications 2. 1990, pp. 365–374, Elsevier.

[1] G. de Haan and P.W.A.C. Biezen, “Sub-pixel mo- [11] J.L. Yen, “On Nonuniform Sampling of
tion estimation with 3-D recursive search block Bandwidth-Limited Signals,” IRE Tr. on Circuit
matching,” in Signal Processing: Image Commu- Theory, vol. CT-3, pp. 251–257, December 1956.
nication 6. 1994, pp. 229–239, Elsevier.
[12] P. Delogne, L. Cuvelier, B. Maison, B. Van Cail-
[2] M.J.J.C. Annegarn, T. Doyle, P.H. Frencken, and lie, and L. Vandendorpe, “Improved Interpolation,
D.A. van Hees, “Video signal processing circuit Motion Estimation and Compensation for Inter-
for processing an interlaced video signal,” Euro- laced Pictures,” IEEE Tr. on Image Processing,
pean Patent Application no. EP-A 0 192 292. vol. 3, no. 5, pp. 482–491, September 1994.

[3] J. Juhula, A. Nieminen, J. Salo, and Y. Nuevo, [13] L. Vanderdorpe, L. Cuvelier, B. Maison,
“Scan conversions using weighted median filter- P. Quelez, and P. Delogne, “Motion-compensated
ing,” in Proc. IEEE Int. Conf. on Circuits and conversion from interlaced to progressive
Systems, Portland OR, May 1989, pp. 433–436. formats,” in Signal Processing: Image Communi-
cation 6. 1994, pp. 193–211, Elsevier.
[4] P. Haavisto, J. Juhola, and Y. Neuvo, “Scan rate
up-conversion using adaptive weighted median fil- [14] A.A.C. Kalker, “Motion Estimation and Compen-
tering,” in Proc. 3rd Int. Workshop on HDTV and sation for Interlaced Video,” to be published in
beyond, Torino, 1989. IEEE Tr. on Signal Processing.

12
[15] E.B. Bellers and G. de Haan, “Advanced
motion estimation and motion compensated de-
interlacing,” in Proc. of the Int. Workshop on
HDTV, Los Angeles, USA, October 1996.
[16] G. de Haan, P.W.A.C. Biezen, H. Huijgen, and
O.A. Ojo, “True-Motion Estimation with 3-D Re-
cursive Search Block Matching,” IEEE Tr. on cir-
cuits and systems for video technology, vol. 3, no.
5, pp. 368–379, October 1993.
[17] G. de Haan and P.W.A.C. Biezen, “Time-
recursive de-interlacing for high-quality television
receivers,” in Proc. of the Int. Workshop on HDTV
and the Evolution of Television, Taipai, Taiwan,
November 1995, pp. 8B25–8B33.

13

View publication stats

You might also like