Download as docx, pdf, or txt
Download as docx, pdf, or txt
You are on page 1of 8

LINEAR FILTERING FOR NOISE REDUCTION AND INTERFERENCE REJECTION

Mehrez Souden, Jacob Benesty, and Sofie`ne Affes


INRS-E´ MT, 800, de la Gauchetie`re Ouest, Suite 6900, Montre´al, H5A 1K6, QC, Canada.
{souden,benesty,affes}@emt.inrs.ca

ABSTRACT
A widely known example consists in teleconferencing
We study the linearly constrained minimum variance (LCMV) and environments where hands-free full-duplex communication devices
the minimum variance distortionless response (MVDR) filters are deployed for speech acquisition. In this situation, the target
when multiple interferers and unknown (ambient) noise coexist signal is gener- ated by one speaker while the interference is more
with a tar- get speech signal. Precisely, the LCMV is designed to likely to be gen- erated by other participants or devices located
remove all the interference signals while preserving the desired within the same enclo- sure. In addition, ambient noise is
speech and attempt- ing to reduce the ambient noise components. ubiquitous in these environments and it has to be considered. The
The MVDR is simply formulated such that the overall ambient- proposed study is motivated by the need to understand the pros and
noise-plus-interference are reduced while satisfying a distortionless cons of two main beamformers that are commonly utilized in this
constraint. We provide sim- plified expressions for both scenario: the MVDR that simulta- neously reduces overall
beamformers and show their relation- ship. Furthermore, we interference-plus-ambient-noise energy and the LCMV that totally
underline the limitations of the LCMV when the ambient noise is rejects all the interferers while attempting to reduce the ambient
present. When the latter is absent, we also prove that the MVDR noise. Both filters preserve the target speech sig- nal as well.
degenerates to the LCMV. Numerical examples are provided to
support our study. 2. DATA MODEL AND ASSUMPTIONS
Index Terms— Noise reduction, interference rejection, micro-
phone array, beamforming, linearly constrained minimum variance We consider the following frequency domain representation of the
(LCMV), minimum variance distortionless response (MVDR). investigated data model [9]
L
Yn(jω)= Gn(jω)S(jω)+ Dn,l(jω)Ψl(jω)+ Vn(jω), (1)
1. INTRODUCTION l=1

where Yn(jω), Gn(jω), S(jω), Dn,l(jω), Ψl(jω), and V n(jω) are


Microphone array signal processing has attracted a significant discrete-time Fourier transforms of the output of microphone n (n =
amount of research attention over the last few decades. Indeed, its 1, ..., N ), the channel TF between the target source and mi-
inherent spatial aperture allows for additional key functions among crophone n, the channel TF between the lth interfering source and
which is the celebrated noise reduction with no speech distortion microphone n, the target speech signal, the original lth interference
[1]–[10]. signal, and the ambient noise component seen by microphone n, re-
Earlier efforts devoted to gain insight into the functioning of spectively. Also, we define X n(jω) = Gn(jω)S(jω), In,l(jω) =
multichannel noise reduction techniques include [11] where Bitzer Dn,l (jω)Ψl(jω) for l = 1, ..., L, and I n (jω) = l=1 In,l(jω). From
now on, the frequency dependence will be removed for L the
et al. investigated the theoretical performance limits of the gener-
alized sidelobe canceller (GSC) in the case of a spatially diffuse sake of clarity.
noise. In [12], Doclo and Moonen investigated the effect of gain Our aim is to recover one of the noise-free speech components,
say X1, the best way we can (along some criteria to be defined
and phase errors in microphone arrays on broadband beamforming
later) by applying a linear filter h to the overall observation vector
and proposed robust design procedures of beamformers. In [13], y = [Y1 Y2 YN ]T . ( )T denotes the transpose operator. For no-
the theoretical equivalence between the LCMV and its GSC ···
tational convenience, we also define x, i, and v as y and w = v + i.
counterpart was demonstrated. In [8], Habets et al. studied the The output of h is
effectiveness of the MVDR when designed to remove both additive Z = hH y
noise and rever- beration. It was found that a tradeoff between = hH x + hH i + hH v, (2)
noise reduction and dereverberation has to be made. However,
where hH x is the filtered speech component, hH i is the residual in-
dereverberation still re- mains an open field for future research
terference part, hH v is the residual noise, and ( )H is the transpose-
and we would rather focus on noise reduction here. Another ·
conjugate operator. The vector containing all the channel TFs be-
notable effort to understand the functioning of the multichannel
tween the desired source and microphones’ locations is defined as
linear processing for noise reduction was published in [10]. T
g = [G1 G2 ... GN ] , while D = [d1 d2 ... dL] is the N L
Therein, Gannot and Cohen studied the noise reduction ability of ×
matrix where dl = [D1,l D2,l ... DN,l]T is the vector containing all
the channel transfer function (TF) ratios-based GSC beamformer the channel TFs associated with the lth interferer. Also, we de-
in [10]. They found that it is theoretically possible to achieve
infinite noise reduction when only one spatially coherent noise fine φaa = E AA∗{ }and Φaa = E aaH , respectively, as the
overlaps with the desired speech. In [5, 6], analytical results PSD and PSD matrix of a given random process A and a vector
showing the tradeoff between noise reduction and speech distortion of random processes a. We assume that all the interferers are
in the parameterized multichannel Wiener filtering were uncor- related which means that Φψψ = diag[φψ1 ψ1 φψ2 ψ2 ... φψLψL
established. In this paper, we are interested in the analysis of the ].
potential of microphone arrays to reduce the ambient noise and
reject multiple interferers when both types of signals overlap with
a target speech.

Authorized licensed use limited to: NATIONAL INSTITUTE OF TECHNOLOGY DURGAPUR. Downloaded on April 30,2022 at 06:44:26 UTC from IEEE Xplore. Restrictions apply.
978-1-4244-4296-6/10/$25.00 ©2010 IEEE 89 ICASSP 2010

Authorized licensed use limited to: NATIONAL INSTITUTE OF TECHNOLOGY DURGAPUR. Downloaded on April 30,2022 at 06:44:26 UTC from IEEE Xplore. Restrictions apply.
As performance measures, we use the SNR and SIR at the output 5. PERFORMANCE ANALYSIS
H
of a given filter h which are defined as SNRo (h) = h Φxxh and
H
vv
hH Φ h
In order to understand the behaviors of both filters and outline their
Φxxh
SIRo (h) = h .iiIn our processing, we are only interested in relationship, we consider the cases of spatially white ambient noise
hH Φ
reducing the additive noise w as in [4, 5, 6, 7, 9, 10] in contrast to and single interferer (L = 1).
[3, 8]. Thus, we take the first microphone as a reference and Spatially White Ambient Noise: In this scenario, the PSD ma-
accord- trix of the ambient noise is given by Φvv = σ2I where σ2 is the
φx x φx x
ingly define the input SNR = 1 1 and input SIR = 1 1 .
φ v1 v1 φi1 i1
noise power. Consequently, the MVDR is expressed as
3. MINIMUM VARIANCE DISTORTIONLESS RESPONSE −1
I − D σ2Φ−1
ψ +D D
H
DH g, (11)
hMVDR = G∗1
The optimization problem leading to the MVDR noise reduction fil-
ter is known as [1, 4, 5, 6, 7, 10] ψ
−1

H
gH I − D σ2Φ−1 + DH D DH g
hMVDR = arg min h Φww h
h while the LCMV’s expression becomes
subject to gH h = G∗1 . (3) ∗ −1
hLCMV = G 1
I − D D HD D
H
g. (12)
The MVDR solution is then written as g I − D (DH D)−1 DH g
−1ww
hMVDR = G ∗ Φ g . (4)
1 H
g Φ−1 g We see from (11) and (12) that the LCMV performs a projection 1
w
We use the following matrix inversion lemma onto the subspace orthogonal to the L-dimensional one spanned
−1
−1 −1 −1 −1 H −1 H −1 by the2 columns of D regardless of the level of the ambient noise,
Φ =Φ −Φ DΦ +D Φ D D Φ i.e., σ , and interferers, i.e., Φ . Conversely, the MVDR takes
ww vv vv ψψ vv vv ψψ
into account the energy2 terms by modifying the projection ma-
to obtain Pg trix by a term σ Φ−1 = diag 1 ... 1 , where we de-
hMVDR = G1 H ∗ , (5) ψψ INR1 INRL
where P = Φ−1 . Since we are onlyg interested
Pg in noise reduction, fine INRl ψlψ
l
φ
as the lth interference-to-ambient-noise ratio.
ww = σ2
If we further assume that the interferers have equal energies, i.e.,
it is possible to directly use the noise and noisy data statistics to INR1 = INR2 = ·· · INRL = INR, and suppose that INR → ∞
implement the above expression as [1, 5, 6]
−1
Φ Φyy − I u1 (equivalently supposing that σ2 → 0 for given levels of
interference) lim h
hMVDR = ww , (6) INR→∞ MVDR
= hLCMV . (13)
tr Φ−1 Φyy −
w N
T
where u1 = [1 0 ... 0] is an N -dimensional vector. In the sequel, This result means that the MVDR (with its simple and efficient for-
mulation) is able to perform perfect source extraction from a
we will be interested in the analysis of the MVDR’s behavior with
mixture of multiple competing speakers. Resorting to the LCMV
respect to both interference and ambient noise components that can
be better seen using (5). may not be justified if we consider the required knowledge of the
all propagation paths of the interferers and potential amplification
of ambient noise that will be shown below. When the energy of the
4. LINEARLY CONSTRAINED MINIMUM VARIANCE interferers is too low as compared to the ambient noise, we have

Here, the LCMV beamformer that we are interested in aims at re- g


lim h MVDR = G∗ , (14)
jecting all the interferers while preserving the target signal and at- INR→0 1ǁgǁ 2
tempting to reduce the ambient noise. Mathematically, we have the which is the matched beamformer that coherently adds up the
following optimization problem [2, 9] signal components to enhance the desired signal and attenuate the
H
hLCMV = arg min h Φvv h ambient noise.
h Single Interferer: In this scenario, we have shown in [7] that
subject to CH h = G∗1 u˜ 1 , (7) the MVDR can be written as a linear combination of the LCMV
where u˜ 1 = [1 0 ... 0]T is an (L + 1)–dimensional vector, and (to- tally focussed on interference removal) and a matched filter
C = [g D]. The solution to this optimization problem is known as (totally focussed on ambient noise removal) and allows for a
[9, 13] certain tradeoff between both functions by properly weighting both
−1
h = G∗Φ−1C CH Φ−1C u. (8) components de- pending on the INR. The decomposition in [7] is a
LCM 1 v v 1 particular case of
This form is not easy to interpret. Nevertheless, we can prove that (5) even though no simplified decomposition is available when
(8) can be written as (see Appendix) mul- tiple interferers are present. However, one can expect a similar
Qg
h = G∗ , (9) be- havior where all the interferers locations and energies are taken
into account by the MVDR. The analysis of the single interferer
scenario seems to be sufficient and very insightful. For the sake of
simplic-
LCMV 1
gH Qg ity here, we further assume that the ambient noise is white and that
where the environment is anechoic. Consequently, the SIR and SNR at the
−1
output of the LCMV and MVDR filters are given by [7]
Q = Φ−1 − Φ−1D DH Φ−1D DH Φ−1. (10)
It is interesting to see
v how vsimilar both expressions
v (5) and
v (9) are. SNRo (hMVDR) = N SNR, (15)
N 2 INR2 κ(1−κ)
1+
[1+N INR(1−κ)]2
Indeed, they only differ by the choice of the projection-like matrix
1
P or Q. Further interpretations are given below. At the same time, no distortion is introduced to the desired signal.
Authorized licensed use limited to: NATIONAL INSTITUTE OF TECHNOLOGY DURGAPUR. Downloaded on April 30,2022 at 06:44:26 UTC from IEEE Xplore. Restrictions apply.
90

Authorized licensed use limited to: NATIONAL INSTITUTE OF TECHNOLOGY DURGAPUR. Downloaded on April 30,2022 at 06:44:26 UTC from IEEE Xplore. Restrictions apply.
[1 + N INR (1 − κ)]2 MVDR’s beampattern also exhibits larger sidelobes when Δθ is
SIRo (hMVDR) = SIR, (16)
κ de- creased. To explain these behaviors, recall that in the
formulation of the optimization problems leading to the LCMV
and MVDR, the ar-
SNRo (hLCMV) = N (1 − κ) SNR, (17) ray response towards the source direction is forced to the unity
gain. This constraint is always satisfied (the maxima of the LCMV
beam- pattern correspond to values larger than one and the results
SIRo (hLCMV) = +∞, (18) presented in Fig. 2 are normalized with respect to the largest
value). When the interferers move towards the target source, it
becomes harder for the
2
where κ = H
g d LCMV to satisfy two contradictory constraints: switching the array
|
ǁgǁ2 ǁdǁ2 is the collinearity factor that has a critical gain from zero to one. This fact causes some instabilities that trans-
effect on the performance of both beamformers. Essentially, when late into the appearance of sidelobes and displacement of the
this factor is increased toward 1, an amplified ambient noise can maxi- mum far from the interference, thereby leading the array to
(h )
be obtained at the output of the LCMV (i.e., SNRo LCMV = N (1 capture
κ) < 1). The MVDR can also amplify the ambient noise S if the
− and collinearity factor are sufficiently high (in this the ambient noise which spans the whole space. Finally, it is obvi-
input INR
ous that when Δθ increases, the two filters perform relatively well
case, the ambient noise level takes very small value and its in terms of noise (for the LCMV) and interference (for the
amplifi- cation is not disastrous as it may be the case for the MVDR) re- moval. In the second setup, we consider a reverberant
LCMV that may dramatically amplify it regardless of its energy). enclosure and show the resulting output SNR and SIR for Δθ = 30◦
When multiple interferers are present, the likelihood of having and 10◦ for
one of them in the vicinity of the target source signal becomes the frequency span 0 to 4 kHz in Fig. 3. We notice that the infinite
higher. Consequently, it is more likely that the ambient noise SIR gain achieved by the LCMV may come at the price of very
becomes am- plified, especially by the LCMV. In addition, the low output SNR as compared to the MVDR, especially for the low
estimation of the channel TFs associated with the interferences fre- quency range (lower than 1000 Hz). When we compare Figs.
becomes more and more complicated. Conversely, the MVDR 3 (a) and (c) to Figs. 3 (b) and (d), respectively, we notice that
allows for blind interfer- ence and ambient noise removal since the when the
knowledge of the channel TFs of the interferers is not required as interference is placed near the target source, a remarkable perfor-
shown in (6) where only the PSD matrices of the interference- mance degradation is observed in terms of output SNR, especially
plus-ambiant-noise and noisy data are used. for the LCMV filter, and in terms of output SIR for the MVDR.

6. NUMERICAL EXAMPLE 7. CONCLUSIONS

For illustration purposes, we consider a uniform linear array In this paper, we studied the ability of the MVDR and LCMV
(ULA) of 4 microphones with δ being the inter-microphone beamformers to simultaneously reduce the ambiant noise and
spacing, located in a reverberant enclosure. The microphone reject the multiple interferers when both types of signals overlap
elements are placed on the axis (y0 = 1.016, z0 = 1.016) m with the to the target speech signal. We elaborated simplified expressions
center of the ar- ray being at (x0 = 1.524 m, y0, z0) and the nth for both beamformers to illustrate the tradeoff of noise reduction
microphone at (x0 N−2n+1 δ, y0, z0) with n = 1, ..., N . In the first and interference rejection. Specifically, while the LCMV totally
example,
− we2 consider the case of an anechoic room (T60 = 0 ms) focusses on interference rejection and may dramatically increase
where the channel transfer functions are pure delay-based. In the the ambiant noise, the MVDR seems to be more promising since
second sce- it tradeoffs both functions by taking into account the energies of
nario, we consider a reverberant enclosure simulated using the both types of signals. In the particular case where the ambient
image method [14] with a reverberation time T60 = 300 ms. The noise is weak (as compared to the level of the interferers), the
source and the two interferers are located in the same enclosure at MVDR is more focussed on interference rejection and degenerates
a dis- tance 2.5 m away from the array center and at the azimuthal to the LCMV without requiring the knowledge of the channel TFs
angles
θs = 90◦, θi1 = θs Δθ and θi2 = θs + Δθ which are measured counter- associated to the interferers. Conversely, when the interferers are
− axis. Δθ will be chosen depend- ing on
clockwise from the array weak (as compared to the ambient noise), the MVDR allows for
the examples investigated below. An additive spatially white better ambient noise reduction in contrast to the LCMV.
noise was added to model the ambient noise. We choose the input
SIR and SNR to be equal to 3 dB and 0 dB, respectively. APPENDIX: PROOF OF THE NEW LCMV EXPRESSION

In the first setup, we investigate the effect of the angular
separa-
In order to prove (9)–(10), we first see that
tion Δθ on the performance of the MVDR and LCMV g Φ−1 g gH Φ−1vD
CH Φ−1C = Hv
beamformers
at a frequency f = 1 kHz. The inter-microphones spacing is
v
DH Φ−1g DH Φ−1D
. (19)
such that δ = c (c = 343 ms−1 is the speed of sound). Figs. 1 vv vv
2f
(a) and (b) show the effect of Δθ on the SIR and SNR at the out- By using (19), we can compute (20) that we plug into (8) to ob-
put of both beamformers. When Δθ decreases, the output SNR of tain (21). Finally, we use the matrix inversion lemma to expand gH
−1
the LCMV is decreased. It is even much lower than the input SNR Φ−1gDH Φ−1D DH Φ−1ggH Φ−1D and use it in (21) to obtain
for Δθ < 15◦. In contrast, the output SNR of the MVDR is al- v v
the LCMV expression in−(9)–(10). v v

most unaffected while very low output SIR values are obtained for
small Δθ. To gain a better understanding of these results, we pro-
vide the normalized beampatterns of both beamformers in Fig. 2 −1
for Δθ = 30◦ and 10◦. We see that when Δθ is small, two ma- CH Φ−1vC u 1 = (20)
jor behaviors of the LCMV emerge: displacement of the main −1
gH Φ−1g − gH Φ−1D DH Φ−1D DH Φ−1g −
H−1 v H −1 v H −1 v
H −1 −1 H v −1
away from the source location and appearance of sidelobes. The − g Φ gD Φ D − D Φ gg Φ D D Φ g ,
v v v v v
Authorized licensed use limited to: NATIONAL INSTITUTE OF TECHNOLOGY DURGAPUR. Downloaded on April 30,2022 at 06:44:26 UTC from IEEE Xplore. Restrictions apply.
91

Authorized licensed use limited to: NATIONAL INSTITUTE OF TECHNOLOGY DURGAPUR. Downloaded on April 30,2022 at 06:44:26 UTC from IEEE Xplore. Restrictions apply.
hLCMV
= Φ−1g (21)
vv
G∗1 −1 −1
g Φ g − g Φ D DH Φ D
H H −1 −1 −1
DH Φ −1g 6
−1 H −1 v H −1 v H −1 v H −1 v −1
H 1 5 LCMV (SIR = )
−Φvv D g Φvv gD Φvv D − D Φvv gg Φvv D D Φvv g. 0
40 MVDR
Input SIR
30
−10

8. REFERENCES 20
−20
10

[1] J. Benesty, J. Chen, and Y. Huang, Microphone Array Signal −30


LCMV
MVDR 0
Processing. Springer-Verlag, Berlin, Germany, 2008. Input SNR
−40 −10
0 20 40 60 80 100 0 20 40 60 80 100
[2] J. Benesty, J. Chen, Y. Huang, and J. Dmochowski, “On Δ [degree] Δ [degree]

microphone-array beamforming from a MIMO acoustic (a) Output SNR (b) Output SIR
signal
processing perspective,” IEEE Trans. Audio, Speech, Fig. 1. Performance of the MVDR and LCMV vs. Δθ: anechoic
Language Process., vol. 15, pp. 1053–1065, Mar. 2007. room.
[3] S. Affes and Y. Grenier, “A signal subspace tracking algo-
rithm for microphone array processing of speech,” IEEE
Trans. Speech, Audio Process., vol. 5, pp. 425–437, Sept.
1997.
[4] S. Gannot, D. Burshtein, and E. Weinstein, “Signal enhance-
ment using beamforming and nonstationarity with 90 1 90 1
applications to speech,” IEEE Trans. Signal Process., vol. 49, 120 0.8 60 120 0.8 60
0.6 0.6
pp. 1614– 1626, Aug. 2001. 150 0.4 30 150 0.4 30
0.2 0.2
[5] M. Souden, J. Benesty, and S. Affes, “New insights into non-
180 0 180 0
causal multichannel linear filtering for noise reduction,” in
Proc. IEEE ICASSP, 2009, pp. 141–144.
[6] M. Souden, J. Benesty, and S. Affes, “On optimal frequency- 210330 210330
domain multichannel linear filtering for noise reduction,” LCMV
240 MVDR 270
3 LCMV
240 MVDR 270
3

IEEE Trans. Audio, Speech, Language Process., vol. 18, pp. (a) Δθ = 30 ◦
(b) Δθ = 10◦
260–
276, Feb. 2010.
Fig. 2. Beampatterns of the MVDR and LCMV filters; the source is
[7] M. Souden, J. Benesty, and S. Affes, “On optimal at 90◦ and two interferences at 90◦ ± Δθ: anechoic room.
beamform- ing for noise reduction and interference
rejection,” in Proc. IEEE WASPAA, 2009, pp. 109–112.
[8] E. Habets, J. Benesty, I. Cohen, and S. Gannot, “New
insights into the MVDR beamformer in room acoustics,”
IEEE Trans.
Audio, Speech, Language Process., vol. 18, pp. 158–170, 10 10
Jan. 2010. 6 6
0 0
[9] S. Markovich, S. Gannot, and I. Cohen, “Multichannel −5 −5

eigenspace beamforming in a reverberant noisy environment −10 −10


−15 −15
with multiple interfering speech signals,” IEEE Trans. Audio, −20 −20
Speech, Language Process. vol. 17, no. 6, pp. 1071– −25
LCMV
−25
LCMV
1086,Aug. 2009. −30
−35
MVDR
Input SNR
−30
−35
MVDR
Input SNR

[10] S. Gannot and I. Cohen, “Adaptive beamforming and −40


0 1000 2000 3000 4000
−40
0 1000 2000 3000 4000
Frequency [Hz] Frequency [Hz]
postfiter- ing,” in Springer Handbook of Speech Processing,
J. Benesty,
M.M. Sondhi, and Y. Huang, editors, Springer-Verlag, 2007, (a) Δθ = 30◦ (b) Δθ = 10◦
Chapter 47, pp. 945–978.
25 25 LCMV (+)
[11] J. Bitzer, K.U. Simmer, and K.-D. Kammeyer, “Theoreti- MVDR
LCMV (+)
MVDR
20 20 Input SIR
cal noise reduction limits of the generalized Input SIR
15 15
sidelobecanceller (GSC) for speech enhancement,” in Proc.
10 10
IEEE ICASSP, 1999, vol. 5, pp. 2965–2968.
5 5
[12] S. Doclo and M. Moonen, “Design of broadband beamform- 0 0
ers robust against gain and phase errors in the microphone −5 −5
ar- ray characteristics,” IEEE Trans. Signal Process., vol. 51, −10 −10
pp. 2511–2526, Oct. 2003.
0 1000 2000 3000 4000 0 1000 2000 3000 4000
[13] B. R. Breed and J. Strauss, “A short proof of the equivalence Frequency [Hz] Frequency [Hz]

of LCMV and GSC beamforming,” IEEE Signal Process.


(c) Δθ = 30◦ (d)Δθ = 10◦
Lett., vol. 9, pp. 168–169, Jun. 2002.
[14] J.B. Allen and D.A. Berkley, “Image method for efficiently Fig. 3. SNR and SIR at the output of the LCMV and MVDR filters:
simulating small-room acoustics,” J. Acoust. Soc. Amer., vol. reverberant room.
65, pp. 943–950, Apr. 1979.

Authorized licensed use limited to: NATIONAL INSTITUTE OF TECHNOLOGY DURGAPUR. Downloaded on April 30,2022 at 06:44:26 UTC from IEEE Xplore. Restrictions apply.
92

Authorized licensed use limited to: NATIONAL INSTITUTE OF TECHNOLOGY DURGAPUR. Downloaded on April 30,2022 at 06:44:26 UTC from IEEE Xplore. Restrictions apply.

You might also like