Professional Documents
Culture Documents
5 Guti Crespo Ieee It 08
5 Guti Crespo Ieee It 08
Abstract—For the engineering community, Gray’s tutorial he succeeded in conveying the main ideas of the Szegö theory
monograph on Toeplitz and circulant matrices has been, and to a wider audience.
remains, the best elementary introduction to the Szegö theory on In the paper of Gazzah, Regalia, and Delmas [3], the Szegö
large Toeplitz matrices. In this paper, the most important results
of the cited monograph are generalized to block Toeplitz (BT) theorem for Toeplitz matrices with symbols in the Wiener al-
matrices by maintaining the same mathematical tools used by gebra is generalized to block Toeplitz (BT) matrices by using
Gray, that is, by using asymptotically equivalent sequences of asymptotically equivalent sequences of matrices (i.e., Gray’s
matrices. As applications of these results, the geometric minimum tools) and the concept of vec-permutation matrix.
mean square error (MMSE) for both an infinite-length multi- In this paper, we go a step further. By using again the concept
variate linear predictor and an infinite-length decision feedback
equalizer (DFE) for multiple-input–multiple-output (MIMO) of asymptotically equivalent sequences of matrices, we extend
channels, are obtained as a limit of the corresponding finite-length the most important results (the Szegö theorem among others) of
cases. Similarly, a short derivation of the well-known capacity of the Gray tutorial [2], to BT matrices with continuous symbols
a time-invariant MIMO Gaussian channel with intersymbol inter- (i.e., the with are given by the Fourier coefficients of a
ference (ISI) and fixed input covariance matrix is also presented. continuous matrix function), which is a less restrictive
Index Terms—Block circulant matrices, block Toeplitz (BT) condition than being in the Wiener algebra.
matrices, channel capacity, Frobenius norm, functions of ma- In [4], Bose and Boo also derived asymptotic results of BT
trices, Hermitian matrices, multiple-input–multiple-output matrices based on Gray’s tools. However, they look at a com-
(MIMO) minimum mean square error decision feedback equal-
pletely different problem to the one considered in our paper.
izer (MMSE-DFE), (MMSE) linear predictor, multivariate
stationary processes, spectral norm. Here, is an BT matrix with blocks, and we look,
among other things, at the asymptotic behavior of the eigen-
values of as goes to infinity while remains fixed (specifi-
I. INTRODUCTION cally, we study the Szegö theorem for continuous matrix-valued
symbols). On the contrary, in [4], is an block Toeplitz
matrix with Toeplitz blocks (BTTB) of order , and Bose
T HE most famous theorem on large Toeplitz matrices is
the Szegö theorem [1]. This result deals with the asymp-
totic behavior of the eigenvalues of an Hermitian Toeplitz
and Boo look at the asymptotic behavior of the eigenvalues of
when both and go to infinity (specifically, they study
matrix , where the complex numbers with the Szegö theorem for certain two-variate symbols). It should
are the Fourier coefficients of a bounded function called be pointed out that their derivation cannot be particularized to
the symbol of the sequence . Unfortunately, the level of the case where is kept fixed.
mathematical sophistication for understanding [1] is often be- Finally, it should be mentioned that, based on matrix analysis
yond the level provided by a typical engineering background. but without using the Gray concept of asymptotically equiva-
For this reason, Gray simplified the Szegö theory in [2] and by lent sequences of matrices, Tilli [5] and Serra-Capizzano [6] de-
assuming more restrictive conditions, namely, Toeplitz matrices rive results of Szegö type for Lebesgue integrable multivariate
matrix-valued symbols that are stronger than the results given
with symbols in the Wiener algebra (i.e., ).
Since his approach is purely matrix-analytic (mainly based on by Bose and Boo [4] and the ones presented here. Such gener-
his concept of asymptotically equivalent sequences of matrices), ality is based on introducing another concept of approximation
of matrix sequences, namely, the notion of approximating class
of sequences (a.c.s.), which is more general than that of asymp-
Manuscript received February 26, 2007; revised May 13, 2008. Current
version published November 21, 2008. This work was supported in part by
totically equivalent sequences. In the Appendix, we report some
the Spanish Ministry of Education and Science, by the European Social Fund, analysis based on this new approach, since it has the potential of
and by the European Regional Development Fund through the Torres-Quevedo becoming a further tool available in the engineering community.
program, the MultiMIMO Project TEC2007-68020-C04-03, and the program
CONSOLIDER-INGENIO 2010 CSD2008–00010 COMONSES.
Therefore, the main contribution of our paper is to present
The authors are with Electronics and Communications Department, R&D a plain generalization to BT matrices of Gray’s famous tuto-
Center CEIT, 20018 San Sebastián, Spain and University of Navarra (Tecnun), rial [2], by maintaining the Gray tools. The theory on BT ma-
20018 San Sebastián, Spain (e-mail: jgutierrez@ceit.es; pcrespo@ceit.es). trices is, in general, useful in signal processing, communica-
Communicated by A. Høst-Madsen, Associate Editor for Detection and Es-
timation. tions, and information theory, since this kind of matrices arise,
Digital Object Identifier 10.1109/TIT.2008.2006401 for instance, as covariance matrices of multivariate stationary
0018-9448/$25.00 © 2008 IEEE
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
5672 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 54, NO. 12, DECEMBER 2008
where denotes trace, denotes conjugate transpose, and is C. Block Circulant Matrices
a column vector. We now review a special type of BT matrices.
The Frobenius norm and the spectral norm are special in-
stances of unitarily invariant norms (see, e.g., [7] or [8]). We Definition 5: An block circulant matrix with
recall that a norm is unitarily invariant if blocks is one having the form
for every matrix and for every unitary matrices
and . .. ..
. .
The next definition is due to Gray (see [2] or [9]).
..
.
Definition 2: Let and be matrices for all . .. .. .. ..
We say that the sequences and are asymptotically . . . .
equivalent, and write , if
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
GUTIÉRREZ-GUTIÉRREZ AND CRESPO: ASYMPTOTICALLY EQUIVALENT SEQUENCES OF MATRICES 5673
where is the Kronecker product, is the From Lemma 1, the theorem holds for any polynomial .
Fourier unitary matrix Now, let be an arbitrary continuous function on . Since
the spectral norm is unitarily invariant, we deduce that
III. A NEW RESULT ON ASYMPTOTICALLY EQUIVALENT for every . This ends the proof.
SEQUENCES OF MATRICES This result will be very useful as shown in the following sec-
Theorem 1: Consider two asymptotically equivalent se- tion. It should be mentioned that Theorem 1 can be stated in a
quences of Hermitian matrices and . Let be more general way (see the Appendix).
a closed interval containing the eigenvalues of the matrices of
these sequences. Then IV. LARGE HERMITIAN BLOCK TOEPLITZ MATRICES
In this section, we will derive several fundamental theorems
on Hermitian BT matrices with blocks, for large
where denotes the set of all continuous complex func- .
tions on . Fix , and let and be matrices for all
Proof: Because and are sequences of Her- . We say that the sequences and are asymp-
mitian matrices, (1) implies that there is such that totically equivalent, and write , if they satisfy the
. There- expressions (1) and (2). Note that Lemma 1 and Theorem 1 still
fore, there exists such an interval , for instance, . hold in this more general context.
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
5674 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 54, NO. 12, DECEMBER 2008
In the sequel, let be an matrix tian. By reasoning as in [2] and for the scalar case, it is not dif-
function, which is continuous and -periodic. We denote by ficult to show (see, e.g., [14, p. 871]) that the eigenvalues of the
the sequence of Fourier coefficients of matrices are contained in the closed interval with finite
endpoints
(4)
(7)
(5)
Therefore, there exists such that
for all , where denotes the zero matrix.
Suppose moreover that is Hermitian for all . Con-
(9)
sequently, the and matrices will also be Hermi-
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
GUTIÉRREZ-GUTIÉRREZ AND CRESPO: ASYMPTOTICALLY EQUIVALENT SEQUENCES OF MATRICES 5675
Furthermore, because the Frobenius norm is unitarily invariant, By using the Cauchy–Schwarz inequality and the fact that
we deduce that is a unitary matrix, we see that
The product of two BT matrices and is not, in As in the earlier case, based on advanced tools from func-
general, BT. However, the next theorem shows that this product tional analysis, a stronger result linking to
is asymptotically equivalent to the BT matrix of the product of can be obtained (see [15, Sec. 2.7]).
the two symbols, i.e., . By particularizing in Theorem 3 to
, and , we obtain the next result.
Theorem 2 (Product of BT Matrices): Let
be Hermitian matrix functions, which are continuous and Theorem 4 (Exponential, Inverse, and Square Root of BT Ma-
-periodic. If is also Hermitian, then trices): Let be as in Lemma 5.
. 1) Then .
Proof: By combining Lemmas 1 and 5, we have 2) If , then .
3) If , then .
. We now derive the Szegö theorem for BT matrix sequences
It should be mentioned that Widom, by using advanced tools with continuous symbols.
from functional analysis, proved a stronger and more general Theorem 5 (Trace of Functions of BT Matrices): Let be as
result on the product of two BT matrices that can be found, for in Lemma 5. Then
instance, in [6, Sec. 3.3.1], [15, p. 192], or [16].
From Theorem 2, with
, i.e., . Based on Theorem
1 and Lemma 5, we will show that this is also true for more
general functions. for all .
Theorem 3 (Functions of BT Matrices): If is as in Lemma Proof: Because for all matrix
5, then . , from Theorem 3, we have
Proof: Fix . We begin by showing
that is continuous on . We
will denote by
an eigenvalue decomposition of for all . By
the Stone–Weierstrass approximation theorem, there exists a
sequence of polynomials that converges uniformly to The extension of Theorem 5 to Lebesgue integrable symbols
on . Therefore, given , we can find can be found in [5] and [6]. It should be mentioned
such that that when is bounded , the assumption of bounded
support for the test function is not necessary. Moreover, for
, the assumption of bounded support
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
5676 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 54, NO. 12, DECEMBER 2008
can be replaced by an assumption of mild growth at infinity. the theorem for the Frobenius norm. Using the Cauchy–Schwarz
More precisely, it is required that the test functions are inequality, we have
continuous and is bounded (see [17]).
The following theorem is a straightforward consequence of
Theorem 5, and it deals with determinants instead of traces.
Theorem 6 (Determinant of Functions of BT Matrices): Let
be as in Lemma 5. Then
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
GUTIÉRREZ-GUTIÉRREZ AND CRESPO: ASYMPTOTICALLY EQUIVALENT SEQUENCES OF MATRICES 5677
0 1 taps.
Fig. 2. MIMO communication system: discrete time channel and equalizer.
Fig. 1. Multivariate forward linear predictor with n
.. .. .. .. .. .. ..
. . . . . . . (13)
.. .. .. .. ..
. . . . .
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
5678 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 54, NO. 12, DECEMBER 2008
by , where the are matrices. The perhaps for the upper left corner, and therefore,
channel output can be written as . The matrix can be partitioned as
Thus
where and
denote the input and noise, respectively. The input components
symbols are discrete uncorrelated random variables with
and . The observation noise is un-
Combining Lemmas 3 and 4, we deduce that
correlated with the input, and its samples are uncorrelated
random variables with zero mean and variance .
The equalizer block is a standard DFE that comprises an
tapped delay line feedforward filter (FFF), and an infinite
tapped delay line feedback filter (FBF). We linearly estimate the
input vector , where is the decision (14)
delay, from the channel outputs and
from the inputs (assuming Without any loss of generality, suppose that
correct previous decisions). Thus, such an estimate has where is a function such that
the form , and (for example,
). Consequently, for sufficiently large , the
matrix is equal to and by using (14), we have
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
GUTIÉRREZ-GUTIÉRREZ AND CRESPO: ASYMPTOTICALLY EQUIVALENT SEQUENCES OF MATRICES 5679
, where is given by expression All the Schatten norms are special instances of unitarily in-
(13), variant norms (see, e.g., [7] or [8]).
and . Using these norms, the definition of asymptotically -equiv-
Assuming we restrict the covariance of the input random alent sequences of matrices is as follows.
vector to be , the maximum
Definition 7: Consider a strictly increasing sequence of nat-
of the normalized mutual information is ural numbers . Let and be matrices for all
bits per channel use, and it . We say that the sequences and are asymp-
is obtained when the input process is Gaussian, with zero mean totically -equivalent, , and write , if
[23]. By denoting , the signal-to-noise ratio at the
input of the MIMO channel, the channel capacity is now given (15)
by the limiting expression and
(16)
with
APPENDIX
OTHER CONCEPTS OF APPROXIMATION OF MATRIX SEQUENCES where and depend only on and, moreover
In this Appendix, we present the notions of asymptotically
-equivalent sequences of matrices and that of a.c.s. Both are
concepts of approximation of matrix sequences that generalize
Gray’s concept of asymptotically equivalent sequences. We The a.c.s. notion is more general than that of asymptotically
present these two notions to show that Theorem 1 can be -equivalent sequences. In fact, we show next that if
stated in a more general way, by proving that this theorem can , then is an a.c.s. for . Moreover, the
be considered as a special instance of certain known results, requirement of uniform boundedness in (15) is not necessary.
namely, [25, Th. 2.1 and 2.2]. Lemma 6: Consider and a strictly increasing se-
We begin with the definition of asymptotically -equivalent quence of natural numbers . Let and be ma-
sequences of matrices that can be obtained from the definition trices for all . If expression (16) holds, then
of asymptotically equivalent sequences of matrices by substi- is an a.c.s. for .
tuting the Frobenius norm for any other Schatten -norm. We Proof: Let be a sin-
first review these well-known norms (see, e.g., [7] or [8]). gular value decomposition of the matrix with
Definition 6: The Schatten -norm of an matrix . We define
, is the norm on of the vector of the singular
values of , i.e.,
if
if
where , with , are the singular values of
counted with their multiplicities. Obviously, , because
Notice that (the spectral radius if is Her- . On the one hand
mitian) and (the norm on of the vector
containing the eigenvalues of when is Hermitian).
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.
5680 IEEE TRANSACTIONS ON INFORMATION THEORY, VOL. 54, NO. 12, DECEMBER 2008
Authorized licensed use limited to: Pedro M Crespo. Downloaded on December 1, 2008 at 03:32 from IEEE Xplore. Restrictions apply.