Professional Documents
Culture Documents
Wiesel 2006
Wiesel 2006
Wiesel 2006
Invited Paper
motivated the design of transmit MF precoders and transmit In the present paper, we integrate the ideas above in the con-
rakes, which perform better in low SNR. In addition, transmit text of MIMO precoding for fixed receivers. The design of most
minimum mean-squared error (MMSE)1 precoders that tried to of the previous precoders is based on minimizing variants of the
compensate for the performance in the different SNR regions common MSE criterion. This criterion is usually computation-
were derived [5]–[10]. ally attractive and performs quite well. However, as far as the ap-
Other precoders using different kinds of optimization criteria plications are concerned, the interesting and relevant criteria are
were also derived. Variants of the previous precoders were dis- BER and capacity, which are intimately associated with maxi-
cussed in [22]–[27]. Linear precoders based on an approximate mizing SINR [1]. Unlike joint optimization, optimizing the pre-
maximum-likelihood approach and maximum asymptotic mul- coder to minimize MSE does not necessarily maximize SINR
tiuser efficiency with different power constraints were derived when the receiver is fixed. Thus, following the transmit beam-
in [28]. A linear precoding technique based on a decomposition forming approach, we focus on SINR-based criteria, and, in par-
approach was proposed in [29], and a linear precoder design for ticular, try to optimize the worst SINR. We consider two design
nonlinear maximum-likelihood (ML) receivers was discussed strategies: The first maximizes the worst SINR subject to an av-
in [30]. Among the nonlinear precoders are the Tomlinson Ha- erage power constraint, and the second minimizes the required
rashima precoder (THP) [31], the “Dirty Paper” precoder [32], average power subject to a constraint on the worst SINR. We
and the vector perturbation precoder [9]. Another nonlinear pre- prove that the proposed precoders have the attractive property
coder that optimizes the transmitted symbols vector itself was of equal performance among all the subchannels.
derived in [33]. Our precoder design is based on the powerful framework of
The problem of precoder design is highly related to other convex optimization theory [21], which allows for efficient nu-
problems in the literature. In this paper, we consider the design merical solutions using standard optimization packages [46]. A
of linear precoders for fixed linear receivers. A related problem brief review of such programs and their standard forms is pro-
is the problem of jointly optimizing the precoder/transmitter and vided in Section III. We then cast the precoder design problems
the receiver, which has been treated, e.g., in [12]–[20]. The de- as standard conic optimization packages. Specifically, we show
sign of the optimal signatures (which can be considered as a that the power optimization problem can be formulated as a
linear precoding scheme) for matched MMSE receivers was dis- second-order cone program (SOCP) [47] or a semidefinite pro-
cussed in [34] and [35], whereas signature design for matched gram (SDP) [48] [otherwise known as a linear matrix inequal-
decision feedback receivers was explored in [36]. One of the ities (LMI) program]. The SINR optimization can also be for-
interesting properties of these joint designs is that maximizing mulated as a standard conic program known as the generalized
the signal-to-interference-plus-noise ratios (SINRs) is related to eigenvalue problem (GEVP) [49].
minimizing the MSE [16]. Thus, although different criteria have Next, we derive optimality conditions for both of the de-
been explored, most of the research was dedicated to variants of sign problems by analyzing the Karush–Kuhn–Tucker (KKT)
the MMSE criterion. conditions for conic programs. We derive a simple expres-
Another related problem is the joint design of rank-one sion for the structure of the optimal precoder as a function
transmit beamforming design and optimal power control of the dual variables. The conditions can be used to verify
[37]–[39]. This problem is equivalent to precoding when there whether a proposed solution is optimal. For example, using
is no transmitter, i.e., the precoder itself is the transmitter. At these conditions, it is easy to show that the MMSE precoder
first glance, it seems that the precoding problem can be solved proposed in [5]–[7] does not necessarily maximize the worst
by addressing the transmit beamforming problem and then SINR, except in the case of a symmetric channel. Another
compensating for the fixed transmitter. Unfortunately, this is use for these conditions is as a stopping criteria in previous
not possible when the transmitter is rank deficient and cannot iterative optimization algorithms.
be inverted. In this aspect,2 our problem is more general. Unlike Probably the most important use of the optimality conditions
the previous references regarding precoding, which usually is in deriving new design algorithms. Using the conditions,
dealt with the MSE criterion and its variants, the beamforming we provide a simple fixed-point iteration that is guaranteed to
community has successfully managed to optimize SINR-based converge to the solution of the power optimization. As a special
criteria, which are more related to practical performance mea- case, this simple iteration can solve the well-known rank-one
sures, such as bit error rate (BER) and capacity. This problem beamforming problem. This allows a simple solution to the
is mathematically more difficult than MSE optimization. It problem without the need for special optimization packages.
was solved using an interesting duality between downlink A similar fixed-point iteration is derived for the SINR opti-
and uplink beamforming [40], [41]. The uplink beamforming mization problem without a convergence proof. In comparison
problem has been solved before in [42] and [43]. Using the to the downlink–uplink duality-based solutions, our simple
duality, the downlink SINR problem can be handled as well fixed-point iterations are considerably more appealing. In
[38], [44]. Recently, a nonlinear THP version of these papers addition, following [39], we derive an alternative approach for
was presented in [45]. satisfying the optimality conditions in the power optimization
1The name transmit MMSE is borrowed from the terminology of receiver de- through a dual SDP/LMI program.
sign. These receivers do not minimize the mean-squared-error (MSE) between One of the advantages of our proposed algorithms is their ro-
the received vector and the symbol vector. On the contrary, the transmit ZF min-
imizes it [3]. bustness to the rank of the effective channels. Most of the previous
2On the other hand, the above references deal with beamforming for rank precoders assume a full-rank effective channel. For example, one
r> 1 and are therefore more general than precoding. cannot decorrelate the channel in [3] if the channel is rank defi-
WIESEL et al.: LINEAR PRECODING VIA CONIC OPTIMIZATION FOR FIXED MIMO RECEIVERS 163
cient, as is the case when the number of users is greater than the block of symbols is modulated and transmitted over the chan-
spreading factor (or the number of transmit antennas). Our design nels. The possibly distorted output is then processed at the re-
algorithms, both the conic solutions and the fixed-point iterations, ceivers in a linear fashion, as depicted in Fig. 1. Denoting by
are indifferent to the rank of the channel, and are therefore appli- the length output of the th receiver, for , we
cable to such scenarios as well. In addition, following [34] and have that
[35], which addressed this problem in the context of optimal se-
quences design for MMSE receivers, we provide an upper bound .. .. .. (1)
for the maximal feasible SINRs in these cases. . . .
An interesting result of our precoders is their performance
in symmetric systems. In this case, our precoders admit simple
closed-form expressions that have already been derived in where the matrices and denote the receiver and
[5]–[9] through different considerations and for general chan- channel associated with the th user, the matrix is the
nels. Using our optimality conditions, it is easy to show that centralized transmitter, is the length vector of
these precoders maximize the worst SINR in symmetric chan- independent, and unit variance transmitted symbols, and are
nels. A realistic example of such systems is a CDMA scheme, the noise vectors. The noise vectors may be correlated, and the
using pseudonoise (PN) sequences as signatures. We analyt- channels are completely arbitrary. The only restriction is that the
ically show that the achievable SINRs using these precoders transmitter is centralized and has access to all of the transmit
with MF receivers is identical to those obtained by using components.
MMSE receivers with no precoders. This result is interesting, Our problem formulation assumes a single stream per MIMO
as it allows for each user to use a simple receiver that does not dimension, i.e., the length of is equal to the length of . How-
require neither the knowledge of all the other signatures nor a ever, these streams can be dedicated to a single user or to mul-
matrix inversion. It is important to note that this feature does tiple users. Three specific examples for which this model holds
not extend to nonsymmetric channels. are given below.
The paper is organized as follows. We begin in Section II by • Point-to-point multiple antenna system—A single user,
introducing the problem formulation. A brief review of conic point-to-point communication system using multiple
optimization is provided in Section III. The power optimization receive and transmit antennas is a special case of (1)
problem is explored in Section IV, in which we discuss its feasi- with .
bility and provide standard conic optimization solutions. In order • CDMA system—The downlink channel of a CDMA
to improve our design algorithms and in order to gain more insight system with users is a special case of (1) with ,
into the problem, we then provide optimality conditions and sug- where is a signature matrix whose columns are the
gest a simple fixed-point iteration for finding the variables that signatures of each of the users, and are
satisfy them. Next, in Section V, we follow the same steps for row vectors representing the linear receive filters of each
the SINR optimization problem. A few special cases for which a of the users.
closed-form solution exists are explored in Section VI. In Section • Transmit beamforming—A multiuser system in which
VII, we illustrate the use of the aforementioned precoders in the transmit antennas signal to users each using a single re-
context of multiple-user communication systems. ceive antenna is a special case of (1) with . Here,
The following notation is used. Boldface upper-case letters is a beamforming matrix whose columns are the an-
denote matrices, boldface lower-case letters denote column vec- tenna weights of each of the users, are row vec-
tors, and standard lower-case letters denote scalars. The super- tors that represent the paths from the transmit antennas to
scripts , and denote the transpose, the th receive antenna, and are arbitrary scalars.
the complex conjugate, the Hermitian, the matrix inverse opera- In the sequel, we will assume that the transmitter , the
tors, and the Moore Penrose pseudoinverse, respectively. channels and the receivers are fixed and cannot be
denotes the ( th, th) element of the matrix . By , we altered due to budget restrictions, standardization, or physical
denote a diagonal matrix with being the ( th, th) element; problems. Given this fixed structure, we will try to improve the
by , we denote stacking the elements of in one long performance by introducing a linear precoder. The precoder, de-
column vector; by , we denote a zeros vector with a one at the noted by , linearly transforms the original symbol vector prior
th element; by , we denote an all ones vector; and by , we de- to the transmission so that the outputs of the receiver are now
note the identity matrix of appropriate size. , given by
and denote the trace operator, the real part, the abso-
lute value, the standard Euclidean norm, and the induced row
sum matrix norm, respectively. Finally, denotes that the .. .. ..
. . .
matrix is a Hermitian positive semidefinite matrix, and
denotes the Null space operator.
(2)
II. PROBLEM FORMULATION
For ease of representation, we will use the following notation:
Consider a general, block oriented, MIMO communication
system with a centralized transmitter. At each time instant, a (3)
164 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 1, JANUARY 2006
where and the rest of the variables are defined One of the main observations of our work is that both op-
in (2). timization problems (6) and (7) can be solved using standard
Our goal is to improve the system performance by optimally conic optimization algorithms. Therefore, in the next section,
designing the precoder. The system performance is usually we review these algorithms.
quantified by its quality of service (QoS) and the resources it
uses. The most common QoS metrics are BER and capacity, III. REVIEW OF CONIC OPTIMIZATION
both of which are highly related to the output SINRs, and in
In recent years, there has been considerable progress and de-
particular to the worst SINR. In our model, the output SINR of
velopment of efficient algorithms for solving a variety of opti-
the th subchannel is defined as
mization problems. In order to use these algorithms, one must
reformulate the problem into a standard form that the algorithms
SINR (4) are capable of dealing with. In this section, we will briefly re-
view the three formulations that we use in the paper: SOCP, SDP,
for , where . Another range and GEVP programming.
of criteria deal with the use of system resources, e.g., peak-to- The most widely researched field in optimization is convex
average ratio, or maximal transmitted power. The most common optimization. A convex program is a program with a convex ob-
resource measure is average transmitted power, which is defined jective function and convex constraints. It is well known that in
as such programs a local minimum is also a global minimum. The
most common convex program is probably the linear program
(5) (LP) [21], i.e., an optimization with a linear objection func-
tion and linear (affine) constraints. Recent advances in convex
It is easy to see that the SINR metric and average power metric optimization generalize the results and algorithms of LPs to
conflict. One cannot maximize the SINRs while also minimizing more complicated convex programs. Special attention is given
the power, and vice versa. Depending on the application, the de- to conic programs, i.e., LPs with generalized inequalities. The
signer must decide which criteria is stricter. We therefore con- two standard conic programs are SOCP and SDP optimization.
sider one of the following two complementary strategies. The The standard form of an SOCP is [47]
first optimization strategy seeks to minimize the average trans-
mitted power subject to QoS constraints. This criterion is inter-
SOCP
esting from a system-level perspective. Given the required QoS, s.t.
the system tries to satisfy it with minimum transmitted power
(8)
[17], [38], as follows:
where the optimization variable is the vector of length and
(6) , and for are the data parameters
s.t. of appropriate sizes. The notation denotes the following
generalized inequality:
where is the given worst SINR constraint.
The second strategy is maximizing the minimal SINR subject (9)
to a power constraint [16], [44]. This problem formulation is in-
teresting when the power constraint is a strict system restriction The standard form of an SDP is [48]
that cannot be relaxed. In this case, the problem can be formu-
lated as
SDP (10)
s.t.
(7) where is a Hermitian matrix that
s.t. depends affinely on . The data parameters are the Hermitian
matrices for . The notation denotes the pos-
where is the given power constraint.
itive semidefinite generalized inequality. A simple case of an
Note that although we are optimizing the minimum SINR
SDP is an SOCP. For example, each of SOC constraints in (8)
in both problems, it is easy to see that at the optimal solution
can be written as an LMI [21], as follows:
of both problems all users will attain equal SINRs (see also
[38]). In other words, the above design criteria both promise
fairness among all the substreams. This is an important prop- (11)
erty in MIMO communication systems. In systems where some
streams demand different QoS, e.g., systems with voice and data A common optimization package designed to solve SOCP and
streams, the designer can replace each SINR in the optimiza- SDP is SEDUMI [46].
tions with SINR , where are constant weights that denote Although most of the research in the field of optimization
the importance of the substreams. This will ensure weighted concerns convex programs, due to their importance, some cases
fairness among the streams. of nonconvex problems have also been investigated. Among
WIESEL et al.: LINEAR PRECODING VIA CONIC OPTIMIZATION FOR FIXED MIMO RECEIVERS 165
them is the GEVP [49], which is not convex but can still be ef- Proposition 1: There exists a such that
ficiently solved. Its standard form is
(16)
s.t.
GEVP (12) only if
(17)
where is a real-valued optimization variable and
and
Proof: In order to prove the proposition, we must upper
are Hermitian matrices that depend affinely
bound the minimal SIR, as follows:
on . The data parameters are the Hermitian matrices
and for . The name of the GEVP arises from
its resemblance to the well-known problem of minimizing the (18)
maximal generalized eigenvalue of the pencil , i.e., min-
imizing the largest such that . It is easy to show where , and we have used
that this problem can be expressed as the monotonicity of in . Due to
monotonicity, we can bound by bounding its argument.
(13) Thus, we now develop a bound on the minimum . Let
s.t. have a singular value decomposition (SVD) ,
where and are semi-unitary matrices, is an
which is, of course, a simple SDP. The GEVP generalizes this diagonal matrix, and rank . Then
program to the case where and also depend on the opti-
mization variables.
(19)
IV. POWER OPTIMIZATION
In this section, we consider the power optimization subject to where and are the th columns of and , respec-
SINR constraints, i.e., the problem of (6). We begin in Section tively. For every , we can bound (19) by applying
IV-A by discussing its feasibility and then provide a few alterna- the Cauchy–Schwarz inequality to the vectors and
tive approaches for its solution. In particular, in Section IV-B,
(20)
we derive a solution to the problem that is based on standard
SOCP or SDP optimization packages. Next, in Section IV-C, Since , we conclude that
we develop optimality conditions for this problem and use them
to derive two alternative solutions. For completeness, in Section (21)
IV-D, we discuss the uplink–downlink duality in the context of
the power optimization. Thus, the minimum is bounded by
A. Feasibility
The first important property of any optimization problem is
its feasibility (admissibility), i.e., whether a solution exists. In
other words, we need to verify whether for a given there exists
a such that (22)
is only necessary. In general, we cannot always attain the bound Since for , we can take the square
when the receiver is fixed. Two simple examples for channels root of , resulting in
in which the bound cannot be achieved are a diagonal with
rank diagonal zeros, or a channel with two identical (28)
rows. In both of these examples, it is easy to see that, no matter
what the precoder is, we will not attain the bound. which can be written as the SOCs
Nonetheless, experimenting with arbitrary channels shows
that in almost all practical channels the bound can be achieved
even for a fixed suboptimal receiver. For example, consider a (29)
rank channel with the normalized null vector
. Except for the case in which for some
, the bound can always be attained by choosing Similarly, the power constraint in (26) can be reformulated
using the operator as , which is
(23) equivalent to the SOC
(25)
C. Optimality Conditions
In this section, we will derive the KKT optimality conditions
(27) for the power optimization. These conditions provide more in-
WIESEL et al.: LINEAR PRECODING VIA CONIC OPTIMIZATION FOR FIXED MIMO RECEIVERS 167
(37) .. ..
. .
(38)
.. (47)
for . This structure of is unique within the .
range of . At this optimal solution, all the constraints are
active, i.e., there are equal SINRs for all the subchannels. The
optimal objective value is The elements of are nonnegative. Therefore,
the elements of the sum will also be nonnegative, and we
(39) can take the element wise squared roots and solve for for
.
Thus, we have shown that the solution in (36)–(38) exists.
Proof: The proof consists of two parts. First, we show that Plugging this solution into the SINR constraints satisfies all the
if (35) holds, then the problem is strictly feasible. Next, as- constraints with equality. Therefore, the problem is feasible.
suming it is strictly feasible, we will use the KKT optimality Moreover, since for , we can always scale
conditions to show that the proposed solution is necessary and the solution by and satisfy the constraints with strict
sufficient. inequalities, i.e., the problem is strictly feasible.
We begin by proving that if (35) holds, then the proposed so- In the next part of the proof, we will show that if (35) holds,
lution in (36)–(38) is feasible. First, let us prove that this solu- then the solution in (36)–(38) is necessary and sufficient for op-
tion exists, i.e., that the matrix in (37) is timality. The power optimization problem can be written as fol-
invertible and that the argument of the squared root is nonnega- lows:
tive. The matrix is invertible since the maximal eigenvalue of
is less than , as follows:
s.t.
(40)
(48)
(41)
The above program is not written in convex form (in order to
write it in convex form, conic inequalities must be used). In
(42) general, the KKT conditions are not sufficient for optimality in
168 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 1, JANUARY 2006
nonconvex programs. However, in Appendix I, we show that satisfies (54). In addition, it has the structure of (53) and is there-
in this special case, if the program is strictly feasible, then its fore sufficient. Moreover, it is easy to show that this structure is
KKT conditions are necessary and sufficient for optimality. The also necessary (within the range of ). Plugging from
Lagrangian associated with program (48) is (53) into (54) yields
(55)
(49)
for , where is the matrix defined by (38).
Rewriting in matrix form, we have
where are the Lagrange dual variables. As we have
shown in the first part of the proof, if (35) holds, then the
.. .. (56)
problem is strictly feasible. Therefore, its primal and dual . .
variables are optimal if and only if the following conditions are
satisfied.
Since for , the unique solution to this
1) Feasibility: The variable is feasible
set of equations is given by (37) and (38). Finally, the optimal
objective value in (39) can be easily found using (5) and (52).
(50) For completeness, it should be noted that when the problem is
solvable, there always exist for such that (35)
for , and the dual variables are dual feasible, holds. This can be shown since if we left multiply both sides
i.e., for . of (53) by and examine the diagonal elements, then (35) is a
2) Complementary slackness: For each , either direct consequence of (53) (which is a necessary condition for
or optimality).
Theorem 1 provides a simple strategy for designing the pre-
(51) coder. Given a feasible , all one has to do is find which
satisfy (35). Once these are found, can be derived through
(36)–(38). As we will show in Section VI, in some special cases,
3) Zero derivative: The derivative of with respect to is
these variables can be derived in closed form. Otherwise, we
zero, resulting in
now propose two alternative methods for finding these variables.
In Section IV-C-1), we present a simple fixed-point iteration,
(52) and in Section IV-C-2), we propose an SDP dual program.
1) Fixed-Point Iteration for Finding : The structure of
At the optimal solution, all the constraints are active, i.e., (51) (35) motivates a fixed-point iteration for finding . By rear-
holds with equality for . As proof, note that if one ranging (35), we arrive at the following simple iteration:
constraint does not hold with an equality, then we can always
scale the row in associated with it and arrive with a feasible
solution that results in a lower objective value, which is a con-
tradiction. (57)
Another important property of the optimal solution, is that Clearly, the optimal satisfy this fixed point. As we now show,
all the dual variables are strictly positive. As proof, assume the if is feasible, then the above iteration will converge from
contrary, i.e., there exists an such that . Then, multi- any to a set that satisfies (35). The convergence
plying (52) by on the left and examining the th diagonal proof is based on the standard function approach introduced in
element, we have , which holds if and [51], which can be summarized as follows. Consider the fixed-
only if and , in which case the th point iteration
SINR is clearly zero. However, since , this contradicts
the SINR constraints. (58)
In general, the that satisfies (52) is not unique. Nonetheless,
expressing as , where where . If (58) has a fixed point, and the
, and , we can find , which is unique functions obey the following properties:
within the range of . Using , we arrive with • (positivity) if for all , then for all ;
the following necessary and sufficient conditions: • (monotonicity) if for all , then
for all ;
(53) • (scalability) if , then for all
then starting in any initial , the iteration will converge to this
(54) unique fixed point. In Appendix II, we show that if the problem
is feasible and for ,
where for . As then the functions in (57) satisfy these properties, and the itera-
already explained, if (35) holds, then the solution in (36)–(38) tion will converge.
WIESEL et al.: LINEAR PRECODING VIA CONIC OPTIMIZATION FOR FIXED MIMO RECEIVERS 169
s.t.
1
b
Fig. 2. Block diagram of a downlink (broadcast) system. The matrices
(59) for m = 1; . . . ; M are diagonal matrices with the s associated with .
s.t. (60)
170 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 1, JANUARY 2006
Moreover, previous attempts for solving the problem are Proof: We begin by proving (64) by contradiction. As-
based on this duality [38], [42]. As we have shown in the pre- sume the contrary, i.e., and are the optimal value and ar-
vious section, the problem can be solved without the duality gument of , and and are the optimal value and
using the optimality conditions. However, for completeness, we argument of . If , then this is a contradiction for the
now present the duality-based approach as well. This approach optimality of for , since is feasible for it and provides
confronts the problem by addressing the problem first and a larger objective value . Otherwise, if , then this is a
then adjusting the solution based on Theorem 2. Fortunately, contradiction for the optimality of for , since ,
there is an intuitive iterative solution to program . The problem and we can always find such that will still be feasible
can be solved by iteratively solving for each of the parameters, but will result in a smaller objective.
while keeping the others fixed. Next, we prove (66) by contradiction. Assume the contrary,
i.e., and are optimal for , and and are optimal
for . We can always multiply by so that it will
1 repeat
still achieve the SINRs constraints of , with an effective power
2
constraint . This contradicts the assumption that
3 was optimal for . The continuity can be verified using similar
. arguments to those in Lemma 2 of [52]. The proofs of (65) and
4 until convergence (67) are similar and are therefore omitted.
Using the properties in Theorem 3, we can solve for a
Line 2 optimizes the receive matrix to maximize the SINRs
given by iteratively solving for different s. Due to
for fixed . Line 3 optimizes the power allocation weights
the inversion property, if , then its solution will be
for fixed [51]. In [38], it was shown that the above algorithm
optimal also for . The strict monotonicity and continuity
always converges to the optimal solution. It is similar to our
guarantees that a simple one-dimensional bisection search will
simple fixed-point iteration in (57), except for the fact that in the
efficiently find the required . This procedure is summarized
above algorithm and are independently optimized at each
in the following algorithm (see also [43]).
iteration, whereas in (57) both are optimized together. Thus, our
simple fixed-point iteration is more appealing.
1
2
V. SINR OPTIMIZATION 3 repeat
We now consider the problem of maximizing the worst SINR 4
subject to a power constraint, i.e., the program of (7). As be- 5
fore, we begin by examining its feasibility. Fortunately, it is easy 6 if
to verify that the program is always feasible, as we can always 7 then
scale so that it satisfies the power constraint. In Section V-A, 8 else
we discuss the connection between the power optimization and 9 until
the SINR optimization and explain how this connection can be 10 return
used to solve the SINR optimization. Then, we follow the steps where MinSINR and MaxSINR define a range of relevant
we took before in the context of the power optimization and re- SINRs for a specific application, and where we have used the
peat them in the context of the SINR optimization. In Section convention that if it is infeasible.
V-B, we formulate the SINR problem as a standard GEVP conic Theoretically, this means that the SINR optimization problem
program; in Section V-C, we provide a fixed point iteration; and can be solved through the previous results concerning the power
in Section V-D, we discuss its uplink–downlink duality. optimization. Nonetheless, due to its importance and in order to
obtain more efficient numerical solution, we now provide direct
A. Connection With Power Optimization solutions for the SINR optimization through conic optimization,
The most interesting property of the SINR optimization via the optimality conditions, and through the uplink–downlink
program is its relation to the power optimization program. In duality.
order to mathematically define this relation, we introduce the
following theorem. B. Conic Optimization Solution
Theorem 3: The power optimization problem of (6) and the The SINR optimization can be cast as a standard GEVP pro-
SINR optimization problem of (7) are inverse problems: gram. Using a real-valued slack variable , the problem can be
rewritten as
(64)
(65) s.t.
(68)
In addition, the optimal objective value of each program is con-
tinuous and strictly monotonic increasing in its input argument
At first glance, (68) seems similar to (26). However, it turns
(66) out to be considerably more complicated. This is because the
(67) SINR matrix inequalities in (33) are linear in
WIESEL et al.: LINEAR PRECODING VIA CONIC OPTIMIZATION FOR FIXED MIMO RECEIVERS 171
or in , but not in both simultaneously. Thus, when is an op- D. Interpretation via Uplink–Downlink Duality
timization variable and not a parameter, these constraints are no Following the success of the uplink–downlink duality in the
longer LMIs. In fact, the sets which they define are not convex.3 power optimization, the duality was recently used to confront
Nonetheless, we can still express them using generalized ma- the SINR optimization [44]. The uplink–downlink duality in the
trix inequalities as in (32) and (33). If we rewrite the ’s case of the SINR optimization can be stated as follows.
in (33) and separate out the terms which are linear, we have Theorem 4: Consider the uplink program in (74), shown at
(69) the bottom of the page. Program is the dual of the pro-
gram of (7) in the sense that if the optimal arguments
where and are matrices that depend affinely on and objective value of are and , then the optimal
, as follows: objective value of is also , and its optimal argument is
, where are appropriate scaling coeffi-
cients.
Proof: The proof is similar to the other proofs in this paper
and is therefore omitted. A detailed proof in the case of
(70) can be found in [44].
The downlink beamforming SINR optimization problem
Using (69), we can express in the standard GEVP form was solved using duality in [44]. The algorithm iteratively
optimizes each of the optimization variables while keeping the
others fixed, as follows.
s.t.
1 repeat
(71) 2
s.t. (74)
172 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 1, JANUARY 2006
(75)
(76)
Fig. 5. BERs of a nonsymmetric three-user system with cross correlations Fig. 6. Worst output SINR in a system with K = 4 and N = 3.
= 0:8; = 0:9; = 0:7.
There are many interesting extensions to this paper that
precoders are presented in Fig. 6. Using our precoder sig- are worth pursuing. The first concerns the extension of our
nificantly increases the SINR compared to a system with no results to the case of partial channel state information (CSI). In
precoder. Using the precoder, the SINRs asymptotically con- many practical systems, the transmitter does not have access
verge to the bound in (17), i.e., for to perfect CSI, and needs to resort to noisy channel estimates,
. Interestingly, the performance using the precoder and/or delayed feedback. In this case, robust optimization
of [5] is even worse than not using a precoder at all. For fairness, algorithms should be applied. Another possible direction is
we must note that the SINR of the best user using this precoder to consider fixed nonlinear receivers, such as the successive
are much higher. However, from a systems prespective, the in- canceling receiver. It is well known that such receivers outper-
teresting metric is the performance of the worst user, and in this form the linear receivers explored in our paper. Therefore, by
sense our precoder is more appealing. designing the precoder to optimally work with such receivers,
the performance can significantly improve.
VIII. CONCLUSION
APPENDIX I
In this paper, we addressed the problem of designing linear OPTIMALITY CONDITIONS FOR PROGRAMS
precoders for fixed MIMO receivers. We considered two com- WITH SOC CONSTRAINTS
plementary design criteria, and proposed several alternative al-
gorithmic solutions for these optimization problems. In this appendix, we derive optimality conditions for opti-
It is first observed that in precoder design, maximizing the mization programs with SOC constraints. The conditions are
worst SINR is advantageous to minimizing MSE. Most of the summarized in the following proposition.
previous work regarding precoders is based on optimizing vari- Proposition 3: Consider a nonconvex program of the struc-
ants of the MMSE criterion. These ad hoc criteria are usually ture
computationally attractive and perform quite well. However, the
(79)
ever increasing demand for better performance, as well as the s.t.
considerable progress in optimization theory, suggests that up-
coming research should focus on design criteria, which are more where is convex in , and and for
related to practical performance measures, such as maximizing are affine functions of . Let us associate the dual
the worst SINR. variables for with this program. If the program
Our second important observation is that by using conic is strictly feasible, then the following KKT conditions are nec-
optimization theory and algorithms, the precoder design prob- essary and sufficient conditions for optimality of and .
lems can be solved in a straight forward manner without using 1) Feasibility: The variable is feasible
uplink–downlink duality. This duality is remarkable and has
(80)
enabled solutions to problems which were unsolvable before.
Nonetheless, we believe that understanding the precoder design and the dual variables are dual feasible for
using first principles, and not as a byproduct of the uplink .
problem, is also important. For example, in future work, the 2) Complementary slackness: For each , one of
simple optimality conditions may help in analyzing the perfor- the following conditions holds:
mance of these systems, or in improving the design criteria,
without the need to resort to the virtual uplink problem. (81)
174 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 1, JANUARY 2006
Proof: For simplicity, we will only deal with real-valued In this appendix, we will prove some properties of . For
variables and functions. The extension to complex values is simplicity and due to space limitations, we will only deal with
straightforward. The KKT conditions are necessary for opti- real-valued variables and functions. The proofs rely on the fol-
mality of any optimization problem [21]. If the program in (79) lowing proposition.
was in convex form, then the conditions were also sufficient Proposition 4: If and is in the range of ,
for optimality. Unfortunately, the program is not expressed in then
convex form, and therefore we must prove the sufficiency. Let
us begin by rewriting (79) in convex form (90)
where we have used Proposition 4 with [2] A. Goldsmith, S. A. Jafar, N. Jindal, and S. Vishwanath, “Capacity limits
and . Due to the semidefiniteness, the di- of MIMO channels,” IEEE J. Sel. Areas Commun., vol. 21, no. 5, pp.
684–702, Jun. 2003.
agonal elements of are nonnegative. [3] B. R. Vojcic and W. M. Jang, “Transmitter precoding in synchronous
Therefore, if they are finite, then their inverses are strictly multiuser communication,” IEEE Trans. Commun., vol. 46, no. 10, pp.
1346–1355, Oct. 1998.
positive, and the property holds. [4] R. F. H. Fischer, Precoding and Signal Shaping for Digital Transmis-
sion. New York: Wiley, 2002.
[5] M. Joham, K. Kusume, M. H. Gzara, and W. Utschick, “Transmit wiener
• Monotonicity—If for , then filter for the downlink of TDD DS-CDMA systems,” in Proc. IEEE 7th
for . Int. Symp. Spread Spectrum Techniques Applications (ISSSTA), vol. 1,
Sep. 2002, pp. 9–13.
[6] F. Wathan, R. Irmer, and G. Fettweis, “On transmitter-based interference
Proof. Observe the following chain: mitigation in TDD-downlink with frequency selective fading environ-
ment,” presented at the Asia–Pacific Conf. Communications (APCC),
Indonesia, Sep. 17–20, 2002.
[7] R. L. U. Choi and R. D. Murch, “New transmit schemes and simplified
receivers for MIMO wireless communications systems,” IEEE Trans.
Wireless Commun., vol. 2, no. 6, pp. 1217–1230, Nov. 2003.
[8] L. U. Choi and R. D. Murch, “Transmit-preprocessing techniques with
simplified receivers for the downlink of MISO TDD-CDMA systems,”
IEEE Trans. Veh. Technol., vol. 53, no. 2, pp. 285–295, Mar. 2004.
[9] C. Peel, B. Hochwald, and L. Swindlehurst, “A vector-perturbation tech-
nique for near-capacity multi-antenna multi-user communication,” pre-
sented at the 41st Allerton Conf. Communication, Control, Computing,
IL, Oct. 1–3, 2003.
(96) [10] R. Esmailzadeh, E. Sourour, and M. Nakagawa, “Pre-rake diversity com-
bining in time division duplex CDMA mobile communications,” IEEE
where we have used Proposition 4 with and Trans. Veh. Technol., vol. 48, no. 3, pp. 795–801, May 1999.
[11] K. N. Modi, K. Reynolds, and X. Wang, “Linear precoding for
. multi antenna CDMA onver multipath ISI channels,” presented at
• Scalability—If , then for the Conf. Information Sciences Systems (CISS-2004), Princeton,
NJ, Mar. 2004.
. [12] Y. Ding, T. N. Davidson, Z.-Q. Luo, and K. M. Wong, “Minimum BER
block precoders for zero-forcing equalization,” IEEE Trans. Signal
Process., vol. 51, no. 9, pp. 2410–2423, Sep. 2003.
Proof. Observe the following chain: [13] K. Ban, M. Katayama, T. Yamazato, and A. Ogawa, “Transmitter pre-
coding with multiple transmit/receive antennas for high data rate com-
munication in bandwidth-limited channels,” in Proc. Vehicular Tech-
nology Conf. (VTC 1999—Fall), vol. 3, Sep. 1999, pp. 1946–1950.
[14] J. Yang and S. Roy, “Joint transmitter receiver optimization for multi
input multi output systems with decision feedback,” IEEE Trans. Inf.
Theory, vol. 40, no. 5, pp. 1334–1347, Sep. 1994.
[15] A. Scaglione, P. Stoica, S. Barbarossa, G. B. Giannakis, and H.
S. Sampath, “Optimal designs for space time linear precoders and
decoders,” IEEE Trans. Signal Process., vol. 50, no. 5, pp. 1051–1064,
May 2002.
[16] D. P. Palomar, J. M. Cioffi, and M. A. Lagunas, “Joint Tx-Rx beam-
forming design for multicarrier MIMO channels: A unified framework
for convex optimization,” IEEE Trans. Signal Process., vol. 51, no. 9,
(97) Sep. 2003.
[17] D. P. Palomar, M. A. Lagunas, and J. M. Cioffi, “Optimum linear joint
where we have used Proposition 4 with and transmit-receive processing for MIMO channels with QoS constraints,”
. The inequality is non strict, i.e., IEEE Trans. Signal Process., vol. 52, no. 5, pp. 1179–1197, May 2004.
[18] Z. Q. Luo, T. N. Davidson, G. B. Giannakis, and K. M. Wong, “Trans-
holds with equality if and only if ceiver optimization for block-based multiple access through ISI chan-
. Multiplying by on the left, yields nels,” IEEE Trans. Signal Process., vol. 52, no. 4, pp. 1037–1052, Apr.
2004.
. Therefore, , and [19] S. Serbetli and A. Yener, “Transceiver optimization for multiuser MIMO
due to the symmetry of , we also have systems,” IEEE Trans. Signal Process., vol. 52, no. 8, pp. 214–226, Aug.
. In addition, due to the semidefiniteness, this 2004.
[20] H. Sampath, P. Stoica, and A. Paulraj, “Generalized linear precoder and
means that and decoder design for MIMO channels using the weighted MMSE crite-
. Therefore, , rion,” IEEE Trans. Commun., vol. 49, no. 12, pp. 2198–2206, Dec. 2001.
[21] S. Boyd and L. Vandenberghe, Introduction to Convex Optimization
and . Consequently, the problem is with Engineering Applications. Stanford, CA: Cambridge Univ. Press,
infeasible. 2003.
[22] M. Brandt-Pearce and A. Dharap, “Transmitter based multiuser inter-
ference rejection for the down-link of a wireless CDMA system in a
ACKNOWLEDGMENT multipath environment,” IEEE J. Sel. Areas Commun., vol. 18, no. 3, pp.
407–417, Mar. 2000.
A. Wiesel would like to thank Prof. A. Nemirovski for intro- [23] M. Meurer, P. W. Baier, T. Weber, Y. Lu, and A. Papathanassiou, “Joint
transmission: Advantageous downlink concept for CDMA mobile radio
ducing him to the GEVP, and Dr. D. P. Palomar, Dr. A. Dor, and systems using time division duplexing,” Electron. Lett., vol. 36, no. 10,
the anonymous referees for their insightful comments and sug- pp. 900–901, May 2000.
gestions that helped improve this paper. [24] S. Georgoulis and D. Cruickshank, “Pre-equalization, transmitter
precoding, and joint transmission techniques for time division duplex
CDMA,” in Proc. Inst. Elect. Eng. 3G Mob. Communications Tech-
REFERENCES nology, London, U.K., Mar. 2001, pp. 257–261.
[25] A. N. Barreto and G. Fettweis, “Joint signal precoding in the downlink
[1] S. Verdu, Multiuser Detection. Cambridge, U.K.: Cambridge Univ. of spread spectrum systems,” IEEE Trans. Wireless Commun., vol. 2, no.
Press, 1998. 3, pp. 511–518, May 2003.
176 IEEE TRANSACTIONS ON SIGNAL PROCESSING, VOL. 54, NO. 1, JANUARY 2006
[26] S. L. Georgoulis and D. G. M. Cruickshank, “Transmitter-based inverse Ami Wiesel (S’02) received the B.Sc. degree in
filters for reducing MAI and ISI in CDMA-TDD downlink,” IEEE Trans. electrical engineering and the M.Sc. degree in elec-
Wireless Commun., vol. 3, no. 2, pp. 353–358, Mar. 2004. trical engineering from Tel-Aviv University (TAU),
[27] J. Choi and S. Perreau, “MMSE multiuser downlink multiple antenna Tel-Aviv, Israel, in 2000 and 2002, respectively.
transmission for CDMA channels,” IEEE Trans. Signal Process., vol. He is currently working toward the Ph.D. degree
52, no. 6, pp. 1564–1573, Jun. 2004. at the Department of Electrical Engineering, The
[28] S. Wang and J. Caffery, “Linear multiuser precoding for synchronous Technion—Israel Institute of Technology, Haifa,
CDMA,” in Proc. IEEE Wireless Communications Networking Conf.
Israel.
(WCNC2003), vol. 1, Mar. 2003, pp. 403–407.
[29] L. U. Choi and R. D. Murch, “A transmit preprocessing technique From June 2002 to March 2003, he was with
for multiuser MIMO systems using a decomposition approach,” IEEE the Department of Signal Theory and Communica-
Trans. Wireless Commun., vol. 3, no. 1, pp. 20–24, Jan. 2004. tions of the Universitat Politecnica de Catalunya,
[30] L. Collin, O. Berder, P. Rostaing, and G. Burel, “Optimal minimum dis- Barcelona, Spain. His main areas of interests include signal processing algo-
tance based precoder for MIMO spatial multiplexing systems,” IEEE rithms, convex optimization methods, and detection and estimation theory.
Trans. Signal Process., vol. 52, no. 3, pp. 617–627, Mar. 2004. Mr. Wiesel was awarded the Weinstein Study Prize in 2002 and the Intel
[31] R. Fischer, C. Windpassinger, A. Lampe, and J. Huber, “Space time Award in 2005. He received the Student Paper Award for an SPAWC-2005 paper.
transmission using Tomlinson-Harashima precoding,” in Proc. 4th ITG
Conf. Source Channel Coding, Berlin, Germany, Jan. 2002, pp. 139–147.
[32] M. Schubert and H. Boche, “Joint ‘dirty paper’ pre-coding and downlink
beamforming,” in Proc. IEEE Int. Symp. Spread Spectrum Techniques
Applications (ISSSTA 2002), vol. 2, Sep. 2002, pp. 536–540.
[33] E. S. Hons, A. K. Khandani, and W. Tong, “An optimized transmitter Yonina C. Eldar (S’98–M’02) received the B.Sc.
precoding scheme for synchronous DS-CDMA,” in Proc. IEEE Int. degree in physics and the B.Sc. degree in elec-
Communications Conf., vol. 3, Apr. 2002, pp. 1818–1822. trical engineering from Tel-Aviv University (TAU),
[34] P. Viswanath and D. N. C. Tse, “Optimal sequences, power control, and Tel-Aviv, Israel, in 1995 and 1996, respectively,
user capacity of synchronous CDMA systems with linear MMSE mul-
and the Ph.D. degree in electrical engineering and
tiusers receivers,” IEEE Trans. Inf. Theory, vol. 45, pp. 1968–1983, Sep.
1999. computer science from the Massachusetts Institute
[35] S. Ulukus and R. Yates, “Iterative construction of optimum signature of Technology (MIT), Cambridge, in 2001.
sequence sets in synchronous cdma systems,” IEEE Trans. Inf. Theory, From January 2002 to July 2002, she was a
vol. 47, no. 5, pp. 1989–1998, Jul. 2001. Postdoctoral Fellow at the Digital Signal Processing
[36] T. Guess, “Optimal sequences for CDMA with decision feedback re- Group, MIT. She is currently an Associate Professor
ceivers,” IEEE Trans.n Inf. Theory, vol. 49, no. 4, pp. 886–900, Apr. 2003. with the Department of Electrical Engineering, The
[37] J. H. Chang, L. Tassiulas, and F. R. Farrokhi, “Joint transmitter receiver Technion—Israel Institute of Technology, Haifa, Israel. She is also a Research
diversity for efficient space division multiple access,” IEEE Trans. Wire- Affiliate with the Research Laboratory of Electronics at MIT. Her current
less Commun., vol. 1, pp. 16–27, Jan. 2002. research interests are in the general areas of signal processing, statistical signal
[38] E. Visotsky and U. Madhow, “Optimum beamforming using transmit processing, and quantum information theory.
antenna arrays,” in Proc. IEEE Vehicular Technology Conf., vol. 1, May Dr. Eldar is a member of the IEEE Signal Processing Theory and Methods
1999, pp. 851–856. technical committee and an Associate Editor for the IEEE TRANSACTIONS ON
[39] M. Bengtsson and B. Ottersten, “Optimal downlink beamforming using SIGNAL PROCESSING. She was in the program for outstanding students at TAU
semidefinite optimization,” in Proc. 37th Annu. Allerton, Sep. 1999, pp.
from 1992 to 1996. In 1998, she held the Rosenblith Fellowship for study in
987–996.
[40] H. Boche and M. Schubert, “A general duality theory for uplink and Electrical Engineering at MIT, and in 2000, she held an IBM Research Fellow-
downlink beamforming,” in Proc. IEEE Vehicular Technology Conf. ship. She is currently a Horev Fellow of the Leaders in Science and Technology
(VTC-2002 Fall), Sep. 2002, pp. 87–91. program at The Technion and an Alon Fellow. In 2004, she was awarded the Wolf
[41] P. Viswanath and D. N. C. Tse, “Sum capacity of the vector Gaussian Foundation Krill Prize, and in 2005 the Andre and Bella Meyer Lectureship.
broadcast channel and uplink downlink duality,” IEEE Trans. Inf.
Theory, vol. 49, no. 8, pp. 1912–1921, Aug. 2003.
[42] F. R. Farrokhi, L. Tassiulas, and K. J. R. Liu, “Joint optimal power con-
trol and beamforming in wireless networks using antenna arrays,” IEEE
Trans. Commun., vol. 46, no. 10, pp. 1313–1324, Oct. 1998.
[43] T.-L. Tung and K. Yao, “Optimal downlink power-control design Shlomo Shamai (Shitz) (S’80–M’82–SM’89–F’94)
methodology for a mobile radio DS-CMDA system,” in Proc. IEEE received the B.Sc., M.Sc., and Ph.D. degrees in elec-
Workshop Signal Processing Systems (SIPS-2002), 2002, pp. 165–170. trical engineering from The Technion—Israel Insti-
[44] M. Schubert and H. Boche, “Solution of the multiuser downlink beam- tute of Technology, Haifa, Israel, in 1975, 1981, and
forming problem with individual SINR constraints,” IEEE Trans. Veh. 1986, respectively.
Technol., vol. 53, no. 1, pp. 18–28, Jan. 2004. From 1975 to 1985, he was a Senior Research
[45] S. Shi and M. Schubert, “Precoding and power loading for multi antenna Engineer with the Communications Research Lab-
broadcast channels,” presented at the Conf. Information Sciences Sys-
oratories, Tel Aviv, Israel. Since 1986, he has been
tems (CISS-2004), Princeton, NJ, Mar. 17–19, 2004.
[46] J. F. Sturm, “Using SEDUMI 1.02, a Matlab toolbox for optimizations with the Department of Electrical Engineering,
over symmetric cones,” Optimization Meth. Software, vol. 11–12, pp. The Technion—Israel Institute of Technology,
625–653, 1999. where he is now the William Fondiller Professor
[47] M. S. Lobo, L. Vandenberghe, S. Boyd, and H. Lebret, “Applications of Telecommunications. His research interests include topics in information
of second order cone programming,” Linear Algebra App., vol. 284, pp. theory and statistical communications. He is especially interested in theoretical
193–228, Nov. 1998. limits in communication with practical constraints; multiuser information
[48] L. Vandenberghe and S. Boyd, “Semidefinite programming,” SIAM Rev., theory and spread-spectrum systems; multiple-input-multiple-output (MIMO)
vol. 38, no. 1, pp. 49–95, Mar. 1996. communications systems; information theoretic models for wireless networks
[49] S. Boyd and L. E. Ghaoui, “Method of centers for minimizing general- and systems; information theoretic aspects of magnetic recording, channel
ized eigenvalues,” Linear Algebra and Its Applications, vol. 188–189, coding, combined modulation and coding, turbo codes, and low-density parity
pp. 63–111, 1993. codes in channel, source, and combined source–channel applications; iterative
[50] C. D. Meyer, Matrix Analysis and Applied Linear Algebra. Philadel- detection and decoding algorithms; coherent and noncoherent detection; and
phia, PA: SIAM, 2000. information theoretic aspects of digital communication in optical channels.
[51] R. Yates, “A framework for uplink power control in cellular radio sys-
Dr. Shamai (Shitz) is a member of the Union Radio Scientifique Internationale
tems,” IEEE J. Sel. Areas Commun., vol. 13, no. 7, pp. 1341–1347, Sep.
1995. (URSI). He is the recipient of the 1999 van der Pol Gold Medal of URSI and
[52] Y. C. Eldar, “Minimax MSE-ratio estimation with bounded data uncer- a corecipient of the 2000 IEEE Donald G. Fink Prize Paper Award, and the
tainties,” IEEE Trans. Signal Process., Dec. 2003, submitted for publi- 2003 and the 2004 joint IT/COM societies paper award. He is also the recipient
cation. of the 2000 Technion Henry Taub Prize for Excellence in Research. He has
[53] W. Yang and G. Xu, “Optimal downlink power assignment for smart served as Associate Editor for the Shannon Theory of the IEEE TRANSACTIONS
antenna systems,” in Proc. IEEE Int. Conf. Acoustics, Speech, Signal ON INFORMATION THEORY and also serves on the Board of Governors of the
Processing (ICASSP), vol. 6, May 1998, pp. 3337–3340. Information Theory Society.