Professional Documents
Culture Documents
Pilot Precoding and Combining in Multiuser MIMO Networks
Pilot Precoding and Combining in Multiuser MIMO Networks
Pilot Precoding and Combining in Multiuser MIMO Networks
Abstract—Although the benefits of precoding and combining standard can employ up to eight antenna elements [4]. For
arXiv:1611.01950v2 [cs.IT] 16 Nov 2016
data signals are widely recognized, the potential of these tech- 5G systems, we expect high-end UEs supporting high order
niques for pilot transmission is not fully understood. This is of modulation and coding schemes and a greater number of
particularly relevant for multiuser multiple-input multiple-output
(MU-MIMO) cellular systems using millimeter-wave (mmWave) receive and transmit antennas [5]. Moreover, due to the high
communications, where multiple antennas have to be used both at path loss at the mmWave frequencies, exploiting multiple
the transmitter and the receiver to overcome the severe path loss. antennas in addition to the spatial precoding and combining
In this paper, we characterize the gains of pilot precoding and is considered to have an essential role for establishing and
combining in terms of channel estimation quality and achievable maintaining a robust communication link [6]. Nonetheless,
data rate. Specifically, we consider three uplink pilot transmission
scenarios in a mmWave MU-MIMO cellular system: 1) non- most of the significant investigations in massive multiuser
precoded and uncombined, 2) precoded but uncombined, and MIMO (MU-MIMO) assume that the BSs or access points
3) precoded and combined. We show that a simple precoder that serve a lower number of single-antenna UEs. In those studies,
utilizes only the second-order statistics of the channel reduces such an assumption is considered non-restrictive because the
the variance of the channel estimation error by a factor that is spatial precoding of the user data streams boosts the achieved
proportional to the number of user equipment (UE) antennas.
We also show that using a linear combiner designed based on signal-to-noise-and-interference ratio (SINR) [7], [8]. While
the second-order statistics of the channel significantly reduces transmit precoding for the downlink transmission is the key
multiuser interference and provides the possibility of reusing to achieve high spectral efficiency, precoding in the uplink
some pilots. Specifically, in the large antenna regime, pilot direction has not been considered in these works. This is
precoding and combining help to accommodate a large number of true not only for uplink data transmission, but also for the
UEs in one cell, significantly improve channel estimation quality,
boost the signal-to-noise ratio of the UEs located close to the cell transmission of uplink pilot signals that are used to acquire
edges, alleviate pilot contamination, and address the imbalanced both channel state information at the transmitter (CSIT) and
coverage of pilot and data signals. channel state information at the receiver (CSIR) at the BS. A
Index Terms—multiuser MIMO, multiple antenna UEs, chan- direct consequence of the lack of precoding of pilot signals
nel estimation, millimeter-wave, transceiver design. is that the majority of the existing schemes consider one
orthogonal pilot sequence per transmit antenna, which gives
several systematic problems – illustrated in the sequel – and
I. I NTRODUCTION may limit the efficiency and future use cases of MU-MIMO
Multiple-input multiple-output (MIMO) systems that em- systems, especially in mmWave networks.
ploy a large number of antenna ports at wireless access Acquiring accurate channel state information (CSI), either
points are a rapidly maturing technology. The 3rd Generation at the transmitter or at the receiver, is among the main
Partnership Project (3GPP) is currently studying the details of bottlenecks of massive MIMO systems and faces three main
technology enablers and performance benefits of deploying challenges: i) scalability of the number of pilots, ii) per-
large scale antenna systems that support up to 64 antenna formance at low signal-to-noise ratio (SNR), and iii) pilot
ports at cellular base stations (BSs) [1]. Moreover, higher contamination [9], [10]. The length of training sequences for
frequency bands, such as millimeter-wave (mmWave), will nat- channel estimation, in the traditional “one orthogonal pilot
urally employ large-scale antenna systems [2]. With mmWave sequence per transmit antenna” scheme scales up (at least)
antennas, the physical array size can be greatly reduced due linearly with the number of transmit antennas [11]. Assuming
to the decrease in wavelength. Therefore, it is expected that that the number of BS antennas is larger than the combined
wireless systems employing even greater number of antenna number of UE antennas, channel estimation in uplink imposes
ports will be deployed in mmWave bands, making massive shorter training sequences. However, this scheme requires the
MIMO systems a practical reality. principle of channel reciprocity to hold, which is valid only for
In addition to the usage of many antenna ports at the time division duplexing (TDD) mode and when the duplexing
BS, also user equipments (UEs) compliant with existing and time is much shorter than the coherence time of the channel.
emerging wireless standards are also employing a growing Thus, realizing massive MIMO systems in frequency division
number of receive and transmit antennas. For example, current duplexing (FDD) mode is a well-known challenge [9]. Even in
UEs of the 3GPP Long Term Evolution systems can employ TDD mode, this scheme may not be feasible when a massive
up to four antennas for transmit and receive diversity as well number of multiple-antenna UEs are present, which is an
as for spatial multiplexing [3]. Clients of the IEEE 802.11ac important use case of mmWave networks. In such a case, the
2
entire coherence budget may be used only for the channel estimation quality and the achievable data rate. Our extensive
estimation procedure, depriving the data transmission phase mathematical and numerical analysis results in the following
from valuable coherent time and frequency resources. Thus, a key findings:
pilot transmission scheme that is scalable with the number of
• Pilot precoding in PuC substantially improves the channel
transmit antennas is especially desirable in mmWave systems.
estimation quality that can be achieved by the baseline
Moreover, due to the central role of the uplink pilot signals in
nPuC scenario. In particular, we show that the channel
the CSI acquisition, the constrained UE power and the lack of
estimation error variance can be reduced by a factor of
uplink precoding gains may limit the performance of mmWave
δ = N/L for large values of N , where N is the number of
systems with large antenna arrays. This leads to an imbalanced
UE antenna elements, and L is the rank of the channel
coverage of pilot and data signals (a.k.a imbalance between
between each UE and the BS. Hence, pilot precoding
uplink and downlink coverage [12], [13]), where the range at
leads to a large improvement in the channel estimation
which reasonable data rates can be maintained differs from
performance of systems with a large number of antennas
the one at which pilot signals can be detected. This problem
and low rank channels (which is the case in mmWave
is particularly important in mmWave networks due to severe
networks) where δ is large.
channel attenuations [14]. Therefore, a good pilot transmission
• A simple combiner of the pilot sequence, in the PC
scheme should address the imbalance in pilot-data coverage.
scenario that uses only the second-order statistics of the
The above-mentioned technical challenges, which limit the
channel substantially reduces intra-cell multiuser inter-
achievable data rate of massive MIMO systems, are exacer-
ference. In the systems with a large number of antennas
bated by pilot contamination, defined as the interference in the
at the BS, such as in mmWave networks, this interfer-
pilot signals [7], [8], [15]. Several methods to alleviate pilot
ence can be canceled completely. Consequently in PC,
contamination have been proposed and demonstrated [9], [16]–
pilots can be reused within one cell, as opposed to
[18]. In the context of multi-cell networks, the results of [18]
orthogonal pilot sequences used in the nPuC and PuC
suggest that channel covariance-aware pilot assignment to UEs
scenarios. This potential translates into the possibility
can completely remove the pilot contamination effects in the
of channel estimation using shorter training sequences
limit of large number of BS antennas. The intuition is that
(or equivalently serving more UEs without extra training
if a selected UE exhibits multipath angles of arrival (AoA)
sequences), which in turn enhances the network spectral
at its serving BS, which do not overlap with the AoAs of
efficiency.
UEs in the neighboring cells, these UEs can reuse the same
• Unlike the baseline scenario, nPuC, the number of pilot
pilot sequence as the selected UE, without any contamination
symbols needed in PuC and PC, is not dependent on
among the pilots of different cells. In [17], the UEs within
the number of antenna elements at the transmitters. This
each cell employ the same pilot sequence while the UEs in
benefit enables the realization of massive MIMO systems
the different cells are assigned orthogonal pilot sequences. The
in both FDD and TDD deployments.
impact of intra-cell pilot contamination is mitigated later in
• In PC, when the number of antennas at the BS and UEs
the downlink data transmission phase. However, in this case
goes to infinity, which may resemble a wireless back-
extra precoding matrices should be designed at the BSs, which
hauling scenario, we conclude the following asymptotic
add to the complexity of data transmission, specially when the
results:
number of antennas grows large. The other studies have not
investigated the pilot reuse within one cell, despite that such 1) the effects of pilot contamination vanish;
a reuse, together with employing multiple antenna elements at 2) a multi-cell network can be modeled by multiple unco-
the UE, has the potential to substantially improve the spectral ordinated single-cell systems with no performance loss
efficiency. (that is, without a performance penalty due to the lack
In this paper, we argue that all the aforementioned problems of coordination); and
– scalability, poor performance at low SNR, and pilot contam- 3) channel estimation in the entire network can be done
ination – can be substantially alleviated by employing multiple with only one pilot symbol. We also characterize the
antennas at the UEs, together with pilot precoding and com- above gains in the finite antenna regime.
bining. We show the benefits of pilot precoding and combining Our investigations can contribute to answering the following
for general massive MU-MIMO systems and for mmWave fundamental questions related to large scale antenna systems
systems in particular, where these benefits are substantial. Our in general and mmWave networks in particular [19]: How
investigation is motivated not only by the ongoing standards can we improve the quality of channel estimation for a fixed
development and the need to boost the uplink SNR, but also (finite) number of BS antennas? Can we mitigate the effects of
by the expectation that pilot precoding may achieve better pilot contamination and can we reduce the need for multicell
spatial separation of UEs served in the same and surrounding coordination? Can we utilize massive antenna arrays in FDD
cells. Specifically, we focus on the uplink of a MU-MIMO systems? How many orthogonal pilot sequences do we need
system and consider three pilot transmission scenarios: 1) non- for a given number of – possibly multiple antenna – UEs?
precoded and uncombined (nPuC), which serves as a baseline The rest of this paper is organized as follows. Section II
scenario, 2) precoded but uncombined (PuC), and 3) precoded describes the system model, including the channel and the
and combined (PC). We use these three scenarios to study the signal model. Section III analyzes three distinct channel es-
gains of pilot precoding and combining in terms of channel timation techniques that differ in terms of complexity and
3
B. Signal Model
A. Channel Model
Within one fading block, the baseband received signal
We assume a narrow-band block-fading channel between the
vector at the BS is
BS and each UE, where the channels are relatively constant K
for one fading block, with duration Tc channel uses, and X
y(t) = Hk xk (t) + z(t) , (5)
they change to statistically independent values in the next k=1
4
increases the dimension of the received signal. These extra TDD mode (in which the channel reciprocity principle holds),
observations, while having a constant number of unknown whereas PuC can be used in both TDD and FDD modes.
parameters, generally improve the channel estimation perfor- Note that the number of unique pilots required by PuC does
mance. On the other hand, increasing N reduces the received still scale by the number of UEs. Moreover, due to the lack
SNR, formulated in the previous paragraph. These two effects of receiver antenna gain, it may not completely solve the
cancel each other, making the average normalized MSE almost imbalanced pilot-data coverage problem, though substantially
independent of the number of UE antennas N . On the contrary, alleviate it compared to the nPuC scheme.
comparing Fig. 2(a) and Fig. 2(b) shows that adding more For mathematical tractability, we assume the availability of
antenna elements to the BS enhances the channel estimation of perfect second-order statistics information which allows us
performance. This can also be related to the increase in the to gain insights about the impact of different parameters on
dimension of the received signal by increasing the number the network performance (e.g., channel estimation quality and
of BS antennas, M . Another observation from Fig. 2 is that network throughput). In this section, we choose the precoding
the channel estimation performance improves for networks matrix Vk = Uk for each UE k, which simplifies the
with sparser scattering environments (smaller L), where this mathematical analysis and, at the same time, is asymptotically
improvement is almost linear in the large antenna regimes; see optimal in terms of maximizing the SNR [27]. No combining
Corollary 1. filter is considered at the BS in this scenario, that is Wk = IM .
Substituting the training matrix and spatial filters of scenario
B. Precoded and Uncombined Pilot Transmission (PuC) PuC into (10), the MMSE estimate of the channel becomes
H −1
In the second pilot transmission scenario, the pilots are (PuC) (PuC) (PuC) (PuC)
vec Hk b = Rk P̆k P̆k Rk P̆k 2
+ σz I
precoded using spatial filters at the UEs. However, no pilot
combining is performed at the BS. The channel estimation
quality in this scenario depends on the statistical information · vec Yτ(PuC) , (18)
of the channel available prior to pilot transmission. Assuming (PuC) (PuC) ∗ (PuC)
that Uk is available at UE k, either perfectly or with some where P̆k = Uk Pk ⊗ IM . {P̆k }K k=1 inherit the
(PuC) K
unbiased errors [26], the spatial filters can be designed to orthogonality property of {Pk }k=1 , similar to nPuC, and
help focusing the pilot energy along the strongest multipath therefore the received signals from UE j 6= k are canceled out
components between the UEs and the BS. Hence, precoding at the receiver in the estimation process of Hk .
the pilots boosts the SNR in the pilot transmission phase, e (PuC) = Hk −H
Define the estimation error matrix as H b (PuC) .
k k
which is specially beneficial for the UEs at the cell edges. The following propositions characterize the accuracy of the
Considering that there are L ≤ N paths between each channel estimation under precoded but uncombined pilot trans-
(PuC) (nPuC)
UE and the BS, Tτ = KL ≤ Tτ pilot symbols mission:
suffices for transmitting orthogonal training sequences through
Proposition 3. Consider the system model of scenario PuC.
all the paths.1 In other words, unlike scenario nPuC, where
The covariance of the channel estimation error matrix using
orthogonal pilots are assigned to the UE antennas, the number
orthogonal pilot transmissions given by (17) and an MMSE
of required orthogonal pilot sequences in scenario PuC is
estimator given by (18) is
equal to the number of multipath components. Clearly, this H
pilot transmission scheme leaves longer time for the data e (PuC)= E vec H e (PuC) vec H e (PuC)
R k k k
transmission phase, compared to the baseline scenario nPuC.
This brings a significant gain for the data transmission time, = δM σk2 (U∗k ⊙ Bk ) (19)
especially in wireless networks with small coherence time such T −1
H
as mmWave networks [14]. · IL +δM ζk R2Uk ◦RBk (U∗k ⊙Bk ) .
(PuC) (PuC)
Let Pk ∈ CL×Tτ be the pilot symbols transmitted by
UE k in scenario PuC, then orthogonality of training sequences Proof: A proof is given in Appendix B.
and the energy constraint (6) imply that Proposition 4. Consider the covariance of the channel esti-
(
H ρτ
IL k = j , mation error for UE k given by Proposition 3. The normalized
(PuC) (PuC) (PuC) e (PuC) )/tr(Rk ), is bounded as
Pk Pj = L (17) MSE, defined as ek = tr(R k
0 otherwise .
" (PuC)
#+
Note that although in this paper we are investigating the λ−1
max ǫ (PuC) λ−1
min
1− k ≤ ek ≤ , (20)
uplink channel estimation, pilot transmission scenario PuC 1+δM ζk δM ζk 1+δM ζk
entails the same complexity for downlink channel estimation.
In contrast, the complexity of the downlink channel estimation where λmin and λmax represent the minimum and maximum
in scenario nPuC is substantially higher than that of uplink eigenvalues of RUk , respectively and
channel estimation if M ≫ KN . As a result, in massive T 2
MIMO systems, the nPuC scheme is suitable only for the 1 − κ IL + δM ζk R2Uk ◦ RBk
(PuC)
ǫk = T ≥ 0.
1 It is shown in [24] that in the case of correlated channels between UE k and 4κ IL + δM ζk R2Uk ◦ RBk
the BS, the actual length of training sequence needed for channel estimation (21)
can be less than the number of transmitting antennas.
7
−5 comparing Fig. 3(a) and Fig. 3(b), we can see that when M
increases from 8 to 1024, a gain of 10 log10 (1024/8) ≈ 21 dB
−10
can be achieved.
−15 The gain of pilot precoding in PuC can be understood by
−20 comparing Fig. 2 and Fig. 3. Specifically, when the number
−25 L=2 of antenna elements at both the UE and BS sides is large, say
L=4 M = N = 1024, Fig. 3(b) shows 21, 24 and 27 dB lower
−30
L=8 MSE compared with Fig. 2(b) for the curves corresponding
−35 to L = 2, 4 and 8, respectively. This fact is in accordance
101 102 103
Number of UE antenna elements N
with (22).
(a) Network with M = 8. C. Precoded and Combined Pilot Transmission (PC)
−20
In the third scenario, in addition to precoding the pilots
Average normalized MSE (dB)
0 0
−5 −10
−10 −20
nPuC, Tτ =64
−15 −30 PuC, Tτ =8
PC, Tτ =4
−20 Tτ = 1 −40 PC, Tτ =2
Tτ = 4 PC, Tτ =1
−25 −50
10 20 30 40 50 −20 −15 −10 −5 0 5 10 15 20
Number of UEs in the cell K Pilot transmition energy ρτ (dB)
Fig. 5: Channel estimation performance in the precoded and combined pilot Fig. 6: The impact of pilot energy and number of transmitted pilot symbols on
transmission scenario (PC) as a function of number of UEs in the cell. the channel estimation performance in the three pilot transmission scenarios.
the cell for a network with M = 128, N = 32 and L = 4. where zeff is the effective noise at the BS which combines the
For the sake of simplicity, all the UEs are located at the receiver noise and the residual channel estimation errors.
same distance from the BS and therefore experience the same In the following, we first present the performance metric
path loss. The AoDs and AoAs are drawn independently from used for the performance evaluation of the network. Subse-
uniform random distributions in [−π/6, π/6] and [−π/3, π/3], quently, we study the precoding methods that are used in the
respectively. The results are averaged over 50 different re- data transmission phase.
alizations of AoAs and AoDs. Two pilot sequence length,
namely Tτ = 4, 1, are considered in this figure. In the case
A. Performance Metric
of Tτ = 4, no intra-UE interference exists, and only the inter-
UE interference degrades the channel estimation performance Although the optimal distribution of {xk }K
k=1 is not known,
when K increases. However, when Tτ = 1 both intra-UE using (28), the following lower bound for the sum-rate of the
and inter-UE interference contaminate the pilot symbols which network can be found as [28]:
leads to at least 5 dB poorer performance compared to the case " K
!#
Td X
when Tτ = 4. r= E log det IM + Rzeff −1 b k Rx H
H b H
, (29)
k k
Tc
k=1
Fig. 6 shows the average normalized MSE against the total
pilot transmission power using the same setup as in Fig. 5 where Rxk and Rzeff are the covariance matrices of xk and
when there are K = 2 UEs in the network. According to zeff , respectively, and are defined as
this figure, pilot precoding with N = 32 can achieve the
same estimation error as nPuC with almost 5 dB less power. H H ρd
Rxk = E xk xH k = Fk E sk sk Fk = Fk FHk , (30)
Alternatively, for a given total pilot transmission power, pilot Td
precoding can improve the estimation error by 6 dB compared XK h i
with the conventional nPuC. Moreover, the performance of Rzeff = E zeff zH
eff = E e
H k R x
e
H H 2
k + σz IM . (31)
k
PC with only 4 orthogonal pilots is almost identical to that k=1
of PuC with 64 orthogonal pilots. In other words, these
16 x shorter training length allows channel estimation of B. Data Precoding
substantially higher number of UEs within the same coherence For a single link MIMO network, when only imperfect CSI
budget, while also leaving more time for the data transmission. is available at the transmitter, it is shown that the eigenvectors
Therefore, in PC we expect a boost in the achievable rate with of the channel estimate covariance matrix represent the optimal
4 orthogonal pilots compared to that of PuC. transmit directions [29, Theorem 1]. More specifically, the
So far, we have analyzed the channel estimation quality precoder of UE k is designed in a way that its signal is
of three pilot transmission scenarios. In the next section, we transmitted in the direction of the right singular vectors of
investigate their effects on the achievable data rate. the estimated channel matrix H b k . For the sake of simplicity,
10
2 0.45
0.4
PuC PuC N =4
1.5 PC, Tτ = 1 PC , Tτ = 1
0.3
1
0.2
0.5
0.1
0 0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 8 16 32 64 128
Normalized pilot energy ρ̄τ Number of BS antenna elements M
(a) Total energy ρτ + ρd = 0.1. (a)
PuC nPuC
6 PC, Tτ = 1 20 PuC
PC, Tτ = 1
4
10
2
0
0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1 0
Normalized pilot energy ρ̄τ 8 16 32 64 128 256
Number of BS antenna elements M
(b) Total energy ρτ + ρd = 0.5.
(b)
Fig. 7: Spectral efficiency as a function of normalized pilot energy ρ̄τ =
ρτ /(ρτ + ρd ), M = 128 and N = 32. The maximum of the curves are Fig. 8: Impact of number of antenna elements on the optimal pilot energy and
marked by circles. maximum spectral efficiency when ρτ + ρd = 4.
we choose to allocate equal powers to different eigendirections scenarios; see Fig. 6. Acquiring more accurate CSI, better
for all three pilot precoding scenarios. Note that considering precoders can be designed for the data transmission phase.
optimal power allocation does not change the insights that However, since the total energy is fixed, increasing the pilot
we gain from the comparative performance analysis of this energy leaves less energy for the data transmission, leading
paper, but complicates the mathematical analysis; see [29] to a well-known trade-off between the amount of energy
for the optimal power allocation algorithm. Formally, denote allocated to pilot and data transmission [11]. This trade-off is
the eigen-value decomposition of Rxk by Rxk = F̄k Λk F̄H k illustrated in Fig. 7 for three pilot transmission scenarios. From
b HHb bHb H
and that of H k k by Hk Hk = Ek Σk Ek . Here, we set both Figs. 7(a) and 7(b), PC and PuC always lead to higher
F̄k = Ek and Λk = ρd /(Td L)TL where TL is a diagonal spectral efficiencies than the conventional nPuC. Moreover, the
matrix whose first L diagonal elements are one and the rest maximum spectral efficiency that PC can reach is higher than
are zero. Comparing the designed covariance matrix of the PuC, though PuC has a better channel estimation performance;
data signal with the one in (30), the data precoding matrix see Fig. 6. Another observation from Fig. 7 is that the optimal
corresponding to UE k is found as pilot energy of PC and PuC are substantially smaller than that
r r of nPuC. Indeed, by pilot precoding and combining, a minimal
1 1
Fk = F̄k TL = Ek TL . (32) amount of pilot power that is allocated to a few (possibly
L L one) pilot symbols Tτ = 1 may lead to a very high spectral
Figs. 7 and 8 show the spectral efficiency of a network efficiency. Moreover, increasing the total energy shifts these
employing the aforementioned data precoding method. In optimal points to the left, implying that a smaller fraction of
these simulations, we assume that the channel estimation is the total energy needs to be allocated to the pilot transmission
performed according to the scenarios presented in Section III. phase.
We consider a cell with K = 2 UEs located at the same Fig. 8 shows the optimal pilot energy and the corresponding
distance from the BS, where the AoAs and AoDs are set maximum spectral efficiency against the number of antenna
similarly to the ones in Fig. 2. We also assume that each elements. From Fig. 8(a), the optimal pilot energy decreases
coherence block has length Tc = 128 channel uses and the with M in all three scenarios. On the contrary, the UEs with
number of paths between each UE and the BS is L = 4. larger number of antennas need more pilot transmission energy
As we increase the pilot transmission energy ρτ , the channel to reach the optimal performance (this can be observed by
estimation performance improves in all pilot transmission comparing solid lines to the dashed ones). This increment
11
TABLE II: Minimum number of unique pilots needed in the pilot transmission
is much higher in nPuC than other schemes. In general, the phase. “UL” and “DL” stand for uplink and downlink, respectively.
optimal pilot energy in PuC and PC is less sensitive to the
changes of M or of N . Fig. 8(b) presents the maximum nPuC PuC PC
spectral efficiency against M when N = 32. From this figure, N, M < ∞ KN KL KL
employing either PuC or PC for pilot transmission significantly N → ∞, M < ∞ KN K K
UL pilots
improves the maximum spectral efficiency compared to the N < ∞, M → ∞ KN KL L
conventional nPuC. In particular, the improvement is around N → ∞, M → ∞ KN K 1
80% with only 128 BS antennas. N, M < ∞ M KL KL
The following proposition gives a lower bound for the N → ∞, M < ∞ M KL L
achievable sum-rate using the aforementioned data precoding DL pilots
N < ∞, M → ∞ M K K
method and different pilot transmission scenarios of Sec-
N → ∞, M → ∞ M K 1
tion III.
Proposition 5. Consider the pilot transmission scenario x ∈
{nPuC, PuC, PC} and uplink data transmission using the data
precoding filter of (32) for UE k. Assume that {Uk }K
k=1 and entire network can be done with a single pilot symbol.
{Bk }K
k=1 are perfectly known. If M, N → ∞, then
ρd PK
! Corollary 5 suggests that although we have considered a
2
LTd Td k=1 σk
r≥ log 1 + . (33) single-cell scenario throughout this paper, our insights are
Tc Lσz2 valid in a multicell scenario, especially in the large antenna
Proof: A proof is given in Appendix B. regime. In fact, all the interference components in the pilot
transmission can be rejected either at the transmitter or at
Corollary 4. For given budgets of coherence time Tc and total
the receiver. In particular, from (23), the transmitter cancels
energy ρτ + ρd , the lower bound in (33) is maximized when
intra-UE interference, and the receiver cancels out inter-UE
ρτ → 0 and Td /Tc → 1 (or equivalently Tτ = 1).
interference (which can be readily extended to the inter-cell
Corollary 4 implies that in the large antenna regime, among interference). Notably, PC allows using fixed-length pilot se-
the three pilot transmission scenarios, only PC can reach the quences (almost) independently of the number of MU-MIMO
maximum spectral efficiency. users. As an extreme case, a single pilot symbol can be used
by multiple UEs. In real life deployments, even though spatial
V. F URTHER D ISCUSSIONS
separability based on channel covariance matrix knowledge
Equation (3) implies that the vectors b(θ) or u(θ) with is possible in PC, multiple antenna UEs benefit from code
different θ create an asymptotically orthonormal basis, which domain separability among pilot signals used to estimate the
can be used as orthogonal spatial signatures. More interest- channel of the different UE antennas. However, when the
ingly, asymptotically, there are infinitely many such spatial number of antennas at the BS grows large, full reuse of the
signatures (realized by changing θ). This leads to an interesting same pilot sequence among the UE antennas is possible. This,
and practically relevant consequence of pilot precoding and in turn, maximizes the number of symbols available for data
combining: maintaining orthogonality of pilots in the spatial transmission.
domain instead of code domain becomes possible. To appre-
ciate this aspect, we recall that in MU-MIMO systems with To generalize Corollary 5, Table II shows the number of
single antenna UEs, the orthogonality of pilot signals must unique pilots that the three pilot transmission scenarios need
be maintained in the code domain to avoid intra-cell pilot in different network settings. This table also includes the case
contamination effects. Therefore, there is an inherent trade- of downlink pilot transmission. From this table, it is clear
off between the number of symbols spent on constructing the that the training sequence lengths required in PC and PuC
pilot sequences and the number of symbols available for data are substantially smaller than that in nPuC. The minimum
transmission, as illustrated by Fig. 7. Due to the proposed pilot number of unique pilots in both nPuC and PuC scales up with
precoding schemes, this trade-off can be relaxed by creating the number of UEs; whereas there are situations in which
pilot signal separability in the spatial domain. We have the PC may need a constant number of pilots independently of
following conceptually important result: the number of transmit antenna elements or the number of
UEs. Specifically, when the number of receive antennas (M
Corollary 5. Consider the channel model in (1). Suppose in uplink and N in downlink) goes to infinity, a maximum of
that there are a limited number of paths between each UE L pilot sequences is enough to handle the channel estimation
and its serving BS and that the second-order statistics are of the entire network, irrespective of the number of UEs or
perfectly known. Let either the number of BS antennas M or BSs. Also, note that large values of N can model wireless
the number of UE antennas N tends to infinity, then 1) the pilot backhauling use cases. Moreover, this table illustrates that, for
contamination problem disappears, 2) a multi-cell network can example, as N → ∞ and M < ∞), PuC may benefit from
be modeled by multiple uncoordinated single-cell networks uplink pilots whereas PC may benefit from downlink pilots.
with no performance loss2 , and 3) channel estimation in the This disagreement indicates that employing pilot precoding
2 Note that inter-cell coordination may still bring gain to the resource and combining may relax the usual system design constraint
allocation performance [30]. of always relying on the uplink pilots for CSI acquisition.
12
VI. C ONCLUDING R EMARKS AND O UTLOOK The Khatri-Rao product of A of size M × L and B of size
The performance of MU-MIMO systems heavily dependents N × L is defined as
on the quality of the acquired CSI. The UEs can facilitate
A ⊙ B = [A]:,1 ⊗ [B]:,1 . . . [A]:,L ⊗ [B]:,L . (36)
the acquisition of high-quality CSI if they, contrarily to what
commonly assumed, exploit the multi-antenna capabilities. In Lemma 1 (Vectorization lemma [31]). For any three matrices
this paper, we showed that UEs can precode pilot signals A, B, and C with appropriate dimensions, we have
to drastically improve the quality of the CSI at the BS,
which in turn improves the spectral efficiency during data vec(ABC) = CT ⊗ A vec (B) . (37)
transmission. Moreover, the CSI further improves if pilot
precoding is carried along with pilot combining at the BS. The Lemma 2 ([31]). Consider matrices A ∈ CN ×M , B ∈ CR×S ,
aforementioned gains are more prominent when the channels C ∈ CM×L and D ∈ CS×L . The following equalities always
are sparse and the BS and UEs are equipped with a large hold:
number of antennas, both hold in mmWave networks.
(A ⊗ B) (C ⊙ D) = (AC ⊙ BD) ,
The insights of this work suggest a further development of
H H
the following major research questions (C ⊙ D) (A ⊗ B) = (AC ⊙ BD) ,
H
• The MMSE channel estimation used to obtain the esti- (C ⊙ D) (C ⊙ D) = CH C ◦ DH D .
mated channel in (12) (which is then used throughout
in Section III) inherently depends on exploiting side in- Lemma 3. For any positive definite matrix A ∈ CN ×N
formation lying in the second-order statistics (covariance N 2 N
X 1 1 + κ (A) X 1
matrices) of the channel vectors. As noted in [18], the role ≤ tr A −1
≤ ,
of covariance matrices is to capture structural information a
i=1 ii
4κ (A) a
i=1 ii
related to the distribution of the multi-path AoA at
the serving BS. Therefore, imperfect knowledge of the where aii and κ(A) are diagonal element i and the condition
channel covariance matrices entails channel estimation number of matrix A, respectively.
errors that result in precoding errors. Lemma 4 ([32]). For any two positive semi-definite (PSD)
• Since pilot precoding increases the spatial separation of
matrices A ∈ CN ×L and B ∈ CL×M ,
UEs, it potentially mitigates the affects of pilot con-
tamination by employing low rate multicell coordination λmin (A) tr (B) ≤ tr (AB) ≤ λmax (A) tr (B) ,
techniques proposed in, for example, [18]. This is because
the method proposed in [18] achieves better results if the where λmin (A) and λmax (A) are the smallest and largest
paths of different UEs do not overlap. In the asymptotic eigenvalues of A, respectively.
regime, if either the number of BS antennas NBS or Lemma 5. For any two real values 0 < a ≤ b, we have
the number of UE antennas NUE tends to infinity, pilot
contamination disappears since the UEs become perfectly 1+a a
≥ . (38)
orthogonal in the AoA domain. Thereby, a multi-cell net- 1+b b
work can be modeled by multiple uncoordinated single-
cell networks with no performance loss. A PPENDIX B: P ROOFS
• Although in this paper we considered the case of UL
pilot and data transmissions, the concept of precoded A. Proof of Lemma 3
pilots in PC and PuC can be employed for DL pilot The upper bound is given in [33], and the lower bound can
and data transmissions as well. As noted earlier, DL pilot be obtained by the Cauchy-Schwarz inequality. In particular,
precoding can be useful in FDD systems. let ei ∈ CN be the standard basis vector (with 1 in its i-th
entry and 0 otherwise). Then,
A PPENDIX A: P RELIMINARIES √ √ 2 (a)
In this subsection, we give preliminary linear algebra to 1 = e∗i A A−1 ei ≤ (e∗i Aei ) e∗i A−1 ei
prove the results in Appendix B. 1
≤ A−1 ii ,
Definition 1 (Products [31]). The Hadamard product of any [A]ii
two arbitrary matrices A and B, of the same size, is defined where (a) is due to the Cauchy-Schwarz inequality. The lower
as bound will be proved by summing both sides of the second
[A ◦ B]i,j = [A]i,j [B]i,j . (34) inequality over i.
The Kronecker product of A of size M × N and B of any
arbitrary size is defined as B. Proof of Proposition 1
[A]1,1 B . . . [A]1,N B From (11), it is straightforward to show that
.. .. ..
A⊗B= . . . . (35) H
(nPuC) (nPuC) ρτ
[A]M,1 B . . . [A]M,N B P̆k P̆k = I. (39)
N
13
(nPuC)
H
=Rk − Rk P̆k e
Rk
(PuC) e
=E vec Hk
(PuC) e
vec Hk
(PuC)
H −1 H
(nPuC) (nPuC) 2 (nPuC)
· P̆k Rk P̆k +σz I P̆k Rk =Rk − Rk P̆k
(PuC)
−1 H −1 H
(a) N σz2 · P̆k
(PuC)
Rk P̆k
(PuC)
+σz2 I P̆k
(PuC)
Rk
= Rk − Rk Rk + I Rk
ρτ
(b) ρτ (a) σz2 (PuC)
= (U∗k ⊙ Bk ) (U∗k ⊙ Bk )H (U∗k ⊙ Bk ) = δM σk2 (U∗k ⊙ Bk ) (U∗k ⊙ Bk )H P̆k
N σz2 δM σk2
−1 H −1
1 (PuC) ∗ H
+ I (U∗k ⊙ Bk )H , · P̆k (Uk ⊙ Bk ) + I (U∗k ⊙ Bk ) ,
δM σk2
where (a) follows by replacing for Rk from (4) and applying
where (a) is valid due to (39) and (b) holds by substituting
the matrix inversion lemma. The proof completes by noting
for Rk from (4) and applying matrix inversion lemma. The
that
proof will be completed using Lemma 2 and after some
straightforward algebraic manipulations. H ρτ
(PuC) (PuC) H
P̆k P̆k = (U∗k ⊗ I) (U∗k ⊗ I) ,
L
and applying Lemma 2.
C. Proof of Proposition 2
(a)
−1
where (a) follows from the commutative property of trace = tr Qk RT Uk ⊗ I ,
and after applying Lemma 2 and (b) holds since [RUk ]ii =
[RBk ]ii = 1 for i = 1, . . . , L. where Qk = (U∗k ⊗ I) Re (PuC) H
(U∗k ⊗ I) and (a) holds due to
k
Now, we are ready to prove Proposition 2. Using Proposi- commutative property of trace.
tion 1, the normalized channel estimation error is calculated −1
Now, since both Qk and RT Uk ⊗ I are PSD matrices,
as we can use Lemma 4 and write
ek
(nPuC)
= tr R e (nPuC) /tr (Rk ) tr (Qk ) −1 2 tr (Qk ) −1
k λ ≤ σH
e s2 ≤ λ , (42)
tr (Hk ) max tr (Hk ) min
k
(c) 1 −1 T
= tr IL +M ζk RT Uk ◦RBk RUk ◦RBk (41)
L where
1 1 −1
= 1− tr IL +M ζk RT Uk◦RBk , −1
M ζk L λ−1
max , λmin RT
Uk ⊗ I = (λmax (RUk ))−1 ,
−1 −1
λ−1
min , λmax RTUk ⊗ I = (λmin (RUk )) .
e (nPuC)
where (c) follows after substituting for tr (Rk ) and R k
from (40) and (13), respectively and applying Lemma 2.
Moreover, by taking the same steps as in proof of Proposi-
Considering that for i = 1, . . . , L
tion 2, it can be shown that
" #
IL + M ζk RT (PuC) +
Uk ◦ RBk ii
= 1 + M ζk , 1 ǫk tr (Qk ) 1
1− ≤ ≤ . (43)
1+δM ζk δM ζk tr (Rk ) 1+δM ζk
and applying Lemma 3 on the trace at the right hand side
(nPuC)
of (41) along with the fact that ek ≥ 0, the bounds in (15) By combining the results from (43) and (42), the proof is
follows after some straightforward algebraic manipulations. completed.
14
F. Proof of Corollary 3 [10] J. Choi, D. J. Love, and P. Bidigare, “Downlink Training Techniques for
FDD Massive MIMO Systems: Open-Loop and Closed-Loop Training
First note that when M → ∞, BH
= 0 for j 6= k. Now,
k Bj With Memory,” IEEE J. Sel. Top. Sign. Proces., vol. 8, no. 5, pp. 802–
(PC) 814, Oct. 2014.
by substituting for Rj and P̆kj into (24) from (4) and the
[11] B. Hassibi and B. Hochwald, “How much training is needed in multiple-
line after (23), respectively and then using Lemma 2, it is antenna wireless links?” IEEE Trans. Inf. Theory, vol. 49, no. 4, pp.
straightforward to show that limM→∞ Qk → 0. This implies 951–963, Apr. 2003.
that the inter-UE interference tends to zero when the number [12] S. Sesia, I. Toufik, and M. Baker, Eds., LTE - The UMTS Long Term
Evolution: From Theory to Practice, 2nd Edition Stefania Sesia, Issam
of BS antennas grows large. Toufik, Matthew Baker. WILEY, Jul. 2011, no. ISBN: 978-0-470-66025-
In PC, the non-orthogonality of the pilot sequences trans- 6.
mitted from different antennas of a UE leads to intra-UE [13] V. B. Yerrabommanahalli, “Method and Apparatus for Radio Link
Imbalance Compensation,” U.S. Patent US 2014/0 051 449 A1, 2014.
interference at the BS. However, RUk → IL when N → ∞, [14] H. Shokri-Ghadikolaei, C. Fischione, G. Fodor, P. Popovski, and
implying that the precoded pilots are transmitted through the M. Zorzi, “Millimeter wave cellular networks: A MAC layer perspec-
L paths without interfering with each others. Therefore the tive,” IEEE Trans. Commun., vol. 63, no. 10, pp. 3437–3458, Oct. 2015.
[15] O. Elijah, C. Leow, T. Rahman, S. Nunoo, and S. Z-Iliya, “A compre-
intra-UE interference tends to zero when the number of UE hensive survey of pilot contamination in massive MIMO - 5G system,”
antennas grows large. IEEE Commun. Surveys Tuts., vol. 18, no. 2, pp. 905–923, Nov. 2015.
[16] V. Saxena, G. Fodor, and E. Karipidis, “Mitigating pilot contamination
by pilot reuse and power control schemes for massive MIMO systems,”
G. Proof of Proposition 5 in IEEE Vehicular Technology Conference, Glasgow, Scotland, 2015, pp.
1–6.
Note that as M, N → ∞, H e (x) → 0 for x = nPuC, PuC and [17] B. Liu, S. Cheng, and X. Yuan, “Pilot contamination elimination
k precoding in multi-cell massive MIMO systems,” in Personal, Indoor,
PC according to Proposition 2, 4 and Corollary 3, respectively. and Mobile Radio Communications (PIMRC), 2015 IEEE 26th Annual
Therefore, independent from the pilot transmission scenario, International Symposium on, Aug. 2015, pp. 320–325.
in the large antenna regime Rzeff = σz2 IM . Substituting for [18] H. Yin, D. Gesbert, M. Filippou, and Y. Liu, “A coordinated approach
to channel estimation in large-scale multiple-antenna systems,” IEEE J.
Rzeff = σz2 IM and moreover Rxk and Fk from (30) and (32), Select. Areas Commun., vol. 31, no. 2, pp. 264–273, Feb. 2013.
respectively, in (29) yeilds [19] R. C. Daniels, J. N. Murdock, T. S. Rappaport, and R. W. H. Jr., “60
" !# GHz wireless: Up close and personal,” IEEE Microw. Mag., pp. S44–
K
Td ρd X S50, Dec. 2010.
r = E log det IM + Σk [20] O. E. Ayach, S. Rajagopal, S. Abu-Surra, Z. Pi, and R. H. Jr., “Spatially
Tc Td Lσz2 sparse precoding in millimeter wave MIMO systems,” IEEE Trans.
k=1
K
! Wireless Commun., vol. 13, no. 3, pp. 1499–1513, Mar. 2014.
(a) T
d ρd X [21] A. M. Sayeed, “Deconstructing multiantenna fading channels,” IEEE
≥ log det IM + E [Σk ] (44) Trans. Signal Processing, vol. 50, no. 10, pp. 2563–2579, Oct. 2002.
Tc Td Lσz2 [22] M. R. Akdeniz, Y. Liu, M. K. Samimi, S. Sun, S. Rangan, T. S.
k=1
PK ! Rappaport, and E. Erkip, “Millimeter wave channel modeling and
LTd ρd k=1 σk2 cellular capacity evaluation,” IEEE J. Sel. Areas Commun., vol. 32, no. 6,
= log 1 + , pp. 1164–1179, Jun. 2014.
Tc Td Lσz2 [23] J. Brewer, “Kronecker products and matrix calculus in system theory,”
IEEE Trans. Circuits Syst., vol. 25, no. 9, pp. 772–781, Sept. 1978.
where (a) is due to Jensen’s inequality. [24] E. Björnson and B. Ottersten, “A framework for training-based es-
timation in arbitrarily correlated Rician MIMO channels with rician
disturbance,” IEEE Trans. Signal Processing, vol. 58, no. 3, pp. 1807–
R EFERENCES 1820, Mar. 2010.
[25] S. Rangan, T. Rappaport, and E. Erkip, “Millimeter wave cellular
[1] 3GPP, “Study on elevation beamforming / full dimension (FD) multiple wireless networks: Potentials and challenges,” Proc. IEEE, vol. 102,
input multiple output (MIMO) for LTE, (release 13),” Technical Spec- no. 3, pp. 366–385, Mar. 2014.
ification Group Radio Access Networks, Technical Report TR 36.897, [26] 3GPP, TS 36.214 Evolved Universal Terrestrial Radio Access (E-UTRA)
Jun. 2015. Physical layer Measurements (Release 13), 3rd Generation Partnership
[2] S. Han, C.-l. I, Z. Xu, and C. Rowell, “Large-scale antenna systems Project Std., Jun. 2016.
with hybrid analog and digital beamforming for millimeter wave 5G,” [27] O. E. Ayach, R. W. Heath, S. Abu-Surra, S. Rajagopal, and Z. Pi, “The
IEEE Commun. Mag., vol. 53, no. 1, pp. 186–194, Jan. 2015. capacity optimality of beam steering in large millimeter wave MIMO
[3] 3GPP, User Equipment (UE) Radio Transmission and Reception (Re- systems,” in Proc. IEEE International Workshop on Signal Processing
lease 14), 3GPP Std. TS 36.101, Jun. 2016. Advances in Wireless Communications (SPACWC), Jun. 2012, pp. 100–
[4] O. Bejarano, E. Knightly, and M. Park, “IEEE 802.11 ac: from chan- 104.
nelization to multi-user MIMO,” IEEE Commun. Mag., vol. 51, no. 10, [28] T. Yoo and A. Goldsmith, “Capacity and power allocation for fading
pp. 84–90, Oct. 2013. MIMO channels with channel estimation error,” IEEE Trans. Inf. Theory,
[5] A. Osseiran, J. Monserrat, and P. Marsch, Eds., 5G Mobile and Wireless vol. 52, no. 5, pp. 2203–2214, May 2006.
Communications Technology. Cambridge University Press, 2016, iSBN [29] J. Wang and D. P. Palomar, “Worst-case robust MIMO transmission with
978-1-107-13009-8. imperfect channel knowledge,” IEEE Trans. Signal Processing, vol. 57,
[6] S. Kutty and D. Sen, “Beamforming for millimeter wave communica- no. 8, pp. 3086–3100, 2009.
tions: An inclusive survey,” IEEE Commun. Surveys Tuts., vol. 18, no. 2, [30] H. Shokri-Ghadikolaei, F. Boccardi, C. Fischione, G. Fodor, and
pp. 949–973, 2016. M. Zorzi, “Spectrum sharing in mmWave cellular networks via cell asso-
[7] F. Rusek, D. Persson, B. K. Lau, E. G. Larsson, T. L. Marzetta, ciation, coordination, and beamforming,” IEEE J. Sel. Areas Commun.,
O. Edfors, and F. Tufvesson, “Scaling up MIMO - opportunities and to be published.
challenges with very large arrays,” IEEE Signal Process. Mag., pp. 40– [31] S. Liu and G. Trenkler, “Hadamard, Khatri-Rao, Kronecker and other
60, Jan. 2013. matrix products,” Int. J. Inf. Syst. Sci, vol. 4, no. 1, pp. 160–177, 2008.
[8] T. Marzetta, “Noncooperative cellular wireless with unlimited numbers [32] R. A. Horn and C. R. Johnson, Matrix analysis. Cambridge university
of base station antennas,” IEEE Trans. Wireless Commun., vol. 9, no. 11, press, 2012.
pp. 3590–3600, Nov. 2010. [33] Z. Bai and G. H. Golub, “Bounds for the trace of the inverse and
[9] E. Björnson, E. G. Larsson, and T. Marzetta, “Massive MIMO: Ten the determinant of symmetric positive definite matrices,” Annals of
myths and one critical question,” IEEE Commun. Mag., pp. 114–123, Numerical Mathematics, vol. 4, pp. 29–38, 1996.
Feb. 2016.