Professional Documents
Culture Documents
Machine Learning For Reliable Mmwave Systems Blockage Prediction and Proactive Handoff
Machine Learning For Reliable Mmwave Systems Blockage Prediction and Proactive Handoff
Machine Learning For Reliable Mmwave Systems Blockage Prediction and Proactive Handoff
Reliability and latency are two main challenges for millimeter In this section, we describe the adopted mmWave system and chan-
wave (mmWave) wireless systems [1–3]: (i) The high sensitivity nel models. Consider the communication setup in Fig. 1, where a
of mmWave signal propagation to blockages and the large signal- mobile user is moving in a trajectory. At every step in this trajec-
to-noise ratio gap between LOS and non-LOS links greatly affect tory, the mobile user gets connected to one out of N candidate base
the link reliability, and (ii) the frequent search for new base stations stations (BSs). For simplicity, we assume that the mobile user has
(BSs) after link disconnections causes critical latency overhead [4]. a single antenna while the BS is equipped with M antennas. Ex-
This paper leverages machine learning tools to efficiently address tending the results of this paper to the case of multi-antenna users
these challenges in mobile mmWave systems. is straight-forward. Let hn,k denote the M × 1 uplink channel vec-
The coordination among multiple BSs to serve the mobile tor from the user to the nth BS at the kth subcarrier. If the user is
user has been the main approach for enhancing the reliability of connected to the nth BS, this BS applies a beamforming vector fn to
mmWave communication links [1–3]. In [1], extensive measure- serve this user. In the downlink transmission, the received signal at
ments were done for coordinated multi-point transmission at 73 the mobile user on the kth subcarrier can then be expressed as
GHz, and showed that simultaneously serving the user by a number
of BSs noticeably improves the network coverage. This cover- yk = h∗n,k fn s + v, (1)
age performance gain was also confirmed by [2] in heterogeneous
where data symbol s ∈ C satisfies E |s|2 = P , with P the to-
mmWave cellular networks using stochastic geometry tools. To
overcome the large training overhead and increase the effective tal transmit power, and v ∼ NC 0, σ is the receive noise at the
2
achievable rate in coordinated transmissions, especially for highly- mobile user. Due to the high cost and power consumption of mixed-
mobile applications, [3] proposed to use machine learning tools to signal components in mmWave large antenna array systems, beam-
predict the beamforming directions at the coordinating BSs from forming processing is normally done in the analog domain using net-
low-overhead features. Despite the interesting coverage gains of works of phase shifters [5]. The constraints on these phase shifters
The final goal of training the model is to find the parameters
embd, Wr , Wz , Wh , Ur , Uz , Uh , Wf , cr , cz , ch , cf that 0.95
and the BS, α is the angle between the direction of travel and the di- Deep
limit the beamforming vectors to be selected from quantized code- rection of the main scatterer/reflector
Fig. 3. (or the BS in the case of LOS),
books. Therefore, we assume that the BS beamforming vector fn is and Θn defines the beam-width of the beams used by BS n. Now, as-
Fig. 3.
selected from a quantized codebook F with size/cardinality |F| = suming that the current/serving BS n updates its beamforming vector
(t)
MCB . The codewords of this codebook are denoted as gm , m = every beam-coherence time and calling it a time step, we define fn
1, 2, ..., MCB . Further, we assume that the beamforming vector fn as the beamforming vector selected by the nth BS to serve the mo-
is selected from the codebook F to maximize the received signal bile user in the tth time step, with t = 1 representing the first time
power, i.e., according to the criterion step after the handing-over to the current BS. With this, we define
X ∗ the beam sequence of BS n until time step t, denoted Bt as
fn = arg max |hn,k gm |2 . (2)
n o
gm ∈F Bt = fn(1) , fn(2) , ..., fn(t) . (6)
k
We adopt a wideband geometric mmWave channel model [4] Further, we define st ∈ {1, 2, ..., N }, as the hand-off status at
with L clusters. Each cluster `, ` = 1, ..., L is assumed to contribute the tth time step, with st = n indicating the user will stay connected
to the current BS n in the next time step, and st 6= n indicating Fig. 4. that
with one ray that has a time delay τ` ∈ R, and azimuth/elevation an-
gles of arrival (AoA) θ` , φ` . Further, let ρn denote the path-loss be- the mobile user will hand-off to another BS in the t + 1 time step. It
tween the user and the n-th BS, and prc (τ ) represents a pulse shap- is important to note here that predicting a hand-off in the next time
ing function for TS -spaced signaling evaluated at τ seconds. With step is more general that predicting a blockage, as the hand-off can
Fig. 4.
this model, the delay-d channel between the user and the nth BS happen due to a sudden blockage or a better SNR. Therefore, Fig. 4. we
follows will generally adopt the hand-off prediction that implicitly include
link blockage prediction.
r L
In this paper, we will focus on stationary blockages, [1] F.
VI. B. Mismar and B.
Cwhich
ONCLUSION
M X
hn,d = α` p(dTS − τ` ) an (θ` , φ` ) , (3) leaves the user mobility as the main factor of the time-varying be- new radio aided by S
ρn
`=1 havior in the channel. To complete the formulation, we leverage R EFERENCES
ICC the Workshops, Kans
note that the beamforming[1]vector thatMismar
maximizes [2] A.signalAlkhateeb, O. El A
where an (θ` , φ` ) is the array response vector of the nth BS at the F. B. and the
B. received
L. Evans,
[1]andF. “Partially
hybrid
B. Mismar blind
and B.h
precoding
power, as defined in (2), heavily relies aided
new radio on thebyuser location
Sub-6 GHz and
LTE
AoAs θ` , φ` . Given the delay-d channel in (3), the frequency domain new radiosignaling,” Si
channel vector at subcarrier k, hn,k , can be written as the surrounding environment ICC geometry
Workshops,(including of
blockages)
Kansas City, ICC
MO, [3].USA, Maybyin
Selected aided
Topics
201
2014. Workshops, Kans
Therefore, we formulate [2]theA. problem
Alkhateeb, as O.
a El
prediction
Ayach, G.of the and R. Heath
[3][2]A. A.Leus,
Alkhateeb,
Alkhateeb, O.J. ElMA
D−1 hand-off status at time t + 1,and i.e.,hybrid
ŝt , given the beam-sequence
precoding for millimeter at
and hybridwaveprecoding
cellular s
2πk d
time t, Bt . Formally, the objective precoding and combin
X
hn,k = hn,d e−j K . (4) of this Topics
of Selected paper isintoSignal
maximize the
Processing,
of Selected vol.
Topics 8, in
no
probability of successful blockage/hand-off
2014. prediction defined Communications
as Mag
d=0 2014.
[3]
P [ŝtA.
= sAlkhateeb,
t |Bt ] .
J. Mo, [4] [3]R.
N. A.W.(7) Heath, N. J.
Gonzalez-Prelcic,
Alkhateeb, Gonz
and
Mo
Considering a block-fading channel model, {hn,k }K k=1 are assumed precoding and combining solutions “An overview
precoding for and ofcombin
signa
millimeter
to stay constant over the channel coherence time, denoted TC [6] . In this next section, we leverage machine learning
Communications Magazine,, systems,”
tools to ad-
vol. IEEE
52, no.
Communications 12, Journ
pp.
Mag 1
dress this problem. [4] R. W. Heath, N. Gonzlez-Prelcic, [4]no.R. 3,W.S.pp. 436–453,
Heath,
Rangan, N. W.
Gonz RA
3. PROBLEM DEFINITION AND FORMULATION “An overview of signal processing[5] V.“An Va,overview
J. Choi,
techniques T. mS
of for
signa
systems,”
4. DEEP LEARNING BASED PROACTIVE HAND-OFF IEEE Journal of multipath
systems,”
Selected fingerprintin
IEEE
Topics in Journ
Sign
Maintaining good link reliability is a key challenge for mmWave no. 3, pp. 436–453, April 2016. no. 3, pp. 436–453,
Transactions on Vehic A
communication systems, especially with mobility. This is mainly [5] V. Va, J. Choi, T. [6][5]
Shimizu,
In this section, we explain our proposed solution that uses deep A. V. G. Va, J.
Alkhateeb,
Bansal,Choi,
S.
and T.
AlexR S
due to the high sensitivity of mmWave signals to blockages, which multipath
learning tools, and more specifically fingerprinting
recurrent neural networks, multipath
for learning
millimeter fingerprintin
to ef- coordinated
wave v2i b
Transactions onhighlight
Vehicular Transactions vol.onPP,Vehic
can frequently cause link disconnections. Further, when the link ficiently predict hand-off/blockages. First, we theTechnology,
systems,”
key idea IEEE Accesno
[6] A. Alkhateeb, S. Alex, [7] P. [6] A. Alkhateeb,
Varkey,
K. Cho, Y. Li,
B. Q. Van
S.
Qu,Alex
Meran
learning coordinated beamforming learningforcoordinated
highly-mo
properties
systems,” IEEE Accesm
of neural
systems,” IEEE Access, pp. in 1–1, 2018.
[7] K.Proc. Cho,ofB.SSST,Van 201Mer
[7] K. Cho, B. Van Merriënboer, P.D.Kingma
[8] D.properties Bahdanau, and andJ. m B
of neural
properties of neural machinearXiv translation:
in preprint
Proc. Encoder
of SSST,arXiv:1
201
in Proc. of SSST, 2014. [8] D. P. Kingma and J. B
[8] D. P. Kingma and J. Ba, “Adam: A method for sto
⇥ yt ŝt where embdxt Sequence
The ismain
= embd (bprocessing:
a lookupt ) table
value Thelayer
we
of this central
learn iscomponent
during
(8) to ...the of our model
training.
XXX.
t-max
1 h (.)
ft-max
nnected Layer
ax
d ŝ(.)t U hytwhere aThelookup
U z ŝt isŝthen main value
where is of
table thisislearn
a we layer isduring
to ...Unit
aRecurrent
lookup XXX.
table
the we learn
training. during the training.
nected Layer
oft-max
connected Layer
ted Layer
onnected
lly-connected Layer
Fully-connected (c) 2014 Robert W Heath Jr.
soft-max
⇥ embd
h (.)
embd 3
) tthisbeam index, then our
Gated a model ofstarts with an embedding layer
yt tth beam index, our modelxstarts with
thisan embedding ...(GRU)
layer [7] — form neural
Wrh
tThe main
Sequence value = of
t Sequence
embd
processing: layer
processing:
(b
The tcentral to XXX.
The of
component our(8)
central component of our model
model The amain isvalue of this layer is training.
to
for...processing XXX.
h
Gated
The Recurrent
main ŝt value where Unit of (GRU)
this islayer lookup
atable
tovector we
...Unit XXX. xlearn during the
soft-maxLayer
1 h (.)
⇥ that maps every embd
beam index bt networks
to that is best
t .The that suited
maps every beam index variable bt tolength a vector xt .
yU
aSequence processing:
issequnces.a anGated ... Recurrent central Unit component (GRU) of[7]our at— model
of aour form of neural
t
U index, hThe main ismodel value Gated of this Recurrent layer isembedding
toGRU (GRU)
XXX. is alayer [7] —
recursive a form
network of neural
that runs every time
h (.) z
beam z then our Sequence
starts with processing: The central component model
Fully-connected Layer
tth Sequence whereprocessing: is a lookup
The centraltable we
component learn during of ourthe training.
model
soft-max
Uz ⇥
ŝ
+
t
embd
Sequence is
networks a Gated
processing: that Recurrent
is best suited Unit for (GRU) processing [7] —variable a form length of neural
that maps t every beam index b x to= a embd networks
vector step The (b of
x central
) . the that input,component is and best it ofmaintains
our
suited
(8) modelafor hidden processing
x state
= embd q variable
which
(b ) length (8)
U z U zht is Delay W a r Gated The
W z is main
Recurrent
a Gated value
sequnces.
networks is tof
Recurrent
W ha this
t
Unit Gated
GRU that layer
(GRU)
Unit
is isaisaRecurrent
t tis to ... XXX.
(GRU)
recursive
best
function [7] [7]
suited
of —network
—
the Unit
aaprevious
forform
form (GRU)
that ofof
processing neural
runs
state [7]
neural
at
and every —
variable a form
t
time
the current
t
length of neural
input.
t
In
+ ht
Sequence networks
processing: sequnces. that
The is
central GRU best is
suited
where a recursive for
The processing
final network
is model agoal lookup ofhelps that
variable
training
tablewith runs
wethe at
length
learn every
model during time
is tothe find the parameters
training.
ŝt acomponent ofmechanism
our
soft-max
where embd networks isxtstepasequnces.
=lookup
that of issuited table
best we
suited learn for during
processing the (8) training.
variable embd length
soft-max
Fully-connected Layer
1 indicating thethe will hand-off to 1,k
t
step of addition, input, it has anda it t
special maintains g
gating
r at hidden
t
mechanism
r t state
1 q r
that t which helps with long
ext
euralt +time
sequnces.sequences.
step, ŝtaGRU
,andgiven st More
isthe a2recursive
past formally, gin
network
t Fig.
GRU
2, that
Uand
runsis1is implemented
at every time
+described )q xt as
by h+
the Udepicted
qfollowing model equations
0.8
Embedding Layer
the
1 time
hand-off
step,
is status function at
=Delay
the
0of indicating
rt the = z(W
2 previous
that
= r xtg+ (W
state rzq xandtt +
r = + U
the czz rq t 1 is
)tcurrent
(W h+(W acxinput.
zfunction
+ U
N,k
In q of
(r(9) t the
+ c 1 )(10)
qt previous
)+ cq ) ,state index andbtthe
(11) to acurrent
vector
(9) xt . In
input.
step of the sequences. input, and it More maintains rtformally, = by a ghidden (W GRU xstate +cisU gqimplemented
tr q which r 1 t+ cfrn) as r tdepicted (9)
recurrent
ill past stay inneural
connected Fig.
addition, 2,to the and
itthe has is
current described
aztspecial= BSgq(W in
tgating=thezx (1tand +mechanism
U zthe rtfollowing
tz)qt current q1ttt+is1that z ) helps
t model
addition, with long itequations
has(10)
1 r
a special gating mechanism that helps with long
soft-max
g, given
hand-off the isuser astatus
the will
function
past athand-off
the intofFig. to2, previous
and iszstate described where the by x the the
followinginput.
input In
vector, model q+ t isequationsthe hidden state, ŝt GRU rt and zt (9)
0.75
the
ral
he
ast current
hand-off status thetime-varying
where xt is theand input the vector,
bias 1
vectors qt is cthe hidden state, rt parameters
r , cz , cq are model
and zt where
that are embd is a lookup table we learn during the training.
in To Fig.BS 2,in z and the = isthe described (W x by + the U followingq + modelc ) equations (10)
dhannel. step,
s
the = user and
0 complete
indicating
will s =
t hand-off
t r 0 = indicating
that
gz
where formulation,
gt to =
(W z x is(W
t + that
the U we
x
input q qz + = z
U
vector,
t + 1(1cq ) z
(W
is
z + +)
the c z x q
)
hidden + U (W
state, r (r = (9) x q
and (W
+ U ) x (10)
q (r
+ + U
ctqphase. ,rq
)de- qtt 11(11) )++crcSequence
)q ) , (11)
processing: (9)The central component of our model is a
hat the
status
tationary
t
thecurrent atblockages,
beamforming the t
which
vector
x are
rtg t ”gates”
that maximizes
z r t tthat t 1learnedt help
z rt the q h 1 model
tduring t q zthe t t learn
t machine 1 h from q t r
t qt long
learning g t training
t z 1tdistance
r t
Finally, g
0.65
Next, we describe the key elements of the proposed machine learning where the reference prediction vector p has 1 in the entry corre-
model which is based on recurrent neural networks. sponding to the index of the correct BS in st , and zero otherwise.
0.95
BS 2
0.9
BS 1
Blockage
0.8
0.75
0.7
Fig. 3. This figure illustrates the considered simulation setup where 0.65
two candidate BSs, each has ULA, serve one vehicle moving in a
street. 0.6
0.55
0 2 4 6 8 10 12 14 16 18
Further, the model prediction vector p̂ has the dth entry equals to Deep Learning Dataset Size (Thousand Samples)
softmax(a)d , ∀d.
The final goal of training the model is to find the parameters Fig. 4. This figure shows that the proposed deep-learning proactive
embd, Wr , Wz , Wh , Ur , Uz , Uh , Wf , cr , cz , ch , cf that mini- hand-off solution successfully predicts blockages/hand-off with high
mize this loss for all training instances. probabilities when the machine learning model is well-trained.
5. SIMULATION RESULTS
reliability of next-generation mmWave systems.
In this section, we evaluate the performance of the proposed deep-
learning based proactive hand-off solution. 6. CONCLUSION
Simulation setup: We adopt the mmWave system and chan-
nel models in Section 2, with two candidate BSs to serve one vehi- In this paper, we proposed a novel solution for the reliability and la-
cle/mobile user moving in a street, as depicted in Fig. 3. To generate tency problem in mobile mmWave systems. Our solution leveraged
realistic data for the channel parameters (AoAs/delay/etc.), we use deep learning tools to efficiently predict blockage and the need for
the commercial ray-tracing simulator, Wireless InSite [8], which is hand-off. This allows the mobile user to proactively hand-off to the
widely used in mmWave research [6, 9], and is verified with channel next BS without disconnecting the session or suffering from high la-
measurements [9]. Each BS is installed on one lamp post at height tency overhead due to sudden link disconnections. The simulation
4 m, and employs a 32-element uniform linear array (ULA) facing results showed that the developed proactive hand-off strategy can
the street. The mobile user is moving in straight-line trajectories successfully predicts blockage/hand-off with high probability when
in the street, that can be any where of the street width, and with the machine learning model is trained with reasonable dataset sizes.
maximum trajectory length of 160 m. The trajectory starting point is In the future, it is interesting to extend the developed solution to
randomly selected from the first 40m of the street, and the user speed multi-user systems and account for mobile blockages.
is randomly selected from {8, 16, 24, 32, 40} km/hr. The BS selects
its beamforming vector from a uniformly quantized beamsteering
7. REFERENCES
codebook with an oversampling factor of 4, i.e. the codebook size is
MCB = 128. At every beam coherence time, the BS beamforming [1] G. R. MacCartney, T. S. Rappaport, and A. Ghosh, “Base sta-
vector is updated to maximize the receive SNR at the user. During tion diversity propagation measurements at 73 GHz millimeter-
the uplink training, the MS is assumed to use 30dBm transmit power, wave for 5G coordinated multipoint (CoMP) analysis,” in 2017
and the noise variance corresponds to 1GHz system bandwidth. The IEEE Globecom Workshops (GC Wkshps), Dec 2017, pp. 1–7.
system is assumed to operate at 60GHz carrier frequency.
We consider the deep learning model described in Section 4.2. [2] D. Maamari, N. Devroye, and D. Tuninetti, “Coverage in
The neural network model has an embedding that outputs vectors of mmwave cellular networks with base station co-operation,”
length 20 to the GRU unit, with maximum sequence length of 454. IEEE Transactions on Wireless Communications, vol. 15, no.
Since we have only 2 BSs in out experiment, the fully-connected 4, pp. 2981–2994, April 2016.
layer has only two outputs that go to the softmax function. We use [3] A. Alkhateeb, S. Alex, P. Varkey, Y. Li, Q. Qu, and D. Tu-
the Adam optimizer [10]. In the deep learning experimental work, jkovic, “Deep learning coordinated beamforming for highly-
we used the Keras libraries [11] with a TensorFlow backend. mobile millimeter wave systems,” IEEE Access, pp. 1–1, 2018.
Hand-off/Blockage prediction: To evaluate the performance
[4] R. W. Heath, N. Gonzlez-Prelcic, S. Rangan, W. Roh, and
of the proposed deep-learning based proactive hand-off solution,
A. M. Sayeed, “An overview of signal processing techniques
Fig. 4 plots the blockage/hand-off successful prediction probability,
for millimeter wave MIMO systems,” IEEE Journal of Selected
defined in (7), versus the training size. Fig. 4 shows that with suf-
Topics in Signal Processing, vol. 10, no. 3, pp. 436–453, April
ficient dataset size (larger than 12 thousand samples), the machine
2016.
learning model successfully predicts the hand-off with more than
90% probability, given only the sequence of past beams Bt . This [5] A. Alkhateeb, Jianhua Mo, N. Gonzalez-Prelcic, and R.W.
illustrates the potential of the proposed solution in enhancing the Heath, “MIMO precoding and combining solutions for
millimeter-wave systems,” IEEE Communications Magazine,,
vol. 52, no. 12, pp. 122–131, Dec. 2014.
[6] V. Va, J. Choi, T. Shimizu, G. Bansal, and R. W. Heath, “In-
verse multipath fingerprinting for millimeter wave v2i beam
alignment,” IEEE Transactions on Vehicular Technology, vol.
PP, no. 99, pp. 1–1, 2017.
[7] Kyunghyun Cho, Bart Van Merriënboer, Dzmitry Bahdanau,
and Yoshua Bengio, “On the properties of neural machine
translation: Encoder-decoder approaches,” in Proc. of SSST,
2014.
[8] Remcom, “Wireless insite,” http://www.remcom.com/
wireless-insite.
[9] Q. Li, H. Shirani-Mehr, T. Balercia, A. Papathanassiou, G. Wu,
S. Sun, M. K. Samimi, and T. S. Rappaport, “Validation of a
geometry-based statistical mmwave channel model using ray-
tracing simulation,” in 2015 IEEE 81st Vehicular Technology
Conference (VTC Spring), May 2015, pp. 1–5.
[10] Diederik P Kingma and Jimmy Ba, “Adam: A method for
stochastic optimization,” arXiv preprint arXiv:1412.6980,
2014.
[11] Frdric Branchaud-Charron, Fariz Rahman, and Taehoon Lee,
“Keras,” .