Ming_May2020

C OMMUNICATIONS T HEORIES & S YSTEMS
Fully Connected Feedforward Neural Networks Based CSI

Feedback Algorithm
Ming Gao* , Tanming Liao, Yubin Lu
School of Telecommunication Engineering, Xidian University, Xi’an 710071, China

*
The corresponding author, email: mgao@mail.xidian.edu.cn
Abstract: In modern wireless communication sys- side [2]. However, to fully utilize the potential gains of
tems, the accurate acquisition of channel state infor- massive MIMO, knowledge of channel state informa-
mation (CSI) is critical to the performance of beam- tion (CSI) at the BS is essential. For example, many
forming, non-orthogonal multiple access (NOMA), works about NOMA have assumed perfect CSI or the
etc. However, with the application of massive MIMO order of the instantaneous channel gain which can ob-
in 5G, the number of antennas increases by hun- tained from CSI feedback [3–5]. However, imperfect
dreds or even thousands times, which leads to exces- CSI will seriously affect the performance of NOMA
sive feedback overhead and poses a huge challenge to system [6–8]. Due to the rapid increase of feedback
the conventional channel state information feedback overhead in massive MIMO systems, the conventional
scheme. In this paper, by using deep learning technol- CSI feedback methods are difficult to apply to massive
ogy, we develop a system framework for CSI feedback MIMO systems. As a result, many researches have
based on fully connected feedforward neural networks been motivated to reduce the CSI feedback overhead
(FCFNN), named CF-FCFNN. Through learning the in massive MIMO systems [9–13]. In these efforts,
training set composed of CSI, CF-FCFNN is able to [9–12] mainly uses the spatial or temporal correlation
recover the original CSI from the compressed CSI of the massive MIMO system to reduce the feedback
more accurately compared with the existing method overhead. Specifically, these methods design the algo-
based on deep learning without increasing the algorithm based on the idea of compressed sensing (CS),
rithm complexity. which converts the channel matrix into a sparse vec-
Keywords: massive MIMO; CSI feedback; deep tor that can be sparsely represented on some orthogo-
learning; fully connected feedforward neural network nal basis. Then, the sparse vector is compressed and
reconstructed so as to reduce the feedback overhead.
I. INTRODUCTION However, in practice, the sparse vector of the channel
matrix is only approximately sparse on certain orthog-
AS one of the critical candidate technologies of 5G onal basis, such as two-dimensional discrete Fourier
wireless communication [1], massive MIMO can fur- transform(2D-DFT) and two-dimensional discrete co-
ther improve the transmission efficiency and capacity sine transform (2D-DCT). This insufficient sparsity
of wireless communication systems and effectively re- makes it difficult for the recovery algorithm in CS to
duce the interference between users and the energy recover the channel matrix accurately from the sparse
consumption of the receiver or transmitter by deploy- vector.
ing a large number of antennas at the base station (BS) In [13], a novel CSI feedback scheme based on
deep learning is proposed. The compression and re-
Received: May 28, 2020 construction process of channel matrix is fulfilled by
Revised: Aug. 5, 2020
Editor: Wei Duan
a neural network called CsiNet, which is similar to the
China Communication · January 2021 43
Authorized licensed use limited to: MAULANA AZAD NATIONAL INSTITUTE OF TECHNOLOGY. Downloaded on February 05,2024 at 10:22:36 UTC from IEEE Xplore. Restrictions apply.
autoencoder [14]. By learning the characteristics of where hk ∈ C Nt ×1 , xk ∈ C and nk ∈ C denote
the channel matrix from the training set, this scheme the channel vector, transmit data symbol and addi-
can recover CSI with greatly improved reconstruction tive noise of the kth subcarrier, respectively, wk ∈
quality compared with existing CS-based methods. C Nt ×1 denotes the beamforming or precoding vector
In this paper, in order to further improve the re- designed by the BS based on the downlink CSI. By
construction accuracy of the channel matrix, we con- sequentially stacking Nc channel vectors, the channel
sider using the fully connected-feedforward neural matrix in the spatial-frequency domain is represented
networks (FC-FNN) to build a CSI feedback network, as H = [h1 , h2 , · · ·, hNc ] ∈ C Nt ×Nc .
named CF-FCFNN. Different from the CsiNet built by To reduce the feedback overhead and the complexity
a convolution neural network (CNN) which is more of the training set, we transform the spatial-frequency
suitable for image processing, CF-FCFNN built by domain channel matrix H into the angle-delay domain
FC-FNN can extract spatial features sufficiently, so channel matrix H 0 through 2D-DFT as follows:
as to further improve the reconstruction performance.
The simulation results show that the reconstruction H 0 = Ds HDf , (2)
quality of CF-FCFNN is better than CsiNet. In addi-
tion, our proposed network architecture is comparable where Ds ∈ C Nt ×Nt and Df ∈ C Nc ×Nc are both 2D-
to CsiNet in terms of the algorithm runtime. DFT matrices. Due to limited multipath time delay,
The rest of the paper is organized as follows. Sec- only the first N c0 (N c0 < Nc ) columns of H 0 contain
tion II provides the system model for CSI feedback we values. Therefore, within allowable range of accuracy,
0
considered in this paper. Section III explains the ar- we can truncate H 0 into H 00 ∈ C N t×Nc by retaining
chitecture of CF-FCFNN and the deep learning algo- the first N c0 columns of H 0 and removing remaining
rithms we used in detail. Simulation results that verify columns. The conversion from H to H 00 is called pre-
the effectiveness of CF-FCFNN are presented in Sec- processing. In the research, we focused on designing
tion IV. Finally, Section V provides the conclusions. the CSI encoder and the CSI decoder with excellent
performance. The CSI encoder is used to transform the
II. SYSTEM MODEL preprocessed complex channel matrix H 00 into an M-
dimensional (M N, N = Ntotal = 2 × Nt × N 0 c)
Consider a flat slow-fading downlink massive MIMO compressed vector v, that is,
system with Nt transmit antennas at a BS and a single
receive antenna at a UE operating in the FDD mode. It v = CSIEncoder(H 00 ). (3)
is assumed that the UE can obtain a perfect downlink
channel matrix by pilot-based training [15]. The chan- The data compression ratio is defined as follows:
nel feedback model of single-user MIMO system is
shown in Figure 1. The system is operated in orthogo-
CR = M/N . (4)
Base Station
Precoding
CSI
H UE
Channel
Preprocessing
CSI Then the compressed vector v is sent to the BS

Decoder Estimation Encoder

through the feedback link by the UE. The CSI decoder
Nt
is used to reconstruct H 00 from the compressed vector
Feedback Channel
v at BS, and the reconstructed channel matrix is repre-
sented by H b 00 , that is,
Figure 1. The channel feedback model of single-user MIMO
system.
b 00 = CSIDecoder(v).
H (5)
nal frequency division multiplexing (OFDM) over Nc
subcarriers. The received signal at the kth subcarrier Finally, the original spatial-frequency domain chan-
can be expressed as follows: nel matrix H is obtained by sequentially perform-
ing zero-padding and two-dimensional inverse dis-
yk = hH
k wk xk + nk , (1) crete Fourier transform (2D-IDFT) on H 00 . The encod-
44 China Communication · January 2021
CSI CSI
Encoder Decoder
UE Feedback link BS
M×1 M×1
the real and imaginary the real and imaginary
parts of channel matrix Compressed vector parts of reconstructed
˄2,32,32˅ channel matrix
v
˄2,32,32˅
N×1 N×1
reshape
Dense; Sigmoid
Dense; LeakyReLU(alpha=0.3)
copy
Figure 2. The CSI feedback network based on the fully connected feedforward neural network
ing process and decoding process can be understood as hidden layers to six. The input layer of the CSI de-
the operation of compressing and reconstructing the coder is composed of M neurons that considers the
complex channel matrix H respectively. compressed vector as input and outputs a vector of
size N , which serves as an input of subsequent hid-
III. CF-FCFNN den layers. For the number of neural network lay-
ers in CF-FCFNN is directly related to performance
We develop a CSI feedback network based on fully of CF-FCFNN, we introduce the residual block of the
connected feedforward neural networks for CSI com- deep residual network [16, 17] into the CSI decoder
pression and reconstruction, named CF-FCFNN. The to avoid the degradation of the performance caused
CSI feedback network based on the fully connected by the depth of CF-FCFNN. Consider splitting the six
feedforward neural network is shown in Figure 2. hidden layers into two residual blocks, each of which
CF-FCFNN consists of a CSI encoder and a CSI consists of three hidden layers. In each residual block,
decoder, which are both built from neural networks. there is a shortcut connection between the first layer
We exploit the CSI encoder to compress the complex and the third layer, so as to pass data flow to later
channel matrix. The CSI encoder consists of an input layers directly. Besides, the activation function be-
layer and an output layer, which are fully connected by tween the neural network layers of the two residuals
weighting values. We reshape the real and imaginary blocks is set as the leaky version of a rectified linear
parts of the preprocessed complex channel matrix H 00 unit (LeakyReLU), that is,
into a vector with dimension of N (N = 2×Nt ×Nc 0 ),
which serve as the input of the CSI encoder. The out-
n
x, if x>0,
LeakyReLU (x) = αx, otherwise, (6)
put of the CSI encoder is the compressed vector v,
which is obtained by reducing the dimension M of the
output layer. where α ∈ (0, 1). After two residual blocks, the CSI
Once the BS acquires the compressed vector v from decoder outputs the reconstruction of H 00 , which is
the UE through the feedback link, it uses the CSI de- b 00 .
represented by H
coder to reconstruct the original complex channel ma- We determine the number of hidden layers and
trix H 00 from the compressed vector. To fully extract residual blocks of the neural network in CF-FCFNN
the feature of the channel matrix, the CSI decoder is through experiments. The experimental results show
composed of an input layer, several hidden layers and that six hidden layers and two residual blocks can
an output layer. In this paper, we set the number of achieve good recovery performance. Further increas-
ing the hidden layers of CF-FCFNN or the number of Table 1. NMSE in dB, Cosine Similarity and Runtime in ms.
residual blocks will not significantly improve the qual- Feedback NMSE Runtime
ity of recovery, but will increase the computational Scenario CR
Model (dB) (ms)
complexity. CsiNet −18.71 0.99 1.3
1/4
CF-FCFNN −20.07 0.99 1.1
In addition, in order to accelerate the convergence CsiNet −14.05 0.98 1.3
1/8
speed of network, we introduce batch normalization CF-FCFNN −15.14 0.98 1.1
CsiNet −8.87 0.94 1.3
[18] to all layers in CF-FCFNN, which maps each Indoor 1/16
CF-FCFNN −12.35 0.97 1.1
layer of input to [0, 1]. Therefore, the output layer will CsiNet −7.88 0.92 1.3
1/32
map the output value to [0, 1] using the sigmoid func- CF-FCFNN −8.86 0.93 1.1
CsiNet −4.88 0.82 1.3
tion in the CSI decoder. 1/64
CF-FCFNN −6.6 0.89 1.1
To obtain all the weight and bias values of the CSI 1/4
CsiNet −9.01 0.92 1.3
CF-FCFNN −11.61 0.95 1.1
encoder and CSI decoder, we train them with end-to-
CsiNet −7.2 0.89 1.3
end learning methods of deep learning. We use f () 1/8
CF-FCFNN −10.08 0.93 1.1
to represent a well-trained CF-FCFNN, with CSI en- CsiNet −4.8 0.81 1.3
Outdoor 1/16
CF-FCFNN −9.12 0.92 1.1
coder fen and CSI decoder fde . When the input of CsiNet −3.14 0.71 1.3
1/32
CFN-FCFNN is H 00 , the reconstructed channel matrix CF-FCFNN −8.42 0.91 1.1
CsiNet −2.02 0.59 1.3
can be represented as follows: 1/64
CF-FCFNN −7.25 0.88 1.1
b 00 = f (H 00 ; W, b)
H
(7) obtain H 00 with dimension of 32 × 32. The training,
= fde (fen (H 00 ; Wen , ben ); Wde , bde ),
validation and testing sets contain 100000, 30000, and
20000 samples for offline training or online testing re-
where Wen and ben denote the set of weights and bi-
spectively. The epochs, learning rate and batch size
ases for the CSI encoder respectively, Wen and ben
are set as 1000, 0.001, and 200, respectively. The al-
denote the set of weights and biases for the CSI de-
pha of LeakyReLU function is set as 0.3. The per-
coder respectively. We choose Adam [19] as the op-
formance of the network architecture is measured by
timized algorithm for updating the set of parameters,
the normalized mean square error(NMSE) and the co-
and use the mean square error (MSE) as the loss func-
sine similarity [13]. The difference between the recon-
tion, which is defined as:
structed channel matrix H b and original channel matrix
T
H is quantified by NMSE, which is defined as follows:
1X c 2
L(W, b) = Ht − Ht 00 , (8)
T t=1 20

2

N M SE = E H −H
b / kHk22 . (9)
2
where T is the number of samples in the training set,
k·k2 is the Euclidean norm. The cosine similarity is defined as follows:
 
IV. SIMULATION RESULTS AND ANALY- 
 1 X Nc c H
hk hk  
SIS ρ=E , (10)
 Nc
 k=1 c
hk khk k2 

In this section, we describe the detailed setup of our 2
experiments and compare our approach with existing

deep learning-based approaches. Referring to the de- where hck is the reconstructed channel vector of the kth
fault settings for all parameters in [20], we gener- subcarrier.
ate channel matrices under two scenarios through the We performed the experiments with CR values of
COST 2100 channel model. We use uniform linear ar- 1/4, 1/8, 1/16, 1/32, and 1/64, respectively. In addi-
ray (ULA) with Nt = 32 and Nc = 1024 subcarriers. tion, CsiNet and CF-FCFNN are trained and tested on
In the preprocessing of the spatial-frequency channel the Nvidia GeForce GTX 1050 Ti GPU. In all exper-
matrix, we only retain the first 32 columns of H 0 to iments, all parameters of CsiNet follow their default
setting in [13]. The corresponding NMSE, ρ and run- tions on Wireless Communications, vol. 9, no. 11, 2010, pp.
time of CsiNet and CF-FCFNN for each CR value are 3590–3600.
[3] Z. Ding, P. Fan, et al., “Impact of user pairing on
summarized in Table 1. 5g nonorthogonal multiple-access downlink transmissions,”
It can be seen from the table that the performance IEEE Transactions on Vehicular Technology, vol. 65, no. 8,
of CF-FCFNN is significantly better than CsiNet with 2016, pp. 6010–6023.
[4] C. Li, Q. Zhang, et al., “Price-based power allocation for
different CR values, especially in outdoor scenarios.
non-orthogonal multiple access systems,” IEEE Wireless
For example, the reduction of CR will cause distortion Communications Letters, vol. 5, no. 6, 2016, pp. 664–667.
to increase, therefore, ρ will decrease. With the re- [5] P. Wu, J. Zeng, et al., “On energy efficiency optimization in
duction of CR, the performance of ρ of CF-FCFNN downlink mimo-noma,” in 2017 IEEE International Con-
ference on Communications Workshops (ICC Workshops),
scheme is obviously better than CsiNet scheme. Fur- 2017, pp. 399–404.
thermore, the online runtimes of the two networks are [6] X. Wang, J. Wang, et al., “Outage analysis for downlink
almost the same. Therefore, we conclude that CF- noma with statistical channel state information,” IEEE Wire-
FCFNN based on fully connected feedforward neural less Communications Letters, vol. 7, no. 2, 2018, pp. 142–
145.
networks can significantly improve the channel recon- [7] S. Arzykulov, T. A. Tsiftsis, et al., “Outage performance
struction quality without increasing the time complex- of cooperative underlay cr-noma with imperfect csi,” IEEE
ity. Communications Letters, vol. 23, no. 1, 2019, pp. 176–179.
[8] P. Li, Z. Ding, et al., “Enhanced receiver based on fec
code constraints for uplink noma with imperfect csi,” IEEE
V. CONCLUSION Transactions on Wireless Communications, vol. 18, no. 10,
2019, pp. 4790–4802.
In this paper, we propose a novel CSI feedback frame- [9] M. S. Sim, J. Park, et al., “Compressed channel feedback
for correlated massive mimo systems,” Journal of Commu-
work CF-FCFNN by using the fully connected feed- nications and Networks, vol. 18, no. 1, 2016, pp. 95–104.
forward neural networks. The framework can sig- [10] P. Kuo, H. T. Kung, et al., “Compressive sensing based
nificantly improves the channel reconstruction quality channel feedback protocols for spatially-correlated massive
antenna arrays,” in 2012 IEEE Wireless Communications
without increasing the time complexity. For accurate
and Networking Conference (WCNC), 2012, pp. 492–497.
CSI can avoid error propagation effectively in SIC re- [11] Z. Zhang, K. C. Teh, et al., “Application of compressive
ceivers in NOMA system, it is foreseeable that bet- sensing to limited feedback strategy in large-scale multiple-
ter system performance can be obtained using our CSI input single-output cellular networks,” IET Communica-
tions, vol. 8, no. 6, 2014, pp. 947–955.
feedback scheme. Furthermore, based on the existing [12] X. Rao and V. K. N. Lau, “Distributed compressive csit
CSI feedback framework, we believe that if the time estimation and feedback for fdd multi-user massive mimo
correlation of channels was taken into account, the systems,” IEEE Transactions on Signal Processing, vol. 62,
performance of channel matrix reconstruction would no. 12, 2014, pp. 3261–3271.
[13] C. Wen, W. Shih, et al., “Deep learning for massive
be further improved [21]. Therefore, we leave the ex- mimo csi feedback,” IEEE Wireless Communications Let-
ploitation of the time correlation across fast-varying ters, vol. 7, no. 5, 2018, pp. 748–751.
channels as a topic for future researches. [14] Y. B. I. Goodfellow and A. Courville, “Deep learning,”
CambridgeMA, USA: MIT Press, 2016.
[15] J. Choi, D. J. Love, et al., “Downlink training techniques
ACKNOWLEDGEMENT for fdd massive mimo systems: Open-loop and closed-loop
training with memory,” IEEE Journal of Selected Topics in
Signal Processing, vol. 8, no. 5, 2014, pp. 802–814.
This work was supported by the Key Research and De-
[16] K. He, X. Zhang, et al., “Deep residual learning for image
velopment Project of Shaanxi Province under Grant recognition,” in 2016 IEEE Conference on Computer Vision
no.2019ZDLGY07-07. and Pattern Recognition (CVPR), 2016, pp. 770–778.
[17] K. He, X. Zhang, et al., “Identity mappings in deep residual
networks,” Arxiv, 03 2016.
[18] S. Ioffe and C. Szegedy, “Batch normalization: Accelerating
References deep network training by reducing internal covariate shift,”
02 2015.
[1] F. Boccardi, R. W. Heath, et al., “Five disruptive technol- [19] D. Kingma and J. Ba, “Adam: A method for stochastic opti-
ogy directions for 5g,” IEEE Communications Magazine, mization,” International Conference on Learning Represen-
vol. 52, no. 2, 2014, pp. 74–80. tations, 12 2014.
[2] T. L. Marzetta, “Noncooperative cellular wireless with un- [20] L. Liu, C. Oestges, et al., “The cost 2100 mimo channel
limited numbers of base station antennas,” IEEE Transac-
model,” IEEE Wireless Communications, vol. 19, no. 6, Tanming Liao received the B.S. de-
2012, pp. 92–99. gree in electronic information engineer-
[21] T. Wang, C. Wen, et al., “Deep learning-based csi feedback ing from Anhui University of Finance and
approach for time-varying massive mimo channels,” IEEE Economics, Bengbu, China, in 2017 and
Wireless Communications Letters, vol. 8, no. 2, 2019, pp. is currently working toward the M.S. de-
416–419. gree in electronics and communication
engineering at Xidian University, Xi’an,
China. His research is mainly focus on ar-
tificial intelligence communication technology.
Biographies
Ming Gao received the Ph.D. degree in
telecommunications engineering from Xi- Yubin Lu received the B.S. degree
dian University, Xi’an, China, in 2009. in Communication Engineering from
He is currently an associate Professor in Taiyuan University of Technology,
Xidian University. From 2015 to 2016, Shanxi, China, in 2018 and is currently
he was a visiting scholar at the depart- working toward the M.S. degree in elec-
ment of electrical engineering and com- tronics and communication engineering
puter science in the University of Ten- at Xidian University, Xi an, China. Her
nessee, Knoxville, USA. His research is mainly focused on wire- research is mainly focus on authentication
less communication, information security and artificial intelli- protocol for RFID.
gence communication technology.

Ming_May2020

Uploaded by

Copyright:

Available Formats

You might also like

Ming_May2020

Uploaded by

Document Information

Copyright

Available Formats

Share this document

Share or Embed Document

Sharing Options

Did you find this document useful?

Is this content inappropriate?

Copyright:

Available Formats

Ming_May2020

Uploaded by

Copyright:

Available Formats

C OMMUNICATIONS T HEORIES & S YSTEMS

Fully Connected Feedforward Neural Networks Based CSI

Ming Gao* , Tanming Liao, Yubin Lu

School of Telecommunication Engineering, Xidian University, Xi’an 710071, China

China Communication · January 2021 43

Decoder Estimation Encoder

44 China Communication · January 2021

China Communication · January 2021 45

experiments and compare our approach with existing

46 China Communication · January 2021

China Communication · January 2021 47

48 China Communication · January 2021

You might also like